You are on page 1of 2281

Geographic Information

Systems:
Concepts, Methodologies, Tools,
and Applications
Information Resources Management Association
USA

Volume I
Managing Director: Lindsay Johnston
Editorial Director: Joel Gamon
Assistant Acquisitions Editor: Kayla Wolfe
Book Production Manager: Jennifer Romanchak
Publishing Systems Analyst: Adrienne Freeland
Development Editor: Chris Wozniak
Assistant Production Editor: Deanna Jo Zombro
Cover Design: Nick Newcomer

Published in the United States of America by


Information Science Reference (an imprint of IGI Global)
701 E. Chocolate Avenue
Hershey PA 17033
Tel: 717-533-8845
Fax: 717-533-8661
E-mail: cust@igi-global.com
Web site: http://www.igi-global.com

Copyright © 2013 by IGI Global. All rights reserved. No part of this publication may be reproduced, stored or distributed in
any form or by any means, electronic or mechanical, including photocopying, without written permission from the publisher.
Product or company names used in this set are for identification purposes only. Inclusion of the names of the products or
companies does not indicate a claim of ownership by IGI Global of the trademark or registered trademark.

Library of Congress Cataloging-in-Publication Data


Geographic information systems (2013)
Geographic information systems : Concepts, methodologies, tools, and applications / Information Resources Management
Association, Editor.
pages cm
Includes bibliographical references and index.
Summary: “This reference is a collection of knowledge on the latest advancements and research of geographic information
systems, offering background for those involved in geographical data”--Provided by publisher.
ISBN 978-1-4666-2038-4 (hardcover) -- ISBN (invalid) 978-1-4666-2039-1 (ebook) -- ISBN (invalid) 978-1-4666-2040-7
(print & perpetual access) 1. Geographic information systems. 2. Geographic information systems--Research. 3. Geospa-
tial data. I. Information Resources Management Association. II. Title.
G70.212.G4257 2013
910.235--dc23
2012029283

British Cataloguing in Publication Data


A Cataloguing in Publication record for this book is available from the British Library.

The views expressed in this book are those of the authors, but not necessarily of the publisher.
Editor-in-Chief
Mehdi Khosrow-Pour, DBA
Editor-in-Chief
Contemporary Research in Information Science and Technology, Book Series

Associate Editors
Steve Clarke, University of Hull, UK
Murray E. Jennex, San Diego State University, USA
Annie Becker, Florida Institute of Technology, USA
Ari-Veikko Anttiroiko, University of Tampere, Finland

Editorial Advisory Board


Sherif Kamel, American University in Cairo, Egypt
In Lee, Western Illinois University, USA
Jerzy Kisielnicki, Warsaw University, Poland
Keng Siau, University of Nebraska-Lincoln, USA
Amar Gupta, Arizona University, USA
Craig van Slyke, University of Central Florida, USA
John Wang, Montclair State University, USA
Vishanth Weerakkody, Brunel University, UK
List of Contributors

Ababsa, Fakhreddine / University of Evry Val d’Essonne, France................................................. 2164


Abadía, Mercedes Farjas / Universidad Politécnica de Madrid, Spain............................................ 970
Abdelouahad, Abdelkaher Ait / University of Mohammed V-Agdal, Morocco................................ 616
Abdullah, Alias / International Islamic University Malaysia, Malaysia............................................ 991
Abdullah, Muhammad Faris / International Islamic University Malaysia, Malaysia..................... 991
Abid, Mohamed Amine / University of Manouba, Tunisia.............................................................. 1345
Aboutajdine, Driss / University of Mohammed V-Agdal, Morocco................................................... 616
Adamchuk, Viacheslav / McGill University, Canada...................................................................... 1368
Aguilar, José Alberto Hernandez / Autonomous University of Morelos, Mexico........................... 1809
Aguilera, Fatima Sayuri Quezada / Autonomous University of Aguacalientes, Mexico................ 1809
Akinyede, J. O. / African Regional Centre for Space and Technology Education, Nigeria............. 1515
Akinyemi, Felicia O. / Kigali Institute of Science and Technology, Rwanda................................... 2040
Alabi, Tunrayo / International Institute of Tropical Agriculture, Nigeria......................................... 912
Alam, Bhuiyan Monwar / The University of Toledo, USA.............................................................. 1385
Algeo, Katie / Western Kentucky University, USA............................................................................ 1309
Ali, Hesham A. / Mansoura University, Egypt................................................................................. 1448
Ali, Mohammed H. / Mansoura University, Egypt.......................................................................... 1448
Allen, Jeffery / Clemson University, USA......................................................................................... 1297
Al-Marri, Salem / Leeds Metropolitan University LEEDS, UK....................................................... 1604
Annegarn, Harold J. / University of Johannesburg, South Africa..................................................... 880
Argyreas, Nikolaos / National Center of Scientific Research “Demokritos,” Greece....................... 726
Arnold, Lesley / Landgate, Australia................................................................................................. 638
Ayansina, Ayanlade / Obafemi Awolowo University, Nigeria............................................................ 938
Ayats, Victor Pascual / Spatial Data Infrastructure of Catalonia (IDEC), Spain............................. 465
Balasubramaniam, S. / JSS University, India.................................................................................. 1988
Barker, Bradley / University of Nebraska-Lincoln, USA................................................................. 1368
Barth, Andreas / Beak Consultants GmbH, Germany........................................................................ 765
Batsakis, Sotirios / Technical University of Crete, Greece............................................................... 1751
Bekkers, Victor / Erasmus University Rotterdam, The Netherlands................................................ 1974
Belghith, Abdelfettah / University of Manouba, Tunisia................................................................. 1345
Beniston, Martin / University of Geneva, Switzerland.................................................................... 1260
Bennewitz, Evelyn / Beak Consultants GmbH, Germany.................................................................. 765
Berglund, Ulla / Swedish University of Agricultural Sciences, Sweden............................................. 799
Bermudez, Luis / Southeastern Universities Research Association, USA......................................... 890
Bernabé-Poveda, Miguel-Ángel / Technical University of Madrid, Spain............................................ 1
Bernadó, Oriol / Science and Technology Park for Tourism and Leisure, Spain............................. 1100
Bigorra, Albert / Science and Technology Park for Tourism and Leisure, Spain............................ 1100
Bimonte, Sandro / Laboratoire d’Informatique en Images et Systèmes d’information, France &
Cemagref, UR TSCF, France..................................................................................................... 91, 377
Bizani, Erasmia / Aristotle University of Thessaloniki, Greece....................................................... 1559
Blackwell, P. R. / Stephen F. Austin State University, USA................................................................ 926
Blank, Daniel / University of Bamberg, Germany.............................................................................. 666
Bol, Peter K. / Harvard University, USA.......................................................................................... 1764
Borràs, Joan / Science and Technology Park for Tourism and Leisure, Spain................................. 2208
Brimicombe, Allan J. / University of East London, UK..................................................................... 151
Brisaboa, Nieves R. / University of A Coruña, Spain......................................................................... 358
Brister, Evelyn / Rochester Institute of Technology, USA................................................................ 2063
Brown, Stacey R. / Oklahoma State University, USA...................................................................... 1864
Brunner, Dominik / Fraunhofer Institute of Optronics, Germany................................................... 1587
Bruno, Fernanda / Universidade Federal do Rio de Janeiro, Brazil.................................................. 66
Brunt, Matthew / Western Kentucky University, USA..................................................................... 1309
Bunch, Rick / University of North Carolina at Greensboro, USA.................................................... 1612
Burns, Meghan D. / Montana Natural Heritage Program, USA..................................................... 1170
Butterworth, Edwin / CACI International, USA............................................................................. 1665
Buurma, J.S. / Wageningen UR, The Netherlands........................................................................... 1883
Calvo, Pedro Ángel Merino / Universidad Politécnica de Madrid, Spain........................................ 970
Cameron, Nairne / Algoma University, Canada.............................................................................. 1665
Cangel, Ayça / International Competitiveness Research Institute, Turkey....................................... 1392
Carroll, Jon W. / Michigan State University, USA........................................................................... 1328
Cerney, Dawna L. / Youngstown State University, USA................................................................... 1665
Chandramouli, Magesh / Purdue University, USA............................................................................ 216
Chen, Yenming J. / National Kaohsiung First University of Science and Technology, Taiwan.......... 81
Cherifi, Hocine / University of Burgundy, France.............................................................................. 616
Christophoridis, Christophoros / Aristotle University of Thessaloniki, Greece............................. 1559
Clave, Salvador Anton / University Rovira i Virgili, Spain................................................... 1100, 2208
Coetzee, Serena / University of Pretoria, South Africa ................................................................... 1794
Comer, Jonathan C. / Oklahoma State University, USA.................................................................. 1864
Connolly, Matthew H. / Texas State University-San Marcos, USA................................................. 1953
Conover, Helen / The University of Alabama in Huntsville, USA...................................................... 890
Cooper, Antony K / CSIR, South Africa & University of Pretoria, South Africa............................. 1794
Corcho, Oscar / Universidad Politécnica de Madrid, Spain.............................................................. 434
Crumlish, John / University of Ulster, UK......................................................................................... 540
Curran, Kevin / University of Ulster, UK.......................................................................................... 540
Curtin, Kevin M. / George Mason University, USA............................................................................ 39
D’Ulizia, Arianna / Consiglio Nazionale delle Ricerche, IRPPS, Italy.............................................. 848
Dao, Hy / University of Geneva and UNEP, Switzerland.................................................................. 1260
Dartevelle, Sébastien / Los Alamos National Laboratory, USA....................................................... 1697
Day, Patrice / University of Wisconsin-Milwaukee, USA................................................................. 1525
De Bono, Andrea / University of Geneva and UNEP, Switzerland.................................................. 1260
de Kool, Dennis / Center for Public Innovation, The Netherlands.................................................. 1974
de la Flor, Joan / Science and Technology Park for Tourism and Leisure, Spain............................ 2208
Dean, Denis J. / University of Texas at Dallas, USA........................................................................ 1828
DeGroote, John / University of Northern Iowa, USA......................................................................... 480
DeMers, Michael N. / New Mexico State University, USA............................................................... 1541
Díaz, Laura / Universitat Jaume I de Castellón, Spain & Institute of New Imaging Technologies,
Universitat Jaume I, Spain....................................................................................................... 305, 952
Didier, Jean-Yves / University of Evry Val d’Essonne, France........................................................ 2164
Dolney, Timothy J. / Penn State University – Altoona, USA........................................................... 2128
Duckham, Matt / University of Melbourne, Australia......................................................................... 24
Dugas, Daniel P. / New Mexico State University, USA..................................................................... 1541
Duke-Williams, Oliver / University of Leeds, UK........................................................................... 1675
Eckert, Jeanette / The University of Toledo, USA............................................................................ 1385
El Asmar, Toufic / University of Florence, Italy................................................................................. 228
El Hassouni, Mohammed / University of Mohammed V-Agdal, Morocco........................................ 616
Epelbaum, Samuel / Pace University, USA..................................................................................... 1724
Epperson, Ann / Western Kentucky University, USA....................................................................... 1309
Everding, Thomas / University of Muenster, Germany.................................................................... 2020
Fernández-Wyttenbach, Alberto / Technical University of Madrid, Spain.......................................... 1
Ferri, Fernando / Consiglio Nazionale delle Ricerche, IRPPS, Italy................................................ 848
Fisher, Gavin / University of Ulster, UK............................................................................................. 540
Fitzner, Daniel / Fraunhofer IGD, Germany.................................................................................... 1619
Fonollosa, Arturo Beltran / Universitat Jaume I de Castellón, Spain............................................ 2176
Fraser, Kym / University of South Australia, Australia................................................................... 2150
Fridman, Alexander / Russian Academy of Science, Russia........................................................... 1724
Fugazza, Cristiano / Institute of Environment and Sustainability, Italy............................................ 305
Fytianos, Konstantinos / Aristotle University of Thessaloniki, Greece........................................... 1559
Gallegos, Julio Cesar Ponce / Autonomous University of Aguacalientes, Mexico.......................... 1809
Gallo, Giovanni / University of Catania, Italy................................................................................. 1132
Gangopadhyay, Aryya / University of Maryland - Baltimore County, USA........................................ 47
Gardarin, Georges / PRiSM Laboratory, France.............................................................................. 550
Gaur, S. / Ecole Nationale Supérieure des Mines, France................................................................. 736
Gelernter, Judith / Carnegie Mellon University, USA..................................................................... 1941
Gensel, Jerome / Laboratoire d’Informatique de Grenoble, France.................................................. 377
Ghose, Rina / University of Wisconsin-Milwaukee, USA................................................................. 1525
Gil, José / Institute of New Imaging Technologies, Universitat Jaume I, Spain................................. 952
Gillavry, Edward Mac / Webmapper, The Netherlands................................................................... 1231
Giuliani, Gregory / University of Geneva and UNEP, Switzerland................................................. 1260
Goh, Mark / University of South Australia, Australia & National University of Singapore,
Singapore....................................................................................................................................... 2150
Gorod, Alex / Stevens Institute of Technology, USA......................................................................... 1724
Gould, Michael / Universitat Jaume I, Spain..................................................................................... 493
Graham, Amy K. / Oklahoma State University, USA...................................................................... 1864
Graillot, D. / Ecole Nationale Supérieure des Mines, France............................................................ 736
Grandgenett, Neal / University of Nebraska at Omaha, USA.......................................................... 1368
Granell, Carlos / Universitat Jaume I de Castellón, Spain & Institute for Environment and
Sustainability, Italy.................................................................................................................. 305, 952
Graves, Sara / The University of Alabama in Huntsville, USA.......................................................... 890
Greenlee, Janet C. / New Mexico State University, USA................................................................. 1541
Gribb, William J. / University of Wyoming, USA............................................................................ 1665
Grifoni, Patrizia / Consiglio Nazionale delle Ricerche, IRPPS, Italy................................................ 848
Guan, Weihe (Wendy) / Harvard University, USA.......................................................................... 1764
Guarnera, Giuseppe Claudio / University of Catania, Italy........................................................... 1132
Guo, Zhen-Sheng / Hokkaido University, Japan.............................................................................. 1216
Gupta, Phalguni / Indian Institute of Technology Kanpur, India....................................................... 703
Hammitzsch, Martin / Helmholtz Centre Potsdam, GFZ German Research Centre for Geosciences,
Germany........................................................................................................................................... 449
Hane, Elizabeth / Rochester Institute of Technology, USA.............................................................. 2063
Hanke, Henrik / University of Duisburg-Essen, Germany................................................................... 17
Hansen, Stefan / Spatial/Information Systems Ltd./LISAsoft, Australia............................................ 250
Hawthorne, Timothy L. / Columbus State University, USA............................................................ 1848
Haynes, Kingsley E. / George Mason University, USA.................................................................... 1665
Hegedüs, Péter / Budapest University of Technology and Economics, Hungary............................... 609
Henrich, Andreas / University of Bamberg, Germany....................................................................... 666
Herold, Sam / University of Ottawa, Canada..................................................................................... 175
Ho, Chao-Ching / National Yunlin University of Science and Technology, Taiwan........................... 783
Hodge, Bill / City of Midland, USA................................................................................................... 1665
Honea, Robert B. / KU Transportation Research Institute, USA..................................................... 1665
Hosszú, Gábor / Budapest University of Technology and Economics, Hungary............................... 609
Huang, Bo / Chinese University of Hong Kong, China...................................................................... 216
Huerta, Joaquín / Institute of New Imaging Technologies, Universitat Jaume I, Spain.................... 952
Hyyppä, Juha / Finnish Geodetic Institute, Finland........................................................................ 1011
Ioja, Cristian / University of Bucharest, Romania........................................................................... 1276
Iqbal, Muhammad Usman / University of New South Wales, Australia........................................... 399
Irkhin, Alexey / Beak Consultants GmbH, Germany......................................................................... 765
Isern, David / Universitat Rovira i Virgili, Spain............................................................................. 2208
Jaakkola, Anttoni / Finnish Geodetic Institute, Finland................................................................. 1011
Jegede, O. O. / Obafemi Awolowo University, Nigeria....................................................................... 938
Jenihhin, Maksim / Tallinn University of Technology, Estonia......................................................... 407
Jha, Manoj K. / Morgan State University, USA................................................................................. 857
Joshi, P. K. / TERI University, India................................................................................................. 1913
Jutman, Artur / Tallinn University of Technology, Estonia............................................................... 407
Kaartinen, Harri / Finnish Geodetic Institute, Finland................................................................... 1011
Kakaletris, George / University of Athens, Greece............................................................................ 258
Kathlene, Lyn / Colorado State University, USA............................................................................... 685
Katsianis, Dimitris / University of Athens, Greece............................................................................ 258
Kaushik, Vandana Dixit / Harcourt Butler Technological Institute, India........................................ 703
Keiser, Ken / The University of Alabama in Huntsville, USA............................................................. 890
Khant, Aung Phey / Asian Institute of Technology, Thailand............................................................ 815
Kholladi, Mohamed-Khireddine / University Mentouri of Constantine, Algeria............................ 750
Klimaszewski-Patterson, Anna / New Mexico State University, USA............................................ 1541
Klippel, Alexander / University of Melbourne, Australia.................................................................. 250
Korfmacher, Karl / Rochester Institute of Technology, USA........................................................... 2063
Kourie, Derrick G / University of Pretoria, South Africa . ............................................................. 1794
Koussouris, Sotirios / National Technical University of Athens, Greece......................................... 1243
Kovács, Ferenc / Budapest University of Technology and Economics, Hungary............................... 609
Ku, Wei-Shinn / Auburn University, USA.......................................................................................... 631
Lancaster, Anne / Independent Researcher, USA............................................................................... 529
Larsen, Lillian I. / University of Redlands, USA................................................................................ 832
Lehmann, Anthony / University of Geneva and UNEP, Switzerland.............................................. 1260
Li, Yang / University of East London, UK.......................................................................................... 151
Lim, Samsung / University of New South Wales, Australia............................................................... 399
Lindquist, Peter S. / The University of Toledo, USA........................................................................ 1385
Liu, Zhengjun / Chinese Academy of Surveying and Mapping, China............................................ 1011
Llaves, Alejandro / University of Muenster, Germany..................................................................... 2020
Loi, Nguyen Kim / Nong Lam University, Vietnam.......................................................................... 2080
Lourenço, Júlia M. / University of Minho, Portugal....................................................................... 1476
Lu, Kang Shou / Towson University, USA........................................................................................ 1297
Luaces, Miguel R. / University of A Coruña, Spain........................................................................... 358
Lutz, Collin / Virginia Polytechnic Institute and State University, USA.......................................... 1368
Manso-Callejo, Miguel-Angel / Universidad Politécnica de Madrid, Spain.................................. 2176
Mansouri, Mo / Stevens Institute of Technology, USA..................................................................... 1724
Martins, Hugo / Universidade Nova de Lisboa, Portugal & Universidade Técnica de Lisboa,
Portugal......................................................................................................................................... 1773
Maskey, Manil / The University of Alabama in Huntsville, USA....................................................... 890
Mathews, Leah Greden / University of North Carolina at Asheville, USA....................................... 529
Matyas, Corene J. / University of Florida, USA.............................................................................. 1069
Mazimpaka, Jean Damascène / National University of Rwanda, Rwanda....................................... 295
Mazzetti, Paolo / National Research Council, Italy........................................................................... 305
McDonald, Darrel / Stephen F. Austin State University, USA............................................................ 926
McGuire, Steven / Aberystwyth University, UK................................................................................. 108
McKnight, Kevin P. / Michigan Department of Transportation & Michigan State University,
USA................................................................................................................................................ 1170
Meenar, Mahbubur R. / Temple University, USA.............................................................................. 122
Men, Hao / Stevens Institute of Technology, USA............................................................................... 502
Messina, Joseph P. / Michigan State University, USA..................................................................... 1170
Messineo, Antonio / Kore University of Enna, Italy......................................................................... 1411
Mimoun, D. / Ecole Nationale Supérieure des Mines, France........................................................... 736
Misra, Santosh K. / Cleveland State University, USA........................................................................ 336
Moody, Rebecca / Erasmus University Rotterdam, The Netherlands.............................................. 1974
Moore, Alyssa K. / University of Redlands, USA............................................................................... 832
Moreno, Antonio / Universitat Rovira i Virgili, Spain..................................................................... 2208
Morgan, John / Towson University, USA......................................................................................... 1297
Morgan, Kathy / University of Nebraska-Lincoln, USA.................................................................. 1368
Nakato, Teddy / Ministry of Agriculture, Uganda.............................................................................. 938
Nativi, Stefano / National Research Council, Italy............................................................................ 305
Neumann, Alf / University of Cologne, Germany................................................................................ 17
Niţă, Mihai / University of Bucharest, Romania............................................................................... 1276
Noack, Silke / Beak Consultants GmbH, Germany............................................................................ 765
Norder, John W. / Michigan State University, USA......................................................................... 1328
Nordin, Kerstin / Swedish University of Agricultural Sciences, Sweden........................................... 799
Nugent, Gwen / University of Nebraska-Lincoln, USA.................................................................... 1368
Oduwole, Olusoji / Cocoa Research Institute of Nigeria, Nigeria..................................................... 912
Okafor, Christopher / International Institute of Tropical Agriculture, Nigeria................................ 912
Olaleye, V. F. / Obafemi Awolowo University, Nigeria....................................................................... 938
Olufemi, Bolarin / University of Ilorin Nigeria................................................................................. 938
Onose, Diana / University of Bucharest, Romania........................................................................... 1276
Opeyemi, Zubair A. / National Space Research and Development Agency, Nigeria...................... 1515
Orosz, Mihály / Budapest University of Technology and Economics, Hungary................................ 609
Oryspayev, Dossay / University of Northern Iowa, USA.................................................................. 480
Panno, Domenico / University of Palermo, Italy.............................................................................. 1411
Pappathanasi, Kym / Salem State University, USA......................................................................... 1090
Paradzayi, Charles / University of Johannesburg, South Africa & Midlands State University,
Zimbabwe......................................................................................................................................... 880
Patent-Nygren, Megan / University of Nebraska-Lincoln, USA..................................................... 1368
Pathak, Prasad A. / University of North Carolina at Greensboro, USA.................................. 130, 1612
Patroescu, Maria / University of Bucharest, Romania..................................................................... 1276
Pazos, Andrés / Universitat Jaume I, Spain........................................................................................ 493
Peduzzi, Pascal / UNEP, Switzerland............................................................................................... 1260
Pérez, Yolanda / University Rovira i Virgili, Spain................................................................ 1100, 2208
Peters, Rob / University of Amsterdam, The Netherlands................................................................ 1243
Petrakis, Euripides G.M. / Technical University of Crete, Greece................................................. 1751
Pigozzi, Bruce W. / Michigan State University, USA....................................................................... 1170
Pochiraju, Kishore / Stevens Institute of Technology, USA............................................................... 502
Poveda, José / University of Texas, USA............................................................................................. 493
Prasetyo, Lilik B. / Bogor Agriculture University, Indonesia.......................................................... 1901
Priyanka, Neena / TERI University, India........................................................................................ 1913
Pultar, Edward / University of California – Santa Barbara, USA................................................... 2006
Quddus, Mohammed A. / Loughborough University, UK................................................................. 658
Quintana, Manuel Sillero / Universidad Politécnica de Madrid, Spain........................................... 970
Rahaman, Khan R. / University of Minho, Portugal....................................................................... 1476
Raik, Jaan / Tallinn University of Technology, Estonia...................................................................... 407
Ramachandran, Muthu / Leeds Metropolitan University, LEEDS UK........................................... 1604
Ray, Nicolas / University of Geneva and UNEP, Switzerland.......................................................... 1260
Reid, Neil / The University of Toledo, USA....................................................................................... 1053
Rex, Art / Appalachian State University, USA.................................................................................... 529
Riccioli, Francesco / University of Florence, Italy............................................................................. 228
Richter, Kai-Florian / Universität Bremen, Germany....................................................................... 250
Rocha, Jorge G. / Universidade do Minho, Portugal....................................................................... 1773
Rozylowicz, Laurenţiu / University of Bucharest, Romania............................................................ 1276
Russo, Antonio P. / University Rovira i Virgili, Spain............................................................ 1100, 2208
Saleh, Ahmed I. / Mansoura University, Egypt................................................................................ 1448
Samanta, Sutapa / Maryland State Highway Administration, USA................................................... 857
Santoro, Mattia / National Research Council, Italy........................................................................... 305
Saquicela, Victor / Universidad Politécnica de Madrid, Spain......................................................... 434
Sauser, Brian / Stevens Institute of Technology, USA....................................................................... 1724
Savary, Lionel / PRiSM Laboratory, France...................................................................................... 550
Sawada, Michael C. / University of Ottawa, Canada........................................................................ 175
Schmidt, Frank / Beak Consultants GmbH, Germany....................................................................... 765
Schwarzer, Stefan / UNEP, Switzerland........................................................................................... 1260
Seco, Diego / University of A Coruña, Spain...................................................................................... 358
Setiawan, Yudi / Bogor Agriculture University, Indonesia.............................................................. 1901
Shampur, Nikhil V. / Cleveland State University, USA........................................................................ 47
Shortridge, Ashton M. / Michigan State University, USA............................................................... 1170
Siabato, Willington / Technical University of Madrid, Spain................................................................ 1
Sikder, Iftikhar U. / Cleveland State University, USA............................................. 47, 336, 1183, 1421
Sinton, Diana Stuart / University of Redlands, USA......................................................................... 832
Smith, Bruce W. / Bowling Green State University, USA................................................................ 1053
Sonder, Kai / International Maize and Wheat Improvement Center, Mexico..................................... 912
Sorrentino, John A. / Temple University, USA................................................................................... 122
Sphicopoulos, Thomas / University of Athens, Greece...................................................................... 258
Sportolaro, Carlo / Agronomist, Perugia, Italy................................................................................ 1429
Sridhar, R. / Sri Ramakrishna Mission Vidyalaya College of Arts and Science, India.................... 1988
Srivastava, Rajeev / Institute of Technology, Banaras Hindu University (ITBHU), India................ 569
Stanco, Filippo / University of Catania, Italy................................................................................... 1132
Stefansson, Gunnar / Chalmers University of Technology, Sweden & University of Iceland,
Iceland............................................................................................................................................ 2051
Stillwell, John / University of Leeds, UK......................................................................................... 1675
Sugumaran, Ramanathan / University of Northern Iowa, USA....................................................... 480
Tanaka, Yuzuru / Hokkaido University, Japan................................................................................ 1216
Tanasi, Davide / Arcadia University, USA........................................................................................ 1132
Tapp, Anna / University of North Carolina at Greensboro, USA..................................................... 1612
Thomopoulos, Stelios C. A. / National Center of Scientific Research “Demokritos,” Greece......... 726
Todaro, Rudj Maria / Insula S.p.A., Italy........................................................................................ 1392
Torquati, Biancamaria / University of Perugia, Italy...................................................................... 1429
Towers, George / Concord University, USA..................................................................................... 1151
Toxopeus, Albertus G. / University of Twente, The Netherlands..................................................... 2102
Tripathi, Nitin Kumar / Asian Institute of Technology, Thailand..................................................... 815
Trisurat, Yongyut / Kasetsart University, Thailand......................................................................... 2102
Tsitsanis, Anastasios / National Technical University of Athens, Greece........................................ 1243
Ubar, Raimund / Tallinn University of Technology, Estonia.............................................................. 407
Valcik, Nicolas A. / The University of Texas at Dallas, USA............................................................ 1502
Valls, Aida / Universitat Rovira i Virgili, Spain................................................................................ 2208
van Westen, Cees / University of Twente, The Netherlands............................................................. 1120
Van Woerden, Jaap / UNEP, Switzerland........................................................................................ 1260
Varoutas, Dimitris / University of Athens, Greece............................................................................. 258
Veenendaal, Bert / Curtin University, Australia................................................................................ 638
Vilches-Blázquez, Luis. M. / Universidad Politécnica de Madrid, Spain......................................... 434
Villalón, Christian José Correa / Institute of Education of Aguascalientes, Mexico...................... 1809
Villanova-Oliver, Marlène / Laboratoire d’Informatique de Grenoble, France............................... 377
Vizzari, Marco / University of Perugia, Italy................................................................................... 1429
Vogt, Brandon J. / University of Colorado at Colorado Springs, USA................................. 1200, 1665
Volpe, Roberto / University of Palermo, Italy.................................................................................. 1411
Wallgrün, Jan Oliver / Universität Bremen, Germany.................................................................... 1639
Wang, Haojun / University of Southern California, USA.................................................................. 631
West, Geoff / Curtin University, Australia.......................................................................................... 638
Whalen, Stephen / University of North Carolina at Chapel Hill, USA.............................................. 130
Whitesides, Clayton J. / Texas State University-San Marcos, USA................................................. 1953
Whitford, Walter G. / New Mexico State University, USA.............................................................. 1541
Wijaya, Chandra Irawadi / Bogor Agriculture University, Indonesia............................................ 1901
Witt, Ron / UNEP, Switzerland......................................................................................................... 1260
Woldai, Tsehaie / University of Twente, The Netherlands................................................................ 1120
Wolter, Diedrich / Universität Bremen, Germany............................................................................ 1639
Woodside, Joseph M. / Cleveland State University, USA................................................................ 1183
Wu, Xiaoying / Curtin University, Australia...................................................................................... 638
Xia, Jianhong (Cecilia) / Curtin University, Australia...................................................................... 638
Yesmin, Sharmin / Temple University, USA....................................................................................... 122
Young, Stephen / Salem State University, USA................................................................................ 1090
Zadorozhny, Vladimir I. / University of Pittsburgh, USA................................................................... 30
Zahari, Rustam Khairi / International Islamic University Malaysia, Malaysia............................... 991
Zeitouni, Karine / PRiSM Laboratory, France.................................................................................. 550
Zendjebil, Iman Maissa / University of Evry Val d’Essonne, France............................................. 2164
Zeug, Gunter / European Environment Agency, Denmark............................................................... 1587
Zhu, Lingli / Finnish Geodetic Institute, Finland............................................................................. 1011
Zimmermann, Roger / National University of Singapore, Singapore............................................... 631
Table of Contents

Volume I

Section 1
Fundamental Concepts and Theories
This section serves as a foundation for this exhaustive reference tool by addressing underlying principles
essential to the understanding of Geographic Information Systems. Chapters found within these pages
provide an excellent framework in which to position Geographic Information Systems within the field of
information science and technology. Insight regarding the critical incorporation of global measures into
Geographic Information Systems is addressed, while crucial stumbling blocks of this field are explored.
With 15 chapters comprising this foundational section, the reader can learn and chose from a compen-
dium of expert research on the elemental theories and technological building blocks underscoring the
Geographic Information Systems discipline.

Chapter 1
Showing the Past: Integrating Cartographic Collections through Virtual Map Rooms........................... 1
Willington Siabato, Technical University of Madrid, Spain
Alberto Fernández-Wyttenbach, Technical University of Madrid, Spain
Miguel-Ángel Bernabé-Poveda, Technical University of Madrid, Spain

Chapter 2
Location-Based Services: A Taxonomy on Theory and Practice........................................................... 17
Henrik Hanke, University of Duisburg-Essen, Germany
Alf Neumann, University of Cologne, Germany

Chapter 3
Location Privacy.................................................................................................................................... 24
Matt Duckham, University of Melbourne, Australia

Chapter 4
Location-Based Performance Tuning in Mobile Sensor Networks....................................................... 30
Vladimir I. Zadorozhny, University of Pittsburgh, USA
Chapter 5
Routing ................................................................................................................................................. 39
Kevin M. Curtin, George Mason University, USA

Chapter 6
Web-Based Geospatial Services: Implementing Interoperability Specifications................................... 47
Iftikhar U. Sikder, Cleveland State University, USA
Aryya Gangopadhyay, University of Maryland - Baltimore County, USA
Nikhil V. Shampur, Cleveland State University, USA

Chapter 7
A Brief Cartography of Smart Cameras: Proactive Surveillance and Control....................................... 66
Fernanda Bruno, Universidade Federal do Rio de Janeiro, Brazil

Chapter 8
GIS, Grid Computing and RFID in Healthcare Information Supply Chain: A Case for Infectious
Disaster Management............................................................................................................................. 81
Yenming J. Chen, National Kaohsiung First University of Science and Technology, Taiwan

Chapter 9
On Modeling and Analysis of Multidimensional Geographic Databases . ........................................... 91
Sandro Bimonte, Laboratoire d’Informatique en Images et Systèmes d’information, France

Chapter 10
Technology and the Multipolar Global Economy: Implications for European Competitiveness........ 108
Steven McGuire, Aberystwyth University, UK

Chapter 11
Coupling GPS and GIS........................................................................................................................ 122
Mahbubur R. Meenar, Temple University, USA
John A. Sorrentino, Temple University, USA
Sharmin Yesmin, Temple University, USA

Chapter 12
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition of
Arctic Lakes, Toolik Lake Region, Alaska.......................................................................................... 130
Prasad A. Pathak, University of North Carolina at Greensboro, USA
Stephen Whalen, University of North Carolina at Chapel Hill, USA

Chapter 13
Mobile Geographic Information Systems . ......................................................................................... 151
Yang Li, University of East London, UK
Allan J. Brimicombe, University of East London, UK
Chapter 14
A Review of Geospatial Information Technology for Natural Disaster Management in Developing
Countries ............................................................................................................................................. 175
Sam Herold, University of Ottawa, Canada
Michael C. Sawada, University of Ottawa, Canada

Chapter 15
Virtual Environments for Geospatial Applications.............................................................................. 216
Magesh Chandramouli, Purdue University, USA
Bo Huang, Chinese University of Hong Kong, China

Section 2
Development and Design Methodologies
This section provides in-depth coverage of conceptual architecture frameworks to provide the reader
with a comprehensive understanding of the emerging developments within the field of Geographic In-
formation Systems. Research fundamentals imperative to the understanding of developmental processes
within Geographic Information Systems are offered. From broad examinations to specific discussions on
methodology, the research found within this section spans the discipline while offering detailed, specific
discussions. From basic designs to abstract development, these chapters serve to expand the reaches
of development and design technologies within the Geographic Information Systems community. This
section includes 15 contributions from researchers throughout the world on the topic of Geographic
Information Systems.

Chapter 16
GIS Technique for Territorial Analysis: Spatial MultiCriteria Decision Analysis............................... 228
Francesco Riccioli, University of Florence, Italy
Toufic El Asmar, University of Florence, Italy

Chapter 17
Cognitively Ergonomic Route Directions............................................................................................ 250
Alexander Klippel, University of Melbourne, Australia
Kai-Florian Richter, Universität Bremen, Germany
Stefan Hansen, Spatial/Information Systems Ltd./LISAsoft, Australia

Chapter 18
Design and Implementation Approaches for Location-Based, Tourism-Related Services.................. 258
George Kakaletris, University of Athens, Greece
Dimitris Varoutas, University of Athens, Greece
Dimitris Katsianis, University of Athens, Greece
Thomas Sphicopoulos, University of Athens, Greece

Chapter 19
Methodical Spatial Database Design with Topological Polygon Structures ...................................... 295
Jean Damascène Mazimpaka, National University of Rwanda, Rwanda
Chapter 20
Methodologies for Augmented Discovery of Geospatial Resources .................................................. 305
Mattia Santoro, National Research Council, Italy
Paolo Mazzetti, National Research Council, Italy
Stefano Nativi, National Research Council, Italy
Cristiano Fugazza, Institute of Environment and Sustainability, Italy
Carlos Granell, Universitat Jaume I de Castellón, Spain
Laura Díaz, Universitat Jaume I de Castellón, Spain

Chapter 21
Semantic Interoperability of Geospatial Services................................................................................ 336
Iftikhar U. Sikder, Cleveland State University, USA
Santosh K. Misra, Cleveland State University, USA

Chapter 22
New Discovery Methodologies in GIS: Improving the Information Retrieval Process...................... 358
Nieves R. Brisaboa, University of A Coruña, Spain
Miguel R. Luaces, University of A Coruña, Spain
Diego Seco, University of A Coruña, Spain

Chapter 23
A Multidimensional Model for Correct Aggregation of Geographic Measures ................................. 377
Sandro Bimonte, Cemagref, UR TSCF, France
Marlène Villanova-Oliver, Laboratoire d’Informatique de Grenoble, France
Jerome Gensel, Laboratoire d’Informatique de Grenoble, France

Chapter 24
Location Privacy in Automotive Telematics . ..................................................................................... 399
Muhammad Usman Iqbal, University of New South Wales, Australia
Samsung Lim, University of New South Wales, Australia

Chapter 25
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams . .............................. 407
Raimund Ubar, Tallinn University of Technology, Estonia
Jaan Raik, Tallinn University of Technology, Estonia
Artur Jutman, Tallinn University of Technology, Estonia
Maksim Jenihhin, Tallinn University of Technology, Estonia

Chapter 26
Semantic Annotation of Geospatial RESTful Services Using External Resources............................. 434
Victor Saquicela, Universidad Politécnica de Madrid, Spain
Luis. M. Vilches-Blázquez, Universidad Politécnica de Madrid, Spain
Oscar Corcho, Universidad Politécnica de Madrid, Spain
Chapter 27
Framework for Graphical User Interfaces of Geospatial Early Warning Systems ............................. 449
Martin Hammitzsch, Helmholtz Centre Potsdam, GFZ German Research Centre for
Geosciences, Germany

Chapter 28
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery ........................................... 465
Victor Pascual Ayats, Spatial Data Infrastructure of Catalonia (IDEC), Spain

Chapter 29
Development and Application of a Spreadsheet-Based Spatial Decision Support
System (SDSS).................................................................................................................................... 480
Dossay Oryspayev, University of Northern Iowa, USA
Ramanathan Sugumaran, University of Northern Iowa, USA
John DeGroote, University of Northern Iowa, USA

Chapter 30
A Package-Based Architecture for Customized GIS .......................................................................... 493
Andrés Pazos, Universitat Jaume I, Spain
José Poveda, University of Texas, USA
Michael Gould, Universitat Jaume I, Spain

Section 3
Tools and Technologies
This section presents extensive coverage of various tools and technologies available in the field of
Geographic Information Systems that practitioners and academics alike can utilize to develop different
techniques. These chapters enlighten readers about fundamental research on the many tools facilitat-
ing the burgeoning field of Geographic Information Systems. It is through these rigorously researched
chapters that the reader is provided with countless examples of the up-and-coming tools and technologies
emerging from the field of Geographic Information Systems. With 18 chapters, this section offers a broad
treatment of some of the many tools and technologies within the Geographic Information Systems field.

Chapter 31
Algorithms for 3D Map Segment Registration.................................................................................... 502
Hao Men, Stevens Institute of Technology, USA
Kishore Pochiraju, Stevens Institute of Technology, USA

Chapter 32
TVAL-Farm: A Qualitative Enhancement of the LESA Model........................................................... 529
Art Rex, Appalachian State University, USA
Leah Greden Mathews, University of North Carolina at Asheville, USA
Anne Lancaster, Independent Researcher, USA
Chapter 33
OpenStreetMap.................................................................................................................................... 540
Kevin Curran, University of Ulster, UK
John Crumlish, University of Ulster, UK
Gavin Fisher, University of Ulster, UK

Chapter 34
GeoCache: A Cache for GML Geographical Data............................................................................... 550
Lionel Savary, PRiSM Laboratory, France
Georges Gardarin, PRiSM Laboratory, France
Karine Zeitouni, PRiSM Laboratory, France

Volume II
Chapter 35
PDE-Based Image Processing: Image Restoration.............................................................................. 569
Rajeev Srivastava, Institute of Technology, Banaras Hindu University (ITBHU), India

Chapter 36
Multicast over Location-Based Services ............................................................................................ 609
Péter Hegedüs, Budapest University of Technology and Economics, Hungary
Mihály Orosz, Budapest University of Technology and Economics, Hungary
Gábor Hosszú, Budapest University of Technology and Economics, Hungary
Ferenc Kovács, Budapest University of Technology and Economics, Hungary

Chapter 37
A New Image Distortion Measure Based on Natural Scene Statistics Modeling................................ 616
Abdelkaher Ait Abdelouahad, University of Mohammed V-Agdal, Morocco
Mohammed El Hassouni, University of Mohammed V-Agdal, Morocco
Hocine Cherifi, University of Burgundy, France
Driss Aboutajdine, University of Mohammed V-Agdal, Morocco

Chapter 38
Modern Navigation Systems and Related Spatial Query . .................................................................. 631
Wei-Shinn Ku, Auburn University, USA
Haojun Wang, University of Southern California, USA
Roger Zimmermann, National University of Singapore, Singapore

Chapter 39
Managing Schema Evolution in a Federated Spatial Database System............................................... 638
Xiaoying Wu, Curtin University, Australia
Jianhong (Cecilia) Xia, Curtin University, Australia
Geoff West, Curtin University, Australia
Lesley Arnold, Landgate, Australia
Bert Veenendaal, Curtin University, Australia
Chapter 40
Map Matching Algorithms for Intelligent Transport Systems............................................................. 658
Mohammed A. Quddus, Loughborough University, UK

Chapter 41
Describing and Selecting Collections of Georeferenced Media Items in Peer-to-Peer Information
Retrieval Systems................................................................................................................................. 666
Daniel Blank, University of Bamberg, Germany
Andreas Henrich, University of Bamberg, Germany

Chapter 42
Cognitive Mapping and GIS for Community-Based Resource Identification .................................... 685
Lyn Kathlene, Colorado State University, USA

Chapter 43
Complementary Part Detection and Reassembly of 3D Fragments ................................................... 703
Vandana Dixit Kaushik, Harcourt Butler Technological Institute, India
Phalguni Gupta, Indian Institute of Technology Kanpur, India

Chapter 44
Biometric Authentication in Broadband Networks for Location-Based Services............................... 726
Stelios C. A. Thomopoulos, National Center of Scientific Research “Demokritos,” Greece
Nikolaos Argyreas, National Center of Scientific Research “Demokritos,” Greece

Chapter 45
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping
Wells.................................................................................................................................................... 736
D. Mimoun, Ecole Nationale Supérieure des Mines, France
S. Gaur, Ecole Nationale Supérieure des Mines, France
D. Graillot, Ecole Nationale Supérieure des Mines, France

Chapter 46
Last Online Deposits Spatial Data in the Web..................................................................................... 750
Mohamed-Khireddine Kholladi, University Mentouri of Constantine, Algeria

Chapter 47
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks
and GIS ............................................................................................................................................... 765
Silke Noack, Beak Consultants GmbH, Germany
Andreas Barth, Beak Consultants GmbH, Germany
Alexey Irkhin, Beak Consultants GmbH, Germany
Evelyn Bennewitz, Beak Consultants GmbH, Germany
Frank Schmidt, Beak Consultants GmbH, Germany
Chapter 48
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings ....................................... 783
Chao-Ching Ho, National Yunlin University of Science and Technology, Taiwan

Section 4
Utilization and Application
This section discusses a variety of applications and opportunities available that can be considered by
practitioners in developing viable and effective Geographic Information Systems programs and pro-
cesses. This section includes 16 chapters that review topics from case studies in Africa to best practices
in Asia and ongoing research in the United States. Further chapters discuss Geographic Information
Systems in a variety of settings (environment, city planning, disease/disaster management, etc.). Con-
tributions included in this section provide excellent coverage of today’s IT community and how research
into Geographic Information Systems is impacting the social fabric of our present-day global village.

Chapter 49
Children’s Maps in GIS: A Tool for Communicating Outdoor Experiences in Urban Planning......... 799
Kerstin Nordin, Swedish University of Agricultural Sciences, Sweden
Ulla Berglund, Swedish University of Agricultural Sciences, Sweden

Chapter 50
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS ........................................ 815
Nitin Kumar Tripathi, Asian Institute of Technology, Thailand
Aung Phey Khant, Asian Institute of Technology, Thailand

Chapter 51
The Spatially Interactive Literature Analysis System Study Tool: A GIS-Based Approach to
Interpreting History in the Classroom.................................................................................................. 832
Alyssa K. Moore, University of Redlands, USA
Lillian I. Larsen, University of Redlands, USA
Diana Stuart Sinton, University of Redlands, USA

Chapter 52
Geographic Visual Query Languages and Ambiguities Treatment...................................................... 848
Arianna D’Ulizia, Consiglio Nazionale delle Ricerche, IRPPS, Italy
Fernando Ferri, Consiglio Nazionale delle Ricerche, IRPPS, Italy
Patrizia Grifoni, Consiglio Nazionale delle Ricerche, IRPPS, Italy

Chapter 53
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window: Theory, Solution and
Application........................................................................................................................................... 857
Sutapa Samanta, Maryland State Highway Administration, USA
Manoj K. Jha, Morgan State University, USA
Chapter 54
Estimating Potential Woody Biomass in Communal Savanna Woodlands from Synthetic Aperture
Radar (SAR)......................................................................................................................................... 880
Charles Paradzayi, University of Johannesburg, South Africa & Midlands State University,
Zimbabwe
Harold J. Annegarn, University of Johannesburg, South Africa

Chapter 55
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web...................................... 890
Manil Maskey, The University of Alabama in Huntsville, USA
Helen Conover, The University of Alabama in Huntsville, USA
Ken Keiser, The University of Alabama in Huntsville, USA
Luis Bermudez, Southeastern Universities Research Association, USA
Sara Graves, The University of Alabama in Huntsville, USA

Chapter 56
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa:
A Case Study of Nigeria...................................................................................................................... 912
Tunrayo Alabi, International Institute of Tropical Agriculture, Nigeria
Kai Sonder, International Maize and Wheat Improvement Center, Mexico
Olusoji Oduwole, Cocoa Research Institute of Nigeria, Nigeria
Christopher Okafor, International Institute of Tropical Agriculture, Nigeria

Chapter 57
The Columbia Regional Geospatial Service Center System: A Proven Model for the
United States........................................................................................................................................ 926
P. R. Blackwell, Stephen F. Austin State University, USA
Darrel McDonald, Stephen F. Austin State University, USA

Chapter 58
Mapping the Distribution of Tsetse Flies in Eastern Uganda: A Geoinformatics Approach............... 938
Teddy Nakato, Ministry of Agriculture, Uganda
O. O. Jegede, Obafemi Awolowo University, Nigeria
Ayanlade Ayansina, Obafemi Awolowo University, Nigeria
V. F. Olaleye, Obafemi Awolowo University, Nigeria
Bolarin Olufemi, University of Ilorin Nigeria

Chapter 59
Open Source Based Deployment of Environmental Data into Geospatial Information
Infrastructures...................................................................................................................................... 952
José Gil, Institute of New Imaging Technologies, Universitat Jaume I, Spain
Laura Díaz, Institute of New Imaging Technologies, Universitat Jaume I, Spain
Carlos Granell, Institute for Environment and Sustainability, Italy
Joaquín Huerta, Institute of New Imaging Technologies, Universitat Jaume I, Spain
Chapter 60
Application of Topographical Capture Techniques for Modelling Virtual Reality: From the Static
Object to the Human Figure................................................................................................................. 970
Mercedes Farjas Abadía, Universidad Politécnica de Madrid, Spain
Manuel Sillero Quintana, Universidad Politécnica de Madrid, Spain
Pedro Ángel Merino Calvo, Universidad Politécnica de Madrid, Spain

Chapter 61
GIS Implementation in Malaysian Statutory Development Plan System . ......................................... 991
Muhammad Faris Abdullah, International Islamic University Malaysia, Malaysia
Alias Abdullah, International Islamic University Malaysia, Malaysia
Rustam Khairi Zahari, International Islamic University Malaysia, Malaysia

Chapter 62
3D City Modeling and Visualization for Smart Phone Applications ................................................ 1011
Juha Hyyppä, Finnish Geodetic Institute, Finland
Lingli Zhu, Finnish Geodetic Institute, Finland
Zhengjun Liu, Chinese Academy of Surveying and Mapping, China
Harri Kaartinen, Finnish Geodetic Institute, Finland
Anttoni Jaakkola, Finnish Geodetic Institute, Finland

Chapter 63
Assessing the Success of an Industrial Cluster ................................................................................. 1053
Neil Reid, The University of Toledo, USA
Bruce W. Smith, Bowling Green State University, USA

Chapter 64
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after
Landfall ............................................................................................................................................. 1069
Corene J. Matyas, University of Florida, USA

Section 5
Organizational and Social Implications
This section includes a wide range of research pertaining to the social and behavioral impact of Geo-
graphic Information Systems around the world. Chapters introducing this section critically analyze and
discuss trends in Geographic Information Systems, such as historical GIS, volunteered GIS, and health-
care applications of GIS. Additional chapters included in this section look at collaborative mapping,
agriculture, and conservation. Also investigating a concern within the field of Geographic Information
Systems is research which discusses the effect of trust and security on Geographic Information Systems.
With 19 chapters, the discussions presented in this section offer research into the integration of global
Geographic Information Systems as well as implementation of ethical and workflow considerations for
all organizations.
Chapter 65
The Digital Geography Lab at Salem State University: The Evolution of One of the Oldest
Educational Digital Geospatial Labs................................................................................................. 1090
Kym Pappathanasi, Salem State University, USA
Stephen Young, Salem State University, USA

Chapter 66
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS........................................ 1100
Oriol Bernadó, Science and Technology Park for Tourism and Leisure, Spain
Albert Bigorra, Science and Technology Park for Tourism and Leisure, Spain
Yolanda Pérez, University Rovira i Virgili, Spain
Antonio P. Russo, University Rovira i Virgili, Spain
Salvador Anton Clave, University Rovira i Virgili, Spain

Chapter 67
The RiskCity Training Package on Multi-Hazard Risk Assessment................................................. 1120
Cees van Westen, University of Twente, The Netherlands
Tsehaie Woldai, University of Twente, The Netherlands

Volume III
Chapter 68
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style.............. 1132
Filippo Stanco, University of Catania, Italy
Davide Tanasi, Arcadia University, USA
Giuseppe Claudio Guarnera, University of Catania, Italy
Giovanni Gallo, University of Catania, Italy

Chapter 69
Cultural Dasymetric Population Mapping with Historical GIS: A Case Study from the Southern
Appalachians...................................................................................................................................... 1151
George Towers, Concord University, USA

Chapter 70
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile
Virus in Detroit, Michigan................................................................................................................. 1170
Kevin P. McKnight, Michigan Department of Transportation & Michigan State University, USA
Joseph P. Messina, Michigan State University, USA
Ashton M. Shortridge, Michigan State University, USA
Meghan D. Burns, Montana Natural Heritage Program, USA
Bruce W. Pigozzi, Michigan State University, USA
Chapter 71
GIS Application of Healthcare Data for Advancing Epidemiological Studies ................................. 1183
Joseph M. Woodside, Cleveland State University, USA
Iftikhar U. Sikder, Cleveland State University, USA

Chapter 72
Colorado 14ers, Pixel by Pixel........................................................................................................... 1200
Brandon J. Vogt, University of Colorado at Colorado Springs, USA

Chapter 73
A Component-Based 3D Geographic Simulation Framework and its Integration with a
Legacy GIS........................................................................................................................................ 1216
Zhen-Sheng Guo, Hokkaido University,, Japan
Yuzuru Tanaka, Hokkaido University, Japan

Chapter 74
Collaborative Mapping and GIS: An Alternative Geographic Information Framework................... 1231
Edward Mac Gillavry, Webmapper, The Netherlands

Chapter 75
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal
Systems ............................................................................................................................................. 1243
Anastasios Tsitsanis, National Technical University of Athens, Greece
Sotirios Koussouris, National Technical University of Athens, Greece
Rob Peters, University of Amsterdam, The Netherlands

Chapter 76
Sharing Environmental Data through GEOSS................................................................................... 1260
Gregory Giuliani, University of Geneva and UNEP, Switzerland
Nicolas Ray, University of Geneva and UNEP, Switzerland
Stefan Schwarzer, UNEP, Switzerland
Andrea De Bono, University of Geneva and UNEP, Switzerland
Pascal Peduzzi, UNEP, Switzerland
Hy Dao, University of Geneva and UNEP, Switzerland
Jaap Van Woerden, UNEP, Switzerland
Ron Witt, UNEP, Switzerland
Martin Beniston, University of Geneva, Switzerland
Anthony Lehmann, University of Geneva and UNEP, Switzerland

Chapter 77
Agriculture and Conservation in the Natura 2000 Network: A Sustainable Development
Approach of the European Union...................................................................................................... 1276
Cristian Ioja, University of Bucharest, Romania
Laurenţiu Rozylowicz, University of Bucharest, Romania
Maria Patroescu, University of Bucharest, Romania
Mihai Niţă, University of Bucharest, Romania
Diana Onose, University of Bucharest, Romania
Chapter 78
A Neural Network for Modeling Multicategorical Parcel Use Change . .......................................... 1297
Kang Shou Lu, Towson University, USA
John Morgan, Towson University, USA
Jeffery Allen, Clemson University, USA

Chapter 79
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park . ...................... 1309
Katie Algeo, Western Kentucky University, USA
Ann Epperson, Western Kentucky University, USA
Matthew Brunt, Western Kentucky University, USA

Chapter 80
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region of Ontario,
Canada................................................................................................................................................ 1328
John W. Norder, Michigan State University, USA
Jon W. Carroll, Michigan State University, USA

Chapter 81
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs .............. 1345
Mohamed Amine Abid, University of Manouba, Tunisia
Abdelfettah Belghith, University of Manouba, Tunisia

Chapter 82
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience................ 1368
Viacheslav Adamchuk, McGill University, Canada
Bradley Barker, University of Nebraska-Lincoln, USA
Gwen Nugent, University of Nebraska-Lincoln, USA
Neal Grandgenett, University of Nebraska at Omaha, USA
Megan Patent-Nygren, University of Nebraska-Lincoln, USA
Collin Lutz, Virginia Polytechnic Institute and State University, USA
Kathy Morgan, University of Nebraska-Lincoln, USA

Chapter 83
A Reflection on the Ph.D. Program in Spatially Integrated Social Science at the University of
Toledo ............................................................................................................................................... 1385
Bhuiyan Monwar Alam, The University of Toledo, USA
Jeanette Eckert, The University of Toledo, USA
Peter S. Lindquist, The University of Toledo, USA
Section 6
Managerial Impact
This section presents contemporary coverage of the social implications of Geographic Information
Systems, more specifically related to the corporate and managerial utilization of information sharing
technologies and applications, and how these technologies can be extrapolated to be used in Geographic
Information Systems. Core ideas such as urban geospatial management, environmental quality monitor-
ing, city competitiveness, and other determinants that affect the intention to adopt technological innova-
tions in Geographic Information Systems are discussed. Equally as crucial, chapters within this section
discuss how leaders can utilize Geographic Information Systems applications to get the best outcomes
from their governors and their citizens.

Chapter 84
Urban Geospatial Management System............................................................................................. 1392
Rudj Maria Todaro, Insula S.p.A., Italy
Ayça Cangel, International Competitiveness Research Institute, Turkey

Chapter 85
Technical and Economical Feasibility of Biomass Use for Power Generation in Sicily . ................ 1411
Antonio Messineo, Kore University of Enna, Italy
Domenico Panno, University of Palermo, Italy
Roberto Volpe, University of Palermo, Italy

Chapter 86
Managing Uncertainty in Geospatial Predictive Models .................................................................. 1421
Iftikhar U. Sikder, Cleveland State University, USA

Chapter 87
Participatory GIS for Integrating Local and Expert Knowledge in Landscape Planning . ............... 1429
Biancamaria Torquati, University of Perugia, Italy
Marco Vizzari, University of Perugia, Italy
Carlo Sportolaro, Agronomist, Perugia, Italy

Chapter 88
Location Management in PCS Networks Using Base Areas (BAs) and 2 Level Paging (2LP)
Schemes............................................................................................................................................. 1448
Hesham A. Ali, Mansoura University, Egypt
Ahmed I. Saleh, Mansoura University, Egypt
Mohammed H. Ali, Mansoura University, Egypt

Chapter 89
The Role of GIS in City Competitiveness......................................................................................... 1476
Khan R. Rahaman, University of Minho, Portugal
Júlia M. Lourenço, University of Minho, Portugal
Chapter 90
Using Geospatial Information Systems for Strategic Planning and Institutional Research............... 1502
Nicolas A. Valcik, The University of Texas at Dallas, USA

Chapter 91
Harnessing Nigeria’s Investment in Satellite Technology for Sustainable Agriculture and Food
Security ............................................................................................................................................. 1515
Zubair A. Opeyemi, National Space Research and Development Agency, Nigeria
J. O. Akinyede, African Regional Centre for Space and Technology Education, Nigeria

Chapter 92
E-Planning through the Wisconsin Land Information Program: The Contexts of Power, Politics
and Scale............................................................................................................................................ 1525
Patrice Day, University of Wisconsin-Milwaukee, USA
Rina Ghose, University of Wisconsin-Milwaukee, USA

Chapter 93
Rapid Evaluation of Arid Lands (REAL): A Methodology............................................................... 1541
Daniel P. Dugas, New Mexico State University, USA
Michael N. DeMers, New Mexico State University, USA
Janet C. Greenlee, New Mexico State University, USA
Walter G. Whitford, New Mexico State University, USA
Anna Klimaszewski-Patterson, New Mexico State University, USA

Chapter 94
Environmental Quality Monitoring, Using GIS as a Tool of Visualization, Management and Decision-
Making: Applications Emerging from the EU Water Framework Directive EU 2000/60................. 1559
Christophoros Christophoridis, Aristotle University of Thessaloniki, Greece
Erasmia Bizani, Aristotle University of Thessaloniki, Greece
Konstantinos Fytianos, Aristotle University of Thessaloniki, Greece

Chapter 95
Disaster Management and Virtual Globes: A High Potential for Developing Countries................... 1587
Gunter Zeug, European Environment Agency, Denmark
Dominik Brunner, Fraunhofer Institute of Optronics, Germany

Chapter 96
Global Emergency-Response System Using GIS.............................................................................. 1604
Salem Al-Marri, Leeds Metropolitan University LEEDS, UK
Muthu Ramachandran, Leeds Metropolitan University, LEEDS UK

Chapter 97
Leveraging the Science of Geographic Information Systems............................................................ 1612
Rick Bunch, University of North Carolina at Greensboro, USA
Anna Tapp, University of North Carolina at Greensboro, USA
Prasad Pathak, University of North Carolina at Greensboro, USA
Chapter 98
Formalizing Cross-Parameter Conditions for Geoprocessing Service Chain Validation ................. 1619
Daniel Fitzner, Fraunhofer IGD, Germany

Section 7
Critical Issues
This section contains 16 chapters, providing a wide variety of perspectives on Geographic Information
Systems and its implications. Such perspectives include readings in ethics, theory, practice, and a great
deal more. The section also discusses new educational possibilities for uses of GIS. Within the chapters,
the reader is presented with an in-depth analysis of the most current and relevant issues within this
growing field of study. Crucial questions are addressed and alternatives offered, such as what makes
up communities, and how geographic information systems and other spatial technologies help define
such groups.

Chapter 99
Qualitative Spatial Reasoning for Applications: New Challenges and the SparQ Toolbox.............. 1639
Diedrich Wolter, Universität Bremen, Germany
Jan Oliver Wallgrün, Universität Bremen, Germany

Chapter 100
Applied Geography Education in Focus: Strategic Panel Session..................................................... 1665
Nairne Cameron, Algoma University, Canada
Edwin Butterworth, CACI International, USA
Dawna L. Cerney, Youngstown State University, USA
William J. Gribb, University of Wyoming, USA
Kingsley E. Haynes, George Mason University, USA
Bill Hodge, City of Midland, USA
Robert B. Honea, KU Transportation Research Institute, USA
Brandon J. Vogt, University of Colorado Colorado Springs, USA

Chapter 101
Temporal and Spatial Consistency . .................................................................................................. 1675
Oliver Duke-Williams, University of Leeds, UK
John Stillwell, University of Leeds, UK

Volume IV
Chapter 102
Validation Approaches to Volcanic Explosive Phenomenology ....................................................... 1697
Sébastien Dartevelle, Los Alamos National Laboratory, USA
Chapter 103
Target Evaluation and Correlation Method (TECM) as an Assessment Approach to Global Earth
Observation System of Systems (GEOSS)........................................................................................ 1724
Samuel Epelbaum, Pace University, USA
Mo Mansouri, Stevens Institute of Technology, USA
Alex Gorod, Stevens Institute of Technology, USA
Brian Sauser, Stevens Institute of Technology, USA
Alexander Fridman, Russian Academy of Science, Russia

Chapter 104
Integrated Ontologies for Spatial Scene Descriptions ...................................................................... 1751
Sotirios Batsakis, Technical University of Crete, Greece
Euripides G.M. Petrakis, Technical University of Crete, Greece

Chapter 105
Embracing Geographic Analysis beyond Geography: Harvard’s Center for Geographic Analysis
Enters 5th Year................................................................................................................................... 1764
Weihe (Wendy) Guan, Harvard University, USA
Peter K. Bol, Harvard University, USA

Chapter 106
Distributed Geospatial Data Management for Entomological and Epidemiological Studies . ......... 1773
Hugo Martins, Universidade Nova de Lisboa, Portugal & Universidade Técnica de Lisboa, Portugal
Jorge G. Rocha, Universidade do Minho, Portugal

Chapter 107
An Assessment of Several Taxonomies of Volunteered Geographic Information............................. 1794
Antony K Cooper, CSIR, South Africa & University of Pretoria, South Africa
Serena Coetzee, University of Pretoria, South Africa
Derrick G Kourie, University of Pretoria, South Africa

Chapter 108
Logistics for the Garbage Collection through the use of Ant Colony Algorithms ........................... 1809
Julio Cesar Ponce Gallegos, Autonomous University of Aguacalientes, Mexico
Fatima Sayuri Quezada Aguilera, Autonomous University of Aguacalientes, Mexico
José Alberto Hernandez Aguilar, Autonomous University of Morelos, Mexico
Christian José Correa Villalón, Institute of Education of Aguascalientes, Mexico

Chapter 109
The Efficacy of Aerial Search during the Battle of Midway............................................................. 1828
Denis J. Dean, University of Texas at Dallas, USA

Chapter 110
Communities, Cartography and GIS: Enhancing Undergraduate Geographic Education with Service
Learning............................................................................................................................................. 1848
Timothy L. Hawthorne, Columbus State University, USA
Chapter 111
Relating Transportation Quality Indicators to Economic Conditions in the South-Central U.S....... 1864
Jonathan C. Comer, Oklahoma State University, USA
Amy K. Graham, Oklahoma State University, USA
Stacey R. Brown, Oklahoma State University, USA

Chapter 112
Monitoring, Analyzing and Understanding the Dynamics of Complex Processes: The Case of the
Public Debate on Pesticides in The Netherlands............................................................................... 1883
J.S. Buurma, Wageningen UR, The Netherlands

Chapter 113
Spatial Model Approach for Deforestation: Case Study in Java Island, Indonesia........................... 1901
Lilik B. Prasetyo, Bogor Agriculture University, Indonesia
Chandra Irawadi Wijaya, Bogor Agriculture University, Indonesia
Yudi Setiawan, Bogor Agriculture University, Indonesia

Chapter 114
Geo-Informatics for Land Use and Biodiversity Studies . ................................................................ 1913
P. K. Joshi, TERI University, India
Neena Priyanka, TERI University, India

Section 8
Emerging Trends
This section highlights research potential within the field of Geographic Information Systems while
exploring uncharted areas of study for the advancement of the discipline. Introducing this section are
chapters that set the stage for future research directions and topical suggestions for continued debate,
centering on the new venues and forums for discussion. A pair of chapters on data mining and urban
planning makes up the middle of the section of the final 16 chapters, and the book concludes with a
look ahead into the future of the Geographic Information Systems field, with “Toward a Roadmap to
E-Government for a Better Governance.” In all, this text will serve as a vital resource to practitioners
and academics interested in the best practices and applications of the burgeoning field of Geographic
Information Systems.

Chapter 115
Neogeography ................................................................................................................................... 1941
Judith Gelernter, Carnegie Mellon University, USA

Chapter 116
Estimating Fractional Snow Cover in Mountain Environments with Fuzzy Classification ............. 1953
Clayton J. Whitesides, Texas State University-San Marcos, USA
Matthew H. Connolly, Texas State University-San Marcos, USA
Chapter 117
Virtual Neighborhoods and E-Government: A Case Study Comparison........................................... 1974
Rebecca Moody, Erasmus University Rotterdam, The Netherlands
Dennis de Kool, Center for Public Innovation, The Netherlands
Victor Bekkers, Erasmus University Rotterdam, The Netherlands

Chapter 118
Fractal Estimation Using Extended Triangularisation and Box Counting Algorithm for any Geo-
Referenced Point Data in GIS............................................................................................................ 1988
R. Sridhar, Sri Ramakrishna Mission Vidyalaya College of Arts and Science, India
S. Balasubramaniam, JSS University, India

Chapter 119
Data Mining Location-Based Social Networks for Geospatial Discovery........................................ 2006
Edward Pultar, University of California – Santa Barbara, USA

Chapter 120
Discovering Geosensor Data By Means of an Event Abstraction Layer........................................... 2020
Alejandro Llaves, University of Muenster, Germany
Thomas Everding, University of Muenster, Germany

Chapter 121
Towards a Rwandan NSDI: An Update............................................................................................. 2040
Felicia O. Akinyemi, Kigali Institute of Science and Technology, Rwanda

Chapter 122
Intelligent Freight Transportation System: The Effects on Transportation Operations
Performance....................................................................................................................................... 2051
Gunnar Stefansson, Chalmers University of Technology, Sweden & University of Iceland, Iceland

Chapter 123
Visualizing Plant Community Change Using Historical Records .................................................... 2063
Evelyn Brister, Rochester Institute of Technology, USA
Elizabeth Hane, Rochester Institute of Technology, USA
Karl Korfmacher, Rochester Institute of Technology, USA

Chapter 124
Sustainable Land Use and Watershed Management in Response to Climate Change Impacts:
Overview and Proposed Research Techniques.................................................................................. 2080
Nguyen Kim Loi, Nong Lam University, Vietnam

Chapter 125
Modeling Species Distribution.......................................................................................................... 2102
Yongyut Trisurat, Kasetsart University, Thailand
Albertus G. Toxopeus, University of Twente, The Netherlands
Chapter 126
A GIS Methodology for Assessing the Safety Hazards of Abandoned Mine Lands (AMLs):
Application to the State of Pennsylvania........................................................................................... 2128
Timothy J. Dolney, Penn State University – Altoona, USA

Chapter 127
Innovative ICT Applications in Transport and Logistics: Some Evidence from Asia....................... 2150
Mark Goh, University of South Australia, Australia & National University of Singapore, Singapore
Kym Fraser, University of South Australia, Australia

Chapter 128
3D Camera Tracking for Mixed Reality using Multi-Sensors Technology....................................... 2164
Fakhreddine Ababsa, University of Evry Val d’Essonne, France
Iman Maissa Zendjebil, University of Evry Val d’Essonne, France
Jean-Yves Didier, University of Evry Val d’Essonne, France

Chapter 129
Automatic Metadata Generation for Geospatial Resource Discovery............................................... 2176
Miguel-Angel Manso-Callejo, Universidad Politécnica de Madrid, Spain
Arturo Beltran Fonollosa, Universitat Jaume I de Castellón, Spain

Chapter 130
Semantic Enrichment for Geospatial Information in a Tourism Recommender System................... 2208
Joan de la Flor, Science and Technology Park for Tourism and Leisure, Spain
Joan Borràs, Science and Technology Park for Tourism and Leisure, Spain
David Isern, Universitat Rovira i Virgili, Spain
Aida Valls, Universitat Rovira i Virgili, Spain
Antonio Moreno, Universitat Rovira i Virgili, Spain
Antonio Russo, Universitat Rovira i Virgili, Spain
Yolanda Pérez, Universitat Rovira i Virgili, Spain
Salvador Anton-Clavé, Universitat Rovira i Virgili, Spain
xxxii

Preface

The constantly changing landscape of Geographic Information Systems makes it challenging for experts
and practitioners to stay informed of the field’s most up-to-date research. That is why Information Sci-
ence Reference is pleased to offer this three-volume reference collection that will empower students,
researchers, and academicians with a strong understanding of critical issues within Geographic Informa-
tion Systems by providing both broad and detailed perspectives on cutting-edge theories and develop-
ments. This reference is designed to act as a single reference source on conceptual, methodological,
technical, and managerial issues, as well as provide insight into emerging trends and future opportunities
within the discipline.
Geographic Information Systems: Concepts, Methodologies, Tools, and Applications is organized
into eight distinct sections that provide comprehensive coverage of important topics. The sections are:
(1) Fundamental Concepts and Theories, (2) Development and Design Methodologies, (3) Tools and
Technologies, (4) Utilization and Application, (5) Organizational and Social Implications, (6) Manage-
rial Impact, (7) Critical Issues, and (8) Emerging Trends. The following paragraphs provide a summary
of what to expect from this invaluable reference tool.
Section 1, Fundamental Concepts and Theories, serves as a foundation for this extensive reference
tool by addressing crucial theories essential to the understanding of Geographic Information Systems.
Introducing the book is “Showing the Past” by Willington Siabato, Alberto Fernández-Wyttenbach, and
Miguel-Ángel Bernabé-Poveda, a great foundation laying the groundwork for the basic concepts and
theories that will be discussed throughout the rest of the book. Another chapter of note in Section 1 is
titled “GIS, Grid Computing and RFID in Healthcare Information Supply Chain” by Yenming J. Chen,
which discusses the growing use of GIS and distributed computing in the health informatics and clinical
technologies industry. Section 1 concludes, and leads into the following portion of the book with a nice
segue chapter, “Virtual Environments for Geospatial Applications,” by Magesh Chandramouli and Bo
Huang. Where Section 1 leaves off with fundamental concepts, Section 2 discusses architectures and
frameworks in place for Geographic Information Systems.
Section 2, Development and Design Methodologies, presents in-depth coverage of the conceptual
design and architecture of Geographic Information Systems, focusing on aspects including territorial
analysis, spatial database design, augmented discovery, semantic interoperability, location privacy, semantic
annotation, and many more topics. Opening the section is “GIS Technique for Territorial Analysis” by
Francesco Riccioli and Toufic El Asmar. This section is vital for developers and practitioners who want to
measure and track the progress of Geographic Information Systems on a local, national, or international
level. Through case studies, this section lays excellent groundwork for later sections that will get into
present and future applications for Geographic Information Systems, including, of note: “Methodolo-
gies for Augmented Discovery of Geospatial Resources” by Mattia Santoro, Paolo Mazzetti, Stefano
xxxiii

Nativi, Cristiano Fugazza, Carlos Granell, and Laura Díaz, and “User-Friendly Geoportal Interfaces
for Geospatial Resource Discovery” by Victor Pascual Ayats. The section concludes with an excellent
work by Hazel Owen, titled “A Package-Based Architecture for Customized GIS.”
Section 3, Tools and Technologies, presents extensive coverage of the various tools and technologies
used in the implementation of Geographic Information Systems. Section 3 begins where Section 2 left
off, though this section describes more concrete tools at place in the modeling, planning, and applications
of Geographic Information Systems. The first chapter, “Algorithms for 3D Map Segment Registration,”
by Hao Men and Kishore Pochiraju, contains a great example of the types of works that can be found in
this section, a perfect resource for practitioners looking into new technologies within the field. Section
3 is full of excellent chapters like this one, including such titles as “A New Image Distortion Measure
Based on Natural Scene Statistics Modeling,” “Map Matching Algorithms for Intelligent Transport
Systems,” and “Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater
Pumping Wells” to name a few. Where Section 3 described specific tools and technologies at the disposal
of practitioners, Section 4 describes successes, failures, best practices, and different applications of the
tools and frameworks discussed in previous sections.
Section 4, Utilization and Application, describes how the broad range of Geographic Information
Systems efforts has been utilized and offers insight on and important lessons for their applications and
impact. Section 4 includes the widest range of topics because it describes case studies, research, method-
ologies, frameworks, architectures, theory, analysis, and guides for implementation. Topics range from
landscape biodiversity characterization, geographic visual query languages, and estimating biomass, to
open source deployment, topographical capture, and geospatial analysis. The first chapter in the section
is titled “Children’s Maps in GIS,” which was written by Kerstin Nordin and Ulla Berglund. The breadth
of topics covered in the chapter is also reflected in the diversity of its authors, from countries all over
the globe, including Malaysia, Thailand, Sweden, Italy, Nigeria, Uganda, Zimbabwe, South Africa,
the United States and more. Section 4 concludes with an excellent view of a case study in technology
implementation and use, “A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones
after Landfall” by Corene J. Matyas.
Section 5, Organizational and Social Implications, includes chapters discussing the organizational
and social impact of Geographic Information Systems. The section opens with “The Digital Geography
Lab at Salem State University” by Stephen S. Young and Kym Pappathanasi. Where Section 4 focused
on the broad, many applications of Geographic Information Systems technology, Section 5 focuses ex-
clusively on how these technologies affect human lives, either through the way they interact with each
other, or through how they affect behavioral/workplace situations. Other interesting chapters of note
in Section 5 include “Using Volunteered Geographic Information to Assess the Spatial Distribution of
West Nile Virus in Detroit, Michigan” by Kevin P. McKnight, Joseph P. Messina, Ashton M. Shortridge,
Meghan D. Burns, and Bruce W. Pigozzi, and “Cartography and Stability to Enhance and Self Regulate
Proactive Routing in MANETs” by Mohamed Amine Abid and Abdelfettah Belghith. Section 5 concludes
with a fascinating study of a new development in Geographic Information Systems, in “A Reflection
on the Ph.D. Program in Spatially Integrated Social Science at the University of Toledo,” written by
Bhuiyan Monwar Alam, Jeanette Eckert, and Peter S. Lindquist.
Section 6, Managerial Impact, presents focused coverage of Geographic Information Systems as it
relates to effective uses of urban geospatial management, location management, city competitiveness,
e-planning, environmental quality monitoring, disaster management, and much more. This section serves
as a vital resource for developers who want to utilize the latest research to bolster the capabilities and
xxxiv

functionalities of their processes. The section begins with “Urban Geospatial Management System,” a
great look into how city planners and civil engineers can use the latest geospatial technologies to more
effectively practice their work. The 15 chapters in this section offer unmistakable value to managers
looking to implement new strategies that work at larger bureaucratic levels. The section concludes with
“Formalizing Cross-Parameter Conditions for Geoprocessing Service Chain Validation” by Daniel
Fitzner. Where Section 6 leaves off, section seven picks up with a focus on some of the more content-
theoretical material of this compendium.
Section 7, Critical Issues, presents coverage of academic and research perspectives on Geographic
Information Systems tools and applications. The section begins with “Qualitative Spatial Reasoning for
Applications,” by Diedrich Wolter and Jan Oliver Wallgrün. Other issues covered in detail in Section
7 include geography education, taxonomies of volunteered information, community and GIS, complex
processes in GIS, and much more. The section concludes with “Geo-Informatics for Land Use and Biodi-
versity Studies” by P. K. Joshi and Neena Priyanka, a great transitional chapter between Sections 7 and 8
because it examines an important question going into the future of the field. The last chapter manages to
show a theoretical look into future and potential technologies, a topic covered in more detail in Section 8.
Section 8, Emerging Trends, highlights areas for future research within the field of Geographic
Information Systems, opening with “Neogeography” by Judith Gelernter. Section 8 contains chapters
that look at what might happen in the coming years that can extend the already staggering amount of
applications for Geographic Information Systems. Other chapters of note include “Data Mining Location-
Based Social Networks for Geospatial Discovery” and “3D Camera Tracking for Mixed Reality using
Multi-Sensors Technology.” The final chapter of the book looks at an emerging field within Geographic
Information Systems, in the excellent contribution, “Semantic Enrichment for Geospatial Information
in a Tourism Recommender System” by David Isern, Antonio Moreno, Joan de la Flor, Joan Borràs, Aida
Valls, Antonio Russo, Yolanda Pérez, and Salvador Anton-Clavé.
Although the primary organization of the contents in this multi-volume work is based on its eight
sections, offering a progression of coverage of the important concepts, methodologies, technologies,
applications, social issues, and emerging trends, the reader can also identify specific contents by utilizing
the extensive indexing system listed at the end of each volume. Furthermore to ensure that the scholar,
researcher, and educator have access to the entire contents of this multi volume set as well as additional
coverage that could not be included in the print version of this publication, the publisher will provide
unlimited multi-user electronic access to the online aggregated database of this collection for the life
of the edition, free of charge when a library purchases a print copy. This aggregated database provides
far more contents than what can be included in the print version, in addition to continual updates. This
unlimited access, coupled with the continuous updates to the database ensures that the most current
research is accessible to knowledge seekers.
As a comprehensive collection of research on the latest findings related to using technology to
providing various services, Geographic Information Systems: Concepts, Methodologies, Tools, and
Applications, provides researchers, administrators and all audiences with a complete understanding
of the development of applications and concepts in Geographic Information Systems. Given the vast
number of issues concerning usage, failure, success, policies, strategies, and applications of Geographic
Information Systems in countries around the world, Geographic Information Systems: Concepts, Meth-
odologies, Tools, and Applications addresses the demand for a resource that encompasses the most
pertinent research in technologies being employed to globally bolster the knowledge and applications
of Geographic Information Systems.
Section 1
Fundamental Concepts and
Theories

This section serves as a foundation for this exhaustive reference tool by addressing underlying principles essential
to the understanding of Geographic Information Systems. Chapters found within these pages provide an excellent
framework in which to position Geographic Information Systems within the field of information science and
technology. Insight regarding the critical incorporation of global measures into Geographic Information Systems
is addressed, while crucial stumbling blocks of this field are explored. With 15 chapters comprising this founda-
tional section, the reader can learn and chose from a compendium of expert research on the elemental theories and
technological building blocks underscoring the Geographic Information Systems discipline.
1

Chapter 1
Showing the Past:
Integrating Cartographic Collections
through Virtual Map Rooms

Willington Siabato
Technical University of Madrid, Spain

Alberto Fernández-Wyttenbach
Technical University of Madrid, Spain

Miguel-Ángel Bernabé-Poveda
Technical University of Madrid, Spain

ABSTRACT
Spatial Data Infrastructures have become a methodological and technological benchmark enabling
distributed access to historical-cartographic archives. However, it is essential to offer enhanced virtual
tools that imitate the current processes and methodologies that are carried out by librarians, historians
and academics in the existing map libraries around the world. These virtual processes must be supported
by a generic framework for managing, querying, and accessing distributed georeferenced resources and
other content types such as scientific data or information. The authors have designed and developed
support tools to provide enriched browsing, measurement and geometrical analysis capabilities, and
dynamical querying methods, based on SDI foundations. The DIGMAP engine and the IBERCARTO
collection enable access to georeferenced historical-cartographical archives. Based on lessons learned
from the CartoVIRTUAL and DynCoopNet projects, a generic service architecture scheme is proposed.
This way, it is possible to achieve the integration of virtual map rooms and SDI technologies bringing
support to researchers within the historical and social domains.

INTRODUCTION accessing and visualizing historical cartographic


information keeping in mind that this information
At the present time, almost all map libraries on has a degree of quality that depends upon elements
the Internet are image collections generated by the such as the accuracy of the digitization process
digitization of early maps. These types of graphic and proprietary constraints (e.g. visualization,
files provide researchers with the possibility of resolution downloading options, copyright, use
constraints). In most cases, access to these map
DOI: 10.4018/978-1-4666-2038-4.ch001 libraries is useful only as a first approach and it

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Showing the Past

is not possible to use those maps for scientific the early maps on Google´s globe. This way, it is
work due to the sparse tools available to measure, possible to visualize them georeferenced and to
match, analyze and/or combine those resources compare them with contemporaneous cartography.
with different kinds of cartography (Fernández- Over one hundred and twenty historical maps
Wyttenbach, Siabato, Bernabé, & Wachowicz, have been selected by David Rumsey from his
2010). It is necessary to design a set of tools that collection of more than 150,000 historical maps
offer the advantages of real map libraries into a to be shown on the Google viewers. This small
virtual environment, offering advanced online sample is a good example of how geographic
measurement and georeferencing tools, providing environments can be used for sharing early maps.
the capability of comparative analysis between One step ahead is The Alexandria Digital
two or more maps and specific tools in order to Library (2009) geographical browser. This in-
facilitate querying and visualizing process. teresting geographic approach provides tools for
Two such online sources are The American discovering and accessing maps. It offers a rich
Geographical Society Library Digital Map Collec- interface for querying the resources providing up
tion (2009) and the LUNA Browser (Rumsey, 2010) to four different panels of search parameters. The
of the David Rumsey Map Collection (Rumsey, main difference between Alexandria’s interface
2011). The American Geographical Society col- and the Rumsey approach is that the first one
lection contains over 500 maps ranging from early comprises a single interface for accessing any
maps of Asia to historical maps of Wisconsin and map from the collections and it is possible to see
Milwaukee. The David Rumsey Map Collection, more than one resource at the same time. This
by contrast, has over 25,000 maps and images service is also based on Google Maps in order to
online, this collection focuses mainly on rare provide the geographical context. A comprehen-
18th and 19th century maps of North and South sive description of the evolution of Digital Map
America as well as other cartographic materials. Libraries and the most relevant online services,
The user interface allows accessing and querying stressing the challenges in the design of the next
certain characteristics but they do not provide tools generation of Virtual Map Rooms (VMR), is
to analyze the maps, much less integrate them in available in Evolution of Digital Map Libraries
a geographical context. Another relevant online towards Virtual Map Rooms: new challenges for
historical content provider is The World Digital the historical research (Fernández-Wyttenbach,
Library (U.S. Library of Congress & UNESCO, Siabato, Bernabé, & Wachowicz, 2010).
2010). The WDL makes it possible to discover, But while the Web mapping/visualization tools
study, and enjoy cultural treasures from around developed by Google and Microsoft provide very
the world on a single website. Its cultural treasures fast, easy-to-access views of images and maps,
include but are not limited to maps. Nevertheless, they are not suited for complex work and analy-
WDL is once again an interesting data repository ses. Nonetheless, the pervasive use of these new
in which the opportunity to provide geographic environments offers an opportunity for sharing
tools was not taken. data and specifically early maps. Therefore, it is
Applications that are more recent provide the essential provide users with environments like
geographical context but there are not enough these but offering enough tools and elements for
tools for getting the most from the maps. A first using early maps properly, not just for seeing them.
approach was the Google Maps Rumsey Historical Thus, setting up robust VMR as a tool for sup-
Maps portal of the David Rumsey Historical Maps porting historical projects in which a large num-
collection (Rumsey, 2008). These new interfaces ber of national and international researches and
(Google Maps and Google Earth) allow visualizing institutions are involved, represents an advance

2
Showing the Past

in reducing the lack of applications that could a Virtual Map Room could be useful for historians
provide historians and documentary experts with and researchers in general. Part three describes the
remote access to the existing information from implementation of the VMR, including concepts
any map library around the world and also with about the data repository, the architecture of the
a comprehensive set of tools for exploiting and service, the data server and the implementation of
analyzing these documents. Web-based mapping a Web client. Part four presents an enriched graphic
has been very important and now we are moving user interface that contains new tools for interact-
from simple mapping and geospatial visualization ing with the data repository and maps, including a
to full online geoservices with which, as Dan- querying-timeline component for accessing maps
germond (2008) asserted, ‘the GeoWeb allows easily and the MapAnalyst component software
extensive sharing of maps and data and opens up for geometrical analysis. Finally, conclusions are
access to geographic applications to everyone. presented, and further steps to strengthen the plat-
This, together with the growing availability of form implementing other services are discussed.
georeferenced content and the ability to easily
search, discover, and mash up these service, is
enabling a whole new pattern and architecture for RESEARCH CONTEXT
geographical applications.’ This pattern empha-
sizes open and interoperable services that can be It is possible to state that Spatial Data Infrastruc-
used to support a broad array of geographically tures (SDI) are by now a methodological and
related applications such as the proposed VMR. technological benchmark for publication of the
Because of that and following the evolution of cartographic heritage (Fernández-Wyttenbach,
the geographic services on the Web, the VMR Álvarez, Bernabé, & Borbinha, 2007). The use
and tools will be available through one single and of the standards and recommendations defined by
usable portal on the Internet. the Open Geospatial Consortium (OGC) enables
The proposed VMR facilitates access to access to historical-cartographic information with
digitized resources (restricted and unrestricted) a degree of detail only restricted by the quality of
and working with maps located in different map the digitalized documents and associated informa-
libraries all over the world. It also provides a set of tion issued by each supplier institution.
tools implemented for the geographical treatment Thus we underline the high potential of map
and manipulation of these resources. In addition, servers consistent with the OpenGIS standards and
it simplifies the task of the institutions that are in specifications, especially OGC Web Map Service
charge of preserving those archives: optimising Interface Standard (WMS), (Open Geospatial
human resources, guarding against misappropria- Consortium, 2006a) and its capability to set up a
tions and avoiding direct manipulation of those platform enabling distributed access to different
archives being in a fragile conservation state. historical archives. Despite this, it is necessary to
This job is at the same time an instance of find the way for creating and providing the most
how the Geographic Information Technologies appropriate virtual tools in order to undertake the
may be useful for the professional and research usual studies that are being carried out on the old
work of historians and social scientists in general, maps by librarians or researchers, either intrinsi-
remarkably contributing to the dissemination of the cally on the document or in relation to the real
History of Cartography and Cartographic Heritage world. This way, metric tools will be available
in the most specialized academic environments. to find out real distances and surfaces among
The paper is structured as follows; part two other characteristics and also to make possible
presents the research context of the project and how analysis that researchers can formulate on the

3
Showing the Past

maps (georeferenced documents). In addition to Both projects have kicked off with the aim
the conventional GIS tools, new ones have been of creating specialized virtual map libraries. The
developed to facilitate access to data repositories results of the DIGMAP (2009) project have been
and manipulation of the results of the queries. taken advantage of, providing solutions for general
From the beginning, it was observed that the users and map libraries with the purpose of pro-
appropriate scenario for development and integra- moting the cartographic heritage and supporting
tion of a virtual map room would be within the research as well as library and documentary in-
context of historical social research projects and formation systems. Beyond the merely distributed
better if these projects would have international access to all resources and simple visualization,
and local repercussion. Such is the case of the the projects intend to provide new virtual tools
DynCoopNet and CartoVIRTUAL projects in for analysis and query of documents, favoring
which the need to rely on distributed access of the safekeeping of documents and restricting
the diverse cartographic archives was recognised. physical access. A set of methods will be sought
On the one hand, the aim of DynCoopNet for digitization, geo-referencing, cataloguing,
is to provide new theoretical insights about the publishing, querying and visualization of histori-
dynamics of complex systems (the evolution of cal cartographic documents within the historical
cooperation tying together the self-organizing map libraries on the Internet.
commercial networks of the first global age) From our viewpoint, it is advisable to develop
through the implementation of GIS as data inte- and set up a VMR based on open standards and
gration engine, fed by distributed databases which taking into account the mentioned above tools. This
store large data sets and information collections, VMR will enable historians and researchers from
and by using tools for management and discovery CartoVIRTUAL, DynCoopNet, and other projects
of information and tools for temporal analysis to access the huge amount of information which
and visualization. encompasses the First Global Age, and/or any kind
On the other hand, the CartoVIRTUAL proj- of data collections that are available to be shared.
ect, from the R&D programme of the Ministry Figure 1 shows a snapshot of the proposed Web
of Science and Innovation (Spain), intends to client for the VMR.
design a methodology for the creation and
implementation of distributed virtual historical
map libraries based on geographic information IMPLEMENTATION OF THE VIRTUAL
and geomatics standards (ISO 19100 standards MAP ROOM
series), OGC specifications, and SDI services.
We set up a prototype (Mercator, 2011) having the The VMR has been implemented as a network of
advantages of real map libraries with advanced cooperating physical servers providing data and
online measurement and geo-referencing tools services. This enables savings in development
(Jenny, Weber, & Hurni, 2007) integrated into time, operating costs, and allows for uniformity
a virtual environment. In this project, we also of data supplier and resources. The organizational
take advantage of digital rights considerations structure, operating environment, technical ar-
(GeoDRM Reference Model (Open Geospatial guments and technological components of this
Consortium, 2006b)) and the MARC-ISO19115 proposal, including the relationships between its
metadata crosswalk developed in order to facilitate parts, and the principles and guidelines, entirely
the publication of cartographic heritage through belong to a typical Spatial Data Infrastructure
SDI (Crespo et al., 2010). architecture. These principles are robust enough
and mature for sharing any kind of geographic data.

4
Showing the Past

Figure 1. Some searching results in the Virtual Map Room (http://www.cartovirtual.es/navegar)

In order to achieve the integration of map li- The software platform of the project is based
braries and their resources and archives, the team on Open Source projects in both the server and
project has set up a virtual space based on Web client side; most of them using the GNU General
servers, map servers, CGI and PHP/MapScript Public License which allows others to use, change,
support, AJAX support, and other technical ele- and share the software and secondary applications
ments which provide enough tools and services without restrictions and makes sure it remains free
to offer a rich, robust and reliable VMR. These software for all its users. This means the proposed
components have been selected as part of the SDI framework is also under the GNU GPL.
architecture. The main element is the map server,
through which the publishing of geographic and Historical Data Repository
spatial data is feasible. This map server has been
recompiled for fulfilling the requirements of re- The information used in the test and implementa-
searchers, fine-tuning the service, and improving tion processes has been provided by DIGMAP, an
the performance of OGC specifications. online repository of historical-cartographic docu-
On the other hand, a specific Web thin client ments. This portal is the outcome of a European
has also been designed for enabling access to project co-funded by the Community programme
the DIGMAP search engine and to the historical eContentplus and published on November 2008
cartography repository. This Web client, which is (DIGMAP, 2008). The aim of DIGMAP was
an online interface, synchronises the search engine to turn its portal into an international reference
with a geographic browser that allows carrying gateway to old maps and related bibliography. The
out geographic browsing, querying operations, and project proposed the development of solutions for
analytical processes once the searching process digital libraries especially focused on cartography
has been done. that promotes the cultural and scientific heritage

5
Showing the Past

(Borbinha, Pedrosa, Gil, Martins, Freire, Dobreva, made up of over 40 map libraries from national
& Fernández-Wyttenbach, 2007). libraries, archives, universities, and army and
This project provides easy access to the civil map services.
thousands of early maps of national libraries and Figure 2 shows the georeferencing process
collections around the world through the Internet. in the DIGMAP Geographic Index module. It
This potential is undoubtedly useful for the cre- should be noted that the geographic referencing
ation of a virtual map room that aims to provide process in DIGMAP is not quite accurate, it just
information for historical and social research in- supplies an approximate location of the maps
terests, since it allows access to a huge amount of based on the maximum and minimum coordinates
cartographic information and related documents, of a bounding box. This reduces the accuracy of
no matter where it is stored. The main service is a measurements and operations on the map. It is,
specialised digital library that recovers metadata nonetheless, valid enough as a first approximation.
from European national libraries and offers online There are methods that deal with the geometric
map searching and access to the contents. Meta- and projective characteristics and properties of
data from other sources are also reused as well early cartographic representations (Livieratos,
as descriptions and references to other relevant 2006; Boutoura & Livieratos, 2006) that could
external resources (Pedrosa, Luzio, Manguinhas, be taken into account in the future in order to
Martins, & Borbinha, 2008). improve this weakness for online resources.
It is important to stress that DIGMAP is a However, if the user needs further accuracy and
virtual digital library, in the sense that it holds to avoid distortion, the VRM have integrated the
only the metadata that describes the resources, but MapAnalyst tool (Jenny, Weber, & Hurni, 2007)
not the resources themselves, which remain in the to offer transforming options and to analyse
local libraries or Web sites (this means that the geometrical aspects such as the distortion grid,
VMR access the original resources through these vectors of displacement, and scale isolines of the
metadata). The resources also can be (1) digital- maps. This tool is quite useful for manipulating
born, (2) digitized, or even (3) physical resources and transforming old maps if necessary.
existing only in the shelves of the libraries. When Although DIGMAP data engine is a great
the resources are digitized maps, it is possible to source of information as described so far, its
index them by their geographic boundaries. This graphical user interface is not very efficient. The
characteristic allows the geographic browser geographic browsing tools associated with the
to show the map in the right position when the search engine are excessively simple and do not
resource is queryable, that is to say, when the follow appropriate usability criteria (Fernández-
resource is georeferenced; otherwise, it will not Wyttenbach, Moya, Álvarez, & Bernabé, 2009).
be possible to load and display the map on the For this reason, it is necessary to develop a new
geographic Web client. user interface that allows researchers to access
Independent of its technical and technological this huge collection more easily and to use the
contributions, the real added value of the DIGMAP maps in a better way.
Portal as data engine for the development of virtual So that this interface could be really useful and
map rooms is the large number of digital maps that assure system interoperability, it has been chosen
has previously been indexed and georeferenced. It to incorporate with the virtual map room the basic
is noteworthy that the main resources come from concepts of the SDI reference framework. Thus
the IBERCARTO Working Group, a specific as- the results (maps and documents) are presented
sociation in which Spanish and Portuguese map on a web-mapping framework in accordance with
libraries are working together. It is a consortium the usual OGC services, providing cartographic

6
Showing the Past

Figure 2. Geographical indexing and referencing of historical contents in DIGMAP

information using independent layers, assuring instance, transparency and anti-aliasing properties
process and platform compatibility, and therefore, are quite efficient for showing early cartography
making easier the documentary research for librar- on a reference WMS layer as shown in Figure 3.
ians, historians and other researchers. The next MapServer is a general platform for publishing
sections describe the server and client processes. spatial data and other geographic applications to
the Web. Based on the language ANSI C/C++, it
Geographic Data Server may be executed, configured and customised in
a large number of operating systems and environ-
After having identified the characteristics of the ments. This means that the framework developed
data repository, and knowing the benefits and on this proposal can run almost on any platform
disadvantages of geographic servers available configuration of any map library. Also, MapServ-
currently on the market (Jansen, 2009; Aime, er works with numerous optional external librar-
2009; Aime & McKenna, 2009), the geographic ies, allowing its extensibility, and supports many
data server MapServer has been chosen as the raster, vector and database formats. This charac-
most appropriate option for management and teristic proves essential when accessing, extract-
publishing of cartographic and historical informa- ing and serving data from different historical data
tion. This server offers enough quality, reliability repositories, such as the ones described above in
and functionality for sharing data on the Internet. the DIGMAP Project.
It supports display and querying of hundreds of The flexibility of use that the server offers
cartographic data sources and formats, render- through the exchange of parameters using con-
ing and showing them using different reference ventional HTTP request methods, GET and POST,
systems and projections through the on-the-fly allows it to dynamically configure the appropriate
projection capabilities. In addition, it assures high variables to meet the specific needs of a particular
quality rendering. These characteristics are very user at a given instant. It thus solves the drawback
useful for the integration of early maps, WMS of generating maps using a static configuration file
data layers and other cartographic resources. For which does not allow modifications by external

7
Showing the Past

Figure 3. Transparency and anti-aliasing examples

users. Due to the development characteristics of Once the access to the repository data has been
the DIGMAP components in which the message solved and the platform has been defined, next step
interchange is handled through this technique, is to visualize the information and to offer users
MapServer is a candidate that covers interaction (librarians, historians, researchers) the necessary
with the historical contents of the map libraries tools (Saracevic, 2000; Marchionini, Plaisant, &
and other future collections that someone wants Komlodi, 2003) to be able to interact with the
to integrate. identified resources.
Thus from a technical point of view, it is
possible extract the information and visualize Service Architecture Scheme
it on the client by means of parameterized URL
query strings. On one hand, the geoinformation The typical application flow for data services
is extracted from the historical documentation based on MapServer is diagrammed in Figure 4.
repositories to generate a raster file as a result, The basic architecture of MapServer applications
and on the other hand, a text file with additional contains a Mapfile, geographic data, HTML pages,
information is also obtained for subsequent geo- MapServer CGI application and the HTTP Server.
referencing process. The generated information The Mapfile is a structured text configuration
is accessible from a temporary URL that points file that defines where the data is and where to
to a public directory of the Web server in which output the generated images. It also defines map
MapServer is executed as a Common Gateway layers, including their data source, projections,
Interface application (CGI). Thus, the server has and symbology. Through these elements we can
available the image and the information required define sources for early cartography and load them.
for georeferencing.

8
Showing the Past

Figure 4. Typical application flow of MapServer

These sources define the second element of (researchers) clicks on the browser. In this case,
the basic architecture: geographic data. The geo- the Apache 2 HTTP Server supports the VMR.
graphic server supports several data input formats Based on Figure 4, it is quite easy to describe
by default, but in our case we compiled it with data flow of the VMR. In brief, MapServer gets
the open source libraries GDAL and OGR in the resources through the DIGMAP collection
order to enable more formats and offer higher by URL requests, and displays them on the Web
quality images. For this service, we are support- client (p.mapper). Figure 5 illustrates the process
ing, among others, WMS layers, shapefile spatial graphically.
data format, and alpha channel values (through
the AGG driver); the last one is very important The Early Cartography Web Client
because it lets the system overlay maps and im-
prove query tasks. This way, we ensure the ap- To interact with the queried resources it is neces-
plication can read and load any file from any library sary to rely on a stable and functional interface
or data source. which suits the purpose of a virtual map room.
HTML pages are the interface between the user Taking into account that currently on the market
and MapServer. These are fully provided by the there is a large number of robust and high-per-
p.mapper framework (described more fully in the formance thin-client web-mapping frameworks, it
next section), thus it is possible to focus only on was decided to build our proposal based on these
the data and functionality. The MapServer CGI clients and not to develop a new one from scratch
application and the HTTP Server are the core of since most of them comply with the minimum
the systems. The CGI is an executable file that historians’ requirements (Crespo-Sanz, 2009)
receives requests and returns images, data, and to interact with early cartography. Among the
other elements that are used by the HTTP Server desirable selection criteria should be emphasized:
for serving up the HTML pages when the user software license, programming language, model

9
Showing the Past

Figure 5. VMR flow chart

and structure, ease of customisation, language used so useful for increasing functionality without
by the API or application interface, OGC services modifying the original structure.
supported, dependence on the map server, inclu- As a thin client, p.mapper takes advantage of
sion of metadata components, interoperability and the latest advances in Web technologies such as
usability of its tools, and perhaps most relevant, an AJAX, enabling transfer of data from client to
Open Source framework for its easy adaptability server and return of results asynchronously, updat-
to specific needs. ing information for the client without having to
After having analyzed the available options, reload the website. This functionality is exploited
it was decided that the visualisation framework to achieve a more natural and intuitive interaction
best fitting the specific needs of map libraries is with early maps. In addition, p.mapper supports
p.mapper, a thin Web client that offers wide range several relational DBMS and contains JavaScript
of functionality and multiple options for facilitat- libraries that facilitate the programmer’s task when
ing the configuration and installation of applica- it is necessary to update functionalities or create
tions based on MapServer and PHP/MapScript. new ones. It handles the light format for data
It works following the multi-layer model and the interchange JSON that minimises the bandwidth
client-server architecture which suits perfectly for expended in client-server communications, with
geospatial data visualisation and its manipulation the consequent increment in response time and
through basic browsing and querying tools. As we speed.
said above, it is characterized by using the GNU This way, we have used MapServer and
General Public Licence which ensures being p.mapper for providing access to historical-
able to modify the application and redistribute it cartographic archives through a graphical Web
without restrictions. On the other hand, it relies on interface. The server platform is composed as
a modular programming structure favouring the follows: Linux Debian Server as Operating Sys-
development and incorporation of new functional- tem, Apache 2 as HTTP Server, PHP5 support as
ities or modification of the existing ones, according scripting language; on the client side, usual Web
to requirements. In addition to the characteristics mechanism such as JavaScript (language script),
already mentioned, p.mapper is programmed un- Cascading Style Sheets (layout presentation) and
der the object-oriented programming paradigm, XML (configuration) were used. As we said above,
communication between server and client has

10
Showing the Past

been established through AJAX to provide natural sary to synchronize both the data engine and the
interaction with maps. These tools allow any user framework. This job was done using session vari-
access to data, regardless their choice of computer, ables. Through this method, we have achieved our
software, browser or other specifications. goal of incorporating metadata and data naturally
The results and performance obtained so far so that users can manipulate them easily.
are reliable and the tests carried out indicate that Once the synchronization process has been
this framework has been a good choice, espe- done, a new Table of Contents (TOC) is loaded in
cially for developing new specific tools for early the browser. This new TOC shows the footprints
cartography and integrating with the DIGMAP of the available georeferenced sources, it must
data repository. be kept in mind that not all the metadata records
contain indexed geospatial sources. Figure 6 shows
Enriched Graphic User Interface a customized TOC.
Once the metadata have been loaded it is pos-
In order to offer historians and researchers an in- sible to describe the queried map. For this task,
tuitive interface for accessing and querying maps, there is a specific tool that loads and shows the
we have improved the client-framework develop- information by clicking on the map. After this, a
ing new tools and functionalities. The framework table describing maps is showed. This table con-
has a flexible layout structure for including new tains a link to the original resource and a further
elements into it through frame windows. Some description. Transparency properties are also
frames that remain empty can be used for almost handled for this table in order not to obscure the
any task if the right techniques are applied. We geographic information. Finally, when user needs
have used these frames for including the DIGMAP to change parameters or make new queries with-
engine and other external applications that provide out taking into account the previous ones, it is
users with a single graphic interface for searching, possible to reset the client and delete the meta-
querying, analyzing, and browsing data. data and data previously loaded. A single X
Due to the fact that the DIGMAP metadata posted in the TOC will delete the current map.
repository had already been set up, it was neces-

Figure 6. VMR customised functions for early cartography searching and querying

11
Showing the Past

Another useful tool is the incorporation of evolution of cities and places. Its main purpose
timeline browsing synchronized with text searches is to compute distortion grids and other types of
on the catalogue and the geographic browser. This visualizations that illustrate the geometrical ac-
tool is based on the SIMILE© Project Timeline curacy and distortion of old maps. The software
widget (http://www.simile-widgets.org/timeline). uses pairs of control points on an early map and
In the same way as we used an empty frame for on a modern reference map. The control points
including the DIGMAP engine, we have used a are used to construct geometrical analysis (Jenny
bottom frame available in the p.mapper’s layout. & Weber, 2011). Figure 7 shows and example of
This tool is useful for knowing the main description a distortion grid analysis.
of the map and, even better, to show an overall
view of the maps in a single timeline. Through
this tool, researchers will be able to query just CONCLUSION AND FUTURE
the maps they really need independent of search- ADVANCES
ing criteria, locating them by date. It works as a
complementary filter for the searching that the Virtual Map Rooms are tools of high interest for
user has previously done. historical and social research projects. They pro-
Finally, as explained in the preceding para- vide specialised access to distributed cartographic
graphs, the MapAnalyst component has been collections and especially early maps collections.
included in the VRM. In addition to the above, The success and sustainability of these tools will
it should be mentioned that MapAnalyst is soft- be influenced by their design and usability pat-
ware developed by Institute of Cartography of terns; this shall be carried out according to the
Swiss Federal Institute of Technology Zurich criteria of publishing of geographic information
for the accurate analysis of early maps. This issued by SDI specifications. They are at the
high-quality tool provides historians, librarians, same time an instance of how the Geographic
and others users with the possibility of analyzing Information Technologies may be useful for the
and browsing the old maps in a modern context, professional and research work of historians and
understanding distortions and checking historical social scientists in general, contributing to the

Figure 7. The MapAnalyst software component for geometrical analysis

12
Showing the Past

dissemination of the History of Cartography and a high degree of quality. Actually, the georefer-
Cartographic Heritage in the most specialized encing process in these collections is better than
academic environments. the one done in the DIGMAP engine, especially
This paper has described two projects in which on the Rumsey collection. Nonetheless, the VMR
a Web-mapping platform has been developed includes a geometrical analysis and correction
in order to supply valuable and comprehensive tool to improve maps if necessary.
instruments for both representing and promot- If one compares digital access to georeferenced
ing the knowledge of cartographic heritage to a early maps, using web-mapping interfaces, one
large number of people. As far as the platform finds that the tools available on the VMR are quite
has been developed, the result is a web-mapping useful, intuitive and usable. The server has been
portal accessible directly from the Internet that set up for completeness, reliability and availability.
makes historical-cartographic resources available The VMR relies on the SDI architecture founda-
for everybody. In this sense, some tasks for the tions. This work has supplied a supported web
internationalization of the platform have been mapping platform for larger configurations related
done but not enough for providing universal ac- with early cartography. Finally, we have added key
cess. Some improvements that cover this aspect components that simplify and enhance the user
will be included in the near future. experience and support the work of libraries in
As a Web mapping platform, the VMR brings charge of preserving cartographic archives. Some
together the SDI Architecture into a single, easy- usability aspects have been taken into account to
to-use Web Portal that provides access to histori- develop an intuitive GUI.
cal data repositories. Any data repository can be This project will incorporate some significant
included in this portal; any library which provides improvements into the Virtual Map Room in the
access to its digital contents through the DIGMAP near future by introducing new technical and
Architecture will be part of the proposed VMR conceptual contributions that will enhance its us-
automatically. This arrangement is the fastest way ability according to users’ requirements. To that
to provide access to the cartographic heritage on end, it is necessary to address the following issues:
the web, leveraging the power of some of the best
open source geospatial platform and the biggest 1. Incorporation of semantic and spatio-
repository of historic-cartographic data on the temporal components. These components
Internet. play an important role to complement
Despite the existing proposals, the Virtual the research work carried out within the
Map Room proposed in this paper is the larg- proposed platform. Their presentation and
est georeferenced historical-cartographic map implementation in a coordinated fashion
collection on the Internet. This is due to the fact will mean a considerable advance in initial
that the application uses the DIGMAP metadata management of information and will lead
repository, which provides access to over 2,500 to a substantial change in the search habits
georeferenced historical maps, unlike other col- that historians and researchers now use.
lections that provide access to 150 (The David 2. Quality of the contents. The Virtual Map
Rumsey Map Collection) or 300 (The World Digital Room cannot be responsible for the quality
Library) georeferenced digital maps. Thus the of the information accessed. It is not uncom-
VMR provides access up to eight times more than mon to find out that there are still collections
the other prestigious collections. However, the without georeferenced archives or with very
quality of the maps must be highlighted. Although poor descriptions (metadata). However,
these collections offer fewer maps, they maintain the application is capable of automatically

13
Showing the Past

incorporating all the improvements that ACKNOWLEDGMENT


may be made to the contents in the future,
as well as new catalogues or collections. At This research work has been partially funded by
the same time, in spite of incorporating the CartoVIRTUAL initiative (Ref. CSO2008-03248/
MapAnalyst tool to allow the user to check GEOG) from the R&D Programme 2008-2011 of
quality and reliability of the resources, it the Ministry of Science and Innovation of Spain.
would also be useful to improve the geo-
graphical indexing and referencing system
provided by DIGMAP. REFERENCES

Once viability is ascertained, the CartoVI- Aime, A. (2009). GeoServer in production. Paper
RTUAL project will move forward towards the presented at the Free and Open Source Software
creation of an institutional prototype of a National for Geospatial Workshop, Sydney, Australia.
Virtual Map Library: a geoportal providing access Aime, A., & McKenna, J. (2009). Web mapping
to historical map libraries, museums and archives. performance shoot-out. Paper presented at the
The success and sustainability of the project will Free and Open Source Software for Geospatial
also depend on adequate training programs for Workshop, Sydney, Australia.
installation and maintenance of map libraries.
We are currently designing an academic program Alexandria Digital Library. (2009). Globetrotter
focus on Geographic Information Technologies - NGDA interface. Retrieved from http://clients.
applied to map libraries. This course will soon be alexandria.ucsb.edu/globetrotter/
accessible from the Virtual Map Library, supported
American Geographical Society Library. (2009).
on an eLearning platform.
Digital map collection. Retrieved from http://
Finally, we are also working to access and
www.uwm.edu/Library/digilib/maps/index.html
compare most of the historical and cartographic
information that is scattered and inaccessible in Arrowsmith, J. R., Glenn, N., Crosby, C. J., &
the Latin American map libraries through a single Cowgill, E. (2008). Current capabilities and com-
Web site. To this end, the Ibero-American Virtual munity needs for software tools and educational
Map Libraries Network (CartoRED) project has resources for use with LiDAR high resolution
started. This network has initial support from topography data. Retrieved from http://opentopo.
eleven geographical institutions representing sdsc.edu/docs/LIDAR_software_tools_needs_
seven countries, which are responsible for safe- Aug_2008.pdf
guarding the historical and cartographic funds in
Borbinha, J., Pedrosa, G., Gil, J., Martins, B.,
Latin America. We expect to complete a 100%
Freire, N., Dobreva, M., et al. (2007). Digital
knowledge and technology transfer process in
libraries and digitised maps: An early overview
order to offer the whole possibilities of integra-
of the DIGMAP project. In D. Goh, T. Cao, I.
tion to the Latin American libraries and related
Sølvberg, & E. Rasmussen (Eds.), Proceedings of
institutions. The use of open source tools will be a
the 10th International Conference on Asian Digital
guarantee for the success of the network, in which
Libraries: Looking Back 10 Years and Forging
raw seeking low-cost technological solutions and
New Frontiers (LNCS 4822, pp. 383-386).
collaborative development networks.

14
Showing the Past

Boutoura, C., & Livieratos, E. (2006). Some fun- Jansen, M. (2009). Comparing apples and or-
damentals for the study of the geometry of early anges: Uncovering the mystery of component
maps by comparative methods. e-Perimetron. selection in WebGIS projects. Paper presented at
International Web Journal on Sciences and Tech- the Free and Open Source Software for Geospatial
nologies Affined to History of Cartography and Workshop, Sydney, Australia.
Maps, 1(1), 60–70.
Jenny, B., & Weber, A. (2011). MapAnalyst - The
Crespo, M., Criado, M., Capdevila, J., Sánchez, A., map historian’s tool for the analysis of old maps.
Soteres, C., & Juanatey, M. (2010). El patrimonio Retrieved from http://mapanalyst.cartography.ch/
cartográfico en las infraestructuras de datos espa-
Jenny, B., Weber, A., & Hurni, L. (2007). Visual-
ciales. Revista Catalana de Geografia, 15(41), 4.
izing the planimetric accuracy of historical maps
Crespo Sanz, A. (2009). The atlas of El Escorial, with MapAnalyst. Cartographica: The Interna-
a forgotten map. Boletín de la Real Sociedad tional Journal for Geographic Information and
Geográfica, 2626, 117–142. Geovisualization, 42(1), 89–94. doi:10.3138/
carto-v42-1-089
Dangermond, J. (2008). GIS and the GeoWeb.
ArcNews, 30, 1–21. Livieratos, E. (2006). On the study of the geometric
properties of historical cartographic representa-
DIGMAP. (2009). Discovering our past world
tions. Cartographica: The International Journal
with digitised maps. Retrieved from http://www.
for Geographic Information and Geovisualization,
digmap.eu
41(2), 165–176. doi:10.3138/RM86-3872-8942-
Fernández-Wyttenbach, A., Álvarez, M., Bernabé, 61P4
M. Á., & Borbinha, J. (2007). Digital map libraries
Marchionini, G., Plaisant, C., & Komlodi, A.
services in the spatial data infrastructure (SDI)
(Eds.). (2003). The people in digital libraries:
framework: The DIGMAP project. Paper presented
Multifaceted approaches to assessing needs and
at the 23rd International Cartographic Conference,
impact. Cambridge, MA: MIT Press.
Moscow, Russia.
Martins, B., Manguinhas, H., Borbinha, J., &
Fernández-Wyttenbach, A., Moya, J., Álvarez, M.,
Siabato, W. (2009). A geo-temporal information
& Bernabé, M. Á. (2008). First approaches to the
extraction service for processing descriptive
usability of digital map libraries. e-Perimetron:
metadata in digital libraries. e-Perimetron. Inter-
International Web journal on Sciences and Tech-
national Web Journal on Sciences and Technolo-
nologies Affined to History of Cartography and
gies Affined to History of Cartography and Maps,
Maps, 3(2), 63-76.
4(1), 25–37.
Fernández-Wyttenbach, A., Siabato, W., Bernabé,
Mercator Research Group. (2011). CartoVIR-
M. Á., & Wachowicz, M. (2010). Evolution of
TUAL - Prototipo de cartoteca histórica nacional
digital map libraries towards virtual map rooms:
distribuida. Retrieved from http://www.cartovir-
New challenges for the historical research. In Kriz,
tual.es/navegar
K., Cartwright, W., & Hurni, L. (Eds.), Mapping
different geographies (Vol. 12, pp. 129–140). Open Geospatial Consortium. (2006a). OpenGIS®
Heidelberg, Germany: Springer-Verlag. web map server implementation specification 1.3.0
(Tech. Rep. No. OGC 06-042). Bergen, Norway:
Open Geospatial Consortium Inc.

15
Showing the Past

Open Geospatial Consortium. (2006b). Geospatial Rumsey, D. (2010). David Rumsey map collec-
digital rights management reference model - Geo- tion - Luna browser. Retrieved from http://www.
DRM RM (Tech. Rep. No. OGC 06-004r3). Bergen, davidrumsey.com/view/luna
Norway: Open Geospatial Consortium Inc.
Rumsey, D. (2011). David Rumsey map collection.
Pedrosa, G., Luzio, J., Manguinhas, H., Martins, Retrieved from http://www.davidrumsey.com
B., & Borbinha, J. (2008). DIGMAP: A digital
Saracevic, T. (2000). Digital library evaluation:
library reusing metadata of old maps and enriching
Toward evolution of concepts. Library Trends,
it with geographic information. In B. Christensen-
49(2), 350–369.
Dalsgaard, D. Castelli, B. Ammitzbøll Jurik, &
J. Lippincott (Eds.), Proceedings of the 12th Eu- U.S. Library of Congress & UNESCO. (2010).
ropean Conference on Research and Advanced The world digital library (WDL). Retrieved from
Technology for Digital Libraries (LNCS 5173, http://www.wdl.org/en/
pp. 434-435).
Wachowicz, M., & Owens, J. B. (2009). Space-
Rumsey, D. (2008). David Rumsey historical time representations of complex networks: What
map collection. Retrieved from http://rumsey. is next? GeoFocus, 9(1), 1–8.
geogarage.com/

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 2, Issue 4,
edited by Donald Patrick Albert, pp. 93-108, copyright 2011 by IGI Publishing (an imprint of IGI Global).

16
17

Chapter 2
Location-Based Services:
A Taxonomy on Theory and Practice

Henrik Hanke
University of Duisburg-Essen, Germany

Alf Neumann
University of Cologne, Germany

ABSTRACT
The provisioning of Location-Based Services (LBS) follows the chain of determination of a position,
mapping this information onto a natural language-based description of this position and performing the
service itself. The evolution of technologies regarding applications and infrastructure, standards and
contents has brought up various streams that have influenced the development of this chain over the past
years (Zeimpekis et al., 2003). On the one hand, emerging theoretical concepts have been showing the
way for many commercial and non-commercial services. On the other hand, the conceptual evolution
has been accompanied by significant investments of mobile technology companies and service providers
to the further development of practical solutions (Gessler and Jesse, 2001).

INTRODUCTION and consumed by customers. It sheds light on the


ubiquitous information management approach as
A wide field for technological innovation, the important foundation for advanced mobile data
conceptual discussion of LBS has widely remained services (Acharya et al., 2004).
a technology issue dominated by the development Furthermore, the chapter provides an overview
of positioning techniques, infrastructure and data of the essential service concepts and relevant
transmission concepts. This chapter re-emphasizes implications, challenges, and opportunities that
the term service, including information and func- can be derived from the application context of
tionality, which is offered by LBS applications LBS. Finally, a taxonomy on theory and practice
is presented that draws the link from the technol-
ogy to the service.
DOI: 10.4018/978-1-4666-2038-4.ch002

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Location-Based Services

UBIQUITOUS INFORMATION identification, contextual and environmental loca-


MANAGEMENT tion information as well as navigation between
different locations.
Along with the sophistication and increasing There exists a vast body of literature on po-
performance of communication devices, such as sitioning technologies reaching from the early
Personal Digital Assistants (PDAs), mobile phones Active Badge indoor locating solutions to the
as well as wireless communication networks, the Global Positioning System (GPS) and the more
environment and the world increasingly adopts a recent Wireless Local Area Network (WLAN) and
mobile character. In this respect, a very important Bluetooth concepts (King et al., 2006).
driver is constituted by a ubiquitous information The diversity of the underlying technological
management concept, which is free from temporal basis as well as the opportunities and limitations
and, in general, also from spatial constraints. In among these approaches in design and character-
such mobile computing, ubiquitous computing or istic means of data networks correspond to the
pervasive computing environments, mobile data- increasing need for adapted LBS infrastructures.
bases and the data dissemination infrastructure are These are tailored to the specific requirements
two integral components especially in the context of different types of locations, different modes
of LBS. Data dissemination can follow push-based of mobility and distance (Zeimpekis et al., 2003).
or pull-based information flows depending on The precision of location information and the
where location and application data is processed. distance of a mobile device to a Location Service
This can be done either on the server side or on Provider (LSP) are two factors that play a key role
the device, i.e., client side (Acharya et al., 2004). in this context. They present the overall frame-
In push-based systems, data is pushed into a work in which the service layer of key features
wireless channel and a user tunes in and down- of applications and infrastructure technology is
loads the required data. This approach can process embedded.
read-only transactions and may include popular
data like the stock quotes, news, weather infor- Physical and Symbolic Location
mation, traffic information. On the other side,
in pull-based wireless services, a user induces A LSP can provide two kinds of location informa-
the server to process specific transactions and tion: physical and symbolic. Physical positions are
send the result to the user through a back chan- determined by longitude, latitude and altitude. In
nel. These transactions can be simple queries or contrast, symbolic location encompasses abstract
update transactions. ideas of where something is: in a certain room, in
The two-tier concept of information manage- a certain town or next to a certain object (Gessler
ment to disseminate, process and further store and Jesse, 2001).
data can be collectively termed information layer. A system providing a physical position can
usually be augmented to provide the correspond-
ing symbolic location information with additional
LOCATION-ADAPTED SERVICES information, infrastructure, or both. For example,
a PDA equipped with a GPS receiver may access
LBS provide users of mobile devices personal- a separate database that contains the positions
ized services tailored to their current location. and geometric service regions of other objects to
These central information services fall into three provide applications with symbolic information
broad categories that also emphasize the added (Brumitt et al., 2000). Linking real-time train
value for consumers: positioning and location positions to the reservation and ticketing database

18
Location-Based Services

can help locate a passenger on a train. Applica- for the appropriate context. Remote mobility is
tions can use the physical position to determine an related to movements between different distant
entire range of symbolic information (Hightower locations.
& Borriello 2001). Following the interdependencies between dif-
The distinction between physical position ferent types of mobility and the design of LBS, a
and symbolic location is more pronounced with compatible concept of distance may be developed
some technologies than others. GPS is clearly a from short to long range with similar implications
physical positioning technology. Bluetooth Indoor on infrastructure requirements, techniques and
Positioning Service (BIPS) systems are symbolic precision of positioning systems, context and navi-
location technologies mostly based on proximity gation services. Table 1 provides a taxonomy of
to known objects. However, some systems such LBS functions within the service and information
as Cricket can be used in either mode, depending layer framework and summarizes the implications
on their specific configuration (King et al., 2006). on the key features of LBS data dissemination,
infrastructure and applications.
Location Distance and Mobility

Tilson et al. (2004) identify three types of mobility: LOCATION-ADAPTED TECHNOLOGY


micro, local and remote. Micro mobility is related
to moveable artifacts in very confined spaces. As mentioned above, LBS exploit knowledge of
Relative location information is more useful than the location of a mobile device to provide services
absolute position. The need for very short-range to it and utilize the location information in some
positioning favors particular technologies, for way or the other. Hence, tracking a mobile device
example, Ultra Wideband (UWB). Local mobility, and the delivery of location-adapted services
for example indoors, is related to the mobility of prerequisites the process of gaining concrete
people or objects at a particular location and the position information, that can be carried out in
need for local awareness. This normally implies different ways, with different location context-
stringent positioning requirements. Applications related technological implications, opportunities
in this context require location information to be and challenges.
determined down to a few meters and translated

Table 1. LBS functions – a service and information taxonomy

Service layer
Positioning Context Navigation
data available
global positioning, traffic
Push mobile advertisement from read-only
indoor positioning alerts Key features of
Informa- channel
data dissemina-
tion layer emergency response, mobile yellow pages, data processed and
routing, direction tion
Pull proximity informa- location-aware updated
finding
tion service billing on-demand
support of micro,
precision for local location description,
local and remote
and micro mobility contextual alerts
mobility
Key features of applications and
infrastructure technology

19
Location-Based Services

LBS management can be classified into two In order to get more precise location information,
dimensions: collection of position-specific data in the structure of the cell needs to be revived. A cell
order to realize the service and the implementation usually consists of three or more antennas, which
and operation of an application in terms of the raises the opportunity to aim at the difference at
service itself (Weinberg, 2004). Each dimension which devices signals reach theses three or more
itself can be subdivided in relation to the orga- towers. This process is called Time Difference Of
nization of the infrastructure. On the one hand, Arrival (TDOA). As these need no further hand-
a dimension might be network-based. It would set, client-side, technology other than a simple
consist of centralized and aggregated information, antenna, supplementary technology is needed in
which are dependent on core network features order to generate high-level tracking precision.
and resources, such as mobile phone antennas. By measuring the time at which signals from the
On the other hand, distributed network-centric towers reach the handset, precision levels up to
features can be classified as organizational edge 30 m are possible. Hence, the TDOA functionality
of the infrastructure. is being reversed. The Enhanced Observed Time
In order for LBS to add value at all, minimum Difference (EOTD) thus needs the handset to be
precision requirements concerning tracking ex- much more involved in position determination.
ist. While network coverage and hence tracking Encountering the core mechanism, location
and positioning precision of a device is high in awareness is referred to as the situation in which
densely populated areas, it is rather low outside the handset does not need any further server-side
of cities, as the distances between reference cells technologies beneath the retrieval of location-
grow. While within cities a coverage using tradi- initiated data, such as geographic position, in
tional technology up to 100m is possible, it can order to provide LBS (Steinfield, 2004). Hence,
reduce down to 10 km in rural areas (Steinfield, the position-originating and position-finding
2004). Hence, coverage is negatively correlated technologies are integrated within the handset.
to population density. In this respect, the handset gains ability to gener-
ate knowledge of position by itself. Requiring a
Location Tracking and highly equipped handset, accuracy of the locating-
Location Awareness process can achieve a maximum level, enabling a
location-range of less than 10 m (Agarwal et al.,
Traditionally, a mobile device is traceable by 2002). Therefore, if satellite communication can
using the service infrastructure it communicates be ensured, handsets might be traced anywhere,
with or is tied to. As a handset involved in the anytime.
process of offering LBS has no built-in position- Enhancing GPS capabilities, it is possible to
finder or location-determination technology, make a combined use of network technologies and
the process is referred to as location tracking equipment. By implementing such hybrid systems,
(Seydim et al., 2001). The infrastructure is made a major weakness of GPS is addressed. Hence, the
up of several antennas, called towers or cells. provision of tracking-capabilities in aggravating
The tower transmitting the signal to devices is circumstances (Zeimpekis et al., 2003), such as
referred to as Cell Of Origin (COO). On a basis erratic satellite communication, can be ensured in
of the knowledge of the COO, the Time Of Ar- use of assisted GPS (A-GPS). Therefore, A-GPS
rival (TOA) of signals between the handset and can provide and speed up tracking with further
the tower is measured in order to determine the robustness against up-link and down-link data
device’s position. Although this method provides transmission failures and uncertainty.
distinct location information, it lacks precision.

20
Location-Based Services

Location Data Interpretation arise and conflict with the LBS primary goal of ro-
bustness, for example, data communication issues,
Following the collection of adequate and crucial server-side processing errors or even downtimes.
position information, these data need to be inter- Providing faster services in terms of appli-
preted and prepared in the next step according to cation processing, flexibility and mobility can
the LBS task. This process can again be structured be ensured by the translocation of the services
into core and edge technology based systems operation. Hence, the computing task regarding
(Weinberg, 2004). the processing of the application itself might be
Using network resources for the provisioning executed on the client-side. Provided the handset
of LBS in terms of prepared data, the handset does has appropriate software implemented, it can there-
not process any intelligent software by itself. It is fore perform the LBS self-sufficiently: once the
rather relying on server-side application outputs. location is determined, the application runs locally
Active uplink data transmission provides the server with the user, hence being independent from core
with the prerequisiting location information. The network resources. The built-in software processes
server-side application analysis, prepares and the information gained through communication
outputs the data in order for the handset to draw by itself, for example, placing the user on a map
the requested information. This might be accom- stored in the memory of the handset.
plished by using the simplest handset software,
such as the Wireless Application Protocol (WAP)
browser to which the data is downloaded and
displayed in according to the LBS. In this sense,
many sources of uncertainty and instability can

Figure 1. Table showing LBS technology: A taxonomy on edge and core properties

21
Location-Based Services

CONCLUSION the diversity that stands out regarding the currently


state-of-the-art of LBS devices, applications and
While it is technically possible to precisely locate the technology infrastructure at the micro, local
a device within a few meters or less, the device and remote level. The different location context-
needs to meet specific requirements. These might related prerequisites make it hard to integrate
reduce the proposed usefulness of LBS, as they the existing variety of systems into one scalable,
possibly conflict with other non-LBS provision- capable and cost efficient solution (Hightower
ing goals, such as small and handy terminals. and Borriello, 2001). Figure 1 provides a final
Higher precision in localization demands higher overview on the location- and positioning-related
costs in infrastructure as well as in handset setup technology issues within the dichotomy of edge
(Patterson et al., 2003). Implementing GPS and core properties and makes clear reference to
competency into a handset raises the need for the precision challenge.
further built-in hardware-components, as well as
higher conductible battery and processing power,
which is resulting in higher expenses, as well as REFERENCES
distorted devices.
The interaction between the above mentioned Acharya, D., Kumar, V., & Dunham, M. H. (2004).
edge-core and collection-operation relation Infospace: Hybrid and adaptive public data dis-
delimit not only the implementation of a LBS semination system for ubiquitous computing.
concerning the underlying infrastructural oppor- Journal for Wireless Communications and Mobile
tunities in terms of the alignment of a service, but Computing.
also LBS goals such us precision or processing Agarwal, N., Basch, J., Beckmann, P., Bharti,
power (Weinberg, 2004). For example, while it P., Bloebaum, S., Casadei, C., Chou, A., Enge,
is possible to use core technologies within a data P., Fong, W., & Hathi, W., Mann, w., Stone, J.,
collection process and provide a precise location Tsitsiklis, J., & Van Roy, B. (2002). Algorithms
information, the further generation of a value- for GPS operation indoors and downtown. GPS
added services might not be: routing information Solutions, 6, 149–160. doi:10.1007/s10291-002-
within a navigation application requires continu- 0028-0
ous communication in order to compute the correct
route. Continuous communication itself induces Brumitt, B., Krumm, J., Meyers, B., & Shafer,
higher network traffic and hence network affliction S. (2000). Ubiquitous computing and the role of
and requires stronger battery-power. On the other geometry. IEEE Personal Communications (Spe-
hand, in order to use this service with a GPS-based cial Issue on Smart Spaces and Environments),
technology, stronger processing power and other 7-5, 41-43.
handset hardware is required; yet, network afflic-
Gessler, S., & Jesse, K. (2001). Advanced location
tion would be reduced.
modeling to enable sophisticated LBS provision-
The challenges and opportunities of LBS in
ing in 3G networks. In M. Beigl, P. Gray, & D.
terms of services adapted to the specificities of
Salber (Eds.) Proceedings of the Workshop on
different locations and context-related infor-
Location Modeling for Ubiquitous Computing.
mation needs of LBS users fall into two broad
Atlanta, USA.
categories: mobility and distance requirements,
accuracy and precision. Therefore, it is primarily

22
Location-Based Services

Hightower, J., & Borriello, G. (2001). A survey KEY TERMS AND DEFINITIONS
and taxonomy of location systems for ubiquitous
computing. In Computer Science and Engineering Bluetooth Indoor Positioning Service
(Ed.) Technical Report (document number UW- (BIPS): A symbolic location technology based
CSE 01-08-03). Washington, DC: University of on proximity to known objects.
Washington. Cell of Origin (COO): In a traditional com-
munication environment a COO is referred to as
King, T., Haenselmann, T., Kopf, S., & Effelsberg, the cell or tower, a handset device is tied to.
W. (2006). Positionierung mit wireless-LAN und Enhanced Observed Time Difference
bluetooth. Praxis der Informationsverarbeitung (EODT): Reversing the TDOA-principle, EOTD
und Kommunikation, 29(1), 9–17. doi:10.1515/ measures the transferring-time of signals from the
PIKO.2006.9 towers to the device.
Patterson, C. A., Muntz, R. R., & Pancake, C. M. Location Service Provider (LSP): A provider
(2003). Challenges in location-aware computing. of a value-adding service, which is dependent on
In IEEE CS & IEEE ComSoc. location information.
Time of Arrival (TOA): The position of
Seydim, A. Y., Dunham, M. H., & Kumar, V. a device can be determined by measuring the
(2001). Location dependent query processing. transferring-time of a signal between the device
Proceedings of the 2nd ACM international work- and the COO.
shop on data engineering for wireless and mobile Time Difference of Arrival (TDOA): Deter-
access, pp.47-53, Santa Barbara,CA, USA. mining a more precise position information of a
Steinfield, C. (2004). The development of loca- device by taking advantage of a cells infrastructure
tion based services in mobile commerce. In: B. and measuring the transferring time of a device
Priessl, H. Bouwman, & C. Steinfield (Eds.), to three or more antennas.
E-Life after the dot.com bust. Berlin, Germany: Ubiquitous Information Management
Springer Verlag. (UIM): A communication concept, which is
free from temporal and, in general, from spatial
Weinberg, G. (2004). Location based services constraints.
case study. MIT Center for E-Business. Boston. Ultra Wideband (UWB): A technology which
Zeimpekis, V., Giaglis, G. M., & Lekakos, G. enables very short-range positioning information.
(2003). A taxonomy of indoor and outdoor
positioning techniques for mobile location
services. SIGecom Exchange, 3(4), 19–27.
doi:10.1145/844351.844355

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 269-276,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

23
24

Chapter 3
Location Privacy
Matt Duckham
University of Melbourne, Australia

ABSTRACT
In this chapter, the author raises a number of issues surrounding the ever-growing capabilities of geo-
informatics. Location privacy can be defined as the claim of individuals to determine for themselves
when, how, and to what extent information about their geographic location is communicated to others.
Location privacy has become an especially important issue in geoinformatics because of the emergence
of location-aware computing. This implication of the interaction between these technology and personal
rights is explored in this chapter.

INTRODUCTION • Inferring other personal information:


Geographic location fundamentally lim-
A person’s geographic location is personal and its many of the activities a person can be
sensitive information that needs to be protected. engaged in. For this reason, location can
Unprotected information about where a person be used to infer other personal informa-
is located can be misused in a number of ways, tion about an individual. For example, the
including: information that Anna is regularly located
in a hospital oncology unit might be used
• Causing personal harm: Information to infer that Anna is suffering from cancer.
about a person’s location may be used to An insurance company might then unfairly
cause that person harm. For example, a va- use this inference to deny health insurance
riety of criminal activities, such as stalking to Anna.
or even physical assault, require that indi-
vidual’s location is known to the criminal. Location privacy can be defined as the claim
of individuals to determine for themselves when,
DOI: 10.4018/978-1-4666-2038-4.ch003 how, and to what extent information about their

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Location Privacy

geographic location is communicated to others. communicate that information to interested par-


Location privacy is a special type of information ties anywhere in the work (e.g., via the Internet).
privacy. Information privacy is usually defined as Users of location-aware technology can po-
the “claim of individuals, groups, or institutions to tentially take advantage of many new and highly
determine for themselves when, how, and to what beneficial location-based services. Examples of
extent information about them is communicated to increasingly common location-based services
others” (Westin, 1967, p. 7). As Figure 1 shows, include navigation and emergency response sys-
information privacy is just one of several types tems. People using location-based services must
of privacy. Privacy in general is regarded as a reveal information about their location to a service
fundamental human right, recognized in Article provider. One of the first location-based services,
12 of the UN Universal Declaration of Human the AT&T “Find Friends” service, enabled friends
Rights (1948). and family to share information about their location
to each other via their mobile phones (Strassman
and Collier, 2004).
PRIVACY IN LOCATION-AWARE In some situations individuals may not even
ENVIRONMENTS be explicitly aware that their location is being
tracked. For example, not all cell phone users
Location privacy has become an especially are aware that cellular phone systems must track
important issue in geoinformatics because of the approximate location of service subscribers
the emergence of location-aware computing. in order to route calls to and from subscribers’
Location-aware computing environments com- mobile handsets. With respect to location privacy,
bine high-power mobile computer platforms, three types of location-sensing technique can be
like personal digital assistants (PDAs) or cellular distinguished (Kaasinen, 2003):
phones; location-sensing technology, like global
positioning system (GPS); and wireless commu- 1. Client-based positioning systems: use
nication, like WiFi or Bluetooth. Using location- a person’s mobile device to compute that
aware computing environments it is possible to person’s location. Client-based position sys-
track an individual’s location in real-time, and tems may be used without revealing location

Figure 1. Classification of types of privacy, after Rotenberg and Laurant (2004)

25
Location Privacy

information to a third party. However, most ample, most countries have fair informa-
location-based services still require such tion practice or data protection laws that
information be disclosed. GPS is an example govern the use of all personal informa-
of a client-based positioning system. tion, including location information (e.g.,
2. Network-based positioning systems: com- OECD, 1980).
pute a person’s location using the network • Privacy policies: are machine-readable
infrastructure. Network-based positioning specifications of acceptable uses of loca-
systems require that a person’s location must tion information used to monitor or en-
be revealed to the network administrators. force location privacy automatically. For
For example, CGI (cell global identity) is a example, PAWS (privacy awareness sys-
common network-based positioning tech- tem) provides a technological framework
nique used in cell-phone networks that enables individuals to automatically
3. Network-assisted positioning systems: use keep track of the storage and usage of per-
a combination of client-based and network- sonal location information by third parties
based technology to compute a person’s (Langheinrich, 2001).
location. Network assisted positioning sys- • Anonymity strategies: dissociate infor-
tems require that some information about mation about an individual’s location from
a person’s location must be revealed to the information about that individual’s iden-
network administrators, but not all. For ex- tity. For example, the “geonotes” system
ample, A-GPS (assisted GPS) combines CGI allows users to read and write digital notes
positioning with GPS to provide increased attached to particular geographic locations.
positioning speed and efficiency. Users of the geonotes system can protect
their privacy by using an alias (pseud-
As location-aware technology becomes more onym) instead of their real name (Espinoza
commonplace, increasing volumes of information et al., 2001).
about individuals’ locations is being generated, • Obfuscation strategies: deliberately de-
communicated, and stored. Correspondingly, con- grade the quality of information about an
cern for protecting the location privacy of these individual’s location. For example, instead
individuals is increasing. An example of one area of specifying a precise coordinate location,
of concern is the so-called “location-based spam,” a person might obfuscate his or her location
where an unscrupulous business sends a person by only specifying in which street, suburb,
unsolicited marketing material for products or city, or even region he or she is currently
services related to that person’s location (Schilit located (Duckham and Kulik, 2005).
et al., 2003).
Unfortunately, no single privacy protection
strategy can offer a complete solution to location
PROTECTING LOCATION PRIVACY privacy issues. Regulatory strategies and privacy
policies provide mechanisms to proscribe certain
There are four different approaches to protecting uses of personal information, but such strategies
an individual’s location privacy: cannot prevent inadvertent or malicious disclosure
of this information. Anonymity and obfuscation
• Regulatory strategies: aim to establish provide mechanisms to hide personal informa-
and enforce rules that govern the fair use tion. In the case of anonymity, a user’s identity
of personal location information. For ex- is hidden. In the case of obfuscation, a user’s

26
Location Privacy

Figure 2. Summary of location privacy protection strategies

exact location is hidden. However, anonymity and For example, in 1999 Scott McNealy (CEO of
obfuscation are not suitable for all situations and Sun Microsystems) controversially exclaimed
may be vulnerable to attacks aimed at invading in response to privacy concerns: “You have zero
an individual’s location privacy. For example, privacy anyway, get over it!” (Sprenger, 1999). It
because our precise location can uniquely identify is possible that in the future we may, as a society,
us, analysis of an anonymous user’s location over discard the notion of a right to location privacy.
time may be used to infer that user’s identity. However, such a dramatic change in attitudes
Consequently, combinations of these different seems unlikely to happen soon. Already, location
privacy protection strategies, summarized in privacy concerns have led to a number of high
Figure 2, need to be used. profile legal cases (e.g., Chicago Tribune, 1999;
Lee, 2004). Whichever way societal attitudes
change in the longer term, it seems certain that
FUTURE OF LOCATION PRIVACY in the near future concerns about location privacy
will continue to increase. Therefore, an essential
Society’s attitudes to privacy change over time. component of the successful development of new
As an example, in 1753 an attempt to establish a applications of location-aware technology is the
national census was defeated in the British parlia- continuing development of ways to enable access
ment as “subversive” to British citizens’ privacy to location-based information and services at the
and liberty. Today, censuses are a basic function same time as protecting the individual’s right to
of modern governments that few people object to location privacy.
(Rule, 1973). Changing attitudes have led some
authors to argue for greater transparency instead
of greater privacy (Brin, 1999). Other authors REFERENCES
point to practical difficulties of maintaining
high-levels of privacy in the face of changing Brin, D. (1999). The Transparent Society. Perseus.
technologies, like location-aware computing.

27
Location Privacy

Chicago Tribune. (2001). Rental firm uses GPS in Schilit, B. N., Hong, J., & Gruteser, M. (2003).
speeding fine. Chicago, IL: Associated Press (p. 9). Wireless location privacy protection. IEEE Com-
puter, 36(12), 135–137.
Duckham, M., & Kulik, L. (2005). A formal
model of obfuscation and negotiation for loca- Sprenger, P. (1999). Sun on privacy: Get over
tion privacy. In H. W. Gellersen, R. Want, & A. it. Wired (San Francisco, Calif.), (January): 26.
Schmidt, (Eds.), Pervasive 2005. Springer: Berlin.
Strassman, M., & Collier, C. (2004). Case study:
(pp. 152-170).
Development of the Find Friend application. In
Espinoza, F., Persson, P., Sandin, A., Nyström, H., J. Schiller & A. Voisard (Eds.), Location-based
Cacciatore, E., & Bylund, M. (2001). GeoNotes: services, Morgan Kaufmann. (pp. 27-39).
Social and navigational aspects of location-based
United Nations (1948). Universal Declaration
information systems. In G. D. Abowd, B. Brumitt,
of Human Rights. United Nations Resolution
& S. Shafer (Eds.), Ubicomp 2001: Ubiquitous
217 A (III).
Computing. Berlin: Springer. (pp. 2-17).
Westin, A. F. (1967). Privacy and Freedom. New
Kaasinen, E. (2003). User needs for location-
York: Atheneum.
aware mobile services. Personal and Ubiquitous
Computing, 7(1), 70–79. doi:10.1007/s00779-
002-0214-7
KEY TERMS AND DEFINITIONS
Langheinrich, M. (2001). Privacy by design -
principles of privacy-aware ubiquitous systems. Cell Global Identify (CGI): Unique iden-
In G. D. Abowd, B. Brumitt, & S. Shafer (Eds.), tification number assigned to each cell within a
Ubicomp 2001: Ubiquitous Computing, Berlin: cellular communications network. Also the term
Springer. (pp. 273-291). for positioning systems that use CGI.
Global Positioning System (GPS): Position-
Lee, J. (2004). Location-Tracking Sparks Privacy ing system based on triangulation of radio signals
Concerns. Korea Times. http://times.hankooki. from orbiting GPS satellites.
com, Accessed 26 July 2005. Location-Aware Computing: Computing
Organization for Economic Co-operation and environment that combines high-power mobile
Development (OECD). (1980). Guidelines on the computer platform, location-sensing technology,
Protection of Privacy and Transborder Flows of and wireless communication.
Personal Data. http://www.oecd.org, Accessed Location-Based Services: Application of
25 July 2005. location-aware computing to provide mobile user
with information relevant to his or her current
Rotenberg, M., & Laurant, C. (2004). Privacy location.
and human rights 2004: An international survey Location-Based Spam: Unsolicited market-
of privacy laws and developments. Privacy In- ing material (spam) related to an individual’s
ternational. http://www.privacyinternational.org/ current or recent location.
survey, Accessed 26 July 2005. Location Privacy: The claim of individuals
Rule, J. B. (1973). Private Lives and Public Sur- to determine for themselves when, how, and to
veillance. London: Allen Lane. what extent information about their geographic
location is communicated to others.

28
Location Privacy

Personal Digital Assistant (PDA): Mobile Wireless Fidelity (WiFi): Wireless digital
handheld computing device for common business network communication system using any of the
and organizational applications. IEEE 802.11 family of standards.

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 254-259,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

29
30

Chapter 4
Location-Based
Performance Tuning in
Mobile Sensor Networks
Vladimir I. Zadorozhny
University of Pittsburgh, USA

ABSTRACT
The author of this chapter considers the location-based approach for performance tuning that significantly
facilitates the challenge of utilizing Mobile Sensor Networks. The authors introduce mobile nodes that
can be deployed in conjunction with stationary sensor nodes to perform mission critical surveillance
and monitoring tasks. It details the past advances in this field and discusses other approaches to this
challenge.

MOBILE SENSOR NETWORKS Sensor nodes can be attached to mobile devices


such as mobile robots forming a Mobile Sensor
Recent advances in wireless communications and Network (MSN).
microelectronics have enabled wide deployment There has been considerable research on
of smart sensor networks. Such networks naturally designing mobile platforms to transport wireless
apply to a broad range of applications that involve sensors. The Millibot project at Carnegie Mel-
system monitoring and information tracking (e.g., lon University (Bererton et al., 2000) focused on
airport security infrastructure, monitoring of constructing heterogeneous distributed robots that
children in metropolitan areas, product transition combine mobile platforms with different sensor
in warehouse networks, fine-grained weather/ devices such as cameras, temperature sensors,
environmental measurements, etc.). Wireless movement sensors, and so forth. Robomote (Sibley
sensors are small resource-constrained devices et al., 2002), designed in the Robotic Embebbed
with wireless communication capability, process- Systems Laboratory at the University of Southern
ing power, and environment sensing equipment. California is a robot platform that functions as a
single mobile node in a mobile sensor network.
DOI: 10.4018/978-1-4666-2038-4.ch004 Figure 1 shows actual pictures of the millibot and

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Location-Based Performance Tuning in Mobile Sensor Networks

Figure 1. Millibot (left) and robomote (right)

robomote platforms. Another example includes utilize ad-hoc localization methods based on
efforts on using commercial-off-the-shelf com- nodes calculating their coordinates using special
ponents to build inexpensive and modular robots beacon nodes whose positions are known. Further
(Bergbreiter and Pister, 2003). consideration of this subject is beyond the scope
Mobile Sensor Network consists of mobile of this chapter.
platform (e.g., mobile robots) carrying wireless
sensor devices that can be deployed in conjunction
with stationary sensor nodes to acquire and process PERFORMANCE TUNING IN
data for surveillance and tracking, environmental MOBILE SENSOR NETWORKS
monitoring for highly sensitive areas, or execute
search and rescue operations. Resource constraints Applications using MSNs have stringent require-
of MSNs make it difficult to utilize them for ments for efficient mechanisms of querying sensor
advanced environmental monitoring that requires data and delivering the query result. Minimizing
data intensive collaboration between the robots sensor query response time becomes crucial in mo-
(e.g., exchange of multimedia data streams) bile sensor networks. At the same time, minimizing
(Scerri et al., 2003; Scerri, Xu et al., 2004). To energy consumption per query is equally crucial
meet the application requirements, the data ex- for these battery-powered devices. In general, the
change must be performed over a wireless link. time/energy trade-offs involve energy and time
Meanwhile, even high rate wireless networks gain/loss associated with specific layouts of the
(e.g., 802.11 networks) use a best-effort service nodes. Proper positioning (relocation) of mobile
that has limitations of data intensive multimedia sensors combined with changing the transmission
applications since it can lead to packet loss, delay ranges of sensors have a considerable impact on
and jitter (Kurose and Ross, 2005). The problem the time/energy trade-off. Specifically, both factors
aggravates in low rate wireless sensor networks, impact the following characteristics of a Mobile
(e.g., 802.15.4 networks) (Zheng and Lee, 2004). Sensor Network:
In this chapter, we consider location-based
approach for performance tuning in MSNs. This • Collision-free concurrency: Packet col-
approach assumes that each node in MSN is aware lisions is one of the major sources of en-
of its geographic location. Note, that using Global ergy and time waste in MSN. Once any
Positioning System (GPS) is not always possible two or more nodes in the same Collision
in such systems because of severe energy and Domain (CD) transmit packets at the same
location precision constraints. Commonly MSNs time, a collision occurs, and packets are

31
Location-Based Performance Tuning in Mobile Sensor Networks

corrupted and discarded. Packet collisions in transmission or reception are in the same col-
can be avoided by minimizing the number lision domain CD(n1,n2) defined as the union of
of intersecting CDs and by synchronizing the transmission ranges of n1 and n2. In Figure
data transmissions among nodes within the 2, nodes n1, n2, n3, and n4, n5 and n6 are in the
same CD. Both relocation and changing the same collision domain. This implies that when n1
transmission range of sensor nodes could and n2 are communicating, n3, n4, n5 and n6 can-
result in changing the number of potential not participate in any communications. A typical
collision-free concurrent transmissions. wireless network handles collisions using carrier
• Filtering factor: Relocation and chang- sense multiple-access with collision avoidance
ing the transmission range of sensors can (CSCMA-CA) (Zheng and Lee, 2004). In general,
result in changing the number of hops and before starting a transmission, nodes must sense
intermediate transmission nodes involved the channel for a predetermined amount of time
in query execution. This, however, brings (waiting time). If the channel is busy, the nodes
both benefits and penalties. If the filtering wait for the predetermined amount of time after the
factor of the intermediate node is low (i.e., channel becomes free. In addition, nodes backoff
it just retransmits the data) then it can intro- for a random time to avoid the possibility that two
duce some time and energy loss due to ex- or more nodes transmit at the same time after the
tra hop. From the other side, the intermedi- waiting period. For this entire period, the node
ate node does reduce the data transmission must sense the channel and this consumes energy.
ranges, which results in energy conserva- Each packet also needs to be acknowledged by the
tion. If the intermediate node does a lot of receiver since wireless channels are unreliable.
filtering, the benefits include spending less In general, mobile sensors should position
energy in order to transmit less data. themselves and adjust their transmission power
so as to minimize overlap of CDs in the query
A mobile sensor query is characterized by large tree. In some cases, however, this general strat-
data streams among participating nodes with pos- egy may result in time and/or energy loss. In
sible in-node data filtering/aggregation, which can order to capture the associated tradeoff, we intro-
be described as a tree-like data delivery pattern duce a concept of collision-free concurrency (cfc)
(query routing tree). Mobile sensors are moved of a query tree. We say that query tree T1 has
into target positions according to the selected query higher cfc than an equivalent query tree T2, if T1
routing tree and mobile sensor deployment plans
taking into consideration the current topology of
Figure 2. Collision domain of two communicat-
stationary sensor nodes, the applications’ cover-
ing sensors
age requirements, and the collision domains of
the sensor nodes.
Figure 2 elaborates on the concept of colli-
sion domains in a typical wireless network such
as IEEE 802.15.4 (Zheng and Lee, 2004) and
illustrates how collisions are handled in such
a network. Consider two nodes n1 and n2 that
wish to communicate. Assuming that all sensor
nodes use the same frequency band for trans-
mission, two transmissions that overlap will get
corrupted (collide) if the sensor nodes involved

32
Location-Based Performance Tuning in Mobile Sensor Networks

enables more concurrent transmissions without query tree. Some of the mobile robots can facili-
risk of collisions than T2. A query tree has cfc=1, tate data delivery acting as intermediate nodes
if it allows for all potentially concurrent transmis- rather than data acquisition nodes. Such mobile
sion pairs to occur. Two elementary transmissions facilitators can introduce extra hops in order to
et1 and et2 are potentially concurrent in a query reduce transmission ranges of the data acquisition
tree T if they do not share the same destination nodes. In addition, the facilitators can also act as
and there is no strict order between et1 and et2. filters aggregating/compressing the input data and
For example, consider the query tree in Figure 3a so decreases the amount of data transmitted from
which was generated for some query Q. The the remote nodes to the root node.
query tree in Figure 3a is associated with cfc=1/3, Figure 4 shows a query tree topology with four
since it allows for one transmission pair (n4~n2, previously positioned nodes s0, s1, s2, s3 and three
n5~n3) to occur without risk of collisions. Note different positions of a mobile facilitator m. The
that the query tree includes three potentially con- facilitators consume extra energy and introduce
current transmission pairs: (n4~n2, n5~n3), some extra processing delay. However, by reduc-
(n4~n2, n3~n1), and (n5~n3, n2~n1). Here ni~nj ing the transmission range and data stream sizes,
denotes an elementary one-hop transmission from they are also capable of reducing the overall query
a sensor node ni to node nj. On the other hand, time and energy consumption. To sum up, given a
Figure 3b illustrates a query tree with cfc=0 since query, the coverage requirements, and the initial
none of the three potentially concurrent transmis- position of the mobile robots, the query optimizer
sion pairs can be performed without risk of col- shifts through possible mobile robots positions
lisions. in order to generate the candidate trees with ac-
An optimization technique should be applied ceptable response time and energy consumption.
to teamwork planning in the MSN to take into
account the time/energy trade-offs. In general, the
trade-offs involve energy and time gain/loss as- ALGEBRAIC OPTIMIZATION
sociated with specific layouts of the wireless
sensors. The mobile robots should position them- In this section we consider an algebraic query
selves and adjust their transmission power so as optimization technique based on a Data Transmis-
to minimize overlap of CDs in the query tree. The sion Algebra (DTA) (Zadorozhny et al., 2005) that
optimizer is responsible for the choice of the best allows a query optimizer to generate query routing
query routing tree. The mobile robots should move trees to maximize collision-free concurrent data
into target positions according to the selected

Figure 3. Explanation of collision-free concurrency

33
Location-Based Performance Tuning in Mobile Sensor Networks

Figure 4. Re-positioning of mobile facilitator

transmissions taking into account intermediate transmissions n2~n1 and n3~n1 since they share
hops and filtering factors of facilitators. the same destination. However this order is not
The DTA consists of a set of operations that strict. Operation a(n4~n2, n5~n3) specifies that
take transmissions between wireless sensor nodes n4~n2 can be executed concurrently with n5~n3,
as input and produce a schedule of transmissions since neither n3 nor n5 belongs to CD(n4,n2), and
as their result. A one-hop elementary transmis- neither n4 nor n2 are in CD(n5,n3).
sion from sensor node ni to node nj is denoted Each operation of the DTA specification de-
as ni~nj. Each transmission ni~nj is associated fines a simple transmission schedule that consists
with a collision domain CD(ni, nj) as defined in of two elementary transmissions. The DTA intro-
the previous section. A transmission schedule is duces a set of transformation rules (Zadorozhny
either an elementary transmission, or a composi- et al., 2004) that can be used to generate more
tion of elementary transmissions using one of the complex schedules. Figure 5 shows an example
operations of the DTA. The basic DTA includes of a complete schedule that includes all elemen-
three operations that combine two transmission tary transmissions of the query tree. Figure 5 also
schedules A and B: shows the initial DTA specification reflecting
basic constraints of the query tree. The initial
1. o(A,B): This is a strict order operation, that specification consists of a set of elementary trans-
is, A must be executed before B. missions reflecting the tree topology imposed by
2. a(A,B): This is an overlap operation, that is, the query semantics, as well as order and overlap
A and B can be executed concurrently. operations over the elementary transmissions.
3. c(A,B): This is a non-strict order operation, Figure 6 shows query routing trees from Figure
that is, either A executes before B, or vice 4 with four previously positioned nodes s0, s1,
versa. s2, s3 and three different positions of a mobile
facilitator m. Note how the re-positioning of the
For an example of the DTA operations consider facilitator is reflected in the initial DTA specifica-
the query tree in Figure 5. It shows some DTA tions is1, is2 and is3.
specifications that reflect basic constraints of Out of the many possible query routing trees
the query tree. For instance, operation o(n4~n2, and transmission schedules the optimizer should
n2~n1) specifies that transmission n2~n1 occurs select an option with an acceptable query response
after n4~n2 is completed. This constraint reflects a time and overall energy consumption. This is a
part of the query tree topology. Operation c(n2~n1, multi-objective optimization (MOP) problem
n3~n1) specifies that there is an order between (Miettinen, 1999). In general, the MOP aims at

34
Location-Based Performance Tuning in Mobile Sensor Networks

Figure 5. Example of DTA specifications

Figure 6. Impact of mobility on DTA specification

minimizing values of several objective functions Figure 6 reports on Pareto fronts for a simple
f1,…fn under a given set of constraints. To choose two-hop query tree of 8 nodes with some data
between different vectors of the optimization aggregation/filtering at intermediate nodes. For
objectives, the optimizer utilizes the concept of example, filtering factor 0.2 means that 20% of
Pareto optimality (Miettinen, 1999). Informally, data delivered to an intermediate node will be
an objective vector is said to be Pareto optimal forwarded to the base station. A major observation
(also called Pareto front) if all other feasible vec- here is an increase of variability in both time and
tors in the objective space have a higher value for energy consumption with a decrease of the facili-
at least one of the objective functions, or else have tator filtering factor. This means that in general,
the same value for all objectives. Among all Pa- the optimizer can benefit from higher filtering
reto optimal solutions, the optimizer should chose factors (the lower filtering factor reduces more
one using an application-dependent utility func- input data). However, there is a considerable risk
tion. The optimizer should evaluate time and for the optimizer to behave as badly as in the case
energy gains/losses and make a preference con- of a high filtering factor. The optimizer explores
sidering the relative importance of time and en- related time/energy tradeoffs maximizing benefits
ergy in the context of a specific query. and avoiding risks of selecting bad schedules.
The assumption is that the energy source used for

35
Location-Based Performance Tuning in Mobile Sensor Networks

Figure 7. Actual Pareto fronts explored by optimizer

moving the platform is separate from the sensor Other approaches can also be adopted to
batteries. Considering both energy and time to optimize data exchange over a wireless link in
move is an interesting issue that should be explored mobile environments. For example, Particle sys-
separately and more work in this area is required. tems utilize nondestructive bit wise arbitration
Note that the algebraic optimization may be for channel access that considerably reduces the
expensive due to its combinatorial nature. The collisions rates (Decker et al., 2005). Several
number of alternative query trees and schedules distributed time slot scheduling algorithms for
grow at least exponentially with the number collision-free communications were implemented
of sensor nodes and elementary transmissions as an extension of basic Time Division Multiple
participating in the query. In order to handle the Access (TDMA) protocols. Ammar and Stevens
optimization complexity, the optimizer should use (1991) proposed a distributed TDMA protocol with
scalable techniques, such as utilize heuristic-based a goal to permit mobile sensors to move and then
pruning that eliminates suboptimal alternatives reallocate themselves a time slot without involv-
and randomized algorithms (Ioannidis and Kang, ing the entire network. Ali et al. (2002) proposed
1990). a distributed and adaptive TDMA algorithm for
multi-hop mobile networks. One concern with this
design is that dynamic topology changes may lead
CONCLUSION to frequent exchanges of control packets that could
consume bandwidth and energy resources. In gen-
Mobile Sensor Networks include a large collec- eral, the distributed TDMA scheduling schemes
tion of collaborating mobile sensor devices. The have considerable control message overhead for
mobile nodes can be deployed in conjunction building data delivery schedules.
with stationary sensor nodes to perform mission
critical surveillance and monitoring tasks. Such
applications require efficient collaboration be- REFERENCES
tween mobile sensors, which imply intensive data
exchange over wireless links. Meanwhile, resource Ali, F., Appani, P., Hammond, J., Mehta, V., Non-
constraints of MSNs make it difficult to utilize eaker, D., & Russell, H. (2002). Distributed and
them for advanced data-intensive monitoring Adaptive TDMA Algorithms for Multiple-Hop
tasks. In this article we considered location-based Mobile Networks, Proceedings of MILCOM,
approach for performance tuning that significantly 546-551.
facilitates this challenge.

36
Location-Based Performance Tuning in Mobile Sensor Networks

Ammar, M., & Stevens, D. (1991) A Distributed Sibley, G., Rahimi, M., & Sukhatme, G. (2002).
TDMA Rescheduling Procedures for Mobile Robomote: A Tiny Mobile Robot Platform for
Packet Radio Networks, Proceedings of IEEE Large-Scale Ad Hoc Sensor Networks. Proceed-
International Conference on Communications ings of the Intl. Conference on Robotics and
(ICC), 1609-1613. Automation, 1143-1148.
Bererton, C., Navarro-Serment, L., Grabowski, Zadorozhny, V., Chrysanthis, P., & Krishnamurthy,
R., Paredis, C., & Khosla, P. (2000). Millibots: P. (2004). A Framework for Extending the Synergy
Small distributed robots for surveillance and map- between MAC Layer and Query Optimization in
ping. Proceedings of Government Microcircuit Sensor Networks. Proceedings of DMSN VLDB
Applications Conference, 162-167. Workshop, 68-77.
Bergbreiter, S., & Pister, K. S. (2003) CotsBots: an Zadorozhny, V., Sharma, D., Krishnamurthy, P., &
Off-the-Shelf Platform for Distributed Robotics. Labrinidis, A. (2005). Tuning query performance
Proceedings of in the 2004 IEEE/RSJ International in sensor databases. Proceedings of MDM. 247-
Conference on Intelligent Robots and Systems, 251.
1632-1637.
Zheng, J., & Lee, M. (2004). Will IEEE 802.15.4
Decker, C., Krohn, A., Beigl, M., & Zimmer, make ubiquitous networking a reality? a discussion
T. (2005). The Particle Computer System. Pro- on a potential low power, low bit rate standard.
ceedings of the ACM/IEEE Fourth International IEEE Communications Magazine, 42(6), 140–146.
Conference on Information Processing in Sensor doi:10.1109/MCOM.2004.1304251
Networks (IPSN05), 443-448.
Ioannidis, Y., & Kang, Y. (1990). Randomized
KEY TERMS AND DEFINITIONS
algorithms for optimizing large join queries.
Proceedings of ACM SIGMOD, 312-321. Collision Domain: The union of the trans-
Kurose, J. F., & Ross, K. W. (2005). Computer mission ranges of two communicating wireless
Networking: A Top-down Approach. Addison- sensors.
Wesley. 712 p. Collision-Free Concurrency: A quality
measure of a query routing tree in a mobile sen-
Miettinen, K. (1999). Nonlinear Multiobjective sor network reflecting the number of concurrent
Optimization Kluwer Academic Publisher, 298 transmission pairs that can be performed within
Scerri, P., Pynadath, D., Johnson, L., Rosenbloom, the query routing tree without risking packet
P., Si, M., Schurr, N., & Tambe, M. (2003). A collisions.
Prototype Infrastructure for Distributed Robot Data Transmission Algebra (DTA): An alge-
Agent Person Teams. Proceedings of 2nd Int. Joint braic formalism consisting of a set of operations
Conference on Autonomous Agents and Multiagent that take transmissions between wireless sensors
Systems, 433-440. as input and produce a schedule of transmissions
as their result. MSN query optimizer utilizes DTA
Scerri, P., Xu, Y., Liao, E., Lai, J., & Sycara, K. to select a query routing tree and transmission
(2004). Scaling Teamwork to Very Large Teams. schedules with an acceptable query response time
Proceedings of AAMAS, 888-895. and overall energy consumption.
Mobile Facilitator: A mobile sensor device
that facilitates data delivery in a mobile sensor

37
Location-Based Performance Tuning in Mobile Sensor Networks

network acting as an intermediate node rather domain transmit packets at the same time. When
than a data acquisition wireless sensor. collision occurs packets are typically corrupted
Mobile Sensor Device: A mobile platform and discarded.
that functions as a single mobile node carrying a Query Routing Tree: A tree-like data delivery
wireless sensor in a mobile sensor network. pattern generated by a query in mobile sensor
Mobile Sensor Network (MSN): A collection networks.
of collaborating mobile sensor devices deployed Wireless Sensor: A small resource-con-
in conjunction with stationary sensor nodes to strained device with wireless communication
perform surveillance and monitoring tasks. capability, processing power and environment
Packet Collision: A phenomena that occurs sensing equipment.
once any two or more nodes in the same collision

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 260-268,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

38
39

Chapter 5
Routing
Kevin M. Curtin
George Mason University, USA

ABSTRACT
Routing is the act of selecting a course of travel. Routing problems are one of the most prominent and
persistent problems in geoinformatics. This large research area has a strong theoretical foundation
with ties to operations research and management science. There are a wide variety of routing models
to fit many different application areas, including shortest path problems, vehicle routing problems, and
the traveling salesman problem, among many others. There are also a range of optimal and heuristic
solution procedures for solving instances of those problems. Research is ongoing to expand the types of
routing problems that can be solved, and the environments within which they can be applied.

INTRODUCTION tion networks. Routing is the most fundamental


logistical operation for virtually all transportation
Routing is the act of selecting a course of travel. and communications applications.
This process is undertaken by nearly every active As in the examples above, routing is frequently
person every day. The route from home to school seen as a practical effort to accomplish some goal.
or work is chosen by commuters. The selection Its importance to geoinformatics, however, lies in
of stops one will make for shopping and other the nature of routing as a general problem. Trans-
commercial activities and the paths between those portation, communications, or utility systems can
stops is a routing activity. Package delivery ser- all be modeled as networks—connected sets of
vices plan routes for their trucks in such a way edges and vertices—and the properties of networks
that packages are delivered within specified time can be examined in the context of the mathematical
windows. School buses are assigned routes that discipline of graph theory. Routing procedures can
will pick up and deliver children in an efficient be performed on any network dataset, regardless of
manner. Less tangible objects such as telephone the intended application. This chapter will discuss
calls or data packets are routed across informa- the formulation of routing problems including
shortest path problems, and will review in detail
DOI: 10.4018/978-1-4666-2038-4.ch005 general vehicle routing problems and the travel-

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Routing

ing salesman problem. Solution procedures for spanning tree problem, the Steiner tree problem,
routing problems are discussed and future trends the Traveling Salesman Problem, and the vehicle
in routing research are outlined. routing problem, among many others (Magnanti
& Wong, 1984). The modeling of these problems
frequently takes the form of integer programming
BACKGROUND models. Such models define an objective and a set
of constraints. Solution procedures are applied that
Generally, a routing procedure is based on an require decisions to be made that generate a route
objective—or goal—for the route, and a set of that optimizes the objective while respecting the
constraints regarding the route’s properties. By constraints. Given the limited space in this forum,
far the most common objective for routing prob- the following sections will focus on the modeling
lems is to minimize cost. Cost can be measured of two significant routing problems in an effort
in many different ways, but is frequently defined to demonstrate the characteristics of the general
as some function of distance, time, or difficulty class. Vehicle Routing Problems are presented in
in traversing the network. Thus the problem of order to discuss the range of possible objectives
locating the least cost or shortest path between for routing problems, and the Traveling Salesman
two points across a network is the most common Problem is presented to demonstrate the formula-
routing problem. It is also a problem for which tion of the objectives and constraints.
there are several extremely efficient algorithms
that can determine the optimal solution. The most Vehicle Routing Problems
widely cited algorithm that solves the least cost
path problem on directed graphs with non-negative Vehicle Routing Problems (VRPs) are those that
weights was developed by Edsgar Dijkstra (1959), seek to find a route or routes across a network
and an even more efficient version of this algo- for the delivery of goods or for the provision of
rithm—the two-tree algorithm—exists (Dantzig, transport services. From their earliest incarnations
1960). Alternative algorithms have been presented VRPs have been formulated as distance or cost
that will solve this problem where negative weights minimization problems (Clarke & Wright, 1964;
may exist (Bellman, 1958), where all the short- Dantzig & Ramser, 1959). This overwhelming
est paths from each node to every other node are bias persists to this day. Nine out of ten research
determined (Dantzig, 1966; Floyd, 1962), and articles regarding route design in the context of
where not only the shortest path but also the 2nd, transit routing written between 1967 and 1998 and
3rd, 4th, or kth shortest path must be found (Evans reviewed by Chien and Yang (2000) employed
& Minieka, 1992). a total cost minimization objective. When the
route is intended as a physical transport route,
the cost objective is nearly always formulated
NETWORK DESIGN PROBLEMS as a generalized measure of operator costs (List,
1990), user costs (Dubois et al., 1979; Silman et
The shortest path problem is just one of a class al., 1974), or both operator and user costs (Ceder,
of related routing problems that can be described 2001; Chien et al., 2001; Lampkin & Saalmans,
as network design problems. Network design 1967; Newell, 1979; Wang & Po, 2001).
problems require that some combination of the The few exceptions include a model that maxi-
elements of a network (edges and vertices) be cho- mizes consumer surplus (Hasselström, 1981), a
sen in order to provide a route (or routes) through model that seeks to maximize the number of public
the network. This group includes the minimal transport passengers (van Nes et al., 1988), a model

40
Routing

that seeks equity among users (Bowerman et al., and ending up at the initial starting location. The
1995), a model that seeks to minimize transfers TSP has its origins in the Knight’s Tour problem
while encouraging route directness and demand first formally identified by L. Euler and A. T.
coverage (Zhao & Gan, 2003), and a model that Vandermonde in the mid-1700s. In the 1800s, the
seeks to maximize the service provided to the problem was identified as an element of graph
population with access to the route (Curtin & theory and was studied by the Irish mathemati-
Biba, 2006). VRPs for transport services can cian, Sir William Rowan Hamilton. The problem
be designed to either determine single optimal was named the Hamiltonian cycle problem in his
routes, or a system of routes (Ceder & Wilson, honor (Hoffman A. J. & Wolfe P., 1985).
1986; Chakroborty & Dwivedi, 2002; List, 1990; The first known mention of the TSP under that
Silman et al., 1974) name appeared in a German manual published
A substantial subset of the literature posits that in 1832, and this was followed by four applied
routing problems are not captured well by any appearances of the problem in the late 1800s and
single optimization objective, but rather multiple early 20th century (Cook, 2001). The mathemati-
objectives should be considered (Current & Marsh, cian and economist Karl Menger publicized the
1993). Among the proposed multi-objective TSP in the 1920s in Vienna (Applegate D., 1998),
models are those that tradeoff maximal covering then introduced it in the United States at Harvard
of demand against minimizing cost (Current & University as a visiting lecturer, where the prob-
Pirkul, 1994; Current et al., 1984, 1985; Current lem was discussed with Hassler Whitney who at
& Schilling, 1989), those that seek to both mini- that time was doing his Ph.D. research in graph
mize cost and maximize accessibility in terms of theory. In 1932, the problem was introduced at
distance traveled (Current et al., 1987; Current & Princeton University by Whitney, where A. W.
Schilling, 1994), and those that tradeoff access Tucker and Merrill Flood discussed the problem
with service efficiency (Murray, 2003; Murray in the context of Flood’s school-bus routing study
& Wu, 2003). in New Jersey (Schrijver, 2004). Flood went on to
Regardless of the objective that is deemed popularize the TSP at the RAND Corporation in
appropriate for a routing application, the prob- Santa Monica, California in late 1940s. In 1956
lem will frequently be posited in the form of a Flood mentioned a number of connections of the
structured mathematical model. In the next section TSP with the Hamiltonian paths and cycles in
the Traveling Salesman Problem is presented to graphs (Flood, 1956). Since that time the TSP
demonstrate how such models are formulated. has been considered one of the classic models in
combinatorial optimization, and is used as a test
The Traveling Salesman Problem case for virtually all advancements in solution
procedures.
The Traveling Salesman Problem (TSP) is argu- There are many mathematical formulations for
ably the most prominent problem in combinatorial the TSP, with a variety of constraints that enforce
optimization. The simple way in which the prob- the requirements of the problem. Since this is not
lem is defined in combination with its notorious the appropriate forum for reviewing all of the po-
difficulty has stimulated many efforts to find an tential formulations, the formulation attributed to
efficient solution procedure. The TSP is a classic Vajda (Vajda, 1961) has been chosen in order to
routing problem in which a hypothetical salesman demonstrate how such a formulation is specified.
must find the most efficient sequence of destina- The following notation is used:
tions in his territory, stopping only once at each,

41
Routing

n = the number of cities to be visited; In addition to the above constraints the deci-
i and j = indices of cities that can take integer sion variables are constrained to be integer values
values from 1 to n in the range of 0 to 1: 0 ≤ x ijt ≤ 1.
t = the time period, or step in the route between Like any routing problem structured as an inte-
the cities ger program, in order to solve the TSP a procedure
xijt = 1 if the edge of the network from i to j is must be employed that allows decisions to be made
used in step t of the route, and 0 otherwise regarding the values of the decision variables. The
dij = the distance or cost from city i to city j choice of a solution procedure depends in part on
the difficulty of the routing problem and the size of
The objective function is to minimize the sum the problem instance being solved. The following
of all costs (distances) of all of the selected ele- section describes the combinatorial complexity
ments of the tour: of routing problems and the solution procedures
that can be used to solve them.
n n n

∑ ∑ ∑d x ij ijt

i =1 j =i t =1
SOLUTION PROCEDURES
The tour is subject to the following constraints: FOR ROUTING PROBLEMS
For all values of t, some arc must be traversed,
hence The TSP and most VRPs are considered to be in a
class of problems that are highly combinatorially
complex. There are, for example, (n – 1)! possible
∑∑x ijt
= 1 for all t.
tours for the TSP. Therefore, as the number of
i j
cities to visit, n, grows, the number of possible
tours grows very rapidly. So rapidly, in fact, that
For all cities, there is just one other city which
even small instances of these problems cannot
is being reached from it, at some time, hence
be solved by enumeration (the inspection of all
possible combinations).
∑∑x ijt
= 1 for all i. If this is the case these integer programming
j t
problems may be solved optimally using a ver-
sion of the simplex method to generate fractional
For all cities, there is some other city from optimal solutions from the linear programming
which it is being reached, at some time, hence relaxation of the integer program, followed by
a branch and bound search procedure to identify
∑∑x ijt
= 1 for all j. integer optimal solutions. A variety of reformula-
i t
tion techniques, preprocessing routines, and con-
straint elimination methods can assist in solving
When a city is reached at time t, it must be these problems.
left at time t + 1, in order to exclude disconnected However, these problems grow so quickly
subtours that would otherwise meet all of the above that it may be impractical or impossible to obtain
constraints. These subtour elimination constraints optimal solutions for large problem instances
are formulated as: even with specialized solution procedures. When
this is the case, alternate—though not guaranteed
∑x ijt
= ∑x jkt +1
for all j and t. optimal—solution procedures can be employed.
i k

42
Routing

These include formal heuristic (or approximate) traveler. More frequently, routing procedures are
methods to quickly find good routes (Bowerman et being conducted while conditions are constantly
al., 1995; Chien & Yang, 2000; Fan & Machemehl, subject to change (such as traffic conditions). The
2004; van Nes et al., 1988; Zhao & Gan, 2003), issue of routing in these dynamic environments
heuristics based on genetic algorithms (Chien et is a growing research area.
al., 2001; Tom & Mohan, 2003) or other proce-
dures with a stochastic element, heuristics that
incorporate expert user input in the process (Baaj CONCLUSION
& Mahmassani, 1995; Ceder & Wilson, 1986;
Lampkin & Saalmans, 1967; Shih et al., 1998), Routing problems are one of the most promi-
and heuristics that are entirely based on expert user nent and persistent problems in geoinformatics.
experience. The last of these is the most widely This large research area has a strong theoretical
used and perhaps the most important technique and foundation with ties to operations research and
is sometimes referred to as manual route planning management science. There are a wide variety of
(Dubois et al., 1979; Moorthy, 1997). The manual routing models to fit many different application
method can result in routes that are extremely areas, and there are a range of optimal and heuristic
near optimal, and the ability of such experts to solution procedures for solving instances of those
quickly react to customer demands and complaints problems. Research is ongoing to expand the types
is a valuable asset (Newell, 1979). However, the of routing problems that can be solved, and the
solutions determined heuristically in this way are environments within which they can be applied.
likely to be suboptimal and without an optimal
solution process there is no way to determine
how far from optimal the manual solutions are. REFERENCES
Lastly, there has recently been increased interest
in the use of simulation methods, such as agent Applegate D., B. R., Chvtal V., & Cook W.. (1998).
based modeling and cellular automaton models On the Solution of Traveling Salesman Problems.
to generate optimal routes (Jiang et al., 2003). Documenta Mathematica, 645-656.
Baaj, M. H., & Mahmassani, H. S. (1995). Hybrid
route generation heuristic algorithm for the design
FUTURE TRENDS IN of transit networks. Transportation Research
ROUTING RESEARCH Part C, Emerging Technologies, 3(1), 31–50.
doi:10.1016/0968-090X(94)00011-S
Since routing is such a fundamental activity, and
the problems can be so difficult to solve, there is Bellman, R. (1958). On a routing problem. Quar-
an ongoing need to improve the modeling of the terly of Applied Mathematics, 16(1), 87–90.
problems themselves and the procedures used to
Bowerman, R., Hall, B., & Calamai, P. (1995). A
solve them. In addition to these long term goals,
multi-objective optimization approach to urban
routing researchers are finding ways to take advan-
school bus routing: formulation and solution
tage of the flood of data regarding networks and
method. Transportation Research Part A, Policy
locations of objects and activities in order to use
and Practice, 29(2), 107–123. doi:10.1016/0965-
these data to identify optimal routes. Of particular
8564(94)E0006-U
interest is the modeling of “best” routes for trip
planning that do not necessarily minimize cost, but
rather maximize some function of benefit for the

43
Routing

Ceder, A. (2001). Operational objective functions Current, J. R., ReVelle, C. S., & Cohon, J. L.
in designing public transport routes. Journal of (1987). The median shortest path problem: a
Advanced Transportation, 35(2), 125–144. multiobjective approach to analyze cost vs.
accessibility in the design of transportation net-
Ceder, A., & Wilson, N. (1986). Bus network
works. Transportation Science, 21(3), 188–197.
design. Transportation Research Part B: Meth-
doi:10.1287/trsc.21.3.188
odological, 20B(4), 331–344. doi:10.1016/0191-
2615(86)90047-0 Current, J. R., & Marsh, M. (1993). Multiobjec-
tive transportation network design and routing
Chakroborty, P., & Dwivedi, T. (2002). Optimal
problems: Taxonomy and annotation. Euro-
route network design for transit systems using
pean Journal of Operational Research, 65, 4–19.
genetic algorithms. Engineering Optimization,
doi:10.1016/0377-2217(93)90140-I
34(1), 83–100. doi:10.1080/03052150210909
Current, J. R., & Pirkul, H. (1994). Efficient
Chien, S., & Yang, Z. W. (2000). Optimal feeder
algorithms for solving the shortest covering path
bus routes on irregular street networks. Journal
problem. Transportation Science, 28(4), 317–327.
of Advanced Transportation, 34(2), 213–248.
doi:10.1287/trsc.28.4.317
Chien, S., Yang, Z. W., & Hou, E. (2001). Genetic
Current, J. R., & Schilling, D. A. (1989). The
algorithm approach for transit route planning and
covering salesman problem. Transportation Sci-
design. Journal of Transportation Engineering,
ence, 23(3), 208–213. doi:10.1287/trsc.23.3.208
127(3), 200–207. doi:10.1061/(ASCE)0733-
947X(2001)127:3(200) Current, J. R., & Schilling, D. A. (1994). The
median tour and maximal covering tour prob-
Clarke, G., & Wright, J. W. (1964). Scheduling
lems: Formulations and heuristics. European
of vehicles from a central depot to a number of
Journal of Operational Research, 73, 114–126.
delivery points. Operations Research, 12(4),
doi:10.1016/0377-2217(94)90149-X
568–581. doi:10.1287/opre.12.4.568
Curtin, K. M., & Biba, S. (2006). The Transit Route
Cook, W. J. (2001). The Traveling Salesman
Arc-Node Service Maximization Problem. Euro-
Problem Paper presented at the Conference Name|.
pean Journal of Operations Research, In Review.
Retrieved Access Date| from URL|.
Dantzig, G. B. (1960). On the shortest route
Current, J. R., ReVelle, C. S., & Cohon, J. L.
through a network. Management Science, 6,
(1984). The application of location models to the
187–190. doi:10.1287/mnsc.6.2.187
multiobjective design of transportation networks.
Regional Science Review, 14, 1–23. Dantzig, G. B. (1966). All Shortest Routes in
a Graph. Paper presented at the International
Current, J. R., ReVelle, C. S., & Cohon, J. L.
Symposium, Rome.
(1985). The maximum covering / shortest path
problem: A multiobjective network design and Dantzig, G. B., & Ramser, J. H. (1959). The truck
routing formulation. European Journal of Opera- dispatching problem. Management Science, 6(1),
tional Research, 21, 189–199. doi:10.1016/0377- 80–91. doi:10.1287/mnsc.6.1.80
2217(85)90030-X
Dijkstra, E. W. (1959). A note on two problems in
connexion with graphs. Numerische Mathematik,
1, 269–271. doi:10.1007/BF01386390

44
Routing

Dubois, D., Bel, G., & Llibre, M. (1979). A set Magnanti, T. L., & Wong, R. T. (1984). Network
of methods in transportation network synthesis design and transportation planning: models and
and analysis. The Journal of the Operational algorithms. Transportation Science, 18(1), 1–55.
Research Society, 30(9), 797–808. doi:10.1287/trsc.18.1.1
Evans, J. R., & Minieka, E. (1992). Optimization Moorthy, N. V. R. (1997). Planning of integrated
Algorithms for Networks and Graphs (2nd ed.). transit network for bus and LRT. Journal of Ad-
New York: Marcel Dekker, Inc. vanced Transportation, 31(3), 283–309.
Fan, W., & Machemehl, R. B. (2004). Optimal Murray, A. T. (2003). A coverage model for im-
Transit Route Network Design Problem: Algo- proving public transit system accessibility and
rithms, Implementations, and Numerical Results expanding access. Annals of Operations Research,
(No. SWUTC/04/167244-1). Austin, TX: Center 123, 143–156. doi:10.1023/A:1026123329433
for Transportation Research.
Murray, A. T., & Wu, X. (2003). Accessibility
Flood, M. M. (1956). The traveling-salesman tradefoffs in public transit planning. Journal of
problem. The Journal of the Operational Research Geographical Systems, 5, 93–107. doi:10.1007/
Society, 4, 61–75. s101090300105
Floyd, R. W. (1962). Algorithm 97, shortest Newell, G. F. (1979). Some issues relating to the
path. Communications of the ACM, 5(6), 345. optimal design of bus routes. Transportation
doi:10.1145/367766.368168 Science, 13(1), 20–35. doi:10.1287/trsc.13.1.20
Hasselström, D. (1981). Public Transportation Schrijver, A. (2004). On the history of combina-
Planning: Mathematical Programming Approach. torial optimization. from http://homepages.cwi.
University of Gothenburg, Gothenburg, Sweden. nl/~lex/files/histco.pdf
Hoffman, A. J., & Wolfe, P. (1985). History. In Shih, M., Mahmassani, H. S., & Baaj, M. H.
The Traveling Salesman Problem - A Guided (1998). Planning and design model for transit
Tour of combinatorial Optimization (pp. 1-17). route networks with coordinated operations.
Chichester: John Wiley & Sons. Transportation Research Record, 1623, 16–23.
doi:10.3141/1623-03
Jiang, R., Hu, M. B., Jia, B., & Wu, Q. S. (2003).
Realistic bus route model considering the capacity Silman, L. A., Barzily, Z., & Passy, U. (1974).
of the bus. The European Physical Journal B, Planning the route system for urban buses.
34(3), 367–372. doi:10.1140/epjb/e2003-00233-1 Computers & Operations Research, 1, 201–211.
doi:10.1016/0305-0548(74)90046-X
Lampkin, W., & Saalmans, P. D. (1967). The de-
sign of routes, service frequencies, and schedules Tom, V. M., & Mohan, S. (2003). Transit route
for a municipal bus undertaking: a case study. network design using frequency coded genetic
Operations Research, 18(4), 375–397. algorithm. Journal of Transportation Engineer-
ing, 129(2), 186–195. doi:10.1061/(ASCE)0733-
List, G. F. (1990). Toward optimal sketch-level
947X(2003)129:2(186)
transit service plans. Transportation Research
Part B: Methodological, 24(5), 325–344. Vajda, S. (1961). Mathematical Programming.
doi:10.1016/0191-2615(90)90007-L Reading: Addison-Wesley Publishing Company.

45
Routing

van Nes, R., Hamerslag, R., & Immers, B. H. Heuristics: Procedures for quickly finding
(1988). Design of public transport networks. good alternate—though not guaranteed optimal—
Transportation Research Record, 1202, 74–83. solutions to routing problems.
Network: A connected set of edges and ver-
Wang, J. J., & Po, K. (2001). Bus routing strate-
tices.
gies in a transit market: A case study of Hong
Network Design Problems: A set of combi-
Kong. Journal of Advanced Transportation,
natorially complex network analysis problems
35(3), 259–288.
where routes across or flows through the network
Zhao, F., & Gan, A. (2003). Optimization of Transit must be determined.
Network to Minimize Transfers (No. BD015- Routing: The act of selecting a course of travel.
02). Miami: Lehman Center for Transportation Shortest Path Problem: The routing prob-
Research. lem of finding the shortest—or least cost—path
through a network.
Traveling Salesman Problem: The most
KEY TERMS AND DEFINITIONS prominent problem in combinatorial optimization,
defined as the routing problem where a hypotheti-
Graph Theory: The mathematical discipline cal salesman must find the most efficient sequence
related to the properties of networks. of destinations in his territory, stopping only once
at each, and end up at the initial starting location.

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 246-253,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

46
47

Chapter 6
Web-Based Geospatial Services:
Implementing Interoperability
Specifications

Iftikhar U. Sikder
Cleveland State University, USA

Aryya Gangopadhyay
University of Maryland - Baltimore County, USA

Nikhil V. Shampur
Cleveland State University, USA

ABSTRACT
This chapter characterizes the requirements of Geographic Information Systems (GIS) middleware and
its components for dynamic registering and discovering of spatial services specifically for collaborative
modeling in environmental planning. The chapter explores the role of Web services with respect to imple-
mentation standard and protocols and identifies implementation features for exposing distributed GIS
business logic and components via Web services. In particular, the chapter illustrates applications of the
interoperability specifications of Open GIS Consortium’s (OGC) Web Mapping Service and (WMS), Web
Processing Standards (WPS) with respect to implementation feature. The chapter demonstrates a proto-
type implementation of collaborative environmental decision support systems (GEO-ELCA- Exploratory
Land Use Change Assessment) where Web service-enabled middleware adds core functionality to a Web
mapping service. The application demonstrates how individual workspace-based namespaces can be used
to perform Web mapping functionality (such as spatial analysis in visualization) through the integration
of environmental simulation models to explore collective planning scenario. Built on OGC compliant
connector and supports WMS and WPS, the system includes interactive supports for geospatial data
query, mapping services and visualization tools for multi-user transactions.

DOI: 10.4018/978-1-4666-2038-4.ch006

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Web-Based Geospatial Services

INTRODUCTION between several geographically separate datas-


ets and perform varying degree of analysis and
Both public and private enterprises have recently geo-processing would inevitably require service
produced a surge of interest in Web applications oriented architecture (SOA) of core GIS which
for Geographic Information Systems (GIS). In would define the use of services to support the
recent years service oriented middleware has requirements of software users. In this context,
emerged as an essential ingredient in distributed GIServices may be defined as a self-contained,
systems (Alonso, 2004; Chang & Park, 2006; stateless spatial processing function which ac-
Chatterjee & Webber, 2004). This has triggered cepts one or more requests and returns one or
a new wave of enthusiasm in composition of more responses through a well-defined, standard
complex services in a meaningful way involv- interface. By having such services distributed
ing not only traditional alphanumeric data but all over the Internet and accessible in a uniform
complex geographic data and services (Jones & standard manner, it is possible to envision the
Taylor, 2004; Sikder & Gangopadhyay, 2003, integration of several spatial services (chaining of
2004). In particular, collaborative and groupware services) to provide higher levels of functionality
researches are being directed towards develop- to existing services (Peng & Tsou, 2003). For
ing reusable generic model and procedures, example, a typical GIS query “Find the nearest
which can be made to communicate each other Japanese restaurant along the highway” could
in any distributed system in a heterogeneous possibly be answered by chaining Web services
environment. Integrating Web service with such as geocoding points of interest, integrating
collaborative Geographic Information Systems transport networks, dynamic segmentation of
(GIS) has triggered new wave of researchers network, providing routing network, cartographic
who are composing dynamic services involving map rendering, and text-to-voice conversion.
complex geospatial objects and models (Balram This paper explores the role of Web services
& Dragic´evic, 2006). The growing need for a and their implementation standards and protocols
service oriented middleware for GIS is especially and characterizes features for distributed GIS for
realized in three main contexts: (1) to access GIS modeling collaborative spatial processes with
data from anywhere (2) to disseminate spatial respect to the current interoperability standards
information of analysis and exploration of spa- and specifications. In particular the paper the
tial patterns and relationships between disparate identifies the interoperability requirements of
GIS datasets and (3) to allow GIS modeling/ OGC’s (Open GIS Consortium) Web Mapping
processing tools and services to be downloaded Service and (WMS), Web Processing Standards
or uploaded over the internet by remote users to (WPS) with respect to implementation features.
work interactively by using existing Web brows- The rest of the paper is organized as follows:
ers rather than installing proprietary GIS software section 2 discusses the framework of service
locally on their machines (Peng & Tsou, 2003). integration in distributed GIS; section 3 discusses
The recent trend of geospatial computing is a implementation standard for interoperability-
gradual shift of traditional desktop GIS towards based distributed GIS; section 4 characterizes the
distributed GIS (also referred to as GIServices). essential features of distributed GIS components;
With technology moving at such a fast pace, and finally section 5 illustrates a prototype imple-
the services expected by a GIS user (including mentation of a system for geospatial resource
mobile ones) are quite demanding. The growing integration in environmental planning.
demand for users’ need to view relationships

48
Web-Based Geospatial Services

Progress in Distributed Spatial services the corresponding change is automati-


Services: Related Works cally reflected in all other organizations sharing
the same resource. Moreover, middleware-based
Collaborative GIS have been used in many access through a broker relies on a standard defini-
planning problems for solving semi-structured tion of “interfaces.” Geo-processing services can
or loosely structured decision problems in en- become very cumbersome in the absence of such
vironmental planning (Angelides & Angelides, interfaces. From the decision support point of
2000; Balram & Dragic´evic, 2006; Balram et view, having data access at the client’s end with-
al.,2003; Kingston et al., Carver, 2000) The Web out robust geo-processing capabilities amounts to
GIS implementation area mainly includes envi- little help. In a broker-based solution, the client
ronmental planning (ISikder & Gangopadhyay, has to pull massive amounts of data at his/her end
2002; Tuchyna, 2006), data dissemination (Hu, and manage it locally. Such approaches assume
1999; Schuurman & Leszczynski, 2006), commu- the client’s explicit ability to manipulate server
nity planning (Al-Kodmany, 2000; MacEachren connections and invoke remote objects. Thus, a
et al., 2006; Rao et al., 2007). While researchers frequent spatial process, such as a spatial join
continue to argue for an integration and structuring between data from two different servers, needs
of collaborative mapping and visualization tech- to be coordinated at the client’s end. Such object
nologies into spatial decision making (Armstrong, level manipulations of spatial processes often fail
1994; Balram & Dragic´evic, 2006; Jankowski & to provide a high-level view to the application
Nyerges, 2001; MacEachren, 2001; Nyerges & developer. Paradoxically, in a spatial decision
Jankowski, 2001), a Web-based GIS framework support system the user or decision maker’s view
designed to integrate stakeholders into the plan- on spatial features or geometry needs to be real-
ning process has yet to be realized. One of the ized at a higher level of abstraction while at the
major impediments to developing GIServices for same time maintaining the transparency of system
collaborative modeling is the lack of interoper- processes (Anderson & Moreno-Sanchez, 2003;
able component technologies. Heterogeneity of Rao et al., 2007; Tait, 2005).
geo-spatial systems has plagued GIS since its Unlike current component technologies,
inception (Goodchild et al., 1992; Stoimenov & however, Web Services do not use object model-
Djordjevic-Kajan, 2005; Worboys & Deen, 1991). specific protocols such as DCOM, RMI, or IIOP
Different agencies had built many different geo- that require specific, homogeneous infrastructures
graphic data models and systems, following their on both the client and service machines. While
native organizational interest and problem domain implementations tightly coupled to specific com-
(Egenhofer & Herring, 1991). The benefit of col- ponent technologies are perfectly acceptable in a
lective learning has not yet been fully realized, controlled environment, they become impractical
due to a lack of mechanism for reusable service on the Web. As the set of participants in an inte-
and models in participatory systems. grated business process changes and as technology
As a precursor to Web services, CORBA and changes over time, it becomes very difficult to
Microsoft’s DCOM or Java-based RMI were often guarantee a single, unified infrastructure among
used for distributed access and query of spatial all participants. Web Services take a different
data, sometimes integrated with DSS (Eldrandaly, approach; they communicate using ubiquitous
2006). However, being “tightly coupled” with na- Web protocols and data formats, such as HTTP
tive data structure, the broker-based services are and XML. Any system supporting these Web
unable to make sure that whenever an organization standards will be able to support Web Services
makes changes in their native data structure or (Peng & Tsou, 2003).

49
Web-Based Geospatial Services

In static web mapping the web client is a simple to include GIS content and capabilities, Intranet
web browser with the capability of handling HTML applications, or ArcGIS—on demand. ESRI’s
and Web Forms. A Web Form is a HTML Page ArcExplorer viewer is designed to support Web
with data entry fields for user input. The user services provided through its proprietary ArcIMS
inputs are collected and sent by the browser to server software. Microsoft’s MapPoint offers a
the server in a HTTP message. The web server street-mapping program that includes some de-
receives the message but cannot respond to it. mographic overlays. Users can buy the package
Since it does not understand any requests other or access MapPoint Web Services, which can be
than for HTML or other MIME-type documents, utilized to integrate other applications: location-
it passes the request to a back-end program. The based services, GPS car navigation, and decision
back end program is a traditional server side ap- support systems. MapInfo supports both OpenGIS
plication that does the actual processing. In the and the World Wide Web Consortium’s Web
case of web mapping, these back-end programs services standards, including SOAP, WSDL and
include map servers and DBMS servers. The web UDDI, in products such as its miAware software
server then returns the results to the client. Here, (MapInfo, 2007) (mobile location services plat-
the web server becomes middleware, connecting form for MapInfo) for developing location-based
the web client and the back end applications(Peng services. MapServer, an open environment, sup-
& Tsou, 2003). ports the creation of maps at the client’s side by
Web mapping of GIS applications has evolved serving as a map engine to provide database and
from the concept of simple static web publishing spatial content through the client’s browser. It also
of maps on a web page as static map images in supports many OGC’s specifications like WMS,
appropriate graphic formats (GIF, JPEG, PNG, non-transactional WFS, WCS, WMC, SLD, and
PDF etc). The client (web browser) makes re- GML (MapServer, 2007).
quests using standard HTTP constructs such as The concept of dynamic distributed computing
GET and POST to a web server, which returns paradigm calls for a metaphorical counterpart of
the requested map image file in appropriate file the so called LEGO blocks that can be interlocked
format. However, an effective Web mapping and stacked to create complex geospatial services
requires more than just making static making (Tsou & Buttenfield, 2002). The GIService and
maps; it requires processing queries and doing modules like LEGOs can be rearranged and
some limited spatial analysis on the server before restacked in a different configuration do a dif-
presenting the output in a specialized cartographic ferent task (Zipf & Jost, 2006). In Web services,
format on the standard web browser. Historically, a series of protocols such as eXtensible Markup
the Xerox PARC Map Viewer developed in 1994 Language (XML); Simple Object Access Proto-
was the earliest attempt to distribute GIS using col (SOAP); Web Service Description Language
HTTP server and CGI program(Putz, 1994). This (WSDL); and Universal Description, Discovery,
was followed by the development of an on-line and Integration (UDDI) - provides standards for
GIService of GRASSLinks prototype that mim- communication and collaborative processing ca-
icked traditional GISystem functions, such as map pacity among Web-service compliant architecture.
browsing and buffering overlay (Huse, 1995). A key advantage is that various GIS layers can
The Alexandria Digital Library offered a “digi- be dynamically queried and integrated while still
tal library” metaphor for publishing multimedia maintaining independence in a distributed envi-
content including georeferenced data. Among the ronment. One of the promises of Web Services
commercially available Web service-enabled ap- in the GIS world is up-to-date information since
plications, ESRI’s ArcWeb Services offer a way the data is stored with the service provider and

50
Web-Based Geospatial Services

expectedly kept up-to-date when accessed by the Implementing Interoperability


GIS web service consumer. From organizational Standards in Distributed GIS
point of view, this may be very appealing, for
local governments (such as counties and other A suite of emerging Web services standards
organizations) can still independently collect and from the Open GIS Consortium Inc. (OGC) is
manage data locally and integrate information and facilitating the transition of historically standalone
services using Web services. A client, for example, GIServices by enabling interoperability. Web
a transportation company, could directly access a services can put relevant GIS applications on the
local government’s base map without maintaining end user’s desktop by embedding them within
its own. At the same time the client can update familiar applications. The most intensively used
local government’s data from its own record. As standards include Web Map Service, Web Feature
far as data interoperability is concerned, extended Service and the XML-based Geography Markup
collaboration and partnerships using Web Services Language (GML)(OGC, 2007; Peng & Zhang,
could provide opportunity to open interfaces 2004; Siyuan, Griffiths, & Paton, 2007); they al-
and communication mechanisms for distributed low applications to access distributed spatial data
computing. Web services extend the use of GIS across the Internet to any OGC-enabled repository.
by making the integration with other applications
easier. Despite the promise of GIS Web services, Interoperability Requirements of
interoperability among GIS programs and data OGC’s WMS, WPS Standards
sources is far from seamless. Vendor adoption
of GIS Web services is still a work in progress. Although interactive mapping programs are
Interactive Web mapping demands more interac- popular, they suffer from common problems
tions between the user and the client interface and such as poor performance and limited functions.
more client-size processing and functionalities However the biggest problem is that they are
than the static web mapping applications. Also, often proprietary and not necessarily interoper-
CGI extensions such as Servlets, ASP, etc. are used able. Different web mapping programs were
to mitigate shortcomings of CGI. Most current developed in different database frameworks and
interactive Web CGI programs are based on this use different technologies. Integrating and sharing
model (dynamic viewer with CGI extensions). information among web mapping programs are
Interactive viewers include 1) DHTML viewer – arduous tasks. Furthermore, migrating technology
makes static HTML pages dynamic using client from one platform to another is very difficult.
side scripting (VBscript, javascript), DOM and Open GIS Consortium (OGC) has been making
CSS; 2) Java Applets – Executable java code efforts to develop a set of standards to guide the
downloaded from the server and executed on development of web mapping programs so that
the client at runtime. Applets can be integrated they can be interoperable. OGC developed Web
inside the web browser; and 3) ActiveX Con- Map Server (WMS) implementation information
trols – Modular pieces of software that perform specifications that are the first effort towards stan-
tasks and communicate with other programs and dardizing the implementations of web mapping
modules over the Internet via OLE. programs (Peng & Tsou, 2003).

51
Web-Based Geospatial Services

Web Map Server (WMS) Web Computer Graphics Metafile (WebCGM)


Implementation Specifications formats.
Web Map Service operations can be invoked
A Web Map Service (WMS) produces maps of using a standard web browser by submitting re-
spatially referenced data dynamically from geo- quests in the form of Uniform Resource Locators
graphic information. A “map” is defined to be a (URLs). The content of such URLs depends on
portrayal of geographic information as a digital which operation is requested. In particular, when
image file suitable for display on a computer a client requests a map, the URL indicates what
screen. The WMS specification stipulates that information is to be shown on the map, what
a mapping service should be able to at least 1) portion of the Earth is to be mapped, the desired
produce a map (as a picture, series of graphical coordinate reference system, and the output image
elements, feature data); 2) answer basic queries; width and height. When two or more maps are
and 3) inform other programs about its capabili- produced with the same geographic parameters and
ties (what maps it can produce and which can be output size, the results can be accurately overlaid
queried further)(OGC). OGC initially came up to produce a composite map. The use of image
with thin, medium and thick clients in its specifica- formats that support transparent backgrounds
tion, but abandoned them for they were inherently (e.g., GIF or PNG) allows underlying maps to
ambiguous in classifying clients this way. Instead be visible. Furthermore, individual maps can be
they use the kind of information presented at to requested from different servers. The Web Map
web client to categorize web mapping services Service thus enables the creation of a network of
into three cases: distributed map servers from which clients can
build customized maps. A basic WMS classifies
1. Picture case: Here a client’s request is an- its geographic information holdings into “Layers”
swered by a picture of the map and offers a finite number of predefined “Styles”
2. Graphic element case: Here the web cli- in which to display those layers.
ent receives a set of graphic elements (e.g.,
Scalable Vector Graphics (SVG)). The Requirements for a WMS over HTTP
picture element case may be considered as
a subset of graphic element case. The International Standards defines the implemen-
3. Data or feature case: Here geographic tation of the Web Map Service on a distributed
feature data is sent from the server to the computing platform (DCP) comprising Internet
client. hosts that support the Hypertext Transfer Pro-
tocol (HTTP). Thus the online resource of each
Currently the picture case is the most popular operation supported by a server is an HTTP URL
framework adopted by the GIS industry. It provides (Uniform Resource Locator). The URL may be
only for limited map display functions and user different for each operation, or the same, at the
interactions. Along with progress of web mapping discretion of the service provider. Each URL is
and information technologies, the data case and implementation-dependent, with only the query
the graphic element case have become slightly portion comprising the service request itself as
more popular. WMS-produced maps are generally defined by the International Standard. The HTTP
rendered in a pictorial format such as PNG, GIF or supports two request methods: GET and POST.
JPEG, or occasionally as vector-based graphical One or both of these methods may be offered by
elements in Scalable Vector Graphics (SVG) or a server, and the use of the Online Resource URL
differs in each case. Support for the GET method

52
Web-Based Geospatial Services

is mandatory; support for the POST method is Web Processing Service


optional. A Web Map Service shall support the
“GET” method of the HTTP protocol while the Web processing service exposes pre-programmed
Web Map Service may support the “POST” method calculations for geospatial data to the Internet. The
of the HTTP protocol. Web Processing Service (WPS) interface specifies
WPS operations that can be requested by a client
Operations on Web Map Services and performed by a WPS server. Those operations
are as follows:
The three operations defined for a Web Map
Service are as follows: • GetCapabilities: This operation allows a
client to request and receive back service
1. GetCapabilities: The purpose of the manda- metadata (or Capabilities) documents that
tory GetCapabilities operation is to obtain describe the abilities of the specific server
service metadata, which is a machine read- implementation.
able (and human-readable) description of the • DescribeProcess: This operation allows a
server’s information content and acceptable client to acquire more specific information
request parameter values. about an Execute operation provided by
2. GetMap: The GetMap operation returns a the WPS, including the input parameters
map. Upon receiving a GetMap request, a and formats, and similarly the outputs.
WMS shall either satisfy the request or issue • Execute: This operation allows a client to
a service exception. run a specified process of the WPS with
3. GetFeatureInfo: GetFeatureInfo is an qualified input parameters and values.
optional operation. The GetFeatureInfo
operation is designed to provide clients of a Implementing GetCapabilites and
WMS with more information about features DescribeProcess through WSDL
in the pictures of maps that were returned by
previous Map requests. The canonical use These operations have many similarities to other
case for GetFeatureInfo is that a user sees OGC Web Services such as Web Mapping Service
the response of a Map request and chooses (WMS), Web Feature Service (WFS), etc. The
a coordinate (i, j) on that map from which GetCapabilites and DescribeProcess are imple-
to obtain more information. The basic op- mented using web services through WSDL which
eration provides the ability for a client to is an XML format for describing Web Services.
specify which pixel is being asked about, WSDL describes the public interface to the web
which layer(s) should be investigated, and service using XML-based service description.
what format the information should be It specifies how to communicate using the web
returned in. Since WMS protocol is state- service, namely the protocol bindings and message
less, the GetFeatureInfo request indicates formats required to interact with the web services
to the WMS what map the user is viewing listed in its directory. The supported operations
by including most of the original GetMap and messages are described abstractly, and then
request parameters. The actual semantics of bound to a concrete network protocol and mes-
how a WMS decides what to return more sage format. WSDL is often used in combination
information about or what exactly to return with SOAP and XML Schema to provide web
are left up to the WMS provider. services over the Internet (Chang & Park, 2006).
A client (program) connecting to a web service

53
Web-Based Geospatial Services

can read the WSDL to determine what functions There already exists a wealth of business
are available on the server. Any special data types logic that has been implemented for providing
used are embedded in the WSDL file in the form various web mapping services. Exposing existing
of XML Schema. The client can then use SOAP GIS business functionality (geospatial analysis,
to actually call one of the functions listed in the processing, etc.) via web services is an essential
WSDL. This amounts to achieving the same ob- aspect. Web services provide standard interfaces
jectives as OGC WPS operations, GetCapabilities via WSDL needed for a distributed GIS architec-
and DescribeProcess. ture. Server side components can be implemented
using any distributed computing framework such
Characterizations of Components as J2EE or.NET. However to provide the neces-
in a Distributed GIService sary standard interface, the functionality must be
exposed via web services.
What follows is the characterization of the com- Figure 1 shows the various components
ponent of distributed mapping services of GIS typically found in the architecture of a distributed
with respect to dynamically distributed GIS. GIS provider. It consists of a web server as the
We have seen that that although web server and front end and interfaces with the client. The web
map server perform important tasks, they cannot server communicates with the application server
directly communicate since each talk different and other related middleware, all of which are
languages. Web servers use HTTP and HTML responsible for the business login that fulfills the
while map servers have different query structures user’s request. The map server is responsible for
and formats, so they rely on in-between programs generating the requested maps. The data server is
to translate between them. Therefore, for a robust the data store where the geographic data is housed.
distributed GIservice-oriented middleware should To provide for higher scalability, we use multiple
include the following things: map servers and multiple web servers. Since
many map servers and data servers are involved,
• Establishing connections between web we should also include catalog services, data
server and map server repositories and load balance services. A catalog
• Translating user requests from the web service is used to keep track of what functions
server and passing them to the map server each map server can provide. Data repository is
for processing a registration service that keeps track of the data
• Translating output from map server to out- types and location of data sets; it also manages
put expected by the web server (such as multiple data servers by filtering the right data
HTML) which will be forwarded to the set from the appropriate data server.
web browser

In addition, GIS middleware performs value GEO-ELCA: COLLABORATIVE


added service such as GISERVICE ARCHITECTURE

• Managing concurrent requests, load In this section we illustrate a prototype imple-


balancing mentation of a Web-based spatial decision support
• Managing state, transaction management system for collaborative planning of urban land use
• Security change evaluation. GEO-ELCA allows the various
features of GIS services on the Web. The system
allows dynamic selection of a feature type (i.e.,

54
Web-Based Geospatial Services

Figure 1. Exposing component functionality via Web service

polygon – from land use theme) interactively, so a complex emergent environment. These parties
that a user can change attribute items and identify (henceforth termed as “agent, “decision maker”,
a feature property. A user can initiate a change in or “stakeholder” interchangeably) must have an
land use type by graphically selecting a polygon. integrated data access from heterogeneous sources
The server side application processes the request to integrate with transparent high performance
and makes necessary updates in the database computing resources to compose decision models
to reflect the corresponding the changes of the dynamically. However, in real life situations, it is
pollutant coefficients. Every request to change often difficult to achieve the stakeholders’ views
in land use category results in a recalculation of or effective patterns of social interactions in the
the mass export of pollutants and corresponding planning process. For example, decisions on how
statistics. The processed result is sent back to current land use should be changed depend on
the Web server and then to the client side. The legal, environmental, regulatory constraints as
user decision profile is then input to a simulation well as biases and preferences of different group
model to estimate the yearly pollution load. The or institutions. In particular, understanding urban
system integrates a simulation model commonly land use dynamics involves considerations of the
used in urban hydrology--the so-called “Simple complex behavior of individual decision makers
Method”(Schueler, 1999) for estimating exports and the interaction of local and regional institutions
of various pollutants runoff from different land in multiple scales. Moreover, such decisions are
uses. The output is then visualized as pollutant inherently spatial in nature because the change
distribution in terms of different classification in a particular parcel may have direct or indirect
schemes (e.g., standard deviation, plain break, consequences to the neighboring parcels.
quantile, etc.) with a modified map legend. For example, EPA’s Brownfields development
program involves the expansion and redevelop-
Multi-User Context in Distributed GIS ment of urban areas that may be complicated by
the potential presence of a hazardous substance,
In view of a decision support framework, col- pollutant, or contaminant (EPA, 2007). While
laborative modeling presupposes multiple parties cleaning up and reinvesting in these properties
with different perspectives working together in takes development pressures off of undevel-

55
Web-Based Geospatial Services

oped, open land and improves the environment, review the planning scenario of other users. The
evaluating a candidate property to determine if system logs individual users’ preferences which
it meets the criteria for redevelopment is inher- can be used as input into mediating algorithm
ently a collective decision process. For instance, (e.g., genetic algorithm) to resolve conflicts among
an individual landowner may act from his or her user preferences of land use choice and thereby
individual interest; however, in the long run the optimize pollution scenarios.
overall land use scenario may be undesirable to
everyone. Moreover, changes in land use may Distributed Mapping Services
create concern for environmental impact in the
surrounding region. While land-use changes are GEO-ELCA offers cartographic representation
often identified as a major driving force of ecologi- with a specialized visualization feature aided with
cal changes, with the conversion of land use from dynamic legend rendering and symbol manipu-
one category to another there is an overall change lation. The GEO-ELCA’s cartographic features
of hydrological characteristics resulting from the include the following:
changes of impervious areas. Consequently, there
is an increase of volume and peak flow of pos- • A visualization of pollution potential
sible increase in the concentration of pollutants, map based on different themes (e.g., total
which potentially could deteriorate the environ- Nitrogen, total BOD etc.)
ment. Hence, a centralized planning process is • Multiple representations and visualizations
essentially inadequate to reflect group dynamics. of the same data based on different statisti-
Such group-individual dilemmas make it ideally cal classification schemes (Equal Interval,
suited for collaborative planning in a distributed Standard Deviation, etc.)
environment, particularly in the seamless integra- • Visualization of a user-specified color
tion of Web-services in accessing geospatial data ramp of customized legend
and models for environmental planning. • Enhanced navigational options for spatial
The resulting pollution map can be visualized overlay of multiple geographic layers
with multiple theme overlay (Figure 2). A user can

Figure 2. User specific visualization to change polygon attributes in GEO-ELCA

56
Web-Based Geospatial Services

Figure 2 shows a typical example where users Operation (changeLandUsageType) messages


graphically select a polygon and choose a land which are XML encoded and enclosed in SOAP
use category from the pop up list (upon right 1.1 messages (see Figure 3). The construction of
click) to simulate the “what if” kind of pollution SOAP messages is done using client side script-
scenario. The simulation model estimates the ing such as JavaScript, VBScript and can also
pollution potential as a result of land use change. makes use of advanced features such as AJAX
For example, users can visualize the pollution (Sayar, Pierce, & Fox, 2006). Built on top of the
characteristics of total Nitrogen or total BOD, ESRI’s ArcIMS, the system uses ArcXML (native
etc., of the selected polygon. The changes made XML encoding of spatial object) to communicate
in the database by the user (user-1) are completely between the custom middleware and the Web
invisible to other users (e.g., user-2). User-2 has mapping server. The services offered by the Web
an option to concurrently perform the similar Mapping Server are similar to the requirements
operation independently without being effected of OGC’s implementation specification for a web
by User-1’s. The result of the simulation will map service (WMS). The only difference is that
be different if different options are chosen. The the communication language is ArcXML whereas
visualization and cartographic representations the OGC international standard defines a more
can be locally customized by users. In each case, generic XML format. The communication with
the corresponding legends and color rendering client and Web server involves ArcXML in both
services are implemented accordingly. directions, whereas OGC specifies that only re-
sponse parameters need to be an XML document
Service Integration in GEO-ELCA while request parameters may be in formats such
as Key Value Protocol (KVP) encoding-- com-
GEO-ELCA’s main feature provides users with monly used in HTTP GET and POST. It should be
a meaningful composition of services to perform noted that a bidirectional mapping of request and
an exploratory analysis and assessment of the response parameters from OGC format to ArcXML
consequence of users’ decisions in environmen- format is quite simple and can be achieved by a
tal planning and compares the results with the middleware solutions. Table 1. shows ArcXML
collective decision. The system is built on an equivalent of OGC WMS operation.
OGC-compliant connector that supports access The following example illustrates how the
to WMS and WPS. services rendered by the customized middleware
The client sends WMS Request (GetMap, uses ArcXML to map directly to the operations
GetCapabilities, GetFeatures) and WPS Execute needed to conform to the OGC standard for a

Figure 3. Overview of service integration mechanism of GEO-ELCA

57
Web-Based Geospatial Services

Table 1. ArcXML equivalent of OGC Web mapping


AYERS=polygon&STYLES=None&SRS=None&BB
service operations
OX=418755.237974071,4569670.72466017,
OGC WMS Operation ArcXML Request Operation 468691.884383251,4608998.26066327&WID
GetCapabilities GET_SERVICE_INFO
TH=786&HEIGHT=400&FORMAT=image/gif
GetMap GET_IMAGE
GetFeatures GET_FEATURES
This is mapped in GEO-ELCA according to the
corresponding request to the web mapping server
which is in ArcXML. GET_IMAGE Request sent
to the web map server (in ArcXML) looks like
WMS (and a WPS). The operation offered by the
what follows:
WPS which is implemented by our middleware
are as follows:
<?xml version=”1.0” encoding=”UTF-8”
?>
void changeLandUsageType(string: use-
<ARCXML version=”1.1”>
rID, long: minX, long: minY, long:
<REQUEST>
maxX, long: maxY, int: newLandType)
<GET_IMAGE>
<PROPERTIES>
The operation allows users to change the land
<ENVE-
type graphically underlying GIS data. The input
LOPE minx=”418755.237974071”
agreements are as follows:
miny=”4569670.72466017”
maxx=”468691.884383251
userID - Identifier of the user (to

perform data manipulation on user
maxy=”4608998.26066327” />
specific data)
<IMAGESIZE
minX, minY, maxX, maxY – bounding co-
height=”400” width=”786” />
ordinates of land whose land type is
to change
newLandType – Identifier of class of
<LAYERS>
Land Type to which the land must be
<LAYER
changed.
name=”polygon” id=”0” />
<LAYERS>
For example, a simple request for change of
<LEG-
land use is encoded using KVP encoding
END title=”Legend” font=”Arial”
width=”170” height=”300”
http://localhost/service?user=tester
autoextend=”true” backgroundcol-
&minX=23.45&minY=24.34&maxX=45.67&ma
or=”255,255,255” />
xY=67.8
</PROPERTIES>
OGC Complaint GET MAP Request (Encod-
<LAYER type=”acetate”
ed using KVP Key-Value Pair and send
name=”theNorthArrow”>
using HTTP-GET)
<OBJECT
http://localhost:8080/poly_Project/sw
units=”pixel”>
itchUser?VERSION=1.1&REQUEST=GetMap&L
<NORTHAR-
ROW type=”4” size=”15” coords=”20
35” shadow=”32,32,32” angle=”0”

58
Web-Based Geospatial Services

antialiasing=”True” /> and have the specified spatial reference system,


</OBJECT> bounding box, size, and format. Every GET_IM-
</LAYER> AGE request to the web map server returns the
<LAYER type=”acetate” URL of the generated file which has a unique
name=”theScaleBar”> filename. The URL of the image returned by the
<OBJECT web map server complies with the expected OGC
units=”pixel”> Complaint GET_MAP Response.
<SCALE-
BAR screenlength=”157” coords=”471 ‘Virtual’ Private User Workspace
3” mode=”geodesic” fontcolor=”0,0,0” for Model Preference
fontstyle=”Regular” width=”5” />
</OBJECT> In this section we examine the database design
</LAYER> which allows the WPS operation to be performed
</GET_IMAGE> and also allows for collaborative modeling us-
</REQUEST> ing a single underlying GIS data set. In order
</ARCXML> to generate multi-use profiles for collaborative
modeling, a relational model was developed
GET_IMAGE Response received from Map providing each user a ”virtual private” (note: can
Server: you say “virtually private”?) workspace. Each
user is associated with a native profile database,
<?xml version=”1.0” while in fact the actual visualization is rendered
encoding=”UTF-8”?> from a single dataset. The GIS layer includes
<ARCXML version=”1.1”> shapefile--a non-topological data structure com-
<RESPONSE> monly used to represent GIS data in polygon
<IMAGE> features. Every polygon has an associated area,
<ENVE- perimeter, and land usage class (represented by
LOPE minx=”405084.257055615” class ID and described in human-readable form as
miny=”4569670.72466017” a Class Label field). We demonstrate two use case
maxx=”482362.865301707” scenarios of the collaborative modeling. In case
maxy=”4608998.26066327” /> I, the user performs default services that involve
<OUTPUT url=”http:// normal backup, replacement, map generation and
maestro/output/poly_project_MAE- restoration of data (Figure 4). In case II the user
STRO47223362.jpg” /> specifies a land change operation, so after the
<LEGEND url=”http:// backup, the user’s table is updated to reflect the
maestro/output/poly_project_MAE- land change request and then the same replace,
STRO47223363.jpg” /> map generation and restore operations follow,
</IMAGE> similar to case I. The sequences are as follows:
</RESPONSE>
</ARCXML> 1. The operations begin with a request from
the client sent to the web server (not shown
This is mapped to an OGC complaint GET_ in diagram).
MAP response. The response to a valid GetMap 2. The middleware receives the request, backs
request must be a map of the georeferenced up the master data (backing up only the re-
information layer requested, in the desired style,

59
Web-Based Geospatial Services

Figure 4. Collaborative feature of GEO-ELCA in the distributed GIS systems

quired amount of data) into a backup data polygon values prior to switching master data
store. contents with user data contents. The changes
3. The middleware then updates the spatial made by the users are reflected on their respec-
layer with user specific data. tive data table and hence changes can be tracked.
4. Then the middleware forwards the request for We note that the response to a changeLandType
map to the web map server after converting request is a Map Image response showing the
the request to a format understandable by the visualization Map reflecting changed land type
web map server. In GEO-ELCA ArcXML on map (see Figure 6).
is the language of communication with Figure 5 illustrates the sequence of operations
ArcIMS. when the user performs a land use change. The
5. The web map server queries the data store sequences are as follows:
to retrieve the data it needs to generate the
map 1. The operations begin with a request from
6. The records returned consist of current user’s the client sent to the web server (not shown
data. in diagram).
7. A map image is generated using the updated 2. The middleware receives the request, and
(current user’s) data and the map image is then it updates the specific user’s data.
returned to the middleware solution. 3. It then backs up the master data (backing
8. The master data is restored to original values up only the required amount of data) into a
by updating records from backup data store backup data store.
9. The middleware solution returns the map 4. The middleware then updates the spatial
image (or URL) of the newly generated user layer with user specific data.
specific Map. 5. Then the middleware forwards the request for
a map to the web map server after converting
A change of land type request issued involves the request to a format understandable by the
a number steps when compared to GetFeatures web map server. In our case ArcXML is the
or GetMap requests. The additional step that is language of communication with ArcIMS.
performed by the middleware are steps 3 and 4
updates the user’s table to reflect the change in

60
Web-Based Geospatial Services

Figure 6. Sequence diagram for changing land use category by collaborative editing of polygon attributes

6. The web map Server queries the data store 9. The master data is restored to original values
to retrieve the data it needs to generate the by updating records from backup data store.
map 10. The middleware solution returns the map
7. The records returned consist of current user’s image (or URL) of the newly generated user
data. specific Map.
8. A map image is generated using the updated
(current user’s) data and the map image is
returned to the middleware solution.

Figure 5. Using Case Scenario II – User requests to edit a map to change polygon land type

61
Web-Based Geospatial Services

CONCLUSION individual decisions). Each user’s planning profile


can finally be compared for further modification.
This article presents a conceptual and prototypical The collaborative model espoused here is geared
implementation of a Web service-based geospatial towards archiving different opinions regarding
resource integration scheme. Since Web services visualization and negotiation. We have also noted
avoid problems of tightly coupled distributed that before GIservices can be wrapped with SOAP
object techniques and exposes an application pro- API, the generic geo-processing model formalism
gramming interface over the Web, it holds promise is needed to link models to the domain specific
for distributed resource sharing and collabora- application. A formal description of the spatial data
tive environmental planning. By wrapping and component and process model will allow better
dynamically integrating remote geo-processing interoperation among heterogeneous systems.
services from multiple sources, one can develop Future extension of GEO-ELCA will include an
an emergent collaborative system using interop- ontology-driven semantic layer for high level
erable standards of XML and SOAP (Chang & communication with geo-processing services.
Park, 2006). We introduced architecture for Web
service-based collaborative modeling applications
for environmental planning. The prototype system REFERENCES
is based on the conversion of the OGC compat-
ible GIS services to Web Services for flexible Al-Kodmany, K. (2000). Using Web-based tech-
collaboration. The prototype architecture offers nologies and geographic information systems in
functionalities to integrate disparate spatial data community planning. Journal of Urban Technol-
processing services into meaningful compositions ogy, 7, 1–30. doi:10.1080/713684108
to perform complex exploratory analysis such as Alonso, G. (2004). Web services: concepts, ar-
the assessment of the environmental consequences chitectures and applications. Berlin: Springer.
of individual decision makers’ choices and the
collective behavior that results.The significant Anderson, G., & Moreno-Sanchez, R. (2003).
advantage of the architecture is that it provides a Building Web-based spatial information solu-
means for self-describing services in which the tions around open specifications and open source
end user (any WMS/WPS client) can invoke a software. Transactions in GIS, 7(4), 447–466.
set of services (e.g., catalog services of spatial doi:10.1111/1467-9671.00158
map). Specifically the system makes use of OGC’s
Angelides, M., & Angelides, M. C. (2000). Using
GetCapabilities, GetMap and GetFeaturesInfo
multimedia database information systems over
to create a user-specific “on-the-fly” map layer
the internet for enhancing the planning process
from existing spatial data layers. The composition
for dealing with the built heritage. International
GIS components allow development of collab-
Journal of Information Management, 20, 349–367.
orative applications (e.g., dynamic user specific
doi:10.1016/S0268-4012(00)00028-1
plan profiling and customization of cartographic
visualization and rendering). In terms of collab- Armstrong, M. (1994). Requirements for the de-
orative decision-making, the added advantage velopment of GIS-based group decision support
is that community based geo-spatial vocabulary systems. Journal of the American Society for Infor-
and the corresponding modeling semantics can be mation Science American Society for Information
communicated effectively (e.g., the consequence Science, 45(9), 669–677. doi:10.1002/(SICI)1097-
of user decision can be simulated graphically to 4571(199410)45:9<669::AID-ASI4>3.0.CO;2-P
estimate the yearly pollution load as a result of

62
Web-Based Geospatial Services

Balram, S., & Dragic’evic, S. (2006). Modeling Hu, S. (1999). Integrated multimedia approach
collaborative GIS processes using soft systems to the utilization of an Everglades vegetation
theory, UML and object oriented design. Transac- database. Photogrammetric Engineering and
tions in GIS, 10(2), 199–218. doi:10.1111/j.1467- Remote Sensing, 65(2), 193–198.
9671.2006.00253.x
Huse, S. M. (1995). GRASSLinks: A new model
Balram, S., Dragicevic, S., & Meredith, T. (2003). for spatial information access for environmental
Achieving effectiveness in stakeholder participa- planning. Retrieved from http://www.regis.berke-
tion using the GIS-based collaborative spatial ley.edu/sue/phd/
Delphi methodology. Journal of Environmen-
Jankowski, P., & Nyerges, T. (2001). Geo-
tal Assessment Policy and Management, 5(3),
graphic information systems for group decision
365–339. doi:10.1142/S1464333203001413
making:Towards a participatory geographic in-
Chang, Y.-S., & Park, H.-D. (2006). XML Web formation science. New York: Taylor and Francis.
service-based development model for Internet
Jones, M., & Taylor, G. (2004). Data integration
GIS applications. International Journal of Geo-
issues for a farm decision support system. Trans-
graphical Information Science, 20(4), 371–399.
actions in GIS, 8(4), 459–477. doi:10.1111/j.1467-
doi:10.1080/13658810600607857
9671.2004.00196.x
Chatterjee, S., & Webber, J. (2004). Developing
Kingston, R., Carver, S., Evans, A., & Turton,
enterprise Web services: an architect’s guide. Up-
I. (2000). Web-based public participation geo-
per Saddle River, NJ: Prentice Hall PTR.
graphical information systems: An aid to local
Egenhofer, M., & Herring, J. R. (1991). High- environmental decision-making. Computers,
level spatial data structures for GIS. In D. J. Environment and Urban Systems, 24(2), 109–125.
Maguire, M. F. Goodchild & D. W. Rhind (Eds.), doi:10.1016/S0198-9715(99)00049-6
Geographical information systems: principles
MacEachren, A. M. (2001). Cartography and
and applications (Vol. 1, pp. 227-237). London:
GIS: Extending collaborative tools to support
Longman Scientific Publications.
virtual teams. Progress in Human Geography,
Eldrandaly, K. A. (2006). A COM-based expert 25, 431–444. doi:10.1191/030913201680191763
system for selecting the suitable map projection
MacEachren, A. M., Pike, W., Yu, C., Brewer, I.,
in ArcGIS. Expert Systems with Applications,
Gahegan, M., & Weaver, S. D. (2006). Building
31(1), 94–100. doi:10.1016/j.eswa.2005.09.008
a geocollaboratory: Supporting human-environ-
EPA. (2007). Brownfields cleanup and redevelop- ment regional observatory (HERO) collaborative
ment. Retrieved January 2007, from http://www. science activities. Computers, Environment and
epa.gov/brownfields/ Urban Systems, 30(2), 201–225. doi:10.1016/j.
compenvurbsys.2005.10.005
Goodchild, M. F., Hanning, R., & Wise, S. (1992).
Integrating GIS and spatial data analysis: problems MapInfo. (2007). miAware2.0 documentation
and possibilities. International Journal of Geo- library. Retrieved May 2007, from http://refer-
graphical Information Systems, 6(5), 407–423. ence.mapinfo.com/common/docs/mapxtend-dev-
doi:10.1080/02693799208901923 web-none-eng/
MapServer. (2007). MapServer. Retrieved May
2007, from http://mapserver.gis.umn.edu/

63
Web-Based Geospatial Services

Nyerges, T., & Jankowski, P. (2001). Geographic Schuurman, N., & Leszczynski, A. (2006). Ontol-
Information Systems for Group Decision Making. ogy-based metadata. Transactions in GIS, 10(5),
London: Taylor & Francis. 709–726. doi:10.1111/j.1467-9671.2006.01024.x
OGC. (2007). OpenGIS. Geography markup Sikder, I., & Gangopadhyay, A. (2002). Design and
language (GML) implementation specification. Implementation of a Web-based collaborative spa-
Retrieved May 2007, from http://opengis.net/gml/ tial decision support system: Organizational and
managerial implications. Information Resources
OGC. Web map service interface (No. 03-109r1):
Management Journal, 15(4), 33–47.
OGC.
Sikder, I. U., & Gangopadhyay, A. (2003). Dis-
Peng, Z.-R., & Tsou, M.-H. (2003). Internet GIS:
tributed data warehouse for go-spatial services.
Distributed geographic information services for
In G. Grant (Ed.), ERP & data warehouse in
the Internet and wireless networks. Wiley.
organizations: Issues and challenges (pp. 132-
Peng, Z.-R., & Zhang, C. (2004). The roles of 145). IRM Press.
geography markup language (GML), scalable
Sikder, I. U., & Gangopadhyay, A. (2004). Col-
vector graphics (SVG), and Web feature service
laborative decision making in Web-based GIS.
(WFS) specifications in the development of
In M. Khosrow-Pour (Ed.), Advanced topics in
Internet geographic information systems (GIS).
information resources management (Vol. 3, pp.
Journal of Geographical Systems, 6(2), 95–116.
147-162). Idea Group Publishing.
doi:10.1007/s10109-004-0129-0
Siyuan, F., Griffiths, T., & Paton, N. W. (2007).
Putz, S. (1994). Interactive information services
GML for Representing data from spatio-historical
using World-Wide Web hypertext. Retrieved 2006
databases: A case study. Transactions in GIS, 11(2),
from http://www2.parc.com/istl/projects/www94/
233–253. doi:10.1111/j.1467-9671.2007.01043.x
mapviewer.html
Stoimenov, L., & Djordjevic-Kajan, S. (2005).
Rao, M., Fan, G., Thomas, J., Cherian, G.,
An architecture for interoperable GIS use in a
Chudiwale, V., & Awawdeh, M. (2007). A web-
local community environment. Computers &
based GIS decision support system for manag-
Geosciences, 31(2), 211–220. doi:10.1016/j.
ing and planning USDA’s conservation reserve
cageo.2004.09.017
program (CRP). Environmental Modelling &
Software, 22(9), 1270–1280. doi:10.1016/j.env- Tait, M. G. (2005). Implementing geoportals:
soft.2006.08.003 applications of distributed GIS. Computers,
Environment and Urban Systems, 29(1), 33–47.
Sayar, A., Pierce, M., & Fox, G. (2006). Integrat-
ing AJAX approach into GIS visualization Web Tsou, M. H., & Buttenfield, B. (2002). A dynamic
services. In Telecommunications, 2006. AICT- architecture for distributing geographic informa-
ICIW ‘06. International Conference on Internet tion services. Transactions in GIS, 6(4), 355–381.
and Web Applications and Services/Advanced doi:10.1111/1467-9671.00118
International (pp. 169-169).
Tuchyna, M. (2006). Establishment of spatial data
Schueler, T. (1999). Microbes and urban water- infrastructure within the environmental sector
sheds. Watershed Protection Techniques, 3(1), in Slovak Republic. Environmental Modelling
551–596. & Software, 21(11), 1572–1578. doi:10.1016/j.
envsoft.2006.05.014

64
Web-Based Geospatial Services

Worboys, M., & Deen, S. M. (1991). Seman- Zipf, A., & Jost, M. (2006). Implementing adaptive
tic heterogeneity in distributed geographic mobile GI services based on ontologies: Examples
databases. SIGMOD Record, 20(4), 30–34. from pedestrian navigation support. Computers,
doi:10.1145/141356.141366 Environment and Urban Systems, 30(6), 784–798.
doi:10.1016/j.compenvurbsys.2006.02.005

This work was previously published in Web Engineering Advancements and Trends: Building New Dimensions of Information
Technology, edited by Ghazi I. Alkhatib and David C. Rine, pp. 281-297, copyright 2010 by Information Science Reference
(an imprint of IGI Global).

65
66

Chapter 7
A Brief Cartography of
Smart Cameras:
Proactive Surveillance and Control

Fernanda Bruno
Universidade Federal do Rio de Janeiro, Brazil

ABSTRACT
This chapter carries out a brief cartography of the so-called “intelligent” video surveillance systems.
These systems are programmed to accomplish real time automated detection of situations considered
irregular and/or suspicious in specific environments, in order to predict and prevent undesirable events.
Three aspects of the smart cameras are focused in this cartography. First, the author explores its regime
of visibility and note how it prioritizes the capture of irregularities in the body’s movements in urban
space. Second, the author shows how the type of monitoring and profiling of bodies and behaviors in
these systems generally acts at the visible, surface and infra-individual level of human conduct. Finally,
he analyzes the temporality of smart cameras, especially in its proactive dimension that intends to
foresee and intervene, in real time, in future events. The analysis of these three aspects of the intelligent
video surveillance identifies and highlights discourses, processes and operations that are common to
the exercising of power and surveillance in contemporary societies – more specifically, those which are
included in the realm of control devices.

INTRODUCTION activity patterns exist in the monitored area, and


then have the system look for atypical patterns that
We care what you do, not who you are. We aim to may signal a person of interest—perhaps some-
analyze and model the behavior patterns of people one engaging in nefarious behavior or a person
and vehicles moving through the scene, rather than in need of help.” (James W. Davis, a computer-
attempting to determine the identity of people... science and engineering professor at Ohio State
We are trying to automatically learn what typical University, describing the construction project of
smart-camera networks that automatically track
DOI: 10.4018/978-1-4666-2038-4.ch007 suspect behavior, 2009:30)

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
A Brief Cartography of Smart Cameras

It is not uncommon for the more enlightened surveillance screen so that intervention can take
statements with regard to control mechanisms to place in time to impede the possible lethal leap
be stated by the people who design them, rather of a potential suicide victim. This same device
than by social critics. Part of what we will deal can also automatically pinpoint the following
with in this chapter has already been revealed, on the screen: an object that was left in the sta-
albeit in a condensed and controversial way, by tion, individuals or groups of people behaving
the words of the inventor of a camera-based se- suspiciously, bodies moving against the flow, or
curity system. This system is capable of tracking any situation previously categorized as needing
suspect or risky behavior by integrating multiple to be highlighted in the machine’s and/or camera
cameras which have high-resolution panoramic operators’ field of attention. In technical terms:
vision, geographic information systems (GIS),
and “machine learning methods. This enables the Intelligent visual surveillance systems deal with
computer to perform the kind of visual recognition the real-time monitoring of persistent and transient
that seems effortless for humans”1. According to objects within a specific environment. The primary
the researcher: aims of these systems are to provide an automatic
interpretation of scenes and to understand and
We envision our research being applied to com- predict the actions and interactions of the observed
mon urban surveillance tasks. To date, we have objects based on the information acquired by sen-
been focusing on the monitoring and analysis of sors (Velastin & Remagnino, 2006: 1).
pedestrian movement and activity...Other related
applications we are considering include persistent Computational vision, pattern analysis, ar-
tracking of vehicles through complex downtown tificial intelligence, and data management are
environments and analyzing the traffic flows to some of the subjects and techniques involved in
look for atypical patterns. (Davis, 2009: 30) the construction of such systems. They rely on
applications in the most varied sectors: safety,
Such a system is part of a new “generation” of administration, transportation, commerce, and
so-called “smart” video surveillance2 that heralds entertainment, among others. The tasks they can
ways of monitoring behavior in an automated perform, and which are always in real time, are
fashion. In most cases, the intention is for such also varied: detection, classification, and track-
cameras to recognize and differentiate regular ing of objects, people, behavior, movements, and
patterns (which are considered to be safe) of con- human activities; classification and interpreta-
duct and occupation of spaces from irregular ones tion of human behavior; detection of changes in
(which are categorized as suspect, dangerous, or the context or the pattern of how individuals or
simply non-functional). These “smart cameras” crowds occupy spaces; identification, etc. This
or “intelligent video surveillance” consist of soft- broad range of functions and applications is vis-
ware3 that can be added on to cameras in order to ible in products currently on the market of smart
filter or read images according to algorithms and video surveillance. ADVISOR (Annotated Digital
point out individuals, objects, and/or attitudes that Video for Surveillance and Optimised Retrieval),
should be the focus of attention of the “scene”. for example, which focuses on surveillance in the
This is carried out in accordance with pre-defined area of public transportation, includes automatic
applications in the system. For example, a body incident detection, content-based annotation of
stopped for a certain period of time very near video recordings, and individual and crowd
the safety line beside the tracks at a subway sta- behavior pattern analysis. ISCAPS (Integrated
tion should be automatically pointed out on the Surveillance of Crowded Areas for Public Secu-

67
A Brief Cartography of Smart Cameras

rity) develops biometric techniques for identifying operate by inspecting and monitoring what you
people in populous areas. BEHAVE, DETER, and do or what you are doing when in a determined
PRISMATICA focus on the analysis and detec- location, rather than simply where you are.
tion of suspect behavior in public places, while In this chapter, we will carry out a brief cartog-
CAVIAR (Context Aware Vision using Image- raphy of these socio-technical systems. Mimick-
based Active Recognition) is aimed at analyzing ing the devices in question, which visualize their
and recognizing objects, contexts, and situations “territories” by sweeping over surfaces – without
for the purposes of urban surveillance and to allow going into greater depth, we will map their most
for the planning of marketing strategies. prominent traits in three domains. Initially, we
What is at stake? Or better yet, what is it that will deal with the regime of visibility present in
these smart cameras bring up for discussion? Most these surveillance devices, as well as their policies
certainly, the question is not to simply describe and orderings of that which is visible, especially
this technological innovation and its effectiveness. with regard to perception and attention. We will
Rather, it is a question of analyzing these devices note that such a regime focuses on picking up ir-
as socio-technical systems (Latour, 1993 and 1999; regularities in the algorithmic visualization of the
Lianos, 2001) that modernize a number of aspects, movements of bodies and their general patterns of
discourses, and processes specific to surveillance conduct. Next, we will attempt to show that this
and the control of bodies and behaviors in the regime of visibility and surveillance goes hand in
produced and protected spaces of contemporary hand with a specific type of power and knowledge
societies. In the reference chapter of this section, over individuals’ bodies that generally acts at the
David Lyon (2010) discusses how our mundane, infra-individual, visible, and surface level of the
everyday mobility has become more and more action and the conduct in order to incite, guide, or
transparent and traceable by way of technologies contain these actions and conducts. Finally, we will
that are, themselves, mobile and locational (cell focus on the device’s temporality, demonstrating
phones, GPS, RFID labels). Knowing “‘where that there is surveillance in force in this device
you are’ matters increasingly from the point of that is intended to be proactive and which ideally
view of contemporary regimes of governance, coordinates the real time of the vision and the
whether of commerce, policing or government” action in the present with a memory or archive
(Lyon, 2010: 4-5). With one single movement, of indexes and patterns of conduct that project a
the potentialities of these technologies multiply future needing to be controlled.
for the surveillance - “as observing, monitoring, The perspective that guides the overview of
tracing” - of citizens, consumers, employees, etc. these three aspects of smart video surveillance
Such potentialities do not only represent threats systems identifies and highlights discourses,
to the privacy of individuals; they also articulate processes, and operations that are characteristic of
together with ways of governing that act in fluid the way power and surveillance are exercised in
and virtual spaces through “coded information contemporary societies – more specifically, those
to monitor, predict and direct the behaviour of which are included in the realm of control devices
individuals...” (Yar, 2003: 257 apud Lyon, 2010). (Deleuze, 1992; Lianos, 2001; Rose, 2000). As a
Intelligent video surveillance systems make up strategy of analysis, we will compare two models
these ways of governing. Within the realm of image of power in which surveillance and visibility play
analysis and perception and attention automation, a central role: discipline, as Foucault (1995) un-
these systems perform functions that are similar derstood and analyzed it; and control, for which
to the technologies of observation, monitoring, Deleuze (1992) traces more general lines and to
and tracing mentioned by Lyon. However, they which other analyses are added that have greater

68
A Brief Cartography of Smart Cameras

or lesser affinities (Rose, 2000; Lianos, 2001) VISIBILITY, PERCEPTION,


with this author. Such a comparison should not be AND ATTENTION
understood according to principles of exclusion
or historic overcoming. Although we recognize What does a smart camera see or perceive? Or
the strong implications of each of these models better yet, what regime of visibility is put to work
for their eras, there are currently zones of com- by this video surveillance that claims to be intel-
monality between them. Discipline is not lodged ligent? One of the most recurring arguments in
in a past that has been completely left behind, just research and industries that design such artifacts
as control is not the current, exclusive model of is that they surpass the perceptive and attentional
power. However, the zones of commonality do limitations present in monitoring carried out by
not minimize the strong transformational process conventional video surveillance systems. Human
taking place, which involves the emergence of attention is excessively limited when dealing
new surveillance processes. In the analysis of with the volume and monotony of surveillance
intelligent video surveillance, we will privilege images. After only 20 minutes, the attention of
the aspects that attest to its affinities with control most individuals falls to a level below that which
and contrasts with discipline. We will note that is acceptable for an efficient monitoring activity
elements which were already indispensable to (Hampapur et alli, 2003 apud Gouaillier & Fleu-
disciplinary power – such as the knowledge of rant, 2009). A guard is not capable of attentively
interiority and the processes of interiorization, observing from nine to twelve cameras for more
the detailed analysis of each individual, and the than fifteen minutes. Some studies show that the
careful production of their identities – certainly proportion of the number of guards in relation to
do not disappear. However, these elements are no the number of screens in some cases can actually
longer at the forefront of the operations of power vary from 1:4 to 1:78 (Hearing et alli, 2008 apud
and knowledge in most of these devices. Gouaillier & Fleurant, 2009).
It should be noted that the cartography traced Moreover, according to such research and
out here is as brief as it is incomplete. We not only industries, this is one of the main reasons for the
leave out a series of aspects, but also remain on the inefficiency of video surveillance in providing
surface of the processes analyzed. We do not go safety – cameras are more effective at producing
into the necessary depth that such topics deserve after-the-fact proof than at preventing incidents
in order to be understood in their complexity. or crimes. Therefore, adding an “intelligent” or
However, the device in question is well-suited for cognitive layer to the cameras has the intention
the type of analysis that is intended to be carried of automating perception and attention in a way
out, considering the fact that tasks which humans that points out only the indexes of threat or danger
are not able to fully accomplish are intended to or any other situation that is worth highlighting
be delegated to the device. It thus embodies part in the images, in accordance with the defined
of the ideals of surveillance and control that are purpose. The system should automatically dif-
present in the socio-technical systems of our time. ferentiate and recognize that which is meaningful
Moreover, these ideals are made especially explicit in a scene and that which is irrelevant, that which
because they deal with automated systems that is irregular and that which is regular. Herein lies
need to be programmed. Hence, our map serves its “intelligence”. In other words, it not only
as a first reading guide of these automated and deals with seeing or capturing, transmitting, and
“intelligent” surveillance devices. archiving images, but also with “interpreting”,
in accordance with pre-defined categories, that
which is “viewed” in a scene. The system thus

69
A Brief Cartography of Smart Cameras

incorporates cognitive functions – more specifi- tention, which is the case of video surveillance
cally perceptive and attentional functions – in an operators as well as contemporary societies in
automated and pre-programmed way. general, there is a need to filter out that which
Let us take DETER (Detection of Events for is relevant. In these systems, only that which is
Threat Evaluation and Recognition) as an example. irregular or unusual is worth seeing. The flowing
Its focus is to detect and report unusual patterns of regularity of bodies in movement remains in the
pedestrian and vehicle movement in external areas. attentional and perceptual background. However,
For this system to work, it must recognize and it should be added that irregularity is an index
differentiate the objects in the scene (pedestrians of threat or suspicion – and that is why it should
and vehicles), as well as the usual and unusual occupy the foreground. Moreover, the irregularity
patterns of movement of each object, in order to in question is no longer confused with the focal
recognize threats. According to a research report points of the disciplinary regimes of visibility,
(Morellas; Pavlidis; Tsiamyrtzis, 2003), DETER which read an abnormal soul through behavioral
has been successful in, for example, detecting the deviation. Here, behavioral deviation is an index
following patterns of movement that the system of imminent risk and undesired or threatening
considered to be potentially alarming in certain action. The psychological motives or traits that
external areas, such as parking lots: overspeeding underlie the action are of little importance.
vehicles, overrun of pedestrian pavement, running This task of catching irregularities or even
pedestrians, multi-vehicle attack (vehicles enter- foreseeing them is already present in conventional
ing simultaneously from different entries in the video surveillance and in many other forms of
parking lot and converging on a common parking human or mechanical inspection. In this case,
lot lane) and M pattern (this refers to pedestrians however, paying attention to that which is irregular
following a serpentine course. It is more often is already incorporated or programmed into the
than not a suspicious pattern associated with viewing device itself. Thus, the type of ordering
vehicle break-ins). Upon detecting such potential of that which is visible becomes more explicit
threats, the system sounds an alarm in the guard’s in this model of surveillance. In this ordering,
attentional field, thus allowing for intervention at there is a particular way of observing everyday
the moment of or before the predicted incident. activities. We know to what extent we can make
Through this delegation of perception and at- surveillance models correspond to regimes of
tention (Latour, 1994), not only are technical and visibility of everyday activities. We also know
administrative parameters of efficiency defined that disciplinary technologies make an entire field
and reiterated; regimes of visibility, surveillance, of everyday gestures and actions, which used to
and control of the space and of human behavior remain far from the investments of power, visible:
are as well. A regime of visibility does not consist activities; gestures; exercises; the following of
so much of what is seen. Rather, it consists of that rules, regulations, and schedules. The panoptic
which makes what is seen possible. Machines, scale of disciplinary surveillance is spoken of a
practices, discourses, and rules that are linked to great deal. A whole political anatomy of detail,
the formation of knowledge and games of power which extends to the lay world and the ordinary
(Foucault, 1995) participate in these conditions individual that which was already important to
of visibility (Deleuze, 1992). Intelligent video theology and asceticism, is relegated to a second
surveillance systems make evident that which level:
is currently defined as needing to be visible and
noteworthy in the field of surveillance. In the The mystique of the everyday is joined here with
context of an excess of images and a lack of at- the discipline of the minute... The meticulousness

70
A Brief Cartography of Smart Cameras

of the regulations, the fussiness of the inspections, term “silhouette” in order to designate the patterns
the supervision of the smallest fragment of life that correspond to certain body movements.
and of the body will soon provide, in the context This scrutinization of that which is visible is
of the school, the barracks, the hospital or the intimately linked to its normative effects. Such
workshop, a laicized content, an economic or cameras are not so much for introducing or estab-
technical rationality for this mystical calculus lishing normality in the heart of deviant popula-
of the infinitesimal and the infinite (Foucault, tions (as in the case of some panoptic institutions).
1995: 140). Rather, they are for capturing or catching a break
in the flowing order. It is worth pointing out that
The meticulous observation of detail makes a order and regularity are maintained. However, their
series of small things, which used to be ignored occurrence is not because of an internalization
by the great moral and judicial systems because of values that guide an identity or biographical
of their relative indifference, noticeable in the project, as was predicted by normalizing, disciplin-
sense of becoming meaningful and visible: de- ary jurisdictions and their regimes of visibility.
lays, absences, interruption of tasks, negligence, These cameras have a “formal,” “pragmatic,” and
rude gestures, immodest gestures, indecencies. “utilitarian” normative effect in which “seeming
This optic scale, which was at the same time normal” is more decisive than “being normal.”
both very detailed and all-encompassing, also Normality is, above all, a surface effect, an adher-
depended on the constitution of meticulous files ence to protocols of action and the use of managed
for each individual and the sharp observation of spaces, and a belonging to a particular “ethos”
exams, in which the overlapping of the relations – the supermarket or bank customer, the airline
of power and knowledge assumed “all its vis- passenger, the subway user, etc. (Cf. Lianos, op.
ible brilliance” (Idem, p.190). The daily exam cit., p.126). Deviance, as we have already stated,
rituals in disciplinary institutions correspond to is an index of threat or suspicion. However, the
the individual entering the field of human and focus is on the action more than on the agent.
“clinical” sciences, through which he or she “may What is being dealt with here is not seeing spe-
be described, judged, measured, compared with cifically or even seeing through. Rather, it has
others, in his very individuality” (Idem, p.191). to do with recognizing in an automated way and
The Intelligent video surveillance systems of seeing into the future (making behavioral patterns
today differ from this disciplinary optic scale. correspond to patterns of pixels in the image). It
They deal with an observation that tracks bodies is a view that has foresight and also allows for
and spaces with a sweeping motion. This obser- action in real time, thus frustrating or averting
vation is not really concerned with the minutiae that which was predicted. In the following topics
of everyday activities. It also does not concern we will explore the emphasis given to the surface
itself with an inventory and objectivization of of the bodies and their correlated categorization
the individual differences that allow one to see of monitored behaviors in greater detail, as well
an interiority – through gestures, activities, and as the temporality of this proactive surveillance.
performances – that is no less rich in details. These
systems deal with an algorithmic view that pays
less attention to details and depth than to superfi- MOBILE BODIES, ACTION,
cial patterns and movements of bodies, which are AND INFORMATION
understood more in their general contours than in
their individuality. Some software even uses the The body has always been of interest to power.
As a vast tradition of studies on the topic reveals,

71
A Brief Cartography of Smart Cameras

there are many ways for power to be interested aspects of the current forms of distance control
by bodies: the body of habits and customs (Elias, and surveillance (Lianos & Douglas, 2000). As
1969); of sexuality (Brown, 1988; Foucault, has been noted, bodies are inspected in their daily
1976); of life, health, and illness (Canguilhem, mobility and movement without interrupting the
2009; Foucault, 1963; Rose, 2006); of work; of flow of displacement in urban spaces, except in the
care with daily cleanliness and beauty (Vigarello, event of suspicion. Disciplinary devices needed to
1987 and 2004), among others. As was seen in the stop and confine the bodies or register them in a
previous topic, there are also many ways for power predefined system of activities in order to observe
to “gaze upon” the body, to make it available for them, know them, and normalize them. Video
visibility, and to make it an object of perception, surveillance systems, however, are interested in
attention, description, knowledge, classification, the flow of moving bodies within the course of
record (Crary, 1992 and 2001; Sekula, 1986). their lives and daily actions. In this sense, they
We have recently seen a recurring diagnosis are part of a set of contemporary technologies that
arise in post-industrial societies with regard to makes it possible to track the everyday mobility
the investments of power over the body. On the and displacement of individuals in physical space
one hand, we are living an intensification of the (Wood and Graham, 2006; Lyon, 2010) and in
most varied forms of biopower (Foucault, 2004; cyberspace (Andrejevic, 2004). As Lyon states
Agamben, 1998) – in other words, of the different in the reference chapter of this section, “tracing
entries of the body and of life into the dominion parolees by satellite would have sounded like
of the explicit calculations of power, from bio- science fiction, and posting invitations by text
technologies to biosocialities (Rabinow, 1992) and message to potential customers passing near to
to the penetration of power in multiple spheres Starbucks like a marketer’s fantasy. All these
of social, cognitive, affective, genetic life, etc. things now happen.” (Lyon, op.cit., p.2-3).
(Negri & Hardt, 2000; Lazzarato, 2004). At the Nevertheless, the monitoring of mobility is
same time, it has been certified that the body, in its not recent. The demarcation of borders and ter-
materiality and physical force, interests power less ritorial protections, the control of migrations,
and less. Power is more interested in the body’s and the circulation of people, goods, diseases,
dimensions that are informational, immaterial, etc., remind us of just how old the crossovers
cognitive, etc. (Lazzarato, 2004). Indeed, we see are between surveillance and mobility (Salter &
distinct forms of relationship between body and Zureik, 2005; Foucault, 2007). On the one hand,
power that coexist in contemporary societies. This surveillance practices and techniques have his-
can also be seen in surveillance technologies, torically acted as mobility containment. On the
which invest in biometric and genetic data, as well other hand, and especially since the constitution
as in various ways of monitoring bodies and their of modern cities and states, surveillance devices
activities from a distance – be it in physical space do not simply run counter to mobility. Rather,
or in informational space. Social and affective these devices accompany the different forms of
dimensions are not left out of these investments, mobility and order them, not only in the sense of
given that more and more of these surveillance containing or immobilizing them, but also in the
devices exist, not only in control and security sense of capitalizing on them and potentializing
circuits, but also in entertainment, sociability, them for their service. This is especially apparent
show circuits, and, above all, in communication in the urbanization processes that began in the
technologies and networks. 18th century, in which surveillance and security
In intelligent video surveillance, the monitor- devices (the constitution of police forces, health
ing of bodies in spaces reflects some important and population administration policies, etc.)

72
A Brief Cartography of Smart Cameras

sought not only to establish limits, borders, and 199). The active user and consumer are the privi-
localizations, but also to permit and guarantee the leged subjects of contemporary control. In such
circulation of people, goods, etc. (Foucault, 2007). a scenario, control is often carried out by being
This dual relationship between surveillance and mixed with the very contexts of consumption and
mobility has become more acute and more com- use of goods and services and with the very systems
plex in the contemporary world (Lyon, 2002 and and technologies of training and coordinating of
2009). As has already been seen, the observation individual choices, thus providing protocols for
of mobile bodies by smart cameras is not simply action and contexts for interaction. In a word,
to know where you are (Lyon, 2010), but rather technologies of empowerment of individual choice
to know what you are doing when in a determined and technologies of control blend more and more
place. As researcher James W. Davis pointed out into contemporary managed spaces (Rose, 2000;
at the beginning of this chapter, it is not always Lianos & Douglas, 2000).
“you” that is important. Although some systems In a parallel manner, a whole culture and rheto-
potentialize identification procedures, the first ric of risk and security (Garland, 2001) permeates
observational focus of many of them is not on the ordering of contemporary socio-technical
the identity or the identification of the individuals spaces, bodies, and environments. It is not enough
but on their actions and behavior. It remains to be to just incite the action. It is also necessary to
understood in what way this automated and “intel- foresee and avoid all behavior that represents risk
ligent” visual monitoring of bodies and behaviors or danger. This focus on the action and behavioral
is associated with the formation of knowledge and patterns in order to construct algorithms capable
power over individuals. of foreseeing events is recurrent in other control
Disciplinary systems certainly made use devices in the fields of health, crime, and labor.
of careful analyses of behaviors, actions, and Such devices act more on the illness than on those
performances. However, all of this was to be who are ill (Rose, 2006), on the crime rather than
accompanied by a technology of the soul (which the criminal (Garland, 2001), on the performance
was correlated to the political investments in the rather than the training (Ehrenberg, 1999).
body) where knowing and producing, examining Thus, the functions of management, control,
and normalizing, measuring and reforming were and security overlap more and more in contem-
sought in one single movement. This incorporeal porary socio-technical environments. From shop-
but very real subjectivity was, as we know, deci- ping centers to subways and electronic Internet
sive in circulating and articulating the effects of sites, comfortable and safe conditions for actions
a certain type of power and knowledge that made and choices are created, as are mechanisms for
up the disciplinary apparatus and its regime of predicting and preventing actions considered to
visibility. According to Lianos (2001), in a highly be suspicious or risky.
institutionalized and media-influenced world, All of these elements are in some way present
the mechanisms of control organize themselves in intelligent video surveillance systems. They
around the action itself, instead of focusing on also aim to be diverse in their application by
the supports of a socially-filtered conscience monitoring and intervening in behaviors and ways
that sustains the action. In a model of control and of occupying spaces with the most varied ends:
government that uses simulation and incitation consumption, commerce, management, security.
more than interdiction and coercion in order to Among their various applications are: counting
accomplish and obtain results, the investments and clients; analyzing consumption habits, patterns of
calculations of power grant privilege to autonomy, movement, and transportation; identifying intru-
initiative, surpassing, and continual choice (Rose, sions in controlled or prohibited zones or areas of

73
A Brief Cartography of Smart Cameras

risk; detecting abandoned objects and suspicious By tracking the users – their mood, pattern of
behavior; monitoring and analyzing populational work, focus of attention, interests, and so on
or crowd flows (crowd density estimation, face – these interfaces acquire information about
detection and recognition in crowds, and crowd the users, which they then use to automatically
behavior monitoring). The diverse nature of smart perform the tasks for them. The close connection
cameras is explicit in the advertising materials between surveillance/monitoring and assistance/
of certain manufacturers: “Whether the need is augmentation is one of the key characteristics
real-time security and surveillance or gathering of the high-tech society (Manovich, 2006: 222).
business intelligence, ObjectVideo is the key to
enabling truly intelligent solutions” (http://www. In this way, intelligent video surveillance pro-
objectvideo.com/solutions/). duces a typology or taxonomy of the daily move-
In the great majority of cases, what is being ment of bodies in spaces, codifying their patterns
dealt with is the monitoring of routine behaviors of conduct and allowing for the calculating and
in order to extract patterns with regard to the planning of actions on these bodies. It is part of
daily gestures of bodies located in their ordinary the capture model proposed by Agre (1994) and
environments so as to incite the bodies to act in revisited by Kitchin & Dodge (2006):
a certain way (consumption, movement, atten-
tion) or to recognize and impede irregular and/ The capture model acknowledges that the mecha-
or suspect patterns of conduct. With its differing nisms by which information is gathered is increas-
visual and algorithmic layers, a single system ingly an integral part of the same system that they
monitors, analyzes, categorizes, and intervenes seek to monitor and regulate (e.g., a computer
in the behaviors. This process of categorization operation system logs its own use by an individual)
of bodies, movements, and routine gestures is and that these mechanisms in turn re–define and
intimately linked to the ordering and codifying re–configure that system (e.g. change workplace
of urban spaces, which in turn are converted practices), quite often in real–time. Agre (1994)
into “automated socio-technical environments” argues that these modes of informational capture
(Lianos & Douglas, 2000). is possible because a grammar of action (for-
Smart cameras and their sensors possess a malized rules) has been imposed on a system. A
type of power and knowledge over bodies that grammar of action is a means of systematically
transforms their actions within a physical space representing aspects of the world, an organized
into information (Manovich, 2006). This informa- language for modeling human behaviors (Kitchin
tion permits the extraction of regular and irregular & Dodge, 2006: 6).
conduct patterns and thus makes interventions,
planning, and calculations on the ordering of But of what does this typology consist? In what
bodies and their flows possible. Intelligent video way is it similar to or different from the modern
surveillance, in a way, goes hand in hand with techniques of categorizing corporal data? If we
the so-called intelligent environments that add compare this typology to the known historical
informational layers to spaces. This therefore methods of classifying corporal characteristics
broadens the possibilities of interacting with that also used vision technologies, such as pho-
the bodies present and allows them to be called tography, we notice that modernity’s inventories
upon in a contextual manner by the surrounding of corporal patterns were focused on specific traits
environment itself. In the words of Manovich, and details. We need only remember the catalogs
of ear types designed by Bertillon (Sekula, 1986).
As in that time, we are now dealing with the exer-

74
A Brief Cartography of Smart Cameras

cise of political and scientific power over bodies the body-image is the index of something that lies
through a classification and recording system that beneath or that comes before; in the other, it is an
converts the image into convincing information index of an action that is yet to come.
and permits the analysis of indications and the The differences between these indexing mod-
recognition of corporal patterns, their norms, els and their respective temporalities are better
and their deviances. However, in the place of understood if we consider the proactive nature of
the rows of ears, little fingers, and forearms, intelligent video surveillance, as will be discussed
the taxonomies of intelligent video surveillance in the next topic.
depend on algorithms that “represent” patterns
of conduct and movement of bodies in a given
context. Instead of categorizing specific traits REAL TIME, PREDICTION,
and fragmented corporal details, the emphasis is AND PROACTIVITY
on contexts of the broadest gestures and action
of the bodies. These contexts are then converted The last trait in relief in our cartography is the
into information and detected by smart camera temporality of smart cameras. In the descriptions
sensors and algorithms. and advertisements of these devices, the main
Modern taxonomies were also focused on the emphasis is given to the “real time” of the moni-
erudite and meticulous eyes of specialists and toring, detecting, and intervening in the contexts
authorities who were to combine instruments and under surveillance. In other words, in addition to
knowledge, such as photography, physiognomic transmitting and observing the image in real time,
description, anthropometry, and statistics. The which is already present in video surveillance
categorizations of smart cameras, in their turn, are controlled by human operators, the intelligent
to be recognized in an automated fashion by the systems operate an automated and “instant”
machine itself, which depends on computational analysis of the scene under observation. This
vision, pattern analysis, artificial intelligence, and allows for any intervention to occur at the very
data management. Another significant difference moment in which the incident is taking place. In
is the main purpose of each of the systems. The theory, possible delays or lapses are eliminated,
visual diagnosis of a criminal personality hidden be they because of human perception or attention
within bodily details and measurements (present, or because of action at the opportune moment. It
above all, in the composite portraits of Galton) should be noted that real time here is not that of the
and the identification and recognition of specific observation, detection, or action taken as isolated
criminals (the first objective of Bertillon’s records) elements. Rather, it is that of the almost immediate
in the heart of an anonymous mass that suddenly or instantaneous connection between these three
appeared in the confused territory of cities oc- normally diachronic, separate “moments,” with
cupied the forefront of concerns and purposes of the interval between them being reduced to the
the modern typology of bodies, their measure- extreme. The image, which already has a real time
ments, and images. In the typologies with which of observation, now has a real time of reaction as
we are dealing here, as has already been seen, we well. In order to understand the reach or extent
are no longer dealing with diagnosing a criminal of this continuously alert and active present, it is
personality through bodily images and details. necessary to understand the past or the memory,
Rather, we are concerned with foreseeing – on which serves as a parameter, as well as the future,
the surface of the image and in the movements of which is projected in the present.
the bodies in a given context – a possible crime The memory of the device is not simply, as
or incident that is about to happen. In one case, in “conventional” video surveillance, its ability

75
A Brief Cartography of Smart Cameras

to register. Smart cameras are equipped with an lance. It does not have to do with the index of the
algorithmic memory, which consists of a repertoire past, nor is it solely the index of the present and
of patterns that the camera should recognize as the force of evidence that is characteristic of the
regular – and thus remain indifferent – or as ir- real time of the transmission and observation of
regular – and therefore intervene. In this way, the the image. It also mainly deals with the index of
memory serves as a parameter or a framework for an occurrence which is yet to come. An extreme
the analysis of the scene being monitored. If we example of this proactivity is provided by the
consider the device’s whole sequence of elabora- FAST (Future Attributable Screening Technol-
tion, this “intelligent” memory, which serves as a ogy4) software, which uses sensors and starts at a
guide for the camera’s “glance” and its perceptive distance of around two meters in order to capture
and attentional filters, is constituted through the a series of body signals of individuals, such as
observation and archiving of an immense mass of skin temperature, blood-flow patterns, perspira-
images submitted for analysis in order to extract tion, and heart and breathing rates, which serve
patterns. These archives can be elaborated in or- as indications of a hostile act yet to come.
der to build an intelligent system, or they can be This index and this “yet to come” have a curi-
put together by reusing and analyzing previous ous, paradoxical, disturbing meaning, however.
records, which in large part do not have any use The index of the past has the facticity and relative
or end. The gigantic and humanly unsearchable stability of “something happened” (Barthes, 2006),
archives of video surveillance images become and the index of the present has the supposed live
a useful memory, according to the creators of referential that “something is” or “something is
intelligent video surveillance. As researchers happening.” The index of the future, however, is
and manufacturers Frost & Sullivan state, video uncertain by nature. In this case, it is also “unveri-
analytics “is the next step to video surveillance fiable”, given that the proactive aspect should im-
where the intelligence of the system is utilized pede or avert that which the body-image indicates
to enable the end user to identify key trends and is about to happen. The uncertainty and the many
significant changes in recorded images. The end possible ramifications of what could happen in the
user will be alerted when there is an unusual event. course of time are therefore reduced. However, this
Video analytics is the key to help end users with reduction does not take place because of accuracy
the abundance of recorded information.” (http:// in the prediction. Rather, it occurs through the
www.frost.com/prod/servlet/svcg.pag/AEID) effectiveness or performance of the anticipation.
The greatest promise of this memory or of We will never know if the body that stops for too
these intelligent archives, as can already be noted, long at the edge of the subway tracks would have
is not to save and recover the past. Rather, it is jumped to its death or not because the future is
to anticipate and avoid undesirable future events, anticipated. In this case, the future is anticipated
above all when surveillance is for the purpose of in order to be averted. Even though it is uncertain
security. We are talking about a record of patterns by nature, it becomes certain at the moment that
that is to serve as an index of the future. Interven- it is foreseen, and yet it cannot be carried out.
tion in real time is, in this case, an action in the The same is true for the repertoire of suspicious
present in order to avoid the occurrence of another behaviors that serves as an index of crimes or
action in the future. It is for this reason that we infractions that are yet to occur or that perhaps
suggest discussing proactive video surveillance will not be carried out. They become suspect at
here, which should come before the foreseen action the moment they are predicted, thus transforming
and, in this case, avert it. We now return to the the body-images into potential criminals.
type of indexing present in this model of surveil-

76
A Brief Cartography of Smart Cameras

The future projects itself onto the present and Barthes, R. (2006). La Chambre Claire: Note Sur
thus limits the stock of possibilities that belong LA Photographie. Paris: Gallimard.
to the present. When smart cameras are focused
Brown, P. (1988). The Body and Society. New
on optimizing the use of space, consumption, or
York: Columbia University Press.
the efficiency of services, this colonization of the
present by anticipating the future is less ostensible. Canguilhem, G. (2009). Le normal et le
Instead of averting an undesired action, there is pathologique. Paris: PUF.
an attempt to incite desirable behavior in specific
Crary, J. (1992). Techniques of the observer.
contexts. At any rate, and above all in the case of
Cambridge, MA: MIT Press.
proactive surveillance with the purpose of security,
the “birth of the management of probability as Crary, J. (2001). Suspensions of Perception.
a possible reality as valid as the present reality Cambridge, MA: MIT Press.
places the problematic of power in a qualitatively
Davis, J. W. (2009). Research Report Article: Eye
different context; it frees power from the limit
in the Sky. OSU. The Alumni Magazine, (May/
of a reference to the occurrence and allows it to
June): 30.
make that which has yet to exist (and which will
perhaps never exist) into a ‘happening of power’”5 Davis, J. W. (n.d.). James W. Davis Home Page,
(Lianos, 2001: 135). from http://www.cse.ohio-state.edu/~jwdavis/
Smart cameras, through the combining of real
Deleuze, G. (1992). Postscript on the Societies
time, foresight, and proactivity, produce a type of
of Control. October, 59, Winter, 3-7.
power over bodies, images, and times that is guided
by an indexing of suspicion and the proactivity Ehrenberg, A. (1999). Le culte de la performance.
of surveillance. They therefore come to augment Paris: Hachette.
the already abundant assortment of predictive
and proactive surveillance devices, such as data Elias, N. (1969). The Civilizing Process: Vol. I.
monitoring and profiling, in the most diverse fields: The History of Manners. Oxford: Blackwell.
consumption, health, security, efficiency, etc. Video Eye in the Sky. (2009). Research Report Article,
surveillance surpasses the dominion of the proof of May/June, 30. Retrieved from www.ohiostatea-
that which happened and promises to proactively lumni.org
intervene in that which has not yet occurred.
Foucault, M. (1963). Naissance de la clinique.
Paris: PUF.
REFERENCES Foucault, M. (1976). Histoire de la sexualité I - La
Volonté de savoir. Paris: Gallimard.
Agamben, G. (1998). Homo Sacer: Sovereign
Power and Bare Life. Stanford, CA: Stanford Foucault, M. (1995). Discipline & Punish: The
University Press. Birth of the Prison. New York: Vintage Books.

Agre, P. E. (1994). Surveillance and capture: two Foucault, M. (2004). Naissance de la biopolitique.
models of privacy. The Information Society, 10(2), Paris: Gallimard/Seuil.
101–127. doi:10.1080/01972243.1994.9960162 Foucault, M. (2007). Security, Territory, Popula-
Andrejevic, M. (2004). Monitored mobility in the tion: Lectures at the Collège de France 1977-78.
era of mass customization. Space and Culture, London: Palgrave Macmillan. Garland, D. (2001).
6(2), 132–150. doi:10.1177/1206331203251256 The Culture of Control. Chicago: University of
Chicago Press.

77
A Brief Cartography of Smart Cameras

Gouaillier, V., & Fleurant, A.-E. (2009). La vi- Lyon, D. (2002). Surveillance Studies: under-
déosurveillance intelligente: promesses et défis: standing visibility, mobility and the phenetic fix.
rapport de veille technologique et commerciale. Surveillance & Society, 1(1), 1–7.
Montréal: Centre de recherche informatique de
Lyon, D. (2010). Why where you are matters:
Montreal (CRIM)/ Technopôle Défense et Sé-
mundane mobilities, transparent technologies,
curité, 2009.
and digital discrimination. In Firmino, R., Duarte,
Hampapur, A., Brown, L., Connell, J., Pankanti, S., F., & Ultramari, C. (Eds.), ICTs for Mobile and
Senior, A., & Tian, Y. (2003). Smart Surveillance: Ubiquitous Urban Infrastructures: Surveillance,
Applications, Technologies and Implications. Locative Media and Global Networks. Hershey,
In Joint Conference of the Fourth International PA: IGI Global.
Conference on Information, Communications &
Manovich, L. (2006). The poetics of augmented
Signal Processing and Fourth Pacific-Rim Con-
space. Visual Communication, 5(2), 219–240.
ference on Multimedia, 2 (pp. 1133- 1138). New
doi:10.1177/1470357206065527
York: December.
Morellas, V., Pavlidis, I., & Tsiamyrtzis, P. (2003).
Hardt, M., & Negri, A. (2000). Empire. Cambridge,
DETER: Detection of events for threat evaluation
MA: Harvard University Press.
and recognition. Machine Vision and Applications,
Hearing, N., Venetianer, P. L., & Lipton, A. (2008). 15(1), 29–45. doi:10.1007/s00138-003-0121-6
The evolution of video surveillance: an over-
Rabinow, P. (1992). Artificiality and enlighten-
view. Machine Vision and Applications, 19(5-6),
ment: from sociobiology to biosociality. In Crary,
279–290. doi:10.1007/s00138-008-0152-0
J., & Kwinter, S. (Eds.), Incorporations (pp.
Kitchin, R., & Dodge, M. (2006). Software and 234–252). New York: Zone Books.
the mundane management of air travel. First
Rose, N. (2000). Power of Freedom: Reframing
Monday, 11(Nr. SI-7).
Political Thought. Cambridge, UK: Cambridge
Latour, B. (1993). We Have Never Been Modern. University Press.
Cambridge, MA: Harvard University Press.
Rose, N. (2006). Politics of life itself: biomedicine,
Latour, B. (1994). On technical mediation: power and subjectivity in the twenty-first century.
philosophy, sociology, genealogy. Common Princeton, NJ: Princeton University Press.
Knowledge, 2(3).
Sekula, A. (1986).The Body and the Archive.
Latour, B. (1999). On recalling ANT. In Law, J., October, 39, 3-64.
& Hassard, J. (Eds.), Actor Network Theory and
Velastin, S. A., & Remagnino, P. (Eds.). (2006).
After. Oxford: Blackwell.
Intelligent distributed video surveillance systems.
Lazzarato, M. (2004). Les revolutions du capital- London: Institution of Engineering and Technol-
isme. Paris: Les Empêcheurs de Penser en Rond. ogy.
Lianos, M. (2001). Le nouveau contrôle social. Vigarello, G. (1987). Le Propre et le Sale.
Paris: L’Harmattan. L’hygiène du corps depuis le Moyen Age. Paris:
Seuil.
Lianos, M., & Douglas, M. (2000). Dangerization
and the end of deviance. The British Journal of Vigarello, G. (2004). Histoire de la beauté, le
Criminology, 40, 261–278. corps et l’art d’embellir de la renaissance à nos
jours. Paris: Seuil.

78
A Brief Cartography of Smart Cameras

Wood, D., & Graham, S. (2006). Permeable bound- Dodge, M., & Kitchin, R. (2005). Codes of life:
aries in the software-sorted society: surveillance Identification codes and the machine–readable
and the differentiation of mobility. In Sheller, M., world. Environment and Planning. D, Society &
& Urry, J. (Eds.), Mobile Technologies of the City Space, 23(6), 851–881. doi:10.1068/d378t
(pp. 177–191). London: Routledge.
Dodge, M., & Kitchin, R. (2007). Outlines of a
world coming into existence: pervasive comput-
ing and the ethics of forgetting. Environment and
ADDITIONAL READING
Planning. B, Planning & Design, 34(3), 431–445.
Agre, P. E. (2001). Changing Places: Contexts doi:10.1068/b32041t
of Awareness in Computing. Human-Com- Dodge, M., & Kitchin, R. (2009). Software, ob-
puter Interaction, 16, 177–192. doi:10.1207/ jects and home space. Environment & Planning
S15327051HCI16234_04 A, 41(6), 1344–1365. doi:10.1068/a4138
Andrejevic, M. (2003). Tracing space: moni- Galloway, A. (2004). Intimations of ev-
tored mobility in the era of mass customiza- eryday life: Ubiquitous computing and the
tion. Space and Culture, 6(2), 132–150. city. Cultural Studies, 18(2), 384–408.
doi:10.1177/1206331203251256 doi:10.1080/0950238042000201572
Andrejevic, M. (2005). Nothing comes between Graham, S. (2005). Software-sorted geographies.
me and my CPU: smart clothes and “ubiquitous” Progress in Human Geography, 29(5), 562–580.
computing. Theory, Culture & Society, 22(3), doi:10.1191/0309132505ph568oa
101–119. doi:10.1177/0263276405053723
Greenfield, A. (2006). Everyware: The Dawning
Beer, D. (2007). Thoughtful territories: imagining Age of Ubiquitous Computing. Berkeley, CA:
the thinking power of things and spaces. City, 11(2), Peachpit Press.
229–238. doi:10.1080/13604810701395845
Law, J., & Hassard, J. (Eds.). (1999). Actor-
Bogard, W. (1996). The Simulation of Surveil- Network Theory and After. Oxford: Blackwell.
lance: Hypercontrol in Telematic Societies. New
York, NY: Cambridge University Press. Lessig, L. (1999). Code. New York: Basic Books.

Cole, S. A. (2001). Suspect Identities: A History Lovink, G., & Gerritzen, M. (Eds.). (2002). Mobile
of Fingerprinting and Criminal Identification. Minded. Corte Madera, CA: Gingko Press.
Cambridge: Harvard University Press. Lyon, D. (Ed.). (2003). Surveillance as social
Crang, M., & Graham, S. (2007). Sentient cities sorting: Privacy, risk, and digital discrimination.
ambient intelligence and the politics of urban space. London: Routledge.
Information Communication and Society, 10(6), Lyon, D. (2003). Surveillance After September
789–817. doi:10.1080/13691180701750991 11. Cambridge, UK: Polity Press.
Dodge, M., & Kitchin, R. (2005). Code and the McCue, C. (2005). Datamining and predictive
transduction of space. Annals of the Association analytics: battlespace awareness for the war on ter-
of American Geographers. Association of Ameri- ror. Defense Intelligence Journal, 13(1/2), 47–63.
can Geographers, 95(1), 162–180. doi:10.1111/
j.1467-8306.2005.00454.x

79
A Brief Cartography of Smart Cameras

McCullough, M. (2004). Digital Ground: Archi- objects based on the information acquired by sen-
tecture, Pervasive Computing and Environmental sors (Reference: Velastin & Remagnino, 2006: 1).
Knowing. Boston, MA: MIT Press. Proactive Surveillance: Procedures and/or
technologies that seek to anticipate danger or risk
Mitchell, W. (2004). ME++: The Cyborg Self and
in order to act preventively to avoid the events
the Networked City. Cambridge, MA: MIT Press.
anticipated to take effect.
Monahan, T. (forthcoming). Dreams of control at a
distance: Gender, surveillance, and social control.
Cultural Studies - Critical Methodologies, 8(4). ENDNOTES
Shields, R. (2004). Visualicity. Visual Culture in 1
For a technical description of the project, Cf.
Britain, 5(1), 23–36. Karthik Sankaranarayanan; James W. Davis
Virilio, P. (1994). The Vision Machine. Bloom- A Fast Linear Registration Framework for
ington: Indiana University Press. Multi-Camera GIS Coordination. IEEE In-
ternational Conference on Advanced Video
Wright, D., Gutwirth, S., Friedewald, M., Vildjiou- and Signal Based Surveillance, September
naite, E., & Punie, Y. (Eds.). (2008). Safeguards in 2008.
a World of Ambient Intelligence. London: Springer. 2
Video surveillance systems are usually di-
vided into three categories or generations:
operator-controlled video surveillance, basic
KEY TERMS AND DEFINITIONS automated video surveillance, and smart
video surveillance.
Control: In this paper, the term control is 3
Such software, usually called video analytics,
used in the sense proposed by Gilles Deleuze: a
seeks to automatically analyze and identify
system of power characteristic of contemporary
specific objects, behavior or attitudes in the
societies, which differs from the system of disci-
video sequences. The software transforms
plinary power, as conceived by Michel Foucault
the video into data that will be transmitted
in modern societies. In addition to Gilles Deleuze,
or archived to allow the video surveillance
this text uses the contributions of Michalis Lianos
system to act, be it by activating a mobile
on social control in Automated Socio-Technical
camera in order to obtain more precise data
Environments.
of the scene or by simply sending an alert
Human Behaviour Monitoring: Regular,
signal to the system’s operators.
systematic and focused monitoring of human 4
Software that is part of the Project Hostile
behavior in order to control, influence or prevent
Intent, developed by The Human Factors
specific actions and/or events.
Division of America’s Department of Home-
Intelligent Video Surveillance: Intelligent
land Security (DHS). Cited in the article
visual surveillance systems are defined as real-
“If looks could kill”, The Economist print
time monitoring of persistent and transient objects
edition, Oct 23rd 2008.
within a specific environment. The main objec- 5
In the original, in French, “événement de
tives of these systems are to provide an automatic
pouvoir” (Lianos, 2001, p. 135).
interpretation of scenes and to understand and
predict the actions and interactions of the observed

This work was previously published in ICTs for Mobile and Ubiquitous Urban Infrastructures: Surveillance, Locative Media
and Global Networks, edited by Rodrigo J. Firmino, Fabio Duarte and Clovis Ultramari, pp. 257-271, copyright 2011 by
Information Science Reference (an imprint of IGI Global).

80
81

Chapter 8
GIS, Grid Computing
and RFID in Healthcare
Information Supply Chain:
A Case for Infectious Disaster Management

Yenming J. Chen
National Kaohsiung First University of Science and Technology, Taiwan

EXECUTIVE SUMMARY
Several healthcare disasters have occurred in the past decade, and their occurrence has become more
frequent recently due to one natural catastrophe after another. The medical application requirement for
such a disaster management system includes effective, reliable, and coordinated responses to disease
and injury, accurate surveillance of area hospitals, and efficient management of clinical and research
information. Based on the application requirements, this case study describes a grid-based system in a
health information supply chain that monitors and detects national infectious events using geographi-
cal information system (GIS), radio-frequency identification (RFID), and grid computing technology.
This system is fault-tolerant, highly secure, flexible, and extensible, thus making it capable of operation
in case of a national catastrophe. It has a low cost of deployment and is designed for large-scale and
quick responses. Owing to the grid-based nature of the network, no central server or data centre needs
to be built. To reinforce the responsiveness of the national health information supply chain, this case
study proposes a practical, tracking-based, spatially-aware, steady, and flexible architecture, based on
GIS and RFID, for developing successful infectious disaster management plans to tackle technical is-
sues. The architecture achieves a common understanding of spatial data and processes. Therefore, the
system can efficiently and effectively share, compare, and federate—yet integrate—most local health
information providers and results in more informed planning and better outcome.

DOI: 10.4018/978-1-4666-2038-4.ch008

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
GIS, Grid Computing and RFID in Healthcare Information Supply Chain

ORGANIZATION BACKGROUND sector many a time and proven itself in resolving


complex, systemic issues (Yousefi & Lauridsen,
The outbreak of severe acute respiratory syndrome 1998; Flessa, 1999; Ritchie-Dunham & Galv´an,
(SARS) in Southeast Asia in 2003 had a serious 1999; Lane, Monefeldt & Rosenhead, 2000).
impact and proved disastrous for the entire society, This case study tries to reinforce the respon-
healthcare practitioners, healthcare institutions siveness of the national health information supply
and public works personnel worldwide (Esswein chain with a practical, tracking-based, spatially-
et al., 2004). In recent years, the numerous natural aware, steady-to-use, and flexible architecture
disasters have always entailed a massive preva- for developing successful infectious disaster
lence of contagious diseases. For example, after a management plan and implementing GIS and
devastating earthquake in Haiti in 2010, there was RFID to tackle technical issues. The proposed
reported an outbreak of a cholera-like disease that architecture will achieve a common understanding
had killed hundreds of people. In order to better of spatial data and processes, which will enable
manage such disasters in the future, much atten- this system to efficiently and effectively share,
tion was given to interoperability of a nationwide compare, and federate yet integrate most of the
health information network (Chau & Yip, 2003). local health information providers and results for
Inherent in the massive collection of data, there is more informed planning and better outcomes.
expected to be trade-offs between quick response
and accuracy that arises with mass surveillance
systems. The application requirements for such SETTING THE STAGE
a system include an effective and coordinated
response to disease and injury, accurate surveil- Early intervention is a key to stop a massive out-
lance of area hospitals, and efficient management break of diseases and new technologies can help
of clinical and research information. in this. A mass epidemic outbreak caused by a
A nationwide health information network or natural disaster drew much attention recently. In
supply chain can be extremely complex, since response to such events, bodies related to biomedi-
it needs to integrate geographically-distributed cal, public health, defense, as also intelligence
healthcare providers and other units with distinct communities, are developing new approaches
functions and mutual dependencies. In this case for real-time disease surveillance in an effort to
study there has been discussed an enhanced health augment existing public health surveillance sys-
information network that dealt with natural disaster tems. The term ‘syndromic surveillance’ refers
caused by mass epidemic outbreak. to methods relying on detection of clinical case
Due to the dynamic nature of propagation of features that are discernible before confirmed
epidemics, many researches investigate, follow- diagnoses are made (Forslundet al., 2004). In
ing various dynamic approaches, the causes and particular, even before a laboratory confirms an
consequent behaviour patterns in the outbreak of infectious disease, sick people may show certain
infectious diseases (Forys, 2002; Scheffer et al., behavioural patterns, symptoms or signs, or there
2001). System dynamics modelling allows the inte- may be certain laboratory findings, that can be
gration of multiple political, environmental, social tracked through a variety of data sources. New
and structural variables into a single model. It also information infrastructure and methods to support
analyses the behaviour of all the variables in the timely detection and monitoring, including the
system, allowing policies to be tested repeatedly discipline of syndromic surveillance, are evolving
(Forrester, 1961). The system dynamics model- rapidly (Homer et al., 2004; Hoard et al., 2005).
ling methodology has been applied to the health

82
GIS, Grid Computing and RFID in Healthcare Information Supply Chain

CASE DESCRIPTION individual victims). In addition, RF-tagged victims


or resources may be located via GIS to produce a
Technology Concerns real-time map of the entire population (Yu & Lim,
2004). RFID technologies are found to be more
The applications of spatial data analysis and min- reliable and durable than bar codes in different
ing are widespread. But in a syndromic surveil- application and functionality (Li et al., 2004).
lance system, geographical data have not been Grid technologies have been widely adopted in
associated with finding local clusters of diseases. scientific and technical computing (Foster, 2004).
Environmental health studies often rely on GIS Grid technologies and infrastructures support the
software to map areas of potential exposure and sharing and coordinated use of diverse resources
locate where people live in relation to these areas. in dynamic, distributed virtual organizations
An overview of geographical health issues has (Chervensak, 2001). The creation of virtual com-
been given by Boulos (2004, 2005). Information puting systems, from geographically distributed
on long-term entomological and epidemiological components operated by distinct organizations
surveillance conducted in an infected area should with differing policies, is sufficiently integrated
be manipulated and presented using spatial analy- to a health information supply chain. A grid may
sis tools available in GIS. This is important to be explained as a set of heterogeneous systems
predict impending epidemics in order to use limited federated over a wide-area network (Foster et al.,
resources in a cost-effective and efficient manner 2001). In contrast to the general Internet, such
and to control any epidemic of infectious diseases. systems are usually interconnected using special
high-speed, wide-area networks in order to get the
Technology Components bandwidth required for their applications.
While the notion of grid computing is simple
A geographical information system (GIS) can enough, the practical realization of grids poses a
be applied to develop maps, presenting spatial number of challenges. Key issues that need to be
distribution of infected areas, infected popula- dealt with are security, heterogeneity, reliability,
tions and infected factors which are important for application composition, scheduling, and resource
spatial and temporal dynamics of transmission of management (Buyya, 2002). We try to elaborate
infectious diseases (Ramljaket al., 2003). A few cross-platform federation without sacrificing
studies (Reiskind et al., 2001) discussed the use flexibility, scalability, reliability and extensibility.
of GIS and spatial analysis for spread of dengue
around the world. Management and
Radio frequency (RF) tags also may be attached Organizational Concerns
to responding personnel, vehicles, equipment or
supplies. An RFID system represents a major There are several concerns involved in the man-
advantage over handwritten or bar-code-based agement of infectious disease control. A suitable
identification systems, because it supports the infectious disaster control and management sys-
simultaneous collection of information from tem has to be versatile. Disasters are commonly
thousands of RF-tagged people or objects and defined as emergencies of severity and magnitude
does not depend on line-of-sight contact between resulting in deaths, injuries, illness, and/or prop-
the receiver and the tag. Moreover, some of erty damage that cannot be effectively managed
these systems enable data stored on the RF tags by the application of routine procedures or re-
to be updated or expanded (e.g., permitting the sources (Landesman,2001). A properly-designed
revision of triage scores on RF tags attached to emergency management system involves a set of

83
GIS, Grid Computing and RFID in Healthcare Information Supply Chain

arrangements, procedures, resources, personnel nity, with the aim of early detection and treatment.
and relations that can reduce the impact of hazards, An effective syndromic surveillance system can
emergencies and disasters (Abrahams, 2001). provide timely information to detect outbreak and
Natural disasters such as floods, earthquakes adverse public health events by aiding in earlier
and hurricanes pose a greater risk to populations intervention (Luckett & Grossenbacher, 2003).
than other disaster events (Landesman, 2001). The requirement for availability and effec-
Since each disaster has different characteristics, tiveness of syndromic surveillance data locally,
it is important for emergency planners to take an regionally, and nationally argues for a common
adjusted approach to disaster preparedness. An infrastructure that would enable use at all levels
enhanced health information network dealing of decision making. This argues very strongly for
with natural disasters caused by mass epidemic a common communication vehicle for syndromic
outbreak is discussed in this case study. The Severe surveillance. The resulting network of surveillance
Acute Respiratory Syndrome (SARS) epidemic, systems needs to be able to be queried for infor-
from November 2002 to June 2003, came with mation about what they are seeing both locally
much public attention and left swiftly, resulting and regionally, and also beyond. This requires a
in more than 8,000 probable cases worldwide and significant level of interoperability and data inte-
774 deaths (WHO, 2003b). gration, including a common functional behaviour.
The World Health Organization (WHO) praised The emergence of electronic healthcare data
the role of GPHIN (Global Public Health Intel- systems has resulted in the ability to monitor
ligence Network) for early detection of SARS. health data almost in real time, presenting a new
GPHIN provided some of the earliest alerts to opportunity to monitor the health status of the
the November outbreak in China (WHO, 2003a). community. Implementing syndromic surveillance
A part of WHO’s Global Outbreak Alert and based on automated acquisition of clinical data
Response Network, it was developed and is oper- requires both the development of secure, reliable
ated by Health Canada’s Centre for Emergency information systems and the use of those systems
Preparedness and Response. It is essentially an in public health practice. The rapid collection
Internet crawler specializing in detecting news and dissemination of information, the sharing of
articles indicating unusual events relevant to that information and a comprehensive reporting
public health. The network continually scans mechanism could be thought of as permitting
more than 400 international sources for news of earlier intervention in outbreaks. Being more sen-
any outbreaks of 31 communicable diseases, as sitive to underlying changes in the data, outbreak
well as articles about natural disasters and drug- signals could be detected sooner, thus mitigating
resistant pathogens, rather than relying on official the effects of large scale disease outbreak.
reports from government sources. As highly contagious diseases pose many
challenges, having efficient monitoring devices
for patient-tracking is essential to the success of
CURRENT CHALLENGES early intervention in epidemic surveillance. Wire-
FACING THE ORGANIZATION less communication may support the widespread
and instantaneous tracking of victims or resources
Rapid detection of outbreaks caused by natural through barcode or RFID systems. It may eliminate
disasters requires immediate response to minimize manual data entry and introduce the potential for
health issues and other social impacts. Public automated processes to increase productivity even
health departments routinely conduct surveillance under difficult working environments.
of naturally occurring diseases within the commu-

84
GIS, Grid Computing and RFID in Healthcare Information Supply Chain

SOLUTIONS AND authentic, physician-generated information that is


RECOMMENDATIONS physically located in different clinical information
systems (Brailsford et al., 2004). Grid computing
An integrated information infrastructure with a makes extreme demands on distributed program-
nationwide health information network would ming because they are typically large-scale, and
provide the means to access all available clinical they exploit wide-ranging networks consisting
information, at the corporate, regional, national or of a variety of protocols and systems that may
even international level, and to meet challenges span organizational boundaries. A description
posed by patient mobility and the fact that an indi- of one kind of health information supply chain
vidual’s health data may reside at many geographi- architecture based on grid technologies is follows.
cally dispersed information systems (Forslund & The national health supply chain architecture
Kilman, 2000). In addition to providing spatially is shown in Figure 1. There are four federated
distributed information, the system may be a valu- services in this architecture. Each federated service
able tool in basic and clinical research, medical acts as a gatekeeper to actual information provid-
decision making, epidemiology, evidence-based ers. Federated syndromic surveillance service pro-
medicine, and in formulating public health policy. vides outbreak detection and mitigation services.
Such a system is fundamentally about bringing Federated GIS atlas service provides map services.
timely health-related information to, and aiding Federated health record service provides medical
communication among, those making health deci- records. Federated patient tracking repository
sions for themselves, their families, their patients service act as a huge data warehouse containing
and their communities (White, 1994; Tsiknakis, massive data of RFID recordings.
Katehakis & Orphanoudakis, 2002). The grid client shown in Figure 1 need not be
Very large-scale information systems are an end-user system, but could be an agent acting
typically deployed as centralized or distributed on its behalf, and there may be many such clients
architectures. In a centralized architecture, a single or agents, acting independently with no central
server is responsible for processing all user requests. control over the components shown. To make use
Centralized architectures simplify administration of resources, a client first uses sources of informa-
and coordination of a service, and they often require tion located within service brokers to discover those
fewer resources. However, centralized architectures resources needed for execution of a task. Multiple
also have several disadvantages. A centralized sources may need to be consulted to locate all the
server represents a bottleneck in processing that resources needed for a computation. Assuming the
can delay response time. It may represent a single discovery and allocation steps were successful, the
point-of-failure, and for adversaries, a single client then sends the input data and executables,
point-of-attack, without which the entire service receiving a reference to the execution in return.
fails. Finally, even if the server is resourceful and These actions may be accomplished in several
available, the routing function provided by the stages or as one consolidated action, depending on
communications layer may fail to provide a path the nature and complexity of the task. As resourc-
from some clients to the server. es are allocated, the resource manager may need
Grid federation is gaining more and more to update the information in the registry to enable
attention during the past few years in a variety reasonable bids for resource allocation from other
of R&and (research and development) efforts. clients. Lastly, the client monitors execution of the
Any federated approach towards the creation of a task using the reference it previously received. It
health information network environment should be can retrieve the results, or be sent status of the task
capable of providing uniform ways for accessing as it progresses.

85
GIS, Grid Computing and RFID in Healthcare Information Supply Chain

Figure 1. National health information supply chain architecture

A federated service provides access to static and but it itself does not receive registration requests.
dynamic information of resources. There are three The local information maintained by a GRIS is
major components within this federated service: updated when requested, and cached for a period
Grid Resource Information Service (GRIS), Grid of time known as the time-to-live. If no request
Index Information Service (GIIS) and Lightweight for the information is received by it, the informa-
Directory Access Protocol (LDAP) Provider. tion would time out and be deleted. If a later request
Figure 2 represents the conceptual view intercon- for the information is received, the system would
nection of the federated service. As illustrated, the call the relevant information providers to retrieve
resource information is obtained by the informa- the latest information.
tion provider and is passed on to a GRIS. The A GIIS is a repository that contains indexes
GRIS registers its local information with a GIIS, of resource information registered by a GRIS and
which, in its turn, registers with another GIIS, other GIISs. It can be seen as a grid-wide infor-
and so on. Federated service clients can get the mation server. It has a hierarchical mechanism,
resource information directly from a GRIS (for structurally like DNS, and each GIIS has its own
local resources) and/or a GIIS (for grid-wide re- name. This means client users can specify the
sources). The federated services use LDAP, which name of a GIIS node to search for information.
provides decentralized maintenance of resource The local health information providers translate
information. This information contains the objects the properties and status of local resources to the
managed by federated services, which represent format defined in the schema and configuration
components resources, bothstatic and dynamic. files. In order to add local health information re-
A GRIS is the repository of local resource sources to a GRIS of federated services, specific
information derived from information providers. information providers need to be created in a
It is able to register its information with a GIIS, predefined format.

86
GIS, Grid Computing and RFID in Healthcare Information Supply Chain

Figure 2. Components for a federated service provider

The broker discovers resources, establishes a list of operations to be performed (Figure 3).
their cost and capability, and then prepares a To process a job, the broker dispatcher deploys
schedule to map requesting jobs to resources. It its agent on that resource, who executes a list of
first identifies a service broker and assigns a job commands specified in the job’s task specification.
to it. A job has a task specification that specifies A typical task specification contains necessary

Figure 3. Components for a service broker

87
GIS, Grid Computing and RFID in Healthcare Information Supply Chain

commands and medical records from one of the Buyya, R. (2002). Economic-based distributed
agent resources, execution of the commands, and resource management and scheduling for grid
finally copying results back to the user. It can computing. Doctoral dissertation, Monash Uni-
also contain special commands for accessing the versity, Australia.
records from the remote database. The data broker
Chau, P., & Yip, P. (2003). Monitoring the severe
looks at the replica catalogue for a list of sites
acute respiratory syndrome epidemic and assess-
providing federated services, checks the status of
ing effectiveness of interventions in Hong Kong
those sites and selects a suitable site. The infec-
special administrative region. Journal of Epide-
tious distribution fetch command can then request
miology and Community Health, 57, 766–774.
the federated service provider for a patient record
doi:10.1136/jech.57.10.766
and sent to outbreak detection federation. They
should be deployable in forms flexible enough Chervenak, A., Foster, I., Kesselman, C., Salisbury,
to meet the needs of incidents of various scopes C., & Tuecke, S. (2001). The data grid: Towards
and complexities, and provide useful and reliable an architecture for the distributed management
information. Moreover, technologies, shared data and analysis of large scientific. Journal of Net-
and terminology are standardized from a variety work and Computer Applications, 23, 187–200.
of agencies. doi:10.1006/jnca.2000.0110
Esswein, E., Kiefer, M., Wallingford, K., Burr, G.,
Lee, L.-H., & Wang, J.-D. (2004). Environmen-
REFERENCES
tal and occupational health response to SARS,
Abrahams, J. (2001). Disaster management in Taiwan, 2003. Emerging Infectious Disease.
Australia: The national emergency management Retrieved from http://www.cdc.gov/ncidod/EID/
system. Emergency Medicine, 13(2), 165–173. vol10no7/03-0728.htm
doi:10.1046/j.1442-2026.2001.00205.x Flessa, S. (1999). Decision support for malaria
Boulos, M. N. K. (2004). Towards evidence-based, control programmes: A system dynamics model.
GIS-driven national spatial health information in- Health Care Management Science, 2(3), 181–191.
frastructure and surveillance services in the United doi:10.1023/A:1019044013467
Kingdom. International Journal of Health Geo- Forrester, J. (1961). Industrial dynamics. Cam-
graphics, 3(1), 1–50. doi:10.1186/1476-072X-3-1 bridge, MA: MIT Press.
Boulos, M. N. K. (2005). Research protocol: EB- Forslund, D., Joyce, E., Burr, T., Picard, R., Wok-
GIS4HEALTH UK—Foundation evidence-based oun, D., & Umland, E. (2004). Setting standards for
and ontology-based framework of modular, reus- improved syndromic surveillance. Engineering
able models for UK/NHS health and healthcare in Medicine and Biology Magazine, 23(1), 65–70.
GIS applications. International Journal of Health doi:10.1109/MEMB.2004.1297176
Geographics, 4(2), 1–14.
Forslund, D. W., & Kilman, D. G. (2000). The
Brailsford, S., Lattimer, V., Tarnaras, P., & Turn- impact of the global, extensible electronic health
bull, J. (2004). Emergency and on-demand health record (pp. 3–13).
care: Modelling a large complex system. The
Journal of the Operational Research Society, Forys, U. (2002). Marchuk’s model of immune
55(1), 34–42. doi:10.1057/palgrave.jors.2601667 system dynamics with application to tumour
growth. Journal of Theoretical Medicine, 4(1),
85–93. doi:10.1080/10273660290052151

88
GIS, Grid Computing and RFID in Healthcare Information Supply Chain

Foster, I., & Kesselman, C. (Eds.). (2004). The Luckett, S., & Grossenbacher, K. (2003). A critical
Grid: Blueprint for a new computating infrastruc- systems intervention to improve the implementa-
ture (2nd ed.). Morgan Kaufmann. tion of a district health system in KwaZulu–Natal.
Systems Research, 20, 147–162. doi:10.1002/
Foster, I., Kesselman, C., & Tuecke, S. (2001).
sres.535
The anatomy of the grid: Enabling scalable virtual
organizations. International Journal of High Ramljak, D., Puksec, J., Huljenic, D., Koncar, M.,
Performance Computing Applications, 15(3), & Simic, D. (2003). Building enterprise informa-
200–222. doi:10.1177/109434200101500302 tion system using model driven architecture on
J2EE platform. In 7th International Conference
Hoard, M., Homer, J., Manley, W., Furbee, P.,
on Telecommunications (pp. 521–526).
Haque, A., & Helmkamp, J. (2005). Systems
modeling in support of evidence-based disaster Reiskind, M., Baisly, K., Calampa, C., Sharp, T.,
planning for rural areas. International Journal Watts, D., & Wilson, M. (2001). Epidemiologi-
of Hygiene and Environmental Health, 208(1-2), cal and ecological characteristics of past dengue
117–125. doi:10.1016/j.ijheh.2005.01.011 virus infection in Santa Clara Peru. Tropical
Medicine, 6(3), 212–215. doi:10.1046/j.1365-
Homer, J., Hirsch, G., Minniti, M., & Pierson, M.
3156.2001.00703.x
(2004). Models for collaboration: How system
dynamics helped a community organize cost- Ritchie-Dunham, J. L., & Galv’an, J. F. M.
effective care for chronic illness. System Dynam- (1999). Evaluating epidemic intervention
ics Review, 20(3), 199–222. doi:10.1002/sdr.295 policies with systems thinking: A case study
of Dengue fever in Mexico. System Dynam-
Landesman, L. Y. (2001). Public health manage-
ics Review, 15(2), 119–138. doi:10.1002/
ment of disasters: The practice guide. Washington,
(SICI)1099-1727(199922)15:2<119::AID-
DC: American Public Health Association.
SDR163>3.0.CO;2-G
Lane, D., Monefeldt, C., & Rosenhead, J. (2000).
Scheffer, M., Carptenter, S., Foley, J. A., Folke,
Looking in the wrong place for healthcare im-
C., & Walker, B. (2001). Catastrophic shifts
provements: A system dynamics study of an ac-
in ecosystems. Nature, 413(6856), 591–595.
cident and emergency department. The Journal
doi:10.1038/35098000
of the Operational Research Society, 51, 518–531.
Tsiknakis, M., Katehakis, D. G., & Orphanou-
Li, C.-J., Liu, L., Chen, S.-Z., Wu, C. C., Huang,
dakis, S. C. (2002). An open, component-based
C.-H., & Chen, X.-M. (2004). Mobile healthcare
information infrastructure for integrated health
service system using RFID. In Proceedings of
information networks. International Journal
the 2004 IEEE International Conference on Net-
of Medical Informatics, 68, 3–26. doi:10.1016/
working, Sensing and Control (pp. 1014–1019).
S1386-5056(02)00060-6
Taipei, Taiwan.
White, S. (1994). Total quality management in
Lipsitch, M., & Bergstrom, C. T. (2004). Invited
the public sector: Issues for implementation.
commentary: Real-time tracking of control mea-
Health Information Management Journal, 24(3),
sures for emerging infections. American Journal
115–121.
of Epidemiology, 160(6), 517–519. doi:10.1093/
aje/kwh256

89
GIS, Grid Computing and RFID in Healthcare Information Supply Chain

WHO. (2003a). Severe acute respiratory syndrome Yousefi, S., & Lauridsen, J. (1998). Socio-spatial
(SARS): Status of the outbreak and lessons for the formalism and nonlinear interaction in the dif-
future. Retrieved June 2005 from http://www.who. fusion of epidemics in regional space. Technical
int/csr/media/sars wha.pdf Report ersa98p406, European Regional Science
Association. Retrieved from http://ideas.repec.
WHO. (2003b). Summary of probable SARS cases
org/p/wiw/wiwrsa/ersa98p406.html
with onset of illness from 1 November 2002 to
31 July 2003, September, 2003. Retrieved June Yu, H., & Lim, E.-P. (2004). LTAM: A location-
2005 from http://www.who.int/csr/sars/country/ temporal authorization model. In M. P. Willem
table2003 09 23/en Jonker (Ed.), Secure Data Management: VLDB
2004 Workshop, SDM 2004, LNCS 3178, (pp.
172–176). Springer-Verlag GmbH.

This work was previously published in Cases on Supply Chain and Distribution Management: Issues and Principles, edited by
Miti Garg and Sumeet Gupta, pp. 256-268, copyright 2012 by Business Science Reference (an imprint of IGI Global).

90
91

Chapter 9
On Modeling and Analysis
of Multidimensional
Geographic Databases
Sandro Bimonte
Laboratoire d’Informatique en Images et Systèmes d’information, France

ABSTRACT
Data warehouse and OLAP systems are tools to support decision-making. Geographic information
systems (GISs) allow memorizing, analyzing and visualizing geographic data. In order to exploit the
complex nature of geographic data, a new kind of decision support system has been developed: spatial
OLAP (SOLAP). Spatial OLAP redefines main OLAP concepts: dimension, measure and multidimen-
sional operators. SOLAP systems integrate OLAP and GIS functionalities into a unique interactive and
flexible framework. Several research tools have been proposed to explore and the analyze spatio-multi-
dimensional databases. This chapter presents a panorama of SOLAP models and an analytical review
of research SOLAP tools. Moreover, the authors describe their Web-based system: GeWOlap. GeWOlap
is an OLAP-GIS integrated solution implementing drill and cut spatio-multidimensional operators, and
it supports some new spatio-multidimensional operators which change dynamically the structure of the
spatial hypercube thanks to spatial analysis operators.

INTRODUCTION OLAP tools provide the ability to interactively


explore multidimensional data presenting detailed
A Data Warehouse is a centralized repository and aggregated data. The results of analyses are
of data acquired from external data sources and the basis of strategic business decisions.
organized following a multidimensional model It has been estimated that about 80% of data
(Inmon, 1996) in order to be analyzed by On- in databases contains geo-referenced informa-
Line Analytical Processing (OLAP) applications. tion (Franklin, 1992). Geographic Information
Systems (GISs) allow memorizing, analyzing and
DOI: 10.4018/978-1-4666-2038-4.ch009 visualizing geographic data.

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
On Modeling and Analysis of Multidimensional Geographic Databases

OLAP systems reduce geographic data to and we present spatio-multidimensional and GIS
simple textual data. Therefore, they do not pres- operators using a study case concerning pollution
ent any cartographic representation of geographic in French cities. Main outcome and limits of our
data. On the contrary, maps are fundamental for approach as regards to existing SOLAP tools are
the spatial decision making process because they detailed, also. Finally, future research directions
stimulate user’s cognitive process, and reveal hid- in spatio-multidimensional visualization and
den precious geospatial information. Therefore, interaction are discussed.
some solutions, called Spatial OLAP, which
integrate GIS cartographic visualization and
interaction functionalities into OLAP systems, BACKGROUND
have been developed. Spatial OLAP (SOLAP) is
“a visual platform built especially to support rapid Data Warehouse and OLAP Systems
and easy spatio-temporal analysis and exploration
of data following a multidimensional approach Data warehouse and OLAP systems are business
comprised of aggregation levels available in carto- intelligence tools intended to support multidimen-
graphic displays as well as in tabular and diagram sional analysis of huge datasets. Data are modelled
displays” (Bédard, 1997). Spatial OLAP systems according to the multidimensional model, which
integrate advanced OLAP and GIS functionalities is based on the concepts of dimensions and facts
(Rivest et al., 2005; Kouba et al., 2000). They (Inmon, 1996). Dimensions represent analysis
visualize measures on maps at different spatial axes. They are organized in hierarchies’ schemas.
granularities revealing relations between facts and Facts, described by numerical values (measures),
dimensions (Bédard, et al., 2001). Moreover, maps are subjects of analysis. Measures are analyzed
allow triggering spatio-multidimensional opera- at different granularities corresponding to dimen-
tors through simple mouse clicks, also. Different sion hierarchies’ levels, and they are aggregated
SOLAP models have been proposed. They address by means of SQL aggregation functions. The
various aspects of geographic information allow- instance of a multidimensional model is the hy-
ing to model different spatio-multidimensional percube. It is a set of cells representing measures
applications. SOLAP applications can address at all combinations of dimensions’ levels. OLAP
several and different domains: environmental operators permit to navigate into the hypercube.
studies, marketing, archaeology, epidemiology, Most common operators are drill and cut operators.
etc.. SOLAP models define the concepts of spa- Drill operators (i.e., Roll-Up and Drill-Down) let
tial/geographic dimension, spatial/geographic navigating into dimension hierarchies aggregat-
measure, and spatio-multidimensional operators. ing measures. Cut operators (i.e. Slice and Dice)
The integration of spatial data into multi- permit to reduce the analysis space, by selecting
dimensional models and systems rises several a sub-set of dimensions members.
theoretical and implementation issues. Therefore, Usually, data warehouse and OLAP systems
in this chapter, we introduce main OLAP and are based on a three-tier architecture. The first tier
GIS concepts. Then, a detailed review of SO- is the data warehouse, where data, coming from
LAP models, architectures and research SOLAP external heterogeneous sources, are uniformed
tools is presented. The chapter describes our and stored according to the multidimensional
Web-based prototype for the analysis of spatio- model. The second tier is the OLAP Server. It
multidimensional databases (GeWOlap) (Bimonte implements OLAP operators and pre-computes a
et al., 2006; Bimonte et al., 2007a, Bimonte et al., set of multidimensional queries to grant effective
2007b). We describe main architectural features, query response times. The OLAP server imple-

92
On Modeling and Analysis of Multidimensional Geographic Databases

ments other advanced functionalities also, such Layers are stored in GIS using Spatial DBMSs,
as control accesses, multidimensional calculation which extend classical DBMS with spatial types
engine, etc. Finally, the OLAP client allows visual- (i.e. point, line, etc.) and spatial functions (i.e.
izing and formulating OLAP queries in a unique, overlap, etc.). As an example, the spatial table
interactive and user-friendly visual interface representing French departments, and a spatial
composed of graphic displays and pivot tables. query selecting departments with an area greater
than 5000 Km2, using Oracle’s Spatial SQL, are
Geographic Information Systems shown on Figure 1a and Figure 1b, respectively. A
tuple creation and its cartographic representation
A Geographic Information System encompasses a are shown in Figure 1c and Figure 1d, respectively.
set of tools for organizing, memorizing, analyzing GIS’s geographic data can be analyzed by
and visualizing geographic information (Longley means of spatial analysis methods. They allow
et al., 2001). estimating, predicting and understanding spatial
Geographic information is the representation data and phenomena by means of (spatial) statis-
of geo-referenced real phenomena. It is described tic (i.e. centroid, etc.), transformation (i.e. buffer,
by a spatial component (the shape and the posi- etc.) and querying techniques. These methods are
tion on earth surface) and a set of alphanumerical supported by visual representation of maps’ fea-
attributes and (spatial) relationships. tures. Indeed, visualization plays a central role
Main GIS functionality is the integration for the spatial analysis process. Map feature’s
and memorization of geographic data. Data are geometries are represented using a two dimen-
organized in layers. A layer represents a set of sional space. Alphanumeric attributes are visual-
geographic data of the same type (i.e. roads, ized with graphic displays (i.e. bars, pies charts,
buildings, departments, etc.). Each element of the etc.), coded using visual variables (size, value,
layer (geographic object or feature) is described colour, shape, orientation and grain) (Bertin &
by geometry and some alphanumeric attributes. Bonin, 1992). GISs make possible to change map’s

Figure 1. Spatial SQL a) Spatial table creation b) Spatial query c) Geographic object creation d) Tabular
and cartographic representations

93
On Modeling and Analysis of Multidimensional Geographic Databases

scale, reduce visualized features, access to alpha- & Zimányi (2005) define a spatial dimension as a
numeric information using the map, etc. Some- set of spatial hierarchies. A “spatial hierarchy” is
times, maps provide also an interface to advanced a hierarchy with at least one level with the spatial
(visual) analysis techniques. Users can explore attribute (spatial level). Topological intersection or
alphanumeric data, perform complex analysis inclusion relationships exist between members of
methods, and obtain personalized and complex different spatial levels. Bimonte (2008) proposes
visual representations of data by simply interact- the concept of “Geographic Dimension” enrich-
ing with map’s features (interactive maps). Inter- ing spatial dimensions with “Map Generalization
active maps are the basis of geovisualization Hierarchies”. “Map Generalization Hierarchy”
systems (MacEachren, et al., 2004). Such systems represents geographic information at different
integrate scientific visualization and image scales or according secondary themes where the
analysis techniques, and GIS tools into an interac- members of a level are the result of map generaliza-
tive, flexible and user-friendly framework in order tion operators applied to the members of the lower
to explore and analyze spatial data. Interactive level. Map generalization is the process used to
capabilities are mandatory for spatial decision- derive data and maps for secondary scales and/or
making process (MacEachren & Kraak, 1997). themes, preserving a good and clear representation
focused on the goal of the map (Weibel & Dutton,
2001). Map generalization provides a simpli-
SPATIO-MULTIDIMENSIONAL fied vision of the spatial phenomenon enriching
DATABASES: MODELS AND TOOLS spatio-multidimensional analysis capabilities and
improving SOLAP clients’ visualization.
Spatio-Multidimensional Models An example of SOLAP application using a spa-
tial dimension is a study for pollution supervision
The more natural and common manner to integrate in French cities. This multidimensional application
spatial information into multidimensional models, presents three dimensions: “Time”, “Pollutants”,
is to use it as a dimension. As defined in Bédard et and “Location” (spatial dimension), and a nu-
al. (2001), a spatial dimension can be “spatial non merical fact, “Pollution”. This fact is depicted by
geometric” (i.e. with text only members), “spatial three measures giving minimum, maximum and
geometric” (i.e. with members with a cartographic average pollution values (see Figure 2) (Bimonte
representation) or “mixed spatial” (i.e. combining et al., 2007a). This multidimensional application
cartographic and textual members). Malinowsky answers questions like “What are the average,

Figure 2. Multidimensional model with spatial dimension

94
On Modeling and Analysis of Multidimensional Geographic Databases

min and max values per month, and pollutant for “City” level of application of Figure 2 is now used
departments with population above 2M?” as spatial measure. The spatial measure is aggre-
A very different way to introduce spatial in- gated using the topological union. Pollution values
formation in data warehouses is using it as an grouped by 5mg/l are used as analysis dimension.
analysis subject, i.e. as a fact. Different definitions This model analyzes polluted French cities ac-
of the spatial measure can be found in literature: cording to time, pollutants and pollution values.
a collection of geometries (spatial objects) (Ste- In this model, the user should be able to deduce
fanovic et al., 2000; Rivest et al. 2001), geometries information about the influence of geographical
or numerical values resulting from spatial (i.e. location of cities in the pollution problem.
topological and metric) operators (Malinowsky Descriptive attributes of geographic data could
& Zimányi, 2004), and/or spatial members be useful to the spatio-multidimensional deci-
(Marchand et al., 2003). Spatial aggregations (i.e. sional process. Thus, Bimonte (2008) introduces
union, intersection, etc.) replace SQL SUM, MIN, the concept of “Geographic Measure”. A “Geo-
MAX, AVG, and COUNT functions. Maps, then, graphic Measure” is a geographic object described
are the cells of the hypercube. by alphanumeric and spatial attributes. Moreover,
Let us take the spatio-multidimensional model it could belong to one or many hierarchy schemas.
given in Figure 3a. The spatial attribute of the

Figure 3. Spatio-multidimensional models a) Spatial measure, b) Geographic measure

95
On Modeling and Analysis of Multidimensional Geographic Databases

This establishes a complete symmetry of geo- kind of operators which change the structure of
graphic measures and geographic dimensions. the spatial hypercube through the introduction of
Replacing the spatial measure of the multidi- new spatial members into geographic dimension
mensional application in the previous example thanks to spatial analysis operators.
with the geographic measure representing cities
(see Figure 3b), it should be possible to answer Spatial OLAP Tools: A Panorama
queries like: “What cities, their population, and
their socio-economic types, are polluted by CO2 Bédard et al., (2005) identify three typologies
per month?” Indeed, a city is a geographic object of Spatial OLAP tools (OLAP dominant, GIS
described by geometry and two alphanumeric dominant and OLAP-GIS integrated), according
attributes: population and socio-economic type. to their OLAP and GIS functionalities. Some
Note that a (spatial) aggregation function is ap- prototypes have been implemented. They allow
plied to each (spatial) attribute of the geographic an effective spatio-multidimensional analysis of
measure (i.e. topological union for geometry, spatial data warehouses thanks to advanced OLAP
list for name, sum for population and a ratio for and GIS functionalities. However, an overall set
socio-economic type). of mandatory functionalities has not been defined,
Spatio-multidimensional operators extend yet. Indeed, the following SOLAP research tools’
drill and cut OLAP operators. “Spatial Roll-up” panorama reveals that spatial data mining tools,
and “Spatial Drill-down” authorize to navigate spatial analysis methods and geovisualization
into spatial dimensions by the simple interaction techniques should enrich spatio-multidimensional
with the map component of the SOLAP user- operators.
interface (Rivest et al., 2005). “Spatial Slice”
makes possible cutting the spatial hypercube by GIS Dominant Solutions
selecting directly spatial members through SO-
LAP user-interface (Rivest et al., 2005), using GIS dominant solutions simulate OLAP server
spatial/alphanumeric predicates (Sampaio et al., thanks to a relational database modelled using
2006) or spatial analysis operators such as buffer the star or the snow-flake schema. As shown
(Scotch & Parmanto, 2006). Exploiting the sym- in Bédard et al. (2005), these solutions provide
metrical representation of geographic dimensions all GIS functionalities: storage, analysis and
and measures, Bimonte (2008) proposes two visualization. However, since they lack of an
operators, “Permute” and “Measure navigation”. OLAP Server, they do not implement advanced
“Permute” allows exchanging dimension and OLAP functionalities, such as derived measures,
geographic measure. This operator dynamically complex hierarchies, etc.. Consequently, GIS
modifies the structure of the spatial hypercube. dominant solutions limit spatio-multidimensional
“Measure navigation” allows navigating into analysis capabilities. To best of our knowledge,
the geographic measure’s hierarchy, changing the only GIS dominant solution is CommonGis
granularity of the measure on the fly. For instance, (Voss et al., 2004). It is a geovisualization system
since cities belong to departments (Figure 3b), extended to support multidimensional databases.
“Measure navigation” operator permits to analyze It provides multi-criteria functionalities, and spa-
polluted French departments (instead of cities) tial analysis and visualization techniques for the
along time and pollutants dimensions. Moreover, analysis of spatio-temporal data, using temporal
to make the spatio-multidimensional paradigm series. CommonGIS has been adapted to analy-
more flexible and being closer to spatial analysis sis of spatio-multidimensional datasets, where
process, Bimonte et al. (2007b) propose a new spatial information is used as analysis axes. The

96
On Modeling and Analysis of Multidimensional Geographic Databases

user interface is flexible and interactive. It offers cartographic component and a text zone to define
spatio-multidimensional drill and cut operators, spatio-multidimensional queries.
and some advanced geovisualization techniques. Finally Shekhar et al. (2001) and Han et al.
(1997) introduce data mining techniques into
OLAP Dominant Solutions OLAP systems, and Pourabbas & Rafanelli (2002)
use visual languages. In particular, Shekhar et
OLAP dominant solutions provide all advanced al., (2001) develop the cube operator extending
multidimensional analysis functionalities thanks aggregation of numerical data to spatial data. The
to an OLAP system. On the contrary, GIS func- Web-based prototype is based on this operator, and
tionalities are reduced to simple cartographic it is especially conceived for the observation and
visualization and selection of geographic objects the discovery of spatio-temporal trends. In Pourab-
(Bédard et al., 2005). OLAP dominant solutions bas & Rafanelli (2002) a visual language is used
can be grouped into two classes: tools using static to query spatial data warehouses. This approach
maps (Stolte et al., 2003; Colonnese et al., 2005; allows the user to formulate slice operations using
Mohraz, 2000) and tools using interactive maps an iconic language.
(Silva et al., 2006; Sampaio et al., 2006; Shek-
har et al., 2001; Han et al., 1997; Pourabbas & OLAP-GIS Integrated Solutions
Rafanelli, 2002).
Polaris system (Stolte et al., 2003) allows the OLAP-GIS integrated solutions combine GIS
visualization of alphanumeric measures, using and OLAP functionalities. GIS analysis and vi-
non-interactive maps incorporated into the cells sualization functionalities enrich and complete
of the pivot table. PostGeOLAP (Colonnese et al., OLAP navigation and visualization functionalities,
2005) is an open source SOLAP tool supporting allowing a real and effective analysis of spatial
numerical measures and spatial dimensions. It data warehouses. Some OLAP-GIS integrated
provides a set of methods to create spatio-mul- solutions have been developed (Rivest et al., 2005;
tidimensional databases and materialize spatial Scotch & Parmanto, 2006; Matias & Moura-Pires,
views. In Mohraz (2000) the OLAP system SAP’s 2005; Escribano et al., 2007).
Business Information Warehouse is integrated These solutions store spatial multidimensional
with a cartographic visualization tool. data using a Spatial DBMS. Spatial data warehous-
Silva et al. (2006) present a Web-based SO- es allow modelling complex spatial hierarchies,
LAP solution, whose principal feature is the use topological intersection relationships, and spatial/
of geographic Web services for the definition of geographic measures. Star and snowflake schemas
GeoMDQL. GeoMDQL is a new query language are modified to handle spatial data (Stefanovic et
for spatial data warehouses. It extends the OLAP al., 2000; Malinowsky & Zimányi, 2007).
language of Microsoft (MDX) with spatial types. (S)OLAP Servers implement spatio-multidi-
The prototype is based on OLAP Server Mondrian, mensional operators explicitly taking into account
which has been modified to handle GeoMDQL spatial members’ geometry to define cutting
queries, and OLAP client JPivot, which is coupled spatial predicates and to perform drill operators.
with an interactive map. In Sampaio et al. (2006), They allow defining spatial members, hierarchies,
the authors describe a Web-based SOLAP sys- measures and derived measures.
tem which handles spatial measures and allows SOLAP clients improve OLAP clients thanks
querying spatial data warehouses using drill to cartographic visualization and interaction
and cut operators on spatial dimensions. This functionalities. They combine and synchronize
solution presents a Web interface composed of a tabular and graphic visualization components with

97
On Modeling and Analysis of Multidimensional Geographic Databases

interactive maps (Rivest et al., 2005). SOLAP In the following paragraph we describe our
operators are available through the simple inter- OLAP-GIS integrated solution.
action with maps. Each spatial level corresponds
to a cartographic layer. Performing (S)OLAP op- GeWOlap: Spatio-
erators on the pivot table and/or on the map will Multidimensional Analysis Tool
result in the calculation and displaying of a new
clickable map (interactive map), merging differ- GeWOlap is a Web-based system for exploration
ent layers according to the different granularities and analysis of spatio-multidimensional databases
selected for the spatial members, and displaying (Bimonte et al., 2006; Bimonte et al., 2007a,
aggregated and/or detailed measures with visual Bimonte et al., 2007b). It provides advanced (S)
variables and/or graphic displays (i.e. bars, pies, OLAP and GIS functionalities through a flexible
etc.) (Rivest et al., 2005). interactive and user-friendly interface. GeWOlap
For instance, considering the application of is based on three tier architecture.
Figure 2, one should be able to show and to inter- The spatial data warehouse tier is implemented
act with a map displaying average and minimum using Oracle for its native support for spatial data
values for selected months and pollutants at the and its object-relational capabilities.
granularity of cities for one department, and at OLAP server Mondrian provides an easily
department level for the rest of the map. customizable, full featured (i.e. aggregate tables,
In order to take advantage from human visual user-defined functions, complex hierarchies,
perception capabilities, interactive maps should MDX support, etc.) OLAP server. It is important
be coupled with advanced geovisualization tech- to notice that no spatial extension has been inte-
niques such as MultiMaps, Space-Time Cubes, etc. grated into Mondrian. Spatial data is represented
(MacEachren et al., 2004). Finally, GIS operators using simple textual labels. In order to overcome
are available through the cartographic component. this limitation, GeWOlap handles spatial data in
Indeed, decision makers should be able to custom- the spatial data warehouse and client tiers, and it
ize their maps, add layers, query the spatial data provides a mapping mechanism between spatial
warehouse, use spatial analysis tools, etc. data and their textual representation in the OLAP
Rivest et al. (2005) present a SOLAP tool Server.
which permits to enrich the elements of the spa- The client tier is implemented using the OLAP
tial data warehouse with multimedia documents client JPivot to provide JSP pages, and a Java applet
such as photos, videos, etc.. SOVAT (Scotch & developed using MapXtreme Java (MapXtreme)
Parmanto, 2006) extends SOLAP functionalities to support map visualization/interaction and GIS
with (spatial) data mining tools (i.e. clustering, functionalities. The client tier allows a visual rep-
etc.). It provides also statistic and spatial analysis resentation of spatio-multidimensional structures
methods. The SOLAP tool described in Matias [(geographic) dimensions and measures] through
& Moura-Pires (2005) allows overlapping two pivot table, graphic displays and interactive map.
hypercubes using their spatial dimensions. PIET Figure 4 displays the visual interface of Ge-
(Escribano et al., 2007) is a Web-based SOLAP WOlap for the application of Figure 2. The pivot
solution providing a new query processing method. table represents pollution values for Ile de France
Spatial members are first organized into open region and for its departments (Essone, Hautes de
convex polygons, and then the overlay of these Seine, etc.). The cartographic component shows a
polygons is used to answer to multidimensional thematic map representing departments’ pollution
queries using user-defined window queries. values using pie charts.

98
On Modeling and Analysis of Multidimensional Geographic Databases

Figure 4. GeWOlap’s user interface

Spatio-multidimensional and GIS operators The user can cut the hypercube by using the
are accessible through the simple interaction with “Cube Navigator” tool provided by JPivot. The
the pivot table and the interactive map of the cli- Cube Navigator provides a tree representation of
ent using only few mouse clicks. dimension members which can be used to custom-
In particular, GeWOlap provides a set of ize pivot table axes and select a sub-set of mem-
drill operators which are available through the bers. Moreover, GeWOlap extends OLAP cut
interaction with the pivot table and the map: operators by introducing two new cut operators:
“roll-up drill-down replace”, “drill-down posi- “Slice Predicate” and “Slice Position” (Bimonte
tion”, “expand-all”, “drill-through” (Bimonte et et al., 2007b). Thanks to “Slice Predicate”, the
al., 2007a). user can select spatial members by directly click-
For instance, let us suppose that the user wishes ing on the interactive map. “Slice Predicate” allows
to “see” the measures for departments of the Ile cutting the hypercube through Spatial SQL queries.
de France region. By pointing the mouse on that Let us suppose the user is interested in pollu-
region she/he can apply the drill-down position tion values for departments crossed by from the
operator (Figure 5a). As a result, the pivot table Seine river. Then, thanks to the Slice Predicate’s
displays average pollution values for the Ile de wizard she/he defines its spatial query:
France region, and for other departments (Figure
5b). Several synchronization problems rise from select * from MEASURES_VIEW, RIVERS
the topological inclusion relationships between where (MEASURES_VIEW.NAME = ‘Essone’)
spatial members of different levels, and the num- OR… OR (MEASURES_VIEW.NAME = ‘Yve-
ber of measure values that must be displayed. For lines’) AND SDO_GEOM.RELATE(MEASURES_
example, unlike the pivot table, the map cannot VIEW.GEOM, ‘ANYINTERACT’,RIVERS.
display at the same time a region and its depart- GEOM, 0.5)= ‘TRUE’ AND RIVERS.NAME =
ments, and visualize the pollution values for each ‘Seine’
pollutant granting a good cartographic readability.

99
On Modeling and Analysis of Multidimensional Geographic Databases

Figure 5. Spatial Drill-down position, a) Input: Ile de France region, b) Output: Departments of Ile de
France

As a result, the pivot table and the map show important limit, as the spatial analysis process is
only the departments satisfying the spatial predi- iterative and flexible, or, in other words, the user
cate. can change and transform spatial data all along the
Cut and drill operators allow to explore the spatial decisional process. GeWOlap fits buffer,
spatial hypercube using, directly or not, the spa- overlay and dissolve spatial analysis operators to
tial attribute of the spatial members. The spatial the multidimensional paradigm. Indeed, GeWOlap
hypercube does not change its structure. As under- provides three new spatio-multidimensional
lined in Bimonte, et al. (2007b), it represents an operators which dynamically create new spatial

100
On Modeling and Analysis of Multidimensional Geographic Databases

Figure 6. Overlay operator

members thanks to spatial analysis operators and details about aggregation and visualization of
calculate their associated measures using MDX geographic measures, and the implementation of
formula and/or Java user-defined functions. The “Permute” operator can be found in Bimonte et
“Dissolve” operator merges adjacent spatial al. (2006) and Bimonte (2007).
members having an alphanumeric attribute with In addition to spatio-multidimensional operators,
the same value. This attribute is chosen by the GeWOlap provides also pure GIS functionalities:
user through the Dissolve’s wizard. The “Buffer” “Zoom in/out”, “Pan”, “Retrieve”, “Map print”,
operator (Bimonte, et al., 2007b) creates a buffer “Map export”, “Rule tool” and “Control Layer”.
region around one spatial member selected by “Pan” allows moving the map using the mouse,
the user through a mouse click on the interactive “Map print” permits to print the map and “Map ex-
map. The distance of the region buffer is chosen port” permits to save map in JPG and PDF formats.
by the user thanks to the Buffer’s wizard. The “Rule Tool” is a metric tool that calculates distance
GIS overlay operator creates a new layer whose between two points selected by the user. Finally,
features are obtained using the intersection op- “Control Layer” provides different functionalities.
erator on two input layers as shown in Figure 6. It allows customizing the visual representation of
GeWOlap adapts this operator to create n new visual variables: colour, size, etc., backgrounding
spatial members (Bimonte, et al., 2007b). the spatio-multidimensional application by adding
We present here an example of the “Dissolve” raster and/or vector layers, and querying the spatial
operator as it is representative of this class of data warehouse using Spatial SQL.
operators. An example is shown in Figure 7. Start- In conclusion, GeWOlap is a full-featured
ing from the query represented in Figure 7a, which OLAP-GIS integrated solution, which supports
shows pollution average values for Ile de France’s geographic dimension and numerical measures,
departments, the analyst chooses the department and implements drill and cut multidimensional
type attribute. Since “Essone” and “Seine-et- operators. Moreover, GeWOlap enriches exist-
Marne” are adjacent and their type is “Commer- ing SOLAP tools functionalities supporting
cial”, then they are merged into one new region. measures defined as geographic objects (geo-
A new spatial member (“Essone-Seine-et-Marne”) graphic measures) and implementing three new
is created, and its measure is calculated using a spatio-multidimensional operators which change
weighted average on the surface (Figure 7b). the structure of the spatial hypercube thanks to
Previous examples show spatio-multidimen- spatial analysis operators. However, some limits
sional operators applied to geographic dimensions remain to overcome: the integration of ad-hoc
with numerical measures. Similarly, GeWOlap indexing techniques (Papadias, et al., 2002) to
supports geographic measures. In this case, pivot improve the performance of our system, and the
table’s cells contain identifiers of geographic implementation of mechanisms for automatically
objects and the interactive map shows geograph- detecting and computing spatial materialized
ic measure dynamically chosen by the user. More views (Colonese et al., 2005).

101
On Modeling and Analysis of Multidimensional Geographic Databases

Figure 7. Dissolve operator, a) Input: Ile de France’s departments, b) Output: Departments of Ile de
France and the new spatial member Essone-Seine-et-Marne

FUTURE RESEARCH DIRECTIONS Map generalization is mandatory for SOLAP


systems (Bimonte, 2008). On the other hand,
Spatial OLAP technologies have reached matu- map generalization implies the reformulation of
rity. They allow an effective multidimensional classical spatial hierarchies by introducing multi-
analysis of spatial data warehouses. However, association relationships and taking into account
the introduction of complex spatial data raises imprecise measures. (Un)Fortunately, visualiza-
several problems. tion of multidimensional imprecise (spatial) data
is an open issue (Pang, 2008).

102
On Modeling and Analysis of Multidimensional Geographic Databases

Spatial OLAP models and tools are based on sentation of the spatial dimension (point, line or
the vector representation of spatial data. Field polygon). Moreover, GIS users are usually specific
data, which represent spatial data as a regular grid knowledge domain decision makers. GIS takes
whose cells are associated with alphanumeric at- into account their profiles and preferences in order
tributes, could be used for spatio-multidimensional to provide well-suited cartographic visualization
analysis. Indeed, Ahmed & Miquel (2005) provide (Vangenot, 2001). Thus, the ambition is to define
a continuous representation of spatial dimensions, a method to automatically find out the most ap-
but the introduction of field data as analysis subject propriate cartographic representation of SOLAP
remains unexplored. Aggregation of field data queries results. The visual variables (size, colours,
could be supported by Map Algebra operators. Map etc.) and the graphic representation (i.e. bar, pie,
Algebra defines a set of operations on field data etc.) used to represent measures on maps can be
(local, focal and zone operators) (Tomlin, 1990). automatically deduced thanks to SOLAP query
Adaptation of Map Algebra to multidimensional patterns taking into account number, type and
data structures, definition of pre-aggregation and current representation of dimensions, measures
visualization/interaction techniques for continu- types, aggregation functions involved in the query
ous measures are challenges to overcome for an and user profile and/or preferences (Bellatreche,
effective SOLAP tool supporting field data. et al., 2005).
The integration of trajectories data into data
warehouses raises several problems because
classical multidimensional models are based on CONCLUSION
discrete facts and dimensions and they do not
take into account spatial predicates. This problem Spatial OLAP refers to the introduction of spatial
has been investigated by some works (Wan et al., data into data warehouse and OLAP systems.
2007; Orlando et al., 2007) in the last years. How- SOLAP enhances decision analysis capabilities
ever, the definition of a SOLAP client to visually of OLAP systems allowing exploiting the com-
query and analyze trajectory data warehouses is plex nature of geographic information. SOLAP
an unexplored important challenge. redefines main OLAP concepts. It defines spatial/
Bertolotto et al. (2007) affirm that visual geographic dimensions as dimensions with spa-
analysis of spatial data mining results is improved tial attributes, spatial measures as a collection of
by exploiting the third dimension of spatial data spatial objects or the result of spatial operators,
through an interactive 3D geovisualization system. and geographic measures as geographic objects
The integration of advanced geovisualization belonging to hierarchy schemas. SOLAP extends
techniques within OLAP clients in order to support multidimensional navigation operators defining
multidimensional 3D spatial data is an interesting spatial drill and cutting operators which allow
research direction. navigating into spatial/geographic dimensions and
Finally, several semiology problems have to cutting the hypercube thanks to spatial and non-
be solved for the correct and relevant visualiza- spatial predicates. Other spatio-multidimensional
tion of measures. Measures can be displayed with operators permit to change the structure of the spa-
labels but it is sometimes worthwhile to use more tial hypercube thanks to spatial analysis operators,
expressive, significant visual components. The to permute dimensions and geographic measures,
way measures will be displayed on the map must and to navigate in the hierarchy of the geographic
depend on several criteria: nature of the measure measure. SOLAP tools are based on the integration
(quantitative or qualitative measure), number of OLAP and GIS functionalities. In this work, we
of measures to be displayed, and current repre- have provided an analytical overview of SOLAP

103
On Modeling and Analysis of Multidimensional Geographic Databases

tools grouping them according to their architecture Bertin, J., & Bonin, S. (1992). La graphique et
approach. We described our Web-based prototype le traitement graphique de l’information. Paris:
(GeWOlap) for the multidimensional analysis of Flammarion.
huge spatial datasets. GeWOlap combines SOLAP
Bertolotto, M., Di Martino, S., Ferrucci, F.,
and GIS operators into a unique interactive and
& Kechadi, T. (2007). Towards a frame-
user-friendly framework. GeWOlap supports geo-
work for mining and analyzing spatio-tem-
graphic dimensions and geographic measures, and
poral datasets. International Journal of Geo-
it introduces some new spatio-multidimensional
graphical Information Science, 21(8), 1–12.
operators which allow to change the structure
doi:10.1080/13658810701349052
of the hypercube thanks to dissolve, buffer and
overlay GIS operators. Bimonte, S. (2007). Vers l’intégration de
l’information géographique dans les entrepôts de
données et l’analyse en ligne: De la modélisation
REFERENCES à la visualization. Unpublished doctoral disserta-
tion, INSA Lyon, France.
Ahmed, T., & Miquel, M. (2005). Multidimen-
sional structures dedicated to continuous spatio- Bimonte, S. (2008). Des entrepôts de données,
temporal phenomena. In M. Jackson, D. Nelson, l’analyse en ligne et l’information géographique.
& S. Stirk (Eds.), Proceedings of the 22nd British Journal of Decision Systems.
National Conference on Databases (Vol. 3567, Bimonte, S., Tchounikine, A., & Miquel, M.
pp. 29-40). Berlin, Germany: Springer. (2007). Spatial OLAP: Open issues and a Web
Bédard, Y. (1997). Spatial OLAP. In Proceedings based prototype. In M. Wachowicz & L. Bodum
of the 2nd Forum annuel sur la R-D, Géomatique (Eds.), Proceedings of the 10th AGILE Interna-
VI: Un monde accessible, Montréal, Canada. tional Conference on Geographic Information
Science.
Bédard, Y., Merrett, T., & Han, J. (2001). Funda-
ments of spatial data warehousing for geographic Bimonte, S., Tchounikine, A., Miquel, M., &
knowledge discovery. In H. Miller & J. Han (Eds.), Laurini, R. (2007). Vers l’intégration de l’analyse
Geographic data mining and knowledge discovery spatiale et multidimensionnelle. In M. Batton-Hu-
(pp. 53-73). London: Taylor & Francis. bert, T. Joliveau, & S. Lardon (Eds.), Proceedings
of the Colloque International de GEOmatique et
Bédard, Y., Proulx, M., & Rivest, S. (2005). Enrich- d’Analyse Spatiale.
issement du OLAP pour l’analyse géographique:
Exemples de réalisation et différentes possibilités Bimonte, S., Wehrle, P., Tchounikine, A., &
technologiques. In Revue des Nouvelles Technolo- Miquel, M. (2006). GeWOlap: A Web based spa-
gies de l’Information, Entrepôts de données et tial OLAP proposal. In R. Meersman, Z. Tari, &
l’Analyse en ligne (pp. 1-20). P. Herrero (Eds.), Proceedings of the Workshop
on Semantic-Based Geographical Information
Bellatreche, L., Giacometti, A., Laurent, D., & Systems (Vol. 4278, pp. 1596-1605). Berlin,
Mouloudi, H. (2005). A personalization frame- Germany: Springer.
work for OLAP queries. In I. Y. Song & Y. Trujillo
(Eds.), Proceedings of the ACM 8th International Colonese, G., Manhaes, R., Montenegro, S., Car-
Workshop on Data Warehousing and OLAP (pp. valho, R., & Tanaka, A. (2005). PostGeoOlap: an
9-18). New York: AMC Press. open-source tool for decision support. In Proceed-
ings of the 2nd Simpósio Brasileiro de Sistemas
de Informação.

104
On Modeling and Analysis of Multidimensional Geographic Databases

Escribano, A., Gomez, L., Kuijpers, B., & Vais- Malinowsky, E., & Zimányi, E. (2005). Spatial
man, A. (2007). Piet: A GIS-OLAP implementa- hierarchies and topological relationships in Spa-
tion. In I. Y. Song & T. B. Pedersen (Eds.), Pro- tialMultiDimER model. In M. Jackson, D. Nelson,
ceedings of the ACM 10th International Workshop & S. Stirk (Eds.), Proceedings of the 22nd British
on Data Warehousing and OLAP (pp. 73-80). New National Conference on Databases (Vol. 3567,
York: ACM Press. pp. 17-28). Berlin, Germany: Springer.
Franklin, C. (1992). An introduction to geographic Malinowsky, E., & Zimányi, E. (2007). Logical
information systems: Linking maps to databases. representation of a conceptual model for spatial
Database, 15(2), 13–21. data warehouses. Geoinformatica.
Han, J., Kopersky, K., & Stefanovic, N. (1997). Marchand, P., Brisebois, A., Bédad, Y., & Edwards,
GeoMiner: A system prototype for spatial data G. (2003). Implementation and evaluation of a
mining. In J. Peckham (Ed.), Proceedings of the hypercube-based method for spatio-temporal ex-
ACM SIGMOD International Conference on ploration and analysis. Journal of the International
Management of Data (pp. 553-556). New York: Society of Photogrammetry and Remote Sensing,
ACM Press. 59(1), 6–20. doi:10.1016/j.isprsjprs.2003.12.002
Inmon, W. H. (1996). Building the data warehouse Matias, R., & Moura-Pires, J. (2005). Spatial
(2nd ed.). New York: John Wiley & Sons. on-line analytical processing: A tool to analyze
the emission of pollutants in industrial installa-
Kouba, Z., Matousek, K., & Miksovsky, P. (2000).
tions. In C. Bento, A. Cardoso, & G. Dias (Eds.),
On data warehouse and GIS integration. In M.
Proceedings of the 5th International Workshop on
Ibrahim, J. Kung, & N. Revell (Eds.), Proceedings
Extraction of Knowledge from Databases and
of the 11th International Conference on Database
Warehouses.
and Expert Systems Applications (Vol. 1873, pp.
604-613). London: Springer. Mohraz, K. (2000). Geographical analysis in SAP
business information warehouse. In K. Li, K.
Longley, P., Goodchild, M., Maguire, D., & Rhind,
Makki, N. Pissinou, & S. Ravada (Eds.), Proceed-
D. (2001). Geographic information systems and
ings of the 8th ACM Symposium on Advances in
science. New York: John Wiley & Sons.
Geographic Information Systems (pp. 191-193).
MacEachren, A., Gahegan, M., Pike, W., Brewer, Washington: ACM Press.
I., Cai, G., Lengerich, E., & Hardisty, F. (2004).
Orlando, S., Orsini, R., Raffaetà, A., Roncato, A.,
Geovisualization for knowledge construction and
& Silvestri, C. (2007). Spatio-temporal aggrega-
decision support. IEEE Computer graphics and
tions in trajectory data warehouses. In I. Y. Song,
application, 24(1) 13-17.
J. Eder, & T. M. Nguyen (Eds.), Proceedings of the
MacEachren, A., & Kraak, M. (1997). Exploratory 9th International Conference Data Warehousing
cartographic visualization: Advancing the agenda. and Knowledge Discovery (Vol. 4654, pp. 66-77).
Computers & Geosciences, 23(4), 335–343. Berlin, Germany: Springer.
doi:10.1016/S0098-3004(97)00018-6
Pang, A. (2008). Visualizing uncertainty in
Malinowsky, E., & Zimányi, E. (2004). Represent- natural hazards. In A. Bostrom, S. French, & S.
ing spatiality in a conceptual multidimensional Gottlieb (Eds.), Risk assessment, modeling and
model. In D. Pfoser, I. Cruz, & M. Ronthaler decision support: Strategic directions series: Risk,
(Eds.), Proceedings of the 12th ACM International governance and society (pp. 261-294). Berlin,
Workshop on Geographic Information Systems Germany: Springer.
(pp. 12-22). New York: ACM Press.

105
On Modeling and Analysis of Multidimensional Geographic Databases

Papadias, D., Tao, Y., Kalnis, P., & Zhang, J. Silva, J., Times, V., & Salgado, A. (2006). An open
(2002). Indexing spatio-temporal data ware- source and Web based framework for geographic
houses. In Proceedings of the 18th International and multidimensional processing. In H. Had-
Conference on Data Engineering (pp. 166-175). dad (Ed.), Proceedings of the ACM Symposium
Los Alamitos, CA, USA: IEEE Computer Society. on Applied Computing (pp. 63-67). New York:
ACM Press.
Pourabbas, E., & Rafanelli, M. (2002). A pictorial
query language for querying geographic databases Stefanovic, N., Han, J., & Kopersky, K. (2000).
using positional and OLAP operators. SIGMOD Object-based selective materialization for efficient
Record, 31(2), 22–27. doi:10.1145/565117.565121 implementation of spatial data cubes. IEEE Trans-
actions on Knowledge and Data Engineering,
Rivest, S., Bédard, Y., & Marcand, P. (2001). To-
12(6), 938–958. doi:10.1109/69.895803
wards better support for spatial decision-making:
Defining the characteristics of spatial on-line Stolte, C., Tang, D., & Hanrahan, P. (2003). Multi-
analytical processing. Journal of the Canadian scale visualization using data cubes. IEEE Trans-
Institute of Geomatics, 55(4), 539–555. actions on Visualization and Computer Graphics,
9(2), 176–187. doi:10.1109/TVCG.2003.1196005
Rivest, S., Bédard, Y., Proulx, M., Nadeaum, M.,
Hubert, F., & Pastor, J. (2005). SOLAP: Merging Tomlin, D. (1990). Geographic information sys-
business intelligence with geospatial technology tems and cartographic modeling. Upper Saddle
for interactive spatio-temporal exploration and River, NJ: Prentice Hall.
analysis of data. Journal of International Society
Vangenot, C. (2001). Supporting decision-making
for Photogrammetry and Remote Sensing, 60(1),
with alternative data representations. Journal of
17–33. doi:10.1016/j.isprsjprs.2005.10.002
Geographic Information and Decision Analysis,
Sampaio, M., Sousa, A., & Baptista, C. (2006). 5(2), 66–82.
Towards a logical multidimensional model for
Voss, A., Hernandez, V., Voss, H., & Scheider, S.
spatial data warehousing and OLAP. In Y. Song &
(2004). Interactive Visual exploration of multidi-
P. Vassiliadis (Eds.), Proceedings of the 9th ACM
mensional data: Requirements for CommonGIS
International Workshop on Data Warehousing
with OLAP. In Proceedings of the 15th Interna-
and OLAP (pp. 83-90). New York: ACM Press.
tional Workshop on Database and Expert Systems
Scotch, M., & Parmanto, B. (2006). Develop- Applications (pp. 883-887). Los Alamitos, CA:
ment of SOVAT: A numerical-spatial decision IEEE Computer Society.
support system for community health assess-
Wan, T., Zeitouni, K., & Meng, X. (2007). An
ment research. International Journal of Medical
OLAP system for network-constrained moving
Informatics, 75(10-11), 771–784. doi:10.1016/j.
objects. In Y. Cho, R. Wainwright, H. Haddad, S.
ijmedinf.2005.10.008
Shin, & Y. Koo (Eds.), Proceedings of the ACM
Shekar, S., Lu, C., Tan, X., Chawla, S., & Vat- symposium on Applied computing (pp. 13-18).
savai, R. (2001). Map cube: A visualization tool New York: ACM Press.
for spatial data warehouses. In H. Miller & J. Han
Weibel, R., & Dutton, G. (2001). Generalizing
(Eds.), Geographic data mining and knowledge
spatial data and dealing with multiple representa-
discovery (pp. 74-109). London: Taylor & Francis.
tions. In P. Longley, M. Goodchild, D. Maguire,
& D. Rhind (Eds.), Geographic Information
systems and science (pp. 125-155). New York:
John Wiley & Sons.

106
On Modeling and Analysis of Multidimensional Geographic Databases

KEY TERMS AND DEFINITIONS Spatial Decision Support System: Interac-


tive, computer-based system designed to support
Geographic Information System: Informa- a user or group of users in achieving a higher
tion system for capturing, storing, analyzing, effectiveness of decision making while solving
managing and presenting data that are spatially a semi-structured spatial problem.
referenced. Spatial OLAP: Visual platform built espe-
Multidimensional Model: Conceptual model cially to support rapid and easy spatio-temporal
for the multidimensional analysis of huge datasets, analysis and exploration of data following a mul-
based on the concepts of dimensions, facts and tidimensional approach comprised of aggregation
measures. levels available in cartographic displays as well
Spatial Data Warehouse: Subject-oriented, as in tabular and diagram displays.
non volatile, time variant and integrated reposi-
tory of spatial data that is designed to facilitate
reporting and spatial analysis.

This work was previously published in Data Warehousing Design and Advanced Engineering Applications: Methods for Com-
plex Construction, edited by Ladjel Bellatreche, pp. 96-112, copyright 2010 by Information Science Reference (an imprint of
IGI Global).

107
108

Chapter 10
Technology and the
Multipolar Global Economy:
Implications for European Competitiveness

Steven McGuire
Aberystwyth University, UK

ABSTRACT
The European Union has engaged in several attempts to increase the level of innovation in member
states – largely without success. This chapter argues that a combination of global and regional issues
explain this failure. What was once a process dominated by Europe and America, innovation is “going
global,” with many countries now developing national innovation systems. European firms are conduct-
ing more of their research in emerging markets, and in Europe, firms seem unable to raise their share of
overall research and development spending. The Commission, for its part, lacks the necessary regulatory
and legislative powers to forge a new regional innovation system and is instead reliant on a policy mix
of overarching objectives, some directed funding, and comparative analysis of member state policies.

INTRODUCTION based capitalist system remains the best way to


spur economic growth and raise living standards.
The globalization of production that developed in However, in the United States and the European
the latter parts of the last century is now augmented Union there is increasing anxiety that the disper-
by the globalization of innovation. This process sion of innovative capabilities represents, if not a
is at once welcome and potentially destabilizing. zero-sum competition between economies, then
It is welcome in the sense that it represents a at least an episode of relative economic decline.
victory for the western liberal economic tradi- While the Kindle is sold by Amazon (an Ameri-
tion advocated by the United States and Europe: can company) and marketed as an example of
that a comparatively open, predictable and rule- American ingenuity, the product itself could not
be manufactured in the US. This is not because of
DOI: 10.4018/978-1-4666-2038-4.ch010

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Technology and the Multipolar Global Economy

cost; rather, the problem is that the US economy no devoted money to research, rather than leave new
longer has the technological capacity to produce product development to opportunistic acquisition.
the item (Munro, 2010). Aside from the ink the key American universities were also well funded by
manufactured elements of the Kindle rely on Asian a variety of sources – federal and state govern-
manufacturing firms. For now, the US dominates ment, students and philanthropy – and played a
the design process, but extensive academic work crucial role in the ascent of the US to dominance
on innovation suggests that design and production of innovation in the 20th century. Both Europe
are, in the medium term, tightly interwoven. The and America adopted what has been described as
tacit knowledge acquired in the production process the ‘linear’ model, which presumed a relatively
eventually underpins new innovations at the design straightforward process of discovery, develop-
phase. Europe shares many of the anxieties seen ment and sales, with many held within the firm,
in the United States. The European Commission or within its control (Chesbrough, 2003). As the
has made several high-profile efforts to address the name suggests, the model envisaged a more or
perceived decline in European competitiveness. less linear relationship between basic research
The Lisbon Agenda, launched in the early 2000’s conducted at universities, leading to applied work
to much fanfare, was effectively abandoned late in at corporate research and development centres
the last decade and the Commission attempted to leading to successful commercialisation.
relaunch a European drive to increase the region’s So successful were Europe and America that
position in high technology industries through follower states developed their own innovation
the Competitiveness and Innovation Framework systems, borrowing and adapting European and
Program (CIP). The programme had three ele- American practice as needed. Japan of the 1980s
ments: first, support for entrepreneurs; second, a was the most successful follower state but the
set of policies designed to generate greater use of new millennium has seen the field of top-quality
information technology in European firms; and, innovating nations become much more crowded.
finally, an initiative to promote innovation in China is merely the most obvious gainer, with
sustainable energy supplies. In 2010 a new effort, Singapore, India and Taiwan other states that have
Europe 2020, was launched, featuring a broader developed top quality innovation systems. Just as
assault on the perceived failings of Europe in the production was globalised in the last century, it
areas of innovation, new business creation, tertiary seems likely that the 21st century will see the true
education and job creation. globalization of knowledge, as many economies
Innovation is often envisaged as the lone in- develop the capabilities to innovate at the very
ventor working in his laboratory, hoping for that frontier of a variety of technologies – and are
crucial breakthrough. Though serendipity can be simultaneously dependent on knowledge cre-
important, the key to the success of Europe and ated elsewhere. This ‘networked’ technological
American innovation was that both actors cre- system poses challenges to state policies oriented
ated systems, comprised of firms, government around keeping innovative activities confined to
and other stakeholders such as universities and the home base.
research institutes. European universities – par- In both Europe and America, this development
ticularly German institutions – led the world in has been met with apprehension and even hostility
the developing institutions devoted to scientific in some quarters. This is because the high-skill,
research, and linked with German firms, who ‘knowledge’ economy was supposed to be a safe
were among the first to develop in-house R&D haven for developed economies faced with the
facilities (Dosi, et al, 2005:21). American firms loss of manufacturing jobs. Public policy in both
were also early adopters of a model where firms economies has focused on the need to enhance the

109
Technology and the Multipolar Global Economy

innovative capacity of the economy, via improved American standards. However the trajectory
education, government support for research and and pace of China’s advance, to say nothing of
development and increased use of innovation by renewed efforts in Singapore and Malaysia and
firms for competitive advantage. The unstated as- the expansion of India is impressive. The notion,
sumption was that emerging economies like China, popularized by numerous politicians, that Europe
though they were good at cheap manufacturing, and America could find an ‘economic haven’
would not be able to duplicate European and among high-skilled, high technology sectors is
American efforts in technology intensive sectors. proving gravely mistaken.
Though there is evidence that a significant Exactly why this diffusion is happening is
skills premium remains, the assumptions about not widely considered in the literature. A 2007
the security of even graduate-level, managerial European Commission document suggested that
and technical are proving misguided. The key this process has resulted from ‘the growing S&T
distinction insofar as jobs are concerned is not [science and technology] capacity of China and
between knowledge workers and manual labour, India and the expanding global knowledge pro-
but between jobs that can be traded (even if elec- duction chains inside MNEs’ (2007: 45). Athreye
tronically) and those that are non-tradable (Blinder, and Cantwell (2008) similarly highlight the role of
2005). To the extent that emerging economies internationalized supply chains and the diffusion
develop talents in tradable, highly skilled goods of more knowledge-intensive functions within
and services, they will compete directly with these supply networks. However, this explanation
European and American scientists, engineers, as merely highlights the process; why did developed
well as management consultants and accountants. state firms feel able to diffuse these functions in
China is dramatically increasing its innovative the first place? The political economy literature
capacity, with increased research and develop- of the late 1990s and 2000s, far from envisaging
ing spending actually increasing faster than the the diffusion of innovation, fretted that the inno-
overall rate of economic growth (OECD, 2006a). vation ladder was being pulled up from develop-
Though a note of caution must always apply to ing states. A particular target for criticism in this
Chinese statistics, China’s overall effort in R&D, regard was the WTO’s Trade Related Aspects of
gross expenditure on research and development Intellectual Property (TRIPs) (Sell, 1996). The
(GERD) is, at 1.31 per cent of GDP, not far off world envisaged in this literature was one where
the European average (Zhou and Leydesdorff, developing states remained trapped in low skilled,
2006: 84). China is now the 5th largest patent- low value-added activities. Why this expectation
ing nation on Earth, though with a share barely has been confounded, not least in China but also
a quarter of Japan and America’s (WTO, 2006). India, Brazil and other emerging states, needs
China has put particular emphasis on nanotechnol- consideration.
ogy, thus emulating several states in viewing this
technology as a key driver of future innovative Multilateral Trade, R&D, and
activity. China is now second only to the US in the Diffusion of Innovation
terms of single country production of published
scientific research on nanotechnology (Zhou and The globalization of innovation is now clearly
Leydesdorff, 2006: 94). The overall quality of underway, though like other aspects of global-
China’s innovation infrastructure can be called into ization the density and reach of this process is
question. Though money is pouring into Chinese uneven. Just as Richard Florida’s (2006) work on
universities, laboratory equipment and academic economic growth has identified patterns of both
training remain relatively poor by European and flatness and spikiness in the global economy, so too

110
Technology and the Multipolar Global Economy

for research and development. This globalization 2002). As both Evenett and Kang note, WTO
has also been accompanied by an increase in the disciplines on R&D subsidies are comparatively
number of states actively engaged in technology weak, whereas intellectual property provisions
policies in support of emergent industries. These under TRIPs are comparatively strong (Evenett,
policies involve an array of research and develop- 2009; Kang 2000). This partly explained by the
ment initiatives designed to both tempt foreign desire of developed world (or OECD) firms
firms to locate R&D facilities in the locality and pressing their governments to gain multilateral
to enhance the innovative capabilities of domestic protection for their existing stock of intellectual
firms. Trade liberalization under the WTO has property. TRIPs has been controversial because
facilitated this, but not in a straightforward man- of the unfairness implicit in the protection of
ner. Trade liberalization has two potential effects developed world knowledge, thus restricting the
on firm incentives to undertake R&D: first, it can ability of developing states to catch up. Recent
create positive incentives to invest, as liberaliza- experience has, however, called this into question,
tion expands the market (scale effect) but: second, as Asian countries have demonstrated an ability
it can dampen incentives if new competitors are to develop a range of emergent high technology
created (competition effect) (Teteryaknikova, sectors. Forero-Pineda (2006), for example, argued
2008). As with any other subsidy, research and that TRIPs has increased incentives to pursue
development incentives risk being captured by niche technology policies in several developing
organized producer interests (Brou et al., 2009), states. Similarly, Hu and Matthews (2008) note
but they can also address the market failures as- that innovative activity in China soared after that
sociated with risky investments, and in so doing be country’s accession to the WTO in 2001 (Figure
welfare-enhancing. Thus, multilateral disciplines 1). They argue that foreign firms had, in effect,
on subsidies need to accommodate two demands been waiting until China joined the WTO before
that are in tension with each other. Unsurprizingly investing in higher value-added activities there.
then the Subsidies and Countervailing Measures The SCM Code, in contrast to TRIPs, has not
(SCM) agreement in the WTO is comparatively proved problematic for emerging economies, aside
complex and, when first negotiated, sought to from a few high-profile cases such as Embraer’s
accommodate a range of permissible subsidies subsidization by the Brazilian government (Gold-
– as well as outlawing export subsidy (McGuire, stein and McGuire, 2004). The language of the

Figure 1. China’s patenting activity in the USPTO, 1991-2005 (Source: Hu and Matthews, (2008))

111
Technology and the Multipolar Global Economy

SCM – what Evenett refers to as its incomplete- Jean-Jacques Servan-Schreiber articulated


ness – offers considerable scope for states to European fears about American domination of the
develop national programs under the aegis of global economy over forty years ago, yet his work
environmental technologies or regional eco- speaks to current anxieties about Europe’s ability
nomic development. During the Uruguay Round to compete. Writing in the 1960s, he observed the
negotiations, the United States succeeded in per- fantastic growth of US multinationals and feared
suading other states of the merits of a specific that Europe’s future was to be a mere branch
research and development exemption (ie green plant for American multinationals, who had com-
light) under the SCM code (GSI, no date). The pletely supplanted European firms across sectors
recent economic crisis has merely added to the (Servan-Schreiber, 1968). Indeed, for pessimists,
view that the SCM is not a significant obstacle to the European Union has missed not one but two
governments seeking to promote or protect in- major waves of innovation: the 1980s innovation
dustries. As Falke (2010) notes, the US bail-out of computers and related industries; and the 1990s
for the automobile sector was the most costly service-based innovation that built on the earlier
subsidy in US corporate history. Europe did not wave. The figures on comparative expenditure on
retaliate, in large part because of several Euro- R&D paint a depressing picture. Overall European
pean states went ahead and introduced similar Union expenditure on research has remained stable
subsidy measures designed to preserve their auto at just under 2 per cent of GDP for several years, in
sectors. The European Union essentially sus- spite of the Lisbon Agenda and a number of other
pended its anti-subsidy enforcement activities, EU-level initiatives designed to improve this per-
fearing a political storm should the Commission formance. Though expenditures were also stable
be seen as sacrificing member state economies to in the US, America spends considerably more on
WTO-inspired regulations (Evenett, 2009). The innovation (2.6 per cent of GDP) and has done so
wide scope of the Subsidies Code has been a boon for many years (Eurostat, 2008). In an updated
to nations like India, which have been able to analysis performed in the 2010, the Commission
pursue national innovation policies in key areas noted that both the United States and Japan had
such as pharmaceuticals and biotechnology (Ra- maintained their lead over the European Union
mani, 2002). in a range of innovation indicators, most notably
in the area of business-level R&D expenditure
European Innovation Policy1 (European Commission, 2011).
Of course, a note of caution is in order: some
European innovation policy is complicated by European states are very competitive in the new
the simple fact that the EU is not a state, but technology and service economies. Sweden and
a regional economic integration project, and Finland, for example, regularly feature among
so policymaking is that much more complex. the top in any ranking of competitive economies.
Moreover, in contrast to other areas of policy, the At one level then Europe’s ‘problem’ is one of
Commission has relatively few instruments at its uneven performance, where lagging member
disposal, and so relies to a much greater extent on states tend to create the perception that all EU
the willingness of member states to cooperate with members have innovation gaps. In virtually no
initiatives. The result has been a comparatively other policy area is there as wide a dispersion
disappointing performance, particularly in light in performance between the best and worst EU
of the success of technology programs in several member state as in innovation policy (Archibugi
emerging economies such as Korea and China. and Coco 2005: 443).
Nonetheless, the relative failure is not new.

112
Technology and the Multipolar Global Economy

The EU lags both in total economic resources ous sectors. These firms, usually created by the
devoted to innovation, and particular in business- government-sponsored merger of smaller firms,
level research and development expenditure. The would enjoy a dominant position in a given na-
percentage of R&D funded by business in the EU- tional market and thus reap economies of scale
27 is 55%, compared with 64% in the US and 75% benefits. However, the protection afforded these
in Japan (European Commission, 2007). Its share firms gave the wrong series of incentives; the
of world patenting appears to have stalled. Though policy produced inefficient, non-innovative firms
the situation is improving, European firms show a which were unable to cope with the dynamism of
lower propensity to use IT to increase productivity Japanese or American competitors. Outside the
than American firms. European universities are aerospace sector (and perhaps nuclear power) the
poorly funded by American standards, though in government-led national champion policies were
fact they employ a higher percentage of research- notable for failure rather than success. Policy fail-
ers (37%) than the United States 15% (European ure highlighted the need to expose European firms
Commission, 2007). A National Science Founda- to greater competition; this could be achieved in
tion study of immigrant scientists and engineers the first instance by creating a proper European
noted at strong belief among European emigrants market. The Single Market Programme was the
that the US offered better research infrastructure liberal response to the poor competitiveness of
and job opportunities (NSF, 2007a). As a study for European firms. Creation of the unified market
the European Commission noted, the problem is would allow European firms to reap the same scale
‘that Europe has structural lags in top level science economies that the American market provided to
and innovative performance vis-à-vis the US…’ US firms decades earlier. For reasons which are
(Dosi et al. 2005:1, emphasis added). On a range not yet clear, Europe’s innovative capacity did
of university-related indicators of innovation, not accelerate after the SMP: indeed as indicated
such as the numbers of PhDs awarded and citation above, it deteriorated.
rates of scientific publications, the United States
has maintained it lead (European Commission, The Failure of the Lisbon Strategy
2011). The problem is not simply one of com-
mercialization – where excellent science work is The Lisbon Summit in 2000 committed EU
left underexploited by firms – but rather one of member states and the Commission to making
weakness in both basic research and its applica- Europe, ‘the most competitive and dynamic
tion to the creation of goods and services (Dosi et. knowledge-based economy in the world by 2010’.
al, 2006). Though European universities account The declaration was essentially an admission that
for a significant percentage (about 22 percent) of efforts in the 1990s had either failed or needed
all research and development expenditures – and to be intensified and refocused. Two measures of
produce significant numbers of technicians and economic performance were cited by European
engineers – this does not translate into innovative policymakers in defense of new strategies: the
success for firms (OECD, 2006b: 209). deterioration of per capita GDP growth relative
Europe has been here before, with the superior to the US; and the stagnation of European pro-
performance of other economies provoking con- ductivity. European per capita GDP relative to
cern about European competitiveness. European the United States climbed steadily in the years
policymakers fastened on the scale advantages following the Second World War, peaking in the
offered by the large American market, and the 1970s at about three quarters of US levels. The
role of an interventionist state in Japan, to argue trend, if anything, has been downward and in
for the creation of national champions in vari- 2006, the gap was a worrying large 32 per cent

113
Technology and the Multipolar Global Economy

(Fassbender, 2007). Some eight years after the contrast, some Europeans see in the competitive-
launch of the Lisbon strategy, R&D intensity in the ness agenda an effort to unpick the social market
European Union has stagnated at approximately protections that citizens have enjoyed for decades.
1.84% of GDP (EU, 2009) http://eur-lex.europa. It does not follow that the need for creative de-
eu/LexUriServ/LexUriServ.do?uri=COM:2009: struction requires a small state and low taxes; in-
0558:FIN:EN:PDF#page=2. deed, government activity and public expenditure
The second problem, in a sense, gave rise to the might be necessary for innovation in many fields.
first. EU labor productivity growth hardly moved There is thus dispute over the orientation of in-
during the 1990s, whilst in the same period, the US novation policy. Japan and the United States are
was undergoing a so-called productivity miracle both highly innovative and successful economies
(Daveri, 2002). Exactly why Europe lagged behind with considerable numbers of competitive firms.
has not been adequately answered, though various Yet, both operate very different mechanisms for
attempts had been made. One suggestion was that generating this innovation. Even within Europe,
the still incomplete Single Market process was to Nordic countries are reasonably successful at rec-
blame; as long as firms could exist in relatively onciling high taxation with appropriate incentives
uncompetitive national markets, they lacked the for firms to invest and innovate. Some European
incentive to increase their investment in produc- states, including the UK, Ireland and Hungary,
tivity enhancing processes and assets (McGuire, are very reliant on foreign firms for research and
2006). Another explanation drew on social and development expenditures, whilst others, such as
cultural attributes, with the observation that Eu- Germany, retain significant national capabilities
ropeans produce less than Americans because of (OECD, 2006b). In short, very real and impor-
a rational decision to devote more time to leisure tant elements of variation of technology policies
(Fassbender, 2007). A more recent study has like- exist, and European policymakers have found it
wise suggested that, at least in certain parts of the difficult to articulate a clear and cohesive strategy
EU, demand for innovative products and services for Europe in the face of conflicting evidence
remains low, with entrepreneurs thus unwilling and societal pressures. In a clear admission that
to risk entering the market (EU Yearbook, 2009). Europe is not a ‘single innovation space’ the Com-
One of the aims of the CIP programme is to ad- mission dramatically revised its methodology for
dress this by making early stage finance more analyzing member state performance in 2010.
available, thus overcoming what appears to be a The result of the Innovation Union Scoreboard
market failure. was that there are four distinct groups or clusters
There is a view that the demands of innova- of countries within the EU. Denmark, Finland,
tion policy in the 21st century are increasingly Germany and Sweden are ‘Innovation Leaders’
in conflict with entrenched political and social with internationally competitive innovation sys-
practices in the European Union. Throughout the tems and performance. The next group, Innovation
world, innovation policies emphasize the process Followers, have strong innovative capabilities
of ‘creative destruction’ where older firms and but without the outstanding performance of the
industries are swept and as labor and capital are leading group. The next two groups, ‘Moderate
reallocated to new activity (Schumpeter, 1942). Innovators’ and ’Modest Innovators’ are mainly
Advocates of the Schumpeterian perspective see small and newly-acceded member states without
the globalization as the very essence of creative significant innovative capabilities (European
destruction. Competitiveness policies ought to Commission, 2011)
assist in the process by encouraging new indus- One difficulty the EU has is the multilayered
tries and allowing other sectors to wither away. In nature of governance, which makes the develop-

114
Technology and the Multipolar Global Economy

ment and implementation of innovation policies Whilst retaining some of the key performance
difficult and contested. In the innovation policy indicators of the Lisbon process, including a
literature, the EU proves troublesome for analysts. Union-wide target for R&S spending, the 2020
The EU is distinct in that it is not a national sys- process concentrates more on enabling processes
tem of innovation, yet neither is it a fully-fledged such as human capital formation and on employ-
regional system (Kuhlmann, 2001). In innovation ment generally. A more sector-based approach
management, as with other areas, EU policies to innovation is also taken, with the Commis-
and institutions must be understood as a complex sion seeking to capitalize on existing European
negotiated order among national, supranational strengths in low-carbon energy technologies
and local actors (Smith and Elgstrom, 2000). The (European Commission 2011b)
Commission’s involvement has been governed
by the principle of subsidiarity: that the role of The Direction of European R&D
Brussels was to augment and catalyze efforts at
the regional or nation state level (Kuhlmann, Evolutionary perspectives on innovation draw
2001:963). In sum, in marked contrast to issue attention to the uncertainty surrounding innova-
areas like trade and competition policies, the tion (McGuire, 2006). The innovative process is
Commission enjoys relatively little power or au- uncertain (that does not mean it has no organi-
tonomy. The Commission lacks effective tools to zational logic) and characterized by incremental
force through policy changes, even if agreement improvements and developments that are, from
could be generated on the nature of the problem time to time, interrupted by dramatic, discontinu-
and the response. This has left the Commission ous change. Schumpeter referred to a ‘process of
in the position of advocating networking and industrial mutation…that incessantly revolution-
mobility among research institutes and firms, not izes the economic structure from within.’ (Drejer,
necessarily because networking and mobility do 2004: 556). Schumpeter’s early interest in indi-
much to aid innovation, but because such limited viduals as inventors and entrepreneurs gave way
policy tools are about the only ones that the Com- in time to a recognition that creative destruction
mission has (Dosi, et al. 2006). could – and was – carried out in larger organiza-
EU R&D policy thus continues to operate at tions. Mowery and Rosenberg (1989) described
the margins of influence. This limits its ability to the American economy of the late 19th and early
develop and enforce policy directions and instead 20th centuries as a period where, in a somewhat
makes it prone to backsliding. The Kok Report paradoxical way, innovation was made routine by
observed that European states cannot develop solu- institutionalizing activities underpinning inven-
tions in isolation – and only co-ordinated action tion. These included the spread of secondary and,
will benefit Europe. Yet in a 2007 working paper, later, higher education, as well as the creation
Commission officials conceded that member states of in-house research and development units in
focus their innovation strategies on domestic American firms. Thus, the innovative process
concerns and work most closely with domestic may be uncertain and non-linear, but it is not
constituents. UNICE bemoaned the inability of random. Government policies should be able to
member states to meet their Lisbon commitments, provide incentives (where needed) to encourage
noting that the temptation to use public funds for the appropriate conditions for innovation.
objectives other than enhancing competitiveness If innovation is as non-linear as evolution-
seemed hard to resist (UNICE, 2006). ary perspectives suggest, then targeted national
In 2010, the Commission announced the re- champion policies are unwise for a different
placement for the Lisbon Strategy: Europe 2020. reason: since a state cannot be certain where and

115
Technology and the Multipolar Global Economy

how innovations will arise, the best policy is to R&D expenditure (Potocnik, 2007). An additional
encourage innovative activity across a range of obstacle was the tension between allocating re-
sectors. Moreover, a key benefit of innovation lies search money on the basis of the excellence of
in its diffusion through the economy to medium the proposed research, and awarding money to
and low technology sectors. This diffusion ought less technologically capable partners in poorer
to make less technologically advanced sectors member states. Plans for the European Research
more productive (Navarro, 2003: 12; Patel 2007). Council to centralize and co-ordinate long term
Thus, targeted policy that comes at the expense basic research were developed and accepted in
of basic, blue-skies research seems problematic 2006. In a departure from previous practice, ‘sci-
(Brusconi and Guena, 2003; Dosi, Llaverena and entific merit’ would be the overriding criterion for
Labini, 2005) Some evidence in support of an awarding funding (European Commission, 2004c:
emphasis on supporting wide ranging research 21). There is a persistent tension between cohe-
comes from a comparison of the early develop- sion and innovation underpinnings of EU R&D
ment of biotechnology in the United States and policy (Sharp, 1998). The former implied the need
Germany. Both countries invested considerable to direct R&D support to poorer member states
sums in this new technology, with German gov- even if this was unlikely to produce cutting-edge
ernment support dating from the 1960s. Yet, by research; the latter suggested that the best ideas
1997, the United States biotechnology sector was needed to win out in a process of competition if
larger by a factor of ten (Giesecke, 2000: 207). EU science was to progress. By 2004, the Kok
The crucial factor is that diffuse US support for Report, a high-level review of EU innovation
basic research allowed the organic development policy chaired by former Dutch prime minister
of the sector. German efforts, by contrast, were Wim Kok, offered indications that the latter view
hampered by the dominant pharmaceutical firms, was prevailing (European Commission 2004c).
which resisted greater competition and weak This plan was generally welcomed in member
incentives in the university system to transform states with strong university systems that could be
basic research into patentable innovations (ibid). expected to win the greatest share of money, but
By and large the Commission’s pronounce- other actors preferred to retain a ‘developmental’
ments on the trajectory of research and develop- aspect to EU-level research policies.
ment policy reflect an evolutionary economics All that said governments and the Commis-
perspective, stressing the need for basic research. sion are only partly responsible for the problem:
Central to the Lisbon Agenda is raising the amount European businesses invest less in innovation
spent on R&D by Member States to 3 per cent than competitors. Business R&D investment –
of GDP, though as some academics note, most of particularly in the major European economies
this increase is to come from industry (Archibugi – remains low compared to the United States
and Coco, 2005). By 2004, there were several and Japan. In Europe, businesses conduct 64 per
states spending at this level, notably the Nordic cent of R&D expenditures, whereas in Asia and
states, Germany and France. The Commission’s the US the figure is 70 per cent (NSF 2007d).
own funding streams, principally the Framework European firms are more reliant on public funds
Programs and Structural Funds, were also tilted for innovative activities – and this leaves them
toward basic research and designed to catalyze more vulnerable to political processes that favor
innovation. However, these efforts are small in expenditure in other areas. European firms make
comparative terms: the € 10.5 billion of Structural less use of technicians and scientists than American
Funds earmarked for innovation from 2000-2006 or Japanese companies. Corporate research staffs
were roughly equivalent to Sweden’s annual represent 80 per cent of all researchers in the US,

116
Technology and the Multipolar Global Economy

while the comparable figure in the EU (25) is 49 States and Japan of about $7 billion (European
per cent (OECD, 2006b: 36). What is the problem Commission 2007). Though innovative activities
with lagging business investment? In the United are diffusing throughout the globe, Europe’s rela-
States, high-technology industries, which are the tive attractiveness to foreign firms is declining.
main spenders on research and development, are
a larger part of the economy (European Commis- A Global Innovation Space?
sion 2007c). For some, the problem remains a lack
of pump-priming of the small and medium sized For most of the past 100 years, innovation was
business sector, the sector seen as a key driver of dominated by the Triad of Europe, the United
new technologies. The Kok Report, called for the States and Japan. The application of science and
creation of new tax incentives for SMEs to engage technology to economic activity first arose in
in research and development (2004c). By 2010, German and American firms early in the 20th
there is precious little evidence that this had any century and was part of the set of characteristics
effect (European Commission 2011a) that allowed the transatlantic economy to domi-
Another fear, however, is that EU-based firms nate the global system for most of the past 200
are expending less of their own funds because years. Though governments devote considerable
they are spending more money in foreign markets. resources to capturing the benefits of innovation
The diffusion of innovative activities has been for their own firms through a variety of policies,
noted in the academic literature but in recent in fact innovative activity does tend to diffuse
years has made an appearance in public debates. across borders. Firms not only create their own
As countries like India and China develop, there intellectual property, but also scan the competitive
is a strong business case for relocating research environment for knowledge that might yield a com-
and development activities closer to market. Ger- petitive advantage. Firms were diversifying their
man multinationals illustrate the trend in R&D innovation activities some years before it became
offshoring. By 2007, nearly 20 per cent of all commonplace to talk about the globalization of
German corporate R&D was conducted outside research. American firms invested abroad earlier
of Germany, more than twice the figure of two than many foreign competitors, with much of this
decades earlier (DB Research, 2011). Though investment going to Europe. For much of the past
most of this offshoring went to companies located twenty years, the ‘globalization’of innovation was,
in neighboring EU states or the United States, an in fact, the regionalization of research and devel-
unmistakable trend in using emerging countries opment among the US, the EU and Japan. These
was also notable. Fully 29 per cent of German three actors accounted for the bulk of global R&D
patents filed during 2008-2008 had at least one and when the research was conducted outside the
emerging market co-filer (DB Research, figure home country, it was located in one of the other
5). The assumption of the linear model of innova- two economies. In 2002, US firms placed 61 per
tion – that innovative activities are centralized the cent of their foreign research and development
core and product produced in the peripheries – is investment in the European Union, worth about
being severely undermined by corporate activity. $15 billion (OECD, 2006b:123). Over half of
As the EU notes, the US has increased its share this investment went to two sectors: automobiles
of foreign affiliates’’ spending on R&D, to a large and pharmaceuticals. The United States was a net
extent by taking a greater share of research from exporter of research and development activities
Europe (EU, 2005: 36). Between 1997-2003, to Europe until the turn of the millennium, when
the EU ran an ‘R&D deficit’ – where net R&D European firms expanded their activities in the US.
outflows exceeded inflows – with the United Trade in ideas, represented by flows of royalties,

117
Technology and the Multipolar Global Economy

fees and license income from industrial processes, this had dipped to 58 per cent (NSF, 2007b, Figure
was remarkably balanced, with the US generally 22). Though American firms did increase their
running only a modest surplus on its trade with R&D expenditures in Europe during the 1998-
Europe (NSF, 2007d). 2002, the rate of growth (4.8%) was dwarfed by
The US was, by the early 2000s, the favored the rate of growth of American R&D investment
location for foreign OECD-based multinationals, in Asia (28.6%) (NSF, 2007b). It is not hard to
many keen to capitalize on US leadership of the in- understand the interest of American firms in this
formation technology sector. The US attracted the regard. China has produced a stunning increase
bulk of all foreign R&D activity conducted in the in its innovative capacity, with R&D intensity
OECD (42 per cent). During the 1980s and 1990s (research spending as a percentage of GDP) ris-
was the degree to which foreign firms began to ing from 0.6 per cent in 1995 to 1.3 per cent in
locate their research facilities in the United States, 2006. China’s pool of researchers grew 77 per cent
perhaps to take advantage of basic research done at to 926,000 in the same period (OECD, 2006a).
US universities. The percentage of patents granted Smaller developing states are also accounting
to foreign firms was about 45 per cent by the mid- for more innovative activity, with the share of
1990s (Mowery, 1998). Access to the American global patenting accounted for by Brazil, China,
science base was particularly important for sectors India and South Africa increased from 0.15 per
like pharmaceuticals, with European firms com- cent in 1991 to 0.58 per cent in 2004 (OECD,
ing to rely on their US operations for much of the 2006a). A small part of the total to be sure, but
basic research work (Brusoni and Geuna, 2003). a dramatic increase and an indication of creative
Indeed, fully half of EU research and develop- talent that has not gone unnoticed by European
ment investment in the US was concentrated in and American multinationals. What is good for
chemicals and pharmaceutical sectors (OECD, firms, and indeed the overall economy, may not
2006b: 123). Indeed, in the pharmaceutical sector, be good for individual European and American
it is not an exaggeration to describe US-EU links workers however, hence the keen political interest
as a fully-fledged transatlantic innovation area. in the globalization and relocation of research and
Such corporate cooperation was underpinned by development activities. Innovation policy may yet
broad congruence between the EU and the US on come to have a political dynamic like that in trade
various regulatory matters, not least contract law policy: the diffuse benefits of open trade attract
and intellectual property. little attention compared to the concentrated losses
Innovation is now even more globalised and, in industries. Consumers gain from innovation,
as outlined above, Europe and America are both wherever it is created, but skilled workers are
experiencing a relative decline in their dominance not as mobile as their jobs and fear losing out to
of the area. China has embarked on a remarkably competitors. Both Europe and America are strug-
successful process of technological upgrading, gling to devise policy responses that accommodate
allow the country to act not merely as a host for these conflicting pressures.
foreign R&D activity but to generate significant
amounts of indigenously-created research (Fu and
Gong 2011). It is notable that Europe is becom- CONCLUSION
ing relatively less important as an R&D hub for
American firms, which are increasing the amount Europe is again in the midst of a crisis of confi-
of research work in Asia. In 1995, Europe ac- dence about its ability to compete in knowledge-
counted for 70 per cent off all US firms’ overseas intensive sectors of the globalized economy. Some
research and development expenditures: by 2002, perspective needs to be kept: Germany and the

118
Technology and the Multipolar Global Economy

Nordic states enjoy strong innovation systems, and Chesbrough, H. (2003). The era of open innova-
the UK enjoys a significant exporter of ideas and tion. MIT Sloan Review, 44(3), 35–41.
basic research. However, Europe’s market share
Daveri, F. (2002). The new economy in Europe,
in high technology industries is under pressure,
1992-2001. Oxford Review of Economic Policy,
and the continent is generally over-reliant on
18(3), 345–362. doi:10.1093/oxrep/18.3.345
a comparatively small set of very competitive
companies to maintain this market share. There Deutsche Bank. (2011). International division
is a paucity of smaller firms with a significant of labor in R&D: Research follows production.
R&D presence. Deutsche Bank Economics Paper 82, February
European angst about this situation may yet 2011.
turn to something more troublesome. European
Dosi, G., Llerena, P., & Labini, M. S. (2005).
and American expectations and experiences were
Evaluating and comparing the innovation per-
crucial in shaping the international innovative
formance of the United States and the European
environment that allowed states such as Korea,
Union. TrendChart Policy Workshop.
China and India to develop significant research
and development programs. Thus far, charges Drejer, I. (2004). Identifying innovation in sur-
of neomercantilism remain muted, but that may veys of services: A Schumpeterian perspective.
change. Germany and Spain, for example, are Research Policy, 33, 551–562. doi:10.1016/j.
two European states with a significant stake in respol.2003.07.004
both solar and wind energy generation. Should
European Commission. (2007). The European
the perception grow that China has acted unfairly
research area: New perspectives. Commission
in developing its own industries, the resultant
Staff Working Document SEC (2007) 412/2.
trade conflict could undermine an already fragile
multilateral system. European Commission. (2009). Annual report
on research and development. Retrieved May 3,
2010, from http://eur-lex.europa.eu/LexUriServ/
REFERENCES LexUriServ.do?uri=COM:2009:0558:FIN:EN:P
DF#page=2
Archibugi, D., & Coco, A. (2005). Is Europe
becoming the most dynamic knowledge econ- European Commission. (2011a). Innovation union
omy in the world? Journal of Common Market scoreboard (IUS) 2010. Brussels, Belgium: Di-
Studies, 43(3), 433–459. doi:10.1111/j.0021- rectorate for Enterprise and Industry.
9886.2005.00564.x European Commission. (2011b). Europe 2020.
Athreye, S., & Cantwell, J. (2007). Creating Retrieved July 24, 2011, from http://ec.europa.
competition? Globalization and the creation of eu/europe2020/index_en.htm
new technology producers. Research Policy, 36, Evenett, S. (2009). What can be learned from
209–226. doi:10.1016/j.respol.2006.11.002 crisis-era protectionism? An initial assessment.
Brou, D., Campanella, E., & Ruta, M. (2009, 25 Business and Politics, 11(3), article 4.
November). The value of domestic subsidy rules Falke, A. (2010). Much ado about nothing:
in trade agreements. (Staff Working Paper ESRD- Obama’s trade policy after year one. Unpublished
2009-12). World Trade Organization. manuscript, Freidrich-Alexander Universitiet,
Erlangen-Nuermberg, January 2010.

119
Technology and the Multipolar Global Economy

Fassbender, H. (2007). Europe’s productivity McGuire, S. (2002). Between pragmatism and


challenge. The McKinsey Quarterly, 2. principle. The International Trade Journal, 16(3).
Florida, R. (2002). The rise of the creative class. McGuire, S. (2006). No more Euro-champions:
New York, NY: Basic Books. The interaction of EU industrial and trade poli-
cies. Journal of European Public Policy, 13(6),
Forero-Pineda, C. (2006). The impact of stronger
887–905. doi:10.1080/13501760600838573
intellectual property rights on science and tech-
nology in developing countries. Research Policy, McGuire, S., & Smith, M. (2008). The European
35, 808–824. doi:10.1016/j.respol.2006.04.003 Union and the United States: Competition and
convergence in the global arena. Basingstoke,
Fu, X., & Gong, Y. (2011). Indigenous and
UK: Palgrave.
foreign innovation efforts and drivers of techno-
logical upgrading: Evidence from China. World Munro, M. (2010). Amazon’s Kindle: Symbol of
Development, 39(7), 1213–1225. doi:10.1016/j. America’s decline? The Brookings Institution,
worlddev.2010.05.010 February 25 201. Retrieved July 24, 2011, from
http://www.brookings.edu/opinions/2010/0225_
Giesecke, S. (2000). The contrasting roles of gov-
tech_muro.aspx
ernment in the development of biotechnology in-
dustry in the US and Germany. Research Policy, 29, Ramani, S. (2002). Who’s interested in biotech?
205–223. doi:10.1016/S0048-7333(99)00061-X R&D strategies, knowledge base and market sales
of Indian biotechnology firms. Research Policy,
Global Subsidies Initiative. (n.d.). The WTO and
31, 381–398. doi:10.1016/S0048-7333(01)00115-9
subsidies. International Institute for Sustainable
Development. Retrieved May 17, 2010, from Schumpeter, J. (1942). Capitalism, socialism and
http://www.globalsubsidies.org/en/media-portal/ democracy. New York, NY: Knopf Publishers.
the-wto-and-subsidies
Smith, M., & Elgstrom, O. (2000). Introduction:
Goldstein, A., & McGuire, S. (2004). The political Negotiation and policy-making in the European
economy of strategic trade policy and the Brazil- Union - Processes, system and order. Journal
Canada subsidies saga. World Economy, 27(4), of European Public Policy, 7(5), 673–683.
541–566. doi:10.1111/j.0378-5920.2004.00613.x doi:10.1080/13501760010014894
Hart, D. (2001). Antitrust and technological in- Teteryaknikova, M. (2008). R&D in the net-
novation in the US: Ideas, institutions, decisions work of international trade: Multilateral versus
and impacts, 1890-2000. Research Policy, 30(6), regional trade agreements. Staff Working Paper
923–936. doi:10.1016/S0048-7333(00)00165-7 ESRD-2009-03, World Trade Organization, 10
November 2008.
Hu, M.-H., & Matthews, J. (2008). China’s na-
tional innovative capacity. Research Policy, 37, UNICE. (2007, 16 November). UNICE position
1465–1479. doi:10.1016/j.respol.2008.07.003 paper on cohesion policy and competitiveness.
Brussels, Belgium: UNICE. Retrieved January 7,
Kang, M. (2000). Trade policy mix under the
2007, from http://www.unice.org/DocShareNo-
WTO: Protection of TRIPs and R&D subsidies.
Frame/Common/GetFileURL.asp?FileURL=F_4
KIEP Working Paper 00-11, Korea Institute for
International Economic Policy, December 2000.

120
Technology and the Multipolar Global Economy

Zhou, P., & Leydesdorff, L. (2006). The emer- ENDNOTES


gence of China as a leading nation in science.
Research Policy, 35, 83–104. doi:10.1016/j.
1
This section draws on McGuire and Smith,
respol.2005.08.006 2008

This work was previously published in Disruptive Technologies, Innovation and Global Redesign: Emerging Implications, edited
by Ndubuisi Ekekwe and Nazrul Islam, pp. 331-344, copyright 2012 by Information Science Reference (an imprint of IGI Global).

121
122

Chapter 11
Coupling GPS and GIS
Mahbubur R. Meenar
Temple University, USA

John A. Sorrentino
Temple University, USA

Sharmin Yesmin
Temple University, USA

ABSTRACT
Since the 1990s, the integration of GPS and GIS has become more and more popular and an industry
standard in the GIS community worldwide. The increasing availability and affordability of mobile GIS
and GPS, along with greater data accuracy and interoperability, will only ensure steady growth of this
practice in the future. This chapter provides a brief background of GPS technology and its use in GIS,
and then elaborates on the integration techniques of both technologies within their limitations. It also
highlights data processing, transfer, and maintenance issues and future trends of this integration.

INTRODUCTION a certain time, a series of places over time, or a


place with no regard to time, GPS can measure it,
The use of the Global Positioning System (GPS) locate it, track it.” (Steede-Terry, 2000).
as a method of collecting locational data for Geo- Although GIS was available in the market
graphic Information Systems (GIS) is increasing in the 1970s, and GPS in the 1980s, it was only
in popularity in the GIS community. GIS data is in the mid-1990s that people started using GPS
dynamic – it changes over time, and GPS is an coupled to GIS. The GPS technology and its
effective way to track those changes (Steede-Terry, analogs (Global Navigation Satellite System or
2000). According to Environmental Systems GLONASS in Russia and the proposed Galileo
Research Institute (ESRI) president Jack Dan- system in Europe) have proven to be the most
germond, GPS is “uniquely suited to integration cost-effective, fastest, and most accurate meth-
with GIS. Whether the object of concern is moving ods of providing location information (Longley
or not, whether concern is for a certain place at et. al, 2005; Trimble, 2002; Taylor et. al, 2001).
Organizations that maintain GIS databases – be
DOI: 10.4018/978-1-4666-2038-4.ch011 they local governments or oil companies – can

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Coupling GPS and GIS

easily and accurately inventory either stationary The GPS Technology


or moving things and add those locations to their
databases (Imran et. al, 2006; Steede-Terry, 2000). The GPS data can be collected from a constellation
Some common applications of coupling GPS and of active satellites which continuously transmit
GIS are surveying, crime mapping, animal track- coded signals to receivers and receive correctional
ing, traffic management, emergency management, data from monitoring stations. GPS receivers
road construction, and vehicle navigation. process the signals to compute latitude, longitude,
and altitude of an object on earth (Giaglis, 2005;
Kennedy, 2002).
BACKGROUND A method, known as triangulation, is used
to calculate the position of any feature with the
Need for GPS Data in GIS known distances from three fixed locations (Le-
tham, 2001). However, a discrepancy between
When people try to find out where on earth they satellite and receiver timing of just 1/100th of
are located, they rely on either absolute coordi- a second could make for a misreading of 1,860
nates with latitude and longitude information or miles (Steede-Terry, 2000). Therefore, a signal
relative coordinates where location information is from a fourth satellite is needed to synchronize
expressed with the help of another location (Ken- the time between the satellites and the receivers
nedy, 2002). GIS maps can be created or corrected (Maantay and Ziegler, 2006; Longley et. al, 2005;
from the features entered in the field using a GPS Letham, 2001). To address this fact, the satellites
receiver (Maantay and Ziegler, 2006). Thus people have been deployed in a pattern that has each one
can know their actual positions on earth and then passing over a monitoring station every twelve
compare their locations in relation to other objects hours, with at least four visible in the sky all the
represented in a GIS map (Thurston et. al, 2003; times (Steede-Terry, 2000).
Kennedy, 2002). The United States Navigation Satellite Timing
GIS uses mainly two types of datasets: (a) and Ranging GPS (NAVSTAR-GPS) constella-
primary, which is created by the user; and (b) tion has 24 satellites with 3 spares orbiting the
secondary, which is collected or purchased from earth at an altitude of about 12,600 miles (USNO
somewhere else. In GIS, primary data can be NAVSTAR GPS, 2006; Longley et. al, 2005;
created by drawing any feature based on given Steede-Terry, 2000). The GLONASS consists of
dimensions, by digitizing ortho-photos, and by 21 satellites in 3 orbital planes, with 3 on-orbit
analyzing survey, remote sensing, and GPS data. spares (Space and Tech, 2005). The proposed sys-
Using GPS, primary data can be collected accu- tem GALILEO will be based on a constellation of
rately and quickly with a common reference system 30 satellites and ground stations (Europa, 2005).
without any drawing or digitizing operation. Once The NAVSTAR-GPS has three basic segments:
the primary data is created, it can be distributed (1) the space segment, which consists of the satel-
to others and be used as secondary data. Before lites; (2) the control segment, which is a network
using GPS as a primary data collection tool for of earth-based tracking stations; and (3) the user
GIS, the users need to understand the GPS tech- segment, which represents the receivers that pick
nology and its limitations. up signals from the satellites, process the signal
data, and compute the receiver’s location, height,
and time (Maantay and Ziegler, 2006; Lange and
Gilbert, 2005).

123
Coupling GPS and GIS

Data Limitations and Accuracy Level now use Carrier Phase Tracking, an application
of DGPS, which returns positional accuracies
Besides the timing discrepancies between the down to as little as 10 centimeters (Maantay and
satellites and the receivers, some other elements Ziegler, 2006; Lange and Gilbert, 2005).
that reduce the accuracy of GPS data are orbit
errors, system errors, the earth’s atmosphere, and
receiver noise (Trimble, 2002; Ramadan, 1998). INTEGRATION OF GPS AND GIS
With better attention to interoperability between
the GPS units, hardware, and software, some of The coupling of GPS and GIS can be explained
these errors can be minimized before the data are by the following examples:
used in GIS (Thurston et. al, 2003; Kennedy, 2002).
Using a differential correction process, the • A field crew can use a GPS receiver to
receivers can correct such errors. The Differential enter the location of a power line pole in
GPS (DGPS) uses two receivers, one stationary need of repair; show it as a point on a map
and one roving. The stationary one, known as displayed on a personal digital assistant
the base station, is placed at a precisely known (PDA) using software such as ArcPad from
geographic point, and the roving one is carried by ESRI; enter attributes of the pole; and fi-
the surveyor (Maantay and Ziegler, 2006; Imran nally transmit this information to a central
et. al, 2006; Thurston et. al, 2003; Kennedy, 2002; database (Maantay and Ziegler, 2006).
Taylor et. al, 2001; Steede-Terry, 2000). The base • A researcher may conduct a groundwater
station sends differential correction signals to the contamination study by collecting the co-
moving receiver. ordinates and other attributes of the wells
Prior to 2000, the GPS signal data that was using a GPS; converting the data to GIS;
available for free did not deliver horizontal posi- measuring the water samples taken from
tional accuracies better than 100 meters. Data with the wells; and evaluating the water quality
high degree of accuracy was only available to U.S. parameters (Nas and Berktay, 2006).
government agencies and to some universities.
After the U.S. Department of Defense removed There are many ways to integrate GPS data in
the restriction in May 2000, the positional accu- GIS, ranging from creating new GIS features in
racy of free satellite signal data increased to 15 the field, transferring data from GPS receivers to
meters (Maantay and Ziegler, 2006). In September GIS, and conducting spatial analysis in the field
2002, this accuracy was further increased to 1 to (Harrington, 2000a). More specifically, the GPS-
2 meters horizontally and 2 to 3 meters vertically GIS integration can be done based on the following
using a Federal Aviation Administration funded three categories – data-focused integration, posi-
system known as Wide Area Augmentation Sys- tion-focused integration, and technology-focused
tem (WAAS). WAAS is available to the public integration (Harrington, 2000a). In data-focused
throughout most of the continental United States integration, the GPS system collects and stores
(Maantay and Ziegler, 2006). data, and then later, transfers data to a GIS. Again,
Depending on the receiver system, the DGPS data from GIS can be uploaded to GPS for update
can deliver positional accuracies of 1 meter or and maintenance. The position-focused integration
less and is used where high accuracy data is re- consists of a complete GPS receiver that supplies
quired (Maantay and Ziegler, 2006; Longley et. a control application and a field device application
al, 2005; Lange and Gilbert, 2005; Taylor et. al, operating on the same device or separate devices.
2001). For example, the surveying professionals In the technology-focused integration, there is

124
Coupling GPS and GIS

no need for a separate application of a device to The scale and datum of the base map are also
control the GPS receiver; the control is archived important. For example, a large-scale base map
from any third party software (Harrington, 2000a). should be used as a reference in a site specific
Figure 1 provides an example of a schematic project in order to avoid data inaccuracy. While
workflow process of the GPS-GIS integration by collecting GPS data in an existing GIS, the datum
using Trimble and ArcGIS software. In short, the designation, the projection and coordinate system
integration of GPS and GIS is primarily focused designation, and the measurement units must be
on three areas - data acquisition, data processing identical (Kennedy, 2002; Steede-Terry, 2000). It
and transfer, and data maintenance. is recommended that all data should be collected
and displayed in the most up-to-date datum avail-
Data Acquisition able (Lange and Gilbert, 2005).
The user may create a data dictionary with
Before collecting any data, the user needs to de- the list of features and attributes to be recorded
termine what types of GPS techniques and tools before going to the field or on-spot. If it is created
will be required for a particular accuracy require- beforehand, the table is then transferred into the
ment and budget. The user needs to develop or GPS data collection system. Before going to the
collect a GIS base data layer with correct spatial field, the user also needs to find out whether the
reference to which all new generated data will be locations that will be targeted for data collection
referenced (Lange and Gilbert, 2005). are free from obstructions. The receivers need a
clear view of the sky and signals from at least
four satellites in order to make reliable position
Figure 1. Example workflow process of GPS-GIS
measurement (Lange and Gilbert, 2005; Giaglis,
integration
2005). In the field, the user will check satellite
availability and follow the manuals to configure
GPS receivers before starting data collection.
GIS uses point, line, and polygon features, and
the data collection methods for these features are
different from one another. A point feature (e.g.,
an electricity transmission pole) requires the user
to remain stationary at the location and capture
the information using a GPS device. For a line
feature (e.g., a road), the user needs to record the
positions periodically as s/he moves along the
feature in the real world. To capture a polygon
feature (e.g., a parking lot) information, the posi-
tions of the recorder are connected in order to form
a polygon and the last position always connects
back to the first one. The user has to decide what
types of features need to be created for a GIS map.
In a small scale map, a university campus can be
shown as a point, whereas in a detailed map, even
a drain outlet can be shown as a polygon.
GPS coordinates can be displayed in real time
in some GIS software such as ESRI ArcPad, In-

125
Coupling GPS and GIS

tergraph Intelliwhere, and Terra Nova Map IT. In Data Processing and Transfer
the age of mobile GIS, users can go on a field trip,
collect GPS data, edit, manipulate, and visualize Once the data are collected, they can be down-
those data, all in the field. While GPS and GIS loaded, post-processed, and exported to GIS
are linked, the GPS receiver can be treated as the format from the field computer to the office
cursor of a digitizer. It is linked to the GIS through computer. Where real-time signals are needed but
a software module similar to a digitizer control- cannot be received, the post-processing techniques
ler where data are saved into a GIS filing system can be applied to re-process the GPS positions.
(Ramadan, 1998; UN Statistics Division, 2004). Using this technique, the feature positions can
In real-time GPS/GIS integration, data may be be differentially corrected to the highest level of
collected and stored immediately for future use in accuracy. The users who integrate GPS data into
a mapping application, or data may be discarded their own applications need to consider how and
after use in a navigation or tracking application when they should apply differential corrections.
(Thurston et. al, 2003). Real-time processing allows recording and cor-
For example, Map IT is a new GIS software recting a location in seconds or less, but is usually
designed for digital mapping and GPS data capture less accurate. Post-processing allows the surveyor
with a tablet PC. The software connects a tablet pc recording a location as much time as s/he likes,
to a GPS antenna via a USB port. While conduct- and then differentially corrects each location back
ing the field work, the user may use the software in the office. This technique is used in mapping
to: (a) display the current ground position on the or surveying (Steede-Terry, 2000; Thurston et.
tablet PC’s map display in real time; (b) create new al, 2003). Instead of relying on real-time DGPS
features and add coordinates and other attributes; alone, the users should enable their applications
(c) edit or post-process the data in real time; and to record raw GPS data and allow post-processing
(d) automatically link all activity recorded in the techniques to be used either solely or in conjunc-
field (including photographs, notes, spreadsheets, tion with real-time DGPS (Harrington, 2000b).
and drawings) to the respective geographic posi- Most GPS receiver manufacturers have their
tions (Donatis and Bruciatelli, 2006). own data file format. GPS data is stored in a
Although the integration of GIS and GPS can receiver in its own format and later can be trans-
in general increase accuracy and decrease project lated to various GIS formats (Lange and Gilbert,
costs and completion time, it can also create new 2005; Ramadan, 1998). Data can be transferred
problems, including creation of inaccurate data in a couple of ways. One simple way is collecting
points and missing data points (Imran et. al, 2006). coordinates and attributes in a comma delimited
Sometimes a handheld GPS navigator may not file from the GPS device storage. The other more
be able to acquire a lock on available satellites preferable way is converting the data from GPS
because of natural conditions like dense forest storage to the user-specific database interchange
canopies, or human-made structures like tall format using a data translation program (Lange
buildings or other obstacles (Lange and Gilbert, and Gilbert, 2005). Such a program allows the
2005; Thurston et. al, 2003). Data collection with user to (1) generate metadata; (2) transform the
GPS also might get affected by any equipment coordinates to the projection, coordinate system,
malfunction in the field. and datum of the user’s choice; and (3) translate
GPS data into customized formats that the GPS
manufacturers could never have anticipated
(Lange and Gilbert, 2005).

126
Coupling GPS and GIS

A number of file interchange protocols are tivities which will further increase the availability
available to exchange data between different of accurate data to general users.
brands and types of receivers. One widely used New applications of GIS-GPS integration are
interchange protocol is the Receiver Independent constantly becoming popular and widespread. The
Exchange Format (RINEX), which is supported latest developments in GPS technology should
by most satellite data processing software (Yan, encourage more use of such integration in the
2006). Another commonly used interface standard future. Reduction in cost and personnel training
is a standard released by the National Marine time of using GPS technology with high data ac-
Electronics Association (NMEA). Most GPS curacy will eventually provide a cost-effective
receivers support this protocol and can output means of verifying and updating real time GIS
NMEA messages, which are available in ASCII mapping in the field (Maantay and Ziegler, 2006;
format (Yan, 2006). UN Statistics Division, 2004).

Data Maintenance
CONCLUSION
For data revisions or data maintenance, GIS data
is transferred back to the field computer and can In today’s market, the mobile GIS and GPS devices
be verified or updated in the field. The user can are available with greater accuracy at a reduced
relocate features via navigation, verify the position cost. The data transfer process from GPS to GIS
and attribute features, and navigate to locations has become faster and easier. GIS software is get-
to collect new attribute data. The user may select ting more powerful and user friendly, and GPS
features and examine them in the field, modify devices are increasingly getting more accurate
attributes, and even collect new features if desired. and affordable. The integration of GIS and GPS
Using receivers such as Trimble, any feature that has been already proven to be very influential in
has been added or updated is automatically marked spatial data management, and it will have steady
to determine which data needs to go back to GIS growth in the future.
(Trimble, 2002).

REFERENCES
FUTURE TRENDS
Donatis, M., & Bruciatelli, L. (2006). Map it:
The future trends of GIS-GPS integration will be the GIS software for field mapping with tablet
focused on data accuracy, interoperability, and PC. Computers & Geosciences, 32(5), 673–680.
affordability. In order to make the WAAS level of doi:10.1016/j.cageo.2005.09.003
precision available to users worldwide, the Unites Europa web site. http://www.eu.int/comm/dgs/
States is working on international agreements to energy_transport/galileo /index_en.htm, accessed
share similar technologies available in other parts on December 12, 2005
of the world, namely Japan’s Multi-Functional
Satellite Augmentation System (MSAS) and Giaglis, G. (2005). Mobile Location Services. In
Europe’s Euro Geostationary Navigation Overlay M. Khosrow-Pour (Ed.), Encyclopedia of Infor-
Service (EGNOS) (Maantay and Ziegler, 2006). mation Science and Technology, 4, 1973-1977.
In addition, the European satellite positioning Pennsylvania: Idea Group Reference.
system, Galileo, will be dedicated to civilian ac-

127
Coupling GPS and GIS

Harrington, A. (2000a). GIS and GPS: Technolo- Space and Tech web site. http://www.spacean-
gies that Work Well Together. Proceedings in the dtech.com/spacedata/constellations/glonass_con-
ESRI User Conference, San Diego, California. sum.shtml, accessed on December 12, 2005
Harrington, A. (2000b). GPS/GIS Integration: Steede-Terry, K. (2000). Integrating GIS and
What Can You Do When Real-Time DGPS Doesn’t the Global Positioning System. California: ESRI
Work? GeoWorld, 13(4). Available online at http:// Press.
www.geoplace.com/gw/2000/0400/0400int.asp,
Taylor, G., Steup, D., Car, A., Blewitt, G., &
accessed on August 25, 2006.
Corbett, S. (2001). Road reduction filtering for
Imran, M., Hassan, Y., & Patterson, D. (2006). GPS-GIS navigation. Transactions in GIS, 5(3),
GPS-GIS-based procedure for tracking vehicle 193–207. doi:10.1111/1467-9671.00077
path on horizontal alignments. Computer-Aided
Thurston, J., Poiker, T., & Moore, J. (2003). In-
Civil and Infrastructure Engineering, 21(5),
tegrated Geospatial Technologies – A Guide to
383–394. doi:10.1111/j.1467-8667.2006.00444.x
GPS, GIS, and Data Logging. New Jersey: John
Kennedy, M. (2002). The Global Positioning Wiley & Sons, Inc.
System and GIS: An Introduction. New York:
Trimble Navigation Limited. (2002). TerraSync
Taylor and Francis.
Software – Trimble’s Productive Data Collection
Lange, A., & Gilbert, C. (2005). Using GPS for and Maintenance Tool for Quality GIS Data.
GIS Data Capture. In Geographic Information California: Trimble Navigation Limited.
Systems: Principles, Techniques, Management,
UN Statistics Division. (2004). Integration of GPS,
and Applications (pp. 467-476). NJ: John Wiley
Digital Imagery and GIS with Census Mapping.
& Sons, Inc.
New York: United Nations Secretariat.
Letham, L. (2001). GPS made easy. Washington.
USNO NAVSTAR GPS web site. http://tycho.
The Mountaineers, 5(12), 183–186.
usno.navy.mil/gpsinfo.html, accessed on August
Longley, P., Goodchild, M., Maguire, D., & Rhind, 26, 2006.
D. (2005). Geographic Information Systems and
Yan, T. (2006). GNSS Data Protocols: Choice and
Science. New Jersey: John Wiley & Sons, Inc.
Implementation. Proceedings in the International
(pp. 122-123, 172-173).
Global Navigation Satellite Systems Society IG-
Maantay, J., & Ziegler, J. (2006). GIS for the Urban NSS Symposium, Australia.
Environment. California: ESRI Press, 306-307.
Nas, B., & Berktay, A. (2006). Groundwater
KEY TERMS AND DEFINITIONS
contamination by nitrates in the city of Konya,
(Turkey): a GIS perspective. Journal of Environ- Coordinate System: A reference framework
mental Management, 79(1), 30–37. doi:10.1016/j. used to define the positions of points in space in
jenvman.2005.05.010 either two or three dimensions.
Ramadan, K. (1998). The Use of GPS for GIS Datum: The reference specifications of a
Applications. Proceedings in the Geographic measurement system, usually a system of coor-
Information Systems: Information Infrastructures dinate positions on a surface or heights above or
and Interoperability for the 21st Century Informa- below a surface.
tion Society, Czech Republic.

128
Coupling GPS and GIS

Differential GPS (DGPS): The DGPS is Projection: A method requiring a systematic


used to correct GPS signal data errors, using two mathematical transformation by which the curved
receivers, one stationary (placed at a precisely surface of the earth is portrayed on a flat surface.
known geographic point) and one roving (carried Scale: The ratio between a distance or area
by the surveyor). The stationary receiver sends on a map and the corresponding distance or area
differential correction signals to the roving one. on the ground, commonly expressed as a fraction
GPS Segment: GPS consists of three seg- or ratio.
ments: (i) space segment – the GPS satellites, (ii) Wide Area Augmentation System: (WAAS):
user segment – the GPS handheld navigator, and The WAAS is a system that can increase the GPS
(iii) ground control segment – the GPS monitor- signal data accuracy to 1 to 2 meters horizontally
ing stations. and 2 to 3 meters vertically.

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 277-284,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

129
130

Chapter 12
Using Geospatial Techniques
to Analyze Landscape
Factors Controlling Ionic
Composition of Arctic Lakes,
Toolik Lake Region, Alaska
Prasad A. Pathak
University of North Carolina at Greensboro, USA

Stephen Whalen
University of North Carolina at Chapel Hill, USA

ABSTRACT
The impacts of climate change on landscapes in arctic Alaska are evident in terms of permafrost melting,
frequent thermokarst activity, and the occurrence of more broadleaf vegetation. These changes may alter
natural biogeochemical cycles of ions along with major nutrients and affect ionic compositions of lakes,
as they are connected with the landscapes. However, the nature of the connectivity between lakes and
landscapes in this region is not yet explored. The authors propose that geospatial analysis of landscape
properties along with observed lake ion concentrations will enable an understanding of the currently
existing landscape controls over ion inputs into the lakes. For the watersheds of 41 lakes in the Arctic
Foothills region of Alaska, spatial properties of natural vegetation communities expressed in terms of
percentage, shape complexity, and patch density metrics were derived using satellite data. Regression
analyses were performed for concentration of ions as well as conductivity in lake water where the spa-
tial metrics along with lake physical properties, lake order, and glacial till age categories were used
as predicting variables in the regression. Landscape metrics for major land covers i.e., Percentage of
Moist Acidic Tundra (MAT) and Moist Non-acidic Tundra (MNT) were the major predicting variables
for concentration of several ions.

DOI: 10.4018/978-1-4666-2038-4.ch012

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

INTRODUCTION zone interactions (Gergel et al., 1999; Griffith et


al., 2002).In contrast, King et al. (2005) argued
The ionic composition of lakes plays a significant that if remote sensing data is converted into spa-
role in controlling their trophic structure. Anions tial and structural information about land cover
and cations are important for many aquatic pri- researchers will be able to understand indirectly
mary producers as they control various cellular root zone interactions and weathering processes.
processes. Further, they are crucial to maintain the Such spatial and/or structural information, when
pH of surface waters, creating favorable condi- expressed in terms of landscape metrics, allows
tions for several prokaryotes and plants to grow quantitative analyses for lake-landscape interac-
(Smith, 1995). tions (O’Neill et al., 1997).
In closed drainage basins, the ionic composi- For example, Stewart et al. (2001) studied
tion is controlled by the evaporation rate and pre- Proportion and Fragmentation metrics related to
cipitation (Lesack & Melack, 1991). Specifically riparian vegetation. The authors considered ripar-
within low arctic watersheds, it has been observed ian zones as regions of high water fluctuations and
that transport of Na+ and K+ was governed by correlated landscape metrics for the riparian zones
precipitation whereas Ca2+ and Mg2+ concentra- with the trophic structure of lakes. Similarly, Patch
tions in lakes were related to chemical weathering Density of a land cover is another such metric
(Cornwell, 1992). For lakes in the coastal region, useful for investigating water quality (Johnson
atmospheric inputs were considered to be im- et al., 1997). Patch Shape Complexity metrics,
portant. However, weathering of parent material mostly used in wildlife studies (Forman & Gordon,
(i.e., soil and rock) is considered to be the major 1986), are also adopted in limnological studies
source of ions for lakes (Wetzel, 2001). Weather- to understand growth stages of land covers. For
ing processes are affected by several landscape instance, prominent land covers within basins
factors, with land use and land cover being the will occupy most of the area and exhibit simple
most prominent (Drever, 1994). shapes. On the other hand, emerging land covers
Likens et al. (1977) and Drever (1994) em- will display more complex shapes. It is likely that
phasized that natural vegetation aids chemical the dominant land cover and the emerging land
weathering of parental material via natural biota cover may exhibit different root zone properties
in the root zone. In particular, the mineralization and influence export of ions into water bodies ac-
and immobilization processes carried out by these cordingly. Using the same principle, Kearns et al.
microorganisms are crucial for recycling ions (2005) have demonstrated that shape complexity
(Frost et al., 2006; Xiao & Ji, 2007; Canham et metrics can be applied to understand landscape
al., 2004). It has been further explained that these controls over ionic composition of lakes.
processes along with soil moisture and hydrologi- The arctic lakes also should be studied care-
cal events determine influx of ions into surface fully for their ionic composition and associated
water bodies. Therefore, it is necessary to derive landscape controls, particularly when the climate
information about land covers to understand change is altering the dynamic between landscapes
their influence on water bodies. With the advent and lakes. Arctic landscapes are highly sensitive
of remote sensing technology, satellite datasets to the contemporary air temperature increase and
have been used to obtain such information at are therefore, experiencing severe alterations
various scales (Jensen, 2005). Limnological stud- (ACIA, 2005). Permafrost is rapidly melting due
ies, which have adopted remote sensing data, are to rising temperatures, which are also inducing
considered to be constrained only to above ground more frequent thermokarst activity. Both of these
information and not really inferred about root processes could release stored ions within soils

131
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

into streams and lakes. Hence, the ion concentra- research. In this study, a more comprehensive ap-
tions of surface water may be altered in the near proach has been adopted to investigate currently
future due to terrestrial inputs (Keller et al., 2007). existing relationships between landscapes and
Previous studies quantified ions within particu- concentration of major ions in arctic lakes. We
lar vegetation communities (Marion et al., 1989; analyzed lake water chemistry data previously
Schimel et al., 1989; Giblin et al., 1991). These collected by our research group to evaluate rela-
studies indicated that soil moisture influences the tionships between common ions and vegetative
concentration of ions and vegetation communities characteristics of the surrounding watershed.
having higher amount of moisture in the soil of Opting for geospatial techniques, we proposed
their root zones exhibited higher leaf concentra- that spatial properties of different vegetation
tions of common ions. For example, moist soils communities derived from remote sensing data,
hosting broadleaf shrubs had higher potassium at the watershed level and within the vicinity of
contents compared to other vegetation types water channels, would be useful indicators of the
with lower soil moisture (Marion et al., 1989). delivery of common ions from landscapes to lakes.
Similarly, vegetation communities having deep The spatial knowledge of landscape factors was
root structures were found to have more access derived from a satellite image and other second-
to ions and soil micronutrients. Therefore, these ary datasets. Age of the till surfaces forming the
communities were thought to be sinks for ions, watersheds and order of lakes (Riera et al., 2000)
prohibiting release to surfaces waters (Schimel along with certain physical properties of the lakes
et al., 1996). were also included in the analyses as predicting
Only recently, a more comprehensive study landscape factors.
about geochemical weathering and its effects on Establishment of a set of landscape factors for
soil and streams has been carried out by Keller et arctic watersheds with respect to the concentra-
al. (2007). In the Arctic Foothills region of Alaska, tions of common ions was the primary aim of our
the authors compared the relative concentrations analysis. With adaptation of a geospatial approach
of Ca2+, K+, and P within certain streams, soils, we aimed to create a broadly applicable methodol-
permafrost layer and bed rocks. They concluded ogy so that other arctic regions could be analyzed
that permafrost has relatively higher concentra- in the similar fashion.
tions of these elements compared to active mineral
soil layer. By comparing the same bed rock mate-
rial on glacial tills of different ages, the authors METHODS
confirmed that differences in concentration of
these elements among the till surfaces was a Study Area
function of age. They concluded that weathering
plays a more important role than parent material. The area under consideration was the Toolik Lake
Moreover, they predicted that with climate change, region (68o 38’ N/ 149o 36’ W), situated in the
active soils will receive an increased amount of Arctic Foothills Province of Alaska, just north
the elements stored in permafrost leading to higher of the Brooks Range (Figure 1). Regionally, lake
concentrations in the streams. formation is by ice scour, thermokarst activity and
It is evident that these studies adopted a non- melting of ice blocks in glacial drift and outwash
spatial approach; however, they were helpful in plains (kettle lakes). The average local relief of
identifying important landscape factors and pro- the area is 750 m. Hillocks, exposed barren areas,
pose the underlying mechanism of lake-landscape and moraines characterize the entire landscape.
interactions, which is the focus of the current Water tracks, streams, and rivers, dissect the area.

132
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

This region has experienced multiple glaciations 2005 is used for this research. The image is
in the mid-Pleistocene and late Pleistocene eras. provided after merging the original panchro-
Study lakes are located in both the Sagavanirktok matic and multispectral images obtained at
and Toolik areas and therefore sited on glacial the same time. The image has a pixel size
drifts of varying age. Age of Itkillik I advance of 5 meters X 5 meters and it has 3 spectral
obtained by carbon dating is 53,000 B.P. Follow- bands: Band1 (0.50 – 0.59 μm), Band2
ing that, the Itkillik II advance ranged from 25 (0.61 – 0.68 μm) and Band3 (0.79 – 0.89
to 11.5 ka. The Sagavanirktok River glaciation μm). The SPOT Corporation did not provide
is from Mid-Pleistocene time period, having an any specifications about pre-processing
estimated age ranging from 780,000 to 125,000 and merging technique used to derive the
years B.P. (Hamilton, 1986). Age of these glacial product. Traditionally such merged products
drifts has played an important role with respect are used only for visual inspection, and they
to soil pH, with older drifts showing more acidic are not used for digital classification as they
soils (Walker et al., 1994). do not conserve original spectral proper-
ties. However, new techniques similar to
Data wavelet based fusion and High Pass filter
fusion retain original spectral properties of
Spatial Datasets the multispectral image used in the merging
process, allowing digital classification of
a. A SPOT (Satellite Pour L’Observation de la the end product (Gangkofner et al., 2008).
Terre; 5 HRVIR) image acquired on July 25th, This study is based on the merged satellite

Figure 1. Location map of the study area and SPOT 5(2005) image snapshot (False Color image with
green, red, near infra-red bands) with lake groups under consideration

133
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

image so the nature of spectral properties inorganic carbon [DIC; predominately HCO3-])
of major land covers was verified in the as well as specific conductance.
ERDAS Imagine using the Spectral Profile
tool as well as NDVI ranges for land covers Derivation of Landscape Factors
compared with earlier studies. The image
encompassed approximately a 60 x 60 square Landscape factors were derived using following
kilometer area surrounding the study region. steps:
A Universal Transverse Mercator (UTM)
projection, zone 6, is applied to geometrically 1. Watershed Delineation: Hydrology tools
correct the image (Figure 1). and Conditional tools in the Spatial Analyst
b. Digital Elevation Model (DEM): A 5 meter extension of ArcGIS 9.3 were used to derive
X 5 meter DEM of the study area is used to watersheds. For each lake an approximate
delineate individual watersheds for each lake portion of the entire DEM was initially se-
using hydrology tools in ArcGIS. The DEM lected. Using the portion of DEM as input,
data was collected by Intermap Technologies Flow direction and Flow Accumulation
using an airborne Star 3i X-Band radar sys- surfaces were derived. Based on the Flow
tem. The horizontal accuracy of the data was Accumulation surface and the SPOT image,
2.5 meter RMSE with a 1 centimeter verti- pour points for individual watersheds were
cal resolution and 3 meters RMSE (Nolan, determined. The pour point and the respec-
2003). tive Flow Direction surface were used to
c. Lake properties: Lake Order depicting delineate the watershed for every lake.
their landscape position was derived using 2. Digital Classification: These watersheds
topographic maps of the area. The Strahler were used to divide the SPOT image. The im-
stream order of the outlet streams for lakes ages obtained for each watershed then were
was followed to assign the order to each utilized for calculation of the Normalized
lake. Lake area, perimeter, and shoreline Difference Vegetation Index (NDVI) layer,
development factors were derived from which represents vigor of vegetation but it
satellite data. is also characteristic for different vegetation

Non-Spatial Datasets
Table 1. Lakes and sampling dates
Maximum lake depth, light attenuation coef- Lake # Sampling Date
ficients within the water column, euphotic zone 115, 116, 117, 118 29 June 2001
depth (depth of penetration of 1% surface irradi- 119, 120, 121, 122 7 July 2001
ance), and till surface age category were observed 127, 128, 129, 130 8 July 2002
during field campaigns. 131, 132, 133, 134 10 July 2002
135, 136, 137, 138 12 July 2002
Concentrations of Ions
139, 140, 141, 142 28 June 2003
143, 144, 145, 146 3 July 2003
Forty one lakes (Table 1) were sampled over three
147, 148, 149, 150 14 July 2003
different thaw seasons (2001-2003) to determine
151, 152, 153, 154 24 July 2003
concentrations of common anions and of cations
E4 28 July 2003
(Ca2+, Mg2+, Na+, K+, SO42-, Cl- and dissolved
123, 124, 125, 126 4 Aug 2003

134
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

communities (Jensen, 2005). The layer of Snowbed complexes in Stine et al. (2010) were
NDVI was added to spectral bands of each adopted and modified for this study. Using these
watershed prior to classification with the criteria, rules were developed in the Knowledge
aim of improving the accuracy of the digital Engineer available with ERDAS Imagine (Ap-
classification. pendix B, Figure 5). The resultant Snowbed
complex layers were merged with the initially
The scheme for major land cover types for classified image of each watershed. During the
this study was a combination of classification summer season of 2008, six watersheds (Table 3)
systems used by Walker et al. (1994) and field were visited to collect ground truth information
modifications done by Stine and Ray (Table 2). at the points generated by a stratified random
The main categories were as follows. method. The data was used to assess accuracy of
The ISODATA clustering method for unsuper- classification of these watersheds. The average
vised classification was run on the NDVI combined accuracy obtained was approximately (88.6%),
image of each watershed. Initially 100 spectral which was above the suggested minimum accu-
classes were derived and finally merged to achieve racy level for ecological studies (Congalton et al.,
the desired land cover categories. The Snowbed 1991). Lake GTH 153 and the nearby 3 watersheds
complex category was derived after initial unsu- were affected by shadows of clouds. Even though
pervised classification. A knowledge-based clas- the shadow effect reduced accuracy for this group
sification approach was used for identifying of watersheds, classifying at watershed level
snowbeds (Appendix B, Figure 5). As field con- provided much better control to minimize the
ditions described by Walker et al. (1994), Snow- effect.
beds occur over gentle north facing slopes. The
NDVI and spectral value ranges observed for

Table 2. Vegetation communities and species composition

Class Description
Barren complex Barren surfaces, sparsely vegetated, rocks covered with lichens
Moist Acidic Tussock Tundra (MAT) Eriophorum vaginatum, Carex bigelowii, Betula nana, Salix pulchra, Sphagnum spp.
complex
Moist Non-acidic Tussock Tundra Salix reticulate, Saxifraga oppositifolia, Cares bigelowii, Carex membranacea, Dryas integri-
complex folia, Ledum decumbens, Equisetum ravens
Shrub complex Betula spp., Salix spp., Sphagnum spp.
Riparian complex Eriophorum anguistifolium, Salix pulchra, Salix alaxensis, Salix richardisonii
Fen Complex Carex rariflora, Carex rotundata, and mosses like Sphagnum spp., Carex chordorrhiza, Carex
aquitilis, and Tomentypnum nitens
Heath complex Festuca altaica, Empetrum hermaphroditum, Loiseleuria procumbens, Dryas octopetala, Cas-
siope tetragona, Salix phlebophylla
Snowbed complex Cassiope tetragona, Salix rotundifolia, Arnica frigida
Mountain Meadow complex Carex biglowii, Cassiope tetragona, Salix chamissonis
Aquatic vegetation complex Similar to fen but on lake-fringes
Water Lakes, streams, and rivers
Cloud
Shadows

135
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

Table 3. Overall accuracy values for six water-


Percentage
sheds visited during summer of 2008

Watershed Overall Accuracy (%) Kappa Once every watershed was completely classified,
GTH 120 95 0.92
an Area column was calculated. The same column
GTH 133 86.36 0.82
was used to calculate percentage of each land
GTH 135 82.75 0.78
cover within watersheds.
GTH 144 85.29 0.75
Patch Density
GTH 149 93.75 0.89
GTH 153 88.88 0.85
Patch Density (PD) is expressed in terms of number
of patches of a vegetation type per 100 hectares.
It was calculated at the class level. To analyze if
Derivation of Buffer Zones Along Water
overall spatial distribution properties of any par-
Channels
ticular land cover class would be controlling ion
inputs to lakes, Patch Density was incorporated
To extract land covers within the proximity of
in this research:
water channels, it was necessary to first derive
the water channels. Using Con tool in the Hydrol-
ni
ogy toolset of ArcGIS, water channels for each PD =
A
(10000)(100)
watershed were identified. A cell value of 500
for the Flow Accumulation surface was used for
the process. where, ni represents the number of patches and A
The value was determined based on visual is total area of the landscape. This value is multi-
inspection of the Flow Accumulation surface and plied by 10,000 and 100 to convert to hectares.)
observed tracks on the SPOT image. Based on our
field experience regarding soil moisture around Edge Density
water tracks, a buffer distance of 20 meters along
the channels was utilized to extract land covers. Edge Density (ED) is the number of edges of the
These land covers were assumed to experience given class per unit area (meters per hectare). The
higher soil moisture and therefore a greater prob- greater the Edge Density, the more fragmented
ability of ion exchange within their root zones the class under consideration. Similar to Patch
(Giblin et al., 1991). Density, the Edge Density index indicates the
extent of fragmentation for a land cover class
Fragstats/ArcGIS within the given watershed.

m
As explained in Table 3, landscape metrics for
proportion, fragmentation, and shape complexity ED =
∑ e
k =1 ik
(10, 000)
A
were calculated using Fragstats 3.3. Thematic
maps of the watersheds were used as inputs for
this process, where Spatial Analyst extension of Here, eik represents sum of total edge in meters.
ArcGIS was necessary for recognizing inputs in A is the area of the landscape in square meters and
IMG format. is multiplied by 10,000 to convert to hectares.

136
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

Landscape Shape Index 2 ln (0.25 pij )


FRACT =
ln aij
Landscape Shape Index (LSI) depicts the total edge
of a particular class (if this index is calculated at
class level), divided by the minimum possible Here, pij represents patch perimeter and aij
edge length for that class. Overall it is a more represents patch area.
direct measure of disaggregation of the land cover Shape indices were included in the current re-
class than Patch Density or Edge Density index. search to obtain indirect information about active
soil depth and probable moisture content of the
E soil. For example, riparian complex zones would
LSI = occur only near major streams and have greater
min E
soil depth compared to other vegetation classes
(Giblin et al., 1995). Structural properties such
Shape Index (Mean)
as very small patches (Mean Patch Size index) of
riparian complex would indicate that only small
Mean Shape Index (MSI) measures the average
areas near streams have deeper active soil zone
patch shape, or the average perimeter-to-area ratio
and may either be source or sink of nutrients.
for a particular patch type (class) or for all patches
It is well established that many landscape
in the landscape. SHAPE = 1 when the patch is
metrics provide redundant information (Turner et
circular (vector) or square (raster) and increases
al., 1989). To identify highly correlated metrics,
without limit as the patch shape becomes more
we initially performed correlation analysis. It was
irregular.
found that the Edge Density metric was highly
correlated with the Patch Density metric; hence,
(0.25) p ij it was omitted from the main analysis. To suit
MSI =
aij the analysis, the shape complexity metric was
converted into percent change from 1, which is
the numerical expression for the simplest shape.
For raster format of inputs pij represents sum
of the patch perimeter, aij represents patch area,
0.25 is a constant to adjust for a square standard. ANALYSIS

Fractal Dimension Index (Mean) Various statistical methods such as Correlation


Analysis, Analysis of Variance (ANOVA), Prin-
Units: None. Range: 1 < FRACT < 2. ciple Component Analysis (PCA), and Multiple
Regression Analysis are used in similar limnologi-
A fractal dimension greater than 1 for a 2-dimen- cal studies (Johnson & Guage, 1997). Advanced
sional patch indicates a departure from Euclidean methods such as Path Analysis are also useful
geometry, i.e., an increase in shape complex- (Liu et al., 1997). However, regression analysis
ity. FRACT approaches 1 for shapes with very is the most widely used of the analytical meth-
simple perimeters such as circles or squares, and ods, because it is easy to interpret and replicate
approaches 2 for shapes with highly convoluted, (Webster et al., 1996).
plane-filling perimeters. In the current research, the backward elimi-
nation method of regression was adopted. The
landscape metrics along with non spatial attributes

137
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

of lakes such as lake order, euphotic zone depth meter buffers of water channels. The standardized
and maximum depth were treated as predicting coefficient obtained for the variables (Prct_MNT
variables. Separate regression models were devel- [0.772], EZD [0.364], and PD_B_Shrub [0.329]),
oped for estimates of each major ion as well as indicated that Prct_MNT has more influence over
estimates of specific conductance. The standard- the specific conductance estimates compared to
ized regression coefficients obtained for individual the other variables.
predicting variables were used as indicators of Approximately 75% of the variance within the
their relative contributions to respective models. calcium estimates was attributed to Percentage of
MNT complex, the Euphotic Zone Depth, and the
Landscape Shape Index of MM complex. Accord-
RESULTS ing to the standardized coefficients for the predict-
ing variables, the contributions from Prct_MNT
The statistically significant models obtained at (0.571) and EZD (0.496) were roughly equal, but
the confidence interval of 95% are represented in LSI_MM (0.28) had a lesser contribution.
Table 4. Appendix A (Table 6) provides explana- The model developed for magnesium was able
tion about the acronyms utilized in these models. to explain 74% of variance within its estimates
The regression model for specific conductance using Percentage of MAT complex, presence
estimates explained 80% of the variability in the of the Till Age 2 category, and Euphotic Zone
data using the landscape metrics percentage of Depth. We obtained standardized coefficients
MNT complex, Euphotic Zone Depth, and Patch for three of the predicting variables (Prct_MAT
Density of Shrub complex observed within 20 [-0.546], Till Age Category 2 [0.3120], and EZD

Table 4. Landscape metrics

Type Metrics Level


Proportion Percentage Watershed, Buffer
Fragmentation Patch density, Edge density, Fractal Dimension, Patch size Watershed, Buffer
Shape Complexity Mean Shape Index, Landscape Shape Index Watershed, Buffer
Note: Basics concepts of landscape metrics were obtained from Fragstats documentation (McGarigal et al., 2002).

Table 5. Regression models for ions and conductivity

Dependent Variable Regression Model Obtained r2


Conductivity Conductivity = -29.426 + 2.246 (Prct_MNT) + 7.17 (EZD) + 0.195 (PD_B_Shrub) 0.80
Calcium Ca = -2.202 + 0.368 (Prct_MNT) + 2.13 (EZD) + 0.317 (LSI_MM)
2+
0.75
Magnesium Mg2+ = 2.415 – 0.037 (Prct_MAT) + 0.983 (Till Age 2) + 0.117 (EZD) 0.74
Sodium Na = 0.309 – 0.006 (Prct_MAT)
+
0.50
Potassium K = 0.212 – 0.003 (Prct_MAT) + 0.019 (EZD) + 0.114 (Till age 1)
+
0.63
Sulfate SO42- = 4.351 + 0.059 (PD_MM) – 0.407 (LSI_B_Fen) + 0.215 (LSI_ Shrub) 0.37
Chloride Cl = 0.186 + 0.042 (Prct_Heath) – 0.304 (Till age 2) + 0.001 (PD_ Fen) + 0.023 (MS_B_Fen)
-
0.60
+ 0.023 (MS_B_MM_Prct)
DIC DIC = 22.966 + 0.893 (Prct_MNT) + 27.86 (Till Age 2) – 5.169 (FD_B_Shrub) + 3.146 (EZD) 0.90
+ 0.596 (LSI_MM)

138
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

[0.227]). As the contribution is estimated by the indicated that Prct_Heath contributed more than
value and not by its sign, Prct_MAT was the most other variables.
significant variable in the model and EZD was The regression model for DIC was obtained
the least significant. using Percentage of MNT, Till Age category 2,
Only 50% of its variance was explained by the Fractal Dimension of Shrub within buffer zones
regression model developed for sodium. The only of channels, Euphotic Zone Depth, and Landscape
landscape factor included in the regression model Shape Index of MM complex. These landscape
was the Percentage of MAT complex. metrics were responsible for 65% of total variance.
In contrast, more than 63% of variation with The standardized regression coefficients obtained
potassium estimates was explained by the regres- for these variables (Prct_MNT [0.493], Till Age
sion model. The Percentage of MAT complex, Category 2 [0.372], FD_B_Shrub_Prct [-0.241],
Euphotic Zone Depth, and the presence of till EZD [0.256], and LSI_MM [0.224]) indicated that
age 1 category were the predicting variables in the most significant contribution was displayed
this model. Similar to the models for sodium and by the Prct_ MNT.
magnesium, the amount of MAT complex within
the watershed was the most significant predicting
variable. The standardized coefficients obtained DISCUSSION
for the predicting variables in this model were
Prct_MAT (-0.511), EZD (0.393), and Till Age The MAT complex and the MNT complex are the
Category 1 (0.331). major land cover types within the Arctic Foothills
Among the anions included in this study, the re- Province of Alaska (Walker et al., 1994). It has
gression model for sulfate was developed based on been observed that the MAT complex is prevalent
Patch Density of MM complex, Landscape Shape over geologically relatively older tills whereas
Index of Fen complex present within buffer zone, as MNT complex prominently occupies younger
and Landscape Shape Index of Shrub complex. tills (Walker et al., 1994). According to Walker
This model was able to explain only 37% of the et al. (2009), the older glacial surfaces have ac-
total variance within sulfate estimates. The Land- cumulated wind-blown glacial silt leading to more
scape Shape Index metrics for Fen complex within peat formation with higher soil moisture. All these
buffer zone was the most significant contributor processes were responsible for introduction of
to this model with a standardized coefficient of Sphagnum moss leading to acidic soils on older
-0.512. For comparison, the standardized coef- tills. It also allowed development of well-drained
ficients for PD_MM and LSI_Shrub were 0.401 soil surfaces resulting in weathering of calcium
and 0.398, respectively. rich parent material, adding to acidity of the
Percentage of Heath, Till Age Category 2, Patch soils. Similarly, Keller et al. (2007) indicated that
Density of Fen complex, Mean Shape Index of younger tills have relatively higher concentra-
Fen complex occurring within buffer zones, and tions of calcium and other ions, which are more
percent change from a value of 1 in the Mean susceptible to physical weathering. The glacial
Shape Index of MM complex were the landscape geology maps of this region are available only at
factors found to be significant for the regression a 1:250,000 scale, which is coarse with respect to
model developed for chloride. While depicting the watershed level analysis in the current research
60% of the variance within chloride estimates, (Hamilton, 1980). Also, on the ground, we have
the standardized coefficients for predicting vari- observed that the MAT and MNT complexes in
ables (Prct_Heath (0.568), Till_age2 (-0.322), ArcGIS are often intermixed and co-occur on
PD_Fen (0.212), and MS_B_MM_Prct (0.364)), older as well as younger tills. However, they

139
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

follow the generalized rule of till age and tundra repository of cations and anions, is near to the sur-
type dominance. face. Therefore, the MNT vegetation type, which
Hence, we suggest that landscape metrics is prominently observed over younger till surfaces,
related to MAT and MNT complex are more in- could be indicative of the possible release of ions
dicative of till surface properties than the actual from weathering and thawing of permafrost layer
contribution by the land covers. For example, during the growing season. This logic is also sup-
the positive relationship between calcium and ported by reportedly higher concentrations of Ca2+,
Prct_MNT indicates that more calcium could be Fe3+, K+, and Na+ over Sagavanirktok glaciation
released from the landscape dominated by the surfaces, which are relative younger than Itkillik
MNT complex. Thus, geologically young tills, III surface (Keller et al., 2007).
which are active ion stores, would release more Hence, we suggest that the positive relationship
calcium. Similarly, magnesium, potassium, and so- of MNT to ions such as calcium is more indicative
dium were negatively correlated with Prct_MAT, of the relative abundance of ions in the younger
suggesting that older tills which are dominated by till surfaces. In contrast, the negative correlation
MAT have already experienced weathering and exhibited by magnesium, potassium, and sodium
most of the ions have leached and not contribut- with Prct_MAT could be indicative of retention
ing towards contemporary lake water chemistry. of these ions by the MAT complex along with
MAT and MNT complexes are major land lower concentration of ions within active soil on
covers within the Arctic Foothills Province of older tills. This is illustrated by considering lakes
Alaska, with the MAT complex is prevalent over GTH 119 and GTH 153 which show 8% and 43%
glacially older tills and the MNT complex domi- MAT complex within their respective watersheds
nant over relatively younger tills (Walker et al., (Figure 2). The Na+ concentration in GTH 119 was
1994). Wind-blown glacial silt has allowed older 2.1 mg L-1, but was only 0.4 mg L-1 in GTH 153.
glacial tills to have a more active soil depth and Lake GTH 119 is located over young till (Itk II),
increased soil moisture. As a result, well-drained whereas the GTH 153 watershed is situated on
surfaces could be observed over older tills. These an older glacial till. Thus, the higher concentra-
tills, therefore, have been chemically weathered tion of Na+ within GTH 119 could be the result
and have lost calcium and other ions. At the same of weathering and release from the permafrost
time, older tills have an accumulation of higher layer, unlike GTH 153 where MAT would utilize
amounts of peat accompanied by increased growth all nutrients and ions for plant growth.
of Sphagnum moss rendering soils more acidic The broad leaf vegetation communities such
(Walker et al., 2009). These factors lead to the as Shrub complex and MM complex exhibited
establishment of the MAT complex over older their influence over ionic composition via either
till surfaces. The MAT complex has vegetation Patch Density or other shape metrics. Patch Den-
species with deep root structures and with rela- sity of Shrub complex within buffer zones was
tively higher productivity rates. Therefore, we positively correlated with specific conductance,
considered the presence of MAT as an indicator indicating that if a watershed has a high number
of ion contribution via breakdown of organic soil of Shrub patches, it would exhibit higher spe-
content and transportation of ions through surface cific conductance. Specific conductance of the
and sub-surface water to lakes rather than an index water is a representation of the overall ionic
of actual weathering. strength of the water, irrespective of the relative
In contrast, younger tills have a shallow ac- contributions of dissolved constituents. Hence,
tive soil layer and do not display well developed in this study the regression model developed for
drainage system. The permafrost layer, which is specific conductance was able to explain more

140
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

Figure 2. GTH 119 and GTH 153 MAT showing patches

variance than any regression model developed Landscape Shape Index of Shrub complex and
for individual ions. sulfate estimates. According this relationship, as
Vegetation species found in the Shrub complex the patches of Shrub complex are becoming in-
have deeper root systems, pointing to the presence creasingly irregular in shape, more sulfate is
of a relatively greater soil depth (Giblin et al., observed in the lakes. Complexity of patches
1991). Sited within the buffer zones of seasonal would be higher when patches are either growing
streams, the Shrub complex would experience or subjected to fragmentation processes. Hence,
more water flux, aiding ion release from the root it could be inferred that complex patches would
zones to the nearby surface waters. To further il- enhance the release of sulfate ions from soils into
lustrate this concept, two representative watershed surface waters. We also observed a negative cor-
examples are provided in Figure 3. GTH 125 has relation between fractal dimensions of Shrub
a very few number of patches within buffer zones patches occupying buffer zones around water
of streams, therefore, having low patch density channels indicating that complex patches of Shrubs
(7.4 per 100 hectares). In the watershed of GTH within the buffer zones would act as sinks for DIC
138, however, more patches of Shrub complex being transported to lake waters.
were present within the buffers leading to higher Shape complexity of MM complex expressed
patch density (52.7 per 100 hectares). Therefore, in terms of Landscape Shape Index, exhibited a
for the GTH 125 watershed, the lower number of positive relationship with calcium and DIC. MM
patches may indicate a relatively lower contribu- complexes are responsible for physical weathering
tion of Shrub complex towards ion release from due to their root growth (Marion et al., 1989). This
root zones into streams and eventually into the weathering activity makes carbonates available
lake. The result could be the lower conductance for exchange from parent material. MM complex
observed for GTH 125 (93.5μS cm-1). In GTH and lichens are responsible for primary physical
138, however, the higher patch density could and biochemical weathering of parent material
have increased ion release from root zones of the on higher slopes (Fedina, 1978). The currently
Shrub complex into seasonal streams, leading to observed relationship may be suggesting us that
a specific conductance of 193.4μS cm-1 of the the changes in shape complexity of MM complex
lake water. patches will have greater impact on weathering
The same mechanism at watershed level could parent material and eventually concentration and
be responsible for positive correlation between transport of Ca2+ and DIC to lakes.

141
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

Figure 3. GTH 125 and GTH 138 showing Shrub patches

To support our argument, the following two (higher Landscape Shape Index) within buffer
watersheds are helpful (Figure 4). The GTH 125 zones of water channels displayed a negative
watershed has a lower number of contiguous correlation with sulfate concentrations in the lakes,
patches of MM complex, hence a low LSI index indicating that complex patches of Fen commu-
(16.7). In contrast, the MM complex in the GTH nity within buffers may act as sinks for this anion.
118 watershed has been dissected by Shrub com- While studying carbon mineralization in the Fen
plex and other land covers leading to an increased complex within the Upper Peninsula of Michigan,
number of patches distributed over a large area Keller and Bridgham (2007) noted that sulfate
resulting in a higher LSI index of 33.1. As noted reduction is the most likely mechanism of or-
earlier, the contribution of MM complex in physi- ganic carbon mineralization in anaerobic condi-
cal weathering of parent material would be lower tions within vegetative units similar to the Fen
in GTH 125, resulting in lower carbonates and complex. This observation supports the proposed
lake waters show a DIC concentration of 65.9 role of Fen complex as sink for sulfate. W GTH
mg L-1. In comparison, the GTH 118 watershed 133 and similar watersheds showed a very few
shows a, relatively higher possibility of physical contiguous patches within buffers of streams
weathering by MM complex and a higher DIC (LSI_B_Fen 12.1), whereas some watersheds like
concentration of 81.2 mg L-1 in lake water. GTH 120 had many small patches distributed
We found that the Shape complexity of Fen within the buffer zones of streams (LSI_B_Fen
communities was related to sulfate and chloride 41.05). The measured sulfate concentration in
concentrations. More complex Fen patches

142
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

Figure 4. GTH 125 and GTH 118 showing MM complex patches

lake GTH 133 was 20.6 mg L-1 compared with 2 cations, especially K+. However, we could not find
mg L-1 in lake GTH 120. any logical reason behind the negative correlation
In contrast, the positive relationship between between chlorides and Till Age Category 2.
Mean Shape Index of Fen complexes occurring in Euphotic Zone Depth exhibited a positive
buffers and chloride estimates indicates that more correlation with specific conductance, calcium,
complex patches of Fen act as a source of chloride magnesium, potassium, and DIC estimates. These
for Arctic Foothill lakes. Fen complexes are fed relationships were likely fortuitous because eu-
by ground water and/or permafrost melt (Cooper photic depth plays a direct role for photosynthetic
& Wolf, 2006), indicating that the water chemistry activity (Wetzel, 2001) rather than controlling
of Fen complexes is governed by parent material concentration of cations and anions in the lakes.
of the surface. There could also be a relationship Overall, landscape metrics were successfully
between pH and variations in chemical properties able to explain more than 50% of the variance for
of Fen complex (Walker et al., 1994). Detailed most of the ions. Usually the ionic composition of
investigation of chemical parameters within Rich lakes is directly controlled by geological settings
Fen and Poor Fen communities may help reveal and weathering phenomena. However, there was
the underlying mechanism of cations and anions no detailed spatial geological information avail-
transport to and from Fen communities. able for the study region. Landscape metrics,
Even though not the most significant predictor especially, Prct_MAT and Prct_MNT were able
by itself, till age played a major role in explaining to overcome this limitation and served as proxies
variance for some of the cations and anions in the of geological conditions or till surfaces. Shape
study. We observed that Till Age Category 2, which Complexity metrics were useful in inferring prob-
was assigned to the Itk2 surface, was positively able root zone conditions and were also useful in
correlated with magnesium and DIC estimates. inferring weathering connections with concen-
The younger category (Till Age Category 1) trations of ions within lakes. At the same time,
representing the Itk2 readvance was positively complexity of patches helped to understand the
correlated with potassium concentrations in the role of vegetation communities as either a source
lakes. These observations follow the findings by or sink of various ions. However, more specific
Keller et al. (2007), who stated that streams on studies are required to understand thoroughly the
younger surfaces have higher concentrations of landscape controls over ion concentration in arctic

143
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

lakes. Our study could be considered a primary of Alaska. These landscape factors suggest that
initial effort in this regard. vegetation types within watersheds are a result
This research could be enhanced with availabil- of climatic conditions, dominant weathering
ity of detailed geological maps. Currently, avail- processes, and they influence the ion recycling
able maps for Toolik lake region are at 1:250,000 via interactions with soil moisture and root zone
scale and do not include lake groups such as GTH processes.
153 and surrounding lakes. At the same time, there In the near future, arctic lakes will experience a
are certain factors which might have impacted our changing influence of factors governing ion inputs
findings. For example, the lake chemistry data was due to climate change. It is expected that the natural
collected during seasons (Table 1) for various lake transformation of MNT into MAT with aging of
groups. As mentioned by Cornwell (1992), each tills will be accelerated with rising temperatures
season might have experienced different patterns (Walker et al., 2001). Also, there will be increase in
of precipitation and insolation, resulting in the the amount of broad leaf vegetation such as Shrub
observed ion concentrations. Moreover, all the complex in arctic watersheds (ACIA, 2005). The
lake chemistry data was collected only once during previous studies of Pienitz et al. (1997), Marion
the thaw season. Thus, the current research does et al. (1989), Oechel (1989), and Cornwell (1992)
not capture within season variation as attempted have highlighted various probable mechanisms
by Cornwell (1992) and Marion et al. (1989). The for transportation of nutrients from land surfaces
satellite data used for this research was from a into surface water bodies. With implementation of
different date than the lake water sample collec- geospatial technology and remote sensing datasets,
tion. However, with then available access to the we were able to establish a set landscape metrics
region along with funding and time limitations, useful and repeatable for other arctic watersheds.
our research represents a unique attempt to link This research could also be expanded to study
landscape factors with lake-ion chemistry. watersheds influenced by human activities. A
To further enhance the study, we propose to combination of human induced processes such as
include metrics related to only MAT, MNT, and deforestation and natural factors would make a
broad leaf vegetation categories as the current key set of predicting landscape factors for surface
results suggest their strong correlation with ionic water quality (Steedman, 2000).
compositions of lakes. Expanding our data set to
include additional lakes would also improve the
predictive power of regression models. Further, if ACKNOWLEDGMENT
additional data were available, it would be useful
to adopt more advanced analytical techniques such This research was funded by the National Science
as Classification And Regression Trees (CART). Foundation (Grant 051604).

CONCLUSION REFERENCES

This study shows that the spatial and structural ACIA. (2005). Arctic climate impact assessment:
properties of MAT complex, MNT complex, and Scientific report. Cambridge, UK: Cambridge
broad leaf vegetation such as Shrub complex University Press.
and MM complex are useful indicators of ionic
composition of lakes in the Arctic Foothills region

144
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

Canham, C. D., Pace, M. L., Papaik, M. J., Gangkofner, U. G., Pradhan, P. S., & Holcomb,
Primack, A. G. B., Roy, K. M., & Maranger, R. D. W. (2008). Optimizing the high-pass filter
J. (2004). A spatially explicit watershed-scale addition technique for image fusion. Photogram-
analysis of dissolved organic carbon in Adirondack metric Engineering and Remote Sensing, 74(9),
Lakes. Ecological Applications, 14(3), 839–854. 1107–1118.
doi:10.1890/02-5271
Gergel, S. E., Turner, M. G., & Kratz, T. K. (1999).
Congalton, R. G. (1991). A review of assessing Dissolved organic carbon as an indicator of the
the accuracy of classifications of remotely sensed scale of influence on lakes and rivers. Ecological
data. Remote Sensing of Environment, 37, 35–46. Applications, 9(4), 1377–1390. doi:10.1890/1051-
doi:10.1016/0034-4257(91)90048-B 0761(1999)009[1377:DOCAAI]2.0.CO;2
Cooper, D. J., & Wolf, E. C. (2006). Fens of the Giblin, A. E., Nadelhoffer, K. J., Shaver, G. R.,
Sierra Nevada, California. Retrieved April 14, Laundre, J. A., & McKerrow, A. J. (1991). Biogeo-
2011, from http://www.rigelstuhmiller.com/evan/ chemical diversity along a riverside toposequence
CooperWolfSierraFensFinalReport2006.pdf in arctic Alaska. Ecological Monographs, 61,
415–436. doi:10.2307/2937049
Cornwell, J. C. (1992). Cation export from Alaskan
arctic watersheds. Hydrobiologia, 240, 15–22. Griffith, J. A., Martinko, E. A., Whistler, J. L., &
doi:10.1007/BF00013448 Price, K. P. (2002). Interrelationships among land-
scapes, NDVI, and stream water quality in the US
Drever, J. I. (1994). The effect of land plants on
central plains. Ecological Applications, 12, 1702–
weathering rates of silicate minerals. Geochi-
1718. doi:10.1890/1051-0761(2002)012[1702:IA
mica et Cosmochimica Acta, 58(10), 2325–2332.
LNAS]2.0.CO;2
doi:10.1016/0016-7037(94)90013-2
Hamilton, T. D. (1980). Surficial geologic map of
Fedina, A. E. (1978). The structure of landscape
the Killik River quadrangle, Alaska. 1:250,000.
altitudinal zonality on the northern slopes of the
Washington, DC: U.S. Geological Survey.
Greater Caucasus in the Baksan Basin, USSR.
Arctic and Alpine Research, 10(2), 437–429. Hamilton, T. D. (1986). Late Cenozoic glaciation
doi:10.2307/1550774 of the Central Brooks Range. In Hamilton, T. D.,
Reed, K. M., & Thorson, R. M. (Eds.), Glacia-
Forman, R. T. T., & Gordon, M. (1986). Landscape
tion in Alaska: The geologic record (pp. 9–49).
ecology. New York, NY: John Wiley & Sons.
Anchorage, AK: Alaska Geological Society.
Fragstats Conceptual Background. (n.d.). Land-
Jensen, J. R. (2005). Introductory digital image
scape pattern metrics. Retrieved December
processing: A remote sensing perspective (3rd ed.).
22, 2007, from http://www.umass.edu/landeco/
Upper Saddle River, NJ: Prentice Hall.
research/fragstats/documents/Conceptual%20
Background/Background%20TOC.htm Johnson, L. B., & Gauge, S. H. (1997). Landscape
approaches to the analysis of aquatic ecosystems.
Frost, P. C., Larson, J. H., Johnston, C. A.,
Freshwater Biology, 37, 113–132. doi:10.1046/
Young, K. C., Maurice, P. A., Lamberti, G. A., &
j.1365-2427.1997.00156.x
Bridgham, S. D. (2006). Landscape predictors
of stream dissolved organic matter concentration
and physicochemistry on a Lake Superior river
watershed. Aquatic Sciences, 68, 40–51.

145
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

Johnson, L. B., Richards, C., Host, G. E., & Marion, G. M., Hastings, S. J., Oberbauer, S.
Arthur, J. W. (1997). Landscape influence on F., & Oechel, W. C. (1989). Soil-plant element
water chemistry in Midwestern stream ecosys- relationships in a tundra ecosystem. Holarctic
tems. Freshwater Biology, 37(1), 193–218. Ecology, 12, 296–303.
doi:10.1046/j.1365-2427.1997.d01-539.x
McGarigal, K., Cushman, S. A., Neel, M. C., &
Kearns, F. R., Kelly, N. M., Carter, J. L., & Resh, Ene, E. (2002). FRAGSTATS: Spatial pattern
V. H. (2005). A method for the use of landscape analysis program for categorical maps. Retrieved
metrics in freshwater research and management. April 11, 2011, from http://www.umass.edu/lan-
Landscape Ecology, 20, 113–115. doi:10.1007/ deco/research/fragstats/fragstats.html
s10980-004-2261-0
Nolan, M. (2003). Watershed-scale analyses of
Keller, J. K., & Bridgham, S. D. (2007). Path- an arctic drainage using a new high resolution
ways of anaerobic carbon cycling across an digital elevation model. Retrieved April 14, 2011,
ombrotrophic-minerotrophic peatland gradient. from http://www.drmattnolan.org/olderprojects/
Limnology and Oceanography, 52(1), 96–107. kuparuk/drmatt_kup_agu.htm
doi:10.4319/lo.2007.52.1.0096
O’Neill, R. V., Hunsekar, C. T., Jones, K. B.,
Keller, K., Blum, J., & Kling, G. W. (2007). Geo- Riiters, K. H., Wickham, J. D., & Schwartz, P.
chemistry of soils and streams on surfaces of vary- M. (1997). Monitoring environmental quality at
ing ages in arctic Alaska. Arctic, Antarctic, and the landscape scale. Bioscience, 47(5), 513–519.
Alpine Research, 39, 84–98. doi:10.1657/1523- doi:10.2307/1313119
0430(2007)39[84:GOSASO]2.0.CO;2
Pienitz, R., Smol, J. P., & Lean, D. R. S. (1997).
King, R. S., Baker, M. E., Whigham, D. F., Weller, Physical and chemical limnology of 24 lakes
D. E., Jordan, T. E., Kazyak, P. F., & Hurd, M. located between Yellowknife and Contowoyto
K. (2005). Spatial considerations for linking Lake, Northwest Territories (Canada). Canadian
watershed land cover to ecological indicators in Journal of Fisheries and Aquatic Sciences, 54,
streams. Ecological Applications, 15, 137–153. 347–358. doi:10.1139/f96-275
doi:10.1890/04-0481
Riera, J. L., Magnuson, J. J., Kratz, T. K., & Web-
Lesack, L. F. W., & Melack, J. M. (1991). The ster, K. E. (2000). A geomorphic template for the
deposition, composition, and potential sources of analysis of lake districts applied to the Northern
major ionic solutes in rain of the central Amazon Highland Lake District, Wisconsin, USA. Fresh-
Basin. Water Resources Research, 27, 2953–2977. water Biology, 43, 301–318. doi:10.1046/j.1365-
doi:10.1029/91WR01946 2427.2000.00567.x
Likens, G. E., Bormann, F. H., Pierce, R. S., Eaton, Schimel, J. P., & Clein, J. S. (1996). Microbial
J. S., & Johnson, N. M. (1977). Biogeochemistry response to freeze-thaw cycles in tundra and
of forested ecosystem. New York, NY: Springer. taiga soils. Soil Biology & Biochemistry, 28,
doi:10.1007/978-1-4615-9993-7 1061–1066. doi:10.1016/0038-0717(96)00083-1
Liu, Z., Hallberg, G. R., & Malanson, G. P. (1997). Smith, R. J. (1995). Calcium and bacteria. Ad-
Structural equation modeling of dynamics of vances in Microbial Physiology, 37, 83–133.
nitrate contamination in ground water. Journal doi:10.1016/S0065-2911(08)60144-7
of the American Water Resources Association,
33(6), 1219–1225. doi:10.1111/j.1752-1688.1997.
tb03548.x

146
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

Steedman, R. J. (2000). Effects of experimental Walker, D. A., Hamilton, T. D., Munger, C. A.,
clearcut logging on water quality in three small & Maier, H. A. (2009). Glacial history and long-
boreal forest lake trout (Salvelinus namaycush) term ecology in the Toolik Lake region. In Arctic
lakes. Canadian Journal of Fisheries and Aquatic LTER (Ch. 3). Retrieved April 14, 2011, from
Sciences, 57, 92–96. doi:10.1139/f00-119 http://ecosystems.mbl.edu/StaffWeb/jhobbie/
arcbook/chapter3_environmental_settings.pdf,
Stewart, J. S., Wang, L. Z., Lyons, J., Hor-
draft chapter
watich, J. A., & Bannerman, R. (2001). Influ-
ences of watershed, riparian-corridor, and Walker, M. D., Walker, D. A., & Auerbach, N. A.
reach-scale characteristics on aquatic biota in (1994). Plant communities of a tussock tundra
agricultural watersheds. Journal of the American landscape in the Brooks Range Foothills, Alaska.
Water Resources Association, 37, 1475–1487. Journal of Vegetation Science, 5, 843–866.
doi:10.1111/j.1752-1688.2001.tb03654.x doi:10.2307/3236198
Stine, R., Chaudhuri, D., Ray, P., Pathak, P., & Webster, K. E., Kratz, T. K., Bowser, C. J., Mag-
Hall-Brown, M. (2010). Comparing image clas- nuson, J. J., & Rose, W. J. (1996). The influence
sification techniques for Arctic tundra land cover, of landscape position on lake chemical responses
Toolik Lake, Alaska. GI Science and Remote Sens- to drought in northern Wisconsin. Limnology and
ing, 47(1), 78–98. doi:10.2747/1548-1603.47.1.78 Oceanography, 41(5), 977–984. doi:10.4319/
lo.1996.41.5.0977
Turner, M. G. (1989). Landscape ecology: The ef-
fect of pattern on process. Annual Review of Ecol- Wetzel, R. G. (2001). Limnology: Lake and river
ogy and Systematics, 20, 171–197. doi:10.1146/ ecosystems (3rd ed.). New York, NY: Academic
annurev.es.20.110189.001131 Press.
Walker, D. A., Bockheim, J. G., Chapin, F. S. III, Xiao, H., & Ji, W. (2007). Relating landscape char-
Eugster, W., Nelson, F. E., & Ping, C. L. (2001). acteristics to non-point source pollution in mine
Calcium-rich tundra, wildlife, and the “Mam- waster-located watersheds using geospatial tech-
moth Steppe.”. Quaternary Science Reviews, 20, niques. Journal of Environmental Management,
149–163. doi:10.1016/S0277-3791(00)00126-8 82, 111–119. doi:10.1016/j.jenvman.2005.12.009

147
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

APPENDIX A

Table 6. Acronyms for landscape factors

Landscape Metrics Vegetation Community Acronym


Aquatic Vegetation Complex Prct_AV
Moist Acidic Tundra Complex Prct_MAT
Moist Non-Acidic Tundra Complex Prct_MNT
Shrub Complex Prct_Shrub
Fen Complex Prct_Fen
Percentage
Heath Complex Prct_Heath
Mountain Meadow Complex Prct_MM
Riparian Complex Prct_Rip
Snowbed Complex Prct_SB
Barren/Thermokarst Prct_BT

Aquatic Vegetation Complex PD_AV


Moist Acidic Tundra Complex PD_MAT
Moist Non-Acidic Tundra Complex PD_MNT
Shrub Complex PD_Shrub
Fen Complex PD_Fen
Patch Density
Heath Complex PD_Heath
Mountain Meadow Complex PD_MM
Riparian Complex PD_Rip
Snowbed Complex PD_SB
Barren/Thermokarst PD_BT

Aquatic Vegetation Complex MS_AV


Moist Acidic Tundra Complex MS_MAT
Moist Non-Acidic Tundra Complex MS_MNT
Shrub Complex MS_Shrub

Mean Shape Index Fen Complex MS_Fen


(Percent change) Heath Complex MS_Heath
Mountain Meadow Complex MS_MM
Riparian Complex MS_Rip
Snowbed Complex MS_SB
Barren/Thermokarst MS_BT

continued on following page

148
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

Table 6. Continued

Landscape Metrics Vegetation Community Acronym


Aquatic Vegetation Complex LSI_AV
Moist Acidic Tundra Complex LSI_MAT
Moist Non-Acidic Tundra Complex LSI_MNT
Shrub Complex LSI_Shrub
Fen Complex LSI_Fen
Landscape Shape Index
Heath Complex LSI_Heath
Mountain Meadow complex LSI_MM
Riparian Complex LSI_Rip
Snowbed Complex LSI_SB
Barren/Thermokarst LSI_BT

Aquatic Vegetation Complex FD_AV


Moist Acidic Tundra Complex FD_MAT
Moist Non-Acidic Tundra Complex FD_MNT
Shrub Complex FD_Shrub
Fen Complex FD_Fen
Fractal Dimension (Percent Change)
Heath Complex FD_Heath
Mountain Meadow Complex FD_MM
Riparian Complex FD_Rip
Snowbed Complex FD_SB
Barren/Thermokarst FD_BT

Aquatic Vegetation Complex MP_AV


Moist Acidic Tundra Complex MP_MAT
Moist Non-Acidic Tundra Complex MP_MNT
Shrub Complex MP_Shrub
Fen Complex MP_Fen
Mean Patch Size
Heath Complex MP_Heath
Mountain Meadow Complex MP_MM
Riparian Complex MP_Rip
Snowbed Complex MP_SB
Barren/Thermokarst MP_BT

Euphotic Zone Depth EZD


Lake Order Lake Order
Maximum Depth Max_Depth
Shoreline Development Factor SDF
Note: When these indices were calculated at buffer level, a suffix of “_B_” was added in their acronyms.

149
Using Geospatial Techniques to Analyze Landscape Factors Controlling Ionic Composition

APPENDIX B

Figure 5. Expert classification rule used for snowbed complex

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 3,
edited by Donald Patrick Albert, pp. 37-57, copyright 2012 by IGI Publishing (an imprint of IGI Global).

150
151

Chapter 13
Mobile Geographic
Information Systems
Yang Li
University of East London, UK

Allan J. Brimicombe
University of East London, UK

ABSTRACT
This chapter introduces the concept of Mobile Geographical Information Systems (Mobile GIS) as an
evolution of conventional GIS to being available on wireless mobile devices such as smart phones. The
evolution of the technology and its applications are charted in this chapter. The main elements of Mobile
GIS are then discussed. This focuses on: GIS servers; wireless mobile telecommunication networks;
wireless mobile devices; location-awareness technology; and gateway services. This is followed by a
discussion of the main features in terms of the services and usage of Mobile GIS: mobility; real-time
connectivity; location-awareness; broadened usage. Mobile Geographical Information Systems are an
important facilitating technology for Location-Based Services (LBS). A range of applications of Mobile
GIS for smart phones are described. The chapter closes with a discussion of the prospects and challenges
for Mobile GIS. Challenges derive from four broad areas: limitations that derive from the technologies
being used; areas of GIScience that still need to be adequately researched; users; and business models
for a sustainable presence.

INTRODUCTION With the fast development and increasingly wide


usage of smart phones, Mobile GIS have become
Mobile Geographic Information Systems (Mobile embedded in many Location–Based Services
GIS) can be viewed as an outcome of the conver- (LBS) applications, particularly in navigation
gence of wireless mobile technologies and GIS functions, which are provided via smart phones.
to have real-time and mobility characteristics. Mobile GIS can also be considered as a core part
of LBS. Mobile GIS integrates GIS and wireless
DOI: 10.4018/978-1-4666-2038-4.ch013 mobile communication technologies to offer a

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Mobile Geographic Information Systems

real-time capability of interacting with external a discussion of the challenges and research issues
software and remotely accessing and managing raised by Mobile GIS, such as the influence of
data and information. fast development of mobile technologies.
In this Chapter, the development of Mobile GIS
from conventional GIS is presented in the context
of GIS functionalities and mobile technologies and FROM CONVENTIONAL
smart phones. The principles and main features GEOGRAPHIC INFORMATION
of Mobile GIS are addressed in a systematic way, SYSTEMS (GIS) TO MOBILE GIS
covering topics regarding its real-time connectiv-
ity, mobility and location-awareness. Mobile GIS In mid-1960s, with the prospect of handling and
takes advantage of mobile technologies (e.g. wire- analysing spatial data digitally, the beginning of
less networks, mobile devices, smart phones) to geographic information systems (GIS) had been
offer the possibility of exchanging and analysing explored in both professional and academic areas,
spatial information in a real geographic world in as shown in the time line in Figure 1. In Canada,
real-time. Mobile GIS also offers the potential for a prototype of GIS was started for the Canada
users to employ GIS in more mobile and diverse Land Inventory which aimed to map existing
situations and for a range of applications through land uses and analyse land capability for forestry,
smart phones. Mobile GIS are further presented agriculture, wildlife and recreation (Tomlinson,
in the context of Location-Based Services (LBS). 1984). The output of this early stage of GIS was
LBS have been made possible due to the matura- regarded the cost-effective way for mapping the
tion and convergence of a range of heterogeneous whole land area of Canada at the time. This sys-
technologies, such as mobile phones, the Internet tem became fully operational in 1971. In 1964,
and the Web, global positioning system (GPS) and in the Laboratory for Computer Graphics and
GIS. LBS are defined as the delivery of spatial Spatial Analysis, Harvard Graduate School of
data and information services where the content Design, USA, the first digital mapping software,
of those services is tailored to the current or some named SYMAP, was created in 1964. SYMAP
projected location and context of a mobile user used line printers to produce primitive maps to
(Brimicombe & Li, 2009). LBS have been con- visualise landscape themes of human and physical
sidered by many to have evolved professional GIS phenomena for identifying spatial similarities and
applications to more public-centric services and groupings (McHaffie, 2000). These maps were
thus promoting greater ubiquity for GIS. Later in represented as coarse lines of equally spaced
this Chapter, a wide range of Mobile GIS applica- characters and symbols. Evolving from SYMAP,
tions via smart phones are covered, including areas the GRID package with command-line user in-
such as on-site data collection and monitoring; terface and ODYSSEY as a line-based (vector)
navigation with real-time updates; wayfinding prototype were developed as GIS software in the
for individuals to orientate and discover routes; Laboratory in 1970s. Also in mid-1960s, the US
real-time tracking of contacts; mobile commerce Bureau of Census aimed to produce digital maps
(m-commerce) with location-awareness; user- of street blocks and census tracts for supporting
solicited information for a range of business and 1971 census. This introduced explicit topology
social purposes; coordinating emergency and into the data structure. These early developments
maintenance responses to accidents and essential set the foundation of GIS, such as raster (grid) and
services; location-based artistic expression in the vector (line) data structures, the use of database
community, mobile gaming where the players and management for spatial data, the use of data lay-
actions are location–based. The chapter ends with ers (treating each theme as a layer of data), and

152
Mobile Geographic Information Systems

Figure 1. Time line of the development of mobile GIS and associated technologies

the use of topology in handling objects and their Data entry can be either the importing of existing
spatial relationships (topological relationships of digital data set(s), or digitising of secondary
points, lines and polygons). sources (such as existing paper-based maps).
The basic range of technological functionality Spatial and attribute data from different sources
associated with GIS can be generally summed up can be integrated in a GIS database on the basis
as: data entry and editing; transformation; query; of geography. Data records can be geocoded, that
interpolation, cartographic processing, map alge- is have x,y co-ordinates attached to them, through
bra and thematic mapping (Brimicombe, 2010). the use of geographic base files used for address-

153
Mobile Geographic Information Systems

matching and postcode-matching. Data can also Geographic Information Systems (GIS) are
be edited and created within GIS. Transformation often regarded as software; however, the data
in GIS mainly includes data conversion between aspect of GIS is equally important as the software
vector and raster, between different map projec- and GIS also engage people and organisations
tions, and between coordinate systems. Gener- (Brimicombe and Li, 2009) in the production of
alisation can also be viewed as an important form spatial data and the use of software. GIS involve
of a transformation in GIS, such as attribute re- data collection and integration, processing, ana-
classification and spatial aggregation. Examples lysing and presenting in order to communicate
of spatial aggregation can be re-representing analyses of physical and social phenomena that can
clustered points as zones, or joining a number of assist in decision-making and problem-solving.
small zones into a larger one. It can also be sim- The problems and issues analysed, being both
plifying lines or dissolving dividing lines to reduce scientific and practical, are overwhelmingly geo-
the complexity of objects. Query is an important graphical in nature and focus on location. Thus
functionality in GIS. Query can be searching for GIS was traditionally presented as a technology
spatial objects according to specific attributes, or to support science and problem solving; it has
conversely viewing the attributes of selected come to be more than this (Longley, Goodchild,
objects or regions. Query also includes calculat- Maguire, & Rhind, 2005). During the 1990s GIS
ing the area of objects and distances between emerged as a research discipline in its own right,
them, as well as generating a range of statistics often termed GIScience, to study substantive
of both the spatial objects and their attributes. questions arising out of the use of GIS tools and
Interpolation in GIS includes point to area (such spatial data. Longley et al. (2005) discuss some
as Theissen polygons), point to field (such as us- of different views on GIS such as: a professional
ing computational techniques for contouring), user view of GIS as a tool and an analysis engine
area to point (such as centroid of a polygon), area to examine data and real patterns relationships
to field (contouring from centroid), and area to and anomalies; view from decision makers and
area (such as zone transformation). Another func- planners of GIS as a computerised tool to solve
tionality of GIS is cartographic processing, geographic problems; a general public view of
mainly on vector layers, which consists of overlay GIS as a container of maps in digital form.
and buffering. Overlay is fusing a number of lay- Starting in the early 1990s, new opportunities
ers into a new layer containing rebuilt topology became available to access and use maps on the
and attributes of each contributing layer often Internet. These quickly evolved to incorporate
reclassifying using Boolean operators. Buffering basic GIS functionality of pan, zoom, and map
can be performed on points, lines or polygons. customisation to be followed later by wayfinding
Map algebra in GIS is mainly for raster data. tools for route selection. Specific products to de-
Raster layers can be added, subtracted, multiplied, liver server side Internet GIS through a browser
divided, and can also be weighted and compared interface started to be available by in the mid to
for maximum and minimum values. Thematic late 1990s, as seen in Figure 1. This represented a
mapping in GIS is for visualising data, with func- fundamental shift whereby different levels of GIS
tions such as choosing layer combinations, class functionality and data sources could be accessed
intervals, as well as colours, patterns, symbols, and used online by remote users. Users could
and layout. Producing a good thematic map is a have a choice of client side devices from desktop
very important aspect in GIS as it is the prime computers, laptops or personal digital assistants
means of communicating data and the outcomes (PDA). GIS utilised through an Internet connection
of analyses. are considered as Internet GIS, a phrase which

154
Mobile Geographic Information Systems

Figure 2. Basic elements of Internet GIS

was in use by the late 1990s (Peng, 1999). There GIS software will be activated and relevant data
have been different terms to describe Internet sets opened; the request is processed using GIS
GIS, such as Web-based GIS (Doyle, Dodge, & functionality and the results sent back to the user,
Smith, 1998), on-line GIS, distributed geographic usually as image files. This is the basic principle
information (Plewe, 1997). Figure 2 illustrates the of how the Internet GIS operate. There is a range
main concept of Internet GIS. On the client side, of technologies associated with the handling user
users send requests for GIS functions through a requests from browsers, which can be from desk-
Web browser via an Internet connection to the top computers, laptops, hand-held devices to smart
server side where GIS software and data reside. phones; for parsing user requests to the GIS soft-
Requests have to be processed at server, and the ware; and for data exchange between Web and
results or applets are then sent back to users via GIS software.
the Internet. There are four basic parts in Internet Apart from the professional usage of Internet
GIS, which are the user (the client), and the map GIS such as for the use of mapping and data col-
server, data server and Web server which reside lection in the field, popular applications accessed
in the application server (Peng & Tsou, 2003). by the public are Web-based mapping sites such
Two different approaches can be adopted in as Google maps (http://maps.google.com) which
principle for Internet GIS, known as the server- through their API allow other users to embed maps
based approach and the client-based approach; into their own Web pages. Access to and use of
most commonly implementations are server- these sites from smart phones is in many ways
based. In general, the ‘client’ is the side where very similar to the way these sites are accessed
requests are made whereas the ‘server’ is the side from a PC or laptop as they take no account of
where requests are processed (Peng & Tsou, 2003). whether the user is static (at home or in an office)
Internet GIS using a server-based approach has or on the move.
GIS software residing at the server site. Users The advantages of server-based Internet GIS
will access and employ GIS functionalities via are mainly on its characteristics of centralisa-
the Internet from their local computers or de- tion. GIS hardware and software are managed
vices. The Web browser is used as a graphic user and updated centrally, which can not only be
interface at the client side. The data processing maintained more easily than when decentralised,
and analysis are performed by GIS software in- but also allow users to have consistent access to
stalled at server side. At the client side, users send more up-to-dated functionalities. The server can
their requests through the Internet to the server. also have much more computing power and data
When the request is received at the server side, storage than mobile devices for example and are

155
Mobile Geographic Information Systems

able to process complex operations more quickly. functionality. Furthermore, distributed client/
However, Internet GIS based on this approach server approach is being developed and employed
can face the challenge when a large number of which can offer more benefit. There will be some
users send requests to a server simultaneously. discussion in next section.
Since for all user requests the functionality has With the development of mobile telecommu-
to be preformed at the server side instead of at nication network into its third generation (3G) in
the user (client) side, the system can slow down 2000s, the network offers the capability of both
or even crash in responding to large numbers of voice and data communication with high speed
user requests. data transmission of 144Kpbs and latterly up to
Another approach of Internet GIS is client- 8Mbps with the prospect of higher speeds. It can
based. It provides users at the client side the ability support mobile Internet access and multimedia
to run GIS data processing and analysis locally. applications. Mobile devices, particularly mobile
GIS functionalities as users require are down- phones (cellular phones), have seen rapid devel-
loaded from the server onto user’s local devices opment from the early 1990s onwards. Mobile
via the Internet, which can include different level phones have become one of the most widely used
of GIS functions, required data and models. Small electronic devices, largely viewed as a necessity
executable application applets can be created to of daily life by many people. Smart phones have
carry out different tasks such as managing spatial the capability of always-on data connection, which
data, handling maps, performing basic analysis. offers access to mobile Internet with aims of ‘any
Such applets can be Java applets, ActiveX con- time, any place and anywhere’ usage both for
trols, or other suitable plug-ins for mobile devices. voice and data services. There is a convergence
These application applets and plug-ins can be in mobile device development, which means the
downloaded at runtime to users’ local computers boundary is becoming blurred between types of
or their mobile devices. The client-based approach data-oriented handheld devices (such as tablets
Internet GIS have the advantage of reducing the and PDA) and mobile phones. Smart phones can
server side burden of running GIS and data traffic be viewed as one such product. A further techno-
on the Internet. On the other hand, there will be less logical development is the miniaturisation of GPS
centralised GIS function and data management. receivers and accelerometers to small chips so
Furthermore, more sophisticated and advanced that they can be integrated within mobile devices
GIS functionalities may not be possible for users making them location-aware. These technologi-
to access and perform on client-based Internet GIS cal developments (shown in Figure 1) and high
due to lack of power and data storage. levels of ownership of mobile devices provide
The approach taken for Internet GIS implemen- opportunities for the rise of wireless Mobile GIS.
tation, therefore, depends on the criteria of appli-
cations, user volume, telecommunication network
capability, and so on. In principle, the Internet GIS THE PRINCIPLES AND
with server-based approach are more appropriate FEATURES OF MOBILE GIS
to users with certain demands for advanced GIS
analysis and modelling functions. The volume of Wireless Mobile GIS are often shortened to wire-
such users accessing any one particular site is less less GIS or mobile GIS. In this chapter, we use
likely to be very high. Employing client-based the term: Mobile GIS. Mobile GIS have resulted
approach, Internet GIS would be more suitable from the convergence of wireless mobile tech-
to applications with large user volume, but not nologies, GIS and GPS, offering users real-time
necessary requiring professional advanced GIS access while on the move using devices that are

156
Mobile Geographic Information Systems

themselves location-aware. This should not be conventional GIS. As discussed in the last section,
confused with the early usage of the term ‘mobile Internet GIS adopt a client/server approach to
GIS’ which were mainly referred to those GIS used delivery of GIS services to users. In Mobile GIS,
for working on site, in the field, through pocket when based on client/server approach, a distrib-
PCs or PC tablets. Those GIS systems could also uted component framework is often employed for
be connected to GPS to collect and manage spatial users to access GIS. This concept of distributed
data, but most operate as stand-alone GIS. Mobile component framework is different from conven-
GIS using wireless technologies enable the real- tional client/server structure. Distributed compo-
time connection. Thus, mobile devices (such as nent technology allows clients to access hetero-
smart phones) can synchronise their local data geneous servers dynamically (Tsou, 2004).
with the database of the GIS on the server, and Utilising such technology, the applications on
access and manage data remotely in real-time. either client or server side are broken up into a
Mobile GIS offer the capability of accessing range of components that can be interoperable
external GIS functionality and applications for across networks, operating systems, applications,
users on the move. tools, languages and hardware. Distributed com-
Numerous components are associated with ponents can be structured as reusable components
Mobile GIS. Here we focus on the following key which are able to manage complex requests from
elements of Mobile GIS as illustrated in Figure 3: client side. There are a range of benefits in adopt-
GIS servers; wireless mobile telecommunication ing a distributed component approach in Mobile
networks (connectivity); mobile handheld devices GIS. One of them is the flexibility of client and
(such as smart phones); location-awareness tech- server roles in as much as a server can become a
nology; gateway services. client to another server or a client can act as
GIS servers provide GIS functionalities, data server depending on the distribution and func-
and applications. The core of GIS, such as GIS tionality of components. Another advantage of
data structure and basic functionalities, follow using distributed component in GIS is that more
the same principles of conventional GIS. How- complex requests from the client can be managed
ever, Mobile GIS are not stand-alone as are with sequence priority to offer more efficiency.

Figure 3. Key elements in mobile GIS

157
Mobile Geographic Information Systems

This is can be beneficial when a server is busy dynamic and efficient client/server interaction.
when dealing with large numbers of client side Another advantage is Java’s ability of cross plat-
requests. Peng and Tsou (2003) discuss in detail form performance. There is also Java’s dynamic
the advantages of distributed components in the binding feature with downloadable Java applets.
Internet GIS. Three major component technologies However, one of the disadvantages of the Java
used in the component-based application develop- platform is that in comparison with machine level
ment have been widely applied in developing programming with binary coding, running Java
component-based GIS (Wu, Zhang, & Goddard, programs is much slower due to its byte coding
2004). They are: Common Object Request Broker nature. Also there is no standardised distributed-
Architecture (CORBA) specification from Object object infrastructure in using Java. The reason we
Management Group (OMG); Distributed Com- put Java as an example is that Java is one of the
ponent Object Model (DCOM) from Microsoft technologies that has been successfully deployed
Corporation; and Java Remote Method Invocation for Internet GIS, and is still evolving.
(RMI) from Sun Microsystem Inc. A distributed For those architectures that rely on networks
component framework can be applied in Mobile for communication between the server and the
GIS to achieve these benefits. client, client/server framework and distributed
Distributed Mobile GIS components can be component framework are interrelated. Internet
used to provide applications across networks. GIS use Web browser as their user interface.
Take the example of one of the popular distrib- Distributed component applications can use Web
uted component technologies used for mobile browser as the user interface, but also could use
devices – the Java platform. Java platform is their own GUI. Different protocols may be used for
able to offer support for distributed components distributed components in Mobile GIS other than
in Mobile GIS, particularly Java Micro Edition the HTTP protocol which is mostly used in Internet
(Java ME). Java ME emphases applications used GIS. In general, GIS with distributed components
in mobile handheld devices, whilst Java SE (Java do not have all of the system components in the
Standard Edition) is for more general client side same physical location. These components could
applications. Java SE has been used for many be the processing, the database, the rendering or
Internet GIS applications and Internet mapping. the user interface. Mobile GIS adopt a distributed
On the other hand, Java Enterprise Edition (Java component framework to support their services.
EE) focuses on server side applications of GIS, The second key element in Mobile GIS is
such as building distributed components that can the mobile telecommunication network, which
be reused for various services. Java ME, with provides wireless connectivity between Mobile
its focus on components for wireless handheld GIS users and the network where the main GIS
devices, can be used to build applications for software, data and applications reside. In the early
Mobile GIS. The major components of Java ME 1980s, the first generation (1G) analogue mobile
consist of mobile information device profiles, network was in operation to provide voice traffic.
connected device configuration and connected Since then the mobile communication networks
limited device configurations, aiming to provide have developed through the second generation
utility programmes, security and connectivity for (2G) in 1990s with digital techniques and data
small mobile devices communication ability although limited, to the
There are a number of advantages to the Java third generation (3G). 3G networks provide high
platform. One is that it can offer a dynamic com- speed data transmission, which can support fast
ponent framework of Java applets for client side mobile Internet access and multimedia applica-
and servlets for server side, which enables a more tions. Only at this phrase of development in mo-

158
Mobile Geographic Information Systems

bile wireless communication networks, Mobile of maps), processing power, battery consumption
GIS can be realistic and deliver services. Such and so on. Different types of Mobile GIS applica-
networks enable users to access and use GIS, data tions would emphasise different requirements of
and services in mobile situation and in real-time. mobile devices. Applications could be navigation,
However, geographical data are usual result in finding nearest point of interest, real-time games,
large file sizes, therefore needing high bandwidth to real-time data collection. Some might require
and data transfer speeds to deliver. The bandwidth more fast and powerful processing capability and
and data speed of the network have direct impact screen quality, while others require more accurate
on services provided by Mobile GIS. location-awareness technology.
Another element of Mobile GIS is mobile Location-awareness technologies can be
handheld devices. A mobile handheld device is viewed as a key element in Mobile GIS. Conven-
also known as mobile device, handheld device, or tional GIS is usually operated without concerning
handheld. It is a pocket-sized computing device where users are. It is similar for accessing Internet
with wireless connection, typically having a display GIS. Mobile GIS deliver services and applications
screen with touch input and/or a miniature key- to users’ mobile handheld devices, often related
board. Laptops are not viewed as mobile devices to the user’s mobile situation and location. A
due to their size. Mobile handheld devices have typical application example can be providing a
developed over time, including handheld PDAs, user on the move the required route to a certain
Ultra-mobile PCs, Tablet PCs and smart phones. destination based on where they are at that mo-
In order to access Mobile GIS applications and ment. The current location of the user is essential
services, the type of mobile device needs to be more for this service. There are a range of positioning
sophisticated in its connectivity, and computing technologies that can be employed in mobile de-
capability in comparison to basic mobile phones vices, such as Global navigation Satellite Systems
focused on voice and text message communication (GNSS) including Global Positioning System
or conventional handheld computers with limited (GPS), mobile network positioning technologies,
connection. Smart phones belong to this type of wireless and RFID location technologies, sensor
mobile handheld device, which is the emphasis of positioning, visual-aided positioning and hybrid
this book. A smart phone can be regarded as a hand- indoor positioning. These positioning technologies
held mobile device that represents the convergence are discussed fully in various Chapters in this book.
of mobile phones with handheld computers. Such In order to enable mobile handheld devices
mobile devices have the functions of mobile phones (e.g. smart phones) to access GIS servers, mid-
with wireless telecommunication connection, and dleware is required to link them up. This piece
operating systems to install and run advanced ap- of middleware is the gateway service, which is
plications. Furthermore, operating system software another element of Mobile GIS. Gateway services
offers a platform to develop applications. As dis- provide the capability of translating user requests
cussed early in the section, Java ME is one such from mobile devices to GIS servers and format-
platform. More complex applications can also be ting the output back to users at mobile device
installed and run based on a specific platform in side. Gateway services also perform tasks such
a smart phone. Mobile handheld devices are con- as protocol conversion.
tinuously evolving and converging. Applications Having discussed a number of key elements in
of Mobile GIS need to be designed and developed Mobile GIS, we now can look into the main fea-
with characteristics of mobile handheld devices tures in terms of the services and usage of Mobile
in mind, such as relatively small screen size (GIS GIS. These features are mobility; real-time con-
visualisation usually associated with the display nectivity; location-awareness; broadened usage.

159
Mobile Geographic Information Systems

Mobility is one important feature of Mobile physical surrounding, current situation, and their
GIS. Mobile GIS has the ability to operate and personal preferences and requirements.
deliver functionalities, data and services without Location-awareness is another important fea-
necessarily requiring a fixed location or wired ture of Mobile GIS, which differentiate Mobile
connection. Mobility feature can also be seen as GIS from conversional and Internet GIS. Mobile
portability from one aspect. Users can access GIS devices are position-enabled, such as GPS-enabled
via their handheld devices while on the move. The smart phones; therefore the location of a mobile
mobility feature of Mobile GIS has three charac- device and their user can be identified. A range
teristics, which are: mobile connectivity provided of positioning technologies (as discussed previ-
by wireless telecommunication networks; mobile ously and elsewhere in this book) is employed for
handheld devices (e.g. smart phones) used to ac- mobile devices, such as smart phones, to enable
cess GIS functions and applications; GIS with them to be location-aware. Using such location-
distributed component framework where com- aware mobile devices, more spatially-relevant
ponents could be the processing, the database or GIS applications and services can be available
the user interface at different physical locations to users according to their location. For example,
(servers) to deliver mobile services. Mobile GIS information can be received for fixed points of in-
enable data can be captured, stored, and man- terest such as nearest restaurant, petrol station and
aged in remote locations and have the advantage hospitals, or for mobile subjects of interest such
of real-time updating without storing duplicated as the whereabouts of friends and children. Such
data. Data can be available from remote systems devices also enable users’ location to be sent back
to mobile users. Information can be uploaded and to servers, which allows Mobile GIS to manage
downloaded between mobile devices and remote and process real-time data and provide up-to-date
systems. The feature of mobility brings GIS out of applications and services more relevant to users.
fixed locations into the field and for people on the Mobile GIS have broadened GIS user groups.
move. Use of GIS in mobile situations brings more Conventional GIS focus more on professional users,
emphasis on the interaction between the human, both in physical geography and in studying social
mobile device and the environment in human- phenomena to assist decision making and problem
computer interaction (HCI) research (Li, 2006). solving. GIS have been and continue to be utilised
Another main feature of Mobile GIS is the for analysing spatial data, discovering patterns,
real-time connectivity. With this feature, Mo- relationships and anomalies to support science
bile GIS offer the possibility of exchanging and and problem solving. On the other hand, Mobile
analysing geographical information in real-time GIS also provide functionalities and applications
in a real geographic world. The interactive ser- to the wider public - services relevant to people’s
vices can be provided with real-time refresh and daily lives. GIS functionality would be accessed
update. This allows more complex applications and used by people having no GIS professional
such as augmented reality where new data will knowledge. These services can range from: viewing
be delivered as location is changed. It also opens maps from users’ mobile phones, finding nearest
door to applications of push-services, such as facilities, wayfinding, delivering location-related
proximity advertising. For example, warning mes- messages, playing mobile games. Such a wide
sages can be sent to users who are in such areas range of user-centred services necessitates more
where disruption and danger are or might occur. emphasis on the interaction between users, GIS
The context of users can be taken into account and the surrounding environment, which poses
in delivery GIS services, which including user’s challenges in Mobile GIS research. This will be
discussed further in the last section of this Chapter.

160
Mobile Geographic Information Systems

MOBILE GIS FOR LOCATION- services provided by LBS. There are two main
BASED SERVICES (LBS) elements in delivering LBS services: pull and
push. ‘Pull’ element in LBS generally employs
Mobile GIS have brought mobility, real-time user location obtained via his/her mobile device
connectivity and location-awareness into GIS, to offer services whereby the user can pull down
and also have broadened GIS usage and user (access and retrieve) data and information relevant
groups. Mobile GIS provide tools and services to his/her current location and situation. ‘Pull’
not only for skilled professionals but also for a element in LBS can be seen in applications such
wider public to use in their working and social as navigation and wayfinding, searching for near-
lives. Geographical data and information can thus est point of interest. For the ‘Push’ element in
be accessed, extracted and displayed on user’s LBS, individual’s mobile location is used to push
mobile devices (e.g. smart phones) in real-time (send) relevant information to users either as pre-
while on the move. Mobile GIS can be seen as the subscribed requests or independently of a query.
technology that facilitates the creation of mobile For example, information can be delivered to
services. This leads us into the area of Location- users according to their current locations for all
Based Services (LBS). kinds of business and social purposes. This may
Location-Based Services (LBS) are defined take the form of proximity advertising and offers
as “the delivery of data and information services of discounted goods, but could also be weather
where the content of those services is tailored forecast, traffic conditions, delays to trains and
to the current or some projected location and flights, warning messages for potential hazards.
context of a mobile user” (Brimicombe and Li, There is also the element of ‘tracking’, which is
2009). LBS can be viewed as the outcome of the monitoring the location of mobile devices con-
convergence of a range of technologies, as dem- tinuously. ‘Tracking’ could be used to provide
onstrated in Figure 4. GIS provides geographical services such as monitoring or managing vehicle
science, technologies and data for LBS. New fleets, or in entertainment such as location-based
Information and Communication Technologies games. Furthermore, location related information
(NICTs) have evolved from ICTs with features of and services provided by LBS are expected to be
mobility and location-awareness. NICTs include updated according to user’s current situation and
a range of technologies associated with wireless needs, and to be delivered in real-time. It is es-
mobile telecommunication networks, other wire-
less networks, wireless handheld mobile devices
(e.g. smart phones) and positioning technologies Figure 4. Location-based services: An aspect of
that integrated into mobile devices. NICTs also the technological convergence
have the important feature of having wide usage,
near ubiquity, in people’s lives. The convergence
of GIS, the Internet and NICTs has made LBS
possible. We also can see that LBS are tightly
coupled with Mobile GIS. LBS are therefore able
to offer data and information services strongly
related to user location in mobile situations, in
real-time wirelessly.
The focus of LBS is on delivering location
based data and information services that are mo-
bile user centred. Location plays a major role in

161
Mobile Geographic Information Systems

sential that services provide real-time, up-to-date Information and services provided by LBS
information pertinent to mobile users. can increase their commercial potential when
In order for LBS to provide needed information harnessing Mobile GIS. At the same time, LBS
and services to mobile users, context is important also bring emphasis on a range of research ar-
aspect to be considered. The context in LBS mainly eas in GIS and Mobile GIS. These have been
refers to the current situation of a user that could discussed in detail in the book “Location-Based
be relevant to tailoring location-based informa- Services and Geo-Information Engineering” by
tion. Context can be information associated to Brimicombe and Li (2009). Some of these key
user personal situations that might be used in areas are issues concerning the availability of
setting up services particularly delivered to these data, locating users and how to contextualise the
users. For example, large sized text or maps, or user, their environment and the technology for
louder ring tone could be extra features along delivering geographical information services.
with the information provided to those users in As discussed, determining the location of a user
need. Context can also be surrounding situation at to whom services are delivered is essential for
certain location, such as noisy places, unfamiliar tailoring information in LBS. The location can
areas. Other context may be time of day, speed be the current position of a user or some future
of travelling. All these context will have direct projected location. A number of positioning tech-
effects in tailoring location-based information and nologies are available to determine the location
services for mobile users. This will be discussed of a mobile device, as discussed elsewhere in this
further later in this section. book. However, challenges still remain to obtain
Location-based geographical data and informa- user location with sufficient accuracy to enable
tion provided as services by LBS can take many location-based information to be provided with
forms. One of the efficient ways to deliver geograph- adequate detail and thus ensure utility. The level
ical information is using maps, and is sometimes of sufficient accuracy also depends on the nature
narrowly viewed as what LBS are. However, LBS of services required. For example, the level of
should not be simply seen as delivery of maps to positioning accuracy required is much higher for
users’ mobile devices. ‘Location’ in LBS concerns responding say, to a car accident than in providing
more the geographical position of a user as the key weather information for a user location. In addi-
to tailoring information rather than the content of tion to determining user location, contextualising
any response to a query. Response to a query can be the user and associated situation is important for
just text or images, but such information has been tailoring location-based services to mobile users.
tailored to the user, who makes the query, accord- Context of user’s geographical location and sur-
ing to his/her mobile location and situation. For rounding environment could all need to be taken
example, if a user queries the closing time of a car into account. Take a wayfinding example. A route
park where he/she is, the response sent back need will be given to a user according to his/her loca-
only be a simple text message. However, without tion and requirement. The context such as time
requiring the user to describe or state exact location of the day could directly influence the utility of
of the car park, LBS should be able to deliver the the information provided as being either as the
right information in real-time by determining the shortest route or the safest route at night. Informa-
location through the user’s location-aware mobile tion associated with users and their preferences
device. From this simple scenario, we can see that is another aspect of context in providing tailored
in adapting information and services according to geographical information. Technology is a further
location, GIS and the use of spatial data are fun- source of context, being the type of mobile device
damental aspects of LBS. being used and its capabilities, quality of wireless

162
Mobile Geographic Information Systems

connection, or the availability of data resources mobile operating systems (or smart phone plat-
at the time of user request. forms) which are Google’s Android, Microsoft’s
The key research areas in LBS and Mobile GIS Phone 7, and Apple’s iOS. Some applications
include dealing with the spatial query, communi- have recently extended to BlackBerry. While
cating the response and interoperability. Dealing individual users can use smart phone applications
with spatial query and communicating responses for their daily life such as finding nearest points
in Mobile GIS and LBS have their characteristics of interest, organizations in many sectors can use
as processing and providing information in real- them to increase operational efficiency such as
time, and providing geographical related services improving workflow.
to the wide public who should not be required The range of applications of Mobile GIS for
to have GIS skills. The interested reader can smart phones are summarised here. Some of them
refer to the book “Location-Based Services and are still potential applications, which are expected
Geo-Information Engineering” for more in-depth to emerge as Mobile GIS mature in the foresee-
analysis of these areas. As LBS are the result of able future. Others applications have already been
the convergence of a range of heterogeneous developed and are being used to a certain degree.
technologies, interoperability is one of important Relevant examples are presented under each
research areas in GIS, Mobile GIS and LBS. Le- category to demonstrate different development
gal/social issues and business models are issues procedures and application types.
increasingly raised as GIS and LBS have wider
use in society, which will be referred to again in Applications Mainly for Accessing
the last section of this Chapter. and Using GIS Functionality
through Smart Phones

APPLICATIONS OF MOBILE Basic GIS software functionalities can be imple-


GIS FOR SMART PHONES mented on smart phones through Mobile GIS
applications, which are often extended to include
New opportunities for Mobile GIS applications some location-based features, such as accessing
and services are rapidly growing with the rise of maps and data relevant to user location, collecting
smart phones. A 2010 comScore study indicates location related data in real-time, real-time report-
14% of smart phone users (33.5 million) use mobile ing of events with spatial tags. Such applications
mapping applications, a 44% increase from 2009. provide users with smart phones access to basic
This number is expected to grow further through GIS services in real-time while on the move.
2011-12 (http://www.vce.com, viewed on August Here are some examples of this type of Mobile
2011). The availability of smart phones and the GIS application. One of them is ArcGIS Apps for
widespread use of GIS consumer products on smart Smart Phones developed by ESRI, which allows
phones have consequently aroused a substantial users to navigate maps, collect and report data, and
anticipation of Mobile GIS applications for smart perform GIS analysis via a smart phone. Such ap-
phones (Renner et al., 2011). plications offer opportunities for smart phone users
However, Mobile GIS services remain in their to view maps and capture information related to
early stage. There is potential and also challenges their current situation, access content from ArcGIS
for mobile software development, as new hardware Online or use functionalities from enterprise GIS,
is being released and various support platforms and develop custom applications for user needs.
co-exist. At present, most mobile GIS applications These apps could broaden GIS usage to a much
and APIs for smart phones are built on three main wider public. Another example is applications

163
Mobile Geographic Information Systems

built using open source software that are increas- in the U.S. through iPhone, iPad, or iPod Touch.
ingly available. It becomes practicably feasible This application provides information which can
to build a specific single application on different be used to evaluate an area on-site in real-time and
mobile platforms. For instance, a new Flex/AIR assist businesses to understand an area in terms
GIS application for the smart phone BlackBerry of its geo-demographics. The knowledge of an
has recently developed by WebMapSolutions us- area can be: types of people living in the location,
ing Web-based Flex code (www.webmapsolution. including a brief demographic and market data
com, viewed on August 2011). Building Mobile summary of a location (e.g. age, gender, socio-
GIS functionality for smart phones has also been economics, and spending habits); the difference
aimed for Apple’s iOS platform using OpenScales between people living in this location from other
Flex sdk’s, ArcGIS Flex API and OpenLayers. areas by comparing demographic and market data
Also in 2008, a solution for accessing GIS for two locations or one location vs. the entire
through BlackBerry smart phones started to be U.S. national level data. Such applications may
designed (http://www.3-gis.com, viewed on Au- assist businesses to compare and choose prefer-
gust 2011). With the introduction of an application able locations for their services based on needs
named Buzz for BlackBerry smart phones, users are and criteria. My Place History is another free
able to view and query GIS data with their smart app from ESRI’s Geomedicine for Apple’s iOS
phones on the move. Developed to employ the platform which links public health information
ESRI ArcGIS Server technology, Buzz is aimed at with user environmental experience. This appli-
services provided to industries such as the utility, cation provides services to assist users to better
telecommunications and emergency management understand their local environment in terms of
firms; but also could be extended to other markets its effect on their health. U.S. street addresses
with a geo-database. Currently, Buzz can run on are used to create and manage a personal place
the BlackBerry® Pearl™ series, BlackBerry® history. My Place History allows users to access
Bold™ and BlackBerry® Curve™ series smart geographically relevant health information, such
phones utilizing the AT&T, Sprint, T-Mobile and as one’s proximity to environmental hazards or
Verizon Wireless telecommunication networks. exposures. The application of My Place History
can also link user’s personal place history with a
Mobile Applications for Public number of USA governmental databases such as
and Business Services Dartmouth Atlas of Health Care, Toxic Release
Inventory (TRI) of the United States Environ-
In the early stage of mobile GIS applications on mental Protection Agency, Haz-Map, and U.S.
smart phones, there is no clear boundary between National Library of Medicine.
public and business services. However, increasing Another example is a location-based service
numbers of applications are expected for various developed for companies and state and local
specialised aspects of public and business ser- governments by GCS Research, combining the
vices in the future, as mobile technology itself is functionality of Mobile GIS with the GPS-based
evolving. Some of examples are given below to sensor suite of an Android-enabled smart phone
illustrate mobile GIS applications used for public (www.gcs-research.com, viewed on August 2011).
and business services. This is a thin-client application, aiming to be a
One example application is the free BAO (Busi- practical, cost-effective business tool that enables
ness Analyst Online) for Apple’s iOS platform mobile users to access geographic information
developed by ESRI, aiming to give access to key and geospatial functionality via their handheld
demographic and market facts about any location device. The Montana Parcel App, built by GCS

164
Mobile Geographic Information Systems

Research for this purpose, allows smart phone monitoring and management as well as those ac-
users to remotely access information of real estate tivities carried out in the field such as identifying
in Montana. Users can use the phone’s built-in key variables contributing to a problem, taking
GPS to get their current location in the state. They measurements, and inputting and handling data
also can zoom into a mapping interface to select residing locally or remotely. For a variety of
an area of interest by “double-tapping” the touch fieldwork operations, there has historically been a
screen. In real-time, the application can retrieve lag between collecting data on-site and analysing
location-specific property information for the se- data back at the office, also in exchanging relevant
lected site from two Web-enabled databases. The data for analysis and decision making. Mobile
first database is a parcel boundary file maintained GIS applications delivered via wireless networks
as part of the Montana cadastral database. This and mobile handheld devices (e.g. smart phones)
cadastral information is managed in an ArcSDE offer field operatives the opportunity to do on-
geodatabase. The smart phone application can site data collection and monitoring while having
trace the property boundaries in red on the phone’s real-time connection with remote sites (or head
map display. The second database is the State of offices) to exchange data, to carry out analysis to
Montana CAMA (Computer-Assisted Mass Ap- support decision making or to take action rapidly.
praisal) System where it retrieves property details Furthermore, using mobile devices for fieldwork
such as owner name, parcel legal description, area has a number of extra advantages such as light
size, assessed value, and zoning code. Such an in weight and low in cost. Nevertheless there are
application can create a mobile mash-up that uses also a number of disadvantages to using a mobile
the Android mobile operation system to deliver device in the field such as its comparatively low
mobile services. CPU power for processing data and graphics,
The advanced functionality of mobile operat- small screen size and issues on its compatibility
ing systems have made it possible to bridge the when used alongside other equipment.
gap between the enterprise business GIS database One example is a Mobile GIS application
and the mobile phone, as we can see in the case aimed at providing and improving GIS ability for
of the GCS Research application. Cost-effective, field surveying work. SuperGIS Mobile Engine 3
location-based business tools can be created with is a Software Development Kit (SDK) developed
this technology. Users can download applications by SuperGeo (www.supergeotek.com, viewed
such as Montana Parcel App for free on their on August 2011), which can be used to build
smart phones. Such mobile applications and other Mobile GIS applications to meet demands for
Mobile GIS tools can be transferred to different effectively improving the surveyor’s productivity.
smart phone operating systems. The development Additional GIS functionality can also be created
of location-based services, which converges a using SuperGIS Mobile Engine, and then embed
range of technologies, can provide a wide range into the existing mobile application to provide
of Mobile GIS applications for smart phones. Mobile GIS functionality to users via their mobile
devices. It is aimed for user to develop Mobile
Applications for on-Site GIS applications through SuperGIS Mobile Engine
Data Collecting, Monitoring, built-in controls, read-to-use features, to achieve
and Management standard GIS functionality in mobile situations.
These mobile applications include map display,
Real-time communication and analysis play im- GPS positioning, file editing, coordinate system
portant roles to support efficient decision making. settings to meet user needs in the field. They can
On-site work can comprise of data collection, be built for mobile devices supporting Micro-

165
Mobile Geographic Information Systems

soft®.NET Compact Framework, such as smart with other business and organization using real-
phone, Pocket PC and Tablet PC, and thus reduce time on-site information.
the incompatibility problem that often occurs in With a mobile traffic alert application de-
hardware and software. veloped in Ryerson University, Canada, road
Mobile GIS applications, similar to this, often geometry data can be extracted using GPS data
aim to offer users working on-site with capabili- received from smart phones (Niu, Li, & Poursaeid,
ties such as: collecting spatial data; editing and 2011). Based on road centrelines, this application
updating data locally and remotely; accessing can be employed to assist capturing road network
maps to obtain up-to-date and relevant spatial data; data. The current conventional methods to obtain
having positioning ability such as GPS; querying such data, either by surveying or by using remote
and managing data to assist their fieldwork and sensing data, are time consuming and expensive.
decision making. There is a range of examples of In this application example, GPS data are crowd-
such application. One is using mobile handheld sourced through smart phones as an increasingly
device for on-site monitoring of environmental popular way of collecting data that can be used
processes (Nurminen, Kruijff, & Veas, 2011). to provide real-time traffic information, identify
Environmental parameters of certain areas are traffic patterns, and predicting traffic congestion.
observed continuously and measured regularly. The same type of data can be very useful for cost-
Spatial data as well as related data such as images effective, fast updating of road network databases
are communicated and exchanged between people due to its rich spatial and temporal coverage and
working on-site and decision maker who are in high data volume.
different locations. Others include field map-
ping to update spatial data on-site with attribute Applications for Navigation
information, building and maintaining asset lists and Wayfinding
with location and attributes, and real-time incident
recording into spatial database. A range of mobile applications are particularly
Another example for this category application designed for navigation and wayfinding using
is BlackBerry Business Solutions (us.blackberry. mobile devices such as smart phones. Navigation
com/business/needs/field/, viewed on August applications are mainly developed for assisting
2011), which consists of a combination of ap- motorists to identify appropriately optimized
plications, services and peripherals, including routes. For example, an optimised route can be
printers, signature capture, GPS, digital forms found and sent as a map to users’ mobile devices
and barcode scanners. This application is designed when a vehicle is to be driven from one starting
to provide mobile customer services. It provides point to one or more user-specified destination.
mobile field access to details to support cases and Furthermore, subscription services for real-time
customer activities which will raise support issues traffic updates can be delivered to users’ smart
in real-time. This mobile application can be used phones according to the user’s current location.
to provide services such as updating service tick- Mobile GIS applications that provide wayfind-
ets from different locations, triggering the billing ing services are more focused on the means by
process and setting up feedback from the field. which individuals orientate themselves, discover,
Similar applications could provide services such and explore surrounding or certain areas of interest.
as checking availability of products at different Individual users with their mobile devices could
locations according to user’s current position and use this type of application to discover points
placing orders, communicating and collaboration of interest related to their current locations, to
find routes to certain destinations or places such

166
Mobile Geographic Information Systems

as tourist attractions, restaurants, and theatres. • Artistic expression in the community


Modes of transport and other spatially located that might include digital graffiti, location-
objects, landmarks and points of interest can based story lines and discovery trails.
also be provided using location-aware mobile • Mobile gaming where the players and ac-
devices. Such applications intend to deliver us- tions are location-based.
ers a service in real-time according to their need • Social networks where location-aware
while on the move. applications could be implemented with
wireless social network tools, such as twit-
Other Mobile GIS Applications ter. Users would share their location, and
on Smart Phones what is on their mind through updating
their status, attaching a location to their
A range of other applications can be identified, thoughts and activities. The updated status,
many of which have been prototyped and in de- with the location graphically depicted by a
velopment. These applications, along with those map, would be available on the screen of
applications for navigation and wayfinding, are smart phones.
more closely related to LBS applications:
The future trends of Mobile GIS applica-
• Real-time tracking: such applications tions for smart phones will be interwoven with
can be real-time tracking of vehicle fleets, location-based services, as Mobile GIS, smart
business associates, social contacts (e.g. phones, wireless telecommunication networks
to know when friends are nearby) or one’s and related technologies are further developed.
family members (e.g. tracking children Mobile GIS enable more people without GIS
home by parents who are still at work). professional skills to engage with GIS-based ser-
• Mobile commerce: this would incor- vices in a more ubiquitous way. Smart phones are
porate both transactions made by indi- expected to be used by more people on the move
viduals on the move via a NICT and re- as a powerful and readily available thin client for
ceiving (pushed) alerts (e.g. advertising, various applications rather than just as phones.
notification of discounts) to opportunities More advanced real-time/on-site functionality
for transactions that are location-specific of Mobile GIS will become available on smart
(such as at a nearby shopping mall). phones due to the advancement of new informa-
• User-solicited information: this type of tion and communication technologies (NICTs).
applications can be for all kinds of busi- Therefore, the range of Mobile GIS application
ness and social purpose, such as weather will continue grow.
forecast, traffic conditions, delays to trains
and flights, warning messages for potential
threats, film showing and ticket availabil- PROSPECTS AND CHALLENGES
ity, menus and local maps. OF MOBILE GIS
• Location-based tariffs: such as differen-
tials in road pricing, pay-as-you-go car in- Mobile GIS have come a long way in a short time.
surance and similar schemes. They have been facilitated by the rapid develop-
• Coordinating emergency and mainte- ment of mobile telecommunication networks to
nance responses to accidents, interrup- deliver higher bandwidths, the rapidly evolving
tions of essential services and disasters. design of mobile devices, and the convergence of
heterogeneous technologies into single devices

167
Mobile Geographic Information Systems

that are location-aware. As these technologies The rapid development in mobile telecom-
further develop, so too will the possibilities for munication network and mobile device technolo-
Mobile GIS. However, Mobile GIS are technology gies will bring even faster data transfer speeds,
dependent and any shortcomings in the technol- better connectivity and more advanced devices.
ogy will have knock-on effects for Mobile GIS This bodes well for Mobile GIS. However, such
in the implementation and delivery applications developments and changes in these industries
(Brimicombe and Li, 2009). Above all, by align- also raise issues concerning the reliability and
ing GIS with mobility and mobile devices there consistency of network infrastructure and devices
is the opportunity for engineering GI applications which Mobile GIS rely on to build, implement and
that make GIS part of ubiquitous technologies and deliver applications. For example, the release of
at last achieve an embedded mainstream status. the iPhone revolutionised smart phone technolo-
The GIS community has long thought of itself gies and required application designers to quickly
as focusing in two areas: systems and science. Once gear up to this new class of device. The design of
the systems had adequately matured, the science the devices (such as screen size and resolution,
took off with research focusing on the substantive keyboard/keypad, memory and optional additional
questions posed by the use of the systems in solv- memory) is likely to be driven by the more popu-
ing real-world problems. The research agenda is lar uses such as texting, social networking and
far from complete, particularly as new questions tweeting and may remain sub-optimal for Mobile
arise from the pace of technological development, GIS. For example, use of gestures across the face
the increasing quantity, diversity and granularity of the screen is likely to make query definition
of data, and the complexity of analyses that are more difficult when compared to using a keypad,
entailed. Nevertheless, GIScience is reaching a but a pop-up keypad may obscure too much of
maturity with a body of scientific and application the maps being viewed. There is also the issue
knowledge. Engineering is the systematic applica- of battery life, that is, the length of time between
tion of scientific research to successfully and de- charges that allows a device to remain serviceable
pendably solve real world problems (Brimicombe, on the move without needing additional batteries.
2010). With the convergence of technologies shown Relatively short battery life has been an issue with
in Figure 4 and the potential for designing applica- smart phones because of the power demands made
tions that are widely used, we see the opportunity for upon them. Continuity of connectivity and there-
progressing to GI Engineering, that is, “the design fore of service has always been a problem with
of dependably engineered solutions to society’s use wireless mobile devices due to blocked signals
of geographical information” (Brimicombe & Li, in tunnels, underground systems, inside building
2009). Examples already on the market are in-car complexes and so on. Hand over from one cell to
satellite navigation systems (SatNavs), similar ap- another, particular at speed, can cause the service
plications on smart phones, and niche applications to drop. Additionally, GPS in a smart phone for
of location-based services (LBS). location-awareness will not work and may even
Whilst the prospects for Mobile GIS are good give seriously erroneous results where a clear view
with plenty of opportunities, there are challenges of three or more satellites is not available. Whilst
along the road to ubiquity. These challenges assisted GPS through the telecommunications
derive from four broad areas: limitations, even network helps overcome some of the effects, it
threats that derive from the technologies being is not a complete solution. Location-awareness
used; areas of GIScience that still need to be ad- may have to rely upon a number of technologies
equately researched; users; and business models including GPS, A-GPS, accelerometers (which are
for a sustainable presence.

168
Mobile Geographic Information Systems

now available as a chip), and RFID and Bluetooth particularly suited to achieving interoper-
for locating through beacons. ability in heterogeneous computational
Mobile GIS raise a number of research areas environments and would thus be a sound
for GIScience arising from the technology, data structure for Mobile GIS.
requirements and the nature of applications. Im- • Scalability and response times: For
portant areas are: ubiquitous applications of Mobile GIS,
the technology will need to be scalable to
• Distributed component systems: This potentially millions of users without no-
will be a fundamental departure for GIS ticeable degradation of service. GIS-type
software which has tended to be mono- queries are typically long transactions, that
lithic, that is, a large software package is, querying spatial data takes much longer
resident on a single PC or server. The than typical database queries of nonspatial
smart phone and other mobile devices are data. GIS queries tend to have both high
comparatively thin clients with relatively CPU cost and high I/O cost. Thus current
low storage and processing power. An ad- approaches to GIS software are unlikely to
vanced desktop GIS package is unlikely be sufficiently scalable. Whilst open and
to fit on a smart phone and software for interoperable middleware will be impor-
mobile devices that are available on the tant, a large part of the solution rests in the
market are usually a sub-set of the func- type of interoperable component systems
tionality geared to specific tasks. In order discussed above.
to overcome such limitations Brimicombe • Data: There has been a data explosion
and Li (2009) have proposed a distributed over the last decade of unprecedented
component approach to GIS using agent proportions, an increasing proportion of
technologies. A software agent can be de- which is spatial data. Many areas of eco-
fined as an autonomous, problem-solving, nomic endeavour have become data driven
encapsulated entity in a dynamic and open and the focus on high resolution local data
environment (Woolridge, 1997). The fun- has driven new services. Focusing in on lo-
damental difference between a convention- cal areas leads to an exponential rise in the
al software program and an agent is that amount of data required to satisfy informa-
the former will run once and then terminate tional needs (such as from Google Earth
until instructed to run again, whereas the to Google Street View). Zooming in also
latter can sense the environment in which requires data to be timelier with shorter up-
it is embedded and act on it autonomously date cycles. Whilst the technology for this
in pursuit of its goals (Franklin & Gaesser, exists, the costs can be high. But whilst
1997). The advantage of this type of struc- conventionally GIS have been deployed
ture would be that an interface agent would for map data and their attributes, the main
reside on the thin client mobile device and mass markets will be for such data to be in-
would arrange for each task (such as lo- tegrated into a multimedia approach to in-
cating the relevant data, conducting some formation delivery either through on-the-
form of analysis and displaying the result) fly mash-ups or by incorporating into GIS
to be carried out by other agent-based ser- non-traditional data that nevertheless have
vices residing on servers on the Internet. spatial tags (e.g. tweets). This will add an
Functionality and data would be on an order of complexity to GIS.
on-demand basis. Agent technologies are

169
Mobile Geographic Information Systems

• Cognition: This is central to how users the perceived benefits of a service must outweigh
of Mobile GIS are going to able to inter- the cost or effort in obtaining the service. This
act with the technology in relation to their requires business models that promote the utility
changing surroundings. Traditionally stud- whilst being able to cover the costs of sustaining
ies of spatial cognition have focused on how the service (Tsalgatidou & Pitoura, 2001). Many
users conceptualise, understand and use users of Internet and mobile services expect them
map products. But in most interactive ap- for free and consequently must put up with pushed
plications employing Mobile GIS, the user advertising and other inducements which will
initiates the query, the framing of which will cover the cost for the service. This type of busi-
be based on spatial reasoning derived from ness model is likely to be restrictive for services
their perception and understanding of their based on Mobile GIS because unless the volume
surroundings, desired goals and a range of of users is very high, the click rate on advertise-
contextual variables. Spatial queries will ments is unlikely to supply revenue that would
need to be framed as naïve natural language cover set-up and running costs. Simple applications
or gesture-based requests instead of more such as accessing Google Maps are free, but are
formal database queries. The formulation no different from mobile Internet. More complex
of the query is all important in obtaining applications that provide analytical functions and
the desired information and deriving utility tailoring based on location-awareness are likely to
from the application. be subscription services. The challenge for these
services is to provide high levels of utility that are
Some important considerations of the user have error free in order to be sustained. Services based
just been touched on. The design of the application on Mobile GIS are only at the beginning – this will
should provide for a compelling user experience be an interesting space to watch.
and most smart phones provide the functionality
and programming environments to achieve that. But
designers need to also consider the vulnerabilities CONCLUSION
that the use of smart phones can impart to users.
In the UK there have already had to be public Mobile GIS is the successful convergence of mo-
service announcements on the potentially danger- bile communication and GIS technologies. The
ous consequences of using mobile devices while driving force of growth of Mobile GIS is the rising
crossing the road. This and similar vulnerabilities demand for LBS applications from users of mobile
might be an important for Mobile GIS applications devices, where smart phone users have played a
for wayfinding. As a way of alerting the GI com- key role. Wireless mobile devices, particular smart
munity to this issue, Brimicombe (in press) has phones, offer the possibility of querying, analysing
analysed the crime wave in the theft of portable and visualising spatial information on-site and in
in-car satellite navigation devices (SatNavs) once real-time. Mobile GIS provide core functionality in
they became mass consumer products at the end of the context of LBS, while LBS bring Mobile GIS
2005. Many purchasers of these products (the first to various aspects of daily life. The research and
mass consumer products to include GIS software) development of Mobile GIS are still at an early
will have been unaware of the vulnerability to stage, but great potential has been demonstrated
theft that these devices would impart to the owner. for a range of applications. With further progress
Beyond gimmickry and status must-haves, Mobile in wireless mobile technologies and GIS, Mobile
GIS applications will need to establish markets GIS will certainly make a substantial contribution
for services that provide utility to users. That is, in the realisation of ubiquitous GIS.

170
Mobile Geographic Information Systems

REFERENCES McHaffie, P. (2000). Surfaces: Tacit knowl-


edge, formal language, and metaphor at the
Brimicombe, A. J. (2010). GIS, environmental Harvard Lab for computer graphics and spa-
modeling and engineering (2nd ed.). Boca Raton, tial analysis. International Journal of Geo-
FL: CRC Press. graphical Information Science, 14, 755–773.
Brimicombe, A. J. (2012). Did GIS start a crime doi:10.1080/136588100750022778
wave? SatNav theft and its implications for Niu, Z., Li, S. N., & Poursaeid, N. (2011). Road
geo-information engineering. The Professional extraction using smart phones GPS. 2nd Interna-
Geographer, 64. tional Conference and Exhibition on Computing
Brimicombe, A. J., & Li, C. (2009). Location- for Geospatial Research and Applications. Wash-
based services and geo-information engineering. ington DC. Retrieved on August 15, 2011, from
Chichester, UK: Wiley. http://www.com-geo.org/

Brimicombe, A. J., Li, Y., Al-Zakwani, A., & Li, Nurminen, A., Kruijff, E., & Veas, E. (2011).
C. (2009). Agent-based distributed component HYDROSYS - A mixed reality platform for
services in spatial modelling. In Gervasi, O. (Eds.), on-site visualization of environmental data. In
Computational Science and its Applications - Proceedings of the 10th International Symposium
ICCSA 2009 (pp. 300–312). Berlin, Germany: on Web and Wireless Geographical Information
Springer. doi:10.1007/978-3-642-02454-2_21 Systems (pp. 159 - 175). Springer Lecture Notes
in Computer Science.
Doyle, S., Dodge, M., & Smith, A. (1998). The
potential of web based mapping and virtual reality Peng, Z. R. (1999). An assessment framework for
technologies for modelling urban environments. the development of Internet GIS. Environment &
Computers, Environment and Urban Systems, 22, Planning B, 26, 111–132. doi:10.1068/b260117
137–155. doi:10.1016/S0198-9715(98)00014-3 Peng, Z. R., & Tsou, M. H. (2003). Internet GIS:
Franklin, S., & Graesser, A. (1997). Is it an agent, Distributed geographic information services for
or just a program? A taxonomy for autonomous the internet and wireless networks. Hoboken,
agents. In Jennings, N. R., Wooldridge, M. J., & NJ: Wiley.
Müller, J. P. (Eds.), ECAI-WS 1996 and ATAL Plewe, B. (1997). GIS online: Information re-
1996, LNCS (Vol. 1193, pp. 21–35). Heidelberg, trieval, mapping, and the internet. Santa Fe, NM:
Germany: Springer. doi:10.1007/BFb0013570 OnWord Press.
Li, C. (2006). User preference, information Renner, R., Moran, M., Hemani, Z., Thomas, E.,
transactions and location-based services: A study Pio, H. S., & Vargas, A. (2011). A comparison of
of urban pedestrian wayfinding. Computers, mobile GIS development options on smart phone
Environment and Urban Systems, 30, 726–740. platforms. 2nd International Conference and Ex-
doi:10.1016/j.compenvurbsys.2006.02.008 hibition on Computing for Geospatial Research
Longley, P. A., Goodchild, M. F., Maguire, D. J., and Applications. Washington DC. Retrieved on
& Rhind, D. W. (2005). Geographical informa- August 15, 2011, from http://www.com-geo.org/
tion systems and science (2nd ed.). Chichester, Tomlinson, R. F. (1984). Geographic informa-
UK: Wiley. tion systems – A new frontier. The Operational
Geographer, 5, 31–35.

171
Mobile Geographic Information Systems

Tsalgatidou, A., & Pitoura, E. (2001). Business Comer, J. C., & Wikle, T. A. (2008). World-
models and transactions in mobile electronic wide diffusion of cellular telephone. The
commerce: Requirements and properties. Com- Professional Geographer, 60, 252–269.
puter Networks, 37, 221–236. doi:10.1016/S1389- doi:10.1080/00330120701836303
1286(01)00216-X
Drummond, J., Joao, E., & Billen, R. (2007). Cur-
Tsou, M. H. (2004, July). Internet GIS: Today’s rent and future trends in dynamic and mobile GIS.
realities and a vision of the future. GIS Develop- In Drummond, J., Billen, R., Joao, E., & Forrest,
ment. http://www.gisdevelopment.net/magazine/ D. (Eds.), Dynamic and mobile GIS: Investigating
years/2004/july/ changes in space and time (pp. 289–300). Boca
Raton, FL: CRC Press.
Wooldridge, M. (1997). Agent-based software
engineering. IEEE Proceedings in Software Dueker, K. J., & Kjerne, D. (1989). Multi-purpose
Engineering, 114, 26–37. doi:10.1049/ip- Cadastre: Terms and definitions. Falls Church,
sen:19971026 VA: American Society of Photogrammetry and
Remote Sensing and American Congress of Sur-
Wu, X., Zhang, S., & Goddard, S. (2004). Devel-
veying & Mapping.
opment of a component-based GIS using GRASS.
In Proceedings of the FOSS/GRASS Users Confer- Goodchild, M., Yuan, M., & Cova, T. (2007).
ence (pp. 11 - 20). Bangkok, Thailand. Towards a general theory of geographic rep-
resentation in GIS. International Journal of
Geographical Information Science, 21, 239–260.
ADDITIONAL READING doi:10.1080/13658810600965271

Braun, P. (2003). Primer on wireless GIS. Park Heywood, I., Cornelius, S., & Carver, S. (2011).
Ridge, IL: The Urban and Regional Information An introduction to geographical information
Systems Association. systems (4th ed.). Harlow, UK: Pearson.

Brimicombe, A. J. (2008). Location-based services Kaasinen, E. (2002). User needs for location-aware
and GIS. In Wilson, J. P., & Fotheringham, A. S. mobile services. In Personal and Ubiquitous
(Eds.), Handbook of geographical information Computing (pp. 70–79). London, UK: Springer.
science (pp. 581–595). Oxford, UK: Balckwell. Karimi, H. A., & Hammad, A. (Eds.). (2004).
Brimicombe, A. J. (2009). GIS, environmental Telegeoinformatics: Location-based computing
modeling and engineering (2nd ed.). Boca Raton, and services. Boca Raton, FL: CRC Press.
FL: CRC Press. doi:10.1201/9781439808726 Li, C. (2006). User preference, information
Brimicombe, A. J., & Li, C. (2009). Location- transaction and location-based services: A study
based services and geo-information engineering. of urban pedestrian wayfinding. Computers,
Chichester, UK: Wiley. Environment and Urban Systems, 30, 726–740.
doi:10.1016/j.compenvurbsys.2006.02.008
Brimicombe, A. J., & Li, Y. (2006). Mobile
space-time envelopes for location-based services. Li, C., & Maguire, D. (2003). The handheld
Transaction in GIS, 10, 5–23. doi:10.1111/j.1467- revolution: towards ubiquitous GIS. In Longley,
9671.2006.00241.x P., & Batty, M. (Eds.), Advanced spatial analysis:
The CASA book of GIS (pp. 193–210). Redlands,
CA: ESRI Press.

172
Mobile Geographic Information Systems

Li, C., & Willis, K. (2006). Modelling context aware Walker, M., Turnbull, R., & Sim, N. (2007). Fu-
interaction for wayfinding using mobile devices. ture mobile devices – An overview of emerging
In [Espoo, Finland.]. Proceedings of the Mobile- device trends, and the impact on future converged
HCI, 06, 97–100. doi:10.1145/1152215.1152235 services. BT Technology Journal, 25(2), 120–125.
doi:10.1007/s10550-007-0035-3
Mountain, D., & Raper, J. (2007). Mixed reality
interfaces for mobile information systems. Aslib Wilson, J. P., & Fotheringham, A. S. (Eds.). (2008).
Proceedings: New Information Perspectives, 59, The handbook of geographical information sci-
422–436. doi:10.1108/00012530710817618 ence. Oxford, UK: Blackwell.
Peng, Z. R., & Tsou, M. H. (2003). Internet GIS: Zipf, A., & Jost, M. (2006). Implementing adap-
Distributed geographic information services for tive mobile GIS services based on ontologies:
the internet and wireless networks. Hoboken, Examples from pedestrian navigation support.
NJ: Wiley. Computers, Environment and Urban Systems,
30, 779–784.
Raptis, D., Tselios, N., & Avouris, N. (2005).
Context-based design of mobile applications for
museum: A survey of existing practices. In [Sal-
KEY TERMS AND DEFINITIONS
zburg, Austria.]. Proceedings of MobileHCI, 05,
153–160. doi:10.1145/1085777.1085803 Challenges of Mobile GIS: Research chal-
Reichenbacher, T., & Tollner, D. (2003). Design lenges raised in Mobile GIS area including
of an adaptive mobile geovisualization services. limitations derived from the technologies being
In Garter, G. (Ed.), LBS and tele cartography used, areas of GIScience that still need to be ad-
(pp. 17–23). Geowissenschaftliche Mitteilungen. equately researched, users, and business models
for a sustainable presence.
Schmalsteig, D., & Reitmayr, G. (2007). The Geographical Information Systems (GIS):
world as a user interface: augmented reality for Also known as the research discipline of Geo-
ubiquitous computing. LBS and tele cartography graphical Information Science. GIS can be defined
(pp. 369–391). Springer. as a system of hardware, software, data, people,
Shiode, N., Li, C., & Batty, M. (2004). The impact organisations and institutional arrangements for
and penetration of location-based services. In colleting, storing, analysing, visualising and dis-
Karimi, H. A., & Hammad, A. (Eds.), Telegeoin- seminating spatial information about areas of the
formatics: Location-based computing and services earth (Dueker and Kjerne, 1989; Brimicombe,
(pp. 349–366). Boca Raton, FL: CRC Press. 2010).
GI-Engineering: The design of dependably
Tsou, M. H., & Buttenfield, B. P. (2002). A dynamic engineered solutions to society’s use of geographi-
architecture for distributed geographic informa- cal information.
tion services. Transactions in GIS, 6, 355–381. Location-Based Services (LBS): The delivery
doi:10.1111/1467-9671.00118 of data and information services where the content
Wagtendonk, A. J., & de Jeu, A. M. (2007). of those services is tailored to the current or some
Sensible field computing: Evaluating the use projected location and context of a mobile user.
of mobile GIS methods in scientific fieldwork. Mobile Geographical Information Systems
Photogrammetric Engineering and Remote Sens- (Mobile GIS): With features of mobility, real-time
ing, 73, 651–662. connectivity and location-awareness, broader
user audience.

173
Mobile Geographic Information Systems

Mobile GIS Applications: A range of soft- Wireless GIS: An alternative term for Mobile
ware applications with characteristics of wireless GIS.
communication, mobility and GIS functionality.

This work was previously published in Ubiquitous Positioning and Mobile Location-Based Services in Smart Phones, edited
by Ruizhi Chen, pp. 230-253, copyright 2012 by Information Science Reference (an imprint of IGI Global).

174
175

Chapter 14
A Review of Geospatial
Information Technology for
Natural Disaster Management
in Developing Countries
Sam Herold
University of Ottawa, Canada

Michael C. Sawada
University of Ottawa, Canada

ABSTRACT
Disasters are deadly and destructive events, particularly in developing countries where economic, social,
political and cultural factors increase natural hazard vulnerability. The recent devastation of the Haiti
earthquake on January 12th, 2010 was a prime example of the human toll a natural disaster can take in
developing regions of the world. There is an imminent need to improve natural disaster management
capacity in developing countries to reduce disaster impacts. Given that disasters are spatial phenom-
enon, the application of geospatial information technology (GIT) is essential to the natural disaster
management process. However, in developing countries there are numerous barriers to the effective
use of GIT, especially at the local level, including limited financial and human resources and a lack
of critical spatial data required to support GIT use to improve disaster management related decision
making processes. The results of a thorough literature review suggests that currently available free and
open source GIT (FOS GIT) offers great potential to overcome some of these barriers. Thus, disaster
management practitioners in developing countries could harness this potential in an attempt to reduce
hazard vulnerability and improve disaster management capacity. The use of FOS GIT significantly
reduces software costs and can help build local level GIT knowledge/technical skills that are required
for successful GIT implementation.

DOI: 10.4018/978-1-4666-2038-4.ch014

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

INTRODUCTION disaster management – the process of mitigation,


preparation, response and recovery. The field of
Although the United Nations designated the disaster management has greatly benefited from
1990’s as the International Decade for Natural recent advancements in computers and related
Disaster Reduction (IDNDR), there was a global information technologies. Geospatial informa-
failure to reduce natural disaster impacts during tion technologies (GIT), including geographic
that time (IFRC, 2001). Ultimately contributing information systems (GIS), remote sensing (RS),
to this trend are environmental degradation, rapid global positioning systems (GPS) and Internet
urbanization and social marginalization (McEn- GIS (IGIS) are currently being employed in a
tire, 1999), particularly in developing countries. variety of ways to support all phases of disaster
The increasing number of disasters suggests that management. Since “each phase is geographi-
vulnerability to natural hazards is also rising and cally related to where people, places, and things
so equates to changing the geography of risk. are spatially located” (Gunes & Kovel 2000, p.
By way of elaboration, more people are living 136), the entire disaster management process can
in low-lying coastal zones, seismically hazard- be significantly enhanced through the effective
ous areas and concentrated urban environments use of GIT (Goodchild, 2006). “Even though
(Amendola et al., 2008; Briceño, 2004; Burton et the natural processes (e.g., floods, earthquakes,
al., 1993; El-Masri & Tipple, 2002). Vulnerable landslides, etc.) that generate disasters might be
populations will be at increased risk, for example, fundamentally different, the techniques to assess
as the geography and magnitude of hydrometero- and mitigate risk, evaluate preparedness, and assist
logical hazards that are historically associated with response have much in common and can share and
some of the greatest disasters (Kondratyev et al., benefit from advances in geographic information
2002) change with global climate (Smith, 2004; science (GIScience) (e.g., data acquisition and
IPCC, 2007). Defining the geography of risk is integration; issues of data ownership, access,
of a major concern in general and in particular in and liability; and interoperability).” (Radke et
developing countries, “where disasters jeopardize al. 2000, p. 15). We propose that currently avail-
important social development goals such as ad- able free and open source software (FOSS) can
dressing poverty, ensuring adequate food, water, fulfill many GIT requirements needed to improve
and sanitation, and protecting the environment,” disaster management capacity at the local level.
(Levy 2005, p.375). Because natural disasters While FOSS can create additional challenges
have the greatest overall impact in developing compared to commercial solutions (Camara &
countries (Alexander, 1995; Bui et al., 2000; Onsrud, 2004), with a clear understanding of the
IFRC, 2001), this is where geospatial information barriers and benefits of FOSS from a developing
technologies (GIT) have the greatest potential to world perspective, FOSS is a capable and effec-
mitigate causalities. tive alternative.
The purpose of this paper is to examine the In the first section of this review, we begin
use of GIT for natural disaster management, by laying a set of brief contextual explanations
with an emphasis on how these technologies, in and definitions of natural hazards and disasters
particular free and open source GIS (FOS GIT) while emphasizing their spatial components, and
can be effectively utilized at the local level in then describe some of the factors that differenti-
developing countries. Although natural disasters ate developed and developing countries from a
cannot entirely be prevented, disaster losses (in- natural disaster vulnerability standpoint. Included
cluding human, environmental and infrastructure/ in this section is a more detailed examination of
personal property) can be minimized with effective the most commonly cited GIT implementation

176
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

barriers faced by developing countries. This is large region. Earthquakes occur with little warn-
followed by a brief review of the phases that ing and last only a few seconds to minutes, while
comprise the disaster management cycle. In the a drought may build up over a period of months
next section, which forms the bulk of this paper, over large regions and last even longer. Thus, a
we review the extensive literature that describes distinction can be made between ‘rapid-onset’
and explores the many uses of GIT in the field of natural hazards such as floods and earthquakes,
natural disaster management. We then examine or slower ‘creeping crises’ hazards like drought or
GIT-based FOSS, highlighting its potential as well disease (De Paratesi, 1989). However, Coppock
as limitations in terms of its ability to fulfill disas- (1995) points out that those slowly developing
ter management related requirements. Finally, we hazards have more in common with natural re-
discuss and describe our vision of how FOSS can source management, at least from a mitigation
greatly improve the ability of local level disaster perspective
managers to implement GIT, and thus improve McEntire (2001) describes “natural disasters
overall disaster management capacity and reduce as the disruptive and/or deadly and destructive
vulnerability to natural hazards. outcome of triggering agents when they interact
with, and are exacerbated by, various forms of
vulnerability,” (p. 190). Simply put, when a haz-
BACKGROUND ard intersects the zone of human use there is the
risk of disaster. The number of people affected
Natural Hazards and by disasters resulting from natural hazard events
Disasters: An Overview of rapid-onset is increasing (Oloruntoba, 2005).
In particular, hydrometeorological related natural
To provide context for the pursuant issues, a brief disasters (e.g., floods, landslides/avalanches, for-
overview of the relation between a natural hazard est/scrub fires, wind storms and waves/surges)
and a natural disaster is in order. Natural hazards have more than doubled since 1996 and caused
vary greatly in terms of frequency, duration, over 90 percent of deaths from natural disasters
scale, impact, etc., and these differences partly during the 1990’s (IFRC, 2001). Herein, our
determine the spatial data and technology needed main focus is on the role of GIT for mitigation
to effectively mitigate, prepare for, respond to and of rapid-onset natural hazards that cause the risk
recover from potential disasters that may result of natural disaster.
from their occurrence. Large-scale disasters represent a complex
Simply put, natural hazards are unpredictable and multidisciplinary problem for local disaster
acts of nature, characterized by extremes in physi- managers and related organizations, as well as In-
cal processes (Zerger & Smith, 2003). Examples ternational humanitarian/aid organizations. While
of natural hazards include earthquakes, tsunamis, many natural disasters are characterized by short
hurricanes, typhoons, droughts, wildfires, tropical reaction/response times, overwhelming damage
storms and floods. “The fundamental determinants to property and infrastructure, and a strain on the
of natural hazards are location, timing, magnitude resources of the affected community, those less
and frequency,” (Alexander 2000, p. 7). The frequent large-scale natural disasters are much
spatial scale and duration of natural hazards can more deadly. Among the largest-scale globally
vary greatly, which is important from a GIT per- predominant natural disasters with 50,000 vic-
spective, and in particular from the perspective of tims or more, three hazard types can be singled
data requirements. Landslides, for example, have out: earthquakes, tropical cyclones (with coastal
a local impact, whereas major floods can affect a inundation), and river floods (Kondratyev et al.,

177
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

2002). These types of natural hazards have caused ity rather than of the physical magnitude of the
the worst calamities both in the 20th century hazard (Hewitt, 1995; Quarantelli, 1998; Smith,
and the entire history of humanity, namely: the 2001). In this sense, “natural disasters can more
1970 flood and cyclone in Bangladesh (300,000 accurately be seen as social phenomenon, where
victims), the 1976 earthquake in China (242,000 the overall damage due to natural hazards is the
victims), and the 1931 flood in China (140,000 result of both natural events that act as ‘triggers’,
victims). The 2004 earthquake that occurred off and a series of societal factors,” (Weichselgartner,
the coast of Sumatra, Indonesia, and resulting tsu- 2001). Conceptualizing vulnerability in terms of
nami that devastated many countries surrounding societal factors as opposed to biophysical fac-
the Indian Ocean also ranks among the deadliest tors helps to explain the “wide range of human
natural disasters in history, with 283,000 reported impact that can result from events of comparable
fatalities (Lay et al., 2005). magnitude,” (Steiniger & Bocher, 2009, p. 1346).
For example, Montoya and Masser (2005) point
Disaster Vulnerability: Developed out that the “1988 Spitak earthquake in Armenia
vs. Developing Countries (former USSR) and the 1989 Loma Prieta earth-
quake in California were of similar magnitude
Although natural hazards pose a considerable and affected populations of comparable size;
threat to all countries, historically, developing however, the Armenian event killed 25,000 people
countries have been disproportionately affected whereas the California earthquake killed 63,” (p.
(Briceño, 2004). “About 95% of deaths caused by 495). Hurricanes “Hugo in 1989 and Andrew in
natural hazards occur in developing countries,” 1992 caused less than 50 deaths each in the U.S.
(Bui et al., 2000), and the loss of GNP due to On the other hand, cyclones in Bangladesh,” (p.
disasters is 20 times that of developed nations 432) killed over half a million people in 1970 and
(Alexander, 1995). These alarming statistics can- another 140,000 in 1991 (Bui et al., 2000). Clearly,
not be attributed to a greater frequency of natural the extent of human (or societal) vulnerability is
hazards in the developing world, but can be partly a major factor that contributes to determining the
explained by differences in natural hazard vulner- scale of a disaster, but why are people in develop-
ability. The concept of vulnerability is complex ing countries so vulnerable?
– it depends on a number of parameters – and The vulnerability of a geographic “area is
varies depending on the research orientation and determined by its natural and human-made envi-
perspective (for example, see Cutter, 1996; Cutter ronmental conditions, climatic patterns, and its
et al., 2000, 2003; McEntire, 2001; Morrow, 1999; political, social and economic ability to withstand
Weichselgartner, 2001). Thus, there is no single and respond to natural hazard events,” (Jayara-
accepted way to assess natural hazard vulnerability man et al. 1997, p. 294). Hazards tend to be
(Simpson & Human, 2008). However, regardless more destructive in developing countries where
of how it is conceptualized or assessed, history political, social and economic instability lead to
shows that developing countries are more vulner- poor organizational infrastructure and no adaptive
able to natural hazards than developed countries, capacity as compared to most developed countries.
and as a result experience a greater number of Henderson (2004), in examining the pervasive risk
natural disasters. of natural disaster faced by developing countries,
Although location and proximity to natural emphasizes that “conditions of poverty, poor
hazards is certainly an important factor, many housing, lack of information about disaster risk,
researchers propose that the scale of disaster poor telecommunications, and inadequate physical
impact is more a function of human vulnerabil- infrastructure frequently exacerbate natural disas-

178
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

ters,” (p. 107). Insufficient disaster management resistance and resilience,”(p. 190). Risk is created
resources, the absence of enforced laws and the because of proximity to potential hazards, which
shortage of trained experts all increase the diffi- varies spatially; “resistance refers to the ability
culty in coping with natural disasters (Guinau et of buildings and the infrastructure to resist the
al., 2005). For example, the high cost and skills strain exerted by natural hazards,” (McEntire
required for the application of GIS technology has 2001, p. 191) which also varies from one area to
hindered its utilization for disaster management another depending on building design, building
in developing countries (Rudyanto et al., 2001). codes and the materials used, among other factors.
The lack of ability to effectively utilize relevant Susceptibility, “which is a product of social, politi-
technology is a societal factor that contributes to cal, economic and cultural forces that determine
increasing disaster vulnerability. the proneness of groups and individuals to be
Developed countries have a greater awareness adversely affected by disasters,”(McEntire 2001,
and understanding of the importance of natural p. 191) can also be analyzed in spatial terms. For
disaster management. Bui et al. (2000) point out example, characteristics such as age, gender, race
that “they invest more in mitigation and preven- and socioeconomic status are generally accepted to
tion, and have more resources available to enforce be influential components of social vulnerability
legislation that may help reduce human vulnerabil- (Cutter et al., 2003), and these characteristics have
ity,” (p. 432). In developed countries, “insurance a high degree of spatial variability, particularly
absorbs more than half of the economic losses from within large urban areas. Such characteristics are
natural disasters, in contrast, less than 2% of the typically captured in a census, and this informa-
losses are insured in developing countries,” (Free- tion can be analyzed and mapped at varying scales
man & Pflug 2003, p. 601). However, as a general using GIT, such as GIS.
trend, in both developed and developing nations, Considering that assessing and understand-
“disaster management becomes a more pressing ing hazard vulnerability is a key component of
concern only after the disaster has struck,” (Cur- disaster management, and that vulnerability varies
rion et al., 2007). Perhaps the devastating impact spatially, it is well suited to be analyzed using
of recent large-scale disasters, such as the Indian GIT. Therefore, GIT must play a key role in any
Ocean tsunami (2004), Hurricane Katrina (2005), comprehensive and effective disaster management
and the Haiti earthquake (2010) will encourage strategy, yet there are dramatic differences in GIT
disaster managers in both the developed and de- use between developed and developing countries.
veloping world to focus more on mitigation and While each developing country is unique, there
preparation, rather than response and recovery. exists a common set of well-identified barriers
There are many factors that contribute to that often limit the use of such technology, thus
determining disaster vulnerability, and it is not increasing vulnerability to natural disaster. These
our intention to discuss them all, but rather to common barriers include:
demonstrate that the concept is pervasive within
the disaster management literature, particularly 1. A lack of financial resources
from a developing countries perspective. In ad-
dition, since many of the factors that determine Generally, using geospatial technology is
vulnerability vary spatially, vulnerability is well costly: requiring a substantial investment in com-
suited to be assessed using GIT. For example, puter software/hardware, spatial data, and human
McEntire (2001) considers vulnerability to be resources (education and training). In many cases
“the dependent component of disaster that is disaster management responsibilities and duties
determined by the degree of risk, susceptibility, are decentralized to local governments “without

179
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

being accompanied by the necessary funds,” sored/funded technology development programs,


(Montoya & Masser 2005, p. 494). Considering rather than through the gradual building of local
that local governments have to provide many GIT capacity (Britton, 2000). “Local workforce
essential “services (e.g., health, education and development and capacity building projects are
infrastructure services such as water, electricity essential for,” (Mohamed & Plante 2002, p. 2286)
and sewage, to name a few),” (Montoya & Masser, sustained GIT implementation and maintenance
2005, p. 495) they are “often not predisposed to that does not rely on external support or funding.
purchasing off-the-shelf commercial GIS software
due to other demands for funding,” (Renyi & 3. Institutional/political instability
Nan, 2002, p. 1). It is difficult to justify spend-
ing limited budgets on technology to enhance Disaster management is dependent on the
disaster mitigation and preparation when basic “functional and effective operation of institutions,
needs have priority. In addition, software has no whether formal or informal, and at the local level
benefits without the ability to purchase or collect where it matters most,” (Pande, 2006, p. 426).
data. Thus, limited finances are a major barrier The effectiveness of GIT and the ability to con-
and if GIT is going to play a role in local level vert information into action also depends on the
natural disaster management financially feasible existence of supporting organizations, which are
solutions such as FOSS can help. generally lacking in developing countries (Cop-
pock, 1995). Political barriers have also been cited
2. A lack of local expertise/knowledge (human/ as obstacles to local level GIT use (Montoya &
technical resources) Masser, 2005). High resolution census data, for
example, contains a wealth of information that
Effective use of GIT requires considerable could be utilized in a variety of GIT-based natural
technical knowledge and skills, obtained through disaster management related scenarios but this
education and training, as well as practical experi- data is often unavailable in developing countries
ence. Mohamed and Plante (2002) emphasize “the for various reasons. Governments in developing
lack of local expertise and capacity to operate and countries can be unstable, and political terms
maintain a GIS,” (p. 2286) in developing countries can be short. According to Murgia et al. (2002)
and general awareness is strongly lacking (Murgia “politicians have their own circle of supporters
et al., 2002). In contrast, the well established GIT who will be appointed in public institutions, and
communities built up over time in developed thus political instability is expanded to institutional
countries are an important resource for individual instability. Considering institutional instability,”
practitioners to acquire new knowledge, methods (p. 6). Ramasubramanian (1999) suggests that GIS
or techniques, discuss ideas and/or obtain support. implementation in developing countries would
In some developing countries, practitioners may be more successful in the hands of “empowered
be physically separated by hundreds of kilometres individuals and groups rather than organisational
and be without Internet or telephone (Britton, structures,” (p. 359).
2000). As such, local disaster management or-
ganizations would have trouble in trying to find 4. A lack of spatial data
local personnel who could effectively manage the
various GIT components/aspects of a holistic natu- “Accurate and comprehensive spatial data play
ral disaster management strategy. When GIT has a critical role in all phases of disaster manage-
been implemented at the local level in developing ment,” and are required for effective GIT use, yet in
countries, it is often the result of externally spon- many developing countries reliable spatial data are

180
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

a scarce resource (Brodnig & Mayer-Schönberger countries, which in effect, further increases their
2000, p. 3). Murgia et al. (2002) suggest that part vulnerability to natural disasters. However, recent
of the problem are “weak national data providers growth in the domain of free and open source soft-
such as mapping agencies, census organizations ware (FOSS), and in particular GIT-based FOSS,
and cadastres, and that many projects that have provides new opportunities that were previously
been initiated are stand-alone and lack continuity unavailable. For example, the cost alone of com-
and consistent funding or are not embedded in mercial GIT software (such as GIS) can prevent
stable institutions or are politically misused,” (p. it from being used at the local level in developing
6). In addition, governments in developing coun- countries (Ranyi & Nan, 2002); with FOSS, the
tries tend to have a very conservative approach issue of cost is eliminated. For this reason, and
in terms of data and information management others that will be described later in this paper,
(Shrestha, 1994). “For example, among the most FOSS is increasingly being recognized within the
common obstacles to be found in Asia are official disaster management literature as a good fit for
restrictions on geospatial data for security reasons developing countries.
as well as the rigidity and compartmentalization of
government bureaucracies, which consider certain Disaster Management
types of information as their property,” (Brodnig and the Role of GIT
& Mayer-Schönberger 2000, p. 3). In Costa Rica,
for example, access to census tract level data is Disaster management can be understood as a cycle
restricted (Montoya & Masser, 2005). Additional (Figure 1) which includes an effort to “mitigate
factors that contribute to the lack of spatial data against, prepare for, respond to, and recover from
include: (1) the remoteness of many areas; (2) the a disaster,” (Montoya 2002, p. 1). The core com-
lack of technical capacities to collect and assess ponents of this cycle are also commonly referred
bio-physical and socio-economic data; and (3) to within the emergency management literature
competing priorities in the fledging economies (e.g., Cutter, 2003), and often in conjunction with
(Brodnig & Mayer-Schönberger, 2000; Shrestha, natural disasters (Cova, 1999). Thus, emergency
1994). Montoya and Masser (2005) emphasize the and disaster management are closely linked; after
need “to identify or develop cost-effective data
collection methods for producing spatially refer- Figure 1. Disaster management cycle (Adapted
enced information in developing countries. The from Alexander, 2000)
role of spatial data in all areas of natural disaster
management cannot be underestimated; in fact,
Mansourian et al. (2004) propose the development
of a spatial data infrastructure (SDI) as a frame-
work to facilitate disaster management,” (p. 495).
Aside from the barriers to GIT implementation
just described, Ramasubramanian (1999) also
identifies cultural and language barriers in terms
of GIS implementation in developing countries
(for more on specific GIS implementation issues
in developing countries see Ramasubramanian,
1999). Given the aforementioned barriers, disas-
ter managers in developing countries are unable
to utilize GIT in the same ways as developed

181
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

all, a disaster clearly constitutes an emergency, but disseminate information, are essential in all areas
in this paper it will be referred to as the disaster of natural disaster management. Because GIS is a
management cycle. spatial decision support tool it is invaluable when
Mitigation refers to efforts that aim to eliminate effectively used in a field like disaster manage-
or reduce the risk to humans and/or property caused ment that deals with critical spatial decisions
by natural or man-made hazards (e.g., risk assess- (Cova, 1999). For instance, GIT provides the
ment, insurance, engineering standards, land use basis for estimating and mapping risk, planning
management, public education, etc.). Preparedness evacuation routes, determining suitable areas for
refers to activities “necessary to the extent that shelters, identifying disaster victims, and assign-
mitigation measures have not, or cannot, prevent ing resources during recovery (Goodchild, 2006).
disasters,” (Johnson 2000, p. 2). This involves It follows then, that GIT must play a key role in
developing operational capabilities for responding any comprehensive and effective disaster manage-
to a sudden disastrous situation. In this phase ment strategy. Figure 2 incorporates GIT within
“governments, organizations, and individuals the disaster management cycle to emphasize the
develop plans to save lives and minimize potential central role it plays during all phases.
disaster damage,” (Johnson 2000, p. 2). This in- Disaster management is complex, and involves
cludes emergency planning, training exercises, the participation and collaboration of many insti-
implementing hazard warning systems, evacuation tutions/organizations/agencies operating at inter-
procedures and stock-piling of critical supplies. national, national, regional and local levels. Such
In addition, “preparedness measures also seek to institutions/organizations/agencies use a range of
enhance disaster response operations,” (Johnson GIT depending on their specific information re-
2000, p. 2). Response refers to those actions taken quirements, financial resources and technical
just prior to, during, and immediately after a di- capabilities. While a strong national disaster man-
saster that save lives, reduce property damage or agement initiative is important, Henderson (2004)
improve recovery. The most important aspect of emphasizes that disaster management capabilities
this phase involves providing emergency assistance should be decentralized to the regional or local
for victims (e.g., search and rescue, emergency level given the wide variations in demographic,
shelters, medical care and food/water). Disaster socioeconomic, cultural and infrastructural condi-
responders “also seek to stabilize the situation and tions within regions and local areas of a nation.
reduce the probability of secondary damage,” Regional and local authorities involved in disaster
(ERSI 2006, p. 4). Recovery (often used in con- management should ideally have the capacity to
junction with the word ‘relief’) includes those effectively utilize GIT, yet even in developed
activities that (1) restore vital life support systems countries limited financial resources can prevent
and (2) return the area/population to a pre-disaster such utilization within this administrative level
state. The former can be seen as part of the short- (Laben, 2002). Theoretically, this approach is
term recovery plan while the latter as part of the increasingly being recognized to be more effective
long-term, which may continue for a number of than the centralized approach, but as Montoya amd
years after a disaster. Reconstruction is closely Masser (2005) note, local authorities are often not
linked with mitigation, and is undertaken in ways provided with the necessary financial resources
that aim to reduce vulnerability and improve pre- to develop and implement effective policies and
paredness, thus disaster management occurs in a plans, which should incorporate GIT. Laben (2002)
cyclical fashion as depicted in Figure 1. emphasizes that it is extremely important to have
Geospatial information technology (GIT), and GIT alternatives available that can be used by all
its ability to acquire, interpret, analyze, map and

182
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Figure 2. A disaster management cycle incorporating GIT (adapted from Hussain et al., 2005)

levels of the emergency and disaster management Geographic Information


communities. Systems (GIS)

A GIS is an “organized collection of computer


GEOSPATIAL INFORMATION hardware, software, geographic data, and person-
TECHNOLOGY AND nel designed to efficiently capture, store, up-date,
DISASTER MANAGEMENT manipulate, analyze, and display all forms of
geographically referenced information” (ESRI,
Geospatial Information Technology is used for 1993, pp. 1-2). A GIS is a key component of any
many disaster management functions, includ- effective and comprehensive disaster management
ing hazard and risk assessment (Dewan, Kabir, strategy, and are used to display, integrate, map,
Islam, Kumamoto, & Nishigaki, 2007; Ramli et analyse, and model data and information derived
al., 2005), vulnerability assessment (Cutter et al., from satellites, and other spatial data sources
2000; Kienberger & Steinbruch, 2005; Weichsel- (Kumar et al., 1999). GIS function primarily as
gartner, 2001), vehicle dispatch and supply routing a support tool to help answer essential questions
(Dong, 2005), damage assessment (Chen et al., and make informed, timely and appropriate deci-
2005; Rivereau, 1995; Zhang et al., 2002) and sions that can help save lives. Perhaps the greatest
resource mobilization (Goodchild, 2006), among strength of GIS are their ability to integrate a wide
many other essential tasks. In this section we ex- range of data types, including geographic, social,
amine the use of GIS, remote sensing and Internet economic, and political data into a single system
GIS in the area of natural disaster management. (Dash, 1997).
However, to utilize GIS not only requires the
software, but also hardware, data and trained per-
sonnel. Thus, a GIS capability involves not only
the software itself, but the necessary spatial and

183
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

descriptive (attribute) data, computer hardware the quality of which is determined by locational
and personnel who can effectively utilize it (Dash, precision, the characteristics of the attribute data,
1997; Montoya, 2002). As a result, the use of GIS and by the extent to which standards are adopted
for disaster management varies widely between that allow for data transfer (Coppock, 1995). The
individual countries, and within disaster related importance of good quality spatial data for use
organizations and institutions at different levels with GIS to assist disaster management cannot
of government (Laben, 2002). These differences be underestimated. There are several types of
in GIS capability are clearly reflected in the wide spatial information that are useful for disaster
range of literature that examines and describes management related decision making – the type,
the use of GIS for natural disaster management. scale and complexity of such data depend on the
This section of the paper will review the GIS type, scale and stage of the disaster (De La Ville
and disaster management literature in order to et al., 2002), as well as on the GIT capabilities
identify examples of GIS capabilities (software, of the implementing institution/organization. As
data, hardware, and level of expertise required) an example, Table 1 describes some of the vari-
that are feasible for implementation in developing ous types of spatial data/information applicable
countries. These are all important considerations for disaster management that can be utilized in a
that will be emphasised and discussed further. GIS environment.
Since there are distinct differences between Over the years, developments in spatial data
using a GIS for pre- or post-disaster manage- collection and use have led to the concept of
ment activities, this section has been divided in spatial data infrastructure (SDI). Musinquzi et al.
half, one-half that deals with pre-disaster GIS (2004) suggest that an ideal SDI includes “tech-
use and another that examines post-disaster use. nologies, policies, standards, and human re-
However, before these sections are presented it is sources to acquire, process, store, distribute, and
first necessary to discuss the importance of spatial improve utilization of geo-spatial information”
data, since it is the primary input for GIS and is (p. 790). Generally speaking, the “success of
required for their use. disaster management largely depends on the avail-
ability, dissemination and effective use of infor-
The Importance of Spatial Data mation,” (p. 3) much of which is spatial (Ven-
katachary et al., 2002). Accordingly, then, there
The success in utilizing GIS depends largely is a relationship between GIS, SDI and the extent
on the availability of spatially-referenced data;

Table 1. Crucial spatial data for disaster management (adapted from Gunes & Kovel, 2000)

Data/Information Type Description


Disaster forecast Information concerning the extent of a particular hazard or disaster
Vulnerability analysis Information on critical facilities (hospitals, schools, shelters, police and fire facilities, dams, trauma centers,
industrial facilities, etc.);
Information regarding human vulnerability (age, gender, socioeconomic status, etc.)
Damage assessment Data/imagery of the actual impact of a hazard
Resource inventory Location information regarding supplies, equipment, vehicles, or other material resources
Infrastructure Shows transportation networks (roads, railroads, bridges, traffic control points, and evacuation routes) as
well as complete utility grids (electric, gas, water, and sewer)
Mass care/shelter status Monitors the movement of people to and from government or voluntary agency shelters by providing infor-
mation on capacity, availability, supplies, and suitability to victims’ needs

184
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

to which spatial information is utilized for disas- The Internet has also drastically increased
ter management related purposes. access to spatial data, as numerous websites
Most developed countries are at various stages now offer free access to a wide range of data
of developing national SDIs, and their success (e.g., political/administrative boundaries, roads,
can be linked to high levels of technology, avail- hydrology, digital elevation, land cover, etc.).
ability of funds, trained personnel and political However, “despite the rapid growth in available
support and stability (Musinguzi et al., 2004). spatial data there has been little attention paid”
Many of these factors that contribute to success- to aspects of its quality, including “currentness,
ful national SDI development are lacking in most lineage, locational accuracy, completeness, and
developing countries, and as a result, there are vast overall usefulness,” (Engler & Hall 2007, p. 345).
differences in spatial data quality and quantity This is unfortunate, because the quality of spatial
between developed and developing countries. As data is particularly important when it is used for
well, without a national SDI there is no effective disaster management, when lives are potentially
mechanism for coordinating data collection ef- at risk. Although satellite imagery is an essential
forts or for sharing spatial data among agencies type of spatial data for disaster management, and
or departments. This often results in duplication is abundantly available online, its discussion is
of data and wastage of human/financial resources. reserved for subsequent sections, and spatial data
Limitations in accessing and sharing spatial data in the present context refers to vector data only.
are particularly problematic for disaster manag- Scale can also limit the usefulness of online
ers, whose spatial data needs typically cut across spatial data. Currently, data available for develop-
departmental/agency boundaries (Rego, 2001). ing countries is most often at the national scale
However, some developing countries are making and derived from small-scale map sources. Such
progress, including Sri Lanka, for example, who data is inappropriate for regional or local level
just recently (2008) initiated a national SDI (http:// use for a variety of reasons, including locational
www.survey-dept.slt.lk/). Their policy statement inaccuracy and a lack of attribute information.
makes specific reference to disaster management, Even if the scale of the data is appropriate, some
and how a SDI is required to reduce existing data may require additional time investments to
inefficiencies. clean, correct, or add the relevant attribute data
While there are distinct differences between before they can be effectively utilized. In addi-
developed and developing countries in terms of tion, problems can arise when trying to combine
the quality and quantity of currently available data from disparate sources due to variations in
spatial data, efforts are being made to improve coordinate systems and map projections. Engler
the current situation. Nebert (2004) points out and Hall (2007) emphasize that, in many cases, it
that “many national, regional, and international may be better to create or purchase the required
programs and projects are working to improve data elsewhere, as there may be too much work
access to available spatial data, promote its reuse, to be done or uncertainty associated with using
and ensure that additional investment in spatial data freely available from the Internet.
information collection and management results Spatial data are essential in all areas of disaster
in an ever-growing, readily available and useable management, yet good quality spatial data may
pool of spatial information” (p. 6). Although such be hard to come by, especially at the local level in
initiatives may not be part of a formal SDI policy, developing countries. To improve and increase the
they undoubtedly help the global GIS community use of GIS for disaster management will require a
and others who may use spatial data. substantial effort in data collection, one that can
be facilitated with an effective SDI policy. Rego

185
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

(2001) stresses that the development of spatial of floors and whether previous hazard damage
databases should be “built bottom up from the had occurred were among the attribute data col-
lowest administrative unit in country (e.g., the lected. The resulting database, when combined
sub-district or district). The district databases with historical data on previous disasters, such
would then feed into the state/provincial database as flood depth, was used to generate a variety of
and then into the national database,” (p. 2). A vulnerability and risk maps. The database and
bottom-up approach to spatial database produc- maps that can be generated serve as a basis for
tion is especially relevant for local-level GIT use future development and planning that takes into
in developing countries. account both biophysical and human vulner-
abilities. This study was conducted for the city
Pre-Disaster of Turrialba, Costa Rica, which has a population
just over 30,000. Data collection and analytical
A GIS has much to offer in the pre-disaster man- methods are relatively simple to repeat, making
agement phases of mitigation and preparedness. this an excellent example of the type of GIS use
To effectively mitigate and prepare for a disaster that is practical for local level disaster managers
requires not only detailed knowledge/informa- in developing countries.
tion about the “expected frequency, character, Cutter et al. (2000) present a county level
and magnitude of hazardous events in an area, “hazards-of-place” (Cutter, 1996) based GIS
but also the vulnerability of the people, build- method for assessing hazard vulnerability in spatial
ings, infrastructure and economic activities in terms. Using twelve environmental threats and
a potentially dangerous area,” (Van Westen & eight social indicators (e.g., age, race/ethnicity,
Hofstee 2001, p. 442). This information forms income levels, gender, building quality, etc.) their
the cornerstone of preparedness planning and study demonstrates how a GIS can be used to
helps determine appropriate mitigation strate- integrate both biophysical and social factors that
gies. A GIS allows for the synthesis and analysis contribute to hazard vulnerability. Social charac-
of such data/information to help determine risk teristics of the population are available from most
levels, assess vulnerability, model scenarios, plan national censuses, and census data is often used
evacuation routes, determine resource require- in a GIS to map human vulnerability to various
ments, and create a variety of useful information natural hazards. The research methodology and
products to aid decision-making. Some specific conceptualization of hazard vulnerability provides
examples of GIS use in the pre-disaster phases are a template for others to follow, and helps fill a “void
particularly relevant for developing countries and in the literature on spatial analytical approaches to
so we elaborate upon these as cases of effective vulnerability,”(Cutter 1996, p. 530) assessment. If
GIT implementation in pre-disaster management. vulnerability to natural hazards can be identified,
Van Westen and Hofstee (2001) present a appropriate steps can then be taken to reduce the
case study that used GIS to create a spatial data- social and economic impacts of potential disasters.
base of buildings, land parcels, roads and other However, the authors acknowledge that imple-
infrastructure by digitizing features from aerial menting this approach at the “local level may
photographs. Subsequently, a field team of in- pose challenges in terms of availability of funds
vestigators collected specific attribute data for for training and data acquisition,” (Cutter 1996,
each digitized land parcel and building, using a p. 530). Furthermore, results are based on a data
pre-determined checklist. For example, land use intensive methodology, and while this approach
of the parcel (residential, industrial, commercial, is possible in developed countries, it is unlikely
etc.), building material, building age, number feasible in most developing counties due to a

186
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

lack of detailed spatial data, in addition to other ing a relatively simple procedure that involved
GIS implementation issues previously described. assigning a weighted score to each data type (to
However, it seems likely that this methodology represent varying significance) and the use of
could be simplified and still produce useful hazard GIS overlay functionality, the authors were able
vulnerability information. to create maps depicting flood hazard potential
Guinau et al. (2005) used a GIS to create a haz- and flood risk zones. The data collection methods,
ard susceptibility map to help mitigate landslide analytical techniques and general approach used
risk. The methodology applied focused solely on in this study demonstrate innovative and adaptive
the biophysical factors that cause landslides. The use of GIT in the context of a city-scale hazard
authors digitized present and past landslides from assessment for developing countries.
aerial photographs to create a landslide inventory, Uncontrolled and informal housing develop-
which was then “overlaid on terrain data, such as ment is a recurring problem in cities in developing
lithology, slope, soil characteristics and land use” countries (Thomson & Hardin, 2000), and such
analysis of terrain conditions in areas affected by development practices contribute to increasing
landslides made it possible to determine zones natural hazard vulnerability. To address this prob-
with similar characteristics,” (p. 320) and through lem, Thomson and Hardin (2000) used GIS and
further analysis the delineation of low, medium satellite imagery (LANDSAT TM) to “identify
and high susceptibility zones. Perhaps most im- potential low income housing sites in the eastern
portant about this study is that it demonstrates portion of the Bangkok Metropolitan Area, where
a relatively simple GIS-based methodology to flood risk is a concern. Location, infrastructure,
assess landslide vulnerability that is feasible for land cover, and natural environment factors were
developing countries. used to assess the characteristics appropriate for
Dewan, Islam, et al. (2007) integrated GIS and public housing sites. Due to a shortage of spatial
remote sensing techniques to analyze the flood data, all GIS coverage required were derived
hazard and risk levels in Dhaka, Bangladesh. A from satellite imagery or digitized from map
major impediment was unavailable digital geospa- sources, including land use, land parcels, roads
tial data, and as a result, a number of the required and drainage,” (p. 102). To derive land cover,
data layers had to be created. “Flood-affected for example, a combination of techniques was
frequency and flood depth were estimated from used, including unsupervised (to identify spectral
multi-date Synthetic Aperture Radar (SAR) data,” clustering) and supervised classification based on
(Dewan et al. 2007a, p. 1602) (from RADARSAT), local knowledge, and the use of aerial photographs
based on previous flood events. Land-cover was for verification. Major roads were digitized on-
generated using a combination of approaches and screen in the GIS from the satellite imagery, and
data sources, including on-screen digitization of the remaining data were generated using a variety
features from high resolution remotely sensed of low-tech methods, including visual image
imagery, recently produced topographic maps, interpretation (and digitization) and field work.
field surveys and the use of hand-held GPS units. Using basic GIS functions, such as overlay and
A geomorphic map was also developed using a buffering, a final map was produced that identi-
1999 LANDSAT TM image in conjunction with fied “sites possessing appropriate criteria for
available paper maps and field observations. El- low-income housing: non-built, greater than 25
evation data in the form of a DEM was obtained hectares, within 1 km of a road, and least prone
from the Institute of Water Modeling (IWM), to flooding,”(Thomson & Hardin 2000, p. 98).
Bangladesh. All the vector data layers were then The map can be effectively utilized at large and
converted to raster at identical resolutions. Us- medium scales (e.g., 1:20,000 – 1:50,000), and the

187
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

analysis methods and techniques are considered having a database of the most critical and useful
feasible for cities in developing countries, in terms spatial data before a disaster strikes is essential.
of available data and required skill sets. A better It allows for quick updates of the data to reflect
understanding of suitable locations for potential ground conditions and reduces the time it takes
housing sites could help guide development in to produce critical information products required
ways that reduce natural hazard vulnerability. by disaster responders and related emergency
Knowledge of the spatial characteristics response personnel.
of hazards and vulnerability before a disaster Following the initial response period, and once
strikes enables disaster management authorities the situation has stabilized, a GIS can be used to
and emergency workers to identify areas that are analyze disaster impacts and help plan the reha-
likely to be most affected, and thus efficiently bilitation process in a way that reduces potential
focus their resources. Such knowledge is critical vulnerabilities. Development of this fashion is
to the development of effective mitigation and termed ‘invulnerable development’ (McEntire,
preparation strategies that will ultimately help 2001). The following few examples demonstrate
reduce the devastating impacts that can result the value and necessity of GIS during the post-
from natural disasters. disaster management phases.
Impact analysis is one fundamental use of GIS
Post-Disaster in the post-disaster phase. Impact analysis can
assist response efforts by identifying those areas
Disaster responders, emergency personnel, aid most in need, and can help guide reconstruction
workers and anyone involved in the response and efforts in a way that will minimize the potential
relief effort need timely and up-to-date informa- for future disasters; for example, through im-
tion, such as the extent of damage and location of proved land use planning that takes into account
potential victims, the location of critical facilities local hazard vulnerabilities. In conjunction with
(e.g., shelters, hospitals, air strips, etc.), available IKONOS panchromatic images, De La Ville et
resources (e.g., food, water, blankets, medical al. (2002) used GIS to evaluate the distribution
supplies, etc.), infrastructure conditions (e.g., of landslide erosion scars and their effects on
damaged roads/bridges/utility lines, etc.), and several urban areas situated among six mountain
evacuation or supply drop off points. Much of catchments in Venezuela. The GIS was used to
this information is spatial and is thus well suited analyze and map the distribution of scars and
to be compiled and analyzed in a GIS and then deposition zones, and to study the factors (e.g.,
disseminated as maps. However, using GIS post- slope, geology, land cover, etc.) that contributed
disaster is different and poses greater challenges to causing the various types of mass movements.
than using it pre-disaster, since time becomes the All the information derived from this study was
critical factor (Goodchild, 2006). Immediately used by the corresponding government agencies
following a disaster, information must be quickly as the basis for the preparation of reconstruction
collected, analyzed and assembled into useful plans for the affected areas.
information products that can assist response Following the Indian Ocean tsunami of 2004,
efforts. Thus, to the extent possible, GIS data and after evoking the International Charter on
should be collected and/or analysed in advance of Space and Major Disasters to acquire pre- and
an event, rather than having to put it all together post-disaster high resolution satellite imagery,
during the aftermath (ESRI, 2006). Bottlenecks Magsud et al. (2005) used a GIS for rapid damage
in information flow and dissemination can mean assessment of buildings in Galle, Sri Lanka. The
the difference between life and death. Therefore, GIS was used primarily to combine different data

188
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

types and to support a visual analysis of building sheltering, and infrastructure repairs. However, a
damage. QuickBird multi-spectral imagery and a large number of spatial data layers are required
1:5000 scale vector layer of buildings (obtained for planning and coordinating such operations,
from the Survey Department) were overlaid in a and without them the value and usefulness of a
GIS to accurately identify construction that existed GIS decreases substantially, possibly to a point
prior to the disaster. In addition, a ground survey where it may no longer be required. Therefore,
of 81 buildings was undertaken that identified the to some extent, the usefulness of GIS following a
level of damage following the disaster (“complete- disaster hinges upon the existence of the required
ly destroyed, partially damaged mainly inside, framework datasets like roads, critical facilities,
partially collapsed with roof intact, and slightly and population density.
damaged,”(Magsud et al. 2005, p. 1)). GPS was Finally, Zerger and Smith (2003) emphasize
used in conjunction with ground photographs to that “the suitability of GIS for planning verses
accurately record the location and building dam- real-time applications is quite different. Results
age, respectively. This allowed for a comparison from a test scenario indicated that the utility of
between the damage level in the photographs and GIS for real-time decision making is questionable
satellite imagery. Results indicated that heavily owing to a number of practical and implementa-
damaged buildings can easily be identified, but tion impediments,”(p. 124) including a lack of
partial damage, particularly if the roof was still training “and the need for temporal resolution
intact, was difficult to determine from the satellite rather than spatial detail,” (p. 124). Using GIS
imagery alone. The mapped results clearly show for pre-disaster management functions, such as
the location of destroyed buildings, which is valu- vulnerability assessment or evacuation route plan-
able information that could be used to prioritize ning, poses fewer challenges than trying to use the
response operations. With an adequate amount technology post-disaster, when time is critical and
of technically trained personnel, the authors’ ground conditions may be constantly changing.
propose that a “near real time damage assess- Nevertheless, assuming there is adequate data and
ment could be possible,” (Magsud et al. 2005, p. personnel to effectively utilize GIS, few would
2). The analytical methods and data inputs used question its ability to assist disaster response and
in this study are relatively simple, yet effective, recovery operations.
and are therefore considered appropriate/suitable
for disaster managers in developing countries Remote Sensing
given that suitable data availability and delivery
methods are in place. The potential of remote sensing (RS) to provide
In general, GIS are well suited to provide valu- critical earth observation information for disaster
able information to assist disaster response opera- management (e.g., hazard assessment, disaster
tions. For example, GIS has been used to determine mitigation, preparedness, response and recovery)
the extent of a disaster and estimate damage (Ranyi has repeatedly been emphasized (e.g., Becking,
& Nan, 2002), organize resource inventories and 2004; Chen et al., 2005; Jayaraman et al., 1997;
their geographic distribution (Hussain et al., 2005), Mansor et al., 2004; Rivereau, 1995), and is
monitor shelter/refugee camp status and the state reflected in over 400 scientific articles between
of transportation infrastructure (Gunes & Koval, 1972 and 1998 (Showalter, 2001). For instance,
2000) and integrate disparate spatial data sources remotely sensed imagery can answer questions
that may be required to guide response (Amdahl, such as: “what did/does the area look like pre-/
2001). As such, GIS can help with search and res- post-disaster? RS data can show the land cover
cue, providing medical services, debris removal, and topographic features in an area and can il-

189
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

lustrate infrastructure and population density,” the study of natural hazards. DEMs are a critical
(Becking 2004, p. 1). Showalter (2001) provides data input for assessing landslide susceptibility
an in-depth review of the use of RS in hazard and (Guinau et al., 2007), delineating flood risk po-
disaster research, and “concludes that the tech- tential (Dewan, Kabir et al., 2007), flood hazard
nique is primarily used to detect, identify, map, mapping (Dewan, Islam et al., 2007; Sanyal &
survey and monitor existing hazards and/or their Lu, 2003) and for a variety of coastal hazard (e.g.,
effects;” secondary goals of RS focus “on damage tsunami, storm surges, etc.) and disaster assess-
assessment, improved planning, or the provision ment purposes (for example, see Chen et al. 2005).
of data for disaster management functions,” (p. DEMs are also commonly used to derive new
22). Simonovic (2002), in examining the repeat data that is required for specific types of disaster
frequency, spatial resolution, and types of sensors management related analysis or visualization. The
on-board, provides insight into the suitability of types of new datasets that can be generated include,
specific satellites to different natural disasters. but are not limited to: slope, aspect, contour lines,
Multispectral scanners (optical sensors) and radar flow direction, flow accumulation, watersheds,
collection systems are probably the two most solar insolation, viewsheds, hillshade visualiza-
widely recognized RS capabilities that are used tions, and many others. The most important factors
in support of disaster management (GDIN, 1997). that determine the suitability of a DEM for any
The applicability of remote sensing for disaster particular disaster management related application
management is perhaps best exemplified in the are the spatial resolution and vertical accuracy.
case of flooding, and many researchers address The potential of remotely sensed data to as-
its use for this disaster type. Satellite imagery sist disaster management is very clear, however,
can be used to assess the extent of past flood there are some limitations and potential obstacles,
events (Dewan, Islam et al., 2007) and aid in including image resolution, repeat frequency (tem-
the development of flood hazard potential maps. poral resolution) and the suitability of particular
Zhang et al. (2002) and Jayaraman et al. (1997) sensors. Perhaps the most significant obstacle
highlight the potential of RS technology to drasti- facing the use of satellite imagery for a number
cally assist flood response and relief operations of disaster management requirements is low pixel
by providing inundation mapping and damage resolution. In large urban centers, satellite image
assessment. Disasters resulting from floods are resolution can be less than 1 meter, but in more
a logical choice for RS analysis because: (1) remote areas and many parts of the developing
floods generally cover large areas, and thus oc- world, only 15-30 meter Landsat or ASTER im-
cur at spatial scales much larger than the spatial agery are publicly available (Nourbakhsh et al.,
resolution of most satellite imagery, and (2) water 2006). Low image resolution can limit the range
has a unique spectral reflectance which makes it of potential uses of RS data, including using it
clearly discernable from other ground features as a source to create digital datasets of basic, yet
(Showalter, 2001). In contrast, earthquakes, for fundamental information for disaster management.
example, may cause significant damage to build- For example, Rüdenauer and Schmitz (2005) used
ings and infrastructure, but without high resolution supervised classification techniques and found that
imagery or change detection capabilities it can be Landsat 7 ETM+ data (30m resolution) could be
difficult or impossible to identify. used to identify and separate out settlements, but
Remotely sensed digital elevation data (often that the detection of linear features such as roads,
termed DEM (digital elevation model) or DTM railways and waterways was unsatisfying. Inte-
(digital terrain model)) are a digital representation grating the panchromatic band (15 m resolution)
of surface topography and are frequently used in into the analysis failed to significantly improve

190
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

results. Thus, while Landsat imagery can be Recognizing the importance of high resolu-
used to identify settlements it cannot be used as tion satellite imagery in disaster management,
a source to accurately derive all spatial features many major space agencies and satellite operators
on the ground. signed the International Charter on Space and
Repeat frequency is also an important con- Major Disasters by the year 2001. This multilat-
sideration, as time becomes a critical factor in eral agreement stems from the fact that no single
post-disaster use of RS imagery, some satellites operator or satellite can match the data-related
may not be available within an appropriate/suit- challenges of natural disaster management. “The
able time frame (San Miguel-Ayanz et al., 2000). International Charter aims at providing a unified
For example, repeat frequency can vary from 12 system of space data acquisition and delivery to
hours (NOAA) to 35 days (ERS) (Simonovic, those affected by natural or man-made disasters,”
2002). In addition, Cutter (2003) points out that (Space Agencies 2005, p. 1). The charter has been
the “pre- and post-processing time for remote successfully evoked on numerous occasions by
sensing images may negate their use in immedi- countries all around the world – from Ecuador to
ate response activities,” (p. 442). Even if they India to Russia – providing indispensable high
are able to pass over a disaster affected region, resolution imagery for no fee. For example, in
cloud cover and the time of day (day vs. night) the aftermath of the Indian Ocean tsunami several
can constrain the use of optical sensors for image agencies and private companies provided RS im-
acquisition (Coppock, 1995). Therefore, the type agery for response and relief work (ESRI, 2006).
of sensor onboard a particular satellite as well as the DigitalGlobe provided 60-centimeter QuickBird
atmospheric conditions will partly determine the images, which were the highest resolution com-
suitability of satellites for a given disaster related mercial satellite images available. Although
requirement. Even partial cloud cover can be a developing countries cannot send satellites into
significant obstacle that limits the use of optical orbit, opportunities do exist for acquiring high
imagery. In cloudy conditions satellites employing resolution remotely sensed imagery for disaster
radar systems are required, but they too have their management purposes. However, Magsud et
limitations (Simonovic, 2002) and are at times not al. (2005) “point out that once satellite data are
suitable for acquiring the type of information that received by the coordinating office,” (p. 8) it is
may be required. Shadows can also be a limiting then up to the authorized user to effectively use
factor in both optical and radar imagery. Magsud the data – there are few mechanisms in place to
et al. (2005) found that shadows prevented the help countries in need of support to use the data.
visual identification of damaged buildings using In developing countries this can pose a problem
high resolution QuickBird imagery following the because potential data users are not fully aware
Indian Ocean tsunami of 2004. The same authors of the capabilities of GIS and remote sensing, and
also note that intact roof tops can be deceiving, as they require additional assistance in order to put
initially certain buildings appeared to be damage the data to good use. Moreover, the International
free, but upon ground truthing the building was Charter does not provide free imagery for pre-
essentially hollowed out, with the roof top and disaster GIT operations.
support beams being the only remnant. Accord- The Haitian earthquake of January 2010 was an
ingly, caution must be used when drawing initial example of the International Charter on Space and
conclusions based on visual imagery analysis, Major Disasters being invoked and imagery being
and ground truthing should be considered before acquired from satellites including Japan’s ALOS,
any significant actions are taken. CNES’s Spot-5, the US WorldView and QuickBird
platforms, Canada’s RADARSAT-2 and the ESA’s

191
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

ERS-2 and Envisat sensors. Moreover, the delivery significant amount of incremental knowledge
of these datasets was facilitated by Google (http:// gained about the critical nature of geospatial data
www.google.com/relief/haitiearthquake/geoeye. needs in a time of disaster and this recognition has
html) and GeoEye Inc. (www.geoeye.com) to le- led to a greater degree of cooperation in terms of
verage Google’s web-presence and bandwidth to data availability, timeliness and mode of delivery
provide 100s of GBs of high resolution GeoEye 1 for different practitioner communities.
and IKONOS images. In addition, many Google In sum, RS technology acts as a tool to gather
pages like the Google Earth Library (http://www. spatial information that can support many disaster
gelib.com/haiti-earthquake.htm) offered critical management functions, especially when combined
infrastructure data for direct viewing in Google with other spatial data (points, lines and polygons)
Earth formatted KML. The wide availability of in a GIS. Satellite imagery is critical for natural
post-disaster KML datasets allowed for rapid disaster response in remote and inaccessible
assessments by non-GIS experts, thereby, and it regions, or regions where primary (or basemap)
is too soon to tell, increasing the effectiveness of geospatial data are non-existent or difficult to
relief efforts. Overall, the geospatial response to collect. In such regions, satellite imagery often
the Haitian disaster was quite rapid when compared offers the best means of obtaining the necessary
to the 2004 Indian Ocean tsunami, largely due to information (Kerle & Oppenheimer, 2002). This
the need for rapid geospatial assessments during is particularly true in many developing countries
disasters. By way of illustration, in “December which tend to lack digital spatial data infrastruc-
2006, the United Nations General Assembly es- tures (Brodnig & Mayer-Schönberger, 2000;
tablished the United Nations Platform for Space- ESRI, 2006).
based Information for Disaster Management and
Emergency Response - UN-SPIDER,” (http:// Internet GIS
www.un-spider.org/haiti). A visit to UN-SPIDER
provides a plethora of metadata and links to di- Internet GIS (IGIS), the integration of GIS and
saster mapping for Haiti. Numerous other private the Internet, has quickly evolved over the last
organizations like ESRI Inc. (http://www.esri. decade and has achieved significant recognition
com/haiti/) created a central earthquake response within the disaster and emergency management
in support of Haiti, providing maps and mapping community for two main reasons. First, “GIS
support for the relief phase. By contrast, the first provides a way to centralize and visually display
web-based mapping and delivery platform for critical information,” (ESRI 2006, p. 4) relevant
the 2004 Indian Ocean tsunami was the Tsunami to a disaster, since “most of the data requirements
Disaster Mapping Portal launched on January are of a spatial nature and can be located on map,”
10th, 2005, and was led by DM Solutions Group, (Johnson 2000, p. 3). Second, the Internet provides
collaborators including researchers and scientists an ideal medium for multiple users with a range
from the Laboratory for Applied Geomatics and of backgrounds and skill sets to access spatial
GIS Science (LAGGISS), University of Ottawa, information and mapping capabilities. Radke et
Asian Institute of Technology (AIT), Pathum al. (2000) emphasize that “data acquisition and
Thani, Thailand, Chulalongkorn University, integration may be the single largest contribu-
Bangkok, Thailand, and Media Center, Osaka tion area needed for emergency and disaster
City University, Osaka, Japan. By comparison, response,”(p. 15) – IGIS can help address both
there were well over 45 map-based websites or these issues. During the response phase, access
data delivery sites 14 days after the devastation to pertinent spatial data/information is among
in Haiti. Our point here is that there has been a the most essential requirements (Amdahl, 2001;

192
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Jayaraman et al., 1997). Disaster managers, lo- although some offer basic GIS functions such as
cal authorities, aid workers and the public need address matching, proximity searches, and route
up-to-date data/information to enable quick and planning like Google Maps for example. However,
effective decision making in the immediate af- more recent advancements are allowing for the
termath of a disaster. Organizations such as the development of Internet distributed GIS services
Global Disaster Information Network (GDIN) with the capabilities to “interact with multiple
“provide evidence of the importance and value of and heterogeneous systems and servers that sup-
disaster-related information, as well as the need to port more advanced GIS functions,” (Dragićević
be able to obtain and share it effectively,”(http:// 2004, p. 11). Such capabilities are reflected in the
www.gdin.org). rapidly developing standards for web-based GIS
The integration of GIS and the Internet first services (http://www.ogc.org).
began in the early 1990’s (Plewe, 1997), and has The characteristics and capabilities of IGIS
grown rapidly since that time. This growth is offer considerable potential in the field of disaster
partly attributable to advancements in computer management, particularly during the response
and information technology and the building of phase when access to spatial information is a
spatial data infrastructure (SDI) worldwide (Yang key requirement (Radke et al., 2000). Disaster
et al., 2005). It is therefore pertinent in this review impacts can significantly alter the landscape
to briefly define and describe the capabilities (natural and built environments), and often vary
of Internet GIS, discuss the implications of this across organizational, sociological, political, and
relatively new technology on the field of disaster geographic boundaries. This is especially true fol-
management, and examine its potential from a lowing large-scale disasters. Information about the
developing countries perspective. disaster zone needs to be disseminated to many
In this emerging field of Internet GIS, there is stake-holders involved in the response, at local,
no general agreement on the terms used to describe regional, national and international levels, both
GIS-based programs on the Internet. They are com- public and private (GDIN, 1997). Much of this
monly referred to as Internet GIS, web-based GIS, information is spatial, such as the location of the
distributed GIS and On-line GIS, among others. most devastated towns or villages, the location of
Peng and Tsou (2003) help to better differentiate supply drop-off points and the extent of in-tact
between these terms and classify the various types transportation networks. IGIS are well suited to
of applications. The term ‘Internet GIS’ (IGIS) fulfill these information requirements since (1)
will be used henceforth herein and is defined “as they allow for the integration of various spatial
network-based geographic information services datasets and (2) they can be accessed from any
that can utilize wired or wireless Internet protocols location with an Internet connection.
to access geographic information, spatial analysis The often ad-hock nature of disaster response
tools and GIS Web services,” (Peng & Tsou 2003, involves people and organizations (local to inter-
p. 3). It is worth noting here that although the term national) that require spatial information relevant
GIS is included with “Internet”, IGIS focus mainly to their own logistical operations. Comfort (2000)
on displaying geographic information (in map reports that during disaster response there are
form) as well as data dissemination tasks but tend often dynamic and spontaneous actions taken by
to lack comprehensive GIS capabilities common responding organizations and individual groups
to most desktop software. Kraak (2004) points out of people. These types of efforts are encouraged,
that most IGIS applications currently in use are but could be more effective and better coordinated
limited to (interactive) mapping (with zoom, pan, with an improved spatial awareness of the disaster
measure distances, identify spatial features, etc.), zone, achieved through the use of appropriate

193
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

spatial information and maps accessed using of organizations through distributed servers/
IGIS-based websites. Becking (2004) states that sources. An IGIS-based mapping system that is
obtaining geographic awareness is one of the intuitive, user-friendly and easy to access could
most important steps in effectively understand- help provide geographic awareness to disaster
ing a disastrous situation, and being able to make responders, many of whom have limited or no
appropriate decisions. Traditionally, geographic GIS experience but could certainly benefit from
information has been distributed using paper maps, access to spatial information and maps. Andre and
which are costly to produce and difficult to update Smith (2003) note that disaster responders often
and distribute to all parties involved. require simple cartographic products (for example,
“Experience has shown that a top-down ap- the location of impassable roads identified on a
proach to data sharing is not entirely effective,” road network map) as opposed to products derived
(Radke et al. 2000, p. 23), “as disaster respond- from advanced spatial analysis.
ers must gain access to a number of department Experience to date indicates that IGIS has
managers and organizations, their unique maps played a key role in the collection and dissemi-
and data,”(Johnson 2000, p.3). Knowing which nation of spatial information during the disaster
data is where and how to effectively access it can response and recovery of recent natural disasters,
be a significant barrier. Furthermore, this type of including the Indian Ocean tsunami (2004) and
approach often results in duplication of efforts. Hurricane Katrina (2005). However, the success/
For example, following the Indian Ocean tsunami, failure of IGIS in that regard must be interpreted
many agencies created damage maps at almost the within the context of the very young IGIS field.
same time (ESRI, 2006). An alternative option With that caveat in mind, following these natural
is to utilize current IGIS architectures to create disasters IGIS sites were established that integrated
a disaster information system with wide acces- a large number of spatial datasets, including
sibility, which allows multiple users to access coastlines, satellite imagery, damage maps, trans-
relevant spatial data/maps provided by a variety portation networks, population centers and other

Figure 3. Examples of two IGIS-based natural disaster map viewers: a) South East Asia and Indian
Ocean Tsunami Response Map Viewer (produced by DM Solutions Group, http://www.dmsolutions.ca/
showcase/), and b) Hurricane Katrina Disaster Viewer (produced by ESRI Inc., Redlands California, USA)

194
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

data intended to provide a visual overview of the GEOSPATIAL INFORMATION


regions impacted by the disasters. Figure 3 shows TECHNOLOGY SOFTWARE
images of the main interface of two such sites.
The user friendly map based information We have reviewed many uses and barriers of geo-
systems created using IGIS can increase access spatial information technology (GIT) to natural
to relevant spatial information required during disaster management. However, there has been
disaster response. However, their design, develop- no discussion regarding specific aspects of GIT
ment and implementation require substantial GIS/ software, including the difference between com-
technical knowledge and significant human and mercial and free and open source (FOS) software
financial resources. As demonstrated earlier, in (FOSS), and the implications of these differences
many developing countries each of these require- for adoption within developing countries. Turning
ments represents a challenge, and could prevent our attention to that issue, we now discuss the
them from creating IGIS for disaster response. In main differences between commercial/proprietary
fact, the two known IGIS sites created to assist software and FOSS, and explore the significance
with the Indian Ocean tsunami response were not of these differences from a developing country
created by any developing country that was af- perspective. We also briefly discuss the FOS
fected by the disaster; one was created by a Ca- GIT software domain. Although there are some
nadian software development company (DM well documented drawbacks to using FOSS, and
Solutions Group – http://www.dmsolutions.ca) challenges to overcome, the potential benefits that
and the other by the Pacific Disaster Center, which can be gained far outweigh any downside, and
is based out of Hawaii. Both have significant GIT thus FOSS should be considered a viable model
resources and capabilities, and were able to es- (Fitzgerald, 2004; Hoe, 2006).
tablish these IGIS sites in a relatively quick pe-
riod of time. This is an important point, as it is Free and Open Source
unlikely that GIT practitioners in developing Software (FOSS)
countries have the expertise required to effec-
tively develop an IGIS site in a reasonable time While some consider “free” and “open source”
frame. In addition, Internet connections in many software to be something close but not identical,
parts of the developing world can be slow and they are similar enough that for the purposes of
patchy (Nourbakhsh et al., 2006), which reduces this paper they are considered the same. We can
the potential for accessing information using IGIS lump them together because for the general soft-
sites. ware user the difference is negligible, although
In general, the ability to develop an IGIS for philosophical and legal differences do exist
disaster response requires excellent GIT skills and (Rajani et al., 2003; Steiniger & Bocher, 2009).
a large amount of spatial data, which may exist at Although there are a number of criteria that a soft-
differing scales and in varying formats. Currently, ware package must meet to be considered FOSS,
the GIT requirements and skills associated with three essential features capture the essence of the
creating IGIS are beyond those of disaster manag- semiofficial “Open Source Definition” (http://
ers in developing countries, and we suggest that www.opensource.org/docs/osd):
trying to implement IGIS may not be the best use
of their limited resources. • “The source code must be distributed with
the software or otherwise made available
for no more than the cost of distribution.

195
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

• Anyone may redistribute the software for enhancing its existing functionality or by adding
free, without royalties or licensing fees to new functions. In contrast, proprietary software
the author. indicates ownership, and has its source code closed
• Anyone can modify the software or derive – it cannot be viewed, modified or redistributed, as
other software from it, and then redistrib- is stated in the EULA (End User License Agree-
ute the modified software under the same ment) that you must agree to prior to installation.
terms,” (Weber 2004, p. 5). Free, as in price, is perhaps the most significant
characteristic of FOSS that differentiates it from
Licensing agreements such as the General commercial software, which is inherently sold
Public License (GPL) define the rights users have for profit. Another characteristic of FOSS is that
over the software product (http://www.gnu.org/ it can be accessed (downloaded) from anywhere
licenses/gpl.html). Wu and Lin (2002) as well as with an Internet connection; this is typically not
Cook and Horobin (2006) provide further infor- true of commercial/proprietary software. Table
mation about the various FOSS licensing models. 2 outlines some of the key differences between
What makes FOSS different than proprietary proprietary and FOS software products, in terms of
or commercial software is that the source code advantages and disadvantages. With certain types
is “free.” “In this context free means freedom, of software, including GIT software, there are also
not necessarily zero price,”(Weber, 2004, p. 5) considerable differences in terms of functionality
although almost all are available at no cost. Free and ease of use.
software implies that the user has the freedom to The FOSS movement is continually gaining
run, view, copy, modify, and distribute a piece momentum, but has already garnered significant
of software, irrespective of financial limitations. attention in some areas. For example, “Apache
Thus, users can improve the source code, by either dominates the web server market, as over 65% of

Table 2. Key differences between proprietary and FOS software (modified after Steiniger & Bocher 2009)

Proprietary Software Free and Open Source Software


Advantages warranty of developing company on product no licence fees
(holds for every computer) unrestricted use (e.g., no limits for the number of installa-
components should work together tions)
well documented software no update enforcement
regular release times for new versions. support of open standards
regular service packs support usually available from several providers
customization at the API level customisation at source code level
platform independent
Disadvantages software price and maintenance fees installation know-how necessary in many cases
training costs training costs
maintenance fees tied to specific licensed com- interoperability issues between FOSS
panies, software options and time period quality (but self-correcting if actively used)
customised development can be difficult due to no responsible authority
available resources of vendors support can lack for some packages
support only as long as software company
exists
Some limitations on out-of-box functional-
ity where vendor partner’s are necessary for
upgraded functionality
reliance - retraining costs when software
versions change (e.g., ESRI’s ArcView3.2 to
ArcGIS 8.x) or data models change at vendor’s
whim or development cycle

196
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

all active web sites use it,” (Weber 2004, p. 6). custom development industry that can be utilized
Ramsey (2007) points out that its success can be in such circumstances.
linked to a powerful user community that is com- Another concern about FOSS relates to its
mitted to maintaining the Apache platform. Cor- long-term survival. Fitzgerald (2004) reports
porate mainstays such as IBM and HP, government that “studies of Freshmeat.net and Sourceforge.
agencies and academic contributors are all part net (two popular FOSS development websites)
of the Apache community. Other FOSS projects revealed that most projects have only one or two
that have become very popular include office developers, and that follow up studies reported no
suites such as OpenOffice, database systems like change in version number or size of code base for
MySQL, the Mozilla web browser, and the Linux many listed projects several months later,” (p. 95).
operating system. As of February 2009, there were This sort of vague analysis of the FOSS domain
over 230,000 registered projects and over can be deceiving, since there are so many FOSS
2,000,000 registered users at Sourceforge.net, the projects it is not surprising that most have only
world’s largest open source software development one or two developers, and that new versions are
website. In fact, most users can find an application not coming out regularly, especially if the par-
that will exactly meet their specific needs (Wu & ticular package is sufficient for its purpose. That
Lin, 2001). According to Wheeler (2007), who is the nature of the FOSS model, as it encourages
provides quantitative data, FOSS “reliability, individuals or small teams to develop and share
performance, scalability, security and total cost software. It is up to potential users of FOSS to
of ownership are at least as good as or better than examine individual products, consider potential
its proprietary competition, and under certain advantages and disadvantages, and choose what
circumstances are superior,”(Anderson & More- best fits their needs. Câmara and Onsrud (2004)
no-Sanchez 2003, p. 448). This evidence suggests examined FOSS GIT and identified many differ-
that FOSS can, and already is, rivaling certain ences that exist – in terms of support, maturity and
commercial software domains. functionality – between products led by a single
Although a strong case can be formed for the individual, products produced by small teams
FOSS model, it also has some often cited disad- and corporate led products. They conclude that
vantages compared to commercial software. For corporate led projects tend to be of better quality,
starters, software support and technical assistance at least from these three standpoints.
is a major issue facing the adoption of FOSS.
However, FOSS proponents are quick to point Free and Open Source Geospatial
out that support may be found within the FOSS Information Technology Software
community, in the form of user groups and “ar-
chives of past queries and answers available on the Turning our attention now to the specific do-
Internet,” (Ramli et al. 2005, p. 522). Installation main of FOS GIT software, there is reason for
and user documentation is also common among optimism. As opposed to questionable long-term
the more mature FOSS products (Steiniger & survival, Steiniger and Bocher (2009) emphasize
Bocher, 2009). Furthermore, “organizations that the increasing interest and development of FOS
deploy FOSS freely offer advice to one another, GIT products. They point out in their overview
sharing insights and lessons learned,” (Fitzgerald of FOS GIS (e.g., gvSIG, Quantum GIS, SAGA,
2004, p. 93). As a result, solutions for many typi- uDig, GRASS, etc.) that:
cal problems can often be found at no cost. When
they cannot, there is a growing FOSS support and a. Four out of ten desktop projects examined
receive governmental funding support;

197
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

b. There is an increase in the download rate of Resources Analysis Support System), MapServer
FOS GIS software; and and OSSIM (Open Source Software Image Map).
c. There is an increasing number of use cases Other programs focus on more specific tasks, such
of FOS GIS. as data management, format processing, geosta-
tistical analysis and data visualization. Currently,
Furthermore, Ramsey (2007) points out that the freegis.org website contains information and
“existing products are now entering a phase of links to over 300 FOS GIT projects. Ramsey
rapid refinement and enhancement… (FOS) soft- (2007) provides an excellent review of some of
ware can provide a feature-complete alternative the more mature projects within the FOS GIT
to proprietary software in most system designs.” software domain, categorized by development/
Some commercial software manufacturers are implementation languages, such as C, Java and.
even starting to back FOS GIT initiatives, which Net. For a more in-depth overview of current
is encouraging especially from a user support and FOS desktop GIS, in terms of organizations,
longevity perspective. In late 2005 the software software groups and functionality, see Steiniger
industry giant Autodesk, in association with the and Bocher (2009).
MapServer community and DM Solutions Group,
announced that it would support and promote open FOSS and Developing Countries
source web mapping (a form of IGIS) through the
creation of the MapServer Foundation (Schutz- From the perspective of developing countries
berg, 2005). “The Foundation is expected to pro- the FOSS model is a particularly good fit, for
vide a stable infrastructure for the now extended reasons that include: cost, freedom, accessibility,
MapServer family’s code base and its growing customizability, compatibility, capacity develop-
community,” (Schutzberg, 2005, p. 1). ment and reducing the overall so-called ‘digital
The FOS GIT software community is steadily divide’ (Hoe, 2006). The lower cost of FOSS is
growing, and since 2006 has been spearheaded by definitely the most significant factor that makes
the Open Source Geospatial Foundation (OSGeo) it attractive to developing countries (Rajani et al.,
(http://www.osgeo.org). “Their mission is to sup- 2003), with the absence of licensing fees seen
port and promote the collaborative development of as a major benefit (Hoe, 2006). The software
open geospatial technologies and data,” (OSGeo that comprises the commercial GIT domain is
2008, p. 1). The OSGeo hosts an increasing number particularly expensive. Table 3 provides a few
of software projects, publishes the OSGeo journal, examples of some popular commercial GIS and
founded an education and curriculum committee, remote sensing software applications and their
and presents the annual Free and Open Source general price ranges.
Software for Geospatial (FOSS4G) international Even the most basic and required proprietary
conference (Steiniger & Bocher, 2009). software necessary for many computing needs,
The comprehensive list of links to FOS GIT such as Windows XP and MS Office, can be too
related software projects/products available expensive to purchase in a developing country
at opensourcegis.org and freegis.org provides (Roets et al., 2007). With that in mind, it is not
evidence of an active development community. surprising that the highest software piracy rates
Some notable GIT programs, which offer a range occur in developing countries such as Vietnam,
of functionality from simple data/map viewing to China and Indonesia, with piracy rates of 97, 94
more advanced spatial analysis and Internet GIS and 89 percent, respectively (Rajani et al., 2003).
capabilities, include: Quantum GIS, DIVA-GIS, In such countries is not unusual to have new
OpenEV, uDig, gvSIG, GRASS (Geographic computers come with a range of pre-installed

198
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Table 3. Approximate costs of selected commercial GIS/RS software (the wide price ranges reflect pos-
sible variations in license type, software options [software extensions], purchasing agreements, etc.)

Application Price range (USD) Software


High end GIS/RS $10,000 to $50,000 ArcGIS (ArcINFO)
Low end GIS/RS $1,000 to $3,000 Genasys
ENVI
ERDAS Imagine
ER-Mapper
MapInfo
ArcGIS (ArcView)
ArcView extensions
ERDAS Imagine Essential

pirated proprietary software (Weerawarana & services (Cook & Horobin, 2006). This will, in
Weeratunge, 2004). This evidence demonstrates turn, provide more jobs in the FOSS industry,
that users in developing countries most often attract trained professionals, and help stimulate
don’t, and perhaps more importantly, cannot afford the local economy.
to pay for computer software. Although piracy is The freedom to customize the source code is
very common, it actually devalues the economic also important from a developing country perspec-
benefits of FOSS by falsely reducing the price of tive. Krakowski (2006) points out that proprietary
proprietary software (Weerawarana & Weera- software developed within the Western world is
tunge, 2004). The zero cost of FOSS should be designed to fit well with Western culture. As a
considered a major advantage over proprietary result, it may not be well-suited for use in other
software in the case of developing countries, and regions of the world, where local customs and
could also help reduce their reliance, and inad- practices may be quite different. With FOSS,
vertent support and encouragement, of the illegal applications can be tailored to meet the specific
software market. needs of the users, and take into account cultural
The freedom to access and study the source variations and social practices that deviate from
code is another fundamental advantage of using industrialized societies. Similarly, “the majority of
FOSS. As a result, the choice to use FOSS is people in developing countries do not understand
not only a software choice, but also a means of English, yet proprietary software is often only
acquiring knowledge about the software itself. In available in English,” (Hoe 2006, (p. 6); Nonogaki
developing countries this is important for “capacity et al., 2004). Clearly, interface and documentation
building of the local population, and can help them language represents a major obstacle to users who
better understand and deploy new technologies do not understand English; however, FOSS can be
successfully,” (Hoe 2006, p. 31). Mohamed and adapted to meet local language needs and in doing
Plante (2002) emphasize that “local workforce so helps to tackle the digital divide. For example,
development and capacity building are critical Ubuntu is an African adjusted Linux distribu-
for system maintenance and operation over the tion that attempts to resolve language problems
long run,” (p. 2286). In addition, using FOSS (Krakowski, 2006). GNOME 2.22 – the default
can contribute to the development of the local desktop environment of Ubuntu – offers support
information and communications technologies for 46 languages. Similarly, multi-language capa-
(ICT) industry (of which GIT is a part of), as sup- bilities have been incorporated in recent versions
port, development and maintenance contracts can of GRASS and MapServer (Nonogaki et al., 2004;
be provided by local businesses that offer FOSS Raghavan et al., 2006) and gvSIG is available

199
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

in more than ten languages. Additional efforts tries, in terms of advantages and disadvantages, is
need to focus on translation of online help files, diverse and abundant. The preceding discussion
manuals and so on. is meant to provide only a brief overview of some
Another advantage of FOSS is that they usually well acknowledged and commonly discussed
do not require the newest or best computer hard- advantages that FOSS provide and elaborate on
ware in order to function efficiently and correctly. those specific to GIT. We acknowledge that there
In contrast, and especially in the GIT software are many more advantages to FOSS in general for
domain, proprietary companies are continually the developing world that are beyond the scope of
releasing new versions that demand the newest and this paper to discuss and so recommend reviews
most advanced hardware. In developing countries by Rajani et al. (2003) and Weerawarana and
most users do not have access to powerful, high- Weeratunge (2004), or Cook and Horobin (2006)
end computers with the most current hardware who examine FOSS from a public administration
available (Caldeweyher et al., 2006). Therefore, (government) perspective.
even if they could purchase sophisticated GIT
software they would also require computers with
sufficient RAM and CPU power to take advantage DISCUSSION
of the advanced capabilities that are offered in
most commercial GIT packages. Considering that disasters are spatial phenomenon,
A couple of final remarks about the potential it would be reasonable to assume that GIT can
of FOSS for developing countries relate to acces- be effectively employed throughout all phases
sibility and unlimited installations. First, FOSS to assist the disaster management process. While
can be downloaded (accessed) from anywhere this is true in the case of developed countries,
with an Internet connection; and where the In- the same cannot be said for developing coun-
ternet is unavailable FOSS can be delivered on tries. The use of GIT for disaster management in
a CD or provided via a data storage device such developing countries is limited by a number of
as a USB or external hard drive. This makes well documented barriers, including: a lack of
computer software accessible to potential users financial resources (Renyi & Nan, 2002), a lack
who would have absolutely no ability to purchase of local GIT expertise/knowledge (Mohamed &
or otherwise obtain such software. Second, the Plante, 2002), institutional/political instability
freedom to install FOSS on as many computers (Pande, 2006), and a shortfall of spatial data
as is desired is another fundamental benefit not (ESRI, 2006; Murgia et al., 2002). The degree to
offered by proprietary software. If a single use which each of these GIT implementation barriers
license is purchased from a proprietary company limits the use of GIT varies between developing
only one computer can legally run the software, countries, and within each country depending on
and multi-use licenses are expensive and con- the administrative level.
sidered unaffordable for most software users in We have examined the literature and discussed
developing countries, even governments (Roets various aspects of research on natural hazards and
et al., 2007). This limits the accessibility of the disaster management. Developing countries are
software, preventing it from being installed and highly vulnerable to natural disasters, a vulner-
used on more than one computer. In an office en- ability that could be addressed partially through
vironment, for example, this is likely to represent GIT. However, there are numerous barriers to
a significant problem. GIT implementation in the developing world.
In sum, the literature that addresses issues re- We reviewed the application of GIT in natural
lated to FOSS and its adoption in developing coun- disaster management and focused on describing

200
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

the significant potential for GIS, RS and IGIS is now multi-national) history it has evolved into
to assist the disaster management phases. We a sophisticated and powerful raster/vector GIS,
discussed the concept of FOSS, and argued that “used for geospatial data management and analy-
they have reached a stage a maturity that makes sis, image processing, graphics/map production,
FOSS a viable alternative model over commercial spatial modeling, visualization and much more,”
software, especially in developing countries. The (http://grass.itc.it/intro/general.php). Although
FOS GIT software domain is steadily growing GRASS is very powerful, its main drawback is
and there are already many mature projects that its unfriendly user environment, as it functions
currently offer a range of functionality (Raghavan primarily through command line operations which
et al., 2006; Steiniger & Bocher, 2009). can be a challenge for non-expert users (Ramli et
Given the continued growth of FOS GIT, we al., 2005; Steiniger & Bocher, 2009).
propose that its current availability offers increas- Furthermore, a key word search of ‘hazard’
ing potential to the local level disaster management and ‘disaster’, independently, of all presentation
practitioner community in developing countries. titles from the FOSS4G 2006 and 2007 http://
In this section we discuss why and how FOS www.foss4g2006(2007).org/) conference yielded
GIT can, and should, be implemented at the local only three that contained ‘hazard’ and one that
level in developing countries in order to improve contained ‘disaster.’ This is surprising given the
disaster management capacity, and propose future increasing frequency of natural disasters and the
research needs in this area. critical role that GIT plays within the various
The cosmopolitan existence of FOSS is a rela- management phases. However, although very few
tively new phenomenon, as are the applications presentation titles include the word ‘hazard’ and/
that are emerging. Currently, within the literature or ‘disaster’, it is likely that many of the topics
that addresses FOS GIT, there appears to be very discussed do contain recent research or software
little that examines its use in the specific research development relevant to the study and/or manage-
domain of natural hazards and disaster manage- ment of natural hazards and disasters. For ex-
ment, and this finding may be related to the fact ample, addressing how FOS GIS software can
that the entire FOS GIT domain is quite new contribute to SDI development is relevant to di-
compared with proprietary GIT software. Table saster management, since the required spatial data
4, containing search results from the SCOPUS are often lacking, although an explicit connection
database, provides quantitative evidence of the may not necessarily be made. Coppock (1995),
difficulty in finding literature that applies FOS in a study of GIS and natural hazards research,
GIT in the field of natural disaster management makes an analogous point, explaining that GIS
in developing countries. Although the concepts of and natural hazards are multidisciplinary fields,
GIT have existed for decades, and are now reflected and that much of what has been written appears
in a large commercial GIT software market, those in the literature of those disciplines.
concepts have only recently been transformed into While a review of the application of FOS GIT
capabilities within FOS GIT, with the exception in the area of natural hazards and disaster manage-
being GRASS. However, GRASS was originally ment is desirable, the lack of available literature
developed by the U.S. Army Corp of Engineers in (Table 4) makes this task nearly impossible.
the early 1980s and “is currently used in academic However, when you remove the FOSS component,
and commercial settings around the world, and by there is an abundance of literature that utilizes
numerous government departments (e.g., USGS proprietary GIT software, as was demonstrated
and NASA) (http://grass.itc.it/intro/general.php). in the previous section on GIT and disaster man-
As a result of its long use and development (which agement, and in Table 4. More common from the

201
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Table 4. Literature themes search results from Scopus (http://www.info.scopus.com)

LITERATURE ADVANCED SEARCH PHRASES USED # OF RE-


THEMES (in: title-abstract-keyword) SULTS
Natural Hazards and
Disasters
((“disaster*” OR “natural disaster*” OR “natural hazard*” 254,109
OR “environment* hazard*” OR “flood*” OR “earth- 459, 881
quake*” OR “storm*” OR “hurricane*” OR “tsunami*” OR
“landslide*”))
+ “hazard*” (which includes hazards that are not of natural
origin)
Natural Hazards and Disasters ((“disaster*” OR “natural disaster*” OR “natural hazard*” 54,283
+ OR “environment* hazard*” OR “flood*” OR “earth-
Disaster Management quake*” OR “storm*” OR “hurricane*” OR “tsunami*”
OR “landslide*”) AND (“disaster* manage*” OR “hazard*
manage*” OR “mitigation” OR “prepare*” OR “respon*”
OR “relief”))
Natural Hazards and Disasters ((“disaster*” OR “natural disaster*” OR “natural hazard*” 4,994
+ OR “environment* hazard*” OR “flood*” OR “earth-
Disaster Management quake*” OR “storm*” OR “hurricane*” OR “tsunami*” OR
+ “landslide*”) AND (“disaster* manage*” OR “hazard* man-
Geospatial Information Technology (GIT) age*” OR “mitigation” OR “prepare*” OR “respon*” OR
“relief”) AND (“geo* info* sys*” OR “GIS” OR “geoinfo*”
OR “3S” OR “geospatial” OR “geo* info* tech*” OR “re-
mote sens*” OR “RS” OR “earth observation*” OR “EO”
OR “satellite” OR “spatial*”))
Natural Hazards and Disasters ((“disaster*” OR “natural disaster*” OR “natural hazard*” 769
+ OR “environment* hazard*” OR “flood*” OR “earth-
Disaster Management quake*” OR “storm*” OR “hurricane*” OR “tsunami*” OR
+ “landslide*”) AND (“disaster* manage*” OR “hazard* man-
Geospatial Information Technology (GIT) age*” OR “mitigation” OR “prepare*” OR “respon*” OR
+ “relief”) AND (“geo* info* sys*” OR “GIS” OR “geoinfo*”
Developing Countries OR “3S” OR “geospatial” OR “geo* info* tech*” OR “re-
mote sens*” OR “RS” OR “earth observation*” OR “EO”
OR “satellite” OR “spatial*”) AND (“develop* countr*” OR
“develop* region*” OR “develop* nation*” OR “transition*
countr*” OR “third world” OR “less develop*” OR “East*
Europe” OR “Africa” OR “Asia” OR “South America” OR
“Central America” OR “China” OR “India” OR “Pakistan”
OR “Indonesia” OR “Malaysia” OR “Thailand” OR “Ban-
gladesh”))
continued on following page

FOSS perspective is literature that emphasizes the believe that more and more examples of successful
great potential it offers in areas such as disaster FOSS implementation will undoubtedly emerge
management, without going into any more detail in the coming years. This belief is based on the
about specific applications or examples of suc- increasing interest in FOSS and overall growth of
cessful usage. Thus, further research is required the user and development communities, through
regarding the current capabilities of FOS GIT, and initiatives such the OSGeo and websites like
more specifically, how they can be used by the opensourcegis.org and freegis.org.
disaster management community in developing Among the broad set of Millennium Develop-
countries. Again, we emphasize that sufficient ment Goals that the United Nations established
time has not elapsed to allow for the creation of in 2000, one stands out: “Make available the
a literature base in this research area, yet strongly benefits of new technologies, especially informa-

202
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Table 4. Continued

Natural Hazards and Disasters ((“disaster*” OR “natural disaster*” OR “natural hazard*” 15


+ OR “environment* hazard*” OR “flood*” OR “earth-
Disaster Management quake*” OR “storm*” OR “hurricane*” OR “tsunami*” OR
+ “landslide*”) AND (“disaster* manage*” OR “hazard* man-
Geospatial Information Technology (GIT) age*” OR “mitigation” OR “prepare*” OR “respon*” OR
+ “relief”) AND (“geo* info* sys*” OR “GIS” OR “geoinfo*”
Developing Countries OR “3S” OR “geospatial” OR “geo* info* tech*” OR “re-
+ mote sens*” OR “RS” OR “earth observation*” OR “EO”
Free and Open Source Software (FOSS) OR “satellite” OR “spatial*”) AND (“develop* countr*” OR
“develop* region*” OR “develop* nation*” OR “transition*
countr*” OR “third world” OR “less develop*” OR “East*
Europe” OR “Africa” OR “Asia” OR “South America” OR
“Central America” OR “China” OR “India” OR “Pakistan”
OR “Indonesia” OR “Malaysia” OR “Thailand” OR “Ban-
gladesh”) AND (“open source” OR “OS” OR “open source
software” OR “OSS” OR “free software” OR “FS” OR “free
and open source” OR “FOS” OR “free and open source
software” OR “FOSS” OR “F/OSS” OR “free/libre and open
source software” OR “FLOSS” OR “open source software/
free software” OR “OSS/FS” OR “FOSS4G” OR “GRASS”
OR “gvSIG” OR “Quantum” OR “SAGA” OR “uDig” OR
“ILWIS” OR “JUMP” OR “DIVA” OR “KOSMO”))

tion and communications technologies” (http:// for novice users, but recent developments in the
www.un.org/millenniumgoals). GIT are within availability of packaged solutions, such as FW-
the domain of ICT, yet peppered throughout this tools (see maptools.org), and commercial support
current review are a number of barriers that can have helped reduce implementation obstacles. In
limit GIT implementation in developing countries. addition, many FOS GIT software products are
We propose that FOSS can reduce some of these available as a single executable file, and once
barriers, and in doing so, can also help in achiev- downloaded, are installed in the same manner as
ing the development goals established by the UN. proprietary software (e.g., point and click). Albeit,
Primarily, FOSS eliminates the need to pur- some software versions designed for Linux or
chase expensive software licenses and makes GIT other FOSS operating systems do require specific
accessible to anyone, including local level disaster installation instructions and a certain amount of
managers operating on very small budgets. This computer knowledge. Furthermore, that FOSS
point is of fundamental importance when consid- can be an effective educational/training tool is an
ering that in many cases “disaster management idea that is widely acknowledged, given that they
responsibilities and duties are decentralized to can be accessed at no cost and experimented with
local governments without being accompanied by in whatever way users see fit. Local workforce
sufficient funding,” (Montoya & Masser 2005, p. development and capacity building projects are
494). Second, there exists a lack of local expertise essential for sustained GIT implementation and
and knowledge that is required to implement and maintenance that does not rely on external sup-
maintain the use of GIT. FOS GIT software could port or funding.
be used to learn about the technology itself, and In general, disaster management is dependent
the potential ways in which it could be used for on the “functional and effective operation of
natural disaster management purposes. Ragha- institutions, whether formal or informal, and at
van et al. (2006) emphasize that previous FOS the local level where it matters most,” (Pande,
GIT technology transfer was an arduous task 2006, p. 426). However, government instability

203
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

is common in developing countries, which leads Many of the barriers to successful GIT imple-
to institutional instability (Murgia et al., 2002), mentation in developing countries can potentially
which can result in a lack of commitment to ad- be addressed with FOSS. Currently, there are many
dress disaster management requirements, such as basic FOS GIT software available and others are
providing adequate technology and training. As a undergoing rapid development (Raghavan et al.,
result of this instability, Ramasubramanian (1999) 2006). However, there still remains a need to iden-
suggests that GIS implementation in developing tify particular software products, establish feasible
countries is likely to be more successful if it re- methodologies and workflows, and in general
lies on the capacity of “empowered individuals document specific examples of FOSS usage. While
and groups rather than solely on organizational few would argue against the potential of FOSS,
structures,” (p. 359) such as public institutions. there exists a research need to clarify the role of
To some extent, FOS GIT could be harnessed to FOS GIT during each of the phases of the disaster
empower such groups, and allow them to benefit management cycle. Regardless, a lack of attention
from technology without having to rely only on to the aforementioned GIT implementation barri-
institutional initiatives and financial support. ers “in any decision on whether to adopt GIT will
Lastly, and perhaps most importantly, FOS significantly increase the risk of system failures,
GIT offers considerable potential to help build a matter of some significance in any environment
spatial data infrastructures required to fulfill basic characterized by a lack of financial, technical and
disaster management mitigation and preparation human resources,” (Sliuzas 1999, p. 3).
planning, and following a disaster, speed response This review suggests that the successful ap-
and recovery efforts. It has already been shown plication of GIT in natural disaster management
that many essential disaster management require- can require significant quality spatial data, GIT
ments hinge upon the existence of required spatial knowledge and technical skills, and/or advanced
datasets, such as critical facilities, transportation functionality. Each of these requirements consti-
infrastructure and population distribution (loca- tutes significant obstacles from a developing coun-
tion of towns and villages), yet many of these try perspective, especially at any administrative
key datasets are missing in developing countries. level other than national. Another noteworthy fact
Especially for “assessing the situation in more is that the use of proprietary GIT typically requires
remote regions, which are frequently only reached significant computing resources. This represents a
days or even weeks after an event, a detailed huge obstacle in developing countries, which often
inventory of settlements and infrastructure may do not have access to modern (i.e., less than 2-3
save many lives and speed up recovery,” (Kerle years old) computer hardware (Caldeweyher et al.
& Oppenheimer 2002, p. 142). Montoya and 2006, p. 885). GIS users in developing countries
Masser (2005) emphasize the need to identify or are likely unable to take advantage of advanced
develop “cost-effective data collection methods spatial analysis and data visualization features
for producing spatially referenced information prevalent in current proprietary GIS software. In
in developing countries,”(p. 495). Mansourian et comparison, FOS GIS software generally require
al. (2004) propose the development of a SDI as less processing power, and are more suitable for
a framework to facilitate disaster management, users who do not have modern computers. Thus,
and many currently available, and user friendly, while advanced GIT usage is feasible in developed
desktop FOS GIS (Steiniger & Bocher, 2009) countries and within academic/research environ-
have the necessary functionality to assist the SDI ments (including some in developing countries),
development process. for the typical disaster management practitioner

204
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

in developing countries this type of high-end GIT developed countries adopt the technologies and
software usage is unachievable (Cutter, 2003). methods of the more advanced countries (Britton,
Although we have argued that currently avail- 2000). However, this approach is increasingly
able FOS GIT have great potential, we do acknowl- being recognized as ineffective (Brewer et al.,
edge that proprietary GIT software, at this point, 2005; Heeks, 2002). Although GIT implemen-
offer superior overall functionality and provide tation successes are often reported in “popular
advanced analytical and visualization capabilities magazines or trade journals, their sustainabil-
that are beyond what current FOS GIT software can ity is seldom documented, and failures are not
provide, with GRASS being a possible exception. reported or glossed over to avoid criticism,”
However, GRASS is notoriously difficult to use (Ramasubramanian 1999, p. 360). Heeks (2002)
and requires an advanced GIT skill set, and as a explains that the “introduction of GIS in develop-
result, it is not suitable for use at the local level ing countries has been problematic, as they are
in developing countries where such expertise is seen to incorporate a number of assumptions and
in short supply, and where an inability to success- requirements that derive from Western rational-
fully use the software may result in frustration ism,” (p. 107). One way to possibly improve the
and discouragement. To avoid such a result, we success of technology transfer initiatives is through
suggest that simpler FOS GIT software could be participatory approaches. Within the GIT domain,
implemented and utilized more successfully, and GIS has garnered considerable attention from a
at the same time provide sufficient functionality participatory standpoint and thus will be used as
to improve current disaster management GIT an example. In terms of developing countries,
requirements at the local level. In addition, com- Abbot et al. (1998) define PGIS as “an attempt
bining a number of FOS GIT software together to utilise GIS technology in the context of the
can extend the functionality in the long term. GIT needs and capabilities of communities that will
usage must not be over ambitious in the early be involved with, and affected by, development
stages and should focus on the core requirements projects and programmes.” A central objective
needed to improve existing disaster management of the participatory approach is to capture local
capabilities. Osti et al. (2008) make a similar knowledge and combine it with more traditional
point, and suggest that “low-tech systems, which spatial information, and to facilitate GIS produc-
are less cost intensive and can be handled by the tion and use which are community-based. Creating
local population,” (p. 105) are of high interest in GIT environments that pull in new ideas, and pos-
the context of developing countries. sibly new spatial methods and techniques, rather
Within the hazard and disaster management than having them pushed in from the outside,
literature that addresses developing countries appears to be a more effective solution (Britton,
and GIT implementation issues there are a couple 2000). Developing countries could benefit from
additional topics that deserve mentioning: the par- international development programs that focus
ticipatory approach (e.g., PGIS) and the concept of on capacity building in the area of FOSS-based
traditional/indigenous environmental knowledge. participatory GIT implementation. Knowledge-
Both of these topics are also partly embedded able practitioners in developing countries could
within the technology transfer literature, and then further build local capacity, especially with
although not within the scope of this paper, we FOSS, which have been shown to be a good fit
will briefly touch on these two topics since FOSS for developing countries. If appropriately utilized,
has a potentially large role to play. Rambaldi et al. (2006) emphasize that participa-
With technology transfer there is a tendency tory approaches could exert “profound impacts on
to assume that development occurs when less

205
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

community empowerment, innovation and social In the past, there has been unwillingness to
change,” (p. 1). use ‘non-scientist’ or ‘indigenous knowledge’ as
Kienberger and Steinbruch (2005) applied data. However, there is growing recognition of
a PGIS approach in Mozambique, in which the the importance of indigenous knowledge, and the
main objectives were to “acquire spatial data of need to better understand how it can be combined
the communities through GPS surveys,”(p. 5) to with Western knowledge to reduce natural hazard
better visualize certain aspects of the community vulnerability (Mercer et al., 2007; Pande, 2006).
to assist “disaster risk committees, and to identify Local communities in developing countries rely
secure locations in the case of floods,” (p. 5). Re- heavily on their environment; “they depend on the
sults indicate that the participatory approach was subtleties of ecological cycles and patterns, and
suitable “for the collection of spatial information have accumulated a body of wisdom commonly
and to capture the perception of the local people referred to as ‘traditional environmental knowledge’
regarding disasters,”(Kienberger & Steinbruch, (TEK),” (Brodnig & Mayer-Schönberger 2000, p.
2005, p. 5). A follow up of this study revealed that 3). “TEK is in essence a geographical information
in Ssome communities, and without outside/donor system derived from and embedded in the close
influence, maps were also used at meetings to dis- relationship of local people with their land and
cuss issues with a spatial relevance,”(Kienberger natural resources...with members of the community
& Steinbruch, 2005, p. 5). serving as repositories for different types and cat-
Generally speaking, participatory approaches egories of data…” (Brodnig & Mayer-Schönberger,
can help build local capacity to understand and 2000). Raghavan et al. (2006) propose that FOS
implement GIT to improve disaster management GIT could help local practitioners incorporate
capabilities. Using a participatory approach local spatial knowledge, and thus integrate TEK with
knowledge about natural hazards – their loca- technology use. Such an approach could prove
tion and frequency – can be ascertained from valuable for natural disaster management, since
community members and represented spatially local knowledge of environmental hazards and
within a GIS. For example, using a GPS coor- traditional coping strategies could be very useful.
dinates of the location of previous floods can be
captured, along with elevation data. Within the
GIS the flood point locations can be analyzed in CONCLUSION
conjunction with elevation data (e.g., a DEM or
contour lines), and potential flood scenarios can be As the human population continues to grow, and
considered and prepared for accordingly. Comb- considering recent evidence of climate change that
ing locally derived hazard knowledge with other might exacerbate meteorological related natural
spatial data such as transportation infrastructure, hazards in particular, there is reason for concern
critical facilities and towns/villages within a GIS that natural disasters may occur in the future with
environment will help the local population to be increasing frequency and consequence. As a result,
more prepared for and help mitigate potential there is an immediate need to utilize available
disaster impacts. Ultimately, the goal is to reduce technology to reduce natural hazard vulnerability
disaster vulnerability, and this can be achieved with and in general to be more prepared to effectively
a better spatial understanding of the relationship respond to disasters when they occur.
between natural hazards and the elements at risk This article has reviewed and examined the
(e.g., people, infrastructure, etc.) (Montoya & use of geospatial information technology (GIT) in
Masser, 2005). the field of natural disaster management, with an
emphasis on developing countries where natural

206
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

hazard vulnerability is high and disaster impacts financial resources is currently a very significant
can be particularly devastating. The ability of GIT GIT implementation barrier. Recent growth and
to acquire, interpret, analyze, map and disseminate development in the FOSS-based GIT domain has
information, are essential during all phases of the resulted in the emergence of many mature, very
natural disaster management cycle – the process capable and user-friendly software products that
of mitigation, preparation, response and recovery. in some cases offer functionality that is compa-
Since disasters are spatial phenomenon there is a rable with commercial GIT software. As a result,
strong relationship between disaster management we propose that FOSS-based GIT products can
requirements and the spatial information and be used to improve the use of GIT in developing
decision support capabilities offered by GIT. In countries, especially at the local level. Specifi-
this context, GIT includes geographic informa- cally, FOSS-based GIT provide opportunities to
tion systems, remote sensing, global positioning improve local GIT knowledge and related skill
systems and Internet GIS. Among numerous other sets required for their effective application in the
tasks, GIT provides a basis for planning mitigation field of natural disaster management. Additionally,
strategies, hazard and risk assessment/mapping, the current capabilities of FOSS-based GIT allow
vulnerability assessment, vehicle dispatch and for the development of spatial data infrastructures
supply routing, damage assessment, and response (SDI) that are required by the disaster manage-
resource mobilization. Consequently, by using ment practitioner community to successfully
GIT, it is possible to identify and mitigate risk, be implement GIT, for purposes such as hazard and
better prepared for and respond to disasters, and risk assessment. Considering the lack of available
recover from disasters. However, we have also spatial data in developing countries, which is a
identified and described a number of GIT imple- GIT implementation barrier, the development of
mentation barriers that are especially relevant SDIs will represent a major step forward. How-
to the application of GIT at local administrative ever, future research is required that identifies
levels, where a strong disaster management ini- specific FOSS-based GIT and functionality that
tiative is required. These barriers include, but are can be effectively implemented at the local level
not limited to: (1) a lack of financial resources, in developing countries to improve overall natural
(2) a lack of spatial data, (3) political/institutional disaster management capacity.
instability and (4) a lack of local GIT knowledge/
expertise. Thus, until at least some of these bar-
riers are overcome the level of ability for GIT to ACKNOWLEDGMENT
improve overall disaster management capacity at
the local level will remain low. During the course of this research, the authors
Many researchers have highlighted the par- appreciate the discussions with Mr. Chandeep
ticular opportunity that free and open source Corea, head of the GIS department of the Sri Lanka
software (FOSS) now provides for developing Wildlife conservation Society (SLWCS). The
countries that were previously unavailable. At- critical input of Dr. Barry Wellar was essential in
tractive characteristics of FOSS from a develop- developing this paper. The authors also acknowl-
ing country perspective include: cost, freedom, edge financial support to Sam Herold in the form
accessibility, customizability, compatibility, and of a postgraduate scholarship from the Natural
software/technical capacity development oppor- Sciences and Engineering Research Council of
tunities. The cost aspect is especially significant Canada. Further financial support was provided by
from a GIT perspective, as proprietary/commer- the Canadian Foundation for Innovation, Ontario
cial GIT software are expensive and a lack of Innovation Trust and the University of Ottawa.

207
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

REFERENCES Britton, J. M. R. (2000). GIS Capacity Build-


ing in the Pacific Island Countries: Facing the
Abbot, J., Chambers, R., Dunn, C., Harris, T., realities of technology, resources, geography and
de Merode, E., & Porter, G. (1998). Participa- cultural difference. Cartographica, 37(4), 7–19.
tory GIS: opportunity or oxymoron? PLA Notes doi:10.3138/B267-8663-4071-2375
International Institute for Environment and De-
velopment, 33, 24–34. Brodnig, G., & Mayer-Schönberger, V. (2000).
Bridging the gap: The role of spatial informa-
Alexander, D. E. (1995). A survey of the field of tion technologies in the integration of traditional
natural hazards and disaster studies. In Carrara, A., environmental knowledge and western science.
& Guzzetti, F. (Eds.), Geographical information Electronic Journal on Information Systems in
systems in assessing natural hazards (pp. 1–19). Developing Countries, 1(1), 1–15.
Amsterdam, Netherlands: Kluwer Academic.
Bui, T., Cho, S., Sankaran, S., & Sovereign,
Amdahl, G. (2001). Disaster response: GIS for M. (2000). A framework for designing a
public safety. Redlands, CA: ESRI. global information network for multinational
Amendola, A., Linnerooth-Bayer, J., Okada, N., humanitarian assistance/disaster relief. In-
& Shi, P. (2008). Towards integrated disaster risk formation Systems Frontiers, 1(4), 427–442.
management: case studies and trends in Asia. doi:10.1023/A:1010074210709
Natural Hazards, 44(2), 163–168. doi:10.1007/ Burton, I., Kates, R. W., & White, G. F. (1993).
s11069-007-9152-z The environment as hazard (2nd ed.). New York,
Anderson, G., & Moreno-Sanchez, R.., (2003). NY: Guilford Press.
Building web-based spatial information solu- Caldeweyher, D., Zhang, J., & Pham, B. (2006).
tions around open specifications and open source OpenCIS—Open Source GIS-based web commu-
software. Transactions in GIS, 7 (4), 447–466. nity information system. International Journal
Becking, I. (2004, September 6-10). How space of Geographical Information Systems, 20(8),
can better support emergency management and 885–898. doi:10.1080/13658810600711378
critical infrastructure protection. In Proceedings of Câmara, G., & Onsrud, H. (2004). Open-source
the Envisat & ERS Symposium, Salzburg, Austria. geographic information systems software: Myths
Brewer, E., Demmer, M., Du, B., Ho, M., Kam, M., and realities. In Proceedings of the International
& Nedevschi, S. (2005). The case for technology Symposium on Open Access and the Public Do-
in developing regions. IEEE Computer Society, main in Digital Data and Information for Science
38(6), 25–38. doi:10.1109/MC.2005.204 (pp. 127-133).

Briceño, S. (2004). Building disaster-resilient Chen, P., Liew, S. C., & Kwoh, L. K. (2005, July
communities: The road to the second World 25-29). Tsunami damage assessment using high
Conference on Disaster Reduction, Kobe, Hyogo, resolution satellite imagery: A case study of Aceh,
Japan. Natural Resources Forum, 28, 234–236. Indonesia. In Proceedings of the IEEE Interna-
doi:10.1111/j.1477-8947.2004.00093.x tional Symposium on Geoscience and Remote
Sensing (Vol. 2, pp. 1405-1408).

208
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Cook, I., & Horobin, G. (2006). Implementing De La Ville, N., Diaz, A. C., & Ramirez, D.
eGovernment without promoting dependence: (2002). Remote sensing and GIS technologies
Open source software in developing countries as tools to support sustainable management of
in Southeast Asia. Public Administration and areas devastated by landslides. Environment,
Development, 26, 279–289. doi:10.1002/pad.403 Development and Sustainability, 4, 221–229.
doi:10.1023/A:1020835932757
Coppock, T. J. (1995). GIS and natural hazards: an
overview from a GIS perspective. In Carrara, A., De Paratesi, S. R. (1989). Hazards and disasters:
& Guzzetti, F. (Eds.), Geographical information concepts and challenges. In Barrett, E. C., &
systems in assessing natural hazards (pp. 21–34). Brown, K. A. (Eds.), Remote sensing for hazard
Amsterdam, The Netherlands: Kluwer Academic. monitoring and disaster assessment: marine and
coastal applications in the Mediterranean region
Cova, T. J. (1999). GIS in emergency management.
(pp. 1–17). Philadelphia, PA: Gordon and Breach.
In Longley, M. A., Goodchild, M. F., Maguire, D.
J., & Rhind, D. W. (Eds.), Geographical informa- Dewan, A. M., Islam, M. M., Kumamoto, T., &
tion systems: Principles, techniques, applications, Nishigaki, M. (2007). Evaluating flood hazard
and management (pp. 845–858). New York, NY: for land-use planning in greater Dhaka of Ban-
John Wiley & Sons. gladesh using remote sensing and GIS techniques.
Water Resources Management, 21, 1601–1612.
Currion, P., De Silva, C., & Van De Walle, B.
doi:10.1007/s11269-006-9116-1
(2007). Open source software for disaster man-
agement. Communications of the ACM, 50(3), Dewan, A. M., Kabir, M. D. H., Islam, M. M., Ku-
61–65. doi:10.1145/1226736.1226768 mamoto, T., & Nishigaki, M. (2007). Delineating
flood risk areas in Greater Dhaka of Bangladesh
Cutter, S. L. (1996). Vulnerability to environ-
using geoinformatics. Georisk, 1(4), 190–201.
mental hazards. Progress in Human Geography,
20, 529–539. doi:10.1177/030913259602000407 Dong, P. (2005). Development of a GIS/GPS-
based emergency response system. Geomatica,
Cutter, S. L. (2003). GI science, disasters, and
59(4), 427–433.
emergency management. Transactions in GIS,
7(4), 439–445. doi:10.1111/1467-9671.00157 El-Masri, S., & Tipple, G. (2002). Natural disaster,
mitigation and sustainability: The case of devel-
Cutter, S. L., Boruff, B. J., & Shirley, W. L.
oping countries. International Planning Studies,
(2003). Social vulnerability to environmental haz-
7(2), 157–175. doi:10.1080/13563470220132236
ards. Social Science Quarterly, 84(2), 242–261.
doi:10.1111/1540-6237.8402002 Engler, N. J., & Hall, G. B. (2007). The Internet,
spatial data globalization and data use: The case
Cutter, S. L., Mitchell, J. T., & Scott, M. S. (2000).
of Tibet. The Information Society, 23, 345–359.
Revealing the vulnerability of people and places: A
doi:10.1080/01972240701572822
case study of Georgetown County, South Carolina.
Annals of the Association of American Geogra- Environmental System Research Institute (ESRI).
phers. Association of American Geographers, 90, (1993). Understanding GIS: The ARC/INFO
713–737. doi:10.1111/0004-5608.00219 Method. Redlands, CA: Author.

209
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Environmental Systems Research Institute (ESRI). Hoe, N. S. (2006). Breaking barriers: The potential
(2006). GIS and emergency management in Indian of free and open source software for sustainable
Ocean earthquake/tsunami disaster. Retrieved human development. A Compilation of Case
from http://www.esri.com/library/whitepapers/ Studies from Across the World. Retrieved from
pdfs/gis-and-emergency-mgmt.pdf http://www.apdip.net/publications/ict4d/Break-
ingBarriers.pdf
Fitzgerald, B. (2004). A critical look at open source.
IEEE Computer, 37(7), 92–94. doi:10.1109/ Hussain, M., Arsalan, M. H., Siddiqi, K., Nas-
MC.2004.38 eem, B., & Rabab, U. (2005, June 9-11). Emerg-
ing geo-information technologies for natural
Freeman, P. K., & Pflug, G. C. (2003). Infrastruc-
disaster management in Pakistan: An overview.
ture in developing and transitional countries: Risk
In Proceedings of the 2nd International Confer-
and protection. Risk Analysis, 23(3), 601–609.
ence on Recent Advances in Space Technolo-
doi:10.1111/1539-6924.00340
gies. Retrieved from http://ieeexplore.ieee.org/
Goodchild, M. F. (2006). GIS and disasters: Plan- iel5/10107/32390/01512618.pdf?tp=&arnumbe
ning for catastrophe. Computers, Environment and r=1512618&isnumber=32390
Urban Systems, 3(30), 227–229. doi:10.1016/j.
International Federation of Red Cross and Red
compenvurbsys.2005.10.004
Crescent Societies (IFRC). (2001). World disasters
Guinau, M., Pallàs, R., & Vilaplana, J. M. (2005). report. Geneva, Switzerland: Author.
A feasible methodology for landslide susceptibility
International Panel on Climate Change (IPCC).
assessment in developing countries: A case-study
(2007). Climate change 2007: The Physical Sci-
of NW Nicaragua after Hurricane Mitch. Engi-
ence Basis, Contribution of Working Group I to the
neering Geology, 80, 316–327. doi:10.1016/j.
Fourth Assessment Report of the IPCC. Retrieved
enggeo.2005.07.001
from http://www.ipcc.ch/ipccreports/ar4-wg1.htm
Gunes, A. E., & Koval, J. P. (2000). Using
Jayaraman, V., Chandrasekhar, M. G., & Rao, U.
GIS in emergency management operations.
R. (1997). Managing the natural disasters from
Journal of Urban Planning and Development,
space technology inputs. Acta Astronautica, 40(2),
126(3), 136–149. doi:10.1061/(ASCE)0733-
291–325. doi:10.1016/S0094-5765(97)00101-X
9488(2000)126:3(136)
Johnson, R. (2000). GIS technology for disaster
Heeks, R. (2002). Information systems and de-
and emergency management. Redlands, CA:
veloping countries: Failure, success, and local
ESRI.
improvisations. The Information Society, 18,
101–112. doi:10.1080/01972240290075039 Karimi, H. A., & Blais, J. A. R. (1996). Cur-
rent and future directions in GISs. Computers,
Henderson, L. J. (2004). Emergency and disaster:
Environment and Urban Systems, 20(2), 85–97.
Pervasive risk and public bureaucracy in developing
doi:10.1016/S0198-9715(96)00002-6
nations. Public Organization Review, 4(2), 103–
119. doi:10.1023/B:PORJ.0000031624.46153.b2 Kerle, N., & Oppenheimer, C. (2002). Satel-
lite remote sensing as a tool in Lahar disaster
Hewitt, K. (1995). Excluded perspectives in the
management. Disasters, 26(2), 140–160.
social construction of disaster. International
doi:10.1111/1467-7717.00197
Journal of Mass Emergencies and Disasters,
13(3), 317–339.

210
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Kienberger, S., & Steinbruch, F. (2005, September Magsud, M., Sann Oo, K., & Rajapaksha, J. (2005,
7-10). P-GIS and disaster risk management: As- November 7-11). Tsunami disaster damage detec-
sessing vulnerability with P-GIS methods – Ex- tion and assessment using high resolution satellite
perience from Buzi, Mozambiqu. In Proceedings data, GIS and GPS – Case study in Sri Lanka. In
of the International Conference on Participatory Proceedings of the Asian Conference on Remote
Spatial Information Management and Commu- Sensing, Hanoi, Vietnam.
nication: Mapping for Change, Kenya, Nairobi.
Mansor, S., Shariah, M. A., Billa, L., Se-
Kondratyev, K. Y., Grigoryev, A. A., & Varotsos, tiawan, I., & Jabar, F. (2004). Spatial technol-
C. A. (2002). Environmental disasters: Anthro- ogy for natural risk management. Disaster
pogenic and natural. UK: Praxis. Prevention and Management, 13(5), 364–373.
doi:10.1108/09653560410568480
Kraak, M.-J. (2004). The role of the map in a Web-
GIS environment. Journal of Geographical Sys- Mansourian, A., Rajabifard, A., Valadan Zoej, M.
tems, 6, 83–93. doi:10.1007/s10109-004-0127-2 J., & Williamson, I. (2004). Facilitating disaster
management using SDI. Journal of Geospatial
Kumar, V., Bugacov, A., Coutinho, M., & Neches,
Engineering, 6(1), 29–44.
R. (1999, November 5-6). Integrating geographic
information systems, spatial digital libraries and McEntire, D. A. (1999). Issues in disaster relief:
information spaces for conducting humanitarian progress, perpetual problems and prospective
assistance and disaster relief operations in urban solutions. Disaster Prevention and Management,
environments. In Proceedings of the 7th ACM 8(5), 351–361. doi:10.1108/09653569910298279
Symposium on Advances in Geographic Informa-
McEntire, D. A. (2001). Triggering agents, vulner-
tion Systems, Kansas City, MO.
abilities and disaster reduction: towards a holistic
Laben, C. (2002). Integration of remote sensing paradigm. Disaster Prevention and Management,
data and geographic information system technol- 10(3), 189–196. doi:10.1108/09653560110395359
ogy for emergency managers and their applications
Mercer, J., Dominey-Howes, D., Kelman, I.,
at the Pacific Disaster Center. Optical Engineer-
& Lloyd, K. (2007). The potential for combing
ing (Redondo Beach, Calif.), 41(9), 2129–2136.
indigenous and western knowledge in reducing
doi:10.1117/1.1501137
vulnerability to environmental hazards in small
Lay, T., Kanamori, H., Ammon, C. J., Nettles, island developing states. Environmental Hazards,
M., Ward, S. N., & Aster, R. C. (2005). The Great 7, 245–256. doi:10.1016/j.envhaz.2006.11.001
Sumatra-Andaman Earthquake of 26 December
Mohamed, M., & Plante, R. (2002). Remote
2004. Science, 308, 1127–1133. doi:10.1126/
sensing and Geographic Information Systems
science.1112250
(GIS) for developing countries. In Proceed-
Levy, J.K. (2005). Advances in decision support ings of the IEEE International Geoscience and
systems for flood disaster management: challenges Remote Sensing Symposium (Vol. 4, pp. 2285-
and opportunities. International Journal of Water 2287). Retrieved from http://ieeexplore.ieee.org/
Resources Development, 21(4), 593-612. iel5/7969/22039/01026520.pdf
Montoya, L. (2002, April 25-27). GIS and remote
sensing in urban disaster management. In Proceed-
ings of the 5th AGILE Conference on Geographic
Information Science, Balearic Islands, Spain.

211
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Montoya, L., & Masser, I. (2005). Management Pande, R. K. (2006). Participation in practice and
of natural hazard risk in Cartago, Costa Rica. disaster management: experience of Uttaranchal
Habitat International, 29, 493–509. doi:10.1016/j. (India). Disaster Prevention and Management,
habitatint.2004.04.003 15(3), 425–428. doi:10.1108/09653560610669909
Morrow, B. H. (1999). Identifying and mapping Peng, Z. R., & Tsou, M. H. (2003). Internet GIS.
community vulnerability. Disasters, 23(1), 1–18. Hoboken, NJ: John Wiley & Sons.
doi:10.1111/1467-7717.00102
Plewe, B. (1997). GIS Online: Information Re-
Murgia, J., Amemiya, N., & Turkstra, J. (2002, trieval, Mapping, and the Internet. Albany, NY:
May 15-18). Local spatial data infrastructure, OnWord Press.
Trujillo, Peru. In Proceedings of the 6th Seminar
Quarantelli, E. L. (1998). What is disaster?
on GIS and Developing Countries. Retrieved from
Perspectives on the question. New York, NY:
http://www.gisdevelopment.net
Routledge.
Musinguzi, M., Bax, G., & Tickodri-Togboa, S.
Rabindra, O., Tanaka, S., & Tokioka, T. (2008).
S. (2004, June 7-9). Opportunities and challenges
Flood hazard mapping in developing coun-
for SDI development in developing countries – A
tries: problems and perspectives. Disaster
case study of Uganda. In Proceedings of the 12th
Prevention and Management, 17(1), 104–113.
International Conference on Geoinformatics –
doi:10.1108/09653560810855919
Geospatial Information Research: Bridging the
Pacific and Atlantic, Gävle, Sweden. Radke, J., Cova, T., Sheridan, M. F., Troy, A., Lan,
M., & Johnson, R. (2000). Application challenges
Nebert, D. (2004). Developing spatial data in-
for geographic information science: Implications
frastructures: The SDI cookbook v.2.0. Global
for research, education, and policy for emergency
spatial data infrastructure. Retrieved July 2005,
preparedness and response. URISA Journal,
from http://www.gsdi.org
12(2), 15–30.
Nonogaki, S., Van Anh, T., Masumoto, S.,
Raghavan, V., Masumoto, S., & Hastings, D.
Raghavan, V., Nemoto, T., & Mori, T. (2004).
(2006, November 9-11). Free and open source
Development of Vietnamese Version of GRASS
software for geoinformatics: Present status and
GIS. Geoinformatics, 15, 106–107.
future prospects. In Proceedings of the Interna-
Oloruntoba, R. (2005). A wave of destruction and tional Symposium on Geoinformatics for Spatial
the waves of relief: issues, challenges and strate- Infrastructure Development in Earth and Allied
gies. Disaster Prevention and Management, 14(4), Sciences, Ho Chi Minh, Vietnam. Retrieved from
506–521. doi:10.1108/09653560510618348 http://wgrass.media.osaka-cu.ac.jp/gisideas06/
viewpaper.php?id=166
OSGeo. (2008). About the Open Source Geospatial
Foundation. Retrieved from http://www.osgeo. Rajani, N., Rekola, J., & Mielonen, T. (2003).
org/content/foundation/about.html Free as in education: Significance of Free/Libre
Open Source Software for developing countries.
Osti, R., Tanaka, S., & Tokioka, T. (2008).
Retrieved from http://www.itu.int/wsis/docs/
Flood hazard mapping in developing coun-
background/themes/access/free_as_in_educa-
tries: problems and perspectives. Disaster
tion_niranjan.pdf
Prevention and Management, 17(1), 104–113.
doi:10.1108/09653560810855919

212
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Ramasubramanian, L. (1999). GIS imple- Rivereau, J. C. (1995). Spot data applied to di-
mentation in developing countries: Learn- saster prevention and damage assessment. Acta
ing from organisational theory and reflective Astronautica, 35(7), 467–470. doi:10.1016/0094-
practice. Transactions in GIS, 3(4), 359–380. 5765(94)00281-P
doi:10.1111/1467-9671.00028
Roets, R., Minnaar, M., & Wright, K. (2007). Open
Rambaldi, G., Kwaku Kyem, P. A., McCall, Source: Towards successful systems development
M., & Weiner, D. (2006). Participatory spatial projects in developing countries. In Proceedings
information management and communication in of the 9th International Conference on Social Im-
developing countries. The Electronic Journal on plications of Computers in Developing Countries,
Information Systems in Developing Countries, São Paulo, Brazil. Retrieved from http://www.
25(1), 1–9. ifipwg94.org.br/fullpapers/R0061-1.pdf
Ramli, M. F., Sulaiman, W. N. A., Yusoff, M. K., Rüdenauer, H., & Schmitz, M. (2005, March
& Low, Y. Y. (2005). Open source geographical 14-16). Use of Landsat 7 ETM+ data as basic
resources analysis support system (GRASS) information for infrastructure planning. In Pro-
for landslide hazard assessment. Disaster ceedings of the ISPRS Joint Conference of the
Prevention and Management, 11(4), 522–532. 3rd International Symposium on Remote Sensing
doi:10.1108/09653560510618357 and Data Fusion Over Urban Areas, and the 5th
International Symposium on Remote Sensing of
Ramsey, P. (2007). The state of open source GIS.
Urban Areas, Tempe, AZ. Retrieved from http://
In Proceedings of the Global Conference on
www.isprs.org/commission8/workshop_urban/
Free and Open Source Software for Geospatial,
ruedenauer.pdf
Vancouver, BC, Canada. Retrieved from http://
www.refractions.net/expertise/whitepapers/open- San Miguel-Ayanz, J., Vogt, J., De Roo, A., &
sourcesurvey/ survey-open-source-2007-12.pdf Schmuck, G. (2000). Natural hazards monitoring:
Forest fires, droughts and floods – the examples
Rautela, P., & Pande, R. K. (2005). Implications of
of European Pilot Projects. Surveys in Geophys-
ignoring the old disaster management plans: Les-
ics, 21, 191–305. doi:10.1023/A:1006750412500
sons learnt from the Amparav tragety of 23 Septem-
ber 2004 in the Nainital district of Uttaranchal (In- Sanyal, J., & Lu, X. X. (2003, October 13-15).
dia). Disaster Prevention and Management, 14(3), Application of GIS in flood hazard mapping: a
388–394. doi:10.1108/09653560510605045 case study of Gangetic West Bengal, India. In
Proceedings of the Map Asia Conference, Kuala
Rego, A. J. (2001, March 21-23). National disaster
Lumpur, Malaysia. Retrieved from http://www.
management information systems & networks:
gisdevelopment.net/application/natural_hazards/
An Asian overview. In Proceedings of the Global
floods/pdf/ma03138.pdf
Disaster Information Network Conference, Can-
berra, Australia. Retrieved from http://www.adpc. Schutzberg, A. (2005). MapServer Community,
net/infores/adpc-documents/paperatgdin01.pdf Autodesk Announce MapServer Foundation. Di-
rections Magazine. Retrieved from http://www.
Renyi, L., & Nan, L. (2002). Flood area and dam-
directionsmag.com/article.php?article_id=2037
age estimation in Zhejiang, China. Journal of En-
vironmental Management, 66, 1–8. doi:10.1006/ Showalter, P. S. (1999). Remote sensing’s
jema.2002.0544 use in disaster research: A review. Disaster
Prevention and Management, 10(1), 21–29.
doi:10.1108/09653560110381796

213
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Shrestha, S. (1994). Environmental issues in Thomson, C. N., & Hardin, P. (2000). Remote
Asia and the pacific - a perspective towards the sensing/GIS integration to identify potential
information in decision making. In Proceed- low-income housing sites. Cities (London,
ings of the International Conference on Expert England), 17(2), 97–109. doi:10.1016/S0264-
Systems for Development, Bangkok, Thailand. 2751(00)00005-6
Retrieved from http://ieeexplore.ieee.org/
United Nations. (UN). (2006). The millennium
iel2/973/7458/00302314.pdf?tp=&isnumber=&
development goals report. Retrieved from http://
arnumber=302314
mdgs.un.org/unsd/mdg/Resources/Static/Prod-
Simonovic, S. P. (2002). Role of remote sensing ucts/Progress2006/MDGReport2006.pdf
in disaster management (Paper Series No. 21).
Van Westen, C. J., & Hofstee, P. (2001). The role
London, ON, Canada: University of Western On-
of remote sensing and GIS in risk mapping and
tario, Institute for Catastrophic Loss Reduction.
damage assessment for disasters in urban areas.
Simpson, D. M., & Human, R. J. (2008). Large- Enschede, The Netherlands: International Institute
scale vulnerability assessments for natural hazards. for Aerospace Survey and Earth Sciences (ITC).
Natural Hazards, 47(2), 143–155. doi:10.1007/ Retrieved from http://www.dkkv.org/forum2001/
s11069-007-9202-6 Datei61.pdf
Sliuzas, R. (1999, March 11-12). Research is- Venkatachary, K.V., Manikiam, B. and Srivas-
sues for the adoption of geographic information tava, S.K. 2001. Harnessing information and
technology for urban planning and management technology for disaster management. Retrieved
in developing countries. In Proceedings of the from http://www.aiaa.org/indiaus2004/Disaster-
International Workshop on Concepts and Para- management.pdf
digms of Urban Management in the Context of
Weber, S. (2004). The success of open source.
Developing Countries, Venice, Italy. Retrieved
Cambridge, MA: Harvard University Press.
from http://www.naerus.net/sat/workshops/1999/
paper_sliuzas.html Weerawarana, S., & Weeratunge, J. (2004). Open
source in developing countries. Stockholm,
Smith, K. (2004). Environmental hazards: As-
Swedish International Development Cooperation
sessing risk and reducing disaster (4th ed.). New
(Sida) Agency. Retrieved from http://www.sida.
York, NY: Routledge.
se/publications
Space Agencies. (2005). International charter on
Weichselgartner, J. (2001). Disaster mitigation:
space and major disasters. Retrieved from http://
The concept of vulnerability revisited. Disaster
www.disasterscharter.org/main_e.html
Prevention and Management, 10(2), 85–94.
Steiniger, S., & Bocher, E. (2009). An overview doi:10.1108/09653560110388609
on current free and open source desktop GIS de-
Wheeler, D. A. (2007). Why Open Source Soft-
velopments. International Journal of Geographi-
ware/Free Software (OSS/FS, FLOSS, or FOSS)?
cal Information Science, 23(10), 1345–1370.
Look at the Numbers! Retrieved from http://www.
doi:10.1080/13658810802634956
dwheeler.com/oss_fs_why.html

214
A Review of Geospatial Information Technology for Natural Disaster Management in Developing Countries

Wu, M., & Lin, Y. (2001). Open source software Zerger, A., & Smith, D. I. (2003). Impediments to
development: An overview. IEEE Computer, using GIS for real-time disaster decision support.
33-38. Computers, Environment and Urban Systems, 27,
123–141. doi:10.1016/S0198-9715(01)00021-7

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 2,
edited by Donald Patrick Albert, pp. 24-62, copyright 2012 by IGI Publishing (an imprint of IGI Global).

215
216

Chapter 15
Virtual Environments for
Geospatial Applications
Magesh Chandramouli
Purdue University, USA

Bo Huang
Chinese University of Hong Kong, China

ABSTRACT
This article explores the application of virtual environments to 3D geospatial visualization and explora-
tion. VR worlds provide powerful functionalities for model generation and animation and are indeed a
valuable tool for geospatial visualization. Subsequently, related issues such as the constraints in pro-
gressive terrain rendering, geographic data modeling, photo-realism in virtual worlds, and the system
performance with relatively larger files are discussed. Nevertheless, to accomplish the desired results
and to attain a higher level of functionality, a good level of experience in VR programming and the ju-
risprudence to choose the appropriate tool are necessary. Although a standalone VR application is not
capable of a higher level of interaction, using the SCRIPT nodes and the External Authoring Interface
additional functionalities can be integrated. Intended for use over the internet with a VR browser, such
virtual environments serve not only as a visualization tool, but also a powerful medium for geospatial
data exploration.

INTRODUCTION animation. This chapter discusses related issues


such as the constraints in progressive terrain ren-
This chapter explores the application of virtual dering, geographic data modeling, photo-realism
environments to 3D geospatial visualization, ani- in virtual worlds, and the system performance
mation, and interaction. The authors describe the with relatively larger files. VR worlds provide
design and implementation of some 3D models, powerful functionalities for model generation
which offer a good level of user-interaction and and animation and are indeed a valuable tool
for geospatial visualization. Nevertheless, to ac-
DOI: 10.4018/978-1-4666-2038-4.ch015 complish the desired results and to attain a higher

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Virtual Environments for Geospatial Applications

level of functionality, a good level of experience work, “The Ultimate Display”. Such immersive
in VR programming practices is mandatory. Even virtual environments can serve as handy tools for
though a standalone VR application is not capable scientists and researchers that handle enormous
of a higher level of interaction, using the SCRIPT data volumes. By and large, visualization enables
nodes, JavaScript can be embedded in the program large quantities of information to be presented in
to provide additional functionalities. a form that is comprehensible to a wide range of
users (Colin Ware, 2000).

GEO-VIRTUAL ENVIRONMENTS: 3D Geospatial Data Visualization:


EVOLUTION OVER THE YEARS Tools and Techniques

Since the 1960s and 70s, the past several decades Geospatial analysis and research require that the
have witnessed the ‘information revolution’, data be in the 3D form. Geospatial data is inher-
particularly in the domain of spatial information ently three dimensional in nature since every
technology, propelled by the advancements in data spatial element has its own position or location in
acquisition techniques. The evolution of diverse space (latitude, longitude, and altitude). A gamut
digital processing and image generation tech- of applications involving geospatial analysis such
niques over the decades along with the parallel de- as environmental process simulation, infrastruc-
velopments in Geographical Information Systems ture applications, landscape design, geological
GIS and remote sensing have resulted in colossal applications, etc. necessitates three-dimensional
volumes of digital spatial data. In order to make exploration and visualization. Traditionally, op-
the utmost use of this collected data, they must be erations such as town or country planning relied
presented in the form of comprehensible informa- heavily on drawings and these were eventu-
tion. Geospatial data is increasingly being used ally supplemented with Computer Aided Design
for addressing issues involving environmental (CAD) drawings. However, one major handicap
and urban planning, design, and decision-making with these forms of data is that they try to sym-
within a wide range of disciplines ranging from bolize 3D entities in 2D format. Albeit these may
urban landscape management to various other offer a bird’s eye view of the place being studied,
applications. As geospatial data complexity in- such representations depicting 3D data using two
creases, besides the standard rasters, Triangulated dimensions are incomplete and cannot replace
Irregular Networks (TINs) and Digital Elevation a 3D view. For instance, landscape and urban
Models (DEMs), which are used for data explora- modeling architecture applications of today are far
tion, additional tools such as photo-realistic models more complex and advanced tools are inevitable
are needed to provide advanced visualization and to provide the required level of sophistication.
query functionalities. Three-dimensional visual- Several techniques have been tried and imple-
ization is a proven technique for exploring geo- mented for visualizing 3D geospatial data. This
spatial data (Bonham-Carter, 1994). In the work paper delineates some of the notable tools and
on urban modeling, Shiode (2001) explains the techniques that are employed in 3D geospatial
development of 3D models and their role within data visualization and briefly elaborates the basic
the domain of spatial information database and principles underlying the generation of static and
remote sensing technology. The origins of concept dynamic virtual environments.
of spatial immersion can be dated back to 1965 A plethora of commercial software is available
when Ivan Sutherland (1965) made known the for a wide range of purposes such as terrain or
ideas of immersion in virtual space in his influential building generation, photogrammetric measure-

217
Virtual Environments for Geospatial Applications

ments, fly-through simulations, etc. Typically, to manually sift through these huge amounts of
commercial software are designed for specific data. Using visualization techniques, data can
application requirements with the objective of be presented in a much more organized manner
saving time and costs. Even though such software that facilitates understanding the information that
have been reasonably successful in accomplish- may not otherwise be apparent. By proper use of
ing desired tasks, they are not capable of being visualization tools, the same area can be viewed
extended to applications outside their intended at different scales, i.e., a small area in detail or
domain and therefore, are inherently limited. In a bird’s eye view of a larger area. In order to see
order to overcome such limitations and to attain the overall landscape of a whole country we need
greater flexibility and efficiency, some research- to view the entire country at a glance. However,
ers prefer the 3D virtual environments using a the advantage of modern visualization is that
variety of powerful programming languages. such visualizations are not mere depictions of
Even though, initially, building the 3D worlds scenes, but also interactive environments capable
from scratch might be time-consuming process, of animating the scenes, and simulating phenom-
with time and experience, using programming ena. Another kind of information that needs to be
languages to model becomes easier. Moreover, discussed in this context is associated informa-
the tremendous functionalities and flexibility of- tion. In order to understand these links among the
fered by different programming languages make various components of a system, tools that can
the final 3D virtual environments worth the ef- reveal the various concurrent processes among the
fort. Some popular languages for writing code various sub-systems are vital. Solutions to many
to generate advanced 3D environments include complex problems can be found by understanding
VRML, GeoVRML, X3D and OpenGL. VRML the relationships among the system components.
is a specification designed to create interactive Urban planning authorities and town plan-
3D worlds that can be used on the World Wide ners face several problems such as managing
Web. GeoVRML can be considered a derivative water shortages, transportation problems, urban
of VRML because it is used to create geo-specific housing and land use problems, natural and man-
applications. Of late, research involving the de- made disasters, etc. Several of these problems
velopment of 3D geospatial applications has been are mutually dependant and trying to solve them
gaining increasing significance, and the inherent in isolation will never lead to a permanent or
complexity in the efforts to visualize geographical long-lasting solution. One of the foremost steps
phenomena has necessitated the amalgamation in solving these problems is to get a bird’s eye
of the aforementioned tools with conventional view of the problem scenario as a whole, while
programming languages owing to the powerful simultaneously concentrating on the minutiae
functionalities offered by the latter. (Figure 1). This kind of visualization is of im-
mense value to town and country planners and
urban infrastructure management in understanding
APPLICATIONS OF GEO- the link among the various components. Also, the
VIRTUAL ENVIRONMENTS influences on the ambient environment as a result
of the aforementioned project can be studied by
Visualization facilitates not only presenting infor- means of the virtual settings. The visual impact
mation, but also enables seeing and understanding of new buildings and surroundings on each other
hidden information among datasets. As mentioned can be vividly seen on the screen.
in the previous section, huge volumes of data are Two main classes of models are discussed here,
available today and it is practically impossible namely static and dynamic models. The static 3D

218
Virtual Environments for Geospatial Applications

Figure 1. Geo-virtual environments for visualizing interlinked components of urban infrastructure.


Clockwise from top-left a. A Sample urban landscape, b. Land parcel, c. Road network, d. parcel with
proposed building layout

Table 1. A summary of applications of Geo-virtual Static Geo-Virtual Environments


environments
Static virtual environments are those that are
• Geospatial visualization and animation
composed of objects that lack animation capabili-
• Urban planning and Infrastructure applications
ties. Simply stated, the depiction of a 3D model
• Environmental process simulations
of a building is a static representation, while 3D
• Resource management and conservation
illustrations of phenomena such as debris flow
• Hosting cyber cities online
or volcanic activity involves dynamic represen-
• Imparting training and demonstration purposes
tation. Researchers use different approaches to
• Simulating applications such as mining, seismic processes,
etc.
model or create the 3D virtual worlds and depict
the constituents of those worlds. One very effi-
cient method of describing real-world scenarios
is the use of a hierarchical scene tree structure.
models are used for planning and design pur-
Especially, several 3D modeling languages (X3D,
poses, while the dynamic models are used for
VRML, and GeoVRML) model real-world ob-
simulation and training purposes. Such immersive
jects as shapes with geometry and appearance.
models are of immense value in the planning and
All features such as buildings, roads, trees, and
decision-making processes involving terrain and
rivers can be designed and modeled as shapes
building databases. Such models serve as valuable
which can be grouped together and transformed
tools in solving the previously mentioned urban
(translated or rotated).
infrastructure problems. The emphasis here is not
Objects in the real-world have positional
only on the utilization of virtual worlds for visu-
attributes and these are represented in the form
alization but also for simulation and animation.
of the x, y, and z coordinate points within the
Three-dimensional virtual environments are being
virtual world. The topological relationships
increasingly used for disaster mitigation and
among the constituent elements of a scene can
managing events such as debris flow, volcanic
be represented and the virtual representations can
eruption, seismic activities, etc.
be built to scale. The virtual world scene is built
within a virtual coordinate system, in which the
x and y axes are along the length and breadth of

219
Virtual Environments for Geospatial Applications

the computer monitor and the z axes extends in Simple objects such as cuboids might be composed
a direction perpendicular to the monitor. Hence, of just 6 faces, while complex objects might ne-
theoretically, an object that is at a distance of 10 cessitate a greater number of faces.
units along positive z-axis is in front of an object Photo-realistic environments can be built by
at 5 units along positive z-axis. Also, the notion the judicious use of textures and by scaling them
of the parent-child relationship (Figure 2) implies to accurately match the faces. A vast number of
that smaller objects can be grouped to form larger environmental and landscape applications gen-
ones or objects higher in the hierarchy and these erated using Virtual Reality (VR) or VRML are
are, in turn, grouped to form objects still higher available on the Web. Several researchers have
in the hierarchy. generated virtual environments for geospatial ap-
Typically, various modeling approaches use plications (e.g., Huang et al., 1999; Chandramouli
the notion of point, line, and polygon, as in GIS, et al., 2004), resource management (e.g., ref:
to build virtual worlds. All constituent elements SGILICGF, UW), 3D urban applications (e.g.,
of a 3D scene can be built using fundamental Shiode, 2001), etc. For some applications a mere
entities such as point, line, and polygon or faces. representative environment depicting the relative

Figure 2. a) Scene tree structure; b) Parent-child relationship

Figure 3. a) A point in 3D space; b) Vertices of a polygon; c) Polygon with 4 sides

220
Virtual Environments for Geospatial Applications

positioning of various elements would suffice, but of ‘reusable software objects’. For instance, a
others might necessitate photo-realistic rendering lamp object or a tree object that has been created
of the 3D scene. Figure 4a (left) shows a simple 3D for a particular project can be reused in another
environment showing a general layout of houses application. Just like using a math function from
within a community without much photo-realism. a C library, objects modeled within virtual envi-
On the other hand, Figure 4b (right) shows an ronments (Figure 5) as part of one project can be
advanced infrastructure application wherein the referenced and included as part of another ap-
finished product, or final environment, is required plication, thereby saving considerable program-
for viewing by planners in advance and hence the ming time and effort.
virtual world is depicted with a higher degree of
photo-realism. Dynamic Virtual Environments
One advantage of the hierarchical mode of
representation, in which the scene elements are As mentioned in the previous section, dynamic
modeled as individual objects that are grouped virtual representations refer to those 3D worlds in
using parent-child relationships, is that objects which the objects are incorporated with animation
can be built once and used any number of times capabilities. Quite frequently, research involving
either in the same virtual scene or in other vir- the study of real-world geospatial processes and
tual environments. This is known as the concept phenomena, for instance seismic activity or hydro-

Figure 4. a) Community planning with less photo-realism; b) Sample urban landscape with higher
degree of photorealism

Figure 5. Reusable software objects – can be generated once and used multiple times

221
Virtual Environments for Geospatial Applications

logical processes, necessitates that the virtual envi- sense user actions or events (Figure 6). In case of
ronments possess dynamic animation capabilities. simple actions such as the movement of an object
The inherent nature of such activities makes on upon impact, there is a ‘trigger’ or a stimulation,
site evaluation of such phenomena an extremely upon which something acts. In programming, the
hazardous task. Moreover, such disastrous events ‘triggering action’ is referred to as an event. Upon
might be triggered by a variety of factors, both the occurrence of a particular action, an object
known and unknown, and hence, predicting their or group of objects (node) behaves accordingly.
occurrence accurately still remains a challenging Events may not always be explicit or need not
task. Hence, 3D geospatial environments offer an only be ‘physical movement’. In programming
efficient means of modeling such phenomena and context, even the reaching of an instant of time,
studying them at the comfort of an office desktop. say 11:59 pm, is considered an event. Hence, an
Virtual reality serves as an extremely potent and object or a group of objects (Parent node) can be
flexible tool for applications involving simulation programmed to ‘sense’ the particular instant of
and animation since it has functionalities capable time and behave in a particular fashion upon
of being extended to achieve desired results. reaching that moment. The word ‘sense’ has been
Modeling languages such as GeoVRML, VRML, highlighted as it is inevitable that an object or a
and X3D provide additional functionalities known group of object recognize a change of condition
as Scripts that can be programmed and executed (state). Typically, there are three kinds of sensors
to create simple animations. However, these 3D that are used in order to sense time changes, cur-
modeling languages lack advanced program- sor movement over the screen or objects, or the
ming functions to achieve curvilinear motion or field of view from a particular position within the
more sophisticated forms of animation. In order virtual world. These are respectively called time
to incorporate dynamic behavior in the virtual sensors, touch sensors, and visibility sensors in
world objects and replicate complex animation the context of virtual worlds (Figure 7). Upon
patterns, an existing programming language with reaching a particular instant of time, or when the
advanced functionalities, for instance Java or mouse cursor is pointed on a particular object, or
JavaScript, can be used in conjunction with the when the viewer is within the virtual world is at
3D modeling language. Typically, the objects in a a specific position, the objects in the scene can
parent-child relationship within a virtual scene are be programmed to behave in a desired fashion.
grouped into structures called nodes. In order to This behavior or response of objects might be a
achieve animation, these nodes must be made to

Figure 6. Flow of events within a virtual world wherein objects are grouped I the form of nodes

222
Virtual Environments for Geospatial Applications

Figure 7. Diagram illustrating the sensing of actions of events by the objects and the corresponding
movement based on the flow of events (From Chandramouli et al., 2004, 2006)

translation along the x, y, or z axis or a rotatory flow events, seismic activities and the responses
motion, or a combination of both. of structures to such events.
Based on the above principles of animation,
dynamic geo-virtual environments can be gener- Critical Issues in the Design and
ated to simulate various processes or geospatial Implementation of Geospatial Virtual
phenomena. Environmental processes and natural Environments
disasters can be studied in detail by generating
dynamic virtual worlds. By programming the This section concisely discusses some limitations
properties and behavior of the constituent objects, and constraints typically experienced in several
the 3D virtual environments can be used to study virtual world generations as well. One noteworthy
various occurrences which are not easy to be issue is that in visualizing real-world scenarios,
observed in real time, e.g., Figure 8 describes there is an inevitable trade-off amid performance

Figure 8. a) 3D virtual environment depicting geospatial processes (1 picture of a series) such as land-
scape change over time etc.; b) 3D virtual environment depicting water flow in a reservoir

223
Virtual Environments for Geospatial Applications

and resolution. Exploiting the complete capabili- from the data repository and the web server ac-
ties of virtual environments over the Web continues cesses the visualization server for 3D information.
to pose problems. As the number of objects in This framework efficiently handles requests for
a virtual environment increases, online hosting visualizing dynamic processes and based on the
becomes an issue as spontaneous rendering of client requests, the web server provides the ap-
numerous objects is no easy task. Scenes with a propriate information in the conventional HTML
greater number of polygons decelerate the system or 3D VRML format.
and make the interactivity poor. Several factors
need to be considered during visualization such
as the type and volume of data to be visualized, DISCUSSION AND CONCLUSION
memory constraints, and system performance.
Figure 9 presents a summary of the significant Over the past several decades, information presen-
issues concerning geo-virtual environments. In tation has inspired the development of several new
their work on information visualization, Robertson tools and techniques. The information revolution
et al. (1993) have presented a terse compilation has resulted in vast amounts of data that are far
of the important issues. too complex, both in quality and quantity, to be
In his work on dynamic and interactive web- handled by conventional tools and techniques. Re-
based visualizations, Huang and Lin (1999, 2001, cent technological advances in the realm of remote
and 2002) discuss in detail some of these concerns sensing have dramatically increased the amount
and also address some critical issues concerning of geospatial data available. Virtual environments
online hosting of interactive visualizations. The are an efficient means of visualizing voluminous
Java-3D based hybrid method that Huang and Lin geospatial data and are efficient in elucidating the
(1999, 2001) propose offers a standard framework intricate patterns as well as hidden and associated
for visualizing dynamic environmental processes. information. Such virtual environments facilitate
Figure 10 illustrates a 3-tier configuration that understanding of the complex relationships among
Huang and Lin (1999) proposed in GeoVR. The the various components of a multi-level scenario.
visualization server that is interlinked to the spa- This paper discussed the design and imple-
tial database accesses the geospatial information mentation of virtual worlds that can be used to

Figure 9. Table showing a summary of critical issues in designing and implementing 3D virtual worlds

224
Virtual Environments for Geospatial Applications

Figure 10. Online hosting of interactive visualization (From Huang et al.,1999)

generate both static representations depicting Boyd, D. S., Lansdown, J., & Huxor, A. (1996).
real-world settings and dynamic representa- The Design of Virtual Environments. SIMA.
tions that can simulate geospatial processes and
Chandramouli, M., Huang, B., Yin Chou, T., Kun
environmental phenomena. The paper discussed
Chung, L., & Wu, Q. (2006). Design and Imple-
the generation of such geo-virtual environments
mentation of Virtual Environments for planning
with examples and provided explanations as to
and Building Sustainable Railway Transit Systems,
how such geo-visualization applications facilitate
COMPRAIL July 2006, Prague.
understanding of various geospatial phenomena
and environmental processes. The fundamental Chandramouli, M., Lan-Kun, C., Tien-Yin, C., &
principles underlying the generation of virtual vChing-Yi, K. (2004). Design and Implementation
worlds, both static and dynamic, were elaborated of Virtual Environments for Visualizing 3D Geo-
and the common issues involved in the generation spatial Data. TGIS Conference, Oct 28-29 2004.
of such 3D virtual worlds were discussed. Further-
Colin, W. (2000). Information Visualization:
more, the issues related to the online hosting of
Perception for Design. Morgan Kaufmann Series
such virtual environments were tersely delineated
in Interactive Technologies GeoVRML, (www.
and possible solutions to frequently encountered
geovrml.org)
problems were provided.
Huang, B., Jiang, B., & Lin, H. (2001). An inte-
gration of GIS, virtual reality and the Internet for
REFERENCES spatial data exploration. International Journal of
GIS, 15(5), 439–456.
Ames, L. A., Nadeau, D. R., & Moreland, J. L.
(1996). VRML 2.0 Sourcebook. Huang, B., & Lin, H. (1999). GeoVR: A Web-
based tool for virtual reality presentation from
Bonham-Carter, G. F. (1994). Geographic Infor- 2D GIS data. Computers & Geosciences, 25(10),
mation Systems for Geoscientists: Modeling with 1167–1175. doi:10.1016/S0098-3004(99)00073-4
GIS. Pergemon: Oxford (p. 398).

225
Virtual Environments for Geospatial Applications

Huang, B., & Lin, H. (2002). A Java/CGI approach KEY TERMS AND DEFINITIONS
to developing a geographic virtual reality toolkit
on the Internet. Computers & Geosciences, 28(1), Immersion: A Sense of being present within
13–19. doi:10.1016/S0098-3004(01)00008-5 the virtual world and a ‘sense’ being able to visu-
alize objects by being amidst their surroundings
Karel, C., & Jiri, Z. (n/d). Using VRML for creat- and navigating through the world.
ing interactive demonstrations of physical models. Node: An entity within the hierarchical scene
Department of computer science and Engineering. structure that represents a group of objects.
Czech Technical University. OpenSource: Source code or computer soft-
Robertson, G., Card, S., & Mackinlay, J. D. (1993). ware that is freely offered and is available to the
Information visualization using 3d interactive public for building software applications.
animation. Communications of the ACM, 36, Scene-Hierarchy: The organization of the
57–71. doi:10.1145/255950.153577 elements of a 3D virtual scene into successive
levels, in such a way that the object under which
SGILICGF. UW Sea Grant Institute and Land other objects are grouped is called the parent and
Information and Computer Graphics Facility the grouped objects are called its children. When
http://coastal.lic.wisc.edu/visualization/Visual- a parent object is transformed, the children are
ization.htm also transformed.
Shiode, N. (2001). 3D urban models: recent SCRIPT: Program scripts that are used to
developments in the digital modeling of urban perform calculations and return values to the
environments in three-dimensions. GeoJournal, calling programs.
52(3), 263–269. doi:10.1023/A:1014276309416 Transformation: Operations such as transla-
tion, rotation, or scaling involving objects in a
Sutherland, I. E. (1965). The ultimate display. In virtual environment.
the proceedings of the IFIPS Congress, 2, 506- Virtual Reality: A three-dimensional visual
508. New York City, NY. immersive setting that facilitates user to navigate
within the scene and perform operations in real
time.

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 320-331,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

226
Section 2
Development and Design
Methodologies

This section provides in-depth coverage of conceptual architecture frameworks to provide the reader with a com-
prehensive understanding of the emerging developments within the field of Geographic Information Systems.
Research fundamentals imperative to the understanding of developmental processes within Geographic Informa-
tion Systems are offered. From broad examinations to specific discussions on methodology, the research found
within this section spans the discipline while offering detailed, specific discussions. From basic designs to abstract
development, these chapters serve to expand the reaches of development and design technologies within the Geo-
graphic Information Systems community. This section includes 15 contributions from researchers throughout the
world on the topic of Geographic Information Systems.
228

Chapter 16
GIS Technique for
Territorial Analysis:
Spatial MultiCriteria Decision Analysis

Francesco Riccioli
University of Florence, Italy

Toufic El Asmar
University of Florence, Italy

ABSTRACT
Territorial analysis is related to the complexity of an area where human activities and environmental
characteristics are the main development factors. Human activities have a direct (e.g. farm activities)
and indirect (e.g. urban development/sprawl) influence on a territory and its environment, which gives an
important function to such activities when strategies for territorial planning have to be defined. Several
actors are involved in the assessment of a territory, which is a complicated feature of decision making
to be achieved with the use of complex data-analysis process. The evolution of personal computer and
specific software for the analysis of spatial data is giving important contribution to “Territorial Planning
Strategies.” Spatial Decision Support System is an example of how problems occurred during a deci-
sion making process can be solved using methodologies that combine “Multicriteria Approach” and
“Geographic Information System.” One of these methodologies is represented by “Spatial Multicriteria
Decision Analysis.”

DECISION SUPPORT FOR ing the last century anthropic expansion and the
TERRITORIAL ANALYSIS consequent urban sprawl produced a great loss of
natural resources therefore “Territorial Planning
It is possible to explain the complexity of a terri- Strategies” became an important tool to attain
tory as a mosaic of interactions between human decision makers targets. Many variables are
activities and its surrounding environment. Dur- concerned in land-use changes, and many factors
(human, political, economic) are involved in the
DOI: 10.4018/978-1-4666-2038-4.ch016 urbanization growth, thus through the application

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
GIS Technique for Territorial Analysis

of a “decision support analysis” is an important and quantity of information available to decision


and complicated strategy for a correct territorial makers. In general, two types of information
planning. are used in the spatial decision-making process:
Territorial planning is based on decision: a geographical information and information related
decision is a choice between alternatives. The to decision maker’s preferences. The amount
alternatives may represent different courses of and type of information required depend on the
action, different hypotheses about the character nature of a particulate decision problem. Several
of a feature, different classifications, and so on. characteristics of spatial decision problems can be
With rapid increases in population and con- singled out. Spatial problems typically involve a
tinuing expectations of growth in the standard large number of alternatives that are evaluated on
of living, pressures on natural resource use have the basis of multiple criteria. Some of the criteria
become intense. For the resource manager, the task may be qualitative while others may be quantita-
of effective resource allocation has thus become tive. There may be more than one decision maker
especially difficult. Clear choices are few and the and they may have different preferences with
increasing use of more marginal lands puts one respect to the relative importance of the evaluation
face-to-face with a broad range of uncertainties. criteria and the decision consequences. The spatial
Add to this a very dynamic environment subject problem may involve uncertainty and imprecision
to substantial and complex impacts from human that make it difficult to predict the outcome of
intervention, and one has the ingredients for a particular decisions. Given these characteristics,
decision making process that is dominated by many real-world spatial decision problems give
uncertainty and consequent risk for the decision rise to Geographical Information System (GIS)
maker. based multicriteria decision making.
Uncertainty is inevitable in the decision mak- Decision Theory is concerned with the logic by
ing process. Uncertainty includes any known or which one arrives at a choice between alternatives.
unknown error, ambiguity or variation in both the What those alternatives are varies from problem
database and the decision rule. Thus, uncertainty to problem. They might be alternative actions,
may arise from such elements as measurement alternative hypotheses about a phenomenon, alter-
error, inherent variability, instability, conceptual native objects to include in a set, and so on. In the
ambiguity, over-abstraction, or simple ignorance context of GIS it is useful to distinguish between
of important model parameters. policy decisions and resource allocation decisions.
Managing the environment implies dealing The latter involves decisions that directly affect
with dynamic systems that are only partly under- the utilization of resources (e.g., land) while the
stood, with multiple interests and multiple actors, former is only intended to influence the decision
with long-term implications that range from the behavior of others who will in turn make resource
local to the global scale. “It has become more commitments. “Resource allocation decisions are
and more difficult to see the world around us in also prime candidates for analysis with a GIS.
a unidimensional way and to use only a single Indeed, land evaluation and allocation is one
criterion when judging what we see. We always of the most fundamental activities of resource
compare, rank, and order the objects of our choice development” (FAO, 1976).
with respect to criteria of choice. But only in a One of several GIS application tools is based
very simple, straightforward, or routine situation on territorial analysis with the integration of a
can we assume that a single criterion of choice decision support methodology that is the “Spatial
will be fully satisfactory” (Zeleny, 1982). Suc- Decision Support System” (SDSS).
cessful decision making depends on the quality

229
GIS Technique for Territorial Analysis

SDSS provides complex data elaborations are “far from maturity” and “continued changes
that are useful for planning process, in fact as in computer hardware and software technology
referred by Malczewski (2004): “An DSS aims are fundamentally altering the way that decision
to improve the effectiveness of decision making makers, stakeholders, policy makers, and analysts
by incorporating judgments and computer-based interact with computers.”
programs within the decision making process. This chapter is organised in two parts or under
The system should support a variety of decision – chapters. In the first part a general overview
making styles that may be present in a particular of Multicriteria methodology and Geographi-
problem solving process. The purpose of such cal Information System is described, whilst the
a system is to support a user in achieving better second part will consider “Spatial Multicriteria
decision.” Decision Analysis” with the presentation of a
Spatial Decision Support Systems are com- specific case study.
monly based on 4 utilities that are: data input,
data storage, data analysis and data output. All of
these utilities are focused on the support of deci- MULTICRITERIA ANALYSIS AND
sion making procedures. Evidently GIS play an GEOGRAPHICAL INFORMATION
important role in an SDSS process with the aim SYSTEM: A BRIEF INTRODUCTION
to store and elaborate quantitative and qualitative
data or information. GIS help users to resolve their Multicriteria Decision Analysis (MCDM) is a
decision making problems through a participa- methodology commonly used to support decision
tory approach in order to incorporate decision makers that have to face numerous and conflict-
making analytical techniques into it. GIS-based ing evaluations. MCDM aims at highlighting
approaches to territorial analysis are mainly rep- these conflicts and deriving a way to come to a
resented by Artificial Intelligence methods like compromise in a transparent process. MCDM is
neural network models or cellular automata and based on two processes: Multi-Objective Decision
Spatial Multicriteria Decision analysis. Previous Making (MODM) and Multi-Attribute Decision
applications.”..can help in modeling and describ- Making (MADM).
ing complex systems for inference and decision MODM studies decision-making problems
making” nevertheless “the major limitation of in which decisional space is continuous, while
these methods is their “black box” style of analyz- MADM focus on problems with discrete decisional
ing spatial problems” (Malczewski, 2004). spaces. Both MODM and MADM problems are
Spatial Multicriteria Decision analysis com- assessed by a single decision-maker or group of
bine Multicriteria Analysis and Geographical several decision-makers, that are distinguished in
Information System with the aim of converting the number of involved persons within a decision
input data (spatial and aspatial data) in output analysis process (goals, preferences and beliefs).
data (resultant decision). Conventional MCDM The decisions are classified in deterministic,
techniques have largely been aspatial in the sense probabilistic and fuzzy decisions. In the case of
that they assume a spatial homogeneity within deterministic decision process, required data and
the study area. This assumption is unrealistic in information are well known. During probabilistic
many decision situations because the evaluation analysis decisional problems made under uncer-
criteria vary across space. Consequently, there is a tainty are deal randomly. Fuzzy decision analysis
need for an explicit representation of geographical consider the inherent imprecision of information
dimension in MCDM. As Ascough et al. (2002) involved in decision making.
referred, Spatial Multicriteria Decision Analysis

230
GIS Technique for Territorial Analysis

Broadly speaking MCDM is one of the most In literature we find several definitions of
well known branch of decision-making and can Multicriteria Analysis, Figure 1 shows Pitz and
be classified as a decision support method which McKillip, (1984) scheme (one goal, one decision
aim at seeking the best solution1. maker, three objectives).
In general, MCDM problems involves six The columns consist of a level representing
components: the decision makers, their preferences and evalu-
ation criteria. These elements are organized in a
1. A goal or a set of goals the decision maker hierarchical structure. The most general level is
attempts to achieve; represented by goal and at this level a desired end
2. The decision maker or group of decision state resulting from decision-making activity is
makers involved in the decision-making specified. The decision require analysis of the
process along with their preferences with values of person affected by the decision, who
respect to evaluation criteria (consist of a are often characterized by unique preferences with
single person or a group of people, such as respect to the relative importance of criteria on
government or corporate organizations) ; the basis of which alternative decisions are
3. A set of evaluation criteria (called also evaluated. The preferences are typically opera-
objectives and/or attributes) on the basis of tionalized in terms of weights assigned to the
which the decision makers evaluate alterna- evaluation criteria.
tive courses of action; The rows of MCDM matrix represent decision
4. The set of decision alternatives (the decision alternatives. All decision are made in some kind of
or action variables); environmental context and therefore involve many
5. The set of uncontrollable variables or states factors beyond the control of the decision maker.
of nature; The uncontrollable factors reflect the degree of
6. The set of outcomes or consequences as- uncertainty about decision-making processes
sociated with each alternative-attribute pair. and outcomes (consequences). Therefore, for
each decision alternative there is a set of possible
outcomes: the decision outcomes depend on the

Figure 1. MCDM matrix

231
GIS Technique for Territorial Analysis

set of attributes for evaluating alternatives, so a overall assessment of each alternative being ap-
particular intersection of a row and column is a praised.
specific (decision) outcome associated with a Reference literature (Nijkamp 1976, Narisim-
particular alternative and attribute. The decision han 1980, Romero and Amador 1987) indicates
problem requires that the set of outcomes are or- several methods for matrix solution that are based
dered so that the best alternative can be identified. on Multi-Objective Programming, Goal Program-
MCDM techniques commonly apply numerical ming and Compromise Programming.
analysis to a performance matrix in two stages: These approaches are often referred as “com-
scoring and weighting. pensatory concept”: since low scores of one
criterion may be compensated by high scores
• Scoring: the expected consequences of of another. The most common way to combine
each alternative are assigned a numerical scores on criteria, and relevant weights between
score on a strength of preference scale for criteria, is to calculate a simple weighted average
each alternative for each criterion. More of scores. Use of such weighted averages depends
preferred alternatives score higher on the on the assumption of mutual independence of
scale, and less preferred alternatives score preferences. This means that the judged strength
lower. In practice, scales extending from 0 of preference for an alternative on one criterion
to 100 are often used, where 0 represents a will be independent of its judged strength of
real or hypothetical least preferred alterna- preference on another.
tive, and 100 is associated with a real or An initial step can be to see if any of the al-
hypothetical most preferred alternative. ternatives are dominated by others. Dominance
All alternatives considered in the MCDM occurs when one alternative performs at least as
would then fall between 0 and 1002. well as another on all criteria and strictly better
• Weighting: numerical weights are as- than the other on at least one criterion. In principle,
signed to define, for each criterion, the one alternative might dominate all others, but in
relative valuations of a shift between the practice this is unlikely. When it does occur, it is
top and bottom of the chosen scale. A key helpful to ask if there is some advantage of the
feature of MCDM is its emphasis on the dominated alternative that is not represented by
judgement of the decision making team, in the criteria; this may reveal new criteria that have
establishing objectives and criteria, esti- been overlooked. Dominance is more likely just
mating relative importance weights and, to to enable the decision-making team to eliminate
some extent, in judging the contribution of dominated alternatives from further consideration.
each alternative to each performance cri- Once any dominance analysis has been concluded,
terion. The subjectivity that pervades this the next stage is for the decision-making team to
can be a matter of concern. Its foundation, determine whether trade-offs between different
in principle, is the decision makers’ own criteria are acceptable, so that good performance
choices of objectives, criteria, weights and on one criterion can in principle compensate for
assessments of achieving the objectives, weaker performance on another. Most public deci-
although ‘objective’ data such as observed sions admit such trade-offs, but there may be some
prices can also be included. circumstances, perhaps where ethical issues are
central, where trade-offs of this type are not accept-
Mathematical routines, which may be written able. If it is not acceptable to consider trade-offs
into computer programmes, then combine these between criteria, then there are a limited number of
two components (score and weight) to give an non-compensatory MCDM techniques available.

232
GIS Technique for Territorial Analysis

Where compensation is acceptable, most MCDM where: L2 = Euclidean distance of alternative from
methods involve implicit or explicit aggregation ideal solution (p = 2); xi1 xi2= Cartesian coordinates
of each alternative’s performance across all the of alternative i
criteria to form an overall assessment of each
alternative, on the basis of which the set of alter-  n 
natives can be compared. The principal difference L1 =  ∑ x i 1 − x i 2  (2)
 i =1 
between the main families of MCDM methods is  
the way in which this aggregation is done.
In general the aim of matrix solution is fo- where: L1 = Manhattan distance of alternative
cused on the ideal solution choice, which does from ideal solution (p = 1); xi1 xi2= Cartesian
not coincide with the maximum values that can coordinates of alternative i
be obtained by Mathematical programming
viewpoint. The ideal solution definition is based  n 
1
p p
on Pareto efficiency concept “Given a set of al- L∞ = lim  ∑ x i 1 − x i 2  (3)
 i =1 
ternative allocations of goods or outcomes for a  
set of individuals, a change from one allocation
to another that can make at least one individual with lim p → ∞where: L∞ = Infinite metrics
better off without making any other individual distance of alternative from ideal solution (p =
worse off” (Pareto, 1906). ∞); xi1 xi2= Cartesian coordinates of alternative i
The matrix solution implies a compromise Formula 1 and 2 are considered compensatory
solution that is represented by the alternative with methods because low scores on one criterion may
minor distance to the ideal point (solution). So, be compensated by high scores on another. For-
given a goal and n objectives, the alternative with mula 3 is considered no compensatory method that
minor distance to ideal solution will be considered does not permit trade-offs between criteria (i.e. low
the best alternative. It is possible to quantify the values for an alternative on one criterion cannot
distance of n alternative from ideal solution using be offset by better values on another criterion).
Cartesian representation of each alternative: the Using different solution methods can produce
distance is based on p parameter3. different results. “At this point one may reasonably
A parallel can be found in the study of geom- claim that there may never be a single MCDM
etry. Simple principles like the shortest distance method that can ensure that the derived ranking
between two points is a straight line (Formula 1) of the alternatives is the correct one. The prob-
are combined using the rules of logic to prove lem of determining the best alternative is an ill
theorems that are not obvious, like the Pythagorean defined one by its own nature. It is hard to know
principle, that the square of the hypotenuse equals the best solution even under perfect knowledge
the sum of the squares of the other two sides. of input data. What can be done, however, is to
Another two way for calculating the distance analyze the behaviour of numerical methods un-
from alternative and ideal point, are showed in der certain evaluative criteria that are motivated
Formula 2 and 3. by widely accepted notions of logical stability”
(Triantaphyllou 2000).
1
 n 22
L2 =  ∑ x i 1 − x i 2  (1)
 i =1 
 

233
GIS Technique for Territorial Analysis

GEOGRAPHICAL Each layer is created by alphanumeric data


INFORMATION SYSTEM and it is possible to elaborate it using several
operations: measurement, reclassification, buffer-
Geographic Information System (GIS)4 is based ing operations that are the fundamental utilities
on a set of hardware, software an human skills of a GIS.
that have the aim of processing, analysing and Measurement utility enables the calculation
creating graphical representation of a database of points, lines, polygons or pixels to estimate
that is associated to a territory. areas, volumes, distance or other. Reclassification
Since 1960 GIS were performed in Canada and function transform the attribute data associated
United States for military purposes, later (since with a specific “geographic primitives” or pixels.
1980), thanks to personal computer diffusion, they Buffering operation gives specific values to an area
become available for private users. surrounding a geographical primitives or pixels.
It is possible to divide a Geographic Informa- To make map making easy, a GIS system
tion System in three processing methods: data should include a variety of standard map compo-
acquisition, data analysis and data results. nents that can be readily added to a layout. These
Data acquisition is performed by different type5 include map grids, scale bars, legends, annotation
of photo or cartographic and scanned maps, land text, and a means of mixing georeferenced and
use rendering. Each acquired item is linked to a ungeoreferenced groups (north arrows, company
numeric database. This database correspond to a logos) to complete the map. Each of these map
numeric representation of different spatial items components should be easily customizable; for
like land use, slope degree, road and cities location, example, with map grids you should be able to
etc. Database can be composed by categorical control the size and color of the text and lines, the
data (nominal data with no inherent ordering like grid spacing, the components of the grid, and so
land use types, country names or ordinal data with on. Theme mapping is a powerful means to look
inherent order like road class or stream class) or at quantitative trends and classes of attributes as-
numerical data (like temperature degree or rainfall sociated with vector and raster elements. Theme
measurement). Spatial items can have different maps provide a ready means for visualization
representations like unbounded areas (land use), of this information without the need for you to
bounded areas (cities, country boundaries), net- design the scripts that identify the classes and
works (roads or transmission lines) etc. assign meaningful styles. Theme mapping treats
All spatial items are related to a specific pro- the values in a designated field statistically and
jection, scale and resolution. Specific projection assigns styles to the classes identified according
is the related to Earth representation through X,Y to the parameters you specify.
coordinates on a 2D map surface; the scale is One very important feature of an advanced GIS
the ratio of distance on a 2D map respect to the system is that it continues to evolve and incorpo-
equivalent distance in real world, while resolu- rate new technologies. In territorial planning and
tion is the dimension of the smallest spatial item decision making strategies previous techniques are
which can be recognized. increased by introducing decision support tools
There are two different layer representations: for instance statistical, mathematical modeling
the vector layer and the raster layer. Vector layer and map algebra overlay tools.
is based on lines, points and polygons for items Statistical tools are mainly performed by Mul-
representation, and raster layer are pixel grid that tivariate analysis and time series analysis. Former
represent items in a map. is based on Cluster analysis and discriminant
analysis that “are used to explore the similarities

234
GIS Technique for Territorial Analysis

between objects and to define groups of objects responding territorial entity value of layer
by considering simultaneously all variables that B is calculated;
have been measured” (Griffith and Amrhein 1991; • Minimum: the result is the minimum val-
Mather, 1991). Time series analysis “is a set of ue between territorial entity value of layer
observations measured at successive points in time A and corresponding territorial entity value
or over successive periods of time” (Malcewski, of layer B is calculated.
1999). Mathematical modeling tools are mainly
performed by the optimization approach and the Logical operators:
simulation approach. The first approach is based
on finding the best solution of a well defined The logical operators are based on Boolean
management problem. There are several type of algebra and performed on binary variables (0,1
optimization process, like linear programming, or true, false):
integer programming, dynamic programming
etc. Simulation approach is “a methodology for • Intersection: logical AND (algebraic
performing experiments using a model of the multiplication);
real world system” (Mather, 1991). Map Algebra • Union: logical OR (algebraic addiction);
operations perform overlay process of different • Complement logical NOT (inversion).
layers (each geographical primitives or pixels in
a specific layer is expressed by numbers): the Fuzzy operators6:
final layer is the aggregation of some layers using
different type of operators. A list of most typical • Intersection: membership value of x to
operators is showed below. both territorial entity values of layer A and
B:
Algebraic operators:
μA (x) AND μB (x) (MIN[μA (x), μB (x)]);
• Addiction: territorial entity value of lay-
er A is added to territorial entity value of • Union: membership value of x to territo-
layer B; rial entity value of layer A or territorial en-
• Subtraction: territorial entity value of tity value of layer B:
layer A is subtracted from territorial entity
value of layer B; μA (x) OR μB (x) = (MAX[μA (x), μB (x)]);
• Multiplication: territorial entity value of
layer A is multiplied by territorial entity In conclusion, GIS has had an enormous impact
value of layer B; on virtually every field that manages and analyzes
• Division: territorial entity value of layer spatially distributed data. Investing in GIS requires
A is divided by territorial entity value of more than an investment in hardware and software.
layer B; Indeed, in many instances this is the least issue
• Average: the average value of territorial of concern. Most would also recognize that a
entity value of layer A and correspond- substantial investment needs to be placed in the
ing territorial entity value of layer B is development of the database. However, one of the
calculated; least recognized yet most important investments
• Maximum: the maximum value between is in the analysts who will use the system. The
territorial entity value of layer A and cor- system and the analyst cannot be separated, one
is simply an extension of the other. In addition,

235
GIS Technique for Territorial Analysis

the process of incorporating GIS capabilities into rules) define a relationship between the input
an institution requires an investment in long-term maps and the output map” (Malczewski, 2004).
and organization-wide education and training. Multiobjective and multiattribute decision
In many ways, learning GIS involves learning making represent two mainly Spatial Multicriteria
to think, learning to think about patterns, about Decision Analysis methods. First one is mathemat-
space and about processes that act in space. Most ical programming model oriented method “where
procedures have many more general applications the alternatives must be generated” (Keeney R.L.,
and can be used in many novel and innovative ways Raiffa H., 1976). Second one is mathematical
programming data oriented method where the
alternatives is given explicitly: this method is
SPATIAL MULTICRITERIA used with the aim of solving our case study that
DECISION ANALYSIS will be illustrated in six boxes during this section.
The structure of Spatial MultiCriteria Decision
Decisions may be characterized as single- or Analysis is divided into the 5 following defini-
multi-objective in nature, based on either a tion points:
single criterion or multiple criteria. While one
is occasionally concerned with single criterion 1. decisional problem;
problems, most problems approached with a GIS 2. evaluation criteria;
are multicriteria in nature. For example, we might 3. constrains and alternatives;
wish to identify areas of concern for soil erosion 4. decisional rules;
on the basis of slope, land-use, soil type and the 5. territorial analysis results.
like. In these instances, our concern lies with how
to combine these criteria to arrive at a composite It is possible to combine previous points and
decision. As a consequence, the first major area MultiCriteria matrix: the result is showed in
of concern in GIS with regard to Decision Theory Figure 2.
is MultiCriteria Evaluation.
“Earlier, the multi-criteria land suitability was Decisional Problem Definition
assessed more non-spatially, assuming the spatial
homogeneity over the area under consideration. Decisional problem definition is prevalently
This, however, is unrealistic in cases like land suit- related to territorial planning objectives that are
ability studies, where decisions are made using represented by the difference between the ideal
criteria which vary across in space” (Malczewski and the existing situation: this difference is the
1999). Spatial MultiCriteria Decision Support can core of decision makers targets.
be used to overcome previous limit. It is an SDSS Frequently (in literature) territorial analysis is
methodology that combines Multicriteria analysis based on the evaluation of three main aspects of
and Geographical Information System. Starting a generic territory that represent the interaction
from available resources (input data), this meth- between human activities and environment:
odology uses classic overlay operations through
specific decisional process (decision rules) with • economic aspects;
the aim of achieving Decision Makers goals (out- • social aspects;
put data). “GIS-based-MCDM can be thought of • natural aspects.
as a process that combines and transforms spatial
and aspatial data (input) into a resultant decision Territorial planning can be focus on suitable
(output). The MCDM procedures (or decision areas for urban development or suitable land use

236
GIS Technique for Territorial Analysis

Figure 2. Spatial Multicriteria Decision Analysis structure

for some agricultural or forest activities. “Land- Evaluation Criteria Definition


use planning is sometimes misunderstood as
being a process where planners tell people what The second decisional process phase is based
to do. Land-use planning means the systematic on definition of evaluation criteria (Box 2). In
assessment of physical, social and economic GIS analysis Evaluation criteria are qualitative
factors in such a way as to encourage and assist or quantitative factors used to define the area
land users in selecting options that increase their characteristics. They are represented by criterion
productivity, are sustainable and meet the needs maps (raster or vectorial layers) like slope degree
of society” (FAO, 1993). layer, basin roads layer, etc. Each criterion map
Generally numeric data are used for decisional can be obtained through fundamental GIS func-
problem quantification and some objectives are tionalities (i.e. measurement, reclassification,
used for the solution of a decisional problem (Box buffering operations).
1). “An objective is a statement about the desired Frequently criteria are represented by in-
state of geographical system under consideration. creasing or decreasing values in a qualitative or
It indicates the directions of improvement of one quantitative scales. Former is used in order to
or more criteria. In the context of decision analy- identify numbers values with indirect mathemati-
sis, the criteria can be thought of as indicators of cal correlation. For example they can be used for
future outcomes (outcome measures) rather than land use classification (i.e number 1 is related to
past achievements” (Starr and Zeleny, 1977). conifer forest, number 2 is related to croplands
and number 3 is related to vineyards. In this case
Box 1. Decisional problem definition (Goal) all numbers have the same importance value).
Latter is used in order to identify numbers values
Definition of suitable areas for forest activities (harvesting). with direct mathematical correlation: for example
Goal is related to following objectives: they can be used for forest production calculation
1. to minimize the costs of forest harvesting
2. to maximize the forest profit. (i.e. Canada coniferous forest produce n m3 per

237
GIS Technique for Territorial Analysis

hectare per year, while Italian coniferous forest Another normalization methodology fre-
produce n-1 m3 per hectare per year. In this case quently used under uncertainty scenarios is based
n m3 value is higher than n-1 m3 value). on Fuzzy logic (Zadeh, 1965). “The inability
During this phase is important to compare of the normal decision making methods to ad-
different evaluation criteria: the problem is rep- dress the imprecision and uncertainty paved the
resented by different unit of measure (i.e. it is path for the fuzzy decision making techniques.
impossible to related m3 and kilometres per hour Goals, constraints and consequences are known
or hectares and litres). imprecisely in much of the real world decision-
A standard measure scales must to be used making processes and in such a situation fuzzy
with the aim of confrontation of evaluation crite- set theory becomes functional” (Bellman R. E.,
ria. A normalization process can be used for this Zadeh L. A., 1970).
purpose. Following Formula (4-7) show different In Fuzzy logic theory an element x (criteria)
normalization methodologies. has membership value in real unit interval [0,1]
related to an y set. Membership values in vaguely
v jmax − v j defined sets are an extension of classic number
vn = (4) binary range {0,1}. An element x value close to
v jmax − v jmin
1 is considered like an high membership value
in defined set, while an element x value close to
where: vnj = normalized value of criteria j; vjmax = 0 is considered like a low membership value in
maximum value of criteria j; vj = value of criteria defined set. Formula 8 shows a membership value
j; vjmin = minimum value of criteria j function in a generic set. The function derived set
is called fuzzy set.
v j − v jmin
v nj = (5) F = {(x, μ(x))|x X} (8)
v jmax − v jmin

where: F = fuzzy set; X = generic set; x = criteria


where: vnj = normalized value of criteria j; vj =
max
value; μ(x) = membership value of criteria
maximum value of criteria j; vj = value of criteria Most common fuzzy normalizations are related
j; vjmin = minimum value of criteria j to linear, j-shaped and sigmoidal normalization
functions. The y axes represent membership value
vj of criteria μ(x) real unit interval [0,1] while a, b,
v nj = (6) c and d represent inflection points (called also
v jmax
control points) of the curve for increasing function
normalization, decreasing function normalization
where: vnj = normalized value of criteria j; vjmax = and symmetrical function normalization.
maximum value of criteria j; vj = value of criteria j Figure 3 represent an example of member-
ship value of criteria μ(x) to high slope soils set.
v jmin The high slope soils set is between 50% (slope
v nj = 1 − (7) percentage maximum membership value) and 5%
vj
(slope percentage minimum membership value).

where: vnj = normalized value of criteria j; vjmin = μ(x) =1: it identifies high slope percentage soils
minimum value of criteria j; vj = value of criteria j (50% slope percentage)

238
GIS Technique for Territorial Analysis

Figure 3. Membership value of criteria (linear function)

μ(x) = 0,7: it identifies medium-high percentage Constrains and Alternatives


slope soils (35% slope degree) Definition
μ(x) = 0: it identifies almost flat soils (5% slope
degree) “Constrains are limitations imposed by nature or
by human beings that do not permit certain actions
to be taken” (Keeney, 1980). The nature constrains
are usually related to orographic characteristics
Box 2. Evaluation criteria definition
while human constraints are related to planning
Suitable areas for forest harvesting are identified by areas with strategies, budget constraints, etc. Constrains are
low labour costs (minimize costs of forest harvesting objec- generally valued only by binary values (0 or 1)
tive) and areas with high forest production (maximize forest
profit). So evaluation criteria are based on following layers:
of their outcomes. They are often represented by
1. basin roads layer Boolean maps (0,1).
2. forest production layer
Alternatives represent the choices of stakehold-
First layer is used with the aim of minimize costs objective,
while second layer is used with purpose of maximize the profit ers and they are valued through numeric values
objective. A Fuzzy linear normalization process is used using of outcomes that compose themselves. In GIS
following parameters
1. basin road layer-linear decreasing- control point 0, 7 where structure each alternative is represented by pixels
7 is the maximum distance from roads expressed in kilometres (in raster models) or geographical primitives (in
2. forest production layer-linear increasing-control point 0, 324
where 324 is the maximum forest production expressed in m3/ vector models). Geographical primitives can be
ha grouped into two types of alternatives: simple and
Unit of measures of each criteria (kilometres and cubic metres
per hectare) are transformed in 0-2557 range values. Areas complex. Former are represented by single type of
close to roads will have normalized values close to 255 (0 object like points, lines and polygons, while latter
Kilometres), and good forest production areas will have nor-
malized values close to 255 (324 m3/ha).
are represented by interaction between objects like

239
GIS Technique for Territorial Analysis

Box 3. Constrains and alternatives definition


how alternatives are to be compared using this
Constrains (Figure 4): suitable areas must to be covered by index (Box 4).
forest and they must to be included into case study boundar- Broadly speaking evaluation criteria are ag-
ies (dark coloured pixels show forest areas into case study
boundaries).
gregated with multicriteria operators for choose
Alternatives are represented by pixel values of criterion raster the best alternative with the aim of the objectives
maps where dark coloured pixels show best areas location for
definition.
forest production criteria (Figure 5) and dark coloured pixels
show areas close to roads (Figure 6). In GIS models it is possible to do it using
overlay process with compensatory, no compensa-
tory operators. During recent years also partially
line-line, line-polygon, point-line etc. Given objec- compensatory operators are used in territorial
tive n, the stakeholders decision focus on the best analysis.
location finding (alternative). Alternatives values Weighted Linear Combination (WLC) is typi-
are related to outcomes value that can be binary cal compensatory operator through which alterna-
values (0 or 1), discrete values (finite values) or tive value is defined as the average of each criteria
continuous values (infinite values). Thematic value multiplied by the weights value (Formula 9).
maps often represent them (Box 3). It is important the role of criteria weights:
they are assigned to given criteria to indicate its
Decision Rules relative importance according to the preferences
of decision makers. They are called importance
The procedure by which criteria are selected and weights.
combined to arrive at a particular evaluation, and
n
by which evaluations are compared and acted
upon, is known as a decision rule. Decision rules ∑c ij
⋅ pi
j =1
Vj = (9)
typically contain procedures for combining criteria n
into a single composite index and a statement of

Figure 4. Forest areas map

240
GIS Technique for Territorial Analysis

Figure 5. Forest production map

where: Vj = alternative j value; cij = criteria i alternative value and a low alternative value hides
value relating to alternative j; pi = weight value an high alternative value back.
of criteria i (∑pi = 1); n = total criteria number Minimum operator (Min) is a no compensatory
This is a compensatory method because an method through which alternative value is defined
high alternative value hides (compensates) a low

Figure 6. Distance from roads map

241
GIS Technique for Territorial Analysis

by minimum value of each criteria multiplied by where: Vj = alternative j; n = total criteria number;
the weights value (Formula 10). Lij = order weights of criteria i and alternative j (L
≤ n, if L = 1 MIN operator, L close to n = compen-
Vj = Min (cij) ∙ pi (10) satory method, L close to 0 = no compensatory
method); pi = weight value of criteria i (∑pi = 1)
where: Vj = alternative j value; cij = criteria i OWA is almost identical to that of WLC, with
value relating to alternative j; pi = weight value the exception that a second set of weights appears.
of criteria i (∑pi = 1) This second set of weights, the order weights,
Maximum operator (MAX) is also a no com- controls the manner in which the weighted factors
pensatory method through which alternative value are aggregated (Eastman and Jiang, 1996; Yager,
is defined by maximum value of each criteria 1988). Indeed, WLC turns out to be just one vari-
multiplied by the weights value (Formula 11). ant of the OWA technique. To introduce the OWA
technique, let’s first review WLC in terms of two
Vj = MAX (cij) ∙ pi (11) new concepts: tradeoff and risk.
About tradeoff: Factor weights are weights
where: Vj = alternative j value; cij = criteria i that apply to specific factors, i.e., all the pixels
value relating to alternative j; pi = weight value of a particular factor image receive the same fac-
of criteria i (∑pi = 1) tor weight. They indicate the relative degree of
Formula 10 and 11 show no compensatory importance each factor plays in determining the
methods because the final criteria value is based suitability for an objective. In the case of WLC
on highest alternative value (MAX operator) or the weight given to each factor also determines
lowest alternative value (Min operator). how it will tradeoff relative to other factors. For
Ordered Weighted Average (OWA) (Eastman example, a factor with a high factor weight can
and Jiang, 1996). This method offers a complete tradeoff or compensate for poor scores on other
spectrum of decision strategies along the primary factors, even if the unweighted suitability score
dimensions of degree of tradeoff involved and for that highly-weighted factor is not particularly
degree of risk in the solution. good. In contrast, a factor with a high suit- ability
Ordered Weighted Average is a partially com- score but a small factor weight can only weakly
pensatory operator and it can be considerate the compensate for poor scores on other factors. The
intermediate solution between previous opera- factor weights determine how factors tradeoff but,
tors (Formula 12). This is a weighted sum with as described below, order weights determine the
ordered evaluation criteria. Thus in addition to overall level of tradeoff allowed.
the criterion weights (importance weights), order About risk: Boolean approaches are extreme
weights are used. The order weights are associated functions that result either in very risk-averse
with criteria values on location by location basis. solutions when the AND operator is used or in risk-
They are assigned to a location’s criteria values taking solutions when the OR operator is used.53
in decreasing order without considering which In the former, a high aggregate suitability score
criteria value comes from. for a given location (pixel) is only possible if all
factors have high scores. In the latter, a high score
n
in any factor will yield a high aggregate score,
∑L ij
even if all the other factors have very low scores.
Vj = i =1
∙ pi (12)
n The AND operation may be usefully described
as the minimum, since the minimum score for
any pixel determines the final aggregate score.

242
GIS Technique for Territorial Analysis

Similarly, the OR operation may be called the criteria and alternatives values can have the same
maximum, since the maximum score for any pixel importance. No compensatory operators are fre-
determines the final aggregate score. The AND quently used for valuation of projects possibly
solution is risk-averse because we can be sure damaging to the environment. The lowest alterna-
that the score for every factor is at least as good tive value can be represent a dangerous factor that
as the final aggregate score. The OR solution is is more important than others. In this case it is
risk-taking because the final aggregate score only significant to underline the value of dangerous
tells us about the suitability score for the single alternative in order to avoid the wrong choice. An
most suitable factor. example is showed in Box 6.
The WLC approach is an averaging technique
that softens the hard decisions of the Boolean ap- Territorial Analysis Results
proach and avoids the extremes. In fact, given a
continuum of risk from minimum to maximum, Each Criterion map is aggregated by decisional
WLC falls exactly in the middle; it is neither risk- rules (operators): the result is a final map that
averse nor risk-taking. represents an easily understanding method for all
About order weights, tradeoff and risk: The public or private stakeholders that can choose the
use of order weights allows for aggregation solu- correct territorial planning strategies (Box 5, Box
tions that fall anywhere along the risk continuum 6). The end of a decision making process is not
between AND and OR. Order weights are quite only focus on the resolution of a specific decisional
different from factor weights. They do not apply problem, but it is represent a recommendation
to any specific factor. Rather, they are applied for future actions. A large number of alternatives
on a pixel-by-pixel basis to factor scores as de- are evaluated on the basis of multiple criteria and
termined by their rank ordering across factors at then the best solution (best alternatives or ranking
each location (pixel). Order weight 1 is assigned of alternatives) is defined. Spatial Multicriteria
to the lowest-ranked factor for that pixel (i.e., the Decision Analysis is planned with the aim to help
factor with the lowest score), order weight 2 to the decision maker during planning process: it
the next higher-ranked factor for that pixel, and so performs complex multiple criteria evaluation
forth. Thus, it is possible that a single order weight and at same time it includes the preferences of
could be applied to pixels from any of the various decision maker.
factors depending upon their relative rank order.
Using different operators the process can
produce different values. From operative point of Box 4. Decision rules
view compensatory operators are used for envi-
ronmental improvement projects. In this case all A compensatory operator (WLC) is used for criterion maps
aggregation (Figure 8).

Figure 7. OWA scheme

243
GIS Technique for Territorial Analysis

Figure 8. Flowchart of WLC aggregation

Box 5. Territorial analysis results (cartographic


an important phase that need support tools in
layout)
order to considerate many decisional variables.
The territorial analysis results are showed by final map (Figure The decisions can be categorized in decision
9) where dark coloured pixels represent more suitable areas for involving stochastic information and imprecise
forest activities (areas with high forest production and close to
roads).
information. So it is possible to divide decisions
in probabilistic and fuzzy group. Recently both
stochastic (probabilistic) and imprecise (fuzzy)
FUTURE RESEARCH DIRECTIONS methodologies was included in GIS software in
order to resolve decision problems under uncer-
The main goal is to realise a correct territorial tainty, such as territorial characteristics. The future
planning strategy through the prediction of future research directions move to a full integration of
territorial scenarios. Decision processes become previous statistical methodologies and automatic
techniques aiming to the extraction of the involved

Figure 9. Suitable areas for forest activities

244
GIS Technique for Territorial Analysis

Box 6. Territorial analysis results (numeric layout)

According to Formula 9 (Weighted Linear Combination) the following examples provide 2 different suitable areas definitions.
Case study area 1
1. Criteria values
Forest production 324 m3/ha (maximum production is 324 m3/ha)
Distance from roads = 6 kilometres (maximum distance is 7 kilometres)
2. Criteria values after normalization
Forest production = 255
Distance from roads = 15
3. Suitable area value for forest activities using WLC aggregation
WLC = (255*0,58+15*0,5)/2 = 67,5
Case study area 2
1. Criteria values
Forest production 122 m3/ha (maximum production is 324 m3/ha)
Distance from roads = 3 kilometres (maximum distance is 7 kilometres)
2. Criteria values after normalization
Forest production = 120
Distance from roads = 100
3. Suitable area value for forest activities using WLC aggregation
WLC = (120*0,5+100*0,5)/2 = 55
The first example area is far from roads but has the highest forest production, while the second area has medium values for all criteria.
Using WLC method the lowest alternative value (case study area 1 distance from road) is hidden by high production values, so the first
area is more suitable for forest activities than second one.
Same territorial analysis can have different results using different operators. The previous example is also performed using Min operator.
Case study area 1
1. Criteria values
Forest production 324 m3/ha (maximum production is 324 m3/ha)
Distance from roads = 6 kilometres (maximum distance is 7 kilometres)
2. Criteria values after normalization
Forest production = 255
Distance from roads = 15
3. Suitable area value for forest activities using Min aggregation
Min = Min (255*0,5; 15*0,5) = 7.5
Case study area 2
1. Criteria values
Forest production 122 m3/ha (maximum production is 324 m3/ha)
Distance from roads = 3 kilometres (maximum distance is 7 kilometres)
2. Criteria values after normalization
Forest production = 120
Distance from roads = 100
3. Suitable area value for forest activities using Min aggregation
Min = Min (120*0,5; 100*0,5) = 50
No compensatory operators focus on the lowest alternatives value. In this case the lowest alternative value (case study area 1 distance
from road) is not hidden by high values of other alternatives, so the second area is more suitable for forest activities than first one.

decisional rules in territorial planning strategies and are implemented in automatic learning tools.
considering temporal dimension. “GIS have not Markov Chains, Bayesian networks, Cellular
fully realized their potential as systems to sup- Automata are used in order to predict future ter-
port and facilitate spatial modeling processes. ritorial scenarios. They use decisional rules with
They continue to handle the temporal dimension the aim of create transitional rules for modelling
very poorly” (Wagner, 1997). For such purposes spatial territorial scenarios.
sophisticated techniques are performed: decisional So the integration of GIS and statistical
rules are extracted using Discrete Choice models, techniques will be the new frontier of territorial
Neural Network System or other non linear models analysis. “Model integration is a deep scientific

245
GIS Technique for Territorial Analysis

problem but even so a pragmatic multi- criteria personal computer are used to resolve the first
multi-objective problem. Model integration re- problem, while Multicriteria rules applied to Fuzzy
quires dealing with end-use aspects: what is ap- logic are used to represent imprecise, incomplete
propriate to be integrated in view of the intended and vague information. In conclusion this chapter
use, the scientific aspect: what can and cannot be focus on a decision support methodology where
integrated on scientific grounds; and the techni- the final results are thematic maps that represent
cal aspects: how will the integrated model be friendly-understanding method for territorial
assembled and run. Yet, and despite the fact that analysis and an important tool open to public or
the terms ‘integral model’ and ‘integrated model’ private stakeholders who can choose the proper
are used all over, there are very few recipes or territorial planning strategy.
procedures for model integration available from
the scientific literature. Thus, model integration
seems more an art than a science at this moment” REFERENCES
(Engelen et all, 2002).
Amrhein, C., Griffith, D., & Desloges, J. R. (1991).
Statistical analysis for geographers. Prentice Hall
CONCLUSION publisher.
Arrow, K. J. (1950). A difficulty in the concept of
A correct territorial planning must consider the social welfare. The Journal of Political Economy,
increasing of anthropic pressure and the protection 58(4), 328–346. doi:10.1086/256963
of territorial resources: the protection of a territory
is an important key issue, where it is significant Ascough., et al. (2002). Multicriteria spatial deci-
the respect of rules for a correct exploitation of sion support systems: Overview, applications, and
natural resources. future research directions. In A. E. Rizzoli & A. J.
Spatial Decision Support System carry out Jakeman (Eds.), Proceedings of the 1st Biennial
several procedures when analysing complex de- Meeting of the iEMSs.
cisional problems. “The basic strategy is to divide
Bellman, R. E., & Zadeh, L. A. (1970). Decision-
the decision problem into small, understandable
making in a fuzzy environment. Management
parts; analyze each part; and integrate the parts in a
Science, 17(4), B141–B164. doi:10.1287/
logical manner to produce a meaningful solution”
mnsc.17.4.B141
(Malczewski, 1999). One of SDSS procedures is
based on Spatial Multicriteria Decision Analysis Burrough, P. A. (1989). Fuzzy mathemati-
that combines Multicriteria analysis and Geo- cal methods for soil survey and land evalua-
graphical Information System. Given a decisional tion. Journal of Soil Science, 40, 477–492.
problem, some objectives are defined and are ana- doi:10.1111/j.1365-2389.1989.tb01290.x
lyzed through criteria, alternatives and constrains
Eastman, J. R., & Jiang, H. (1996). Fuzzy measures
definition. Thus a spatial analysis is provided by
in multi-criteria evaluation. Proceedings, Second
using multicriteria operators in order to choose
International Symposium on Spatial Accuracy
best alternatives. This phase is characterized by
Assessment in Natural Resources and Environ-
two main problems related to the complexity of
mental Studies, (pp. 527-534). May 21-23, Fort
data analysis and to the difficulty of working under
Collins, Colorado.
uncertain scenarios: GIS techniques executed by

246
GIS Technique for Territorial Analysis

Engelen, G., White, R., van der Meulen, M., Romero, C., & Amador, F. (1987). Multiple ob-
& Hahn, B. (2002). Sustainable developments jectives in agricultural planning: A compromise
of islands: A policy support framework for the programming application. American Journal
integrated assessment of socio-economic and of Agricultural Economics, 69(1), 78–86.
environmental development. In Hsiao, H.-H. M., doi:10.2307/1241308
Liu, C.-H., & Tsai, H.-M. (Eds.), Sustainable
Starr, M. K., & Zeleny, M. (1977). Multiple cri-
development for island societies: Taiwan and the
teria decision making. North-Holland Pub. Co.
world (pp. 251–287). Taipei, Taiwan: Asia-Pacific
Research Program, Academia Sinica and SARCS Triantaphyllou, E. (2000). Multi-criteria decision
Secretariat. making methods: A comparative study. Kluwer
Academic Publishers.
FAO. (1976). A framework for land evaluation.
Soils Bulletin, 32. Rome, Italy: Food and Agricul- Wagner, D. F. (1997). Cellular automata and geo-
tural Organization of the United Nations. graphic information systems. Environment and
Planning B, 24, 219–234. doi:10.1068/b240219
FAO. (1993). Guidelines for land-use planning.
FAO Development Series No. 1. ISBN 92-5- Watson, S. R., & Buende, D. M. (1987). Deci-
103282-3, ISSN 1020-0819 sion synthesis. The principles and practice of
decision analysis. Cambridge, UK: Cambridge
Keeney, R. L. (1980). Siting energy facilities. New
University Press.
York, NY: Academic Press.
Yager, R. (1988). On ordered weighted averaging
Keeney, R. L., & Raiffa, H. (1976). Decision
aggregation operators in multicriteria decision
with multiples objectives: Preferences and value
making. IEEE Transactions on Systems, Man, and
tradeoffs. New York, NY: Wiley.
Cybernetics, 8(1), 183–190. doi:10.1109/21.87068
Malczewski, J. (1999). GIS and multicriteria deci-
Yager, R. R. (1987). Fuzzy sets and application:
sion analysis. John Winley & Sons, Inc.
Selected papers by L.A. Zadeh. New York, NY:
Malczewski, J. (2004). GIS–based land-use suit- John Wileys & Sons.
ability analysis: A critical overview. Elsevier.
Yu, P. L. (1973). A class of solutions for group
Mather, P. M. (1991). Computer applications in decision problems. Management Science, 19(8).
geography. Chichester, UK: John Wiley & Sons. doi:10.1287/mnsc.19.8.936
Narisimhan, R. (1980). Goal programming Zadeh, L. (1965). Fuzzy sets and systems. In Fox,
in a fuzzy environment. Decision Sciences, J. (Ed.), System theory. Brooklyn, NY.
11(2), 325–336. doi:10.1111/j.1540-5915.1980.
tb01142.x
ADDITIONAL READING
Nijkamp, P. (1976). Multi-objective programming
models: new ways in regional decision making. Bernetti I., Fagarazzi C., (2002). L’impiego dei
Regional Science and Urban Economics, 6, modelli multicriteriali geografici nella pianificazi-
253–274. doi:10.1016/0166-0462(76)90002-8 one territoriale. Aestimum dicembre 41, pp 1-26.
Pareto, V. (1906). Manuale di economia politica. Chen, S. J., & Hwang, C. (1992). Fuzzy multiple
attribute decision making. Springer-Verlang,
Berlino.

247
GIS Technique for Territorial Analysis

Chirici G., (1998). I GIS nella pianificazione e KEY TERMS AND DEFINITIONS
gestione forestale: l’esempio del Comune di M.te
Armelio. Genio Rurale n°7/8, pp 19-25. Decision Rules: Territorial planning choices
driven by Stakeholders decisions that shape a
Ferrucci D., (1999). Applicazione di un GIS per specific territory.
il confronto e l’integrazione di cartografie di uso Geographical Information System (GIS):
del suolo. Genio Rurale n°5, pp 55-63. Software for acquisition, storage and analysis of
Funtowicz, S. O., & Ravetz, J. R. (1990). Uncer- spatial data in order to represent real world items
tainty and quality in science for policy. Dordrecht: with vector o raster structure.
Kluwer. Multicriteria analysis: (MCDM): Decision
support method based on objectives, alternatives,
Gisotti G.,Bruschi S., (1990). Valutare l’ambiente. criteria definition with the aim of solving a specific
La nuova Italia Scientifica editore, cap 1,7,9. decisional problem (goal).
Nijkamp, P., & Rietveldt, P. (1996). Multiple Ob- Raster Structure: Representation of real
jective Decision Analysis in regional economic. world items with grid formed by pixels (minimum
Amsterdam, North Holland: Handbook of regional grind unit).
and urban economics. Spatial Decision Support System: (SDSS):
Computer-based systems with the aim of solving
Nuti F., (2001). La valutazione economica delle spatial decision problems.
decisioni pubbliche. Giappichelli editore. Spatial Multicriteria Decision Analysis:
Riccioli, F. (2009). Una proposta metodologica Decision support tool used in territorial analysis
per lo studio della multifunzionalità: l’analisi based on integration of Geographical Information
MultiCriteriale Geografica. Economia e diritto System and Multicriteria analysis.
agroambientale n.3 (pp. 99–118). Firenze Uni- Territorial Planning: Assessment process
versity Press. driven by Decision Makers objectives with the
aim of economics, social and environmental
Sugeno, M. E., & Yasukawa, M. (1993). A fuzzy- characteristics analysis of a specific territory.
logic-based approach to qualitative modeling. Vector Structure: Representation of real
IEEE Transactions on Fuzzy Systems, 1, 7–31. world items with geographical primitives (points,
doi:10.1109/TFUZZ.1993.390281 lines, polygons).
Yu, P. L. (1973). A class of solutions for group
decision problems. Management Science, 19(8).
doi:10.1287/mnsc.19.8.936 ENDNOTES

Zeleny, M. (1974). A concept of Compromise 1


Best solution meaning is based on Arrow’s
solutions and method of the displaced ideal -. impossibility theorem (Arrow, 1950)
Computers & Operations Research, 1(4). 2
The measures used in performance matrices
of this kind are also qualitative descriptions,
Zeleny, M. (1982). Multiple Criteria Decision
or natural units.
Making. New York: McGraw Hill. 3
p is the metrical parameter used for distance
Zimmermann, H. J. (1987). Fuzzy sets, decision from ideal measurement
making and expert system. Boston: Kluwer Ac- 4
The word GIS is considered both plural and
cademic Publisher. singular during text
5
Aerial photo, satellite photo, etc.

248
GIS Technique for Territorial Analysis

6
Based on fuzzy theory (see Spatial Multi- 7
The range scale 0-255 allow an higher cal-
Criteria Decision Analysis section for more culation speed than 0-1 range scale from
details) computer performance point of view
8
Same weights values for each criteria are
considered

This work was previously published in ural and Environmental Informatics, Governance and Management: Emerging Research
Applications, edited by Zacharoula Andreopoulou, Basil Manos, Nico Polman and Davide Viaggi, pp. 425-445, copyright 2011
by Information Science Reference (an imprint of IGI Global).

249
250

Chapter 17
Cognitively Ergonomic
Route Directions
Alexander Klippel
University of Melbourne, Australia

Kai-Florian Richter
Universität Bremen, Germany

Stefan Hansen
Spatial/Information Systems Ltd./LISAsoft, Australia

ABSTRACT
This contribution provides an overview of elements of cognitively ergonomic route directions. Cognitive
ergonomics, in general, seeks to identify characteristics of cognitive information processing and to formal-
ize these characteristics such that they can be used to improve information systems. For route directions,
an increasing number of behavioral studies have, for example, pointed to the following characteristics:
the use of landmarks, changing levels of granularity, the qualitative description of spatial relations.
The authors detail these aspects and additionally introduce formal approaches that incorporate them
to automatically provide route directions that adhere to principles of cognitive ergonomics.

COGNITIVE ASPECTS OF ROUTE aspects of route directions has increased. Next to


DIRECTIONS the general aspects of how to provide route direc-
tions and how to identify principles that allow us
Route directions fascinate researchers in several to define what makes route directions cognitively
fields. Since the 70s linguists and cognitive scien- ergonomic, technical aspects of navigation support
tists have used verbal route directions as a window systems have become an additional focus. The
to cognition to learn about cognitive processes question required from the latter perspective is
that reflect structuring principles of environmen- part of a broader approach that aims to formally
tal knowledge (e.g., Klein, 1978). Over the last characterize the meaning (semantics) of spatial
decade, the number of publications on various relations. In other words, if we want to bridge the
gap between information systems and behavioral
DOI: 10.4018/978-1-4666-2038-4.ch017

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Cognitively Ergonomic Route Directions

analysis we have to answer how we perform the tems detailed formal characterizations of route
transition from data to knowledge. knowledge are required. The approaches discussed
Several key elements can be identified based below are a representative vocabulary that allows
on psychological and linguistic literature on route for the characterization of mental conceptualiza-
directions that are pertinent for cognitively ergo- tion processes reflecting the results from behav-
nomic route directions (Denis, 1997; Lovelace, ioral studies (see Table 1). In this sense we can
Hegarty, & Montello, 1999; Tversky & Lee, refer to them as Ontologies of Route Knowledge
1999). These comprise the conceptualization of (Chandrasekaran, Josephson, & Benjamins, 1999;
directions at decision points, the spatial chunking Gruber, 1993). In Guarino’s terminology these
of route direction elements to obtain hierarchies approaches would most likely be called domain
and to change the level of granularity, the role ontologies (Guarino, 1998).
of landmarks, the communication in different One of the earliest approaches is the TOUR
modalities, the traveling in different modes, and model by Kuipers (Kuipers, 1978) that later devel-
aspects of personalization (see Table 1). Most oped into the Spatial Semantic Hierarchy (SSH)
research on routes and route directions deals with (Kuipers, 2000). Kuipers and his collaborators
navigation in urban structures such as street net- developed this approach to add the qualitative-
works. The results discussed in this article focus ness that can be found in the organization of a
on this domain. cognitive agent’s spatial knowledge to approaches
in robotics. The latter classically relied more on
quantitative spatial descriptions. The SSH al-
APPROACHES TO REPRESENTING lows for modeling cognitive representations of
ROUTE KNOWLEDGE space as well as for building a framework for
robot navigation, i.e. qualitative and quantita-
Behavioral studies have substantiated key ele- tive aspects are combined. The SSH especially
ments of cognitively ergonomic route directions. reflects the aspect of hierarchical organization of
To implement these aspects in information sys- spatial knowledge by providing different levels of
information representation: the sensory, control,
causal, topological, and metrical level. Ontological
Table 1. Cognitive ergonomics of route directions
characterizations are developed for each level to
• are qualitative, not quantitative, match human cognitive processes.
• allow for different levels of granularity and organize spatial The Route Graph model (Werner, Krieg-
knowledge hierarchically, Brückner, & Herrmann, 2000) describes key
• reflect cognitive conceptualizations of directions at decision elements for route based navigation. Similar to
points,
the SSH, it allows representing knowledge on
• chunk route direction elements into larger units to reduce
cognitive load,
different levels of granularity. However, it is
much more abstract and does not provide any
• use landmarks to:
processes for acquiring this knowledge. It is
° disambiguate spatial situations,
intended to provide a formalism expressing key
° anchor turning actions,
notions of route knowledge independent of a
° and to confirm that the right actions have been taken,
particular implementation, agent, or domain. Its
• present information in multimodal communication systems
allowing for an interplay of language and graphics, but re- focus is on a sound formal specification of basic
specting for the underlying conceptual structure, elements and operations, like the transition from
    • allow for an adaptation to the user’s familiarity with an en- route knowledge to survey knowledge by merging
vironment, as well as personal styles and different languages.
routes into a graph-like structure.

251
Cognitively Ergonomic Route Directions

A linguistically grounded approach with the a single route, i.e., actions to perform to reach
aim to generate verbal route directions is the the destination, while the latter provides concepts
CORAL project by Dale and coworkers (e.g., referring to the complete transportation network.
Dale, Geldof, & Prost, 2005). One of the central An industry approach for formalizing route
aspects of their approach is the organization of knowledge can be found in Part 6: Navigation
parts of a route into meaningful units, a process Service of the OpenLS specification. The Open-
they call segmentation. Instead of providing GIS Location Services (OpenLS) Implementation
turn-by-turn directions, this approach allows for Specification (Mabrouk, 2005) describes an open
a small number of instructions that capture the platform for location-based application services,
most important aspects of a route. The employed the so called GeoMobility Server (GMS) proposed
modeling language is called Route Planning by the Open Geospatial Consortium (OGC). It
Markup Language (RPML). offers a framework for the interoperable use of
Formalisms that model route knowledge on mobile devices, services and location-related data.
the conceptual level can be found in the theory of The Navigation Service described in Part 6 of
wayfinding choremes (Klippel, Tappe, Kulik, & the OpenLS specification provides the accessing
Lee, 2005) and context-specific route directions client, amongst other services, with preprocessed
(Richter & Klippel, 2005). These approaches data that is required for the generation of route
model route knowledge modality-independent directions. Based on XML specifications, it defines
on the conceptual level. The wayfinding choreme a data structure that allows clients to generate their
theory employs conceptual primitives—as the own route directions which may accord more to a
result of conceptualization processes of a cogni- user’s preferences. The used data model structures
tive agent incorporating functional as well as the route in maneuvers (descriptions combining
geometrical environmental aspects—to define a turn at a decision point and proceeding on the
basic as well as super-ordinate valid expressions following route segment) and enhances them with
on different levels of granularity. The approach additional information about route elements.
to context-specific route directions builds on
this theory. A systematics of route direction ele-
ments determines which, and how, entities may CORE ASPECTS OF COGNITIVELY
be referred to in route directions. Accordingly, ERGONOMIC ROUTE DIRECTIONS
abstract relational specifications are inferred by
optimization processes that adapt route directions In the following, three aspects that are at the core
to environmental characteristics and inherent of cognitively ergonomic route directions will be
route properties. discussed in greater detail: cognitively adequate
Human wayfinding, however, may not be direction concepts, the use of landmarks, and
restricted to a single mode of transportation. A spatial chunking to obtain hierarchies and change
typical example is public transport, where travelers the level of granularity.
frequently switch between pedestrian movement
and passive transportation (trains, buses, etc.). Conceptualization of Directions
Timpf (2002) analyzed route directions for multi- at Decision Points
modal wayfinding and developed two different
ontologies of route knowledge: one representing The specification of direction changes is the
knowledge from the perspective of the traveler and most pertinent information in route directions.
one taking the perspective of the transportation While current route information systems heavily
system. The former focuses on movement along rely on street names to identify the proper direc-

252
Cognitively Ergonomic Route Directions

tion to take, behavioral research (Tom & Denis, of a specific route, a salient intersection can be
2003) has shown that from a cognitive perspec- used to organize spatial knowledge. This aspect
tive, street names are not the preferred means to has not yet gained much attention.
reorient oneself. People rather rely on landmarks
(as discussed in the next section) and appropri- Enriching Route Directions with
ate direction concepts. On the most basic level Landmarks
we have to specify the correspondence between
a direction change (in terms of the angle) and a Analyzing human route directions shows how
direction concept. For example, which sector is prominently landmarks are used to structure
applicable to a concept like “turn right”? On a the respective spatial knowledge, to give the
more elaborate level, we have to specify alter- instructed the possibility to assure that they are
native direction concepts and detail their scope still following the correct route, and to anchor
of application. Figure 1 shows some examples required turning actions. Since landmarks seem
of how the same direction change can result in to be such an important part of human-generated
different direction concepts (and corresponding route directions their integration is pertinent for
verbalizations) depending, among other things, on automatically generating cognitively ergonomic
the spatial structure in which the change occurs. instructions.
We need this level of specificity for two reasons. Several classifications of landmarks and their
First, a qualitative but precise direction model characteristics have been discussed in the litera-
allows for verbally instantiating a situation model ture. One of the first assessments is presented by
(Zwaan & Radvansky, 1998) of the encountered Lynch (1960) who distinguishes Landmarks as one
intersections. Second, intersections can function of five elements that structure urban knowledge:
as landmarks. Just like classical examples of path, edges, districts, nodes, and landmarks. It
landmarks, such as the Eiffel Tower, in the context is commonly agreed that the landmark account

Figure 1. A change of a direction is associated with different conceptualizations according to the inter-
section at which it takes place. The ‘pure’ change may be linguistically characterized as take the second
exit at the roundabout (a). At intersection (b) it might change to the second right; at intersection (c) it
may change to fork right, and at (d) it becomes veer right.

253
Cognitively Ergonomic Route Directions

should comprise all five elements, as according violates findings of cognitive science, as for ex-
to Presson and Montello (1988) everything that ample formulated in Clark’s 007 Principle: “In
stands out of the background may serve as a general, evolved creatures will neither store nor
landmark. That is, given the right spatial context process information in costly ways when they
different features of an environment may serve can use the structure of the environment and their
as landmarks. Sorrows and Hirtle (1999) distin- operations upon it as a convenient stand-in for the
guish three characteristics important for making information-processing operations concerned.
an object a landmark: its visual, semantic, and That is, know only as much as you need to know
structural characteristics. Additionally, landmarks to get the job done.” (Clark, 1989, p. 64)
can be categorized according to their cognitive Structuring route descriptions by subsuming
function within route directions, their geometry, instructions gives users a coarse overview over a
and their spatial relation to the route. Humans route, which is easier to perceive and quite often
conceptualize landmarks either as point-like, sufficient for successful wayfinding, especially
linear, or area-like entities. However, these con- if the user is familiar with the environment. Of
ceptualizations do not necessarily correspond to course, the subsumed information still has to be
the geometric characteristics of objects but reflect accessible in case the user needs it (or, as discus-
the schematization processes cognitive agents sions on positioning technologies in this volume
apply (Herskovits, 1986). A detailed description show, the user may simply re-query a new route
of the different roles of landmarks is necessary from his new position). This may either be pos-
to allow for their integration in an automatic gen- sible by zoom-in operations, i.e., by accessing the
eration process. For example, a simple, yet as of next, more detailed level of the hierarchy, or by
today unexplored way to enrich route directions (mental) inference processes. Such inferences, for
with landmarks is to include references to salient example, extract from an instruction like “turn left
intersections, like T-intersections or roundabouts, at the dead end” information on which action to
which are easy to identify automatically. This also perform at all intersections before the dead end,
reflects the direction concepts humans employ namely to continue straight (e.g., Duckham &
with such structures (see also Figure 1). Kulik, 2003). The following cognitive strategies
for spatial chunking are discussed in the litera-
Spatial Chunking: Hierarchies ture (Dale et al., 2005; Klippel, Tappe, & Habel,
and Levels of Granularity 2003): numerical chunking, structure chunking,
landmark chunking, and chunking using the street
The hierarchical organization of spatial infor- level hierarchy.
mation and flexibly changing between levels
of granularity are omnipresent in the cognitive
organization of spatial knowledge (Hobbs, 1985; THE MULTIMODAL PRESENTATION
Kuipers, 2000). Chunking elementary wayfinding OF ROUTE KNOWLEDGE
actions (such as turns at intersections) in order
to impose a hierarchical structure and to change The multimodal communication of spatial in-
the level of granularity reflects not only cogni- formation is a core aspect of human cognition:
tive conceptualization processes but organizes linguistic expressions, graphical representations
route knowledge in a cognitively ergonomic such as sketch maps, and gestures are channels
way. Especially users who are familiar with an along which humans naturally communicate (Ovi-
environment can profit from such an approach. att, 2003). Each representational medium—each
In general, providing a user with too much detail channel—has advantages in specific contexts but

254
Cognitively Ergonomic Route Directions

may fail in other situations (Kray, Laakso, Elting, SUMMARY


& Coors, 2003). For example, natural language
expressions are inherently underspecified: a term In the last decades, research on route directions in
like turn right is applicable to a range of different linguistics and cognitive science revealed many
turning angles at an intersection and therefore underlying principles and processes of human
may be sufficient in many situations. Figure 2, route direction production and comprehension,
however, shows a situation that requires a complex and, thus, provides us with an understanding of
explanation if a description is provided in linguis- what constitutes cognitively ergonomic route
tic terms. In this case, a graphic representation directions. However, this understanding has to
is more suitable to communicate the situation at be formally specified to be implemented in in-
hand. Communication channels also differ with formation systems for wayfinding assistance, like
respect to their suitability in the identification of internet route-planners. In essence, three cognitive
landmarks. A salient object at an intersection might principles need to be implemented in wayfind-
be visually easily identifiable and recognisable, ing assistance systems to generate cognitively
but hard to describe linguistically. An expression ergonomic route directions: adequate direction
like follow the road to the dead end on the other concepts, the enrichment of route directions with
hand, may chunk a large part within a route lin- landmarks, and spatial chunking which allows for
guistically and therefore, communicate the spatial a hierarchical structuring of route knowledge and
situation more efficiently if the dead end is a long representations on different levels of granularity.
way away and hard to depict on a small screen. To this end, we need a thorough understanding of
The communication of route information, which direction concept humans apply in which
whether visually, linguistically, or in any other situation, a detailed ontology of the different kinds
modality, has to follow the same guidelines as of landmarks and the role they may take in route
established for the structuring of route knowledge. directions, as well as formal characterizations
Cluttering any communication process has shown that model hierarchical structures and guide the
to violate cognitive ergonomics and to slow down changes of granularity.
information processing. This confinement to
sparseness has been shown for visual route direc-
tions, for example, by Agrawala and Stolte(2000), REFERENCES
who based their route direction tool on results
obtained from sketch maps (Tversky & Lee, 1999). Agrawala, M., & Stolte, C. (2000). A design and
implementation for effective computer-generated
route maps. In AAAI Symposium on Smart Graph-
Figure 2. Complex intersection ics, March 2000. Stanford.
Chandrasekaran, B., Josephson, J. R., & Ben-
jamins, V. R. (1999). What are ontologies, and
why do we need them? IEEE Intelligent Systems
and Their Applications, 14(1), 20–26.
Clark, A. (1989). Microcognition: Philosophy,
cognitive science, and parallel distributed pro-
cessing. Cambridge, MA: MIT Press.

255
Cognitively Ergonomic Route Directions

Dale, R., Geldof, S., & Prost, J.-P. (2005). Using Klippel, A., Tappe, T., Kulik, L., & Lee, P. U.
natural language generation in automatic route (2005). Wayfinding choremes - A language for
description. Journal of Research and practice in modeling conceptual route knowledge. Journal of
Information Technology, 37(1), 89-105. Visual Languages and Computing, 16(4), 311–329.
doi:10.1016/j.jvlc.2004.11.004
Denis, M. (1997). The description of routes: A
cognitive approach to the production of spatial Kray, C., Laakso, K., Elting, C., & Coors, V.
discourse. Cahiers de Psychologie Cognitive, (2003). Presenting route instructions on mobile
16, 409–458. devices. Paper presented at the IUI’03, January
12-15, 2003, Miami, Florida, USA.
Duckham, M., & Kulik, L. (2003). “Simples”
paths: Automated route selection for navigation. Kuipers, B. (1978). Modelling spatial knowledge.
In W. Kuhn, M. Worboys, & S. Timpf (Eds.), Cognitive Science, 2(2), 129–154.
Spatial information theory: Foundations of
Kuipers, B. (2000). The spatial semantic hierarchy.
geographic information science. Conference on
Artificial Intelligence, 119, 191–233. doi:10.1016/
Spatial Information Theory (COSIT) 2003. (pp.
S0004-3702(00)00017-5
182-199). Berlin: Springer.
Lovelace, K., Hegarty, M., & Montello, D. R.
Gruber, T. R. (1993). A translation approach to
(1999). Elements of good route directions in
portable ontologies. Knowledge Acquisition, 5(2),
familiar and unfamiliar environments. In C.
199–220. doi:10.1006/knac.1993.1008
Freksa & D. M. Mark (Eds.), Spatial information
Guarino, N. (1998). Formal ontology and infor- theory. Cognitive and computational foundations
mation systems. In N. Guarino (Ed.), Formal of geographic information science (pp. 65-82).
Ontology in Information Systems. Proceedings of Belin: Springer.
FOIS’98, Trento, Italy, 6-8 June 1998. (pp. 3-15).
Lynch, K. (1960). The image of the city. Cam-
Amsterdam: IOS Press.
bridge, MA: MIT Press.
Herskovits, A. (1986). Language and Spatial
Mabrouk, M. (2005). OpenGis Location Services
Cognition: An Interdisciplinary Study of the
(OpenLS): Core Services. OGC Implementa-
Representation of the Prepositions in English.
tion Specification 05-016 Version 1.1 Open Gis
UK: Cambridge University Press.
Consortium Inc.
Hobbs, J. R. (1985). Granularity. In A. Joshi (Ed.),
Oviatt, S. L. (2003). Multimodal interfaces. In J.
Proceedings of the 9th International Joint Confer-
Jacko & A. Sears (Eds.), The Human-Computer
ence on Artificial Intelligence. Los Angeles, CA
Interaction Handbook: Fundamentals, Evolving
(pp. 432-435). San Francisco: Morgan Kaufmann.
Technologies and Emerging Applications. (pp.
Klein, W. (1978). Wegauskuenfte. Zeitschrift für 286-304). Mahwah, NJ: Lawrence Erlbaum.
Literaturwissenschaft und Linguistik, 33, 9–57.
Presson, C. C., & Montello, D. R. (1988). Points of
Klippel, A., Tappe, T., & Habel, C. (2003). Pic- reference in spatial cognition: Stalking the elusive
torial representations of routes: Chunking route landmark. The British Journal of Developmental
segments during comprehension. In C. Freksa, W. Psychology, 6, 378–381.
Brauer, C. Habel & K. F. Wender (Eds.), Spatial
Cognition III. Routes and navigation, human
memory and learning, spatial representation and
spatial learning. (pp. 11-33). Berlin: Springer.

256
Cognitively Ergonomic Route Directions

Richter, K.-F., & Klippel, A. (2005). A model for Zwaan, R. A., & Radvansky, G. A. (1998).
context-specific route directions. In C. Freksa, Situation models in language comprehension
M. Knauff & B. Krieg-Brueckner (Eds.), Spatial and memory. Psychological Bulletin, 123(2),
Cognition IV. Reasoning, Action, and Interaction: 162–185. doi:10.1037/0033-2909.123.2.162
International Conference Spatial Cognition 2004,
Frauenchiemsee, Germany, October 11-13, 2004,
Revised Selected Papers (pp. 58-78). Berlin: KEY TERMS AND DEFINITIONS
Springer.
Granularity: Here, it refers to the detail in
Sorrows, M., & Hirtle, S. C. (1999). The nature route directions; from coarse levels for general
of landmarks for real and electronic spaces. In C. planning to finer levels to provide context-specific
Freksa & D. M. Mark (Eds.), Spatial information information, for example at decision points.
theory. Cognitive and computational foundations Landmark: Any entity in the environment
of geographic information science (pp. 37-50). that sticks out from the background.
Berlin: Springer. OpenLS: Specification of an open platform
Timpf, S. (2002). Ontologies of wayfinding: A for location-based services defining their core
traveler’s perspective. Networks and Spatial functionality (directory service, gateway service,
Environments, 2, 3–33. location utility service, presentation service, route
service).
Tom, A., & Denis, M. (2003). Referring to land- Personalization: Adaptation of information
mark or street iniformation in route directions: presentation and interaction with a device / soft-
What difference does it make? In W. Kuhn, M. ware to the needs and preferences of a specific,
Worboys & S. Timpf (Eds.), Spatial information individual user.
theory. Foundations of geogrpahic information Route Directions: A set of instructions that
science. International conference, COSIT 2003, allow a wayfinder in known or unknown envi-
Kartause Ittingen, Switzerland, September 2003. ronments to follow a route from a start point to
(pp. 362-374). Berlin: Springer. a destination.
Tversky, B., & Lee, P. U. (1999). Pictorial and Spatial Semantic Hierarchy (SSH): A com-
verbal tools for conveying routes. In C. Freksa & putational model defining acquisition and repre-
D. M. Mark (Eds.), Spatial information theory. sentation of spatial knowledge on different levels
Cognitive and computational foundations of geo- of abstraction ranging from sensory information
graphic information science (pp. 51-64). Berlin: to topological knowledge.
Springer. Wayfinding: The cognitive conceptual activity
of planning and finding ones way.
Werner, S., Krieg-Brückner, B., & Herrmann, Wayfinding Choremes: Mental conceptu-
T. (2000). Modeling navigational knowledge by alizations of functional wayfinding and route
route graphs. In C. Freksa, W. Brauer, C. Habel direction elements.
& K. F. Wender (Eds.), Spatial cognition II.
Integrating abstract theories, empirical studies,
formal methods, and practical applications. (pp.
295-316). Berlin: Springer.

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 230-238,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

257
258

Chapter 18
Design and Implementation
Approaches for Location-Based,
Tourism-Related Services
George Kakaletris
University of Athens, Greece

Dimitris Varoutas
University of Athens, Greece

Dimitris Katsianis
University of Athens, Greece

Thomas Sphicopoulos
University of Athens, Greece

ABSTRACT
The globally observed recession of mobile services market has pushed mobile network operators into
looking for opportunities to provide value added services on top of their high cost infrastructures. Recent
advances in mobile positioning technologies enable services that make use of the mobile user location
information, offering intuitive, attractive applications to the potential customer. Mobile tourism services
are among the primary options to be considered by service providers for this new market. This chapter
presents the key concepts, capabilities, and considerations of infrastructures and applications targeted
to the mobile tourist, covering data and content delivery, positioning, systems’ interactions, platforms,
protocols, security, and privacy as well as business modelling aspects.

INTRODUCTION traditionally considered as being only suited to


wired applications. Due to their versatility, wire-
During the last decade of the 20th century, wire- less telecommunications systems have become a
less data networks have invaded everyday life widespread standard, leading to hardware price
and have gradually started taking over areas drops and radical quality increases. Today there ex-
ist a bunch of technologies that allow the delivery
of information to mobile or wireless devices and
DOI: 10.4018/978-1-4666-2038-4.ch018

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Design and Implementation Approaches for Location-Based, Tourism-Related Services

their users, all presenting different characteristics This chapter presents the technological con-
in performance/ quality, autonomy and cost. These cepts associated with the provision of location-
technological advances accompanied by the reach aware tourism-related services under a service-
of the saturation level (Ellinger, Barras, & Jackel, oriented approach capable of supporting open
2002; Gruber, 2005; Gruber & Verboven, 2001) value chains and to lead financially viable open
in the mobile telephony market pushed hardware and powerful communication systems. The rest
vendors and network and service providers into of the chapter is organised as follows: The “Back-
looking for new business opportunities. The needs ground” section presents the technological and
of tourism-related information provision and business background of location-based services;
services were amongst the first to be considered the “Technology Overview” section gets into
for new applications in the field of communica- details of the technological aspects and issues
tion devices. raised in the domains of positioning and data/
In traditional fixed systems, the location of content delivery, which are fundamental elements
a terminal and its user was a part of its identity of the examined class of services; the section on
and remained constant for a long period during its “Mobile Tourism Services” captures the specific
lifetime. In this new mobility era, this observation needs and opportunities in the specific application
no longer holds: the physical position of the user area and presents issues and considerations with
might be highly variable, introducing a whole respect to integrating the various parts into an open
new range of issues and opportunities to be taken system capable of delivering such services. In the
into account. The use of intelligent systems that “Conclusion,” technology and market conclusions
exploit the positional information of the client, and trends are presented. Finally, due to the large
accompanied by the ability to provide feedback number of acronyms and the frequency of their
over a wireless medium, can lead to the provision appearance, a table of acronyms is provided at
of innovative highly intuitive services that were not the end of the chapter in order to ease reading
available in the near past (Grajski & Kirk, 2003; through it (see Appendix).
Kakaletris, Varoutas, Katsianis, Sphicopoulos, &
Kouvas, 2004; Rao & Minakakis, 2003; Staab &
Werthner, 2002; Yilin, 2000). BACKGROUND
But, although mobile telephony networks offer
maximum mobility, they are not the only means The application of the above-mentioned technolo-
for providing location-based services (LBS) for gies and concepts in tourism gave birth to the
tourism. Local fixed wireless networks in their ubiquitous tourism1 concept (OTC, 2003), which
various forms are another of the modern and refers to the existence and access of tourism related
popular technologies facilitating relevant services. services at any place, any time. Although tourism-
In addition to telecommunication systems and related services are mostly related to content
from a technological perspective, there are a wide provision, more applications can be identified.
range of other systems such as global positioning In its entirety, content provision for e-tourism
system (GPS) (Dana, 1994; ETSI, 2006; GAR- covers a large number of thematic areas: culture,
MIN, n.d.), or ID tags (Bohn & Mattern, 2004; urgencies, transportation, events, and so on. Thus,
Tarumi, Morishita, & Kambayashi, 2000) which content might be both temporally and spatially
might have a significant role in the development labelled (LoVEUS, 2002; M-Guide, 2002). In
and deployment of e-tourism applications based addition, information seeking and avalanche-like
on location information. content provision might guide the user to areas
quite outside her/his initial focus areas.

259
Design and Implementation Approaches for Location-Based, Tourism-Related Services

The information technology (IT), the Internet lows providers and users to make the most out of
and the mobile telecommunications revolutions the existing infrastructures, but also encourages
of the last decades of the 20th century made it usage and drives expectations for the next gen-
possible for enterprises to enable massive access eration of mobile networks (Varoutas, Katsianis,
to their applications and data. Users are able to Sphicopoulos, Stordahl, & Welling, 2006). To
access applications and information through a provide such services, the integration of various
variety of integrated “channels” including the In- components and base-services is required, which
ternet, mobile telephony, and voice interfaces and breaks the current status of most MNOs that have
thus bring forward the concept of multi-channel traditionally been formed as almost monolithic
architectures. Consequently, multi-channel con- self-contained service(s) providers.
tent delivery and media-independent publishing This need for integration of various market
have emerged in order to address the demand for stakeholders in complex business models aim-
personalised content that can adapt to the end-user ing for the provision of high quality services has
device capabilities. Devices, such as PDAs, cel- been indicated not only by mobile market analysts
lular phones, smartphones, and television set-top but also by information systems architects. The
boxes, introduced the need for additional channels service-oriented approach (Brown, Johnston,
for publishing content. The approach of maintain- & Kelly 2003; Colan, 2004), a whole new IT
ing independent content sets per channel proved perspective which is rushing into the industry,
to be highly inefficient in terms of maintenance, underlies the concepts and offers the guidelines
until the wide adoption of eXtensible Markup that render possible such complex collaboration
Language (XML) and related technologies, schemes. In the LBS domain, mobile positioning
such as eXtensible Stylesheet Language / XSL protocol and mobile location protocol (Ericsson;
Transformation (XSL/XSLT), offered a standard OMA, 2002) already exercise concepts in-line
solution to this challenge. with current service-oriented architectures (SOA)
Technology is not the sole reason behind the common practices. Nevertheless, the design of
emergence of the ubiquitous tourism concept. services, such as location-based ones, will always
The existing 2/2.5G2 mobile market has reached have to face domain specific challenges concern-
saturation as analysts have predicted, but its ef- ing technical, economical or even ethical and
fects have only been acknowledged lately, due to social factors of the service application (Daoud
the high expectations of the emerging 3G3 mar- & Mohan 2002).
kets. The costs of licensing (Andersson, Hulten, Nowadays it is possible and desirable to build
& Valiente, 2005; Katsianis, Welling, Ylonen, open systems that can support the delivery of
Varoutas, Sphicopoulos, Elnegaard, et al.,2001; tourism-related location-dependent content to an
Yan 2004) and deployment of 3G networks led end-user on top of the technological and business
mobile network operators4 (MNOs) into a global background already described, allowing:
recession era and a global pessimism for their
adoption which actually reflects user attitudes • Seamless interoperability of systems and
towards the new standard. In order to confront content provided by several market stake-
that, business opportunities based on existing holders towards providing a large range of
mobile and wireless networks have been further high-quality location-based content deliv-
investigated (Katsianis et al., 2001; Varoutas, ery services, through standards and loosely
Katsianis, Sphicopoulos, Loizillon, Kalhagen, coupled elements
& Stordahl, et al., 2003). The provision of value
added services over 2.5/3G networks not only al-

260
Design and Implementation Approaches for Location-Based, Tourism-Related Services

• Exploitation of state-of-the-art and future derived from exactly the same content that would
technology in positioning, mobile devices, drive traditional applications (Web sites, printed
and network infrastructures elements, etc.). The driving force behind these
• Compliance with requirements and stan- is modern software platforms and system archi-
dards for personalisation and quality of tectures that facilitate the creation of the various
service (QoS) nodes of a complex structure of collaborating
• Low-cost implementation and upgrade service elements.
roadmap from 2/2.5G to 3G and other
current and future mobile and wireless Wireless and Mobile Data Services
networks
• Guarantees of privacy Since the last decade of the 20th century and the
beginning of the 21st, the mobile user has come
As already mentioned, provision of tourism- to enjoy the provision of many technologies and
related content can be shown that covers a large services that were hard to even imagine several
portion of the information that is usually delivered years before (Lin & Chlamtac, 2001). Besides
through location-based services. A number of stud- voice, some of the most common ones are:
ies already exist that focus on various aspects of
technologies, architectures and business models • Information services (News, Directories,
of this area (Devine & Holmquist, 2001; EMILY, Weather, Athletics, Financial, etc.)
2002; M-Guide, 2002). This chapter presents the • Entertainment (Chat & Flirt Services,
design aspects of such services in a generic way, Guess who, Alerts, Horoscope, ringtones,
capturing the needs of many location-dependent etc.)
services since it assumes a highly heterogeneous • Communication tools (SMS, MMS, e-
network infrastructure leveraged by the Internet mail, instant messaging, etc.)
protocol (IP) layer. In this way, dealing with
the details of mobile or other wireless network Apart from these common services, a series
infrastructures is avoided yet interoperability of other, more complex ones are being offered to
and integration issues are been identified and the user like navigation, local news, SMS vote,
investigated. microbilling, and so forth. Enabling these services
is achieved through various means the most im-
portant of which being the Web, SMS, and MMS.
TECHNOLOGY OVERVIEW These higher-level information exchange media
are based on lower level communication channels
In the following sections the technologies involved offered by an infrastructure provider. The most
in the provision of mobile tourism services are important ones are briefly described below:
introduced. Connectivity, which essentially allows
delivering data to a device, and positioning, which • Global system for mobile telecommu-
is the ability to locate a device and consequently nications (GSM) refers to 2nd generation
its user in space, are the fundamental enabling mobile telephony networks (Mouly &
technologies for the provision of location-based Pautet, 1995), which although digital, was
services. Assuming these, tourism related infor- designed with voice communications in
mation could be delivered to devices capable mind, thus giving data rates of 9.6kbis/s,
of presenting it (e.g., mobile phones), with a which is rather slow for multimedia ap-
multitude of options (quality, depth, size, etc.), plications. Additionally data transfer is

261
Design and Implementation Approaches for Location-Based, Tourism-Related Services

not packet switched thus not optimised for GSM circuit switched data, which supports
computer type communications requiring the allocation of one user per channel per
circuits to be allocated even if no data are time slot and allows multiple channels to
exchanged. Since its initial appearance, be virtually merged thus offering higher
several enhancements where proposed as data rates (Halonen et al., 2003; Korhonen
side-by-side technologies that enable high- et al., 2001). However the allocation of
er performance data transfers (Korhonen, multiple channels raises the connection
Aalto, Gurtov, & Lamanen, 2001). cost of the end-user, rendering the service
• General packet radio services (GPRS) is rather inappropriate when compared to
a wireless communication protocol based other modern techniques.
on the same modulation as GSM, designed • Universal mobile telecommunications
to be provided as a complementary medium system (UMTS) utilises WCDMA (wide-
to facilitate data transfers over GSM net- band CDMA) over a 5MHz bandwidth
works. It is packet-based and delivers data thus allows speeds the increase of mobile
rates of approximately 40kbps5 (Korhonen network speed in order to allow high-
et al., 2001; Pahlavan & Krishnamurthy, speed transfers. UMTS is one of the five
2002; Patil, 2003; Tisal 2001). It supports types of 3G radio interfaces specified in
continuous connection to the Internet for the ITU6’s IMT-2000 recommendation. It
mobile equipment users. Since GPRS ra- allows various classes of service, ranging
dio resources are utilised only when devic- from more than 100kbps for a fast mov-
es have data to exchange, its end-user cost ing user up to a 2Mbps for a fixed client
is lower in both terms of money and power “lab” speed (3GPP, 2002; Lin & Chlamtac,
consumption. Packet switching allows 2001; UMTS, n.d.).
more users to be simultaneously connected • High-speed downlink packet access
to the Internet, yet performance drops on (HSDPA) is deployed as an upgrade to
high load and no strict guarantees can be UMTS networks and captures the obser-
given. vation that most end-user high-bandwidth
• Enhanced data rates for global evolu- demanding applications require one-way
tion (EDGE) facilitates high-speed mobile high-speed communications, downstream
data transfer over which can reach a peak (i.e., towards the end-user). On-demand
rate of 384kbps and is aimed to mobile net- video, TV and data downloading are some
work operators that might not be able to applications that expose such a transfer
obtain UMTS (further information is pro- pattern thus can benefit quite significantly
vided below) spectrum but would not like from the speed offered by HSDPA which is
to be left out the modern high speed data up to 3.5 times faster compared to the max-
services (Halonen, Romero, & Melero, imum rate of 14Mbps of today’s 3G UMTS
2003; Rysavi 2005). Even higher speeds (Holma & Toskala, 2004; Kaaranen, 2005;
may be available in good radio conditions. Rysavy, 2005).
EDGE provides the same benefits of GPRS • Wireless fidelity (WiFi) is a term that
(e.g., packet switching, always connected) in general refers to the 802.11 family of
however by using a different modulation wireless network protocols (Muller, 2003;
schematic achieves much higher speeds. Smith, 2003). A variety of protocols (e.g.,
• High speed circuit switched data 802.11b, 802.11g) that operate in 2.4GHz
(HSCSD) overcomes the limitation of and 5GHz bands, being the most popular

262
Design and Implementation Approaches for Location-Based, Tourism-Related Services

ones, belong to this family, offering nomi- or a building, such as a museum or a conference
nal speeds of up to 108MBps (802.11. center). See Table 1.
Super-g). WiFi networks are fixed, local- One of the main aspects of location-based
area, wireless networks thus do not of- services is positioning, that is, the estimation of
fer the mobility capabilities provided by the user position, a topic to be covered in detailed
mobile networks. Additionally, although sub-sequent sections. Depending on the service
they can operate without an infrastructure class, the mobile equipment and the infrastructure,
(e.g., on a computer-to-computer way), different approaches and accuracy levels can be
yet another (usually fixed) connection to applied (Dao, Rizos, & Wang, 2002; Northstream,
the Internet is required in order to obtain 2001). In this chapter, the focus is on global scale
worldwide access. services, but the elements and concepts presented
• Bluetooth (IEEE 802.15) is a short-range, are not restricted to this particular class of ser-
relatively low performance communica- vices (Kakaletris et al., 2004). Most location-based
tions protocol. It is designed so that it al- services can be categorised into four main busi-
lows low power consumption and it is very ness applications:
simple to implement so that it can be easily
adopted by “dummy” devices such as head- • Tracking services (with two sub catego-
phones, computer mice, keyboards,and ries): emergency services and fleet man-
so forth. (Bluetooth.org, 2001; Miller & agement services. In the case of emergency
Bisdikian, 2002; Morrow, 2002; Muller, services (such as the E-911), the network
2001; Xiao & Pan, 2005). has the ability to locate persons who are
in danger or missing (with special inter-
Location-Based Services ests for kids) and give them the necessary
protection (Reed, Krizman, Woerner, &
Location-based services are an entire class of Rappaport, 1998). Emergency services are
mobile services that utilise positional (mostly geo- first priority for USA (FCC, 2001) and US
spatial) information of the mobile user in order to companies focus on these class services.
provide intuitive, easy access to content and tools. Road assistance and tracking of stolen
The term covers a quite large group of services equipment (cars, boats, etc.) are other sim-
since it can range from global scale services to ilar services (Fritsch & Scherner, 2005).
highly spatially constrained ones (within a room Fleet management services cover scenar-

Table 1. Typical wireless and mobile data networks

Technology Typical Performance7 End-user relative cost Scope Compatibility


GSM 9.6Kbps High Global Very high
GPRS 40Kbps Medium Global High
HSCSD 64Kbps Very High Global Low
EDGE 115Kbps Medium Global Low
UMTS 220Kbps Medium Global Medium (in deployment)
HSDPA 750kbps Medium Global Low
WiFi 11Mbps 8
Low Local/indoors High
Bluetooth 500Kbps Low Indoors High

263
Design and Implementation Approaches for Location-Based, Tourism-Related Services

ios such as radio-taxi coordination, trans- Positioning


portations, delivery, and so on, and in the
general case, require high accuracy (Feng, Location-based tourist information requires posi-
Zhu, Mukai, & Watanabe, 2005; Iwasaki, tioning of the mobile user with a variable accuracy
Kawaguchi, & Inagaki, 2005). (Kakaletris et al., 2004; Yilin, 2000). Yet this does
• Information services: In this category of not imply that automated acquisition of positional
services content relative to the location of information is always required or desired in order
the user is provided to him/her. However in to consume such a service, as when checking
global scale services the focus is not on the available information by forehand, not being
accuracy of user’s position acquisition but physically present in some area. It is obvious that
rather on the content and the way it is pre- locating the user requires that one is equipped
sented (Taylor & Ryan, 1995). Local news, with a module that has some kind of connection
cultural information, events highlighting, to a fixed infrastructure (Figure 1). However the
or even advertising are some of the appli- elements of this composition might vary:
cations of this category. Such services may
be provided near sightseeing or within mu- • Equipment can be a PDA, a 2G or 3G mo-
seums (Zimmermann, Lorenz, & Specht, bile phone, a personal computer, an ID tag,
2003). In high granularity services (e.g., an IR Scanner, a GPS receiver, and so on.
within museums) positioning has to be • Connection to the infrastructure can
accurate and in many cases highly sophis- be unidirectional or bidirectional utiliz-
ticated since it might even need 3-dimen- ing systems such as WiFi, GSM, GPRS,
sional location of the user and directional UMTS, satellite antenna/receiver, IR re-
clues (Pateli, Giaglis, & Spinellis, 2005). ceiver/transmitter, and so on.
• Fun and entertainment: Player position- • Infrastructure can be mobile or fixed net-
aware games are a new opportunity for work such as a satellite system, a mobile
service providers and the first flavors are telephony/data network (GSM, GPRS,
already out. Despite the criticism, chat and UMTS, etc.), a set of WiFi or Bluetooth
flirt is another very popular type of ser- access points, installed RF tags, fixed IR
vice. In this area, location-aware systems scanners, and so on.
will have the opportunity to refine partner
matches within a certain distance (Gratsias, Positioning can be categorised under two
Frentzos, Delis, & Theodoridis, 2005; main classes:
Karagiozidis, Markoulidakis, Velentzas,
& Kauranne, 2003; Lee, Prabhu, & Park, • Active: The client is the only one respon-
2002). sible for the computational load of the po-
• Billing: Billing also can adopt location- sition estimation.
aware schemes. Creating attractive options • Passive: The client (user/mobile equip-
such as allowing users to exercise cheaper ment) is being located without adding any
communications when in certain hot spots logic to positioning by itself. Infrastructure
(such as home, etc.) is a possible scenar- obtains simple to extract information from
io of location-based billing (Gratsias et client and calculates the position.
al., 2005; Koutsopoulou, Panagiotakis, &
Alonistioti, 2005). Hybrid methods also are very popular, allow-
ing for high accuracy and availability systems.

264
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Figure 1. GPS operation

Exploitation of the positional information also The GPS


can vary quite significantly:
GPS positioning (ETSI, 2006; GARMIN, n.d.) is
• In self-contained systems, such as GPS based on a network of 24 earth orbiting satellites.
enabled PDAs, the content might be al- It was originally designed and implemented to
ready present on the mobile equipment cover the needs of the US military forces, however
and a local piece of software acts on them, since the 1980’s it has been used for wide spectrum
thus no further connection to the outside of applications ranging from civil engineering
world is required. The range of location- to recreational systems. Communication is one-
based services to be supported in this case way; consequently clients only receive signals,
is limited, navigation being the most popu- thus guaranteeing privacy. A GPS client/device
lar one. receives signals from several satellites with at
• In always-connected systems a medium least three satellites needed for 2-dimensional
for exchanging information with the in- coordinate estimation (latitude, longitude) while
frastructure and usually the world (the four are required for three-dimensional position-
Internet) is required. This can be done in ing (latitude, longitude, and altitude). Signals
order to acquire the position, or exchange are time-tagged, enabling the client to calculate
information that will allow position esti- distance from each satellite using the send/receive
mation, or access the core service/content. time difference. Accurate position estimation
requires combining the aforementioned distances
The rest of this section presents the details of for multiple satellites. Bookkeeping of coordinates
some of the most important positioning methods, is a way to calculate the speed and direction of
their requirements, and their characteristics. There moving GPS devices, a facility provided by almost
are a number of ways for obtaining user’s position all modern receivers. In such systems end-user
and the following sections describe some of the devices are quite small and due to low power
current applied automated methods: consumption tend to be highly autonomous.

265
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Civilian GPS satellite signal is low power network, it is raising serious privacy and security
(i.e., less than 50 watts in 1575.42 MHz). It pen- concerns, which can be overlooked for emergency
etrates glass and plastic, but cannot go through purposes but not without risk of misuse.
metal, ground or concrete, effectively preventing GSM positioning is always available in some
indoors GPS usage. Dense urban use also can be form, as long as network coverage is provided.
problematic in some cases. Additionally, the GPS However depending on the network infrastruc-
signal is vulnerable to signal travel time errors ture and method utilised its accuracy might vary
that lead to false distance estimation. Reflection quite significantly ranging from 100m to more
of signal on solid objects as well as other orbital/ than 500m (even several kilometers) (Caffery &
stratospheric phenomena can result to wrong time Stuber, 1998b; Drane, Macnaughtan, & Scott,
estimations. Utilizing more satellites is a way to 1998). Although due to this low level of accuracy
deal with such problems. GSM positioning is of little use for high accuracy
One of the strongest points of GPS however is demanding application, it is accompanied by a
accuracy, which can range from 1 to 15 meters for bidirectional communication channel (voice or
civilian systems. This fact, accompanied by the data) thus enabling interactive applications. Its
simplicity of the relevant devices and the avail- network-side activation, whenever applicable,
ability of the service which can be utilised at no makes it ideal for some special emergency and
cost (apart from equipment) makes it capable of tracking cases.
driving major successful commercial applications,
such as navigation for tourists. As already men- • Cell ID is a location technology that
tioned, GPS is controlled by the U.S. Department utilises the well-known location of fixed
of Defence, nevertheless it is expected that the network elements, that is, the base station
European Galileo system will be competing with transceivers (BTS), to identify the mobile
it by 2008 (Di Fazio, Mocci, Rossini, D’Angelo, equipment location (Figure 2). It can be
Lorelli, & Jarosh, 2004; El-Rabbany, 2002; Prasad easily combined with timing advance (TA)
& Ruggieri, 2005). GPS has to be accompanied in GSM networks and round trip time9
by a supplementary network in order to drive (RTT) information in WCDMA networks
interactive on line systems where satellite cover- in order to improve accuracy. TA is a tech-
age is unavailable. Pseudo-GPS systems emulate nique that utilises the timing advance in-
the existence of satellites for indoors use without formation applied by the GSM network to
requiring any additional equipment, yet they are determine the approximate distance a MS
not widely adopted solutions (Schmid, Neubauer, is from a base station.
Ehm, Weigel, Lemke, Heinrichs, 2005). • Enhanced-observed time difference
(E-OTD) is a more complex method for
GSM-Positioning calculating device position (Caffery &
Stuber 1998a, 1998b). It requires loca-
GSM positioning (ETSI, 2006; Mao & Douligeris, tion measurement units (LMUs), a type
2000; Spirito, 2001; Zhao, 2002) is a facility of device used to provide precise timing
potentially provided by GSM mobile network information for asynchronous networks.
operators. Its operation is based on the fact that Although this method can increase accu-
there is always some type of raw information on the racy, it also increases infrastructure cost
location of a certain GSM mobile device in order and it still suffers from issues with network
for the network to be able to deliver information density in rural areas. E-OTD and time of
to the user. Since it can be originated from the arrival (TOA) methods are very similar to

266
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Figure 2. Cell ID/Cell ID + TA Figure 3. E-OTD operation

constrained areas (e.g., building blocks, etc.).10 It is


the primary candidate technology for big buildings
GPS positioning already described. The like airports, museums, market places and other
location of mobile devices is calculated us- sites where radio interference is not an issue and
ing the signaling time from two or more it should generally be accurate enough to guide a
stations (see Figure 3). visitor through the coverage area. Positioning is
very similar to GPS and GSM positioning, thus it
Typically GSM-positioning would not always uses the time difference between the signals from
be enabled, since this requires significant re- some known positions. This information can be
sources from the operator’s side. In the typical corrected and improved with site calibration, a
usage scenario, the user has to ask for locating method which also can be utilised in some GSM
him/her, either through an SMS or through a re- positioning cases but not without significant dif-
quest over the standard Internet protocol (IP). Due ficulties. In the WLAN positioning case, informa-
to its privacy risk, GSM positioning is usually tion about the physical structure of the target area
surrounded by rules of activation and bureau- (walls, obstacles, etc.) is of crucial information
cratic procedures/agreements, which can be even in order to obtain reliable results.
harder to overcome in cross-MNO cases unless According to systems’ vendors, WLAN posi-
a standard user-driven mechanism is utilised. tioning can be quite accurate (one meter indoors
Unfortunately despite the existence of specifica- according to Ekahau, 2006) yet site calibration is
tions, in mid 2001 decade there is still no standard definitely required for such accuracy. Its cost is
for GSM positioning that is widely adopted by quite reasonable since infrastructure and equip-
operators. A final restriction on its use is its cost, ment are quite cheap and it is well suited for in-
which can vary quite vastly depending on the teractive on-line applications since data network
level of consumption of the service. coverage is implied.

WLAN Other Methods

Wireless local area network (WLAN) position- Despite its peculiarity considered in the context
ing (Wang, Jia, Lee, & Li, 2003) is a local type of the e-services described above, observation ac-
of positioning usable within range of WLAN hot companied by a traditional map (i.e., paper-print-
spots, which is mostly restricted indoors or in quite ed) also could be a useful method of information

267
Design and Implementation Approaches for Location-Based, Tourism-Related Services

dissemination for tourists even in the electronic sitioning (i.e., by blending of different location
age. This does not suffer from service unavail- technologies) utilizing GSM and WLAN meth-
ability, privacy, mobility, and autonomy issues, ods could be optimal. In the rest of this chapter
and its price is usually low. Using an electronic “hybrid” will refer to combining GPS with other
map instead of a printed one would add in detail location techniques.
and in some cases reduce in size while posing There are several whitepapers, guidelines,
some restrictions on autonomy, however if not studies, and surveys showing the details of each
accompanied by a positioning system it would positioning approach (Caffery & Stuber, 1998b;
still require observation and manual operation in Hightower & Borriello, 2001), however a higher
order to consolidate other information. level evaluation of the ones applicable to tourism
Positioning through mapping addresses to as it appeals to mobile network operators will be
geographical coordinates is very similar to using introduced below. Other types of systems like
maps since it requires observation and manual personnel/patient tracking in hospitals (Dao et
utilisation of information. However the location is al., 2002; Douglas, 2004), information delivery
obtained through lookups in appropriate databases, in museums (Sparacino, 2002) or emergency
such as geographical information systems11 (GIS). related ones (FCC, 2001; GSM, 2002) generally
It is quite usable in urban areas and roadsides require more sophisticated means for delivering
of rural areas and it can be both electronic and their services. See Table 2.
traditional (maps with indices). When designing an open infrastructure then,
There are several other methods, like local sen- positioning should not have any strong relation
sor networks, ID tags based on infrared or radio to the implementation of services for the mobile
frequencies (IR or RF), gyroscopes, and statistical traveler. Realisations of different services should
models (Hightower & Borriello 2001; Nellen, leave methods of getting positioning information
Bronnimann, Held, & Sennhauser, 2004; Spratt open, allowing the end-user to choose among
2003). Out of these one could expect low-cost them: GPS, GSM and map outdoors, WLAN and
gyroscopes to be the most attractive mechanisms GSM inside. Services should be prepared to re-
for mobile travelers in the future (whenever it ceive coordinates as bunches of numbers in the
becomes possible to build lightweight, low-cost three-dimentional + direction space accompanied
gyroscopes) since they could drive the most by definition of coordinate-standard and accu-
fascinating classes of applications. For example racy.
taking into account the direction of sight of the Such an approach would leave all possibilities
user could even further enhance the simplicity of open for the user. Also this method would keep
interaction with a service, especially in complex service providers’ monetary costs down, because
(with respect to location and quantity of content) service provider can leave all costs of positioning
environments such as museums or exhibitions. to be decided by the traveler.
Further issues and opportunities can be found in When building services for the mobile traveler,
the section on “Mobile Tourism Services.” designers should ideally therefore forget the actual
positioning method and provide a service in a way
Summary that it would be able to cope with any degree of
accuracy or availability of positioning information.
Each method presented has advantages and Obtaining the position would then be a task to be
shortcomings. It seems that GPS is currently the carried out by external parties such as equipment
primary means of reliable positioning when global manufacturers or infrastructure providers, both of
positioning comes into play, however hybrid po-

268
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Table 2. LBS summary of performance, implementation, and cost trends (Sources: EMILY, 2002; High-
tower & Borriello, 2001; Moureu, 2000; SnapTrack, 2001)

Metric CELL-ID + TA E-OTD GPS A-GPS Hybrids


100m 100m 10m 10m 1m
Accuracy
10km 500m undefined 100m 100m
Kick-off time Low Low High Very high Very high
Dimensions 2 2 3 2/3 2/3 (+direction)
Mobile Equipment Require-
None Low High High High
ments
Applicability Good Poor High Medium Good
Infrastructure Cost None High None None-to-high High
Urban / Dense
Best area applicability Dense urban Suburban Suburban, urban Outdoors, Indoors
urban
Coverage High High Partial High High
Overall Quality Poor Medium High High Excellent
Compatibility Medium Poor Full Medium Poor

them working on low details of the software that mobile location center (GMLC) systems emerged
manages the “networking” hardware. in order to label the entities of the infrastructure
that where used in order to extract positioning
Positioning Standards information and estimate end-user locations. The
mobile location protocol (MLP) is an application
An attempt to standardise positioning has been protocol developed by the location interoperabil-
carried out during the past few years. Solutions ity forum (LIF) that specifies the messages to be
and best practices have been proposed by vendors, exchanged over standard Internet technologies
associations and individual researchers (Adams, in order to obtain coordinates from a positioning
Ashwell, & Baxter, 2003; GSM, 2002; Rao & center. It is focusing on network based position-
Minakakis, 2003). As a consequence the terms ing and has its origin in proprietary commercial
mobile positioning center (MPC) and gateway protocols. OpenLS (OpenGIS location services

Figure 4. Indicative LBS technologies quality per area (accuracy, availability, calculation time)12

269
Design and Implementation Approaches for Location-Based, Tourism-Related Services

[OGS]) defines the interfaces to an open location- Delivering content of location-based services
based services platform, such as the GeoMobility to the end-user is not as simple as it seems at first
server. Utilizing XML for location services (XSL) sight. There is a multitude of devices that content
defines the data exchanged in the interactions can be delivered to and a number of media through
with the geographic information system (GIS) for which it can be transferred, all of them presenting
various classes of services that capture facilities different capabilities and shortcomings. Typical
such as routing (navigation), position extraction, examples are SMS, MMS, and WAP:
lookup (directory), map drawing (presentation),
and so on. • SMS market penetration is significantly
Popular, yet proprietary protocols such mobile higher than the penetration of WAP. The
positioning protocol (MPP) (Ericsson) also can majority of the location-based systems im-
be an option since many existing systems sup- plemented by MNOs, use the SMS for ex-
port these. changing data. However presentation and
interaction facilities are severely limited
Content Technologies over this transport medium; furthermore
SMS can be extremely expensive per byte
One of the major aspects of mobile tourism ser- of data exchanged
vices is the delivery of content to the end-user. • MMS significantly increases the quality
This in the general case involves identifying the of presentation and reduces the price per
content to be displayed, (optionally) moving it byte of exchanged data. Yet its penetration
to the device14 and presenting it. In the following is significantly lower than SMS, it suffers
paragraphs focus is given on the case that content the same interaction restrictions and finally
is passed to the device upon request, that is, does its cost can be significant over SMS for
not reside pre-installed on the device. simple text information exchange

Figure 5. LBS technologies services and accuracy13

270
Design and Implementation Approaches for Location-Based, Tourism-Related Services

• WAP adds to presentation and interactivity can support. Today, four categories of terminals
quite significantly. However its penetration can be found in market:
is quite limited and it is not expected to be
widely adopted unless significant conver- • Standard GSM mobile terminals
gence to Web technologies is achieved • GPRS/UMTS phones: There are two cat-
egories of GPRS/UMTS phones. The WAP
Research on service types show that the SMS/ based and the HTML based ones. The for-
MMS solution might be more appropriate, while mer category offers facilities for consum-
WAP/Web is the only way for others (Bennett, ing WAP information sources (WAP sites)
2001; Heijden & Taylor 2000). Taking into ac- while the latter one exchanges information
count that current end-user mobile terminals have in Web standard formats such as HTML,
limited displays and end-users are not accustomed JPEG, etc. The require a Web server, which
to consuming interactive services through the will host the mapping applications and will
mobile phone interface, SMS/MMS may be a be responsible for routing requests to ex-
commercially viable solution for a wide range of ternal systems.
location-based services. Special sub-sections of • PDAs/Smartphones: Smartphones are
tourism related services such as directory lookups latest generation mobile phones equipped
can be ideally served through simple SMS ex- with enhanced displays, more powerful
changes. processors and capabilities of carrying out
Positioning also is influenced by device ca- tasks that were formerly responsibility of
pabilities. The majority of mobile phones are not computer systems. PDAs are devices in
equipped with GPS devices, which could have a some extent similar to the smartphones,
significant impact on their autonomy, size, and stressing on presentation and application
weight since the cost of a GPS receiver is insignifi- capabilities rather than communications.
cant compared to the cost of the rest of the device. Latest generation devices are usually
Even PDAs or laptops do not usually come out of equipped with an HTML browser able to
the box accompanied by GPS receivers, making interface with standard Web sites. PDAs
it a requirement to purchase one separately. The offer built in or expandability options for
primary option is a Bluetooth GPS device that GPS, GSM, WiFi, and so on.
can be easily connected to portable computing • Laptops with communication facilities:
devices such as latest mobile phones, PDAs, or These offer all the advanced options of
laptops; however despite the small size of these computers in terms of computational and
devices this scenario requires that the user carry storage capacity, yet they have limited mo-
one extra component. bility features.

Mobile Terminals That Support LBS Multi-Channel Content Delivery

The development and full deployment of high Multi-channel content delivery is a very important
quality location-based services for tourists as- aspect of the content provision system. Infrastruc-
sumes the existence of terminals with extended ture and content should be suitably constructed so
capabilities in battery capacity, processing power, that they allow delivery of information through
main memory, application capabilities, size, and various media and means, be it SMS message
weight. Such devices may be categorised accord- exchanges, GPRS, or fixed networks connections,
ing to an incremental rating of the applications they computers/PDAs/phones, and so on. Nowadays

271
Design and Implementation Approaches for Location-Based, Tourism-Related Services

there is a whole suite of technologies and a large is a declaration that describes how to display a
number of systems that enable multi-channel document and XSL is a language for expressing
content delivery, under the author once—publish stylesheets for XML documents. By keeping
many approach, where the same source content content and its presentation information separate
is used for several targets ranging from printed in the XML document and the XSL stylesheet
media to mobile devices. respectively, display of the same original docu-
Nowadays the most widely applied solutions ment on different media and preferences sets is
for this approach are based on a set of standards and achieved by simply modifying the stylesheet.15
tools build around XML, a simple text language However modifying presentation might not be
developed by the World Wide Web Consortium enough in several cases; transformation of the
(W3C) that has its roots on SGML. XML can be source document might be required and this is
used with a wide range of applications, such as au- where XSLT comes into play. Although XSLT
thoring tools, content display engines, translation transformation was originally intended to perform
tools, and database applications (Anderson 2000; complex styling operations, like the generation
Bosak 1998; Bosak & Bray 1999). However, its of tables of contents and indexes, currently it is
major use in the context of mobile tourism services often used as a complete general-purpose XML
is for structuring the tourism related content in a transformation language. In that manner it is the
form that it becomes meaningful for a series of standard means for transforming the source XML
software elements and operations. More precisely, document (e.g., tourism content) into presentable
the various parts of a document are labeled in a forms such as HTML or WML16 pages, or other
special manner (tagged) so that can be extracted XML schemas or even into other more complex
by various applications to fulfill their own needs formats.
(Anderson, 2000). A relevant technology is the Apart the content structure, a number of fea-
well-known HTML which uses a somewhat similar tures are exploitable in the context of establishing
tagging mechanism yet it neither enforces structure advanced, reliable multi-channel content delivery
nor allows for application specific labeling that services, such as mobile tourism related ones:
would facilitate a customised use of the document.
HTML uses a fixed set of tags and describes only • Security, in order to authenticate users and
formatting instructions for the Web. On the other authorise access to sections of the system
hand XML is completely extensible since it does and potentially charge usage, required in
not have predefined tags, unless it explicitly con- order to safeguard mobile user’s privacy
forms to a schema. Various schemas exist in order and content/service providers resources
to capture industrial and domain specific needs • Session management, in order to drive
and as long as two systems comprehend the same stateful applications driven by a series of
schema they can use XML documents to exchange user actions, a feature required for almost
information. News Industry Text Format (NITF) every type of modern interactive Web
and News Markup Language (NewsML) are two application
examples of markup languages that could be of • Automated device/channel detection, in
some use to tourism related services, since they order to enable adaptation to device ca-
enable the sharing of syndicated news articles pabilities, with minimal user intervention
from alternative news providers. that drive better application experience
In addition to XML, the combination of XSL/ • Client and/or server form processing
XSLT is one of the fundamental parts of the and validation that allows meaningful in-
described content delivery chain. A stylesheet

272
Design and Implementation Approaches for Location-Based, Tourism-Related Services

teraction with the system while minimiz- to the multi-channel content delivery process.
ing message exchanges One can easily locate a series of software tools
• Content rendering that adapts to the de- that transform rich content into tailored formats
vice capabilities and user preferences mainly for adopting Web content to mobile devices
(personalisation) required for delivering or vice versa, or systems that offer some degree
content to a multitude of heterogeneous of support for multi-channel content delivery.
devices These systems mostly belong to one of the fol-
• Support for off-line operation of devic- lowing categories:
es, able to drive different application and
connectivity scenarios, especially useful in • Content management systems (CMS)
several forms of application in the mobile • Authoring and publishing tools
tourism area17 • Wireless application gateways (WAG)

Unfortunately, it can be shown that with today’s Platforms such as enterprise portals and e-
applied technologies, depending on the device/ commerce suites also might be offering some
channel, almost all requested features have to be degree of support for multi-channel content de-
achieved by a different set of technologies and livery, however they are considered of rather little
interactions each time. Approaching this problem use in the case of rich mobile tourism services by
through the Model-View-Controller design pattern themselves, since they are capturing this aspect
(Gamma, 1995) offers a potential design solution under the perspective of their specific application
to the challenge, by breaking an application into domain. Such facilities also can be offered by ap-
modules so the logic can be dealt separately from plication servers, however at a much lower level.
the interface. This allows developers to maintain Currently there is no product that provides a full
a single code base for all channels and face each solution that can take raw content and transform it
device/channel as a different view for a given automatically to multiple formats for any request-
transaction. The Model-View-Controller pattern ing device taking into account all the aspects of
can have as many views as desired and isolates the multi-channel content delivery referenced in the
request handler (aka the Controller) and enterprise previous section.
system (the Model) from any knowledge of the
view. Achieving the desired features (e.g., security, Content Types
session management, form validation, and content
rendering across all devices and channels) is case/ There are various types of content that can be
device/channel specific. delivered and displayed by end-user devices. Text
In the proposed multi-channel approach con- is the easiest to handle, however it is relatively
sidering the Web as the primary channel allows less attractive compared to the others due to its
an all-in-one solution for already established Web nature, which only uses vision through the chan-
content provision services. The value of a multi- nel reading. On the opposite side, rich multimedia
channel content delivery enabling platform is content such as Flash18 (Against the Clock [Firm],
that it provides to the hosting enterprise a single 2002; Kozak, 2002) is highly attractive, however
interaction model for all external systems or users its current support by mobile devices is not global,
and leverages security, session management, and yet it is expected to increase in the near future,
content rendering. as convergence of technologies dictates. Direct
Commercial, freeware and open source soft- execution of “binary” code on the devices through
ware markets offer a wide range of tools that relate industry standards also can be referenced, however

273
Design and Implementation Approaches for Location-Based, Tourism-Related Services

support is relatively limited and minimal or no of custom services will most probably demand
cross device compatibility is provided. explicit development of custom software elements.
In scenarios that several market players are
The Semantic Web involved for the provision of a rich service, such
as the case of mobile tourism ones, the traditional
The Semantic Web (W3C, 2001) comes into the component oriented architecture, which modular-
scene in order to cover, among others, many of the ises software in order to be reused among various
issues met in the multi-channel content delivery cases, needs to be enriched by concepts that allow
area. It is led by W3C and provides a whole frame- independence among the various stakeholders.
work so that data obtain a well defined meaning The concept of service-oriented architecture
and can be reused under different cases potentially (SOA) captures architectural style that aims to
leading to quite different end-user experiences and achieve loose coupling among interacting software
content compositions. The key-enabling concept is entities (Erl, 2004, 2005; Newcomer & Lomow,
the resource description framework that is heavily 2005). In this context a service is a unit of logic
dependent on XML contracts. The Semantic Web and data provided by the service provider that is
can be considered as the next step of the current assigned to provide the appropriate results for
World Wide Web (Berners-Lee & Hendler, 2001; its consumer. In this framework the provider and
Berners-Lee et al., 2001). consumer are both software agents that act behalf
of their owners. As a concept SOA is quite abstract
Software in Control and can be utilised both for B2B and B2C interac-
tions. Typically lots of protocols and operations
All of the above mentioned technologies are pro- fall under its meaning, however in mid-2001, more
vided through software elements which might be concrete forms of service oriented paradigms are
either proprietary, bound to a specific device and/ being applied, accompanied by relevant protocols
or case or generic enough to fit several scenarios and enabling platforms.
or even totally irrelevant services. Commercial or The model of publisher/subscribers, assisted
open source platforms do exist that can support by a directory service over a transport consisting
almost all of the stages of mobile tourist content mostly of XML and Internet based protocols is
delivery, with little or even no need for integra- the current roadmap for applying the SOA:
tion development.20 Nevertheless, provisioning

Table 3. LBS content types (Source: M-Guide, 2002)

Power
Content class Compatibility Features Size Sample technologies
requirements
Text Very high Poor Very Low Very low HTML, Plain text
Audio High Medium Low Medium MP3, MIDI
Image High Medium Low Medium JPEG, WBMP, GIF
Video Medium High High Very high MPEG, 3GP19 etc
Integrated interac-
Poor Very high Medium Very high Macromedia Flash
tive multimedia
Applets / ActiveX Compo-
Executable entities Very poor Very high Medium Very high
nents

274
Design and Implementation Approaches for Location-Based, Tourism-Related Services

• Service publishers (might) announce their However, due to its relatively recent appearance,
characteristics in a directory service (e.g., development on the device has somewhat limited
UDDI, 2004). options. For the time being, the most common
• Service consumers can either look in the platforms for mobile device development are the
directory service and locate the service two described bellow and come from commercial
that meets their requirements and obtain a software vendors, however they are both freely
reference to the access point of the service available to the development communities:
or they directly obtain this through other
media. • The micro edition of the Java 2 platform
• Consequently consumers use the access is the port of Java to portable devices that
point reference to directly communicate have minimal capabilities of processing
with the server. and storage. It is a runtime environment
• SOAP over XML over HTTP (i.e., TCP/ that provides on one hand a subset of the
IP) are the most common protocols for facilities offered by the typical desktop
consumer/provider interaction, yet mes- java engines, yet it provides microdevice-
sage details are service dependent. specific capabilities access, potentially as-
sisted by vendor specific libraries. J2ME is
Recent extensions of the initial concept of Web very common in latest 2G and 3G mobile
services for the SOA approach add state manage- phones as well as non PocketPC PDAs
ment, thus allowing a standardised way to access (SUN).
resources hosted by suppliers (Czajkowski, Foster, • The Microsoft.NET compact framework
Ferguson, Frey, Graham, & Snelling, 2004; Foster, is a subset of the Microsoft.NET frame-
Frey, Graham, Tuecke, Czajkowski, & Ferguson, work, the Microsoft technology for desk-
2004; Huber & Huber 2002; Snell, Tidwell, & top and Web application on the Windows
Kulchenko, 2002). Typically consumers have no platform. Although meant to be platform
internal knowledge of the operation of the service independent through standardisation
provider and a particular service in question. The (ECMA22), currently.NET compact frame-
provider might revise implementation, as well as work is only available for PocketPC and
other characteristics of a service without explicitly other flavours of the Windows platform.
notifying clients.21 Due to its requirements it is not suitable
One level below the overall software archi- for current low-end devices, yet its perfor-
tecture is the software technology, which in the mance is very promising in addressed de-
case of location-based services has two facets: vices (Microsoft).
infrastructure and client. Developing on the
infrastructure side is facilitated by numerous
technologies and platforms and is of little interest MOBILE TOURISM SERVICES
from the mobility perspective. All Web service
enabling technologies, assisted by technologies “Tourism-related” content refers to any content
to deliver reach content are quite applicable for that might be of interest to the visitor of a location
the purposes of tourism location-based services. (Beatty 2002; Kakaletris et al 2004; Karagiozidis
Dozens of open-source free or fully commercial et al 2003; Kim, Kim, Gautam, & Lee, 2005). This
platforms exist in the today’s software market, content, be it static (e.g., maps, routes), of low
making it available for anyone to adapt its in- refresh rate (e.g., scheduled or periodic events),
frastructure to their own needs and capabilities. highly dynamic (e.g., traffic), cultural (e.g., muse-

275
Design and Implementation Approaches for Location-Based, Tourism-Related Services

ums, exhibitions, local historical info), informative 2001; GSM, 2002; IST-REPOSIT, 2001; LIF-IOT,
(e.g., weather, local news) or commercial (e.g. 2002), and finally, market and end-user surveys
restaurants, commercial fun parks, etc.), has an (Beatty, 2002; Dao et al., 2002; Katsianis et al.,
increased degree of interest to the “consumer”, 2001; LoVEUS, 2002; M-Guide, 2002). This
especially when he/she is in particular locations. comprehensive study led to the identification of
The term “location” is a varying-“size” descriptor the following key points that require careful con-
which might range from the actual spot where one sideration when implementing a location-aware
is standing in a room, to just a rough approxima- service that aims to provide tourist information
tion of a country. In this context of course, the content:
term “tourist” is very loosely related to the typical
“tourist” and mostly fits the definition of “mobile • QoS in quantifiable technical terms (re-
equipment user”. It is obvious that a very large sponse time, throughput, availability, scal-
number of services related to content delivery fit ability, coverage, accuracy, etc.)
this definition, such as map delivery, archaeologi- • QoS in non-quantifiable terms (qual-
cal information delivery, events announcements, ity of content e.g coverage, depth, media,
emergency, or health services, transportation multilinguality/multiculturality, etc.)
information acquisition, even in-doors museum • Integration capabilities (relevant services
exhibition presentations. These are all forms of support and integration capabilities)
this type of location-based tourism-related content • Security (authentication, authorisation,
delivery service. privacy, etc.)
When designing an infrastructure able to sup- • Service related procedures (e.g., activa-
port such services, requirements that have to be tion / de-activation, billing, pricing, per-
met come from various sources: sonalisation, stakeholders interoperability,
etc.)
• Technical issues that have to be ex- • Service specific features (e.g., notifica-
ploited and addressed in order for it to be tion, positioning triggering)
applicable • Content related issues (e.g., ontology)
• Regulatory and social/ethical restric- • Present and emerging technology capa-
tions that have to be met in order for the bilities (positioning, presentation, mobile
supplied services to be publishable networks, etc.)
• End-user expectations and require-
ments that have to be considered in order Having carefully examined alternatives and the
to achieve successful commercialisation, potential impact of decisions on the above-men-
and so on. tioned key points, several interesting conclusions
can be identified as referenced below. Although
The requirements presented here arise from a technology enthusiasts would think otherwise,
careful study of the 2/2.5G and 3G mobile net- end-users are not generally amazed by high ac-
work technical specifications, location-based ser- curacy positioning mechanisms or very high data
vices related whitepapers and applied paradigms rate demanding services. The already referenced
(Laitinen, 2001; Ludden, 2000; Northstream, user surveys show that they rather prefer low cost
2001; Searle, 2001), modern IT system architec- intuitive services that will satisfy their expecta-
ture concepts (Varoutas, Katsianis, Sphicopoulos, tions, within reasonable quality limits. In the
Cerboni, Kalhagen, Stordahl, et al., 2002; Varou- same context, widely used equipment lacks high
tas et al., 2006), regulations & standards (FCC, interactivity capabilities (i.e., means of passing

276
Design and Implementation Approaches for Location-Based, Tourism-Related Services

information to the service and presenting content doubted demand it implies in terms of infrastruc-
to end-users). Although emerging devices are ture and equipment resources is not a problem.
expected to vastly improve this particular aspect, A typical example is a highly mobile user who is
average users are not enticed by such capabilities. accessing some location-based directory of con-
High accuracy in positioning is not always a tent, related to a specific position he/she has been
real concern either. For example delivering some some time earlier. While the user may be attracted
sort of transportation information or providing a to these particular sites of interest, mobility might
local directory service, requires only a rough ap- cause the delivery of quite different content in
proximation of the position. On the other hand, subsequent requests for information, which could
presenting information in an actual exhibit, which come to be quite irritating. Thus the user should
the visitor is looking at, requires not only precise have the possibility to allow either continuous or
position as to where one is located but also infor- on-request triggered positioning. A good practice
mation about the direction of sight, and even that would be to totally avoid automatic positioning
detail of information may not be adequate. At this instead of temporarily ignoring new position
point, the conclusion is that “less accuracy” does information, however certain applications could
not render LBS useless but rather restricts the require two positional indicators (“live” position
range of services that can be provided. Therefore, and “querying” position). The issue of delay also
although almost identical content might be avail- needs to be taken into consideration when locating
able for presentation to each end-user visiting a the user. In on-demand positioning schemes, the
museum or an open archaeological site, equipment system might introduce quite significant delays
differentiation (e.g., display capabilities, channel due to high load or computational issues which
usage, positioning mechanism, and capabilities), might render some applications rather unattractive.
can drive a completely different degree of content Finally, a very important observation that
exploitation. guides the proposed approach is network con-
Another issue is that some requirements, vergence; xxML/HTTP/TCP/IP “stacks” tend to
for example accurate positioning and enhanced provide a uniform roadmap for offering a very
content, come with some extra “cost” to the end large range of services to each-and-every consumer
user. Cost is not only in financial terms, which oriented device type.
are mostly obvious, positioning accuracy (e.g.,
through a GPS module) and advanced display System Concepts
capabilities for example, might have a severe
impact on device autonomy and size. Enhanced In the rest of this chapter various concepts of an
content (video, virtual reality representations, open architecture, capable of providing rich mobile
etc.) will require more bandwidth and even more tourism services, are pointed out. Although an
powerful equipment (having indirect impact on architectural approach is introduced, the intention
autonomy and size). is not to propose a single solution that could be
When to locate a user is yet another confusing identified as the best in one way or the other for
issue. User-requested positioning requires an extra the tourist; the purpose of the analysis to follow
step to be taken by the end user, who actually has is rather to position the enabling technologies
to request to be located or explicitly state his/her and identify the issues, shortcomings and strong
location at the moment. Alternatively, the user can points of various approaches. Depending on the
be continuously located (tracked) by the system. particular application one has to provide, differ-
Continuously positioning the mobile user is not ent roadmaps can be followed. An hypothetical
usually required, even in the case when the un- structure of an LBS provision system targeting

277
Design and Implementation Approaches for Location-Based, Tourism-Related Services

tourism content delivery is shown in Figure 6. In user through connection information, spa-
this scenario, the elements are separated in three tial information and potentially the user’s
conceptually different clusters: previous behavioural data. The network
might provide various means of submit-
• End-user devices, be it mobile or fixed ting information to the user, such as MMS/
ones. For simplicity, it will be considered SMS (GSM, 2003).
that mobile ones have some sort of posi- • Service infrastructure, which relates to
tioning capabilities either on their own the systems that delivers information and
(e.g., GPS) or network assisted/provided. facilities to the user utilizing as one of its
• Network infrastructure, which is either implied inputs, the positional information
a mobile network or a fixed wireless local of the user.
area network, or a combination of both. In
case of mobile equipment that lack posi- The mobile/wireless network details such as
tioning capabilities, it is necessary that the access points, MSCs, VLRs, HLRs and so forth
network provide the ability to locate the are almost totally hidden and only interactions of

Figure 6. Example of layout of LBS system (Based on Kakaletris et al., 2004)

278
Design and Implementation Approaches for Location-Based, Tourism-Related Services

higher-level components are being referenced. Ad- (WWW) interactions are performed through a
ditionally SOA constructs which obey to specific browser then this server would be a normal Web/
patterns are omitted (Brown et al., 2003; Colan, application server. A unifying approach can be that
2004; UDDI, 2004). The following sections high- both servers exist (even in the same system) in
light important structural and operational aspects the provider’s infrastructure and the Web server
of such a system. redirects incoming calls to the same SAP that the
Webservice consuming capable clients access.
Service Interactions This point can be responsible for a series of ac-
tions, which among others include:
In the presented architecture, the main element is
the “service” (i.e., to tourists). Under the modern • Authenticating end-user even beyond typi-
understanding of realizing a service-oriented cal username/password challenges by util-
architecture, services are implemented under the ising sophisticated network provided au-
WebServices paradigm, utilizing in almost 100 thentication mechanisms
percent of the cases the SOAP/XML - HTTP - TCP/ • Orchestrating service interactions in an
IP route. Thus messages are exchanged in a very application specific scenario when an end-
standardised manner which can be found to be user delivered service requires invocation
fully supported by vendors’ major server systems of a series of services
(content management systems, relational database • Providing facilities for dynamic service
management systems, runtime environments, composition allowing the end-users, de-
etc.), and mobile/client systems as stated by the vices or services to seek and locate suitable
leading technologies of the area (provided by Mi- content and service provided that match
crosoft, SUN, IBM, etc.). Under this architecture, their needs with regards to quality, cost,
the system elements such as services are loosely availability, and so on; delegating lookups
coupled and can be even dynamically composed to registries such as UDDI is performed at
upon interaction/request by using SOA enabling this level
systems such as the UDDI registry.23 Composi- • Providing a repository for open-schema
tion can be entirely left to the servicing side, thus persistent end-user profiles that can be se-
the service access point (SAP) and its delegates, lectively partially offered to subsequent
however exploring mobile equipment capabilities request handlers. Alternatively a distrib-
allows that this task also can be performed at the uted profile schema could be used however
client side through the same standard mechanisms. this would raise interoperability issues and
would increase the risk of compromising
Service Access Point user privacy. Updating profiles could be
triggered by users and other services to the
In the proposed scheme the concept of a service extent allowed by their role in the system
access point (SAP) is introduced, which is respon- • Providing a “service-side triggered” posi-
sible for accepting customer (i.e., tourist) requests tioning acquisition point, in case end-user
for a bunch of services. This is not required to equipment is incapable of autonomously
be part of an MNO infrastructure, but in the case handling positioning which is the main
that the full Webservice approach is followed this the case when SMS/MMS channels are
element is a Web service hosting server, accessed utilised. In this case requesting services
by the client devices through typical WS address- would address this element in order to ob-
ing constructs. In case typical World Wide Web

279
Design and Implementation Approaches for Location-Based, Tourism-Related Services

tain user position upon and special security It is desirable to hide as much as possible of
concerns have to be handled this heterogeneity from the ASP point of view. A
• Providing a mechanism for content lookup potential solution would be as to separate position-
in registered content providers acting as a ing into three distinct layers as follows:
content broker, so that content that meets
specific minimal structural requirements • One layer is concerned with the actual cal-
can be delivered to end users without ad- culation of the position. This layer is quite
ditional processing nodes different in each and every implementation,
• Maintaining session state on top of state- not only due to the various alternatives of
less protocols positioning mechanisms but also due to
• Acting as a front-end for non-IP network differentiation to details of the actual algo-
connected channels, such as SMS/MMS rithms and infrastructures. Very little can
ones, to coordinate requests and replies be- be done in order to unify this layer other
tween channel gateways and content/sub- than defining the appropriate interfaces
service providers • The second layer is the SAP positioning
• Providing presentation transformation lay- layer, which has to provide the actual end-
er to adapt content to client device/channel user position to the application layer. This
needs layer is separated in a server-side and a
client-side part (components to extract de-
Optimisation features as a caching of aggre- vice data where applicable) and its purpose
gated content are application specific ones that is to determine the actual position supplier
can be considered as part of the SAP. Advanced and extract information data, or provide a
value added services on top of positioning such means for manually identifying position
as movement prediction also can be considered • The top-level layer is the application lay-
however they should preferably be implemented er, which has to determine whether new
as a typical Web service to be accessed on demand. positioning is required, reuse of previous
information is adequate or actual position
Homogenisation of Positioning is known (e.g., within query), forward a
request to SAP layer and ultimately return
Examining the details of user positioning, be it location information back to the end-user
network driven, client driven, or even manual, (within response)
shows a significant heterogeneity not only in the
area of the actual position estimation, but also in The aforementioned layers might make use
the process of obtaining this information which of components (e.g., coordinates translators) or
is a rather quite important aspect of the system services (e.g., GIS). A subset of the previous in-
design. Even the relatively simple case of manual teractions, through eliminating the direct message
positioning might have various implementations: exchange of the positioning server and SAP could
manually supplying the coordinates of the loca- potentially minimize the risk of building privacy
tion or indirectly acquiring them out of location holes in the system.
information (e.g., road intersections, etc.). The
following graph shows various potential interac- Push and Pull Services
tions among equipment, software, and data that
could be utilised to drive user positioning. There are two ways one can consider delivery of
content to the end user. In both cases, it assumed

280
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Figure 7. Positioning interactions (Based on Kakaletris et al., 2004)

that the user has posed a request for content and Over the typical HTTP/TCP/IP stack found
they are not entirely network driven, since this in advanced mobile equipment one can consume
would raise several regulatory issues. In each case such services by various mechanisms. The sim-
the user interaction pattern is somewhat different: plest approach would be by periodically polling
through a proprietary client for any kind of pushed
• A pull service is the one that delivers re- information to be presented to the user. Trigger-
quest to the user upon his/her explicit re- ing polling by identifying significant positional
quest within a reasonable interaction time; change also could be considered for appropriate
• A push service is one that submits infor- devices. This solution introduces network usage
mation to the user when special criteria even when no information is to be delivered to the
are met and there seems to be no explicit user, and is actually a mixture of pushing and con-
binding of a request to a response in the cepts services. Traditional mobile devices could
form of interaction. These criteria could be contacted through typical gateways (SMS/
be location, time or more advanced ones MMS) whenever network-monitoring procedures
like matched lookups for information and decide that interesting information is available.
so on. Push services on location criteria can however
become a potential scalability and performance
threat for an LBS infrastructure, because of the

281
Design and Implementation Approaches for Location-Based, Tourism-Related Services

computational and bandwidth demands required forces counter measures for avoidance of such
for scanning vast collections of data on each and situations. In the SMS/MMS channels case this
every end-user move. information is being incorporated in the state of
the gateway/SAP session that is being established
User Authentication and Identification to fulfil a client request. On the TCP/IP typical
channel the situation is a bit more complicated.
A very important part of the service is user identifi- Since the client is requesting position over an
cation, authentication, and authorisation (Gajparia, HTTP request, the MNO typically receives the
Mitchell, & Yeun, 2005; Soppera & Burbridge, request for locating the user by the SAP. Thus
2005). The SAP is proposed for taking over the some user identifier must be presented (other than
responsibility of authorizing end-users. Authen- the username). Fortunately, the SAP already has
tication is actually optional since it is considered the TCP/IP session information (source address/
that just identifying the user can be quite enough port) that is being used in order for the user to be
in several application scenarios, such as free-of- serviced. Typically end-user mobile equipment is
charge information delivery. Simple identification behind a NAT gateway and this address is not a
might be utilised for some personalisation facili- real one, thus the port number must also be used
ties. A basic scenario for authentication would be in order for the related MNO to locate the correct
based on a username/password challenge scheme, user. The SAP can select the correct MNO either
which will be verified against a user directory. Cli- by the internal directory service (the source ad-
ent certificates are a more advanced option. Using dress of the request directly or indirectly identifies
secure channels for performing authentication is the MNO) or by the end-user information record
required to avoid common security penetration being used. The MNO can therefore identify the
cases, yet not all devices might capable of all actual user through NAT layer information and
common types of secure information exchange. GPRS IP address assignment records.
SAP is the responsible for maintaining the user Nevertheless a highly secure approach could
directory where users (via their credentials) are be restricting positioning information exchange
mapped to unique identifiers and authorisation internally to the MNO infrastructure, by having
information. positioning servers to replying only to requests
Authentication is of crucial importance when raised internally by MNO subscribers, and more
positioning takes place under a network provided specifically only by the mobile device being
positioning mechanism. Acquiring end-user’s located. In this scenario, the client device acts as
location relying on simple username/password an autonomously positioned piece of equipment,
challenge opens a wide opportunity for service at least as this can be understood from the SAP
misuse. Taking advantage of the “spying effect,”24 point of view.
an unauthorised user can attach a typical mobile
device related to a specific user account on a Content Structure
person to be “watched,” and request positioning
information through a secondary device (e.g., fixed Limiting delivered content to “tourism” related
PC or other mobile equipment [ME]) on behalf categories might not be a significant restriction
of the above-mentioned account. If no network for the location-related content to be provided, but
session data are being used, location information certainly poses some requirements on the structure
will potentially be delivered to the “spy”. This this content must meet. There exist rich schemas
might be a well-accepted feature of some imple- for describing “tourism” content (TourML 2003),
mentations but the typical perspective of privacy which attempt to fully capture the correspond-

282
Design and Implementation Approaches for Location-Based, Tourism-Related Services

ing ontology. However from our point of view • The first level is internal CP transformation
forcing “over” structuring of content is usually a and renders original data into a form that
source of problems for the content producers and will comply with the SAP required mini-
providers. It is a strategic decision that content mal structure;
providers expose content compliant with a minimal • The second level of transformations is the
set of requirements. These requirements could one that will transform the content in a way
include position information (in various forms, that it will fully exploit the capabilities of
e.g., rectangles, series of points, etc.), language the end user device and user preferences.
identifiers, labels, multiple classifications in a two Such rendering can result into simple text
level hierarchy (though not a hard constraint) and fragments to be posted over SMS, MMS
a content rating scheme. Content classifiers are fragments, WML, HTML, DHTML pages,
open to extension by CPs and are being aggregated and even advanced SMIL25 presentations.
by the SAP. The first level is mainly proposed to
be a means to define end-user content selection Presentation transformation should not be
preferences, but a more sophisticated scheme can considered as a Web-only requirement; even if
make use of a free hierarchy. Content rating is being data are accessed through proprietary Web service
required in order for free of charge, simple or rich clients such transformations would be of use.
content to be differentiated by the SAP, in order
for it to be delivered to various clients, although Application Gateways
the actual content manipulation is to be performed
by the content provider (CP). Internally CPs might The assumed infrastructure of these models of ser-
realise a more complex structure, but the SAP is vice delivery is heavily based on the assumption of
concerned only about the above-mentioned tags. a homogeneous underlying network—on HTTP/
Usage of extra tags can be handled by additional TCP/IP layers. It is obvious that this is not a real-
user preferences, for which an extensible mecha- istic assumption especially when mobile networks
nism should be provided. come to play. In order to hide the diversity of the
underlying infrastructures, gateways are required
Presentation that allow for the translation of non-SAP native
requests to be fulfilled. These gateways receive
A final, but quite important issue is that eventually client requests and deliver them to the SAP in a
data must be presented to the end-user. Presenta- form they can be understood. At this point it is
tion can be realised by a wide series of equipment, obvious that in order for uniform interaction to be
which might range from high quality multimedia achievable by all CPs, translation of requests is
laptops to simple SMS capable mobile phones. being handled by a specific reception point of the
As already mentioned the XML/XSL/XSLT set gateway that analyses the content of the incoming
of standards and related tools provide a powerful message and forms the appropriate request to the
mechanism for implementing an author-once- SAP. When requests are being replied, the data
publish-many times scheme, capable of covering are being appropriately packaged and delivered
a wide range of end user preferences and equip- to the actual network operator infrastructure com-
ment. At this point, it can be identified that these ponents that will push it back to the requesting
are two level transformations in order to achieve equipment through the appropriate “channel.”
maximum content reusability: Typically channels that require gateways are the
SMS, EMS, MMS ones, while the functionality of

283
Design and Implementation Approaches for Location-Based, Tourism-Related Services

such gateways also is significant for implementa- operator. Despite its “tourism content delivery”
tion of alerting services. orientation, the infrastructure in this testbed can be
made capable of providing more complex services
Pricing/Billing with minimum impact on service providers, by
making use of state-of-the-art client equipment
Authorizing users is not only required for enabling features, such as Java, DHTML, and so on. In this
access to the services and content authentication regard, utilizing a directory service under a stan-
but also for charging and billing. When requests dard protocol, within the SAP, is mostly adequate
and responses are being exchanged among the for creating a totally uniform platform for in-doors
various parties, the SAP user identification is and outdoors LBS and quite heterogeneous po-
being utilised in order for the CP or application sitioning methods and capabilities (e.g., RFID,
service provider (ASP) to authorise and charge the wireless access point identification, etc.). Since
appropriate subscriber. Alternatively ASPs or CPs the motivation behind location-based services is
perform no user identification and deliver data on its potential beneficiary commercial exploitation,
behalf of the SAP, which is the actually charged a preliminary technoeconomic evaluation based
partner. This would cover the CP/ASP/SAP part of on hypothetical deployment and commercialisa-
the service usage. On the other hand the network tion scenarios has been attempted by the authors
operator will charge network usage, potentially and can be found in M-Guide (2002). The study
posing a premium on top of the traffic which is presents basic financial indicators that showed
being delivered to specific ASP/CPs/SAPs, and the viability of the proposed solution as well as
so on. Another scenario is that ASPs/CPs might a full description of the evaluation scenarios and
vary the charges depending on user location or results. The details of the methodology used in
user preferences (such as content quality, type, or technoeconomic evaluation can be found in IST-
location-based advertising allowance), thus al- TONIC (2000).
lowing for quite complex pricing/billing schemes. There are various aspects of the presented
approach where potential improvements will
lead to further benefits for the end-user and the
CONCLUSION content/network providers. These largely involve
standards identification and adoption, as well
Making use of standards and protocols (e.g., LIF, as refinements of current implementations. An
XML, and HTTP), avoiding proprietary solutions essential improvement would be to fully fit the
and limiting interoperability requirements to a design discussed here to the open service-oriented
minimum renders architecture capable of integrat- architecture initiative, thus providing dynamic
ing various stakeholders with limited cost and service discovery. Introducing the attributes and
implications. The technologies being proposed rules that will enable the creation of a universal
allow interoperability and redirection among directory service to support each and every rel-
various sub-service providers and the integration evant information and service lookup is a step
of services in order to provide a homogenous ap- that should be made.
plication to the end-user. Although business-to-customer (B2C) infor-
Although the proposed solution for tourists has mation interchange is sufficiently faced under
been tested26 on a single network operator else- our perspective, business-to-business (B2B) is
where, support for location-based services is not partially dealt with. B2B data exchange is limited
restricted to a single network provider, even when to areas where information is essential for servicing
positioning is performed by the mobile network a particular user request, for example, position-

284
Design and Implementation Approaches for Location-Based, Tourism-Related Services

ing, authentication, and so on. However B2B Against the Clock (Firm). (2002). Macromedia
collaboration can be further exploited in subjects flash 5: Animating for the Web. Upper Saddle
such as content modifications due to relocation, River, NJ: Prentice Hall.
caching, restructuring service organisation, and
Anderson, R. (2000). Professional XML. Birming-
so on. Further work on this area would potentially
ham, UK; Chicago: Wrox Press.
enhance scalability of “push services,” another
area that has not been fully exploited. Andersson, P., Hulten, S., & Valiente, P. (2005).
Work also needs to be done on roaming-user Beauty contest licensing lessons from the 3G
scenarios in order for the infrastructure to fully process in sweden. Telecommunications Policy,
comply with market requirements, since han- 29(8), 577–593. doi:10.1016/j.telpol.2005.06.005
dling and network positioning of roaming users
Beatty, C. (2002). Location-based services:
is currently performed inefficiently. Moreover, a
Navigation for the masses, at last! Journal
more detailed technoeconomic study is needed,
of Navigation, 55(2), 241–248. doi:10.1017/
based on specific deployment scenarios, taking
S0373463302001790
into consideration up-to-date technological and
financial data in order for a more precise profit/ Bennett, C. (2001). Practical WAP: Developing
loss prediction of relevant service deployment to applications for the wireless Web. Cambridge, UK;
be made possible. New York: Cambridge University Press.
Berners-Lee, T., & Hendler, J. (2001). Publish-
ing on the semantic Web: The coming Internet
ACKNOWLEDGMENT
revolution will profoundly affect scientific
information. Nature, 410(6832), 1023–1024.
A part of this work has been performed with the
doi:10.1038/35074206
support of the E-Content M-Guide Project, which
was partially funded by the European Union. The Berners-Lee, T., Hendler, J., & Lassila, O. (2001).
authors would like to acknowledge the support and The Semantic Web: A new form of Web content
contributions of their colleagues from M-Guide that is meaningful to computers will unleash a
project, namely Exodus SA, Vodafone-Panafon revolution of new possibilities. Scientific Ameri-
SA, University of Athens, Municipality of Athens, can, 284(5), 3.
Adaptia and Iternet. The authors would also like
Bluetooth.org. (2001). The bluetooth membership
to thank the two anonymous reviewers for the
site. Retrieved from http://www.bluetooth.org/
fruitful comments.
Bohn, J., & Mattern, F. (2004). Super-distributed
RFID tag infrastructures. Ambient Intelligence.
REFERENCES Proceedings, 3295, 1–12.

W3C. (2001). The Semantic Web. Retrieved from Bosak, J. (1998). Media-independent publish-
http://www.w3.org/2001/sw/ ing: Four myths about XML. Computer, 31(10),
120–122. doi:10.1109/2.722303
Adams, P. M., Ashwell, G. W. B., & Baxter, R.
(2003). Location-based services: An overview Bosak, J., & Bray, T. (1999). XML and the second-
of the standards. BT Technology Journal, 21(1), generation. Scientific American, 280(5), 89–93.
34–43. doi:10.1023/A:1022572210026

285
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Brown, A., Johnston, S., & Kelly, K. (2003). Using Douglas, E. (2004). Indoor positioning system can
service-oriented architecture and component- track equipment and people. Anesthesiology News.
based development to build Web service applica-
Drane, C., Macnaughtan, M., & Scott, C. (1998).
tions: IBM Corporation.
Positioning GSM telephones. IEEE Communica-
Caffery, J., & Stuber, G. L. (1998a). Subscriber tions Magazine, 36(4), 46. doi:10.1109/35.667413
location in CDMA cellular networks. IEEE
Ekahau. (2006). Ekahau Positioning Engine
Transactions on Vehicular Technology, 47(2),
3.1—Ekahau Site Calibration. Retrieved from
406–416. doi:10.1109/25.669079
http://www.ekahau.com/
Caffery, J. J., & Stuber, G. L. (1998b). Overview
El-Rabbany, A. (2002). Introduction to GPS: The
of radiolocation in CDMA cellular systems.
global positioning system. Boston: Artech House.
IEEE Communications Magazine, 36(4), 38–45.
doi:10.1109/35.667411 Ellinger, F., Barras, D., & Jackel, H. (2002).
Mobile phone market study. TRLabs/IEEE Wire-
Colan, M. (2004). Service-oriented architecture
less 2002. Retrieved from http://www.ife.ee.ethz.
expands the vision of Web services: IBM Cor-
ch/~ellinger/Homepage/mp_rev.pdf
poration.
EMILY. (2002). Business models: EMILY (IST-
Czajkowski, K., Foster, I., Ferguson, D. F., Frey,
2000-26040) Project.
J., Graham, S., Snelling, D., et al. (2004). The
ws-resource framework. Retrieved from http:// Ericsson. Mobile positioning system / mobile
www.globus.org/wsrf/specs/ws-wsrf.pdf positioning protocol. Ericsson.
Dana, P. H. (1994). GPS overview. Department of Erl, T. (2004). Service-oriented architecture: A
Geography, University of Texas at Austin. field guide to integrating XML and Web services.
Upper Saddle River, NJ: Prentice Hall Professional
Dao, D., Rizos, C., & Wang, J. (2002). Location-
Technical Reference.
based services: Technical and business issues.
GPS Solutions, 6(3), 169. doi:10.1007/s10291- Erl, T. (2005). Service-oriented architecture:
002-0031-5 Concepts, technology, and design. Upper Saddle
River, NJ: Prentice Hall Professional Technical
Daoud, F., & Mohan, S. (2002). Strategies for
Reference.
provisioning and operating VHE services in
multi-access networks. IEEE Communications ETSI. (2006). ETSI telecom standards. Retrieved
Magazine, 40(1), 78. doi:10.1109/35.978053 from http://www.etsi.org/
Devine, A., & Holmquist, S. (2001) Mobile inter- Extensible markup language (XML). (n.d.). Re-
net content providers and their business models. trieved from http://www.w3.org/XML/
The Royal Institute of Technology, Sweden.
FCC. (2001). FCC wireless 911 requirements.
Di Fazio, A., Mocci, G., Rossini, E., D’Angelo, FCC.
G., Lorelli, A., & Jarosh, A. (2004). Gauss satellite
Feng, J., Zhu, Y. L., Mukai, N., & Watanabe,
solution for location-based services. International
T. (2005). Search on transportation network for
Journal of Satellite Communications and Network-
location-based service. Innovations in Applied
ing, 22(1), 55–85. doi:10.1002/sat.775
Artificial Intelligence, 3533, 657–666.

286
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Foster, I., Frey, J., Graham, S., Tuecke, S., Cza- GSM. (2002). Location based services. Retrieved
jkowski, K., & Ferguson, D. F. (2004). Modeling from http://www.gsmworld.com/documents/lbs/
stateful resources with Web services. Retrieved se23.pdf
from http://devresource.hp.com/drc/specifica-
GSM. (2003). MMS interworking guidelines.
tions/wsrf/ModelingState-1-1.pdf
Retrieved from http://www.gsmworld.com/docu-
Fritsch, L., & Scherner, T. (2005). A multilaterally ments/ireg/ir52310.pdf
secure, privacy-friendly location-based service
Halonen, T., Romero, J., & Melero, J. (2003).
for disaster management and civil protection.
GSM, GPRS, and EDGE performance: Evolution
Networking ICN, 2(3421), 1130–1137.
towards 3G/UMTS (2nd ed.). Chichester, West
Gajparia, A. S., Mitchell, C. J., & Yeun, C. Y. Sussex, UK; Hoboken, NJ: J. Wiley.
(2005). Supporting user privacy in location based
Heijden, M. d., & Taylor, M. (2000). Understand-
services. IEICE Transactions on Communications.
ing wap: Wireless applications, devices, and
E (Norwalk, Conn.), 88B(7), 2837–2847.
services. Boston: Artech House.
Gamma, E. (1995). Design patterns: Elements of
Hightower, J., & Borriello, G. (2001). Location
reusable object-oriented software. Reading, MA:
systems for ubiquitous computing. Computer,
Addison-Wesley.
34(8), 57. doi:10.1109/2.940014
GARMIN. (n.d.). GPS guide for beginners. Re-
Holma, H., & Toskala, A. (2004). Wcdma for
trieved from http://www.garmin.com/aboutGPS/
UMTS: Radio access for third generation mobile
manual.html
communications (3rd ed.). Hoboken, NJ: J. Wiley
3GPP. (2002). 3rd Generation Partnership Proj- & Sons.
ect—3GPP. Retrieved from http://www.3gpp.org/
Huber, A. J., & Huber, J. F. (2002). UMTS and
Grajski, K. A., & Kirk, E. (2003). Towards a mo- mobile computing. Boston: Artech House.
bile multimedia age—location-based services: A
IST-REPOSIT. (2001). Real time dynamic band-
case study. Wireless Personal Communications,
width optimisation in satellite networks. IST. EU.
26(2-3), 105–116. doi:10.1023/A:1025518301575
IST-TONIC. (2000). Technoeconomics of IP
Gratsias, K., Frentzos, E., Delis, V., & Theodo-
optimised networks and services. IST. EU.
ridis, Y. (2005). Towards a taxonomy of location
based services. Web and Wireless Geographical Iwasaki, Y., Kawaguchi, N., & Inagaki, Y. (2005).
Information Systems. Proceedings, 3833, 19–30. Azim: Direction-based service system for both
indoors and outdoors. IEICE Transactions on
Gruber, H. (2005). The economics of mobile
Communications. E (Norwalk, Conn.), 88B(3),
telecommunications. New York: Cambridge
1034–1044.
University Press.
Kaaranen, H. (2005). UMTS networks: Architec-
Gruber, H., & Verboven, F. (2001). The diffusion
ture, mobility, and services (2nd ed.). Chichester,
of mobile telecommunications services in the
West Sussex, UK; Hoboken, NJ: J. Wiley & Sons.
european union. European Economic Review,
45(3), 588. doi:10.1016/S0014-2921(00)00068-4

287
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Kakaletris, G., Varoutas, D., Katsianis, D., Sphi- LIF-IOT. (2002). Location services (LCS) inter-
copoulos, T., & Kouvas, G. (2004). Designing and operability test (IOT) specification in GSM. Loca-
implementing an open infrastructure for location- tion Interoperability Forum (LIF), Interoperability
based, tourism-related content delivery. Wireless Testing Group.
Personal Communications, 30(2-4), 153–165.
Lin, J. Y.-B., & Chlamtac, I. (2001). Wireless and
doi:10.1023/B:WIRE.0000049396.99006.0e
mobile network architectures. New York: Wiley.
Karagiozidis, M., Markoulidakis, Y., Velentzas, S.,
LoVEUS. (2002). User requirements of the
& Kauranne, T. (2003). Commercial use of mobile,
LoVEUS system (LoVEUS IST-2000-30155).
personalised location-based services. Journal of
the Communications Network, 2, 15–20. Ludden, B. (2000). Location technology. Voda-
fone.
Katsianis, D., Welling, I., Ylonen, M., Varoutas,
D., Sphicopoulos, T., & Elnegaard, N. K. (2001). M-Guide. (2002). User needs—market analysis
The financial perspective of the mobile networks (Deliverable D2.1).
in europe. [see also IEEE Wireless Communica-
Mao, Z., & Douligeris, C. (2000). A location-
tions]. IEEE Personal Communications, 8(6),
based mobility tracking scheme for pcs networks.
58–64. doi:10.1109/98.972169
Computer Communications, 23(18), 1729–1739.
Kim, J. W., Kim, C. S., Gautam, A., & Lee, Y. doi:10.1016/S0140-3664(00)00202-4
(2005). Location-based tour guide system using
Microsoft. (n.d.). Microsoft.Net compact frame-
mobile gis and Web crawling. Web and Wireless
work 2.0, Retrieved from http://msdn.microsoft.
Geographical Information Systems, 3428, 51–63.
com/mobility/netcf/
Korhonen, J., Aalto, O., Gurtov, A., & Lamanen, H.
Miller, B. A., & Bisdikian, C. (2002). Bluetooth
(2001). Measured performance of GSM, HSCSD
revealed (2nd ed.). Upper Saddle River, NJ: Pren-
and GPRS. Paper presented at the IEEE Interna-
tice Hall PTR.
tional Conference on Communications (ICC’01).
Morrow, R. (2002). Bluetooth operation and use.
Koutsopoulou, M., Panagiotakis, S., Alonistioti,
New York: McGraw-Hill.
A., & Kaloxylos, A. (2005). Customised billing
for location-based services. Distributed Applica- Mouly, M., & Pautet, M. B. (1995). Current evolu-
tions and Interoperable Systems, 3543, 27–37. tion of the GSM systems. IEEE Personal Com-
munications, 2(5), 9–19. doi:10.1109/98.468359
Kozak, B. (2002). Macromedia flash MX freehand
10: Advanced training from the source. Berkeley, Moureu L. (2000), Impact of location-based
CA: Macromedia Press. services on the mobility market. Alcatel Telecom-
munications Review 2nd Quarter 2000.
Laitinen, H. (2001). Cellular location techniques.
Paper presented at the MLW. Muller, N. J. (2001). Bluetooth demystified. New
York: McGraw-Hill.
Lee, Y., Prabhu, N., & Park, E. K. (2002). Middle-
ware for location based information services in mo- Muller, N. J. (2003). WiFi for the enterprise. New
bile environment. Developing and Infrastructure York: McGraw-Hill.
for Mobile and Wireless Systems, 2538, 173–182.
doi:10.1007/3-540-36257-6_16

288
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Nellen, P. M., Bronnimann, R., Held, M., & Reed, J. H., Krizman, K. J., Woerner, B. D., &
Sennhauser, U. (2004). Long-term monitoring Rappaport, T. S. (1998). An overview of the chal-
of polarization-mode dispersion of aerial optical lenges and progress in meeting the e-911 require-
cables with respect to line availability. Journal ment for location service. IEEE Communications
of Lightwave Technology, 22(8), 1848–1855. Magazine, 36(4), 30–37. doi:10.1109/35.667410
doi:10.1109/JLT.2004.832423
Rysavy, P. (2005). Data capabilities: GPRS
Newcomer, E., & Lomow, G. (2005). Understand- to HSDPA and beyond. Retrieved from http://
ing soa with Web services. Upper Saddle River, www.3gamericas.org/pdfs/rysavy_data_
NJ: Addison-Wesley. sept2005.pdf
Northstream. (2001). Location based services: Schmid, A., Neubauer, A., Ehm, H., Weigel, R.,
Considerations and challenges. Retrieved from Lemke, N., & Heinrichs, G. (2005). Combined
http://www.northstream.se Galileo/GPS architecture for enhanced sensitivity
reception. AEÜ. International Journal of Elec-
OGS. (n.d.). Opengis location services, open
tronics and Communications, 59(5), 297–306.
geospatial consortium. Retrieved from http://
doi:10.1016/j.aeue.2005.05.013
www.opengeospatial.org/functional/?page=ols
Searle, M. (2001). Location base services. Paper
OMA. (2002). Mobile location protocol, location
presented at the InSig.
working group: Open Mobile Alliance. OTC.
(n.d.). Open tourism consortium. Retrieved from Smith, R. J. (2003). WiFi home networking. New
http://www.opentourism.org/ York: McGraw-Hill.
Pahlavan, K., & Krishnamurthy, P. (2002). Prin- SnapTrack. (2001). Location technologies for
ciples of wireless networks: A unified approach. GSM, GPRS and WCDMA networks. White paper.
Upper Saddle River, NJ: Prentice Hall PTR.
Snell, J., Tidwell, D., & Kulchenko, P. (2002).
Pateli, A. G., Giaglis, G. M., & Spinellis, D. D. Programming Web services with soap (1st ed.).
(2005). Trial evaluation of wireless info-com- Sebastopol, CA: O’Reilly & Associates.
munication and indoor location-based services
Soppera, A., & Burbridge, T. (2005). Wireless
in exhibition shows. Advances in Informatics.
identification: Privacy and security. BT Technol-
Proceedings, 3746, 199–210.
ogy Journal, 23(4), 54–64. doi:10.1007/s10550-
Patil, B. (2003). IP in wireless networks. Upper 006-0007-z
Saddle River, NJ: Prentice Hall Professional
Sparacino, F. (2002). The museum wearable:
Technical Reference.
Real-time sensor-driven understanding of visitors’
Prasad, R., & Ruggieri, M. (2005). Applied satellite interests for personalized visually-augmented
navigation using GPS, Galileo, and augmentation museum experiences. Paper presented at the
systems. Boston: Artech House. Museums and the Web.
Rao, B., & Minakakis, L. (2003). Evolution Spirito, M. A. (2001). On the accuracy of cellular
of mobile location-based services. Com- mobile station location estimation. IEEE Transac-
munications of the ACM, 46(12), 61–65. tions on Vehicular Technology, 50(3), 674–685.
doi:10.1145/953460.953490 doi:10.1109/25.933304

289
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Spratt, M. (2003). An overview of positioning Varoutas, D., Katsianis, D., Sphicopoulos, T.,
by diffusion. Wireless Networks, 9(6), 565–574. Stordahl, K., & Welling, I. (2006). On the eco-
doi:10.1023/A:1025942800145 nomics of 3G mobile virtual network operators
(MVNOs). Wireless Personal Communications,
Staab, S., & Werthner, H. (2002). Intelligent
36(2), 129–142. doi:10.1007/s11277-006-0027-5
systems for tourism. IEEE Intelligent Systems,
17(6), 53–55. doi:10.1109/MIS.2002.1134362 Wang, Y., Jia, X., Lee, H. K., & Li, G. Y. (2003).
An indoors wireless positioning system based on
SUN. (n.d.). Java 2 platform, micro edition (j2me).
wireless local area network infrastructure. Paper
Retrieved from http://java.sun.com/j2me/
presented at the SatNav 2003, the 6th International
Tarumi, H., Morishita, K., & Kambayashi, Y. Symposium on Satellite Navigation Technology
(2000). Public applications of spacetag and Including Mobile Positioning & Location Ser-
their impacts. Digital Cities, 1765, 350–363. vices, Australia.
doi:10.1007/3-540-46422-0_28
Xiao, Y., & Pan, Y. (2005). Wireless lans and
Taylor, J. H., & Ryan, J. (1995). Museums and bluetooth. New York: Nova Science Publishers.
galleries on the internet. Internet Research-
Yan, X. (2004). 3G licensing in Hong Kong:
Electronic Networking Applications and Policy,
The debate. Telecommunications Policy, 28(2),
5(1), 80. doi:10.1108/10662249510084499
213–226. doi:10.1016/j.telpol.2003.12.006
Tisal, J. (2001). The GSM network: GPRS evolu-
Yilin, Z. (2000). Mobile phone location determina-
tion: One step towards UMTS (2nd ed.). Chichester;
tion and its impact on intelligent transportation sys-
New York: J. Wiley.
tems. IEEE Transactions on Intelligent Transpor-
Tour, M. L. (2003). TourML. Retrieved from http:// tation Systems, 1(1), 55. doi:10.1109/6979.869021
www.tourml.org/
Zhao, Y. L. (2002). Standardization of mobile
UDDI. (2004). Enabling service-oriented archi- phone positioning for 3G systems. IEEE Commu-
tecture, Retrieved from http://www.uddi.org/ nications Magazine, 40(7), 108–116. doi:10.1109/
MCOM.2002.1018015
UMTS. (n.d.). The UMTS forum. Retrieved from
http://www.umts-forum.org/ Zimmermann, A., Lorenz, A., & Specht, M.
(2003). User modeling in adaptive audio-aug-
Varoutas, D., Katsianis, D., Sphicopoulos, T.,
mented museum environments. User Modeling
Cerboni, A., Kalhagen, K. O., Stordahl, K., et
2003. Proceedings, 2702, 403–407.
al. (2002, June 16-19). 3G MVNOs financial
perspectives. Paper presented at the IST Mobile
and Wireless Telecommunications Summit, Thes-
ENDNOTES
saloniki, Greece.
Varoutas, D., Katsianis, D., Sphicopoulos, T.,
1
Being redefined for the electronic era.
Loizillon, F., Kalhagen, K. O., & Stordahl, K.
2
2nd generation (2G) of mobile telephony is
(2003). Business opportunities through UMTS— widely known as GSM, while the term 2.5G
WLAN networks. Annales Des Telecommuni- is used for the 2G networks with data and
cations-Annals of Telecommunications, 58(3-4), Internet access capabilities mainly based on
553–575. GPRS.
3
The 3rd generation of mobile telephony is
mainly known as UMTS.

290
Design and Implementation Approaches for Location-Based, Tourism-Related Services

4
Also labelled as “operator” when no ambigu- 17
In advance downloading of the content to the
ity is introduced. mobile device and consequently displaying
5
Speed might vary upon device capabilities, it to the tourist by utilizing combined user-
reception quality, network load, and so forth. interaction and equipment (such as a GPS
6
International Telecommunication Union device) provided information.
whose standardisation sector is ITU-T, 18
Flash technology (introduced by MacroMe-
known as CCITT up to 1993 (CCITT origi- dia Inc. in 1996) is mainly used for creating
nates back to 1865). interactive multimedia presentations on
7
Performance highly varies upon network the Web.
load, infrastructure specifications, reception 19
3GP is a format defined by 3GPP as a
and mobile equipment capabilities. simplified version of MPEG-4 Part 14 for
8
Nominal speed varies from 11 (802.11b) to multimedia objects exchange and playback
108MBps (Super-G). on 3G mobile phones.
9
The time required for a signal (or minimal 20
Development effort required to achieve
data element) to be send by the source, systems’ inter-working when no sufficient
transmitted back by the target and received configuration/customisation capabilities
back at the source. exist in the involved software entities.
10
Outdoors WLAN access is becoming more 21
Special concerns apply to this assumption.
and more popular since year 2005. 22
An initially European standardisation
11
Basic geographical information systems organisation, currently named as ECMA
are data repositories that offer storage and International.
retrieval facilities suited to the type of infor- 23
The UDDI registry is essentially a “yellow
mation in focus (geospatial) accompanied by pages” like service for locating Web services
appropriate management and visualisation capable of serving a particular request.
tools. 24
The term has been proposed by Antti Dam-
12
Source: Derived from Searle (2001) ski, Adaptia Ltd. in M-GUIDE Project (M-
13
Source: Derived from Searle (2001) Guide, 2002)
14
There are cases that content might be pre- 25
A markup language of similar technology
installed on a device. to HTML that allows display and playback
15
The XSL Formatting Objects (XSL-FO) of interactive multimedia content adding
vocabulary is designed in order to facilitate special features for synchronisation/presen-
content display on a wide variety of media tation, partially supported by some of the
such as screen, paper, or even voice. well-known Web browsers.
16
Mark-up language technologically similar 26
The solution has been developed and tested
with HTML for use with WAP and 2G/2.5G as part of the M-Guide Project, which has
Web-enabled mobile phones. been partially funded project by EU under
the eContent initiative, during 2002-2004.

291
Design and Implementation Approaches for Location-Based, Tourism-Related Services

APPENDIX: ACRONYMS

Acronym Description
2G 2nd generation mobile telephony (e.g., GSM)
2.5 G 2nd generation mobile telephony (e.g., GSM + GPRS)
3GPP 3G Partnership Project
A-GPS assisted GPS
AOA angle of arrival
ASP application service provider
B2B business to business
B2C business to client
BS base station
BTS base transceiver station
CCITT International Telegraph and Telephone Consultative Committee
CDMA code division multiple access
CMS content management system
COO cell of origin
CSS cascading stylesheet
DHTML Dynamic HTML
ECMA European Computer Manufacturers Association
EDGE enhanced data GSM environment
E-OTD enhanced observed time difference
GIS geographical information system
GMLC gateway mobile location center
GPRS general packer radio service
GPS global positioning system
GSM global system for mobile communication
HSCSD high speed circuit switched data
HSDPA high-speed downlink packet access
HTML Hypertext Markup Language
HTTP hypertext transfer protocol
ICT information and telecommunication technology
IEEE Institute of Electrical and Electronics Engineers
IP Internet protocol
IR infrared
IS information systems
IT information technology
ITU International Telecommunication Union
JPEG Joint Photographic Experts Group’
LAN local area network
LBS location-based services

continued on following page

292
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Appendix: Acronyms continued

Acronym Description
LIF location interoperability forum
LMU location measurement unit
ME mobile equipment
MLP mobile location protocol
MMS multimedia messaging service
MNO mobile network operator
MPC mobile positioning center
MPEG Motion Picture Expert Group
MPP mobile positioning protocol
MSID mobile subscriber ID
NAT network address translation
OMA Open Mobile Alliance
OpenLS OpenGIS location services
PDA personal data assistant
QoS quality of service
RDF resource description framework
RF radio frequency
RTT round trip time
SAP service access point
SMS short message system
SOA service oriented architecture
SOAP simple object access protocol
TA timing advance
TCP transport control protocol
TDMA time division multiple access
TOA time of arrival
UDDI universal description, discovery, and integration
UMTS universal mobile telecommunications system
URI uniform resource identifiers
URL uniform resource locator
W3C World Wide Web Consortium
WAG wireless application gateway
WAN wide area network
WAP wireless access protocol
WCDMA wideband CDMA
WiFi wireless fidelity, any type of 802.11xx network
WLAN wireless LAN
WML Wireless Markup Language
WSRF Web services resource framework

continued on following page

293
Design and Implementation Approaches for Location-Based, Tourism-Related Services

Appendix: Acronyms continued

Acronym Description
XLS XML for location services
XML eXtensible Markup Language
XSL eXtensible Stylesheet Language
XSL-FO XSL formatting objects
XSLT XSL transformation

This work was previously published in Information and Communication Technologies in Support of the Tourism Industry, edited
by W. Pease, M. Rowe, and M. Cooper, pp. 114-161, copyright 2007 by IGI Publishing (an imprint of IGI Global).

294
295

Chapter 19
Methodical Spatial Database
Design with Topological
Polygon Structures
Jean Damascène Mazimpaka
National University of Rwanda, Rwanda

ABSTRACT
Spatial databases form the foundation for a Spatial Data Infrastructure (SDI). For this, a spatial database
should be methodically developed to accommodate its role in SDI. It is desirable to have an approach
to spatial database development that considers maintenance from the early stage of database design
and in a flexible way. Moreover, there is a lack of a mechanism to capture topological relations of spa-
tial objects during the design process. This paper presents an approach that integrates maintenance of
topological integrity constraints into the whole spatial database development cycle. The approach is
based on the concept of Abstract Data Types. A number of topological classes have been identified and
modelling primitives developed for them. Topological integrity constraints are embedded into mainte-
nance functions associated with the topological classes. A semi-automatic transformation process has
been developed following the principles of Model Driven Architecture to simplify the design process.

INTRODUCTION tem (DBMS) is a software system used to create


and manage databases. Today’s best approach to
With the current advances in Information and Com- managing huge amounts of data is to use a DBMS.
munication Technology, there has been a problem The field of Geographic Information Systems
shift from data availability to data maintenance. (GIS) makes no exception. However, in GIS the
A database is a large, structured and integrated problem becomes much more complex due to the
collection of data. A Database Management Sys- nature of spatial data that make a spatial database
(Tveite, 2001). A spatial database is a special type
of database which, in addition to containing con-
DOI: 10.4018/978-1-4666-2038-4.ch019

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Methodical Spatial Database Design with Topological Polygon Structures

ventional attribute values of the objects, contains currently existing approaches of data modelling in
data about their geographic location and shape. spatial databases. Next, we describe the proposed
In the field of spatial database development, a lot spatial database design methodology. To show the
of work has been done to capture the semantics applicability of the proposed approach, we then
of spatial objects such as their geographic loca- present an example of implementing the proposed
tion and shape (Shekhar et al., 1999). However, transformational design. Finally, we give some
less importance has been given to capturing their conclusions and the direction for future work.
spatial relations. For instance, spatial objects
such as provinces of a country can be modelled Related Work
as polygons and their geographic coordinates are
recorded to indicate their geographic location. The practice of data modeling has been adopted
However, their spatial relations that they cannot in the field of spatial database design because it
overlap and they form a contiguous area which is simplifies the task of database design (Worboys et
the country, are not modelled or it is done in an al., 1990). The data modeling approaches that have
inflexible way if done. These spatial relations will been widely used are Entity Relationship (ER) or
later form database integrity constraints that need Extended Entity Relationship (EER) and the Uni-
to be maintained. The spatial integrity constraints fied Modeling Language (UML). The extensibility
of the objects are among the most important prob- of UML has made it the best choice as a modeling
lems in modelling spatial databases (Shekhal et language for sophisticated applications, such as
al., 1999). These considerations highlight the need GIS dealing with complex objects and complex
for a mechanism to capture spatial relations, and relationships (Marcos et al., 2003). In the domain
an improved methodology for spatial database of GIS, Tveite (2001) suggests a technique of
design which considers the maintenance of spa- extending the EER and the UML approaches with
tial integrity constraints throughout the database geographical icons. In his work, Tveite proposed
development cycle. different geographical icons for different object
Egenhofer and Franzosa (1991) distinguish types (e.g., points, lines, regions, volumes). The
three categories of spatial relations; topological geometry of a spatial object is modeled by adding
relations, metric relations and relations concern- the appropriate geographical icon to a standard
ing the partial and total order of spatial objects. modelling primitive. In this approach different
Topological relations are spatial relations that object types are identified and their geometry is
are not affected by elastic transformations such modeled but different topological relationships
as rotation and scaling (Egenhofer et al., 1994). are not modeled. The consequence of this is that
This paper presents an improvement to existing if the translation of the model into the database
spatial database design methodology by including model is ever done, the spatial relationships are not
also topological relations in the conceptual model included and the design does not provide any sup-
of the database. Through a semi-automatic trans- port for maintaining spatial integrity constraints.
formation process, topological relationships are Wang and Reinhardt (2007) present a method to
carried at all design steps till the implementation extend geographic data modeling also taking into
where they become database integrity constraints account topological relations. This method uses
that are enforced using a set of maintenance func- a Constraint Decision Table (CDT) to store the
tions. The work presented in this paper focuses on topological integrity constraints. The problems of
collections of area features that display topological this approach are linked to the separation of the
dependencies. The paper is structured as follows; constraints from the UML conceptual schema and
after this introduction, we present an overview of the mastery required for defining the constraints.

296
Methodical Spatial Database Design with Topological Polygon Structures

The two elements (UML model and the Constraint PostgreSQL. Our work does not intend to replace
Decision Table) do not make a compact structure the topology modeling approach used in ArcGIS
and the propagation of updates on one element to products but to enhance the spatial data support
the other element needs to be handled explicitly in those popular DBMSs that are increasingly be-
and carefully. The model reader does not find ing adopted. The enhancement concerned here is
easily the connection between the components of on the maintenance of topological relationships.
the UML model and the constraints in the Con- Moreover, we want to provide an approach which
straint Decision Table. Another approach that is is flexible throughout the whole spatial database
sometimes used to express spatial relationships in development cycle. The approach is such that
UML models of spatial data is to use OCL (Object from the early stage of design the user designs the
Constraint Language) constraints as presented in database visually with high clarity and less effort.
Belussi et al. (2004) and Pinet et al. (2007). OCL Then the design is transformed semi-automatically
has a high expressive power (Warmer & Kleppe, to implementation enabling choice among avail-
2003) but we argue that the textual description able implementation options but in such a way
of OCL constraints makes the model less under- that irrespective of the chosen implementation
standable by the model readers and writing OCL option, the implementation contains functions
constraints requires the database designer to master that will help the user to maintain topological
OCL syntax. It is desirable to restrict the use of integrity constraints at the time of manipulating
OCL constraints to the cases where they are the the database developed.
only possible option.
One of the issues treated in this research is the
implementation of topology related to the work by TRANSFORMATIONAL DESIGN
Hoel et al. (2003). Their work presents a physical OF SPATIAL DATABASES
topology implementation which is built into the
ArcGIS Geodatabase (ESRI, 2002). This topol- The spatial database design methodology pre-
ogy implementation has an interesting feature of sented in this paper is a transformational design
flexibility and it is supported in ArcGIS products which follows the principles of Model Driven
which are very popular. While the ArcGIS Geo- Architecture (MDA). The MDA approach has been
database is an ESRI (Environmental Systems chosen for its feature of simplifying the design
Research Institute) proprietary spatial database task by separating concerns at different design
format, the current trend in the GIS domain is to steps as explained in the MDA guide (Miller
use a full-fledged DBMS and a GIS in conjunction & Mukerji, 2003). A conceptual schema of the
(De By, 2004). In this situation, popular DBMSs database is built using a set of modeling primi-
such as Oracle (Loney, 2004) and PostgreSQL tives. The conceptual schema is then transformed
(PostgreSQL Global Development Group, 2008) into a logical schema which is then transformed
are good candidates for spatial data management into a physical schema for the implementation.
while popular GIS packages such as ArcGIS are A detailed discussion of database schema and its
good candidates for spatial data manipulation. transformation can be found in (Halpin & Proper,
These DBMSs have a well known standard data 1995). While the concepts of database design and
management technology and some of them have transformational design are not new, in this paper
an extension to support spatial data. For instance a focus is on handling spatial data peculiarities in
Murray (2003) describes the spatial data support those existing concepts. Figure 1 illustrates this
in Oracle while Ramsey (2005) and Refractions transformational design process. Simple concepts
Research (2005) describe the spatial data support in are modeled using standard UML modeling primi-

297
Methodical Spatial Database Design with Topological Polygon Structures

Figure 1. The transformational design process


For each of the topological classes, a modeling
primitive has been designed. Each topological
class has properties which are the attributes of the
object being modeled and methods which are the
functions that will be used to perform database
modification operations such as deleting the object
without violating topological integrity constraints.
An area collection can be defined as a spatial
object made of one or more related area objects.
It can be represented by a polygon if it is made of
only one element or by a set of polygons (a mul-
tipolygon) if it is made of more than one element.
The inherent characteristic of area collections is
that a collection is made of areas that have mutu-
ally disjoint interiors. The consideration of other
specific characteristics allows us to define a number
of categories of area collections. Specific charac-
teristics of area collections are such as whether
the objects making all the related area collections
form a contiguous area, whether the elements in
one collection are disjoint, touch in points or in line
segments and whether there are area collections
with a semantic spatial aggregation connection. A
contiguous area is an area in which from any point
there exists a path fully contained in the area to any
other point in the area. A semantic spatial aggrega-
tion connection means a relationship in which an
object at one level is made of the geometric union
tives. For modeling area features with topological
of some objects at another level.
dependencies, we have designed new modeling
Based on these characteristics, we have defined
primitives by extending UML.
the following categories of area collections which
are illustrated in Figure 2:
Area Collection Modeling Primitives
• Collections made of polygons that make
The spatial objects of main interest in this work are
up a contiguous area (Figure 2a)
collections of area features that display topological
• Collections made of multipolygons that
dependencies referred to as area collections. We
make up a contiguous area (Figure 2b)
have designed the modeling primitives for area
• Collections made of disjoint areas (Figure
collections based on the concept of Abstract Data
2c)
Types (ADT). A detailed discussion of the concepts
• Collections of discretely touching areas
of abstract data types can be found in Dale and
(Figure 2d)
Walker (1996). A number of classes of area col-
• Collections of continuously touching areas
lections have been identified based on the types
(Figure 2e)
of topological dependencies that they display.
• Hierarchy of areas (Figure 2f)

298
Methodical Spatial Database Design with Topological Polygon Structures

Figure 2. Example of categories of area collections

This is not an exhaustive list of categories of class. The names of the six stereotypes in the order
area collections. A different categorisation can of their corresponding classes are; contiguous-
be made based on other specific characteristics PolygonalArea, contiguousMultipolygonalArea,
for a specific application. For instance if a land disjointArea, pointTouchArea, lineTouchArea and
parcels management application is considered a areaHierarchyAssociation.
more focused categorisation can be developed One of the unique features of the spatial data-
The categorisation given here is a possible general base design methodology presented in this paper
example that considers mostly encountered real is the consideration of maintenance of topologi-
world objects. It is up to the user to decide which cal integrity constraints from the early stages of
modeling primitive most suits the application the design. Topological integrity constraints are
at hand. For each of these classes, a modeling maintained through the maintenance functions
primitive has been developed. The new modeling carried by the modeling primitives. For instance,
primitives are made available as a UML profile the following functions are associated with the
containing six stereotypes, one for each topological modeling primitive contiguousPolygonalArea:

299
Methodical Spatial Database Design with Topological Polygon Structures

• AddToContiguousArea used to add a new Step 1: Check if the new object has an intersec-
contiguousPolygonalArea object to the tion relationship other than “touches” with
set of existing contiguousPolygonalArea any existing object. If it does, set the error
objects, message then go to step 4
• DeleteFromContiguousArea used to delete Step 2: Check if the geometric union of existing
a contiguousPolygonalArea object, and new objects would have gaps or holes.
• MergePolygonalAreas used to merge two If it would, set the error message then go
contiguousPolygonalArea objects, to step 4.
• ModifyPolygonalAreaBoundary used to Step 3: Insert the new object. Set the message
modify a part of the boundary of a con- to Success.
tiguousPolygonalArea object shrinking or Step 4: Return the message
enlarging it.
This function is conceptually expressed using
The specification of the function AddToCon- the pseudocode and the relational calculus nota-
tiguousArea is as follows: tions as shown in Figure 3.

Input: The name of the table that will contain the A Semi-Automatic Transformation
objects, an array of names of non geometry Process
columns into which values are to be inserted,
an array of values to insert into non geom- The transformational design presented in this
etry columns and the geometry value of the paper separates three design levels: the concep-
object to be added. tual design, the logical design and the physical
Output: the text Success if the operation was design. The conceptual design uses UML to build
successful or the cause of failure otherwise. a conceptual schema of the database. A detailed
Algorithm: AddToContiguousArea (T: text, u: discussion of the object-oriented concepts used in
text[] v: text[], w: geometry): text UML can be found in (Ambler, 2004). The logi-
cal design uses the concepts of the relational data

Figure 3. Example of a conceptual function

300
Methodical Spatial Database Design with Topological Polygon Structures

model to produce the structure of the database. A oped using the programming environment of the
detailed discussion of the relational data model modeling tool.
can be found in (Elmasri & Navathe, 2007). The The transformation from logical schema to
physical design specifies the schema obtained physical schema produces from the final logical
from logical design in the SQL (Structured Query schema a set of statements written in the SQL of
Language) of the DBMS chosen for implementa- the DBMS chosen for implementation. This is a
tion. The physical design also fine-tunes the logi- PSM to PSM transformation. These SQL state-
cal schema to make efficient use of the physical ments include Data Definition Language (DDL)
storage space and enhance the performance of statements used to create the database structure
the database according to the requirements of the and Data Manipulation Language (DML) state-
application (Marcos et al., 2003). In particular at ments used to work on the data in the database.
this design step, spatial indexes are considered. The DML statements include the maintenance
The major advantage of the MDA approach is functions to manipulate area collections. Some
the ability to transform one type of model into modeling tools provide the functionality to gen-
another type with a certain level of automation. erate DDL and DML statements from the logical
In the context of MDA, the conceptual schema is schema. However, this built-in functionality is
a Platform Independent Model (PIM) whereas the only suitable in a few simple cases. For instance,
logical schema and the physical schema are Plat- these modeling tools do not come with support for
form Specific Models (PSM). The design approach modeling spatial database and therefore their DDL
presented here contains two levels of transforma- statement generation functionality does not handle
tion; from conceptual schema to logical schema spatial data peculiarities in DBMS such as specific
and from logical schema to physical schema. The treatment for geometry columns. For this reason
transformation process is semi-automatic. The a custom tool for performing the logical schema
semi-automation simplifies the design task as the to physical schema transformation is preferred.
database designer has to concentrate on specify-
ing the data content of the database (conceptual
schema) and not on its structure and performance. IMPLEMENTATION OF THE
The semi-automation also provides flexibility by TRANSFORMATIONAL SPATIAL
allowing the database designer to influence the DATABASE DESIGN
transformation; during the transformation, the
designer can choose among available options The implementation of the proposed transfor-
such as the implementation model. mational spatial database design consists of the
The transformation from conceptual schema creation of the UML profile containing the mod-
to logical schema takes a UML class diagram, eling primitives for area collections followed by
which is the conceptual model of the database the development of tools to perform the transfor-
and produces another UML class diagram that mation operation. In the current implementation
represents the relational model of the database. example, Enterprise Architect has been chosen
The transformation is done following a set of as the modeling tool and PostgreSQL/PostGIS
MDA transformation rules. The transformation as the target DBMS mainly because of the wide
from conceptual schema to logical schema is done use of these tools and the support of concepts
using the functionalities offered by the modeling required in this work. The UML profile has been
tool. These functionalities include the standard created using the UML extensibility mechanisms
transformation definitions built in the tool and (Jacobson et al., 1998) available in Enterprise
the transformation definitions that can be devel- Architect. This UML profile is shown in Figure

301
Methodical Spatial Database Design with Topological Polygon Structures

4. The transformation from conceptual schema objects based on the information extracted from
to logical schema has been implemented using the data structures that have been built during
Enterprise Architect’s transformation definitions XMI file parsing. In order to test the implemen-
(Sparx Systems, 2008). Existing transformation tation, The design of the spatial database of the
definitions have been customised by modifying provinces of Rwanda has been carried out using
and adding transformation templates to handle the implemented transformational design. The
specific needs especially the transformation of database was designed using the new modeling
concepts specific to spatial data such as spatial data primitives and implemented though the transfor-
types and Spatial Reference System information. mation process. Finally the database was populated
The transformation from logical schema to physi- and the administrative units were manipulated
cal schema has been implemented by developing using the implemented functions that contributed
a custom tool. The approach adopted here is to to the maintenance of topological relationships.
export the final logical schema to XMI (XML
Metadata Interchange) file, an XML (eXtensible
Markup Language)-based file format, which is CONCLUSION AND FUTURE WORK
then taken as input to the custom transforma-
tion tool for the second transformation step. The In this paper we have presented a spatial database
custom transformation tool parses the XMI file design methodology that integrates the mainte-
and generates SQL statements to create database nance of topological integrity constraints into the

Figure 4. UML profile for modelling area collections

302
Methodical Spatial Database Design with Topological Polygon Structures

whole database development cycle. In this respect Dale, N., & Walker, H. M. (1996). Abstract data
a set of modeling primitives have been developed types: Specifications, implementations, and ap-
to model area features that display topological plications. Lexington, MA: Heath and Company.
dependencies, and a semi-automatic transforma-
De By, R. (Ed.). (2004). Principles of geographic
tion process has been developed to simplify the
information systems (3rd ed.). Enschede, The
design task by transforming semi-automatically
Netherlands: ITC Education Textbook Series.
the conceptual schema to logical schema and then
the logical schema to physical schema. The meth- Egenhofer, M., Clementini, E., & Felice, P.
odology is flexible; the new modeling primitives (1994). Topological relations between regions
are easy to use and the database designer has the with holes. International Journal of Geo-
opportunity to influence the transformation for graphical Information Systems, 1, 129–142.
specific needs for instance by choosing among the doi:10.1080/02693799408901990
spaghetti and topological implementation models.
Egenhofer, M., & Franzosa, D. (1991). Point-
The methodology provides a user-friendly way
set topological relations. International Journal
of maintaining topological integrity constraints
for Geographical Information Systems, 1(2),
in a spatial database. The future work will focus
161–174. doi:10.1080/02693799108927841
on developing more maintenance functions with
editing capabilities such that the user will get Elmasri, R., & Navathe, S. (2007). Fundamentals
more support on solving topological integrity of database systems (5th ed.). Reading, MA:
constraint violations in case they would occur. The Addison-Wesley.
future work will also be on improving the area
ESRI. (2002). Building a geodatabase. Redlands,
feature topology classification by considering a
CA: ESRI Press.
specific application area and on the application
of the methodology on the class of hierarchy of Halpin, T. A., & Proper, H. A. (1995). Database
area features. A work is also needed on the imple- schema transformation & optimization. In Pro-
mentation of the methodology by targeting more ceedings of the 14th International Conference on
Database Management Systems and on testing Object-Oriented and Entity-Relationship Model-
maintenance functions for their performance on ing (pp. 191-203).
very large datasets.
Hoel, E., Menon, S., & Morehouse, S. (2003).
Building a robust relational implementation of
topology. In T. Hadzilacos, Y. Manolopoulos, J.
REFERENCES
Roddick, & Y. Theodoridis (Eds.), Proceedings
Ambler, S. (2004). The object primer (3rd ed.). of the 8th International Symposium on Advances
Cambridge, UK: Cambridge University Press. in Spatial and Temporal Databases (LNCS 2750,
doi:10.1017/CBO9780511584077 pp. 508-524).

Belussi, A., Negri, M., & Pelagatti, G. (2004). Jacobson, I., Booch, G., & Rumbaugh, J. (1998).
GEOUML: A geographic conceptual model The unified modeling language reference manual.
defined through specialization of ISO TC211 Reading, MA: Addison-Wesley.
standards. Paper presented at the 10th EC GI & Loney, K. (2004). Oracle database 10g: The
GIS Workshop on ESDI State of the Art, Warsaw, complete reference. New York, NY: McGraw-Hill.
Poland.

303
Methodical Spatial Database Design with Topological Polygon Structures

Marcos, E., Vela, B., & Cavero, J. (2003). A meth- Shekhar, S., Vatsavai, R. R., Chawla, S., & Burk,
odological approach for object-relational database T. E. (1999). Spatial pictogram enhanced concep-
design using UML. Software and Systems Model- tual data models and their translation to logical
ing, 2(1), 59–72. doi:10.1007/s10270-002-0001-y data models. In P. Agouris & A. Stefanidis (Eds.),
Proceedings of the International Workshop on
Miller, J., & Mukerji, J. (2003). MDA guide ver-
Integrated Spatial Databases (LNCS 1737, pp.
sion 1.0.1. Needham, MA: Object Management
77-104).
Group (OMG).
Sparx Systems. (2008). Enterprise architect.
Murray, C. (2003). Oracle spatial topology and
Retrieved from http://www.sparxsystems.com.
network data models, 10g release 1 (10.1). Nashua,
au/products/ea/index.html
NH: Oracle Corporation.
Tveite, H. (2001). Structured modelling for
Pinet, F., Duboisset, M., & Soulignac, V. (2007).
geographical information systems using icons.
Using UML and OCL to maintain the consistency
In Proceedings of the Scandinavian Research
of spatial data in environmental information sys-
Conference on Geographical Information Science
tems. Environmental Modelling & Software, 22(8),
(pp. 185-201).
1217–1220. doi:10.1016/j.envsoft.2006.10.003
Wang, F., & Reinhardt, W. (2007). Extending geo-
PostgreSQL Global Development Group. (2008).
graphic data modeling by adopting constraint de-
PostgreSQL 8.3.0 documentation. Retrieved from
cision table to specify spatial integrity constraints
http://www.postgresql.org/files/documentation/
(pp. 435–454). Berlin, Germany: Springer-Verlag.
pdf/8.3/postgresql-8.3-A4.pdf
Warmer, J., & Kleppe, A. (2003). The object
Ramsey, P. (2005). PostGIS manual. Victoria, BC,
constraint language: Getting your models ready
Canada: Refractions Research Inc.
for MDA (2 ed.). Reading, MA: Addison-Wesley.
Refractions Research. (2005). Postgis topology.
Worboys, M. H., Hearnshaw, H. M., & Maguire,
Retrieved from http://postgis.refractions.net/sup-
D. J. (1990). Object-oriented data modelling for
port/wiki/index.php?PostgisTopology
spatial databases. International Journal of Geo-
Shekhar, S., Chawla, S., Ravada, S., Fetterer, graphical Information Systems, 4(4), 369–383.
A., Liu, X., & Lu, C. (1999). Spatial databases doi:10.1080/02693799008941553
– Accomplishments and research needs. IEEE
Transactions on Knowledge and Data Engineer-
ing, 11(1), 45–55. doi:10.1109/69.755614

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 1,
edited by Donald Patrick Albert, pp. 21-30, copyright 2012 by IGI Publishing (an imprint of IGI Global).

304
305

Chapter 20
Methodologies for
Augmented Discovery of
Geospatial Resources
Mattia Santoro Cristiano Fugazza
National Research Council, Italy Institute of Environment and Sustainability, Italy

Paolo Mazzetti Carlos Granell


National Research Council, Italy Universitat Jaume I de Castellón, Spain

Stefano Nativi Laura Díaz


National Research Council, Italy Universitat Jaume I de Castellón, Spain

ABSTRACT
Presently, solutions for geo-information sharing are mainly based on Web technologies, implementing
service-oriented frameworks, and applying open (international or community) standards and interoper-
ability arrangements. Such frameworks take the name of Spatial Data Infrastructures (SDIs). The recent
evolution of the World Wide Web (WWW), with the introduction of the Semantic Web and the Web 2.0
concepts and solutions, made available new applications, architectures, and technologies for sharing and
linking resources published on the Web. Such new technologies can be conveniently applied to enhance
capabilities of present SDIs—in particular, discovery functionality.

Different strategies can be adopted in order to enable new ways of searching geospatial resources, le-
veraging the Semantic Web and Web 2.0 technologies. The authors propose a Discovery Augmentation
Methodology which is essentially driven by the idea of enriching the searchable information that is
associated with geospatial resources. They describe and discuss three different high-level approaches
for discovery augmentation: Provider-based, User-based, and Third-party based. From the analysis of
these approaches, the authors suggest that, due to their flexibility and extensibility, the user-based and
the third-party based approaches result more appropriate for heterogeneous and changing environments
such as the SDI one. For the user-based approach, they describe a conceptual architecture and the main
components centered on the integration of user-generated content in SDIs. For the third-party approach,
the authors describe an architecture enabling semantics-based searches in SDIs.

DOI: 10.4018/978-1-4666-2038-4.ch020

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Methodologies for Augmented Discovery of Geospatial Resources

INTRODUCTION Friis-Christensen, 2007; Lemmens, et al., 2006;


NASA-JPL, 2011).
In recent years, the World Wide Web (WWW) has Information technology and geo-science are
undergone several important changes in terms of worlds in continuous change. Semantics and Web
available applications, architecture, and related 2.0 are the present challenges, but new ones will
technologies. The need for a more effective re- emerge in the future. This raises the conceptual is-
source sharing through the Web raised awareness sue of enhancing geospatial information discovery
on efforts aiming to enable machine-to-machine capabilities in order to accommodate present and,
applications on top of the Web architecture by mak- possibly, future needs. This chapter describes two
ing semantics explicit. These efforts are currently approaches based on the methodology of augment-
coordinated in the W3C Semantic Web Activity ing semantically the discovery process to enhance
which “provides a common framework that allows the search and retrieval of geospatial resources.
data to be shared and reused across application,
enterprise, and community boundaries […]. It is Data Discovery in the Geospatial
based on the Resource Description Framework Information Domain
(RDF)” (W3C, 2011). At the same time, new use
cases, new available applications and technologies It is estimated that more than 80% of data that
have made possible the WWW revolution which human beings have collected so far are geospatial
is known as Web 2.0 (O’Reilly, 2005). This term data in a wide sense (Frankling & Hane, 1992;
actually refers to an entirely new paradigm in MacEachren & Kraak, 2001), i.e. data with an
the use of the Web as a platform for applications explicit or implicit spatial/temporal reference.
characterized by features like: delivery of services Moreover, this spatial/temporal reference is rel-
instead of packaged software, with cost-effective evant, and even fundamental, for many applica-
scalability; control over unique, hard-to-recreate tions. Therefore, it is of major importance to be
data sources that get richer as more people use capable of discovering geospatial data according
them; trusting users as co-developers; harnessing to their content and geospatial characteristics
collective intelligence; leveraging the long tail (i.e., spatial coverage and temporal extent), and
through customer self-service; design of software to effectively describe them.
above the level of a single device; lightweight Presently, solutions for sharing geo-infor-
user interfaces, development models, and busi- mation implement service-oriented frameworks
ness models (O’Reilly, 2005; O’Reilly & Bat- applying open (international or community) stan-
telle, 2009). dards and interoperability arrangements (Nativi,
These two main changes jointly make new 2010). Such frameworks take the name of Spatial
resources available, and new technologies to Data Infrastructures (SDIs). Typically, in service-
discover them through semantic relationships. oriented frameworks such as SDIs, discovery
Unavoidably, these changes would and should functionalities are provided by catalog compo-
affect the geo-information sharing domain that nents. A formal definition of these components
is mostly based on web paradigms and technolo- is given by the ISO/TC211-Geographic Informa-
gies. Recently many efforts aim to provide more tion/Geomatics, stating that a catalog service is a
powerful tools for the discovery of geospatial “service that provides discovery and management
information that is made available through tradi- services on a store of metadata about instances.
tional or Web 2.0 services, basing on explicit or The metadata may be for dataset instances, e.g.,
implicit semantics (Klien, et al., 2004; Smits & dataset catalogue, or may contain service metadata,
e.g., service catalogue” (ISO, 2003b). At present,

306
Methodologies for Augmented Discovery of Geospatial Resources

the Open Geospatial Consortium (OGC1) Catalog publicly available by thousands of users. Such an
Service Specification (OGC, 2007) defines a com- amount of information cannot be ignored when
mon core specification for geospatial resources designing and developing a modern SDI.
discovery providing a consolidated framework. The conceptual challenge behind the above
Services implementing this standard can be que- issues concerns the enablement of new ways of
ried using common geospatial constraints (i.e., searching in present SDIs. To address this, we
basing on what, where, when, who, etc.). define a Discovery Augmentation Methodology.
In addition to standard solutions based on This methodology is essentially driven by the
ISO metadata and data models and OGC Web idea of enriching the searchable information that
Services (OWS), many other de-facto standards is associated with geospatial resources. In fact,
are extensively used, defined in the context of whether we consider semantic information or
specific Communities-of-Practice (CoP), such user-generated content, the current capabilities of
as THREDDS Data Server and OPeNDAP in geospatial catalog solutions need to be enhanced
the Meteo-Ocean domain, Global Biodiversity for enabling searches on additional content.
Resource Discovery Service (GBRDS) in the Next section introduces three high-level
biodiversity domain, and so on. The advent of approaches for discovery augmentation taking
Web 2.0 further widens the geo-information world into account not only semantics and Web 2.0,
with services providing geospatial resources (Geo- but with a more general perspective in order to
Names2, GeoCommons3, OpenStreetMap4, etc.). address possible future needs. In the following
two sections, we analyze in depth the User-based
Statement of the Problem Approach and the Third-Party Approach. For the
former, we describe a conceptual architecture and
In state-of-the-art catalog implementations, while the main components centered on the integration
the information about geospatial reference is of user-generated content in SDIs. For the latter,
usually precise (through the spatial and temporal we describe an architecture based on a third-party
extent expressed with well defined Spatial and approach to enable semantics-based searches in
Temporal Reference Systems), the information SDIs. The chapter is finished with some conclud-
about content is often inadequate for some rel- ing remarks.
evant use-cases. Indeed, it is often expressed us-
ing free text in multiple metadata fields or using
controlled vocabularies for some specific fields DISCOVERY AUGMENTATION
(e.g. keywords), as it happens in the INSPIRE METHODOLOGY: HIGH-
profile (INSPIRE, 2010). This limits the effective- LEVEL APPROACHES
ness of geospatial data discovery because of the
limitations of searching words in free text instead Enhancing geospatial resource discovery capabili-
of matching concepts. Therefore, the fact that ties can be achieved by augmenting the searchable
the same word may have different meanings in descriptions of resources. Examples of additional
different domains, languages, or contexts makes descriptions (that is, something not searchable
the search error-prone. with typical geospatial discovery services) are:
Moreover, existing geospatial catalog solu- semantic information and user-generated anno-
tions do not address the issue of searching and tations. Searching for geospatial resources that
retrieving resources from the emerging world of are constrained by one or more of the previous
Web 2.0 services. In such a context, new content descriptions implies to characterize such resources
and resources are continuously created and made properly. Several approaches can be followed in

307
Methodologies for Augmented Discovery of Geospatial Resources

order to cope with the required additional descrip- consists of making explicit as much as possible
tions. Roughly, these can be classified in: additional metadata description for each resource.
Clearly, this approach is provider-based since the
• Provider-based approach characterization of resources with new searchable
• User-based approach content is completely entrusted to the provider.
• Third-party approach The main advantage of this approach consists of
the high accuracy in the description of resources.
Each of the above approaches has advantages In fact, they are directly supplied by the resource
and drawbacks. To analyze them in the next sec- providers and then allow agents to execute que-
tions it is important to underline which are the ries against the additional description. Thus, an
main issues to be addressed by a discovery aug- authoritative, quality description can be ensured.
mentation solution, in particular: interoperability, On the other hand, it is not always possible
accuracy, and extensibility. to apply this approach. Resources, especially for
Interoperability is critical for the geo-informa- global systems, are maintained by several provid-
tion domain, where large and distributed infra- ers at different stages. Two providers, one the role
structures must manage resources from different of creator and the other as a custodian, should
and heterogeneous scientific domains. A good generate additional descriptions in a consistent
solution for discovery augmentation must then way. This implies extra synchronization tasks
be able to address the different interoperability among diverse providers involved in the life cycle
issues related to interconnecting resources in such of resources, which is not always possible. This
a heterogeneous environment. Secondly, Accuracy might be a quite complex and expensive operation,
of search results is central to all discovery sys- since the continuous changes in geo-information
tems; it should be as high as possible also when domain would require constant updates of large
discovery capabilities are augmented. An impor- repositories. In conclusion, this approach does
tant aspect that impacts on accuracy is metadata not result an extensible solution.
quality because poorly documented resources
(e.g. a wrong translation of a term indicating its User-Based Approach
semantics) may lead to lower precision-recall
values. Finally, extensibility, which is more related This approach moves the task of augmenting
to the augmentation approach. As stated in the the searchable resources from data providers to
introduction, a general approach should be able data users. From this perspective, we address the
to address possible future needs and not only the discovery augmentation in two different ways:
present need raised by semantics interoperability a) augmenting information sources by extending
and Web 2.0 resources. A general augmentation current systems in order to access Web 2.0 ser-
approach that is relatively easy to extend for vices, which are typically based on user-generated
satisfying new requirements is highly desirable. content; and b) by delegating to users the descrip-
tion of resources by adding the so-called resource
Provider-Based Approach annotation capability to resource sharing systems
or providing mechanisms that allow users to an-
This approach represents the straightforward notate resources once they are found.
solution. In this case, data providers enrich This approach distributes the task of enriching
resource metadata by adding related semantic large repositories of metadata to a wide range of
information based on controlled vocabularies users, scaling and making potentially use of a
or even on full ontologies. In fact, this approach much higher amount of knowledge. Allowing users

308
Methodologies for Augmented Discovery of Geospatial Resources

to annotate resource with new meta-information In fact, the business logic necessary to classify
highly increases the available knowledge; how- resources is concentrated in a separate component
ever, this raises the challenge of metadata qual- (provided by a third party) that can be adapted to
ity control. This is still an open issue not only satisfy new requirements without affecting the
in the geospatial domain (Stvilia, et al., 2008), other existing systems.
but it assumes great importance in this domain
as scientific data must be described in a proper
way to support scientific and decision-making AN ARCHITECTURE TO INTEGRATE
applications (Craglia, et al., 2008). AND ANNOTATE USER RESOURCES

Third-Party Approach With the emergence of the Web 2.0, ordinary citi-
zens have begun to produce and share geospatial
The main principle of the third-party approach information on the Internet. These Web 2.0-based
is to build on existing systems, a widely applied geospatial activities show that users are willing
concept in creating SDIs based on the System of to engage more actively in the production and
Systems approach (Global Earth Observation Sys- provision of content. The aim of this section is to
tem of Systems5). The idea is that existing systems describe how web services offering volunteered
continue to operate within their own mandates, geographic content can be considered to augment
because additional capabilities are provided by the number of data sources in Geospatial Infor-
new components that interconnect with existing mation Infrastructures (GII) (Díaz, et al., 2011).
systems generating added value, for instance, Data contained in GIIs are usually produced
additional meta-information to perform searches. and maintained by the official providers, like sci-
In our case, existing systems are: a) currently entific or government institutions that guarantee
available discovery services and b) repositories data quality and completeness. These providers
of additional meta-information such as controlled also register metadata descriptions of data and
vocabularies, ontologies, user-generated annota- resources in standard catalog services. In this
tions, etc. Following this approach, a third-party way, common tasks of GII users (like data search,
component is in charge of classifying existing re- discovery, and evaluation for a particular purpose)
sources according to available meta-information. are performed against these catalog services, since
Clearly, this approach is not as accurate as they contain metadata descriptions that should
the provider-based one. Indeed metadata quality point to the resource itself or the data service
is still ensured by the data provider but the auto- serving it.
matic classification required to elicit additional Web 2.0 services are meant to be easily ac-
information (e.g. semantic information) may be cessible, via specific web sites or APIs, in order
inaccurate in some cases. However, this approach to integrate them in different applications. The
allows to characterize resources with proper ad- contents of Web 2.0 services are mainly user-
ditional meta-information (provided with a basic generated and are being continuously updated by
set of metadata) stored in existing archives and non experts. There is then a lack of authoritative
repositories, without any modification. Moreover, indicators regarding completeness, accuracy, or
it does not prevent from adding explicit new meta- even veracity of data, but on the other hand, it
information on existing resources. can be easily rated, improved, and updated by
Besides, another advantage of this approach users. Due to the easy deployment and publica-
consists of being able to accommodate future tion mechanisms, the rate of participation is high
needs in a relatively easy way, i.e., it is extensible. and then resources are also quite up-to-date. In

309
Methodologies for Augmented Discovery of Geospatial Resources

contrast, GII publication mechanisms are still In the user-based approach, we propose a cou-
complex and users do not have the knowledge to ple of techniques around the concept of tagging:
publish easily new content.
GII is built on services implementing OGC • Augmenting the discoverable sources:
standard interfaces. Service interoperability is Users define tag-based search queries that
reached by applying the same standard interfaces expand over a great range of heterogeneous
to the different components deployed in the in- sources, augmenting the range of potential
frastructure to be used in as many use cases as discoverable sources.
possible, at the cost of certain abstraction level • Annotating the discovered resources: After
and format complexity. The use of OGC standards discovery, when a user annotates and ag-
is then beneficial in terms of integration and in- gregates one or more search results into a
teroperability, though, these can be rather complex collection to improve future searches.
when compared to Web 2.0 services, which are
built upon simple application-level protocols (the Augmenting the
so-called APIs) and lightweight data formats re- Discoverable Sources
lying on simplicity, ease of implementation, and
fast adaptation to user’s needs. Open standards are The first architecture consists of exposing a simple
used where they serve the keep-it-simple principle. common query interface and common response
Each service offers different functionalities, so formats for several discoverable sources. These
each one provides its own public API. Some com- backend sources represent Web 2.0 services, which
mon operations (such as searching and accessing in principle contain distinct types of resources,
content) could benefit from simple standards as both in nature and format. In order to provide
GeoRSS6, GeoJSON7, or KML (OGC, 2008), thus interoperability across these services, increased
increasing interoperability to some extent. data accessibility, and make client implementa-
An application of the user-based approach for tion much easier, we proposed the use of a Web
discovery augmentation is the implementation of 2.0 Broker (Nuñez, et al., 2011) that enables a
user annotation or tagging, which covers a descrip- simple common search interface to query the set
tive perspective in terms of discovering resources of Web 2.0 services. In this context, tags play
through user-generated tags close to the content an essential role in this architecture by allowing
of resources. There are user-centered techniques users to discover heterogeneous resources from
that may improve the discovery experience from different sources.
the user perspective. A recent study (Strohmaier, Figure 1 illustrates the proposed architecture
et al., 2010) differentiates between users who use for tag-based discovery and the augmentation
tags for categorization and those who use tags for of discoverable sources via the Web 2.0 Broker
description purposes. The first group of users is integrated with the EuroGEOSS Discovery Bro-
motivated toward tagging because they want to ker (EuroGEOSS, 2010). This architecture is
construct and maintain a navigational aid for the supported, at least, by the following components:
resources being tagged. This implies a limited,
stable set of tags that represent categories. As the • Recommendation Module, which seeks
tags are very close to the hierarchical, structured recommended tags from previous user’s
representations of certain models (e.g. forestry, queries.
environment, floods, etc.) they can act as suitable • Web 2.0 Broker, which forwards the user’s
facilitators for navigation and browsing. query over a large set of Web 2.0 services.

310
Methodologies for Augmented Discovery of Geospatial Resources

Figure 1. Tag-based discovery and sources augmentation via the web 2.0 broker

In a normal discovery scenario, the user starts recommend other tags that were used in previous
typing the tags that she thinks best describe the queries and have been related somehow with the
information she is looking for. The Qtag in Figure actual tag, the strategy could consider synonyms
1 denotes the initial user query in terms of typed that have been previously highly ranked, highly
tags in an unrestricted manner. In the following, used, etc. In this sense, we benefit from the previ-
we describe the main components involved in a ous tags used by users to perform similar queries,
tag-based query (Qtag) augmented over discover- that is, exploiting the social knowledge in terms
able resources. of tag clouds built by the users. Recent works
(Barragáns-Martínez, et al., 2010) are exploring
Recommendation Module with success the use of social tags to improve rec-
ommendations to users, other works like (Vilches-
The functionality of this module is to help users Blázquez & Corcho, 2009) disambiguate between
choosing the tags to improve search accuracy, on syntactic terms to find semantic correspondences
the basis of historical tags used in previous queries. in known taxonomies. On the other hand, the
The Recommender component takes a tag (Qtag in Query Log component will store and keep track
Figure 1) and provides a list of tags (Q’tag) related of the different tag-based queries performed by
to the input tag. Its aim is to propose related tags users. This allows us to improve the quality and
in function of the actual tag by analyzing histori- accuracy of further tag-based queries.
cal searches. The idea is to find correspondences The Recommendation Module will augment
between tags that, although syntactically different, the original query Qtag into an extended query
could be semantically related. For instance, if a Q’tag ready to be consumed by the main Discovery
user types the tag “fire,” this component should Broker or directly by the Web 2.0 Broker. We

311
Methodologies for Augmented Discovery of Geospatial Resources

also identify some challenges in implementing based on the use of annotation techniques around
this module. One for example is to deal with the concept of collections. This technique makes
multilingual tags. Another challenge is to identify use of the OAI-ORE8 (Open Archive Initiative—
the target fields in the ISO metadata records used Object Reuse & Exchange) abstract model to
to match the tags. Current metadata records are compose and annotate collections of resources that
not annotated with user tags as other Web 2.0 are of interest to the users, along with the use of a
services are. RDF-based repository to persist such collections.
In this context, tags play an essential role in this
Web 2.0 Broker Component: architecture in letting users annotate collections of
OpenSearch as Integration Protocol heterogeneous resources from different sources.
Figure 2 illustrates the proposed architecture
As mentioned, Web 2.0 services offer different for the annotation of collections of search results
contents and functionalities and provide their (discovered resources) through the set of brokers.
own public API. In our approach, we look for Note that this architecture may be an extension
a mechanism to access common functionalities of the previous, to provide added-value func-
(search interface, geographic content data type) by tionalities over the previous tag-based discovery
offering a common entry point and experimenting approach. For this reason, some components are
with simple open standards. shared by both approaches, like the Recommenda-
OpenSearch (Clinton, 2009) and its geo exten- tion and Web 2.0 Broker components:
sion (Turner, 2010; Gonçalves, 2010) is proposed
as the minimal query interface that can be used to • Recommendation Module and Web 2.0
access geospatial content, actually for both Web Broker.
2.0 services and SDI services. In this sense, Open- • Annotation Module, which let users anno-
Search becomes the “common query interface” tate collections of resources and then pub-
to query the set of Web 2.0 services (Figure 1). lish them in a repository.
This would allow for easy client implementations • Repository Module, which persists re-
that could search and integrate all data sources source collections as RDF triples.
regardless of their origin. In addition, the use of
the OpenSearch as a common interface to access Before going into the description of the An-
to all Web 2.0 services greatly alleviates the inte- notation and Repository modules, we describe in
gration with the Discovery Broker. This approach the following the basic notions and entities behind
implies that the Web 2.0 Broker is designed as a the OAI-ORE specification.
set of adapters that transform and propagates the
original OpenSearch query to the different Web OAI-ORE Specification
2.0 APIs. The Web 2.0 broker encompasses all the
adapters for the selected services for the integration The Open Archive Initiative - Object Reuse and
scenario, which for an initial prototype would be: Exchange (OAI-ORE) protocol (OAI, 2008)
Twitter, Flickr, OpenStreetMap, and Geonames. defines an abstract data model (OAI, 2008b) for
describing, reusing, and exchanging collections of
Annotating Discovered Resources Web resources. The OAI-ORE protocol is initially
conceived of in the context of digital libraries and
Complementary to the previous approach, where e-print resources, in order to expose rich content
the Web 2.0 Broker component augmented the set (text, images, data, video, etc.) into aggregations
of discoverable sources, the following approach is to be then reused by client applications.

312
Methodologies for Augmented Discovery of Geospatial Resources

Figure 2. Tag-based discovery and annotation of collections with the broker

Conceptually, the OAI-ORE’s abstract data Aggregation and AR are abstract entities that
model builds strongly on the notion of “address- must still refer to concrete resources, which can
able resources” to indicate that any resource (file, be discovered resources of any type and even a
image, text document, metadata, process, etc.) is chain of geospatial services (Abargues, et al.,
identified using HTTP URI (Uniform Resource 2010). The OAI-ORE specification makes use of
Identifiers9). The simplified diagram in Figure 3 the Resource Map entity to provide a concrete
shows the main entities that form part of the OAI- representation for the whole aggregation, mostly
ORE’s abstract model. The entity Aggregation derived from RDF. Some suggested formats in
plays a central role as it represents a collection the specification are the Atom syndication for-
of addressable resources that in turn are called mat10, RDF/XML11, and RDFa12 (a microformat
Aggregated Resources (AR). The ore:aggregates for extending XHTML to support RDF).
relation denotes the inclusion of related resources Finally, OAI-ORE also defines a useful abstract
in the same collection.. In addition, both Aggrega- entity called Proxy (P) by which it is possible
tions and AR entities are addressable resources in to express the role an aggregated resource has
the sense that both use HTTP URI as referencing explicitly in the context of an aggregation. The
method, i.e., which just means looking up a URI intended meaning of the role is expressed via
on the Web in order to get either the resource itself ore:proxyFor relationships. For example, two
or its representation. resources may have a temporal relationship that
connects to each other and this is only meaning-

313
Methodologies for Augmented Discovery of Geospatial Resources

Figure 3. Simplified OAI-ORE’s abstract data model

ful within the aggregation context in which they (resources) into OAI-ORE-based collections,
are defined. annotate them, and finally publish all this infor-
mation in a repository. Several components are
Annotation Module required to carry out these tasks. The Collection
Annotator (Figure 3) component allows users to
Annotating search results is a means to refine future create a collection from a set of aggregated re-
searches. The use of collections (aggregations) sources. Such resources are the result of a search
of resources combined with user knowledge can made by the Discovery Broker. An interesting
improve the discovery of these concepts and oth- point here is that, by using the OAI-ORE abstract
ers related to them. Currently, there exist different model, the collection is no further a plain list
techniques to model collections or aggregations but a graph or map of resources with typed links
of heterogeneous resources. After some analysis, and established relationships among them and
we selected the OAI-ORE paradigm because it even with entities outside of the collection (i.e.,
defines a straightforward abstract data model for Geonames13 or DBpedia14 entries). Users should
the description and exchange of aggregations/ interact with the Annotation Module to discard
collections of Web resources. The resources that and select only those resources that better fits their
compose an aggregation are identified by their requirements to form a collection. At this point,
URIs. Although OAI-ORE specification does not the user can add extra metadata (tags, terms) to the
specify a concrete serialization format, RDF, and aggregated resources but also and most important
Atom are the preferred ones. to the aggregation itself. Tags at collection-level
The aims of the Annotation Module is to may describe common features of the aggregated
allow the user to group different search results

314
Methodologies for Augmented Discovery of Geospatial Resources

resources and meaningful information that makes OpenSearch interface ensures the communication
only sense as a whole. between the broker and the RDF-aware reposi-
Building a collection amounts to grouping tory. As the collections of resources can appear
a list of resources with regard to some links or in the search results, the contained resources may
relationships. In a first moment, maybe only a point to other related resources, enriching then
basic set of relationships can be set, such as those the search experience. Nevertheless, a dedicated
specifying the internal structure of the aggrega- SPARQL-interfaced accessor would be preferred
tion, and some others regarding the resources’ to grant access to a great amount of public RDF
metadata obtained in the search. These types of triples available elsewhere.
relationships could be incremented by using the
Relationship Generator component (Figure 2)
that generates new ones based on the geometric SEMANTIC AUGMENTATION
and temporal topology that the different resources
exhibit among them. These new relationships Grounding Semantics-
could be generated automatically by gathering Aware Discovery
the resource’s information collected by the broker,
and by calculating the different relationships using Organizing the concepts that web contents (and,
for instance ontologies or vocabularies that could among them, geospatial resources) are referring
formally specify them, similarly to the ones used to is essentially what the Semantic Web (SW)
currently by the Ordnance Survey15. Terms from is all about. Despite the immense work done in
controlled vocabularies and taxonomies might this research field in the last decade, most of the
be also used at this stage to identify aggregated design principles driving the development of the
resources and typed links. SW can be found in the enlightening book, Weav-
As the OAI-ORE defines an abstract model, ing the Web: The Original Design and Ultimate
the newly-created collections need to be serialized Destiny of the World Wide Web by its Inventor
in a concrete format. The Collection Publisher (Berners-Lee & Fischetti, 1999). Since contents
component serializes an abstract collection into on the Web are constituted by (or at least anno-
a RDF or Atom representation to be added in tated by) text, the only means for indexing them
suitable repositories. has traditionally been text-matching criteria. Of
course, hyperlinking of web resources has been
Repository Module playing a predominant role in the ranking of search
results (Brin & Page, 1998); however, hyperlink-
The Repository Module, in right side of Figure2, ing functionalities are not available in all formats
will make available all the information, struc- currently delivered by the Web and, particularly,
tured and annotated as collections, to the user. not in the spatial data sets and services we are
This module will be composed by a specialized addressing, nor in the metadata annotating them.
RDF storage system commonly known as “triple Moreover, text-matching techniques are charac-
store”16. These systems usually offer a SPARQL terised by a number of shortcomings that make
endpoint as a way to access and query the under- them inefficient; among these, the more relevant
lying repository. are related to homonymy, synonymy (biasing,
As the Discovery Broker supports OpenSearch- respectively, precision, and recall in the discovery
interfaced sources (accessors), the OS2SPARQL process) and of course the difficulty of carrying
component will address the translation of Open- out searches in a multilingual context.
Search-based queries into SPARQL syntax. The

315
Methodologies for Augmented Discovery of Geospatial Resources

In a nutshell, the SW consists of statements elements from some widely acknowledged data
that relate resources to each other or, alternatively, schema for structuring publications (here, we
relate resources to literal values (e.g., the string consider the chapter and book elements of the
“foo”); statements are defined by specifying a DocBook format17). The titles associated with
subject, a predicate (or property) and an object the two resources can be rendered by specifying
according to the RDF data model (W3C, 2004), the title Dublin Core metadata term18; instead, for
a triple in RDF jargon. Resources are singled out expressing compositions of chapters in a book
by using URIs, which provide a straightforward we can leverage on predicate aggregates from
means to provide unique identifiers to complex the OAI-ORE standard (OAI-ORE, 2008), that
entities. As an example, consider the sentence has been introduced earlier in this chapter. The
“chapter ‘Methodologies for Augmented Dis- resulting triples are shown in Listing 1.
covery of Geospatial Resources’ is included in Lists of triples are quite verbose and not apt
the book Discovery of Geospatial Resources: to human consumption; a more convenient rep-
Methodologies, Technologies, and Emergent Ap- resentation format is that of directed labelled
plications.” Apparently, the sentence is defining graphs: In this formalism, ellipses represent re-
a single statement relating a chapter to a book. sources (note that, in the SW, everything that is
Instead, if we want to unambiguously define the not a literal value is a resource) whose identifiers
resources involved, we need to specify the URIs are often shortened by substituting a long
representing them, increasing the number of neces- namespace (e.g., “http://example.com/”) with
sary statements. A set of sentences for expressing prefixes (e.g., “ex:”). Rectangles represent literal
this may be the following: values, such as the string ‘Discovery of Geospa-
tial Resources: Methodologies, Technologies, and
• chapter <http://example.com/chapter123> Emergent Applications.’ Finally, arcs represent
is entitled ‘Methodologies for Augmented predicates. Figure 4 is showing the graph repre-
Discovery of Geospatial Resources sentation of the triples in Listing 1. Despite its
• book <http://example.com/book456> simplicity, the RDF model is capable of represent-
is entitled Discovery of Geospatial ing very complex data structures, such as n-ary
Resources: Methodologies, Technologies, relations and statements about statements (that is,
and Emergent Applications considering a whole triple as subject or object of
• resource <http://example.com/chap- a statement) (see Figure 4).
ter123> is included in resource <http://ex- The graph representation of RDF makes ap-
ample.com/book456> parent one of the main strengths of the data
model, that is, its unstructured nature that makes
As customary in RDF triple representation, it possible to seamlessly aggregate information
URIs are distinguished by enclosing them in angle (statements) from heterogeneous sources. How-
brackets. Even in the extended set of sentences, ever, RDF is just the medium for expressing in-
some of the semantics conveyed by the original formation on the SW; for structuring it as spe-
one is still not formalized. Specifically, the infor- cific data models, more standards were layered
mation that the two resources are, respectively, a on top of RDF, with different degrees of expres-
book chapter and a book is missing; also, prop- sive power and computational capabilities. These
erties “is entitled” and “is included in” do not will be briefly introduced in the following section.
explicitly refer to entities that an automated agent
may understand. The former can be expressed by
using the type predicate of RDF for referencing

316
Methodologies for Augmented Discovery of Geospatial Resources

Listing 1. Example of RDF triples

<http://example.com/chapter123>
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
<http://www.oasis-open.org/docbook/xml/4.2/docbookx.dtd#chapter>

<http://example.com/book456>
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
<http://www.oasis-open.org/docbook/xml/4.2/docbookx.dtd#book>

<http://example.com/chapter123>
<http://purl.org/dc/terms/title>
‘Methodologies for Augmented Discovery of Geospatial Resources’

<http://example.com/book456>
<http://purl.org/dc/terms/title>
‘Discovery of Geospatial Resources: Methodologies, Technologies, and Emer-
gent Applications’

<http://example.com/book456>
<http://www.openarchives.org/ore/terms/aggregates>
<http://example.com/chapter123>

Representing Domain Knowledge may need to characterise book456 as a scientific


book (that is, ascribe the resource to a specific
An aspect of data modelling that the basic RDF data class of resources) and not, for instance, as a novel,
model cannot handle is characterizing the entities as a cooking manual, etc. Still we may want to
that are referred to by triples. As an example, we retain some of the characteristics that all books are

Figure 4. RDF graph induced by the statements in Listing 1

317
Methodologies for Augmented Discovery of Geospatial Resources

sharing, such as the editor, the number of pages, cates in a hierarchal way (e.g., for defining class
the selling price, etc. In a word, we would like to ScientificBook as a specialization of class Book)
structure data in a more object-oriented fashion. but OWL expressiveness is required to express
The schema languages that were defined for the that predicates aggregates and isAggregatedBy
SW, such as the RDF Schema (RDFS) (W3C, are one the inverse of the other. Because of the
2004b) and the Web Ontology Language (OWL) associated inference capabilities, data sources that
(W3C, 2004c), are addressing this requirement; are defined by SW schema languages are typically
schemata defined through the language primitives referred to as Knowledge Bases (KB) (Russell &
defined by RDFS and OWL are typically referred Norvig, 2005).
to as ontologies. Another desirable feature is the A prominent example of OWL ontology in the
capability of structuring predicates and defining geospatial domain is the schema grounding the
relations among them. As an example, OAI-ORE GeoNames geographical database19. Queries to the
defines predicate isAggregatedBy for expressing GeoNames web service can specify the parameter
part-whole relations (not the whole-part relations type=rdf for obtaining results encoded as RDF
expressed by predicate aggregates) but the relation according to the schema defined by the service.
between these (one being the inverse of the other) As an example, Listing 2 is showing a fragment
is not made explicit. The RDF/XML serialisation taken from the RDF response retrieved from the
of an OAI-ORE resource map containing the last service for the search pattern “france.” On the
triple in Listing 1 may, or may not, feature also basis of the associated schema, a reasoner can
the following triple: derive that France (represented by the URI http://
sws.geonames.org/3017382) is a parentFeature
• <http://example.com/chapter123> of Île-de-France because predicate parentCountry
• <http://www.openarchives.org/ore/terms/ is a sub-property of the former. Another interest-
isAggregatedBy> ing feature of GeoNames is that it also provides
• <http://example.com/book456> data as LinkedData: this means that, by accessing
any of the URIs in a query response (such as the
In fact, one of the main advantages of SW data one corresponding to France) a user agent may
formats is that an RDF graph may convey more retrieve the RDF data fragment corresponding to
information than that explicitly stated by the graph the resource. This feature allows clients to selec-
itself. This is due to the semantics underlying the tively navigate the huge RDF graph represented by
schema languages, which defines the entailments GeoNames data without requiring the download
(that is, the logical implications) that shall hold. of the whole resource.
These may be given at different formalization Other than creating data schemas that can be
levels; for example, RDFS provides a set of instantiated by individuals, such as ex:chapter123,
rules indicating exactly which inferences shall SW schema languages can also be used for the
be supported by RDFS reasoners (the automated sole purpose of structuring domain knowledge
agents deriving implications). Instead, most OWL (that is, defining classes and predicates) with only
sub-languages (e.g., OWL-Lite, OWL-DL) reflect few or no individuals (members of a class) in-
the expressive power of some (description) logics stantiating them. This approach allows for a high
(Baader, et al., 2003) and then do not define entail- expressiveness when defining controlled vo-
ments as a finite set of production rules. RDFS cabularies (i.e. thesauri) for modelling a given
and the different flavors of OWL have a well- domain. In the context of SDI-related thesauri,
defined expressiveness. As an example, RDFS is this approach is exemplified by the Semantic Web
expressive enough to arrange classes and predi- for Earth and Environmental Terminology

318
Methodologies for Augmented Discovery of Geospatial Resources

Listing 2. Fragment of the RDF output of the GeoNames web service

<gn:Feature rdf:about=”http://sws.geonames.org/3012874/”>
<rdfs:isDefinedBy>http://sws.geonames.org/3012874/about.rdf</
rdfs:isDefinedBy>
<gn:name>Île-de-France</gn:name>
<gn:alternateName xml:lang=”fr”>Île-de-France</gn:alternateName>
<gn:alternateName xml:lang=”en”>Île-de-France</gn:alternateName>
<gn:alternateName xml:lang=”es”>Isla de Francia</gn:alternateName>
<gn:alternateName xml:lang=”de”>Île-de-France</gn:alternateName>
<gn:featureClass rdf:resource=”http://www.geonames.org/ontology#A”/>
<gn:featureCode rdf:resource=”http://www.geonames.org/ontology#A.ADM1”/>
<gn:countryCode>FR</gn:countryCode>
<gn:population>11598866</gn:population>
<wgs84_pos:lat>48.5</wgs84_pos:lat>
<wgs84_pos:long>2.5</wgs84_pos:long>
<gn:parentCountry rdf:resource=”http://sws.geonames.org/3017382/”/>
<gn:childrenFeatures rdf:resource=”http://sws.geonames.org/3012874/con-
tains.rdf”/>
<gn:locationMap rdf:resource=”http://www.geonames.org/3012874/region-ile-
de-france.html”/>
</gn:Feature>

(SWEET) (NASA-JPL, 2011), a set of 201 Developing Thesauri for SDIs


highly modularised OWL ontologies whose terms
are expressed as Class instances and arranged The discovery of geospatial data and services on
hierarchically by using predicate subClassOf from the Internet is a topic that, despite the hype nour-
RDFS. The lexical representation of terms is ished by the novel notion of SDI (Masser, 2005),
constituted by the fragment identifiers of URIs still lacks efficient technologies and techniques for
(e.g., “#ClimateChange”). The exploitation of enabling an effective retrieval of resources. The
constructs from the meta-language (OWL) allows non-textual nature of this category of resources,
for defining custom predicates that class in- the scarce set of metadata that is often annotat-
stances may have and constrain ontology con- ing them, the multilingual gap (and in general
structs according to them. As an example, Listing the linguistic issues hindering the indexing of all
3 is portraying the definition of class #Mesoclimate categories of resources) motivates the recourse to
whose members are defined as either instances specialized applications, geoportals (Maguire &
of super-class #Climate and also instances whose Longley, 2005), for retrieval of spatial data and
property #hasSpatialScale takes values from class services. Even when an exhaustive set of metadata
#Mesoscale. is provided, the distance between the terminology
adopted by the domain expert during annotation
and that of the end user (which can be herself a
thematic user, but coming from a different do-
main) makes it difficult to reconcile the metadata

319
Methodologies for Augmented Discovery of Geospatial Resources

Listing 3. Definition of class #Mesoclimate in the SWEET ontology

<owl:Class rdf:about=”#Mesoclimate”>
<rdfs:subClassOf rdf:resource=”#Climate”/>
<rdfs:subClassOf>
<owl:Restriction>
<owl:onProperty rdf:resource=”&scale;#hasSpatialScale”/>
<owl:allValuesFrom rdf:resource=”&scale;#Mesoscale”/>
</owl:Restriction>
</rdfs:subClassOf>
<rdfs:comment xml:lang=”en”>The climate of a natural region of small ex-
tent, for example, valley, forest, plantation, and park. Because of subtle
differences in elevation and exposure, the climate may not be representative
of the general climate of the region.</rdfs:comment>
</owl:Class>

descriptions on the one hand and the search pat- as for the SWEET ontologies, may be awkward
terns on the other. Also, specifying metadata in to domain experts because of the inherent tech-
a language that is different from that of the end nicalities. Moreover, applying inference to large
user may utterly prevent retrieval of a resource. schema definitions may be particularly expensive
Albeit easing access to geospatial information, on the computational side.
geoportals often fall short of providing advanced Instead, a different approach that is gaining
search functionalities that allow the end user for wide acceptance is constituted by relying on a
bridging this gap. fixed OWL schema and articulate thesauri by
Acknowledging these shortcomings, the SDI instantiating the classes and predicates defined by
community is increasingly considering recourse the former. The clear advantage with respect to
to thesauri as one of the most promising means to the previously described approach is the capabil-
annotate resources in a consistent way and to ease ity of bounding the complexity of data structures
retrieval of this category of resources in a multi- on the basis of the underlying schema. As for the
lingual and cross-thematic context. In particular, expressiveness, it is depending on the specific
thesauri based on SW formats provide language- data schema that is taken into consideration. The
neutral identifiers (URIs) that can be related with research on Knowledge Organisation Systems
distinct syntactic forms in order to be selected for (KOS) (Bechhofer & Goble, 2001) is focusing on
annotation through multiple, language-dependant the development of thesauri organizing terms in
textual representations. Also, the internal structure a coherent way. The intuition this research elabo-
of thesauri (relating terms with each other ac- rated on is that it is possible to take advantage of
cording to specificity and relatedness) allows for the Semantic Web infrastructure in a lightweight,
advanced functionalities such as query refinement easily implementable way. The Simple Knowledge
on the basis of semantic properties among terms. Organisation System (SKOS20) constitutes the
Finally, by aligning independent thesauri produced main output of this research thread: it is a low
in distinct thematic areas, it is possible to easily complexity ontology that allows to easily structure
bridge between different domains. Using SW terms (also providing translations into multiple
schema languages for articulating thesauri, such languages), to group them into collections of terms

320
Methodologies for Augmented Discovery of Geospatial Resources

Figure 5. The SKOS ontology: a) class hierarchy


and to relate terms from independent thesauri to
and b) property hierarchy
one another.

The Simple Knowledge


Organisation System

SKOS represents a breakthrough in knowledge


organisation inasmuch it provides an exhaustive
set of constructors for creating thesauri without
involving the technicalities of more expressive
ontology schema languages. Making a comparison
with XML technologies, it is like exploiting an
XML custom vocabulary for expressing a given
category of data structures versus dealing with
XML Schema constructs for expressing them.
Figure 5 is showing a general view of the entities
defined by SKOS, as rendered in the Protégé on-
tology editor21 (the names in bold building on the
entities of OWL and RDFS). Class ConceptScheme
is used for creating a whole thesaurus; Collection
and OrderedCollection allow to group terms;
finally, class Concept represent individual terms
in the thesaurus (see Figure 5).
More complex is the organisation of properties.
A set of properties can be used to associate labels
(that is, human-readable text representations) to
terms that are, by themselves, represented by
URIs: prefLabel, altLabel, and hiddenLabel rep-
but are typically not introduced explicitly because
resent, respectively, preferred labels, alternative
it is up to the underlying reasoning facilities to
labels and labels that are possible misspellings of
deduce them. As we will see, all categorisations
term labels. A second set of properties allow the
of SDI-related terms considered in this Chapter
grouping of entities in a thesaurus: properties
are based on SKOS or on an extension to it.
inScheme and topConceptOf allow to associate
terms (Concepts) with the thesaurus (Con-
SKOS Thesauri in the SDI-Domain
ceptScheme) they belong to and allow to specify,
among terms, which are the more general. Fi-
The development of thesauri is a key factor for
nally, a set of properties can be used to categorise
geographic information retrieval and led to the
terms according to specificity (broader, nar-
inclusion in the GEO Work Plan 2009-2011 (GEO,
rower) and relatedness (related); their variants
2010) of a sub-task of the GEOSS Common In-
terminating with “Match,” together with proper-
frastructure devoted to ontology and taxonomy
ties exactMatch and closeMatch, are used for
development. Early adopters of SKOS as either
relating terms from distinct thesauri while those
the native or an alternative format for represent-
terminating with “Transitive” are introduced to
ing thesauri include the United Nations Food and
support the transitive closure of sub-properties

321
Methodologies for Augmented Discovery of Geospatial Resources

Agriculture Organization which is maintaining Meteorology. In general, since knowledge man-


AGROVOC (FAO, 2011), a multilingual resource agement has a direct application in the internal
comprising over 28.000 terms covering all sub- functioning of companies and institutions, more
ject fields in agriculture, forestry, fisheries, food and more domain-specific thesauri are expected
and related domains. The United States National to be developed in the future.
Agricultural Library is maintaining a bilingual ag- With regard to the thesauri that were exploited
ricultural thesaurus made up by more than 80.000 as reference thesauri during development of the
terms (USDA). In Europe, the GEMET Thesaurus semantics-aware component that is described in
(EIONET, 2011) is providing a narrower set of this Section, the source that was taken into con-
terms (around 6000 terms) but is covering all sideration has been the RDF repository developed
languages spoken in the European Union. in the context of the GENESIS FP7 project25. The
Also in Europe, the EU Publications Office is repository is an instance of the Sesame frame-
providing the SKOS version of EuroVoc (2011), a work26, an open source Java web application for
multilingual and multidisciplinary thesaurus cov- storing and querying RDF data, that is currently
ering all areas of activity of the EU. Recently, the hosting the following thesauri in the SKOS format:
categorization of Societal Benefit Areas (SBAs)
underlying GEOSS has been encoded as SKOS • The General Multilingual Environmental
and translated into four more languages in the Thesaurus (GEMET27): 28 of the 31 lan-
context of the EuroGEOSS FP7 project22. Even if guages currently provided by the EIONET
this resource can barely be considered a thesaurus, portal.
since it is featuring only 68 terms, it constitutes • The INSPIRE Feature Concept Dictionary
a good example of how resources not natively and Glossary: 21 of the 23 EU official lan-
encoded as SKOS can be easily translated and guages for INSPIRE Themes, monolingual
extended in order to support multiple languages. the other terms.
Also, a large amount of data sets and services are • The ISO 19119 categorisation of spatial
expected to be categorized according to SBAs data services: 21 of the 23 EU official
and, consequently, the availability of this small languages.
thesaurus, when properly interconnected with • The GEOSS Societal Benefit Areas: 5
other reference thesauri, may prove very useful languages.
for resource discovery.
Finally, several other institutions and proj- However, as explained in the following
ects are similarly digitising SDI-related domain Section, in principle any SPARQL-compliant
knowledge by using SKOS as the encoding repository can be seamlessly integrated and also
format. Starting from the CUAHSI Hydrologic sources based on a different protocol can be easily
Information System23, a thematic thesaurus for integrated. Another important precondition for
water has been developed in GEOSS AIP-3 applying the semantic augmentation paradigm
and is currently being translated into SKOS for to the retrieval of geospatial resources was the
inclusion in the knowledge base considered in harmonization of the distinct thesauri by align-
this Chapter. The Australian agency CSIRO is ing corresponding terms among the resources
developing vocabulary services24 for thematic listed above.
thesauri that are made accessible through an API
akin to the one implemented by GEMET; among
these, the extensive water thesaurus derived from
the WDTF schema of the Australian Bureau of

322
Methodologies for Augmented Discovery of Geospatial Resources

An Architecture for Transparent pean approach to GEOSS) project28 to provide


Semantic Augmentation semantics-aware capabilities to the EuroGEOSS
discovery broker (EuroGEOSS, 2010; Nativi &
One possible concrete application of the third- Bigagli, 2009).
party discovery augmentation approach is to en-
able semantic discovery of geospatial resources. System Architecture Overview
The objective is to develop a flexible framework
that aims to provide users with semantics-aware The transparent semantic augmentation frame-
query capabilities to discover geospatial resources work was designed applying the following general
from one (or more) traditional/standard discovery principles:
services. The semantic augmentation mechanism
is transparent to resource providers because no • Layered Architecture;
additional meta-information needs to be explicitly • Separation of Concerns;
added to existing metadata repositories. • System of Systems (SoS) Approach.
According to the third party approach, the
framework we designed makes use of a specific The layered architecture is a well-known
Discovery Augmentation Component (DAC) approach for designing complex systems (ISO,
that implements the business logic required for 1994), where functionalities are grouped and lay-
semantic augmentation. The DAC implements ered according to their abstraction level. Figure 6
a query expansion strategy according to which shows the three layers of the proposed architecture,
multiple, traditional geospatial queries are derived following the SOA taxonomy:
from a single semantic query. Existing semantic
services are used to expand queries and combine • Service Consumer Layer: This layer
the related results in a meaningful way. implements presentation functionalities, in
In fact, the DAC is able to accept a query, ex- fact it is comprised of components which
pand it as multiple, semantically-related queries by implement Graphical User Interfaces
accessing a customizable set of external semantic (GUIs);
services (thesauri, ontologies, gazetteers, etc.), • Semantic Augmentation Service Layer:
and finally issue them to a geospatial discovery This layer is composed of service compo-
service. The DAC combines query results in a nents which implement the business logic
meaningful way and sends them back to the client. necessary to integrate semantic and geo-
This framework realizes the Information spatial services;
Expert design pattern (Larman, 2005) assigning • Query Service Provider Layer: This lay-
specific tasks (responsibilities) to components er contains services providing both seman-
that have the information needed to carry out the tic and geospatial query functionalities.
task, making the architecture flexible and scalable.
Moreover, it does not affect existing geospatial The DAC clearly falls into the semantic aug-
services interfaces by implementing a loosely- mentation service layer, which makes use of the
coupled solution. However, in order to enable query service provider layer.
this architecture, specific solutions are required in By separating the concerns, we obtain a
order to define the exposed interface, to address set of “dedicated” components: each of them
resources heterogeneity and performances issues. implementing a precise and well-defined set of
This framework was implemented and tested functionalities. The catalog service component
in the context of the FP7 EuroGEOSS (A Euro- is dedicated to execute “traditional” geospatial

323
Methodologies for Augmented Discovery of Geospatial Resources

Figure 6. Architecture of the transparent semantic augmentation framework

queries. The semantic service provides access to sure interoperability with existing systems. For
semantic repositories (e.g. thesauri, ontologies, the interaction between the DAC and the catalog
etc.). The DAC is in charge of executing the query service, the OGC CSW/ISO AP (OGC, 2007b)
expansion (accessing the semantic services) and (Application Profile) interface is used. Among
distributes the resulting set of “traditional” queries the present OGC CSW APs, ISO AP is presently
to the federated catalog service. Finally, the user one of the most widely implemented. Besides, it is
agent provides results visualization. the INSPIRE-compliant catalog service interface.
The System of Systems approach implies the The query syntax to access the semantic service
capability of connecting with existing systems follows the W3C standard SPARQL (W3C, 2008).
without any modification to their interfaces and/ In keeping with the SoS approach, the DAC was
or functionalities. This is achieved by providing conceived to be flexible enough to interoperate
interoperability arrangements to mediate from a with other query languages published by different
geospatial semantic query to a set of semantic types of semantic services.
queries and traditional geospatial queries. The DAC publishes an interface for user agent
The choice of the service interfaces was mainly bindings. This interface allows clients to submit
driven by the need of being compliant with widely queries with a possible combination of semantic,
adopted catalog service specifications and to en- geospatial, and free text constraints. At the time

324
Methodologies for Augmented Discovery of Geospatial Resources

being, there is no standard interface or syntax sidering the basic OpenSearch interface, three
allowing such combined queries. Hence, we de- additional semantic parameters can be specified
cided to introduce an extension of the OpenSearch in the query:
interface (Clinton, 2009). The OpenSearch is a
lightweight interface that allows agents to query A. Subject
catalogs with a simple free text search. There B. Predicate (semantic relation)
exist several extensions of the basic OpenSearch C. Object
syntax; two widely used extensions for geospatial
queries are: All parameters are optional and can be com-
bined with the free text (searchTerm) parameter
• The Geo extension (Turner, 2010; in many different ways. The three parameters
Gonçalves, 2010): this allows agents to are used to represent an RDF statement that the
specify a spatial extent/location as con- selected concepts must satisfy. When the search-
straint in a query; Term (free text) parameter is also specified, the
• The Time extension (Gonçalves, 2010): selected concepts are filtered to match the given
this allows agents to build queries based searchTerm—that is, concepts in whose descrip-
on time and time span constraints. tions the free text (searchTerm) appears.
The three semantic parameters represent se-
We introduced a new extension called: “Con- mantic concepts; thus, they can be represented in
cept-driven extension.” This allows the discovery different ways by semantic services. When the
of well-defined concepts and their relations from user agent and the semantic service make use
semantic services. The DAC query interface of different representations, the DAC is able to
implements all these three extensions. implement the necessary mediation functionality.
Examples of possible query parameters com-
OpenSearch Concept-Driven binations are given in Tables 1, 2, 3, 4. They use
Extension and Query Expansion the URI “http://www.eionet.europa.eu/gemet/
concept/9221” from the GEMET Thesaurus
The introduced extension is based on the concept for representing the semantic concept “water
of RDF triple (see see paragraph ‘Grounding resource.”
semantics-aware discovery’) (W3C, 2004). Con-

Table 1. Semantic query parameters, example 1

searchTerm Subject Predicate Object


drought http://www.eionet.europa.eu/gemet/concept/9221 skos:narrower -
This query selects all concepts more specific than the concept “water resource” and whose description contain the keyword “drought”

Table 2. Semantic query parameters, example 2

searchTerm Subject Predicate Object


- http://www.eionet.europa.eu/gemet/concept/9221 skos:narrower -
This query selects all concepts more specific than the concept “water resource”

325
Methodologies for Augmented Discovery of Geospatial Resources

Table 3. Semantic query parameters, example 3

searchTerm Subject Predicate Object


drought http://www.eionet.europa.eu/gemet/concept/9221 - -
This query selects all concepts which are related to the concept “water resource” according to the DAC default relation (customizable)
and whose description contain the keyword “drought”

Table 4. Semantic query parameters, example 4

searchTerm Subject Predicate Object


drought - - -
This query selects all concepts whose description contain the keyword “drought”

The implementations of the “Concept-driven Using the concept-driven extension, it was


extension” must support at least the GeoRSS and possible to implement two different types of
X-Suggestion29 return types. GeoRSS is a recent query expansion:
extension of the RSS30 feed format; it enriches
traditional feeds with geographic and temporal 1. Automatic query expansion
information. When the query asks for a GeoRSS 2. User-assisted query expansion
return type, the DAC builds a set of traditional
geospatial queries using the multilingual text For the first type, the main discovery steps are
representation of the selected concepts in order depicted in Figure 7.
to retrieve metadata records form the catalog
service. A. The query keywords (the what constraint)
In order to create requests such as those in are “expanded” with the concepts retrieved
Table 1, 2, 3 the client software must be able to from the semantic services accessed by the
retrieve the URIs (or any other kind of represen- DAC;
tation of semantic concepts) from the semantic B. The selected concepts are used to build a
service. To achieve this, the X-Suggestion return set of “traditional” geospatial queries to be
type is used. When this return type is requested, submitted to the catalog service accessed
the DAC selects the concepts without submit- by the DAC;
ting queries to the catalog service. The selected C. The DAC performs a “smart” aggregation of
concepts and their textual description are returned the queries results and provides them back
to the components of the service consumer layer to the client (user agent).
(user agents) in order to be viewed by the user
and, eventually, used in subsequent queries to the The second type differs from the first one only
DAC. This feature enables the use of the DAC for the first step. In fact, in this case, the selection
for browsing the content of the semantic service. of the concepts of interest is not automatic; the
More return types can be defined and imple- user can freely navigate the content of the seman-
mented for addressing specific needs. Examples tic services and select which concepts will be used
of suitable return types are: XML documents for to query the catalog service. Figure 8 depicts the
RDF graphs, ISO 19115 (ISO, 2003), metadata steps of this second type of query expansion
encodings for geospatial resources.

326
Methodologies for Augmented Discovery of Geospatial Resources

Figure 7. Sequence diagram of the automatic query expansion

Figure 8. Sequence diagram of the user-assisted query expansion

327
Methodologies for Augmented Discovery of Geospatial Resources

Discovery Augmentation Component The design of the DAC internal modules is


based on the Mediation pattern: a central module
In order to implement the architecture described (the mediator) orchestrates a set of modules that
above, the DAC design is crucial. As for the operate independently from each other. This way,
functional requirements, the main functionality of it is possible to concentrate the business logic that
the DAC is the implementation of the two query is required to expand the query into one central
expansion strategies; the DAC must be able to: module: the Orchestrator. The other modules
provide specific functionalities, such as generat-
• Interpret a semantic geospatial query; ing and executing semantic queries, generating
• Expand the semantic geospatial query into geospatial queries, executing geospatial queries,
one or more semantic queries; and transforming the results to the desired return
• Build a set of geospatial queries and sub- type. In order to achieve the desired flexibility,
mit them to the catalog service; communication between the Orchestrator and
• Combine the results of a set of geospatial the other modules is completely decoupled. The
queries in a meaningful way; Orchestrator module interprets the semantic geo-
• Transform the results to the requested re- spatial query and, accordingly, calls the necessary
turn type. modules in the right order to process the request.
Figure 9 depicts the main DAC internal modules.

Figure 9. DAC internal modules

328
Methodologies for Augmented Discovery of Geospatial Resources

The OpenSearch Profiler reads incoming re- The CSW Manager module communicates with
quests, checks for validity, and forwards them to the catalog service. It should be noted that this
the Orchestrator in a more structured form. These component does not make use of any adaptation
functionalities must not be implemented by the functionalities. In fact, solutions for mediating
Orchestrator to guarantee more flexibility; in fact, different types of geospatial discovery services
the Orchestrator must be agnostic to the com- already exist (EuroGEOSS, 2010). Thus, where
munication protocols published by the DAC. Thus, necessary, they should be coupled with the DAC.
future useful protocols can be added just by imple- Finally, the Result Converter module encodes
menting new profilers. the results (which can be both metadata records
The Semantic Service Manager is in charge of and semantic concepts) into the desired return type.
generating and submitting requests to the semantic The DAC data model considers two main
service(s). This module makes of use of a set of information elements:
Adapters to manage semantic services heterogene-
ity. These adapting modules mediate between the • Metadata Records
internal Semantic Service Manager interface and • RDF elements (concepts and semantic
the specific semantic service query language (e.g. relations).
SPARQL). Thus, in order to add a new semantic
service type (e.g. a gazetteer service), it is suf- Figure 10 depicts a simplified schema of the
ficient to implement the corresponding Adapter. DAC data model. This model connects ISO meta-
The OpenSearch2GetRecords Mapper module data records and concepts represented as RDF
is able to read the non-semantic parameters of the nodes, allowing building an RDF graph between
request (i.e. spatial and temporal constraints) and metadata records (see Figure 11).
create an ISO GetRecords request—which can be Referring to Figure 11, relations among meta-
submitted to a standard OGC CSW/ISO catalog. data records are more “relaxed” than those ob-
Given an ISO GetRecords request and a set of tained by linking the metadata elements directly:
concepts (represented as terms), the CSW Request Figure 12a) depicts the case of direct links; while
Builder module expands the original request gen- Figure 12b) shows the case of metadata elements
erating a set of related ISO GetRecords requests. linked through concepts.

Figure 10. DAC data model

329
Methodologies for Augmented Discovery of Geospatial Resources

Figure 11. Graph connecting concepts and metadata records

In the design of the data model, an important concept if this appears in one of metadata record
choice is the strategy adopted to link metadata “keyword” elements OR in any other textual ele-
records to concepts. In fact, these two kinds of ele- ment of the metadata record.
ments can be linked in several ways, determining
the reliability of the Relaxed Relation. Basically,
this can be decided case by case depending on CONCLUSION
the overall system needs.
Metadata records are composed of several ele- In this chapter, we discussed the enhancement
ments (e.g. title, abstract, keywords, etc). These of geospatial resources discovery services to
elements are used to link the metadata records to support semantic queries. After explaining the
one or more matching semantic concepts—see general vision, we exposed specific issues related
Figure 12 to the semantic discovery of existing geospatial
One strategy is to match the “keyword” ele- resources provided through standards services and
ment content to one or more semantic concepts. Web2.0 services. Methodologies, architectures,
This strategy generates a very reliable Relaxed and current experimentations are also discussed.
Relation, resulting quite restrictive however. A possible methodology to enhance discovery
A more general but less reliable strategy capabilities of SDIs is to augment the searchable
consists of taking into account also other meta- content that is associated with geospatial resources.
data elements such as “title,” “abstract,” etc. We We have defined three high-level approaches for
adopted this strategy, defining the link between providing the required additional meta-informa-
metadata records and semantic concepts as it fol- tion: provider-based, user-based, and third party.
lows: a metadata record is linked to a semantic We have selected the more extensible and flex-

330
Methodologies for Augmented Discovery of Geospatial Resources

Figure 12. Relations between metadata records in the DAC data model

ible ones: the user-based approach, and the third concepts related to the free-text constraint in the
party approach. original query. This architecture is centered on a
Two architectures have been presented, each third party component, the Discovery Augmenta-
implementing one of the selected approaches. In tion Component (DAC), which implements the
the first one, regarding the user-based approach, business logic needed to integrate semantic and
the architecture makes use of tags in two different geospatial capacities.
but complementary scenarios. The first scenario The two architectures have been prototyped in
considers tagging at discovery time, when users the context of the EC-funded EuroGEOSS project
launch a query based on a set of tags. These tags and are part of the EuroGEOSS multidisciplinary
are either introduced by the users or suggested by interoperability infrastructure. This was success-
the system given the pool of previous tags used fully tested in several use scenarios of the GEOSS
by others. In this case, the previous queries give AIP-3 (Architecture Implementation Pilot—Phase
clues to non-expert users to successfully discovery 3) in collaboration with the GENESIS project
geospatial resources. The second scenario involves (Nativi, et al., 2011; Fugazza, et al., 2011; Pozzi,
the grouping of the resources that are discovered et al., 2011). Demonstration videos are available
into collections. Users are allowed to annotate at http://www.ogcnetwork.net/pub/ogcnetwork/
with tags the collection of discovered resources GEOSS/AIP3/pages/Demo.html.
on the basis of certain relationships. In this case, With regard to the user-centered approach,
not only individual resources are annotated but future research threads will be the inclusion of
whole collections that contain resources that are new Web 2.0 services, exchange formats and
related to the same topics. protocols, and the potential integration with of-
The second solution we presented implements ficial SDI content, the recommender module is
the third party approach in order to extend tra- still an open issue and existing and innovative
ditional discovery functionalities with semantic techniques must be tested.
capabilities. Leveraging the Semantic Web With regard to the DAC, future work will
approach, we described a flexible framework extend the set of semantic service interfaces that
that transparently augments user queries with are currently supported. Besides, new controlled

331
Methodologies for Augmented Discovery of Geospatial Resources

vocabularies and ontologies will be integrated. In Clinton, D. (2009). OpenSearch 1.1 specification
our experimentation, the alignment of different (draft 4). Retrieved June 8, 2011, from http://www.
thesauri and ontologies was carried out manu- opensearch.org /Specifications/OpenSearch/1.1/
ally by domain experts; an automatic alignment Draft_4.
approach for the DAC is another challenge that
Craglia, M., Goodchhild, M. F., Annoni, A.,
will be considered in the future.
Camara, G., Gould, M., & Kuhn, W. (2008).
Next-generation digital earth: A position paper
from the Vespucci initiative for the advancement
ACKNOWLEDGMENT
of geographic information science. International
Journal of Spatial Data Infrastructures Research,
The work was partially funded by the European
3, 146–167.
Commission’s Seventh Framework Programme
under contracts 226487 and 223996. Díaz, L., Granell, C., Gould, M., & Huerta,
J. (2011). Managing user generated informa-
tion in geospatial cyberinfrastructures. Future
REFERENCES Generation Computer Systems, 27(3), 304–314.
doi:10.1016/j.future.2010.09.002
Abargues, C., Granell, C., Díaz, L., & Huerta, J.
(2010). Aggregating geoprocessing services using EIONET. (2011). The general multilingual envi-
the OAI-ORE data model. International Journal ronmental thesaurus (GEMET). Retrieved May 20,
on Advances in Intelligent Systems, 3(3-4). 2011, from http://www.eionet.europa.eu /gemet.

Baader, F., Calvanese, D., McGuinness, D., Nardi, EuroGEOSS WP2. (2010). Deliverable 2.2.2
D., & Patel-Schneider, P. (2003). The descrip- – Specification of the EuroGEOSS initial op-
tion logic handbook: Theory, implementation erating capacity. Retrieved May 20, 2011,
and applications. Cambridge, UK: Cambridge from http://www.eurogeoss.eu /Documents/
University Press. EuroGEOSS_D_2_2_2.pdf.

Barragáns-Martínez, A. B., Rey-López, M., EuroVoc. (2011). EuroVoc – The EU’s multilingual
Costa-Montenegro, E., Mikic-Fonte, F. A., Bur- thesaurus. Retrieved May 20, 2011, from http://
guillo, J. C., & Peleteiro, A. (2010). Exploitation eurovoc.europa.eu/drupal/.
of social tagging for outperforming traditional FAO. (2011). FAO – The AGROVOC thesaurus.
recommender system techniques. IEEE Internet Retrieved May 20, 2011, from http://aims.fao.org
Computing, 14(6), 23–30. /website/ AGROVOC-Thesaurus/sub.
Berners-Lee, T., & Fischetti, M. (1999). Weaving Franklin, C., & Hane, P. (1992). An introduction
the web: The original design and ultimate destiny to GIS: Linking maps to databases. Database,
of the world wide web by its inventor. San Fran- 15(2), 17–22.
cisco, CA: Harper.
Fugazza, C., Nagai, M., Nativi, S., Santoro, M.,
Brin, S., & Page, L. (1998). The anatomy of a & Pozzi, W. (2011). GEOSS AIP-3 engineering
large-scale hypertextual web search engine. Paper report – Vocabularies and semantics scenario.
presented at the Seventh International World-Wide Retrieved May 20, 2011, from http://www.ogc-
Web Conference. New York, NY. network.net /pub/ogcnetwork/GEOSS/AIP3/
documents/AIPscenario_Semantics_1.5.pdf.

332
Methodologies for Augmented Discovery of Geospatial Resources

Gonçalves, P. (2010). OpenGIS® OpenSearch Lemmens, R., Wytzisk, A., de By, R., Granell, C.,
geospatial extensions draft implementation stan- Gould, M., & van Oosterom, P. (2006). Integrat-
dard, ver 0.0.2. OGC 09-084r3. Retrieved from ing semantic and syntactic descriptions to chain
http://www.ogcnetwork.net. geographic services. IEEE Internet Computing,
10(5), 42–52. doi:10.1109/MIC.2006.106
Group on Earth Observation. (2010). GEOSS
– 2009-2011 work plan. Retrieved May 20, MacEachren, A. M., & Kraak, M. J. (2001).
2011, from http://www.earthobservations. Research challenges in geovisualization. Cartog-
org /documents/work%20plan/ geo_wp0911_ raphy and Geographic Information Science, 28,
rev3_101208.pdf. 3–12. doi:10.1559/152304001782173970
INSPIRE. (2010). INSPIRE metadata implement- Maguire, D. J., & Longley, P. A. (2005). The
ing rules: Technical guidelines based on EN ISO emergence of geoportals and their role in spatial
19115 and EN ISO 19119, version 1.2. Retrieved data infrastructures. Computers, Environment
May 20, 2011, from http://inspire.jrc.ec.europa.eu and Urban Systems, 29, 13–14.
/documents/Metadata/ INSPIRE_MD_IR_and_
Masser, I. (2005). GIS worlds: Creating spatial
ISO_v1_2_20100616.pdf.
data infrastructures. New York, NY: ESRI Press.
International Organization for Standardization.
NASA-JPL. (2011). Semantic web for earth and
(1994). Information technology -- Open systems
environmental terminology (SWEET). Retrieved
interconnection -- Basic reference model: The
May 20, 2011, from http://sweet.jpl.nasa.gov/.
basic model. ISO/IEC 7498-1. Geneva, Switzer-
land: ISO. Nativi, S. (2010). The implementation of interna-
tional of geospatial standards for earth and space
International Organization for Standardization.
sciences. International Journal of Digital Earth,
(2003a). Geographic information – Metadata. [E]
3(1), 2–13. doi:10.1080/17538941003764412
[. Geneva, Switzerland: ISO.]. ISO, 19115, 2003.
Nativi, S., & Bigagli, L. (2009). Discovery, me-
International Organization for Standardization.
diation, and access services for earth observation
(2003b). Geographic information – Services.
data. IEEE Journal of Selected Topics in Applied
ISO/IS, 19119:2003. Geneva, Switzerland: ISO.
Earth Observations and Remote Sensing, 2(4),
Klien, E., Einspanier, U., Lutz, M., & Hubner, S. 233–240. doi:10.1109/JSTARS.2009.2028584
(2004). An architecture for ontology-based discov-
Nativi, S., Santoro, M., Dubois, G., Skoien, J.
ery and retrieval of geographic information. In F.
O., De Jesus, J., De Longueville, B., & Fugazza,
Toppen & P. Prastacos (Eds.), Proceedings of the
C. (2011). GEOSS AIP-3 engineering report –
Seventh Conference on Geographic Information
eHabitat. Retrieved May 20, 2011, from http://
Science (AGILE 2004), (pp. 179-188). Heraklion,
www.ogcnetwork.net /pub/ogcnetwork/GEOSS/
Greece: Crete University Press.
AIP3/documents/ CCBio-eHabitat-ER-v2.0-
Larman, C. (2005). Applying UML and patterns FINAL.pdf.
- An introduction to object-oriented analysis and
Nuñez, M., Díaz, L., Granell, C., & Huerta, J.
design and iterative development (3rd ed.). Upper
(2011). Web 2.0 broker: A tool for massive col-
Saddle River, NJ: Prentice Hall PTR.
lection of user information. In Proceedings of the
European Geosciences Union (EGU) General As-
sembly 2011 (EGU 2011). Retrieved 8 June, 2011,
from http://meetings.copernicus.org /egu2011/.

333
Methodologies for Augmented Discovery of Geospatial Resources

O’Reilly, T. (2005). What Is web 2.0 - Design Russell, S., & Norvig, P. (2005). Artificial intel-
patterns and business models for the next genera- ligence: A modern approach. Upper Saddle River,
tion of software. Retrieved May 20, 2011, from NJ: Prentice Hall.
http://oreilly.com /pub/a/web2/archive/what-is-
Smits, P., & Friis-Christensen, A. (2007). Resource
web-20.html.
discovery in a European spatial data infrastruc-
O’Reilly, T., & Battelle, J. (2009). Web squared: ture. IEEE Transactions on Knowledge and
Web 2.0 five years on. Retrieved May 2, 2011, Data Engineering, 19(1), 85–95. doi:10.1109/
from http://www.web2summit.com /web2009/ TKDE.2007.250587
public/ schedule/detail/10194.
Strohmaier, M., Koerner, C., & Kern, R. (2010).
Open Archives Initiative. (2008a). Object reuse Why do users tag? Detecting users’ motivation for
and exchange specification. Retrieved May 20, tagging in social tagging systems. In Proceedings
2011, from http://www.openarchives.org /ore/. of the Fourth International AAAI Conference on
Weblogs and Social Media (ICWSM 2010). AAAI.
Open Archives Initiative. (2008b). ORE user
guide – Abstract data model. Retrieved May 20, Stvilia, B., Twidale, M. B., Smith, L. C., & Gasser,
2011, from http://www.openarchives.org /ore/1.0/ L. (2008). Information quality work organization
datamodel. in Wikipedia. Journal of the American Society
for Information Science and Technology, 59,
Open Geospatial Consortium. (2007a). OpenGIS®
983–1001. doi:10.1002/asi.20813
catalogue services specification, ver. 2.0.2. OGC
07-006r1. Retrieved from http://www.ogcnet- Turner, A. (2010). OpenSearch Geo extension 1.0
work.net. (draft 2). Retrieved June 8, 2011, from http://www.
opensearch.org /Specifications/OpenSearch/ Ex-
Open Geospatial Consortium. (2007b). OpenGIS®
tensions/Geo/1.0/Draft_2.
catalogue services specification - ISO metadata
application profile, ver. 1.0.0. OGC 07-045. Re- USDA. (2011). National agricultural library
trieved from http://www.ogcnetwork.net. thesaurus and glossary. Retrieved May 20, 2011,
from http://agclass.nal.usda.gov /agt.shtml.
Open Geospatial Consortium. (2008). OGC KML,
ver 2.2.0. OGC 07-147r2. Retrieved from http:// Vilches-Blazquez, L. M., & Corcho, O. (2009).
www.ogcnetwork.net. A heuristic approach to generate good quality
linked data about hydrography. In Proceedings of
Pozzi, W., Fugazza, C., Brewer, M. J., Santoro,
the Database and Expert Systems Applications,
M., Nativi, S., Lee, B., & Enenkel, M. (2011).
International Workshop, (pp. 99-103). IEEE Press.
GEOSS AIP-3 engineering report – Global
drought monitoring service through the GEOSS World Wide Web Consortium. (2004a). Resource
architecture. Retrieved May 20, 2011, from http:// description framework (RDF): Concepts and
www.ogcnetwork.net /pub/ogcnetwork/GEOSS/ abstract syntax. Retrieved May 20, 2011, from
AIP3/documents/ GEOSS_AIP3_DroughtWa- http://www.w3.org /TR/rdf-concepts/.
ter_ER.pdf.
World Wide Web Consortium. (2004b). RDF vo-
Richards, D. (2007). A social software/Web 2.0 cabulary description language 1.0: RDF schema.
approach to collaborative knowledge engineer- Retrieved May 20, 2011, from http://www.w3.org
ing. Information Sciences, 179(15), 2515–2523. /TR/2004/REC-rdf-schema-20040210/.
doi:10.1016/j.ins.2009.01.031

334
Methodologies for Augmented Discovery of Geospatial Resources

World Wide Web Consortium. (2004c). OWL web 11


http://www.w3.org/TR/rdf-syntax-gram-
ontology language reference. Retrieved May 20, mar/
2011, from http://www.w3.org /TR/2004/REC- 12
http://www.w3.org/TR/rdfa-syntax/
owl-ref-20040210/. 13
http://www.geonames.org
14
http://dbpedia.org
World Wide Web Consortium. (2008). SPARQL 15
http://www.ordnancesurvey.co.uk/ontology
query language for RDF. Retrieved May 20, 2011, 16
http://en.wikipedia.org/wiki/Triplestore
from http://www.w3.org /TR/rdf-sparql-query/ 17
http://www.docbook.org/specs/cs-docbook-
World Wide Web Consortium. (2011). Semantic docbook-4.2.html
web activity. Retrieved May 20, 2011, http://www. 18
http://www.dublincore.org/documents/
w3.org /2001/sw/. dcmi-terms/
19
http://www.geonames.org/ontology/ontol-
ogy_v2.2.1.rdf
ENDNOTES 20
http://www.w3.org/2004/02/skos/
21
http://protege.stanford.edu/
1
http://www.opengeospatial.org/ 22
http://www.eurogeoss.eu/
2
http://www.geonames.org/about.html 23
http://his.cuahsi.org/
3
http://geocommons.com/help/About 24
http://auscope-services.arrc.csiro.au/vocab-
4
http://wiki.openstreetmap.org/wiki/Main_ service/client/
Page 25
http://www.genesis-fp7.eu/
5
http://www.earthobservations.org/geoss. 26
http://www.openrdf.org
shtml 27
http://www.eionet.europa.eu/gemet
6
http://www.georss.org/Main_Page 28
http://www.eurogeoss.eu/
7
http://geojson.org/ 29
http://www.opensearch.org/Specifications/
8
http://www.openarchives.org/ore OpenSearch/Extensions/Suggestions/1.1
9
http://tools.ietf.org/html/rfc3986 30
http://www.rssboard.org
10
http://www.ietf.org/rfc/rfc4287.txt

This work was previously published in Discovery of Geospatial Resources: Methodologies, Technologies, and Emergent Ap-
plications, edited by Laura Díaz, Carlos Granell and Joaquín Huerta, pp. 172-203, copyright 2012 by Information Science
Reference (an imprint of IGI Global).

335
336

Chapter 21
Semantic Interoperability
of Geospatial Services
Iftikhar U. Sikder
Cleveland State University, USA

Santosh K. Misra
Cleveland State University, USA

ABSTRACT
This article proposes a multi-agent based framework that allows multiple data sources and models to be
semantically integrated for spatial modeling in business processing. The paper reviews the feasibility of
ontology-based spatial resource integration options to combine the core spatial reasoning with domain-
specific application models. We propose an ontology-based framework for semantic level communica-
tion of spatial objects and application models. We then introduce a multi-agent system, ontology-based
spatial information and resource integration services (OSIRIS), to semantically interoperate complex
spatial services and integrate them in a meaningful composition. The advantage of using multi-agent
collaboration in OSIRIS is that it obviates the need for end-user analysts to be able to decompose a
problem domain to subproblems or to map different models according to what they actually mean. We
also illustrate a multi-agent interaction scenario for collaborative modeling of spatial applications us-
ing the proposed custom feature of OSIRIS.

INTRODUCTION to manage spatial data, a trend often described as


a “growth surge”(Warnecke, Beattie, Cheryl, &
Recent studies have revealed that over 80% of Lyday, 1998). With the growth of Internet, there
local governments in the U.S. have locational is an increasing demand for location specific data
references in their data (Nedovic-Budic & Pinto, and analytical solutions requiring GIS to locate
1999) and a majority of local governments use and integrate multiple databases. This, in turn,
geographic information systems (GIS) technology requires federal, state, and local government
agencies to develop capabilities so that their data
DOI: 10.4018/978-1-4666-2038-4.ch021 can interoperate. For example, a real estate entre-

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Semantic Interoperability of Geospatial Services

preneur, looking for a suitable location for a new business services with relative ease if it can un-
business, would require data that combines GIS derstand another application’s service descriptions
data with that of the local government’s zoning and representations of workflows and information
and tax incentive areas. Home owners and home flows within and across organizations. However,
buyers, looking for information about environ- these representations become complicated when
mental hazards, can use e-maps that combine one needs to embed complex data structures and
data from several sources including the EPA’s models into an application. For instance, suppose
environmental data, and the HUD’s (Department we are interested in a mobile commerce applica-
of Housing and Urban Development) housing tion that would provide geospatial information as
community programs (GAO, 2003). Similarly, a a prelude to completing a business transaction. The
water/sewer storm water utility company evaluat- transaction protocol for such an application would
ing the feasibility of a new project to expand the require access to and representation of geographic
existing infrastructures in mountain areas may data and models. These models themselves may
need information about geomorphologic forma- require chaining of multiple services that depend
tions and associated potential landslide risk from on service level description of geo-processing
the local and federal government databases. models, spatial geometries, spatial analysis, and
Agencies in various levels of government rarely implementation logic. Typical query such as “Find
coordinate the development of their applications. the nearest Italian restaurant along the highway”
Consequently there is often redundancies and could possibly be answered by chaining multiple
duplications of data even within organizations services such as geocoding points of interest,
belonging to the same jurisdiction (Nyerges, integrating transport networks, creating dynamic
1989). Users often have to deal with proprietary segmentation of network, providing routing net-
systems that require the understanding of the work, rendering cartographic information, and
systems’ native command language, input data possibly converting text to voice. It is possible
format, and output presentations. The problem to envision integration and chaining of services
is further compounded when there is a need for to provide higher levels of functionality if such
communicating with more than one modeling services are distributed all over the enterprise
paradigms or when spatial analysis and model- and are accessible in a uniform standard manner.
ing techniques are used in application areas for (Peng & Tsou, 2003).
which they were not necessarily designed. In most Methodological artifacts, techniques for cor-
cases, users’ access, inference, and analytical rect description, and interpretation of resources,
ability of spatial dataset and services are limited collectively known as the semantic layer (Vetere
by proprietary standards, platform dependence, & Lenzerini, 2005), are pre-requisites to high level
and incompatibility. interoperability in a service-oriented environment.
In an e-government environment, simple trans- High level or semantic interoperability is of vital
actions can require interactions among multiple importance if collaborative business processes
resources possibly from different entities within are involved (Padmanabhuni, 2004). A complex
the government, and meaningful understanding collaborative process is often needed to compose
of system architectures and the service composi- complex services like dynamic visualization and
tions. Interagency transactions become simple query processing of geo-spatial data in real time.
if the agencies involved in a transaction have The representation of semantic content of space
homogeneous representation structures as well as and spatial knowledge has a special significance
the same discourse domain (Malucelli, Palzer, & in information exchange and integration. As
Oliveira, 2006). A geospatial application can use representation of spatial features are scale depen-

337
Semantic Interoperability of Geospatial Services

dent, multiple representations of spatial objects GIS SERVICES IN THE FRAMEWORK


require contextual information of the spatial OF E-GOVERNMENT
reference domain that allows spatial features to
be integrated into application models. Therefore, As stated earlier, government agencies within
typical business transactions involving spatial the same jurisdiction rarely coordinate their ap-
data interoperability require processes beyond the plication development as well as standardization
data translation or the conversion of geometric in data representation. Making data available
primitives. In collaborative business processes electronically then becomes a problem whether the
involving multi-agent interactions, the semantics potential user of the data is a government agency
of the complex spatial application models needs or otherwise. A major initiative in the USA seeks
to be shared to infer or draw conclusions from to address this problem. The National Spatial Data
such annotations. However, understanding the Infrastructure (NSDI) seeks to build an organiza-
collaborative aspects of multi-agent interaction tional and virtual network to promote the sharing
to solve complex spatial problems is still in its of spatial data among federal, state, regional, and
infancy (Sikder & Gangapadhayay, 2002). local government agencies and the private sector.
This article proposes a framework for a The Federal Geographic Data Committee (FGDC)
semantic-level communication between geo- (FGDC, 2006) is tasked to develop a spatial data
spatial services in business processes and appli- and metadata standard as well as to create data
cation models. The article presents an overview clearinghouse. FGDC is also responsible for
of interoperability efforts with specific reference coordinating the development of a “framework”
to geo-spatial databases and application models data (Wayne, 2005). Similar initiatives have been
and reviews the feasibility of an ontology-based taken by European communities for developing a
spatial resource integration to combine the core spatial data infrastructure (SDI) for e-government
spatial reasoning with domain-specific applica- (Craglla & Signoretta, 2000). As a part these initia-
tion models. Existing industry standards and tives to share data and services among public and
practices in geo-spatial interoperability are identi- private agencies as well as to achieve interoper-
fied. This is followed by a discussion of the role ability, various legal frameworks and standards
of ontology in explicating the implicit semantics have been instituted. The E-government Act of
of spatial data models and the need for formal- 20021 requires federal agencies to coordinate the
ism in descriptions of spatial categories. Use of development of standard protocols for sharing
markup languages for spatial resource description geographic information to reduce redundant data
and the tagging of spatial ontology are illus- collection and promote collaboration and the use
trated. Finally, a multi-agent based architecture of standards (GAO, 2003). As a result, the role
(OSIRIS-Ontology-Based Spatial Information of NSDI and SDI is evolving from a simple data
and Resource Integration Services) for semantic access facilitator to integrated service provider
interoperability of spatial data sets and models is where complex geospatial services and function-
proposed. The architecture is illustrated using an alities can be chained in a distributed environment
application model that uses domain ontology of to meet user needs. The framework of SDI and
urban environmental hydrology. the role of e-government are viewed as a portal
that can be used to access e-government services.
The geospatial one-stop portal, implementing the
NSDI framework, seeks to coordinate and align
geospatial data collection and maintenance across
all levels of government. This portal envisions a

338
Semantic Interoperability of Geospatial Services

virtual repository of spatial data and Web services between multiple agencies and clients using Web
and can provide support to local, state, and federal services can provide opportunity to interoper-
programs as well as decision making at various ate through open interfaces and communication
level of e-government operation. The portal can protocols. Typical geo-processing services may
be used by users to access the following services include data management tools like projection and
(Cummens, 2003): transformation, topology manipulation, indexing,
and spatial join.
1. Searching the geospatial one-stop network The interoperability issue in the context of
for image and feature Web services; e-government services is not limited to technical
2. Registering for notification when new or issues such as linking computer networks. There
updated data, maps, activities, and references is a fundamental requirement to share and re-use
are added to system repositories; knowledge networks and reorganize adminis-
3. Viewing metadata to determine if a product trative processes to better support the services
is suitable for the intended use; and themselves. The key areas of interoperability
4. Publishing (registering) map services, im- in need of consideration when implementing e-
ages, geographic data sets, geoservices, government services include the organizational,
spatial solutions, geographic, and land semantic, and technical issues (CEC, 2003, 2006).
reference materials. At the semantic level, there is an imperative that
the meaning of the information exchanged is not
The geospatial one-stop portal can to be ap- lost in the process of acquiring e-government
plied in the real world environment. For example, services from all levels of government. Thus,
in a real estate business, the portal can perform semantic interoperability entails seamless integra-
the task of a ‘broker’ to discover and chain land tion of information and services from agencies at
information or cadastre services from various different levels whether they are local, regional,
nodes according to the specifications set by ser- or national. This ensures organizational or local
vice requesters (Radwan, Bishir, Emara, Saleh, autonomy as well as global integration.
& Sabrah, 2005). However, there is growing skepticism whether
The application of Web services in geospatial the vision of geospatial one-stop’s objectives will
service management offers an opportunity towards be fulfilled (GAO, 2003) and interoperability
the local autonomy of databases. Web services can be achieved. The main obstacles are lack of
can be used to dynamically query various GIS standards and the requirement of a huge metadata.
layers while maintaining local agency level in- While existing FGDC metadata provides content
dependence in a distributed environment. From standards, they lack a framework for semantic
an organizational point of view, this may be very annotation of the geospatial content and services.
appealing. Local governments, such as counties The geospatial one-stop portal requires a semantic
can independently collect and manage data and layer that would contain domain level specifica-
still integrate information and services using tion of knowledge, mediator, and mapping agents
Web services. A client of the local government to coordinate among mapping components. The
entity, for example, a transportation company, semantic layer should be able to decompose
can directly access the government’s base map users’ declarative requests, compose complex
without maintaining its own dataset. With ap- spatial services within the context of user-defined
propriate permissions, a client can also update constraints.
the government’s dataset from the client’s own
record. An extended collaboration and partnership

339
Semantic Interoperability of Geospatial Services

EPISTEMOLOGY OF est to study the role of an agent in system level


INTEROPERABLE AGENTS IN interoperability by offering interface with special-
BUSINESS PROCESSES ized systems that require agents to understand
the native command language, input data format,
Epistemologically, the intrinsic state of an agent’s and output presentations. In such heterogeneous
reflection of a business process is partially char- environments, agents require mapping one-to-one
acterized by the internal schema (e.g., relational objects on a syntactic basis or measuring the se-
or object relational) of the agent. In addition, the mantic equivalence of objects. However, when the
agent also offers manipulation methods that can be representation is complex, for example, spatial da-
used to interface with other agents through specific tabases, simple semantic translation is inadequate
access protocol. Such autonomous agents can be as it can result in semantic conflicts. Three different
considered as a proxy who works collectively on types of semantic conflicts have been identified
behalf of some higher entity having rich schema in spatial applications (Goh, 1997): confounding
or domain knowledge. For example, in a B2B conflicts, scaling conflicts, and naming conflicts.
e-commerce system, a cooperative mechanism Confounding conflicts occur when information
agent may negotiate with multiple suppliers, items have the same meaning but have different
monitor auctions, and infer tactical business strate- realizations. Information items representing the
gies in a competitive environment. Agents may same value but with different reference systems
not only involve in collaborative interactions to lead to scaling conflicts; and naming conflicts
achieve a common goal, they may also identify are generated when same items have different
best partner through negotiations (He, Jennings, rules in different systems. A typical example of
& Leung, 2003). Business applications can then spatial semantic heterogeneity can start with both
exploit these agent benefits. Specifically, a large naming and confounding conflicts. For instance, a
complex problem can be decomposed into subtasks “river” and a “body of water” have two different
that can then be distributed among the agent com- realizations with respect to the scale and context
munities specializing in solving domain problems. of user perceptions. A location-based query such
In a multi-agent environment, the internal state as “find the intersections of a river and a high-
of agent’s reflection needs to be translated to an way in Ohio,” would need a semantic translator
export schema to map conceptual correspondence. to identify the spatial object that corresponds to
Therefore, multi-agent communication paradigms the concepts referred by “river” and “highway”.
require semantic level agreement. Such semantic The spatial agent needs to be able to characterize
interoperability eventually boils down to the prob- the semantic content of spatial data and model.
lem of the identification of semantically similar The representation of “river” or “body of water”
objects that belong to different components and depends on the context of the requesting agent.
the resolution of their semantic differences (Yeap Depending on the context, the geometric repre-
& Handley, 1990). In this sense, the semantic sentations of “river” and “body of water” could
interoperability problem is somewhat similar to be also different; the resulting intersection of a
the problem of schema integration in traditional “river” and a “highway” could be a point feature
multi-databases. in one representation while it could be a polygon
Different levels of business process conceptu- feature in another.
alization have given rise to many domain specific An agent’s access to data layers of spatial data-
heterogeneous systems, which are often very bases is dependent on the semantic interpretation
difficult to interoperate in applications. In recent and corresponding topological definition where
years, there has been a significant level of inter- spatial relationships such as adjacency relation-

340
Semantic Interoperability of Geospatial Services

ships can be reflected in a topological model. A of data translation or the conversion of geometric
topological model of a highway can be a graph primitives. The semantic interoperability eventu-
theoretic model characterized by its “from” and ally boils down to the problem of the identification
“to” nodes as well as by its left and right polygon of semantically similar objects that belong to dif-
objects. An agent can interpret the semantics of a ferent databases and the resolution of their seman-
textual query that includes “longest highway in tic differences (Kashyap & Sheth, 1996). The use
Ohio” in many ways since the agent would need of an ontology (Guarino & Giaretta, 1995) as a
to understand the definition implicit in topologi- framework for defining similarity among objects
cal containment of part and whole relationship. has the benefit of a formal definition for concepts
Examples of interpretation of “in Ohio” may in- in different metadata, a definition that could be
clude or exclude highways that run along the state used to define axioms for semantic translation
border, that are wholly contained within the state, between ontologies. The term “ontology” has its
or those crossing state boundaries. Thus, different root in the philosophical literature as the study of
interpretations will result in different access and being. In the domain of information systems and
processing requirements to different layers and AI, ontology has a somewhat different connotation
spatial joins. There is no standard or common as an “explicit specification of a conceptualiza-
formal convention for the description and categori- tion” (Gruber, 1993; Farquhar, Fikes, & Rice,
zation of semantic content in the query which can 1996) and provides a more pragmatic definition:
be used for the global interoperability of spatial Ontologies are explicit specifications of domain
agents. In addition, the lack of context-dependent conceptualization names and describe the entities
data views can lead to monolithic systems that that may exist in that domain and relationships
do not promote interoperability. For example, it among those entities. In other words, the tacit and
will be difficult for a local government or busi- implicit knowledge hidden in a particular domain
ness service provider to be able to update its land is explicitly conceptualized in ontology (Guarino,
records while serving them to other organizations 1997). Ontology is considered as a logical theory
without transforming geometric and topological accounting for the intended meaning of a formal
semantics to make them compatible to the client’s vocabulary while conceptualizations are the
structure. A state highway company would have formal structures of reality as perceived and or-
to maintain and reflect the changes made by local ganized by an agent. In spatial ontology, although
governments for its own data view to be able to agents may have a shared vocabulary capable of
use that local government’s base map directly. At establishing relationships, or mapping between
present, there is no accepted convention, which corresponding instances, the conceptualization
can be used to facilitate global interoperability of space as “object-based” and “field-based” may
of spatial objects. be still implicit among agents.
Although spatial ontology is an established
concept, and is capable of providing a naturalistic
ONTOLOGY FOR SEMANTIC representation of spatial objects, in the sense of
INTEROPERABILITY “naive geography” (Egenhofer & Mark, 1995),
it is still a complex specification to be realized.
The objective of semantic interoperability is to Spatial semantic interoperability goes beyond
be able to attach meaning to information entities simple data translation or the conversion of
or services and thereby draw inferences from the geometric primitives based on a-priori ontology.
semantic annotation. In spatial semantic interop- Classical ontology, offering limited expression of
erability, the integration goes beyond the process spatial relations to simulate spatial processes, can

341
Semantic Interoperability of Geospatial Services

be used to express spatially complex phenomena, CRAFTING ONTOLOGY-


is not well understood. Specifically, such expli- SEMANTIC WEB INITIATIVES
cation requires an understanding of mereology
and mereotopology (Smith, 1996). The study of While Web service description language (WSDL)
parts and boundaries of the whole is an important has become de facto standard for defining Web
aspect of representing the multiple constituents of services, its implication is yet to be understood for
a complex spatial object in relation to adjacency, geospatial services. In particular, XML encoding
containment, selection, or the separateness of of geospatial objects namely OGC’s (open GIS
constituent parts. Ontology in this sense needs consortium) geographic markup language (GML)
also to be viewed as a mediator for knowledge provides mechanisms for the encoding of spatial
exchange, to build a business application that feature data without regard to how the data is to
can enable data integration and avoid problems be presented. The spatial feature instance con-
such as inconsistency between ad-hoc ontologies tains feature properties, each as an XML element
which might be built into the system (Fonseca & whose name is the property name. Each of these
Egenhofer, 1999). This approach to ontology in contains another element whose name is the type
geo-spatial service management would address of the property value or instance. This produces a
issues concerning knowledge sharing by creating “layered” syntax in which properties and instances
components from ontology in an object-oriented can be interleaved. GML’s encoding convention
fashion, using classical object-oriented concepts helps distinguish properties from instances. Thus,
such as multiple inheritances. Thus, spatial ontol- a feature like settlement area might be composed
ogy should allow overcoming the implicit seman- of households, a roadway, utility lines, and many
tics of apparently disparate spatial data models by other features. Such structure-independent rep-
providing a higher-level description to prepare a resentation of spatial features allows multiple
cognitive correspondence among multiple com- views of the same entity in a multimodal manner.
ponents. Such ontology is relevant not only for For instance, point features can assume different
spatial data models to unravel the representational symbol sets consistent with the predefined design
scheme of topological part-and-whole dichotomy principle of the organizations. XML namespace
but also for the application models of GIS where prefixes for the domain specific vocabularies,
certain spatial operators (e.g., overlay, buffer, or for example, “urban hydrology” or “watershed
interpolation, etc.) must be executed in specific planning,” can be used to define feature types
sequences to reach a goal. Spatial application from common geometric vocabulary. Since the
developers should be able to combine classes namespace defines a unique identifier, there is
from diverse ontologies and create new classes little concern for naming conflicts. For instance,
that represent the user needs. For example, a class the underground cable line and road network
that represents a land parcel for a specific city databases are maintained by two different organi-
can be built from land parcel components speci- zations. In response to the user query to identify
fied in urban ontology, from polygons specified the locations where the cable lines intersect with
in geometry ontology and from crisp boundary highways, the two data sets need to be spatially
specifications in boundary ontology (Fonseca & overlaid. However, the assigned meanings of
Egenhofer, 1999). XML schema must still be inferred by a human.
A partial solution to these semantic issues can
be done using semantic markup of spatial Web ser-
vices and by using an expressive semantic markup
language that offers well-defined semantics of

342
Semantic Interoperability of Geospatial Services

spatial processing. A possible solution can be the OSIRIS framework will be used to illustrate
found through the use of RDF (resource descrip- the collaborative aspect of multi-agent interaction
tion framework) (Lassila & Swick, 2004). RDF to solve complex spatial problems. The advantage
provides a metadata description to allow for the of using multi-agent collaboration in OSIRIS is
creation of shareable and extensible vocabular- that it obviates the need for end-user analysts to
ies. The so-called self-expressibility of RDF can be able to decompose a problem to subproblems
be achieved by using a triple (resource, property, or to map different models according to what they
value). It provides a globally unique URI to de- actually mean. The proposed approach provides
scribe a resource in a Web page in a serialized and a means for communicating different models in a
machine-readable format. Built on the top of RDF, common semantic framework without the loss of
DAML (DARPA agent markup language) offers native autonomy. Having a comprehensive source
semantics of higher order relationships by includ- description of different modeling representation
ing modeling constructs from descriptive logic. has a unique advantage in understanding how the
DAML overcomes many of the shortcomings of models are organized in terms of interconnection
RDF semantics and is widely used for formulating of schema and scope.
Web ontology language. Thus, users’ declarative
requests can be decomposed to initiate generic
procedures to compose complex spatial services MULTI-AGENT AGENT
within the context of user-defined constraints. The ARCHITECTURE OF OSIRIS
implication of these constructs in spatial databases
is likely to have a fundamental shift in interoper- The OSIRIS architecture (Figure 1) is composed
ability research, allowing distributed access to of several integrated components. The model
multiple stakeholders’ data sets and integrating manager is in charge of providing the overall
them in a seamless Web interface. However, lack coordination of the agents by acting as a media-
of inference and logical layer of RDF makes it tor for instantiating an appropriate model through
difficult to express complex spatial semantics. the ontology agent. It also maintains a catalog of
models and ontological semantics with the help
of ontology agents. The model manager exposes
ONTOLOGY-BASED SPATIAL the model schema to the ontology agent to be
RESOURCES INTEGRATION used in response to specific user request. Using
SERVICES (OSIRIS) the encapsulated description of the model from
the ontology agent, the model manager creates
So far, the feasibility of an ontology-based spatial appropriate instances. The integration of data and
resource integration that combines the core spatial corresponding models is coordinated by the model
reasoning with spatial data models and domain spe- manager which ensures a proper correspondence
cific application models has been discussed. It has through the ontology agent and resolves any dis-
been noted that the application of ontology-based agreements between data and models. Through the
semantic mediation can lead to model integration. interaction of ontology agent and user interface
In this section, the proposed OSIRIS architecture agent, the model manager composes services or
that involves different levels of ontology to create custom models from discrete services to produce
meta-models, especially for spatial services, has a meaningful interconnection among them.
been described. In addition, a specialized prototype OSIRIS architecture incorporates spatial and
for decision making in spatial planning in a multi- non-spatial databases as well as a collection of
agent environment has been demonstrated. Later, application models (GIS resources). A wrapper

343
Semantic Interoperability of Geospatial Services

Figure 1. Multi-agent OSIRIS architecture

is used to encapsulate application models as well query results to the ontology agent. These descrip-
as the database. The wrapper provides a compre- tions could be encoded in the extended RDF or
hensive description of model parameters, scope, OWL-enabled ontology. Such descriptions pro-
input/output options, limitations, context, method mote consistency, exchangeability, and reuse of
of use, and other necessary semantics. Encapsula- modules through the shared repositories in a
tion agents encapsulate models and data to create distributed environment. Above the communica-
meta-models and created interfaces necessary for tion layer of the OSIRIS are two components
the wrapper object. Meta-models may include which ensure the consistent construction of the
legacy models, domain models, generic proce- agent platform. The internal platform message
dures, or high-level schema of interconnected transport (IPMT) provides a message routing
data. Encapsulation agents process queries origi- service for agents in this particular platform. The
nating from ontology agents and retrieve informa- IPMT can use either OMG’s IIOP (Internet inter-
tion back to the system. In addition, these agents ORB protocol) or HTTP protocol. The agent
translate query to the local query language, process platform security manager (APSM) is responsible
the query in a local system, and then forward

344
Semantic Interoperability of Geospatial Services

for managing the agent’s transport level security custom models from discrete services to produce
by creating an audit log. a meaningful interconnection among them.
A model manager can be divided into two-core
Models and Databases components: a) custom model manager and b)
generic model manager. A custom model manager
The OSIRIS framework relies on a repository of is designed to create and manage modules dynami-
models and databases. The choice of models and cally from core ontology and extended domain
data set would depend on the type of system be- ontology while a generic model manager is used
ing considered in a particular case. For example, to integrate legacy models through encapsulated
a system for studying land slide hazard may interfaces by maintaining reusable libraries of
need composite of several application models, models. Generic model managers offer services
algorithms including geoprocessing algorithms, to custom model managers as encapsulated meth-
discrete event simulation models, and non-linear ods and services of a particular class module and
dynamic models for estimating rainfall runoff. embed them in the model hierarchy.
In an e-government system, the set of models
and algorithms may include land use policies, Ontology Agent
zoning regulations, and economic development
priorities. Encapsulation agent is responsible to An ontology agent establishes correspondences
provide appropriate wrapper around such models and interrelations among different domain knowl-
and make them available to an ontology agent for edge and data models and relations. It defines a
semantic annotation and thereby help create a meta common semantic framework and formalism to
model ontology. The creation of wrapper involves share data and models and identifies a set of model-
interaction with the ontology repository through ing approaches and corresponding data sources to
the ontology agent to determine if an existing resolve semantic ambiguities. An ontology agent
ontological specification is adequate; otherwise is responsible for ensuring the class consistency
a new specification and taxonomy is created by and implicit relationships of application models
the encapsulation agent and the specification is and data. It exposes higher level model schema of
deposited in the ontology repository. legacy models and native data structure to model
manager and help generate meta-model catalog.
OSIRIS Model Manager An ontology agent provides standardized
descriptions of model constructs in conformity
The model manager provides the overall coordi- with ontological specifications. Its responsibility
nation of the agents by acting as a mediator for also includes ontology deployment, validation of
instantiating an appropriate model corresponding specification leading to creation of new wrapper
to the meta-model ontology. It also maintains a objects in the database, and updating the ontology
catalog of models. The export model schema to repository. For example, consider an e-government
be used in response to specific user request is ex- system that maintains databases regarding water/
posed to model manager by an ontology agent. The sewer storm water utility. A network infrastructure
integration of data and corresponding models is provider company may need access to the existing
coordinated by the model manager which ensures a water sewer database. This client’s request can be
proper correspondence through the ontology agent realized by dynamically populating a template
and resolves any disagreements between data and schema of water utilities data model.
models. The model manager composes services or The ontology agent populates the template
schema from existing databases for which it

345
Semantic Interoperability of Geospatial Services

receives the necessary request from the model domain ontology following the semantic structure
manager or the user interface agent (UIA) of the of the core ontology.
OSIRIS. The template schema can be created if In OSIRIS, core ontology is related to the
a user ontology matching the specific client’s domain ontology to specify appropriate semantic
request exists. In case such a match is not avail- characterization of the domain model. An example
able, the ontology agent can create a user ontology would be the relationship between the classes of
through an interactive process between itself, the core ontology and the classes of domain ontology
client, and the UIA. The ontology agent invokes that can be expressed using an user-defined seman-
the model manager to service the client request. tics such as: equivalentClassOf (DomainClass,
In the case of complex requests, the ontology CoreClass) or aggregateClassOf (DomainClass,
agent invokes the services of collaborative agent CoreClass). In specific cases, core ontology may
to resolve ambiguities or semantic disagreement include reusable UpperCyc (OpenCyc, 2006)
between domain ontologies and user ontologies. ontology which covers most of the top concepts
Embedded within the ontology agent is a reason- applicable to the domain of e-government. The
ing support engine. At the implementation level, it class relationships allow generating ontology in a
may include a reasoning engine like FaCT (Hor- hierarchy with the child class having explicit at-
rocks, 1998) system that can translate ontology tributes to specialize the parent classes. Thus, the
into SHIQ (D) DL to indicate inconsistencies and orthogonal property of ontology can be maintained
implicit subsumptions. In addition, axioms could by decomposing compound concepts into constitu-
be added to make implicit subsumptions explicit. ent parts. An ontology agent relies on Web services
and generic procedures to discover new services
Ontology Repository and modules. Web procedure/services ontology
manages this function by defining the levels of
The ontology repository consists of i) core ontol- abstractions of external services and fitting them
ogy, ii) domain ontology, iii) user ontology, and with the repository structure. At an implementation
iv) Web procedure/services ontology. In general, level, Web services can be expressed with a Web
these repositories contain a set of axioms (e.g., service description language (WSDL).
assertion of class subsumptions/equivalence),
and taxonomic class hierarchies. All of these on- Ontology Collaboration Agent (OCA)
tologies are strongly related to one another. The
core ontology contains immutable and generic OCA is designed to assert inter-ontology relation-
concepts and top-level categorical abstractions of ships within ontology repository and compute
physical relationships and corresponding semantic class hierarchy/consistency. It merges multiple
descriptions and axioms. For example, in the case ontologies by mapping domain classes with core
of complex spatial data required for representing ontology and provides semantic agreement of user
water and sewer network, it will be necessary to ontology with domain ontology. For example, a
use core ontology that contain high-level defini- road network classification by water/sewer net-
tions of spatial entities and geometric primitives. work authorities may not match with a specific
Domain ontologies are generally associated with domain of a user’s ontologies. In such cases, the
specific application domains such as geographic collaboration agent first tries to match the dispa-
information metadata-ISO 19115 (Islam, Bermu- rate schema by mapping individual road network
dez, Beran, Fellah, & Piasecki, 2006) and infra- classes with classes from core ontology. If the
structure product ontology (Osman & El-Diraby, mapping does not result in consistent well-formed
2006). A domain model is incorporated into the class hierarchy, the collaboration agent may pro-

346
Semantic Interoperability of Geospatial Services

ceed to modify user ontology through mediation tive session, the UIA generates and updates log
of user interface agent and ontology agent. It also files of frequently requested services and creates
integrates specialized Web procedure services corresponding index structure to facilitate faster
ontology (WPSO) and provides Web resources for retrieval of future request.
custom model formulation. WPSO also includes
a set of triggers that gets activated in the event Agent Platform Security Manager
of any change of Web-service composition or
changes in domain ontology. OCA also reacts to The agent platform security manager verifies
these changes and carries out necessary updates inputs from ontology agents and enforces compli-
to user ontology. ance with underlying system security protocol.
The nature of the compliance may differ based
User Interface Agent (UIA) on the application domain. For example, the se-
curity standards specified by FIPA (FIPA, 2006)
This component of OSIRIS is the primary inter- would be enforced by the security manager in an
mediary between the system user and OSIRIS. application involving landslide hazards.
It resolves ambiguities in the user expression in
formulating structured requests through an interac- Internet Platform Message Transport
tive session using the user profile. The ontology
agent collaborates with the UIA to develop model- The Internet platform message transport is de-
ing profiles of the user following user needs and signed to transfer a message from one system to
strategies. Some of the tasks achieved through the another. Such a transfer may involve technologies
interaction between the ontology agent and the such as Web services or CORBA. The intention of
UIA include the user modeling (Cali, Calvanese, the OSIRIS model is to be a high level protocol that
Colucci, Noia, & Donini, 2004; Rich, 1989), iden- interacts with the underlying system’s network.
tification of trends in user requests, and extreme
or unusual requests. Embedded within the UIA Multi-Agent Collaboration:
is a query formulation engine that has access to an Event Flow Scenario
native data semantics of user profiles via ontol-
ogy agent. Once the UIA completes the query In this section, two different scenarios for process-
formulation, the resultant query is parsed by the ing user requests are illustrated: composition of
model manager for the identification of specific services using generic models and custom mod-
model or service as requested by the UIA, and els. A generic model can be created by means of
finally validated by the user. The user ontology searching the model catalog (see the description of
in the ontology repository is updated through the model manager). Model manager maintains reus-
interaction among UIA, ontology agent, and the able libraries of geo-processing models that can be
collaborating agent. In the absence of the requested integrated with legacy models through the wrapper
service in the OSIRIS’meta-model catalog, custom objects in the database. For example, in order to
model is composed dynamically and validated extend an existing water utility infrastructure, the
by an interactive session involving the UIA and local government agency would need to identify
multiple agents in OSIRIS. The UIA also includes the optimum location and network of services
a set of triggers to respond to events such as matching those of its customers’ locations. A user
changes of user ontology in ontology repository, request for identifying location for the proposed
and meta-model catalog updates. As a practical water utility infrastructure would result in a service
implementation issue, at the end of an interac- request to OSIRIS. The resolution of this request

347
Semantic Interoperability of Geospatial Services

Figure 2. Message sequence of agent interactions for composing generic procedures

may consist of searching geocoded customer Unlike the generic model formulation, custom
location (geocoding combines map information models are composed dynamically from core
with street addresses to locate a point uniquely) ontology and extended domain ontology. The
and combining the search results with the gravity composition process involves logical inferences
model (Haynes & Fotheringham, 1984) for loca- from core ontology to generate generic procedures
tion allocation. The OSIRIS can accomplish this from domain ontology and WPSO (Figure 3). The
task by initiating a series of message transports derivation of the logical inference is iterative and
through inter-agent communication. Figure 2 requires user feedback for semantic refinement
illustrates a scenario where a generic model is of the model design. In the next section, an ex-
formulated from the user’s declarative request. ample of custom model composition from axi-
A request for geocoding customer location is omatic proposition in core ontology is described.
initiated by a series of message transports and the
creation of the request object. While the model Composing Custom Models by
manager (MM) provides access to meta-model Collaborative Modeling: Example of
of geo-processing catalogue, the ontology agent Logical Inference
(OA) and the ontology collaboration agent (OCA)
provide semantic support to resolve semantic mis- The interaction of multi-agents in collaborative
match between user request and domain ontology. environments can lead to the solution of complex
Additionally, the data encapsulation agent (DEA) problems not otherwise amenable to analysis in
interprets the schema of existing spatial data a stand-alone system [43]. The custom model
and corresponding meta-data and populates the manager feature of OSIRIS exploits the interaction
instances in the parameters of processing model. of multi-agents in a collaborative environment in
modeling spatial applications. A simplified ap-

348
Semantic Interoperability of Geospatial Services

Figure 3. Composing custom procedure through logical inference in OSIRIS agent interactions

plication scenario of OSIRIS for custom model is completed, output such as landslide risks and
management can be visualized in a spatial plan- zonation can be produced.
ning problem. Consider, for instance, an insurance Suppose the model server does not have ac-
company is interested in finding out the regions cess to the domain model for the landslide hazard
that are prone to landslides in the proposed ex- zonation model. This scenario would invoke the
tended water/sewer utility network. This problem custom modeling abilities of OSIRIS which then
can be addressed in two different ways. First, the proceeds to answer the landslide query indirectly
domain expert model for the landslide hazard in some less rigorous way by using core ontology
zone modeling (a domain model developed by and user templates. Let us assume that the user
geomorphologists) can be identified by OSIRIS input is only a DEM class. The system should be
model manager as the candidate model to ad- able to reason that the DEM class can be used to
dress the issue of landslide hazard. The model create a slope map class that would eventually
parameters, requirements, constraints, and schema identify the locations of highly steep areas that
can be described using description logics. This are likely to be landslide-prone. Creating a slope
description would constitute a meta-model of the map from the DEM is a standard and simple GIS
actual landslide model that eventually becomes procedure and the system should have no problem
the domain ontology for this particular problem. converting the map to a slope map using standard
The model may require different input classes, functions from domain ontology. The core ontol-
for example, 3D digital elevation model (DEM), ogy should be able to infer that higher the slope,
soil map, moisture index map, and satellite im- the greater is the probability of landslide and
ages. Once the processing of the available input specifies to the model manager the requirements

349
Semantic Interoperability of Geospatial Services

for generating the slope map. The common sense HeavyRain ⊆ ∃causes.Moisture
understandings are encoded and inferred from the Moisture ⊆ ∃causes.Landslide
knowledge of core ontology agent. This knowledge ⇒ HeavyRain ⊆ ∃causes.Landslide
includes generic axioms and contexts such as rocks
fall downward under gravity; rocks move faster Inverse roles for example, causes/causedBy
as the slope increases; and rocks do not move relation provides similar to the power of adduc-
in flat plains. The model can progressively lead tive reasoning.
to more sophistication with enhanced inference
and reasoning. Since soil types are associated Moisture ∩ ∃causedBy.HeavyRain ⊆
with landslide phenomena, the system may infer Landslide
that it needs to create a soil class in addition to ⇒ HeavyRain ⊆ ∃causes.Landslide
DEM and therefore, would begin searching for
soil data before it actually processes the query. The inference also provides cardinality restric-
While developing the “custom” model, the system tions and adds consistency constraints.
might “consult” the domain ontology and the user
ontology to gain further “insight” to solve similar RiskIndex ⊆ ∃hasValue.(High ∩ Low) ∧
complex problem. ≤1hasValue)
In the following example, consider the fol- High ⊆ ¬ Low ⇒ HighLowRiskindex ⊆ ⊥
lowing 4 axioms: a plot is sameClassAs a parcel;
landslide is the result of (or subClassOf) steep The inference allows one to express risk
slope and high moisture content; and the slope score of landslide of a given area from the core
can be derived from DEM. Given a set of well- ontology. It should be noted that causal inference
formed axioms, the semantics can be defined via can be derived from the core as well as domain
a translation into an expressive description logic ontology of OSIRIS.
(DL) which is equivalent to SHIQ (D) DL (Ian
Horrocks, Patel-Schneider, & Harmelen, 2003). Semantic Matching of Spatial
Features in OSIRIS
Plot Β Parcel
Landslide ⊆ steepSlope ∩ highMois- The relationship between domain ontology and
ture core ontology in OSIRIS corresponds to the
Slope ⊆ ∃hasSlopefunction.DEM relationship between the concepts of specializa-
tion and generalization. Just as the RDF schemas
Using multiple equality/inclusion axioms, one allow the creation of subtypes using multiple
can derive following assertion. inheritance properties like subClassOf or sub-
PropertyOf, the domain classes can be derived
steepPlot Β Plot ∩ hasHighval.Slope from core classes or ontology. In a spatial context,
moistPlot Β Plot ∩ hasHighval.Mois- by using GML’s core schema, one can develop
ture fundamental building blocks for features and fea-
Slope ⊆ ∃hasSlopefunction.DEM ture geometry and define application or domain
schemas following RDF/DAML constraints. For
Transitive role of classes or causality can be example, new feature types or collection types
captured as: can be defined from gml:AbstractFeatureType or
gml:AbstractFeatureCollection which could be
real world categories such as “roads” or “bridges”.

350
Semantic Interoperability of Geospatial Services

A user-defined complex geometric feature can from two different agencies. At the implementa-
consist of many other geometric elements. Such tion level, this approach can be used to identify
schema can be enriched by DAML+OIL con- spatial features and objects in different Web pages
straints. The DAML+OIL enabled spatial agent and to initiate spatial reasoning among disparate
is capable of new classifications by inference and data sources. The serialized and machine-readable
reasoning. Using GML’s spatial primitives within semantic descriptions, as supported by DAML,
RDF, a domain specific application schema can GML, and other, provide a means for ontology
be expressed to describe the different associative, agents to exchange spatial models and schema
causal, and functional relationships of the domain. on the Web.
Suppose the water/sewer authority in an
e-government system defines a namespace Domain Ontology for Spatial
<myDef:River> using GML’s namespace and Application: An Example
type hierarchy.
Using features of the OSIRIS framework, do-
< -- xmlns:myDef=”http://www.xyzcorp/ main ontology for urban environmental hydrol-
example.rdf --> ogy tightly coupled with a Web-based “what-if”
<myDef:River> scenario generator have bben developed. The
<gml:description> T1</ ontology expresses the widely used specifica-
gml:description> tion of a “non-point pollution model” or what is
<myDef:classification>water body</ commonly known as “simple method” [41] for
myDef:classification> estimating surface runoff or pollutant load. The
<gml:CenterLineOf> first step in markup of the domain components
<gml:LineString srsName=”GFD:987”/> is to describe each constituent component of
<gml:coordinate>0.00, 1000.0 1000.0, the pollution model. It consists of a declarative
0.0</gml:coordinate> description of the model’s properties. The objec-
</gml:LineString> tive is to model pollutant loads originating from
</gml:CenterLineOf> different changes of land use category. Depending
</myDef:River> on the land use category, the planning authority
classifies a certain number of zones. Zones are
It is possible that there exists other definitions spatial as well as administrative features managed
for the river class, say <usgs:River>, defied by by an environmental agency in accordance with
some other agency such as USGS. The use of the environmental regulations. The definition of
a namespace prefix <myDef:River> allows the the organization (in this case-environment and
user to specify the agent’s domain or to scale conservation-and-wildlife-organizations) relevant
specific vocabularies which essentially differenti- to zonal planning is derived from another top level
ate other namespace of similar categories such as ontology namespace to ensure the semantic cor-
<usgs:River>. DAML allows constructs such as respondence. Land use classes have essentially
daml:equivalentTo or daml:samePropertyAs that spatial features. A specific land use category
can be used to inherit specific properties from can result in several “pollutant” types which are
existing type definitions and associated descrip- expressed as enumeration types—daml:one of
tions. Thus, it is possible to have two distinct element as a closed lists of the daml:collection
spatial objects, having two different application parse type. Since the ontology is extensible, any
schemas, sharing the common geometric primitive new ontology component can be added to describe
of GML and the semantics of the two data sets wider algorithm of the domain. An excerpt from

351
Semantic Interoperability of Geospatial Services

an example of an urban hydrology developed <rdfs:subClassOf>


using DAML is shown. <daml:Restriction>
<daml:onProperty
<rdf:RDF rdf:resource=”#zoneID”/>
xmlns:rdf =”http://www. <daml:toClass
w3.org/1999/02/22-rdf-syntax-ns#” rdf:resource=”http://www.
xmlns:daml=”http://www.daml. w3.org/2000/10/XMLSchema#string”/>
org/2001/03/daml+oil#” <daml:maxcardinality>1</
xmlns:rdfs=”http://www.w3.org/ daml:maxcardinality>
TR/1999/PR-rdf-schema-19990303#”> </daml:Restriction>
xmlns=”http://cis.csuohio.edu/daml/ </rdfs:subClassOf>
ontologies/Non-point-pollution#” <rdfs:subclassOf>
> <daml:Restriction>
<daml:Ontology rdf:about=”http:// <daml:onProperty rdf:resource=”#e
opengis.net/gml/gml.rdfs#”> nvironmentalRegulation”/>
<daml:versionInfo>$1.0</ <daml:toCalss rdf:resource=”#Mana
daml:versionInfo> gementAgency”/>
<rdfs:comment>An ontology for </daml:Restriction>
Non-point Pollution modelling</ </rdfs:subClassOf>
rdfs:comment>
<daml:imports <daml:DataProperty rdf:ID=”environme
rdf:resource=”http://www.daml. ntalRegulation”>
org/2001/03/daml+oil#”/> <daml:DataProperty rdf:ID=”#zoneID”>
<daml:imports
rdf:resource=”http://opengis.net/gml/ <rdfs:Class rdf:ID=”LandUse”>
gml.rdfs#”/> <rdfs:subClassOf
rdf:resource=”Zone”>
</daml:Ontology> <rdfs:subClassOf>
<daml:Class <daml:Restriction>
rdf:ID=”ManagementAgency”> <rdfs:domain
<rdfs:subClassOf rdf:resource=”#LandUse”/>
rdf:resource=”http://www.daml. <daml:onProperty
org/cgi-bin/hyperdaml?http:// rdf:resource=”#name”>
opencyc.sourceforge.net/daml/ <daml:toClass rdf:resource=”http://
naics#Environment,-Conservation-and- www.w3.org/2000/10/
Wildlife-Organizations;813312”/> XMLSchema#string”/>
<rdfs:label>Urban Land Land manage- <daml:maxcardinality>1</
ment Agency</rdfs:label> daml:maxcardinality>
</daml:Class> </rdfs:subClassOf>
</daml:Restriction>
<rdfs:Class rdf:ID=”Zone”> <rdfs:subClassOf>
<rdfs:subClassOf <daml:Restriction>
rdf:resource=”http://opengis.net/gml/ <daml:onProperty
gml.rdfs#Polygon”/> rdf:resource=”#pollutant”/>

352
Semantic Interoperability of Geospatial Services

<daml:toClass to illustrate inheritance from a generic class. Us-


rdf:resource=”#Pollutant”/> ing the set of axioms provided by DAML+OIL,
<daml:Maxcardinality>1</ one can assert class subsumption, equivalence of
daml:Maxcardinality> class or property, and various constructors such
</daml:Restriction> as intersectionOf, and Maxcardinality to create
</rdfs:subClassOf> complex semantic constraints. An intelligent
agent with an inference engine can easily deduce
<daml:DatatypeProperty new knowledge about the environment from the
rdf:ID=”LandUseID”> ontology.
<daml:toClass rdf:resource=”http://
www.w3.org/2000/10/ Explorative Land Use Change
XMLSchema#string”/> Analysis: Simulations of
<rdfs:range rdf:resource=”http:// Multi-Agent Interaction
www.w3.org/2000/10/
XMLSchema#nonNegativeInteger”/> Given ontology, multiple agents can interact
<rdf:type rdf:resource=”http:// meaningfully. The following example illustrates
www.w3.org/2001/10/ an agent’s understanding of the domain model
daml+oil#UniqueProperty”/> that can be used to simulate different disicion
</daml:DatatypeProperty> scenarios visualized by map rendering. A “what-
if” type of pollution simulation model has been
<rdfs:Class rdf:ID=”Pollutant”> developed in response to the decision of changing
<daml:unionOf rdf:parseType=”daml:c land use from one category to another category
ollection”> (see Figure 4). This model can be used to simulate
<Pollutant rdf:ID=”BOD”/> the consequence of an agent’s decision, when for
<Pollutant example, an agent changes a certain category of
rdf:ID=”TotalNitrogen”/> land use from “vacant” to “landfill.” The result is
<Pollutant a corresponding increase or decrease in pollution
rdf:ID=”TotalPhosphorous”/> content. Every request to change in land use type
</daml:unionOf> results in recalculation of the mass export of pol-
</rdfs:Class> lutant and corresponding statistics. The resulting
pollution map can be visualized with multiple
<daml:DatatypeProperty theme overlay. The system logs individual user’s
rdf:ID=”pollutant”/> preferences to input into mediating algorithm to
resolve conflict among user preferences of land
<…..X..……> use choice. Built on top of the ESRI’s ArcIMS,
the system uses ArcXML (native XML encod-
The full XML serialization is not shown ing of spatial object) to communicate between
here due to space limitations. In the example, the custom middleware and the Web mapping
the daml:collection represents a DAML+OIL server. The services offered by the Web mapping
extension of RDF to provide a “shorthand” server are similar to the requirements of OGC’s
representation of structured list that defines implementation specification for a Web map
triples. It should be noted that the daml:Class service (WMS).
rdf:ID=”ManagementAgency” is derived from The map server contains registered model
an external URI of Opencyc (OpenCyc, 2006) components which are instantiated following a

353
Semantic Interoperability of Geospatial Services

Figure 4. Implementing map service for simulation of land use change

request from an agent. The server side application feature, visual queries, rendering legends cor-
processes the agent’s request and makes necessary responding to feature types (classification for
updates in the database to reflect the correspond- both continuous and unique data type), multiple
ing changes of the pollutant coefficients. Every themes, and overlays. The solution space of the
request to change in land use type results in re- explorative scenario generated by the simulation
calculation of the mass export of pollutants and process can be further resolved using different a
corresponding statistics. The processed result is mediation algorithm in an evolutionary process.
then sent back to the Web server and then to the For instance, genetic algorithms can be used to
client agents. For a given planning zone, the en- construct links between an agent’s template and a
vironmental regulation of land use constraints is possible solution space and let the system evolve
stipulated by the management agency. The domain until a mutually acceptable solution converges.
ontology includes several such constraints:

Zone (low_density_residential) →Per CONCLUSION


mittedLandUseCategory(multifamily_
dwelling) In this article, how the use of semantic reason-
∀x,y∃d HighDensityResZone(x)^Preserva ing can be used to aggregate and reason over
tionZone(y)→MinDistApart((x,y),d) geographic features taken from multiple sources
has been demonstrated. It has been noted that a
Similarly, other spatial contiguity constraints or semantic layer is essential to fulfill the objec-
topologic constraints can be expressed in domain tives of e-government’s geospatial portal. To the
ontology in relation to the neighboring zone or best of knowledge, there is no current standard
land use. The system provides cartographic ren- specification for re-use of spatial models which
dering options for typical mapping manipulation could enable high level effective communication
procedures such as selecting and identifying a among different agents. There is still a need for a

354
Semantic Interoperability of Geospatial Services

generic formalism to link geo-spatial models to CEC. (2003). Linking up Europe: the importance of
the domain specific application classes. Using interoperability for e-government services- com-
a multi-agent paradigm, a promising direction mission of the European communities (Commis-
to solve complex spatial problems that would sion Staff Working Paper, No. 801). Commission
be otherwise difficult to solve has been demon- of the European Communities & Interchange of
strated. The OSIRIS framework holds that the Data between Administrations (IDA).
description of the infrastructure of spatial data is
CEC. (2006). European interoperability frame-
essential for ontology-assisted interoperability of
work for pan-European e-government services-
heterogeneous sources. It has been demonstrated
Communication from the commission to the
that ontology can be used to provide a common
council and the European parliament. Brussels:
context for the semantic grounding for spatial
Commission of the European Communities.
application models which agents can relate to
their native terminologies and thereby enable ac- Craglla, M., & Signoretta, P. (2000). From global
cess to multiple services. Spatial services can be to local: the development of local geographic
composed or manipulated along with other type information strategies in the United Kingdom.
of services, including Web services. The imple- Environment and Planning, B(27), 777–788.
mentation of the sample prototype of OSIRIS
Cummens, P. (2003). Geospatial one-stop portal is
framework indicates that a scaleable Web based
key to President’s e-government strategy. ArcNews
spatial ontology can be developed using ontology
Online, Summer.
modeling language to enable real world spatial
decision-making. However, further research is Egenhofer, M., & Mark, D. (1995). Naive geog-
needed to understand the requirements of ontol- raphy. Paper presented at the International Con-
ogy language to represent the semantic structure ference COSIT ‘95. Lecture Notes in Computer
for complex spatial and temporal objects. This is Science, Berlin.
especially important for ontology matching and
Farquhar, A., Fikes, R., & Rice, J. (1996). The on-
merging of complex spatial systems. In particu-
tolingua server: a tool for collaborative ontology
lar, the implication of imprecision related to the
construction. Stanford, CA: Knowledge Systems
finite resolution observation of an agent is not
Laboratory-Stanford University.
well addressed in current ontology research. A
robust specification for expressing topology and FGDC. (2006). Geospatial metadata standards.
mereological aspect of spatial reasoning in a se- Retrieved September 2006, from http://www.
mantic language is still needed. Further research fgdc.gov/metadata/geospatial-metadata-standards
is necessary to identify the mechanism by which
FIPA. (2006). The foundation for intelligent
spatial representation and different level of explic-
physical agents. Retrieved May, 2006, from http://
itness at multiple scales affects an agent’s logical
www.fipa.org/
reasoning and comprehension of spatial processes.
Fonseca, F., & Egenhofer, M. (1999). Ontology-
driven geographic information systems. In C. B.
REFERENCES Medeiros (Ed.), 7th ACM Symposium on Advances
in Geographic Information Systems (pp. 14-19).
Cali, A., Calvanese, D., Colucci, S., Noia, T. D., Kansas City: ACM Press.
& Donini, F. M. (2004). A description logic based
approach for matching user profiles. Unpublished
manuscript.

355
Semantic Interoperability of Geospatial Services

GAO. (2003). Geographic information systems: Horrocks, I., Patel-Schneider, P. F., & Harmelen,
Challenges to effective data sharing (Testimony F. v. (2003). From SHIQ and RDF to OWL: The
before the subcommittee on technology, informa- making of a Web ontology language. Journal of
tion policy, intergovernmental relations and the Web Semantics, 1(1), 7–26. doi:10.1016/j.web-
census, committee on government reform. House sem.2003.07.001
of Representatives No. GAO-03-874T). United
Islam, A. S., Bermudez, L., Beran, B., Fellah, S.,
States General Accounting Office.
& Piasecki, M. (2006). Ontology for geographic
Goh, C. H. (1997). Representing and reason- information—metadata (ISO 19115:2003). Re-
ing about semantic conflicts in heterogeneous trieved May, 2006, from http://loki.cae.drexel.
information sources. Boston: Sloan School of edu/~wbs/ontology/iso-19115.htm
Management, MIT.
Kashyap, V., & Sheth, A. (1996). Semantic het-
Gruber, T. R. (1993). A translation approach to erogeneity in global information system: The
portable ontology specifications. Knowledge role of metadata, context and ontologies. In M.
Acquisition, 5(2). doi:10.1006/knac.1993.1008 Papazoglou & G. Schlageter (Eds.), Cooperative
information systems: Current trends and directions
Guarino, N. (1997). Semantic matching: Formal
(pp. 139-178). London: Academic Press.
ontological distinctions for information organiza-
tion, extraction, and integration. In M. Pazienza Lassila, O., & Swick, R. (2004). Resource de-
(Ed.), Information extraction: A multidisciplinary scription framework (RDF) model and syntax
approach to an emerging information technology specification. Retrieved from http://www.w3.org/
(pp. 139-170). Frascati, Italy: International Sum- TR/REC-rdf-syntax/
mer School.
Malucelli, A., Palzer, D., & Oliveira, E. (2006).
Guarino, N., & Giaretta, P. (1995). Ontologies Ontology-based services to help solving the het-
and knowledge bases: Towards a terminologi- erogeneity problem in e-commerce negotiations.
cal clarification. In N. Mars (Ed.), Towards very Electronic Commerce Research and Applications,
large knowledge bases: Knowledge building and 5, 29–43. doi:10.1016/j.elerap.2005.08.002
knowledge sharing (pp. 25-32).
Nedovic-Budic, Z., & Pinto, J. K. (1999). Interor-
Haynes, K. A., & Fotheringham, A. S. (1984). ganizational GIS: Issues and prospects. The Annals
Gravity and spatial interaction models. Beverly of Regional Science, 33, 183–195. doi:10.1007/
Hills, California: Sage Publications. s001680050100
He, M., Jennings, N. R., & Leung, H. F. (2003). Nyerges, T. (1989). Information integration for
On agent-mediated electronic commerce. multipurpose land information systems. URISA,
IEEE Transactions on Knowledge and Data 1, 27–38.
Engineering, 15(4), 985–1003. doi:10.1109/
OpenCyc. (2006). OpenCyc 1.0. Retrieved May,
TKDE.2003.1209014
2006, from http://www.cyc.com/cyc/opencyc/
Horrocks, I. (1998, May 5-- 8). The FaCT sys-
Osman, H., & El-Diraby, T. E. (2006, June 14-16).
tem. Paper presented at the TABLEAUX ‘98, In
Interoperable decision support model for routing
Automated Reasoning with Analytic Tableaux
buried urban infrastructure. Paper presented at
and Related Method, International Conference
the Joint International Conference on Comput-
Proceedings, Oisterwijk, The Netherlands.
ing i& Decision Making in Civil and Building
Engineering, Montreal.

356
Semantic Interoperability of Geospatial Services

Padmanabhuni, S. (2004). Semantic interoper- Smith, B. (1996). Mereotopology: A theory


ability for service oriented architecture (White of parts and boundaries. Data & Knowledge
Paper): Infosys Technologies Limited. Engineering, 20, 287–303. doi:10.1016/S0169-
023X(96)00015-8
Peng, Z. R., & Tsou, M. H. (2003). Internet GIS:
Distributed geographic information services for Vetere, G., & Lenzerini, M. (2005). Models for
the Internet and wireless networks. John Wiley. semantic interoperability in service oriented ar-
chitectures. IBM Systems Journal, 44.
Radwan, M., Bishir, Y., Emara, B., Saleh, A., &
Sabrah, R. (2005, April 16-21). Online cadastre Warnecke, L., Beattie, J., Cheryl, K., & Lyday,
portal services in the framework of e-government W. (1998). Geographic information technology
to support real state industry in Egypt. Paper in cities and counties: A nationwide assessment.
presented at the 8th International Conference Washington, DC: American Forests.
of Global Spatial Data Infrastructure (GSDI-8),
Wayne, L. (2005). Metadata in action: expand-
Cairo, Egypt.
ing the utility of geospatial metadata. Federal
Rich, E. (1989). Stereotypes and user modeling. Geographic Data Committee.
In A. Kobsa & W. Wahlster (Eds.), User models
Yeap, W. K., & Handley, C. C. (1990). Four im-
in dialog systems. Springer.
portant issues in cognitive mapping. Otago.AI Lab.
Sikder, I., & Gangapadhayay, A. (2002). Design
and implementation of a Web-based collaborative
spatial decision support system: Organizational ENDNOTES
and managerial implications. Information Re-
sources Management Journal, 15(4), 34–49.
1
E-government Act of 2002, P.L. 107-347
(December 17, 2002)

This work was previously published in International Journal of Intelligent Information Technologies, Vol. 4, Issue 1, edited by
V. Sugumaran, pp. 31-51, copyright 2008 by IGI Publishing (an imprint of IGI Global).

357
358

Chapter 22
New Discovery
Methodologies in GIS:
Improving the Information
Retrieval Process

Nieves R. Brisaboa
University of A Coruña, Spain

Miguel R. Luaces
University of A Coruña, Spain

Diego Seco
University of A Coruña, Spain

ABSTRACT
In the last decade, the availability of on-line resources, and also the number of users accessing those
resources, has grown exponentially. The information retrieval process, which aims at the improvement
of the access to such resources, has been the focus of interest of many researchers. The presence of geo-
graphic data in these repositories of information is surprisingly high (for example, note that most of the
web pages about business contain information about the locations of their offices). In order to properly
manage this geographic data, the information retrieval process has been extended using architectures,
data structures, and other techniques developed by the GIS community. This has meant the beginning
of a new research field called Geographic Information Retrieval. In this chapter, the authors present a
study of the state-of-the-art of this new field, and they also highlight the main open problems that will
concentrate efforts during the next years.

DOI: 10.4018/978-1-4666-2038-4.ch022

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
New Discovery Methodologies in GIS

INTRODUCTION field of corporate GIS and it remarks the future


importance of geographic information. Thanks
The need to manage information has been one of to these initiatives, many public organizations
the key factors behind the consolidation of infor- are working in the development of spatial data
mation technology as an essential driving force for infrastructures (GSDI, 2011) that enable them to
the development of our society. Over the years, share their spatial information.
many system architectures, index structures, and These two research areas have progressed
other components have been proposed with the independently over the years. On the one hand,
fundamental goal of allowing efficient access to the index structures and techniques from the IR
information stored in huge document databases. field do not take into account the spatial nature
The research field that focuses on this goal is of geographic references that appear in text docu-
called Information Retrieval (IR) (Baeza-Yates & ments. On the other hand, spatial index structures
Ribeiro-Neto, 1999) and it started with the work are not directly applicable in information retrieval
of Salton (1963). This research field has recently systems. However, users increasingly demand
undergone a spectacular development motivated services that allow them to locate the informa-
by the growth of the Internet and the need to search tion in its spatial context and even to access this
the Web. A very important feature of IR is that it information using queries that take into account the
deals with the problem of retrieving information spatial information. These demands have caused
by its content rather than its metadata. Thus, there that researchers in each area have began to pay at-
are a number of techniques for retrieving docu- tention to the other one resulting in a new research
ments of various types: texts, images, sound and field called Geographic Information Retrieval
video files, etc. (GIR). The aim of this field is to propose new
Textual information often includes geographic system architectures, index structures, and other
references in the text (for example, press releases components in order to develop systems to retrieve
usually mention the place where the events hap- documents both thematically and geographically
pened). Taking these geographic references into relevant in response to queries of the form <sub-
account provides added value to classic informa- ject, place>. An example of the type of queries
tion retrieval systems. The research on Geographic studied in this new field is the following: “Ph.D.
Information Systems (GIS) (Worboys, 2004) has dissertations regarding geographic information
dedicated much effort to study the special fea- systems published in Spain.” The reader familiar
tures of geographic information and to develop with classic information retrieval systems knows
systems able to use and take advantage of them. that the relevance of the documents in a textual
This field has received much attention in recent search engine is based on the frequency of the
years due to recent improvements in hardware words that appear in the text of the documents.
that have made possible the development of such Therefore, if the word Spain does not appear ex-
systems by many organizations. In addition, two plicitly in a document its relevance will be low
international organizations ISO (ISO/IEC, 2002) with respect to this query. This happens even if the
and the Open Geospatial Consortium (OGC, word Madrid appears in the document (or any other
2003) are undertaking a major collaborative autonomous region, province or city of Spain)
effort to define standards and specifications to because traditional IR systems are not prepared
develop interoperable systems. At the European to take into account the special characteristics
level, the INSPIRE (Infrastructure for Spatial of the geographic information space (e.g., the
Information in Europe) directive (European Com- contained by spatial relationship between Madrid
mission, 2011) has enabled a breakthrough in the and Spain). Query expansion techniques in clas-

359
New Discovery Methodologies in GIS

sical IR systems reformulate queries by adding are similar to those of IR systems, that is, objects
new terms to the original query in an attempt to belong to the result with a certain probability. Fol-
provide a better context (Baeza-Yates & Ribeiro- lowing the example queries, in the GIS query only
Neto, 1999) (for instance, in the previous example monuments in the municipalities adjacent are part
we could expand the query using the term thesis of the results. On the other hand, in a GIR system
that is related with the term dissertation). Some query, monuments located in a city very close to
examples of these query expansion techniques are: London may be relevant to the user.
term reweighting, local clustering, local context In this chapter, we review the most important
analysis, or those based on thesaurus. However, contributions made in recent years to the field of
all the particularities of the geographic references geographic information retrieval as part of the
are not properly represented by any of them. description of our system architecture and index
Among the topics of interest in the area of structure (Brisaboa, et al., 2010).
geographic information retrieval are the definition
of system architectures, index structures, and other
components that allow to model, capture, store, SYSTEM ARCHITECTURE
manipulate, access, retrieve, analyze, and display FOR GEOGRAPHIC
information efficiently. In addition, these tasks INFORMATION RETRIEVAL
involve additional difficulties over the same tasks
in the area of IR because of the special features Figure 1 shows our proposal for the system ar-
and requirements of geographic information. chitecture of a geographic information retrieval
Despite the common geographical nature of system. The architecture can be divided into
the information, there are two fundamental differ- three independent layers: the index construction
ences on the requirements between GIS and GIR workflow, the processing services, and the user
systems that must be taken into account. First, the interfaces. The influence of GIS architectures and
spatial component of the queries in GIR system spatial data infrastructures can be clearly identi-
is much simpler than the queries that are usually fied. This influence is also reflected in the use of
posed to GIS because the latter involve complex the standards WMS (OGC, 2002) (map generation)
spatial relationships. For example, a typical query and SFS (OGC, 2006) (geographic data storage).
in GIS can be monuments located in municipali- The bottom part of the figure shows the index
ties adjacent to the location of a particular hotel. construction workflow, which in turn consists of
In this case, the municipality where the hotel is three modules: the document abstraction module,
located must be found, and then the monuments the index structure, and the index construction
in adjacent municipalities must be retrieved. In module.
contrast, a typical query to a GIR system can be The processing services are shown in the
monuments in London where the only spatial re- middle of the figure. The Geographic Space Ontol-
lationship to check is whether the spatial scope of ogy Service used in the spatial index construction
the document lies within the geometry associated is shown on the left side. This service is used
with London. extensively in the index construction module. On
The second difference is related to the type the right side, one can see the two services that
of results expected for queries to both systems. are used to solve queries. The rightmost one is the
In GIS queries, the expected results are similar query evaluation service, which receives queries
to those of traditional databases, that is, objects and uses the index structure to solve them. The
in the database either belong to the result or not. other service is a Web Map Service following the
However, in GIR systems, the expected results OGC specification (OGC, 2002) that is used to

360
New Discovery Methodologies in GIS

Figure 1. System architecture

create cartographic representations of the query may be stored using different file formats (plain
results. On top of these services a Geographic text, XML, etc.), but also because their content
Information Retrieval Module is in charge of schema may be different. The set of attributes
coordinating the task performed by each service that have to be stored in the index may be differ-
to respond to the user requests. ent in each document collection. For instance, a
The topmost layer of the architecture shows the document collection may have a set of attributes
two user interfaces that exist in the architecture: (such as document ID, author, and document text),
the Administration User Interface and the Query whereas other document collection may have a
User Interface. different set (such as document ID, summary, text,
author, and source).
To solve this problem, we have defined an
INDEX CONSTRUCTION abstraction for documents similar to the one used
WORKFLOW in the Lucene text search engine (Apache, 2011).
We have extended this idea adding the spatial in-
Document Abstraction dexing possibility. In our abstraction, a document
consists of a set of fields, each one with a value
Given that the system must be generic, it must sup- that is extracted from the document text. Each
port indexing several kinds of documents. These field can either be stored, indexed, or both. If a
documents will be different not only because they field is stored, its contents are stored in the index

361
New Discovery Methodologies in GIS

structure and they can be retrieved by a query. If In more recent works (Martins, et al., 2005;
a field is indexed, then this field is used to build Chen, et al., 2006), the authors survey the work
the index structure. Furthermore, a field can be in the SPIRIT project and propose improvements
indexed textually, spatially, or in both indexes. to the system and the algorithms defined. In their
work, two naive algorithms are proposed: Text-
Indexing First and Geo-First. Both algorithms use the
same strategy: one index is first used to filter the
Some work has been done to combine textual documents (textual index in Text-First and spa-
indexes and spatial indexes in structures able to tial index in Geo-First), the resulting documents
solve the queries of interest in GIR systems. These are sorted by their identifiers and then filtered
structures can be broadly classified into hybrid using the other index (spatial index in Text-First
structures (i.e., textual and spatial indexes are kept and textual index in Geo-First). These two naive
separate) and double-index structures (i.e., both algorithms provide a broad classification of GIR
indexes are merged in one single structure). The index structures. Note that it can also be used to
index structure proposed in the SPIRIT project classify hybrid structures. Figure 2 shows the three
(Vaid, et al., 2005) is based on the combination basic structures according to this classification.
of a grid (Nievergelt, et al., 1981) and an inverted The left most structure belongs to the double-index
index. In Vaid et al. (2005), the authors conclude class and it can be both Text-First and Geo-First
that keeping separate text and spatial indexes, depending on the algorithm used to solve the
instead of merging both in one, results in less queries. The other two are hybrid structures. The
storage costs but it could lead to higher response first one belongs to the Geo-First class (the spatial
times. Keeping both indexes separated has many index is always accessed first) and the second one
advantages (Martins, et al., 2005). First of all, all belongs to the Text-First class (the textual index
textual queries can be efficiently processed by is always accessed first).
the textual index and all spatial queries can be In Zhou et al. (2005), the use of an inverted
efficiently processed by the spatial one. Moreover, index and an R-tree is proposed. Authors combine
queries combining textual and spatial aspects are both structures in the three ways described above
supported. Updates in each index are handled in- and they conclude that keeping both indexes
dependently, which makes easier the addition and separated is less efficient than combining them
removal of data. Finally, specific optimizations can (a similar conclusion had been presented in Vaid
be applied to each individual indexing structure. et al. [2005]) and that the use of the R-tree out-
performs the efficiency of the grid based structures.

Figure 2. GIR index structures

362
New Discovery Methodologies in GIS

In Chen et al. (2006), authors propose the use • GeopoliticalEntity is subclass of


of space-filling curves (Morton, 1966; Böhm, GeographicalRegion
et al., 1999) and compare the resulting structure • GeographicalRegion is subclass of
both with the grid based and the R-tree based GeographicalThing and
structures. Space-filling curves are based on the • GeographicalThing is subclass of
storage of the spatial objects according to the SpatialThing.
order determined by a filling curve. According
to their experiments, the use of the space-filling That is, these four classes are organized into a
curves outperforms both the grid and the R-tree superclass-subclass hierarchy, which is also known
based approaches. as taxonomy. Subclasses specialize (are subsumed
Finally, in the STEWARD project (Lieberman, by) their superclasses. GeopoliticalEntity has four
et al., 2007), a double-index structure based on an subclasses: PopulatedPlace, Country, Continent,
inverted index and a Quad-tree (Nelson & Samet, and Region. All the individuals are members of
1986) is presented. In addition, the authors propose these subclasses. These four subclasses have an
the use of a query scheduler in charge of choosing additional necessarily asserted condition regard-
a Text-First or a Geo-First algorithm according to ing their relations with each other. They are con-
which index may return fewer results (this decision nected by the property spatiallyContainedBy that
is based on statistics collected by the system). describes the existence of a spatial relationship
Nevertheless, none of these approaches among them. For instance, all the individuals of
take into account the relationships between the class PopulatedPlace are spatiallyContainedBy
geographic objects that they are indexing. A individuals of class Region (described in OWL
structure that can properly describe the specific as PopulatedPlace spatiallyContainedBy only
characteristics of geographic space is an ontol- [AllValuesFrom] Region). Figure 3 shows an ex-
ogy, which is a formal explicit specification of ample of these relationships. Ontology classes are
a shared conceptualization (Gruber, 1993). An represented as circles, individuals as rectangles,
ontology provides a vocabulary of classes and and the relationships as labeled lines.
relations to describe a given scope. In Brisaboa et After having defined this ontology, we can
al. (2010), we present an index structure based on define an spatial index structure based on it. This
an ontology of the geographic space that describes structure is a tree with four levels, one for each of
the concepts in our domain and the relationships the subclasses of GeopoliticalEntity. The top-most
that hold between them. Our spatial ontology is level contains a node for each of the instances of
described in OWL-DL (W3C, 2011) and it can be the class Continent. Each node in this level refer-
downloaded from the following URL: http://lbd. ences the instances of the class Country that are
udc.es/ontologies/spatialrelations. OWL classes connected by the spatiallyContainedBy relation-
can be interpreted as sets that contain individu- ship. The levels of Region and PopulatedPlace
als (also known as instances). Individuals can be are built using the same strategy. That is, the
considered instances of classes. Our ontology structure of the tree follows the taxonomy of the
describes eight classes of interest: SpatialTh- ontology. Figure 4 shows the spatial index structure
ing, GeographicalThing, GeographicalRegion, built from the instances shown in Figure 3.
GeopoliticalEntity, PopulatedPlace, Region, The main advantage of this spatial index
Country, and Continent. In our ontology there structure over other alternatives is that intermedi-
are hierarchical relations among SpatialThing, ate nodes in the structure have a meaning in the
GeographicalThing, GeographicalRegion, Geo- geographic space and they can have additional
politicalEntity because: information associated. For instance, we can as-

363
New Discovery Methodologies in GIS

Figure 3. Ontology instances

sociate a list of documents that reference a given structure also refer to all nodes in its subtree.
Country and use this list of documents to solve Furthermore, the index structure is general in the
combined textual and spatial queries. Moreover, sense that the ontology of geographic space can
given that there is a superclass-subclass relation- be adapted to each particular application. For
ship between the levels, the bottom levels can example, if a particular application uses a re-
inherit the properties of the top levels. Particu- stricted area of the geographic space where the
larly, the documents associated to a node in the classes Continent and Country are not necessary

Figure 4. Example of the index structure

364
New Discovery Methodologies in GIS

and, on the other hand, the classes Province, document. This stage comprises two steps. First,
Municipality, City, and Suburb are needed, we the system analyses the document fields that are
could define a different ontology of space and spatially indexable and extracts candidate location
base the index structure on it as long as the rela- names from the text (i.e., discovery of location
tionship spatiallyContainedBy still holds between names). In a second step, these candidate locations
the classes. Finally, we could define additional are processed in order to determine whether the
spatial relationships in the ontology such as spa- candidates are real location names, and, in this
tiallyAdjacent and maintain these relationships case, to compute their geographic locations (i.e.,
in the index structure to improve the query capa- translation of location names to a geographic
bilities of the system. model).
This structure automatically performs query The main problem that can happen at this point
expansion over the geographic component of a is the ambiguity of the geographic references. A
query. In classical information retrieval systems, recent research by Garbin and Mani (2005) claims
the use of ontologies to perform query expansion that more than the 67% of the geographic refer-
is well known as an ontology can be regarded as ences cited in texts are ambiguous. Furthermore,
a generalization of a thesaurus. However, our in Tjong et al. (2003) two kinds of ambiguity are
structure does not use an additional geographic presented. First, a location name can be ambiguous
ontology but it is defined using one. Thus, the query (polysemy). For instance, “London” is the capital
expansion process is intrinsic to the nature of the of the United Kingdom and it is a city in Ontario,
index and not an additional process. Furthermore, Canada too. Second, there can be multiple names
our whole structure combines this geographic for the same geographic location, such as “Los
ontology-based index with a textual index (an Angeles” and “LA.” A third class of ambiguity
inverted index), and classical query expansion could be considered. This is due to the use of
techniques can be used over the textual compo- the same word to refer both a place name and a
nent achieving full query expansion (in the sense organization, company, or person (e.g., Santiago).
that both the textual and geographic components
of a query are expanded). Considering again the Discovery of Location Names
example Ph.D. dissertations in Spain, using this
full query expansion technique we could retrieve Unlike geographic information systems, infor-
documents containing the terms thesis and Madrid. mation in GIR systems is not structured. It is
not possible to know a priori where geographic
Geo-Referencing of Documents references are stored, nor their categories (e.g.,
city, state, country, etc.). In this kind of systems,
This is probably the most complex stage of the geographic references are contained in the text
workflow. In this stage, a geographic footprint is of the indexed documents. Therefore, these texts
assigned to each document allowing its spatial have to be analyzed in order to discover the geo-
indexing. A geographic footprint shows the geo- graphic references.
graphic scope of the document and it can be set In this analysis, all the spatially indexable
as a list of geographic coordinates, a bounding fields are processed in order to discover the place
box grouping that coordinates, etc. For example, names contained within. There are two Linguistic
if the cities of London and Liverpool are cited in Analysis techniques that are widely used for this:
a document, the geographic coordinates of these Part-Of-Speech tagging (Brill, 1992) and Named-
cities or the minimum bounding box containing Entity Recognition (Chinchor & Robinson, 1997;
them can be used as geographic footprint of the Pustejovsky, et al., 2005). On the one hand, Part-

365
New Discovery Methodologies in GIS

Of-Speech tagging is a process whereby tokens are we will assume that the cited place is Santiago de
sequentially tagged with syntactic labels, such as Chile. Perform this process in an automatic, and
verb or gerund. On the other hand, Named-Entity even semi-automatic, way becomes a challenge
Recognition is the process of finding mentions of for the GIR community. Bruno Martins (2008)
predefined categories such as the names of per- describes in his Ph.D. some basic principles to
sons, organizations, locations, etc. Combine both guide this automatic disambiguation process:
techniques is a good solution to discover possible
place names contained in the text of documents. In • One referent per discourse: A location
our prototype, we use the Natural Language Tool name cited several times in the same text
LingPipe (Alias-i, 2011) to find locations. It is a is likely to mean the same place. For ex-
suite of Java libraries for the linguistic analysis ample, if Santiago is cited several times in
of human language free for research purposes a document it should refer always either
that provides both Part-Of-Speech tagging and Santiago de Compostela or Santiago de
Named-Entity Recognition. LingPipe involves the Chile.
supervised training of a statistical model to rec- • Related referents per discourse:
ognize entities. The training data must be labeled Geographic references appearing in the
with all of the entities of interest and their types. same document tend to refer to related lo-
In spite of the good performance of these lin- cations. In our previous example, we use
guistic analysis techniques, when the discovered the place name A Coruña or Atacama to
location names are translated to a geographic disambiguate the place name Santiago.
model many problems related with the ambiguity • Default senses: Important places are
of the location names arise. Although these prob- more likely to be referenced. Therefore,
lems mainly affect the next step (i.e., translation if no clues are available to disambiguate
of location names), some issues are related with a location name, the most important place
this step. First, the system must determine if the should be assigned. For example, countries
discovered location names are true place names. are most important than cities, a capital is
Gazetteers have been widely used for this purpose. more important than each other city, a city
A Gazetteer is a geographical dictionary that is more important than a street, etc.
contains, in addition to location names, alterna-
tive names, populations, location of places, and In the SPIRIT project (Jones, et al., 2001; Jones,
other information related to the location. Although et al., 2003; Jones, et al., 2004; Fu, et al., 2005),
some years ago the availability of these gazetteers a spatial ontology is used instead of a gazetteer.
was very scarce (Petasis, et al., 2000), nowadays Therefore, searches in the ontology both check
there are many resources that provide this kind if the location name is a true place and provide a
of information. A more complex problem is the disambiguation based on the height of the node
disambiguation of a location name once it has in the ontology. This follows the default senses
produced a true positive in the gazetteer. Many principle (the height of the regions is less than the
clues within the whole text of the documents height of the cities). Besides this project, the most
are used by human beings to disambiguate each important works in the area are: Web-a-where
location name cited in the text. For example, if (Amitay, et al., 2004), which uses spatial contain-
the location name Santiago is cited in a document ers in order to identify locations in documents;
near other location names, such as A Coruña, we MetaCarta (the commercial system described by
will assume that the cited place is Santiago de Rauch et al. [2003]), which uses a natural language
Compostela. However, if it is cited near Atacama,

366
New Discovery Methodologies in GIS

processing method; and STEWARD (Lieberman, the topographic representation that makes the
et al., 2007), which uses an hybrid approach. area relevant.
In the SPIRIT project, each location name
Translation of Location Names cited in the text of a document is translated to a
bounding box and the footprint of a document
Once the location names have been located and consists of several bounding boxes (one for each
disambiguated, they have to be translated to a location name cited in its text). This schema was
geographic model, i.e., a geographic footprint also used by Zhou et al. (2005). In Smith and Crane
has to be assigned to each document in order to (2001), authors propose the use of a set o points
make it indexable by a spatial index. There are as the footprint of a document. Thus, the footprint
some differences in the methods proposed in the of a document consists of the geographic coordi-
bibliography. The first one is the type of geographic nates of all the locations cited in its text. These
object used to represent those footprints. Some of coordinates are weighted by the frequency of the
the common options chosen for this purpose are: location name in the document. Then, the centroid
geographic points (the geographic coordinates of this set of points and its standard deviation are
of all the places cited in document), minimum computed according to the weights of the points,
bounding boxes (the boxes of minimum extent that and all the points that are more than twice from
cover the geographic positions of all the places the centroid are prune (remaining points make up
cited in the document), and the centroid of such the footprint of the document).
bounding boxes. A second feature that makes Finally, as we mentioned in the previous sec-
proposed methods different is the uniqueness tion, in the Web-a-where (Amitay, et al., 2004)
of the footprint. Although documents are better project the disambiguation process is based on
described when several footprints are allowed, spatial containers. These containers are defined
because many distant places can be cited in the according to the topological relationships that
same document (consider for example a document exist in a gazetteer. Most of these relationships
about the evolution of the world economy), many belong to the class part-of (for example, Galicia
of the approaches in the bibliography suggest the is part-of Spain). Once place names have been
use of a single footprint. The usage of one footprint disambiguated, all the related places are merged
simplifies the process and improves the perfor- in a taxonomy. The levels in this taxonomy are
mance of both the indexing and querying process. ordered according to their relevance and those
One of the pioneering projects, previous to the levels which height is less than a threshold make
SPIRIT project, aiming at the geo-referencing of up the footprint of the document.
contents in digital libraries is GIPSY (Georefer- In our prototype, we have developed a service
enced Information Processing SYstem) (Larson, based on an ontology of the geographic space
1995). In this project, each location name is trans- that is built using a Gazetteer (Geonames, 2011)
lated to a geographic representation (for example, and a Geometry Supplier (NIMA, 2011). This
a polygon) and a weighting value is assigned to it. service uses information available in the gazetteer
These values depend on features intrinsic to the (such as, place type, population, capital, etc.). All
content of the documents (e.g., frequency of the these data are combined to compute the intrinsic
location name in the text of the document). Then, importance of each place.
all these geographic representations are combined
in three-dimensional topographic representations
that consider the weighting values. Finally, a
threshold determines the minimum elevation of

367
New Discovery Methodologies in GIS

QUERY EVALUATION SERVICE All scores are guaranteed to be 1.0 or less. More
information about the Lucene scoring can be found
The Query Evaluation Service is the component in Gospodnetic and Hatcher (2005) and Apache
in charge of using the index structure to answer Lucene (2011).
the queries posed by the users. Moreover, in
order to return a useful result, this service must Pure Spatial Queries
also provide a relevance ranking of the results.
In this section, we describe the types of queries An example of this type of queries is “retrieve all
in GIR systems, the algorithms to solve them documents that refer to the following geographic
and the equations used to compute the relevance area.” The geographic area in the query can be a
of the result for each of such types: pure textual point, a query window, or even a complex object
queries, pure spatial queries, and hybrid queries. such as a polygon. The spatial index that is part
Note that the concept of relevance of a docu- of the index structure is used to solve them. Given
ment, although well known in the field of IR, that a document in the result set of a query can
had not been introduced in the GIS before of the include geographic references to one or more lo-
arisen of the GIR systems. In Godoy and Rodrí- cation names relevant to the query, the relevance
guez (2004) some qualitative measures for the of the document d with respect to the query q due
spatial relevance of a document are introduced to each location name l has to be computed. We
based on concepts such as bounding boxes, dis- denote this relevance as toponymRelevanceq,d,l. We
tances, overlapping, and relative sizes. In Jones guarantee that this value is 1.0 or less in order to
et al. (2001), an hybrid approach combining the make the integration of both spatial and textual
distance in an ontology of the geographic space relevancies easier. In van Kreveld et al. (2005) both
and the Euclidean distance in the geographic space spatial and textual relevancies are also normalized
is presented. Furthermore, in the context of the to values between 0 and 1. Finally, we compute
Tumba project (Martins, et al., 2005; Andrade the relevance of the document d with respect to
& Silva, 2006), authors use some well-known the query q as the maximum relevance due to any
semantic concepts (e.g., adjacency, connectivity, location name (Equation 1).
etc.) to calculate this relevance when an ontology
is available in the system. A different approach, spatialRelevanceq ,d
which does not assume the use of an ontology, but
= max{toponymRelevanceq ,d ,l }
based on similar concepts is presented in Zhou
(1)
et al. (2005).
The computation of toponymRelevanceq,d,l for
Pure Textual Queries queries specified selecting a node in the spatial in-
dex is a simplification of the previous one because
These are queries such as “retrieve all documents
in this case we have the certainty that the query
where the words hotel and sea appear.” The textual
refers to a specific node in the tree. Therefore, the
index that is part of the index structure is used to
documents associated to this node have relevance
solve them. In our prototype we use Lucene to
1.0. The relevance of a document associated to
implement this textual index, and thus, the rel-
the nodes in the subtree is computed using the
evance ranking depends on its scoring. Lucene
previous equation. This is reflected in Equation 2.
scoring uses a combination of the vector space
model and the boolean model of information
retrieval (Baeza-Yates & Ribeiro-Neto, 1999).

368
New Discovery Methodologies in GIS

toponymRelevanceq,d,l = classical algorithm of spatial indexes. Therefore,


1 if l is specifiied in the query the computation of toponymRelevanceq,d,l must be
 performed using the distance (dcq,l) and the over-
 0.5 distance
 otherwise lap area (oaq,l) between the query window and the
importance
location name. Equation 3 defines this computa-
(2)
tion. We use parameters wdc and woa to weight the
relevance of each factor and we use the importance
The sketch of the index structure shown in
of the location name to assign more relevance to
Figure 5 is useful to understand the difference
the most important nodes that reference the loca-
between both types of queries. Each node in the
tion name.
figure is annotated with its importance between
parentheses. On the one hand, when the user
wdc × dcq ,l + woa × oaq ,l
specifies a query using the location name England, toponymRelevanceq ,d ,l =
the relevance of a document due to England (an importance
important city of Arkansas) will be higher than (3)
the relevance due to England (a small city of
Oppland Fylke), and lower than the relevance Equation 4 defines how to calculate the rel-
due to England (a part of the United Kingdom). evance due to the distance to the query window.
Concrete values of relevance are 0.5 for England centerDistanceq,l represents the Euclidean distance
in Arkansas, 0.33 for England in Oppland, and 1.0 between the location name l and the query window
for England in the United Kingdom. Moreover, q. Similarly, cornerDistanceq is a weight factor
the relevance of the document due to important that represents the maximum distance to the center
cities of England (UK) like London or Liverpool of the window.
is 0.5. This value is high enough to be taken into
consideration. On the other hand, when the query centerDistanceq ,l
dcq ,l = 1 − (4)
is specified selecting the node for England in cornerDistanceq
Arkansas the relevance of a document due to this
node is 1.0 because the user explicitly indicates
The relevance due to the overlap area with the
the interest about documents with geographic
query window is calculated according to Equation
references to that location.
5. When the geometry stored in the node is a point
Finally, in the case of queries specified using
(leaf node), the overlap area is not significant.
a query window the nodes are selected using the
Thus, we use 1/[area(q)+1]0.15. This value depends

Figure 5. Queries specified using a location name vs. queries specified selecting a node

369
New Discovery Methodologies in GIS

only on the query window and is inversely propor- Moreover, three distances used to compute the
tional to its area. The concrete equation has been relevance due the distance to the query window
constructed based on the average area of the nodes (Equation 4) are shown. The weight factor cor-
in each level of the ontology of geographic space. ner distance is depicted as a solid line, and the
distances from Rome and Abruzzi to the center
oaq,l = of the query window are depicted as dotted lines.

 1

 if l is a po
oint

  
0.15
Textual Queries over a Geographic
 area(q) + 1
   Area

  area(l ∩ q) area(l ⊗ q) 

max  0, −  otherwise

  

 
 area(q) area(l) In this case, a geographic area of interest is given
(5) in addition to the set of words. An example is
“retrieve all documents with the word hotel that
Figure 6 uses an example query window in cen- refer to the following geographic area.” As in the
tral Italy to clarify the aforementioned equations. previous case, the geographic area in the query
The bounding boxes of two regions, Umbria and can be a point, a query window, or a complex
Abruzzi, and a populated place, Rome, are shown object. Both the textual and spatial indexes are
in this figure. These bounding boxes as well as used to solve them. Hence, we use the previous
the query window q are used to compute the area equations to compute the spatial and textual
of their respective entities (i.e., area(Umbria), relevance. Equation 6 defines how we combine
area(Abruzzi), and area(q)). The region of both relevance rankings. The weighted sum of the
Umbria is used to illustrate the relevance due to spatial and textual ranking values is one of the
the overlap area (Equation 5). This relevance is simplest methods and is commonly used (Mar-
computed using the area of the intersection of the tins, et al., 2005; Zhou, et al., 2005; Andrade &
region with the query and the area in the part of Silva, 2006). Furthermore, it is the base of more
the region that does not intersect with the query. complex ranking methods (Yu & Cai, 2007). We

Figure 6. Queries specified using a query window

370
New Discovery Methodologies in GIS

assume wt = 1-ws and calculate wt to normalize the USER INTERFACES


differences between textual and spatial rankings.
The system has two different user interfaces: an
relevanceq ,d = wt × textualRelevanceq ,d administration user interface and a query user
interface. The administration user interface can
+ ws × spatialRelevanceq ,d
be used to manage the document collection. The
(6)
main functionalities are: creation of indexes,
addition of documents to indexes, loading and
Textual Queries with Place Names
storing indexes, etc. Figure 7 shows a screen-shot
of the query user interface. This interface was
In this type of queries, some of the words are
developed as a web application using the Open
place names. For instance, “retrieve all documents
Layers API (OSGeo, 2011). This API provides
with the word hotel that refer to Spain.” Both the
a number of utilities for manipulating maps and
spatial and textual indexes are used to solve them
adding content to the map.
and, as in the previous case, the weighted sum
In the previous section, we have presented the
can be used to combine the relevancies obtained
types of queries that can be solved with this sys-
in each index.
tem. These queries have two different aspects: a
Our index structure presents an improvement
textual aspect and a spatial aspect. In our prototype,
over the rest of proposals: it can easily perform
the query user interface allows the user to indicate
query expansion on geographic references because
both aspects. The spatial context can be introduced
the index structure is built from an ontology of the
in three ways that are mutually exclusive:
geographic space. Consider the following query
“retrieve all documents that refer to Spain.” The
• Typing the location name: In this case,
query evaluation service will discover that Spain
the user types the location name in a text
is a geographic reference and then the internal
box. This is the most inefficient way be-
node that represents the geographic object Spain
cause the system has to obtain all the geo-
will be quickly located. Then, all the documents
graphic references associated with the
associated to this node are part of the query result.
place name typed by the user, which is a
Moreover, all the children of this node are geo-
time-expensive process.
graphic objects that are contained within Spain
• Selecting the location name in a tree: In
(for instance, the city of Madrid). Therefore, all
this case, the user sequentially selects a
the documents referenced by the subtree are also
continent, a country within this continent, a
part of the result of the query. The consequence is
region within the country, and a populated
that the index structure has been used to expand
place within the region. If the user wants
the query because the result contains not only
to specify a location name of a higher level
those documents that include the term Spain, but
than a populated place, it is not necessary
also all the documents that contain the name of
to fill in all the levels. The operation is very
a geographic object included in Spain (e.g., all
easy and intuitive because the interface is
the cities and regions of Spain). This geographic
implemented with a custom-developed
query expansion is complementary to other clas-
component using the AJAX technology
sical techniques expanding the textual component
that retrieves in the background the loca-
of the query.
tion names for the next level. When the
user selects a place in the component, the

371
New Discovery Methodologies in GIS

Figure 7. Query User Interface

map on the right zooms in automatically to The work presented in this paper can also be
the selected place. easily integrated in a spatial data infrastructure.
• Selecting the spatial context of interest First, the query processing functionality can be
in the map: The user can navigate using implemented as a Web Processing Service (OGC,
the map on the right to visualize the spatial 2007), which aims at the standardization of the
context of interest. After that, a rectangle way that GIS calculations are made available
can be drawn over it. The system will use on the Internet. This service can then be used to
this rectangle as the query window if the index document collections such as administra-
user did not type a place name or did not tive archives.
select a location name. Furthermore, some of the internal components
of the architecture could also be implemented us-
ing Web Processing Services. As an example, in
CONCLUSION Ladra et al. (2008) we show how the Geographic
Space Ontology Service, which is enclosed in
Nowadays, Geographic Information Systems the intermediate layer of the architecture, can be
constitute a consolidated area in computer sci- integrated in a Spatial Data Infrastructure using
ence. Many impressive research results have been the OGC Web Processing Service. Moreover, even
presented and, more importantly, an effective though our prototype uses a database as Gazetteer
technology transfer has improved the manage- Service, other implementations could easily use a
ment of geographic information in traditional Gazetteer from a spatial data infrastructure.
information systems. For example Spatial Data Finally, the GIR architecture proposed in
Infrastructures are prominent examples of this this chapter is a perfect complement to OGC
technology transfer. catalogues. A GIR system like the one described

372
New Discovery Methodologies in GIS

in this chapter can be built over a collection of structures considering the semantic of the space
OGC catalogues. In this case, instead of index- present valuable improvements for GIR systems
ing digital documents, the system would index over classical spatial index structures (e.g., query
metadata records which are composed of textual expansion, relevance ranking, etc.). Efficient
descriptions and geographic references of the da- implementations of these structures will also be
tasets and services. The user interface of the GIR crucial in their applicability. Finally, some efforts
system would allow a user to query the catalogues must be devoted to improve the system usability.
with keywords and a geographic reference and The spatial component of GIR systems entails the
it would return a list of metadata records ranked need of user interfaces that allow users to properly
by their relevance. An additional advantage of a express the spatial scope and represent the results
GIR system is that an OGC catalogue is built by in a user-friendly way.
a human being that decides what is relevant and
that categorizes the documents using thesaurus.
Furthermore, an OGC catalogue is oriented to REFERENCES
structured searches applying filters to metadata
fields. On the other hand, a GIR system catego- W3C. (2011). World wide consortium - OWL
rizes the documents automatically deciding what 2 web ontology language document overview.
is relevant using the contents and allowing non- Retrieved June 9, 2011 from http://www.w3.org
structured searches. /TR/owl2-overview.
The application of this research to improve Alias-i. (2011). LingPipe, natural language tool.
the task of information retrieval turns out to be a Retrieved June 9, 2011 from http://www.alias-i.
rather challenging problem. Due the importance com /lingpipe.
of this task (millions of users perform queries
on-line each day), many research efforts have Amitay, E. Har’El, N., Sivan, R., & Soffer, A.
been devoted to this new research topic and a (2004). Web-a-where: Geotagging web content.
new research area, named Geographic Informa- In Proceedings of the ACM Special Interest Group
tion Retrieval, has emerged covering the topics on Information Retrieval Conference, SIGIR 2004,
in the intersection between GIS and IR. In this (pp. 273-280). ACM Press.
chapter, we have presented the state-of-the-art in
Andrade, L., & Silva, M. J. (2006). Relevance
this new field. Our own architecture (Brisaboa, et
ranking for geographic IR. In Proceedings of the
al., 2010) was used as the framework that embraces
Workshop on Geographic Information Retrieval,
the majority of the topics in the area.
GIR 2006. ACM Press.
Many new research topics have emerged in
this young area. First, Geo-referencing tech- Apache. (2011). The Apache Lucene project. Re-
niques must be improved to solve the ambiguity trieved June 9, 2011 from http://lucene.apache.org.
problems. This is a crucial task as its influence
Apache Lucene. (2011). Apache Lucene - Scoring.
in the precision and recall of the system is ex-
Retrieved June 9, 2011 from http://lucene.apache.
tremely high. Second, the development of new
org /java/2_2_0/scoring.html.
index structures is also very important as they
are a key factor in the performance of the GIR Baeza-Yates, R., & Ribeiro-Neto, B. (1999).
systems. The use of ontologies in these structures, Modern information retrieval. Boston, MA: Ad-
enhancing their semantic, represents an exciting dison Wesley.
problem that was just sketched with some initial
proposals. As we shown in Brisaboa et al. (2010),

373
New Discovery Methodologies in GIS

Böhm, C., Klump, G., & Kriegel, H.-P. (1999). Geonames. (2011). GeoNames gazetteer. Re-
XZ-ordering: A space-filling curve for objects with trieved June 9, 2011 from http://www.geonames.
spatial extensión. In Proceedings of the Advances org.
in Spatial Databases Conference, SSD 1999, (pp.
Godoy, F., & Rodríguez, A. (2004). Defining
75-90). Springer.
and comparing content measures of topologi-
Brill, E. (1992). A simple rule-based part of speech cal relations. GeoInformatica, 8(4), 347–371.
tagger. In Proceedings of the Conference on Ap- doi:10.1023/B:GEIN.0000040831.81391.1d
plied Natural Language Processing, ANLP 1992,
Gospodnetic, O., & Hatcher, E. (2005). Lucene
(pp. 152-155). Association for Computational
in action. New York, NY: Manning.
Linguistics.
Gruber, T. R. (1993). A translation approach to por-
Brisaboa, N. R., Luaces, M. R., Places, A. S., &
table ontology specifications. Knowledge Acquisi-
Seco, D. (2010). Exploiting geographic refer-
tion, 5(2), 199–220. doi:10.1006/knac.1993.1008
ences of documents in a geographical information
retrieval system using an ontology-based index. GSDI. (2011). Global spatial data infrastructure
GeoInformatica, 14(3), 307–331. doi:10.1007/ association, online documentation. Retrieved June
s10707-010-0106-3 9, 2011 from http://www.gsdi.org.
Chen, Y.-Y., Suel, T., & Markowetz, A. (2006). ISO/IEC. (2002). Geographic information - Refer-
Efficient query processing in geographic web ence model. ISO 19101. Geneva, Switzerland: ISO.
search engines. In Proceedings of the ACM Special
Jones, C. B., Abdelmoty, A. I., & Fu, G. (2003).
Interest Group on Management of Data, SIGMOD
Maintaining ontologies for geographical infor-
2006, (pp. 277-288). ACM Press.
mation retrieval on the web. In On The Move to
Chinchor, N., & Robinson, P. (1997). MUC-7 Meaningful Internet Systems, ODBASE 2003, (pp.
named entity task definition. In Proceedings of 934-951). Springer.
the Message Understanding Conference. ACM.
Jones, C. B., Abdelmoty, A. I., Fu, G., & Vaid,
European Commission. (2011). INSPIRE direc- S. (2004). The SPIRIT spatial search engine:
tive. Retrieved June 9, 2011 from http://www. Architecture, ontologies and spatial indexing. In
ec-gis.org /inspire. Proceedings of the International Conference on
Geographic Information Science, (pp. 125-139).
Fu, G., Jones, C. B., & Abdelmoty, A. I. (2005). On-
Springer.
tology-based spatial query expansion in informa-
tion retrieval. In [Springer.]. Proceedings of On the Jones, C. B., Alani, H., & Tudhope, D. (2001).
Move to Meaningful Internet Systems, ODBASE, Geographical information retrieval with ontolo-
2005, 1466–1482. doi:10.1007/11575801_33 gies of place. In Proceedings of the International
Conference on Spatial Information Theory, COSIT
Garbin, E., & Mani, I. (2005). Disambiguating top-
2001, (pp. 322-335). Springer.
onyms in news. In Proceedings of the Conference
on Human Language Technology and Empirical
Methods in Natural Language Processing, HLT-
EMNLP 2005, (pp. 363-370). The Association for
Computational Linguistics.

374
New Discovery Methodologies in GIS

Jones, C. B., Purves, R., Ruas, A., Sanderson, M., Nievergelt, J., Hinterberger, H., & Sevcik, K. C.
Sester, M., van Kreveld, M. J., & Weibel, R. (2002). (1981). The grid file: An adaptable, symmetric
Spatial information retrieval and geographical multi-key file structure. In [Springer.]. Proceed-
ontologies an overview of the SPIRIT project. In ings of the European Cooperation in Informatics,
Proceedings of the ACM Special Interest Group ECI, 1981, 236–251.
on Information Retrieval Conference, SIGIR 2002,
NIMA. (2011). National imagery and mapping
(pp. 387-388). ACM Press.
agency, vector map level 0. Retrieved June 9, 2011
Ladra, S., Luaces, M. R., Pedreira, O., & Seco, D. from http://www.mapability.com.
(2008). A toponym resolution service following
OGC. (2002). OpenGIS web map service
the OGC WPS standard. In Proceedings of the
implementation specification. OpenGIS Project
International Sympoium on Web and Wireless
Document 01-068r3. New York, NY: Open GIS
Geographic Information System, W2GIS 2008,
Consortium, Inc.
(pp. 75-85). Springer.
OGC. (2003). OpenGIS reference model. OpenGIS
Larson, R. R. (1995). Geographic information
Project Document 03-040. New York, NY: Open
retrieval and spatial browsing. In Geographic
GIS Consortium, Inc.
Information Systems and Libraries: Patrons,
Maps, and Spatial Information (pp. 81–123). OGC. (2006). OpenGIS implementation specifica-
Champaign-Urbana, IL: University of Illinois. tion for geographic information - Simple feature
access - Part 2: SQL option. OpenGIS Project
Lieberman, M. D., Samet, H., Sankaranarayanan,
Document 06-104r3. New York, NY: Open GIS
J., & Sperling, J. (2007). STEWARD: Architecture
Consortium, Inc.
of a spatio-textual search engine. In Proceedings
of the ACM International Symposium on Advances OGC. (2007). OpenGIS web processing service
in GIS, ACMGIS 2007, (pp. 186-193). ACM Press. implementation specification. OpenGIS Project
Document 05-007r7. New York, NY: Open GIS
Martins, B. (2008). Geographically aware web
Consortium, Inc.
text mining. Unpublished Doctoral Dissertation.
Lisbon, Portugal: Universidade de Lisboa. OSGeo. (2011). Open layers API. Retrieved June
9, 2011 from http://openlayers.org.
Martins, B., Silva, M. J., & Andrade, L. (2005).
Indexing and ranking in geo-IR systems. In Petasis, G., Cucchiarelli, A., Velardi, P., Paliouras,
Proceedings of the Workshop on Geographic G., Karkaletsis, V., & Spyropoulos, C. D. (2000).
Information Retrieval, GIR 2005, (pp. 31-34). Automatic adaptation of proper noun dictionar-
ACM Press. ies through cooperation of machine learning and
probabilistic methods. In Proceedings of the ACM
Morton, G. M. (1966). A computer oriented
Special Interest Group on Information Retrieval
geodetic data base and a new technique in file se-
Conference, SIGIR 2000, (pp. 128-135). ACM
quencing. Technical Report. Palo. Alto, CA: IBM.
Press.
Nelson, R. C., & Samet, H. (1986). A consistent
Pustejovsky, J., Knippen, R., Littman, J., & Saurí,
hierarchical representation for vector data. In Pro-
R. (2005). Temporal and event information in
ceedings of the Annual Conference on Computer
natural language text. Computers and the Hu-
Graphics and Interactive Techniques, SIGGRAPH
manities, 39, 123–164.
1986, (pp. 197-206). ACM Press.

375
New Discovery Methodologies in GIS

Rauch, E., Bukatin, M., & Baker, K. (2003). A Yu, B., & Cai, G. (2007). A query-aware docu-
confidence-based framework for disambiguating ment ranking method for geographic information
geographic terms. In Proceedings of the Human retrieval. In Proceedings of the Workshop on
Language Technology Conference, HLT-NAACL Geographic Information Retrieval, GIR 2007.
2003, (pp. 50-54). Association for Computational ACM Press.
Linguistics.
Zhou, Y., Xie, X., Wang, C., Gong, Y., & Ma, W.
Salton, G. (1963). Associative document re- Y. (2005). Hybrid index structures for location-
trieval techniques using bibliographic informa- based web search. In Proceedings of the ACM
tion. Journal of the ACM, 10(4), 440–457. International Conference on Information and
doi:10.1145/321186.321188 Knowledge Management, CIKM 2005, (pp. 155-
162). ACM Press.
Smith, D. A., & Crane, G. (2001). Disambiguating
geographic names in a historical digital library.
In Proceedings of the European Conference on
KEY TERMS AND DEFINITIONS
Research and Advanced Technology for Digital
Libraries, ECDL 2001, (pp. 127-136). Springer. Architecture: Formal description of the set
Tjong, E. F., Sang, K., & Meulder, F. D. (2003). of services and structures that compose a system.
Introduction to the CoNLL-03 shared task: Geographic Information Retrieval (GIR):
Language-independent named entity recognition. Research area related with the properly manage-
In Proceedings of the Conference on Natural ment of the geographic information available in
Language Learning, CoNLL 2003, (pp. 142-147). IR repositories.
Springer. Geo-Reference: Geographic references con-
tained in repositories of information in textual
Vaid, S., Jones, C. B., Joho, H., & Sanderson, M. form (e.g., place names, postal codes, etc.)
(2005). Spatio-textual indexing for geographical Geo-Referencing Process: Common task in
search on the web. In Proceedings of the Sympo- GIR involving the location of geo-references and
sium on Spatial and Temporal Databases, SSTD its translation to a formal model of the geographic
2005, (pp. 218-235). Springer. space.
van Kreveld, M., Reinbacher, I., Arampatzis, Information Retrieval (IR): Research area
A., & van Zwol, R. (2005). Multi-dimensional related with the access to non-structured reposi-
scattered ranking methods for geographic infor- tories of information.
mation retrieval. GeoInformatica, 9(1), 61–84. Relevance: Measure of the importance of a
doi:10.1007/s10707-004-5622-6 resource regarding to a specific query.
Workflow: Sequence of tasks that have to be
Worboys, M. F. (2004). GIS: A computing per- performed to achieve a goal.
spectiva. Boca Raton, FL: CRC.

This work was previously published in Discovery of Geospatial Resources: Methodologies, Technologies, and Emergent
Applications, edited by Laura Díaz, Carlos Granell and Joaquín Huerta, pp. 37-55, copyright 2012 by Information Science
Reference (an imprint of IGI Global).

376
377

Chapter 23
A Multidimensional Model
for Correct Aggregation of
Geographic Measures
Sandro Bimonte
Cemagref, UR TSCF, France

Marlène Villanova-Oliver
Laboratoire d’Informatique de Grenoble, France

Jerome Gensel
Laboratoire d’Informatique de Grenoble, France

ABSTRACT
Spatial OLAP refers to the integration of spatial data in multidimensional applications at physical, logical
and conceptual levels. The multidimensional aggregation of geographic objects (geographic measures)
exhibits theoretical and implementation problems. In this chapter, the authors present a panorama of
aggregation issues in multidimensional, geostatistic, GIS and Spatial OLAP models. Then, they illustrate
how overlapping geometries and dependency of spatial and alphanumeric aggregation are necessary
for correctly aggregating geographic measures. Consequently, they present an extension of the logical
multidimensional model GeoCube (Bimonte et al., 2006) to deal with these issues.

INTRODUCTION of facts and dimensions. Facts are described by


values called measures. Dimensions, structured
A Data Warehouse (DW) is a centralized reposi- in hierarchies, permit to analyze facts accord-
tory of data acquired from external data sources ing to different analysis axes and at different
and organized following the multidimensional levels of detail. An instance of a dimension is a
model (Kimball, 1996) in order to be analyzed by set of members organized according to the hier-
On-Line Analytical Processing (OLAP) systems. archies. An instance of the conceptual model is
Multidimensional models rely on the concepts represented by a hypercube whose axes are the
dimension members at the finest levels. Each cell
DOI: 10.4018/978-1-4666-2038-4.ch023 of a hypercube contains the value of the detailed

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
A Multidimensional Model for Correct Aggregation of Geographic Measures

measure. This basic cube (also called facts table) decisional hypothesis, and to guide their spatial
is then enhanced with cells that contain aggregated decision making processes. SOLAP technologies
values of the measures for each combination of have been usefully applied in several domains:
higher level’s members. Aggregation operators geo-marketing, urban, health, environment, crisis
applied on the measures must be specified in the management, etc. (Bédard et al., 2001) as they
conceptual model and depend on the semantics allow non-computer science users to exploit data-
of the application. The classical functions used bases, statistical analysis and spatial analysis tools
to aggregate numeric measures are the standard without mastering complex query languages and
SQL operations “COUNT”, “SUM”, “MIN”, Geographic Information Systems functionalities,
“MAX” and “AVG”. The multidimensional and understanding underlying complex spatial
model allows pre-computation and fast access to datasets. SOLAP redefines main multidimensional
summarized data in support of multidimensional concepts: spatial dimensions, spatial measures and
analysis through OLAP operators which permit to spatial aggregation functions. In this approach,
explore the hypercube. Drill operators (Roll-Up spatial measures are not numerical values, but
and Drill-Down) permit to navigate in the dimen- spatial objects (geometries) which are aggregated
sions hierarchies aggregating measures. Cutting using spatial aggregation functions (union, inter-
operators (Slice and Dice) select and project a part section, etc.) (Shekar et al., 2001). As shown in
of the hypercube. The multidimensional model this work, SOLAP models only partially support
and OLAP operators have been formalized in dependency of spatial and numerical values,
some logical models (Abello et al., 2006) as a which can lead to wrong aggregation of spatial
support to correct aggregation of measures which and numerical measures (geographic measures).
plays a central role in multidimensional analysis In this paper, we identify a three-step ag-
(Pedersen et al., 2001). They define constraints gregation process for the correct aggregation
on the aggregation functions in compliance with of geographic measures, and we formalize it by
the semantics of the measure and explicit the providing an extension of the logical multidimen-
dimensions that can be used in the multidimen- sional model, GeoCube (Bimonte et al., 2006). The
sional queries. model provides a set of rules to ensure the valid
Most of 80% of transactional data contain aggregation of geographic measures.
spatial information, which represents the form This paper is organized as follows. In the Sec-
and the location on the earth surface of real world tion ”Background”, we introduce main concepts
objects (Franklin, 1992). The heterogeneity of of geographic data and Spatial OLAP. Section
physical spaces and the strong spatial correlation “Related Work” discusses aggregation issues in
of thematic data (Anselin, 1989) are not taken into multidimensional databases, GIS, geostatistic
account into multidimensional models. Then, a and Spatial OLAP domains. We investigate the
new kind of systems have been developed, which problem of the correct aggregation of geographic
intended to integrate the spatial component of the measures in the Section “Research Motivations”.
geographic information into multidimensional The extension of the multidimensional model Geo-
analysis: Spatial OLAP (SOLAP) (Bédard et al., Cube is presented in Section “Correct Geographic
2001). Spatial OLAP allows decision-makers to Multidimensional Aggregation”. Conclusions and
explore, analyze and understand huge volume of discussions about implementation issues are given
geo-spatial datasets, in order to discover unknown in the Section “Conclusion and Discussion”.
and hidden knowledge, patterns and relations.
This useful information can help spatial analysts
and decision-makers to validate and reformulate

378
A Multidimensional Model for Correct Aggregation of Geographic Measures

BACKGROUND creating new geographic objects by splitting and/


or aggregating geographic objects. Aggregation of
Geographic Data geographic objects is performed using spatial ag-
gregation functions (i.e. union, centroid, etc.) (Tao
Geographic data are objects or phenomena of the & Papadias, 2005) to aggregate the spatial com-
real world described by both a spatial component ponent, and alphanumeric aggregation functions
and a semantic component (Degrene & Salgé, to aggregate alphanumeric attributes (SUM, AVG,
1997). The spatial component gives the object’s etc.) (Rigaux et al., 2002). Spatial disaggregation
geometry and its position on the earth’s surface functions are used to split geographic objects. A
(e.g. shape of a department). The semantic com- spatial disaggregation is a spatial function that
ponent is a set of descriptive properties (e.g. name does not preserve all the original geometries into
and population of a department) and of spatial, its final result (i.e. intersection, centroid, etc.).
thematic and map generalization relationships
that the object shares with other (geographic) Spatial OLAP
objects (e.g. a department belongs to a region).
Map generalization is the process of obtaining Integration of spatial data in OLAP leads to the
less detailed maps at different scales (Weibel & definition of Spatial OLAP (SOLAP). (Bédard
Dutton, 2001). et al., 2001) define SOLAP as “a visual platform
Geographic Information Systems (GIS) are built especially to support rapid and easy spatio-
main tools to handle geographic information temporal analysis and exploration of data fol-
(Longley et al., 2001). GIS allow storing, visual- lowing a multidimensional approach comprised
izing and analyzing geographic data. GIS store of aggregation levels available in cartographic
geographic data using two different models: vector displays as well as in tabular and diagram dis-
and raster models (Rigaux et al., 2002). The raster plays”. SOLAP systems integrates OLAP and GIS
model represents the space as a single continu- functionalities into a unique framework allowing
ous space by means of a grid. Thematic data are visualizing measures on maps, and triggering
associated with each cell of the grid. The vector OLAP and spatial analysis operators by simply
model represents the space through geographic interacting with the cartographic interactive com-
objects. A geographic object is a set of spatial ponent of the SOLAP client (Rivest et al., 2005;
objects composed by points. Alphanumeric at- Escribano et al., 2007).
tributes are associated with each spatial object. In order to exploit the spatial component into
Querying and analyzing methods for geographic OLAP systems, SOLAP re-think main OLAP con-
data extends classical ones, explicitly exploiting cepts: dimensions and measures. SOLAP models,
its spatial component (Longley et al., 2001). Some using the vector model, introduce geographic
models, representing the geographic informa- information as analysis axes thanks to spatial
tion as a set of uniform geographic objects have dimensions (Bédard et al., 2001; Fidalgo et al.,
been proposed in literature (Erwig & Schneider, 2004; Malinowski & Zimányi, 2005). A spatial
2000; Guting, 1998; Guting & Schneider, 1995, dimension is described by spatial hierarchies that
Voisard & David, 2002). They provide classical are characterized by the presence of the geometric
GIS operators such as spatial selection, overlay, attribute at the different levels (spatial levels). Spa-
merge, fusion, etc (Longley et al., 2001). These tial hierarchies, usually very complex, can present
operators allow selecting geographic objects us- spatial and classical levels. Members of different
ing thematic and spatial predicates (i.e. “Which spatial levels are related by topologic inclusion or
cities are located at 50 Km far from Paris?”) and intersection relationships (Malinowski & Zimányi,

379
A Multidimensional Model for Correct Aggregation of Geographic Measures

2005). An example of spatial hierarchy is French ena” dimension groups phenomena (i.e. fire,
administrative organization, which groups cities avalanche, etc.) into categories of phenomena
into departments and departments into regions. A (i.e. dangerous, etc.).
topological inclusion relationship exists between The measure is the damaged zone, which is
these geographic objects. described by a geometry, the number of damaged
Spatial data can be also used as analysis subject trees and the area. The area is a derived measure
using the concept of spatial measure. Different as it is calculated using other measures (Abello
definitions of spatial measure can be found in et al., 2006). Then, to calculate the area for the
literature: a collection of pointers to spatial objects aggregated geographic object, no aggregation
or the result of topological or metric operators function is applied. It is calculated using the
(Bédard et al., 2001; Malinowski & Zimányi, 2004; geometry resulting from the spatial aggregation.
Stefanovic et al., 2000). In (Bimonte et al., 2006), Figure 2 shows an example of the facts table
the concept of geographic measure is introduced and the cartographic representation of two geo-
as a geographic object representing a real world graphic measures.
entity and described by some alphanumerical at- The model allows users to answer queries like:
tributes and geometry. Aggregation of geographic “What regions have been damaged by fires during
measures corresponds to the aggregation of their 1978 ?”. This query implies the aggregation of
alphanumeric and spatial attributes by means of the geometry and the number of damaged trees.
classical, spatial and user-defined aggregation Some works study the aggregation of spatial
functions. and numerical measures in spatial data warehous-
An example of SOLAP application concerns es. Performance is fundamental for SOLAP analy-
the monitoring of natural phenomena in a park sis. Then, several authors, focusing on numerical
(Figure 1). The dimensions of that spatio-multi- measures and spatial dimensions, integrate spatio-
dimensional application are time and the type of temporal indexes with pre-aggregation techniques
phenomenon (e.g. fire, landslide, etc.). “Time” (i.e. Tao & Papadias, 2005; Rao et al., 2003 Zhang
dimension is described by a calendar hierarchy & Tsotras, 2005). These methods speed-up ag-
(“Day” < “Month” < “Year”). “Natural Phenom- gregation computation on evolving user-defined

Figure 1. Multidimensional application with a geographic measure

380
A Multidimensional Model for Correct Aggregation of Geographic Measures

Figure 2. Facts table and cartographic representation of geographic measures

spatial hierarchies. While numerical aggregation tidimensional models, GIS models, geostatistic
functions have been well defined (i.e. SUM, MIN, models and logical SOLAP models.
MAX, etc.), a standard set of spatial aggregation
functions for spatial measures (geometries) has Multidimensional Models
not been defined yet. (Shekar et al., 2001) classify
spatial aggregations as distributive, algebraic, and The correct aggregation of measures is of crucial
holistic in order to grant summarazability (c.f. importance in the multidimensional analysis
Sec “Multidimensional Models”) in spatial data process. The correctness of aggregation depends
warehouse. Lopez & Snodgrass (2005), and Silva on both the semantics of the measure and the
et al., (2008) formally define a set of spatial and multidimensional structure of the data warehouse
numerical aggregation functions (i.e. AvgArea, (Horner et al., 2004; Pedersen et al., 2001). For
Count_AtNorth, etc.). In addition to the conceptual example, the sum of pollution values has no
aspects of spatial aggregation, some efforts have sense, while the max or the min operators can
been done for improving queries computation safely be applied. Also, it is possible to sum
in large spatial data warehouses. They provide the populations of cities of a region, but to sum
indexes materialization, selection of aggregated these values on the time dimension is not correct
measures and computational geometry algorithms (the same inhabitants would be counted several
(i.e. Stefanovic et al., 2000; Han et al., 1998). times). This problem is known in OLAP literature
Finally, some models have been proposed for as Additivity (Kimball, 1996). A measure is: (1)
spatial data warehouses, which formally define Additive if the sum operator can be applied on all
SOLAP main concepts: spatial dimensions, spa- dimensions (2) Semi-additive if the sum can be
tial measures, multidimensional operators and applied on a subset of dimensions (3) Non additive
aggregation functions. In particular, the correct if the sum makes no sense whatever the dimen-
aggregation of geographic information is a quite sion. The correct aggregation means providing
complex task, which raises some unresolved aggregation constraints (i.e. a control on the type
problems as described in the next sections. of the aggregation, considering the semantics of
the measure (i.e. the nature of the measure) and
used dimensions). Some logical multidimensional
RELATED WORK models define explicitly the type of aggregation
functions that can be applied to measures (Abello
In this Section, we investigate problems and et al., 2006; Lenher, 1998; Pedersen et al., 2001;
solutions related to aggregation of geographic Trujillo et al., 2000). These models, based on
data according to different models: logical mul- the definitions introduced by Rafanelli & Ricci

381
A Multidimensional Model for Correct Aggregation of Geographic Measures

(1983), classify measures according to three dif- analysis dimension is not present. Indeed, all the
ferent types of aggregation functions that can be aggregation rules are defined exclusively on the
applied to them: ∑ (data can be added together, spatial dimension, while we claim that performing
e.g. population), ω (data can be used for average analysis on other dimensions (e.g. time dimension)
calculations, e.g. temperature) or c (constant data can reveal itself interesting.
implying no application of aggregation operators,
e.g. name). Considering only the SQL standard GIS Models
aggregation functions (AF) applying to each type
of data (c, ω and ∑), an inclusion relationship Spatial aggregation operators have been formal-
exists so that AFc = {COUNT} ⊂ AFω = {AVG, ized and implemented in Geographic Information
MIN, MAX, COUNT} ⊂ AF∑ = {SUM, AVG, Systems (GIS) (Rigaux & Scholl, 1995; Voisard
MIN, MAX, COUNT}. This inclusion relation- & David, 2002; Longley et al., 2001). In all these
ship allows deducing that “∑ data” which can be approaches, unlike the geostatistic solutions, the
summed can also be averaged and identified as the semantics of alphanumeric attributes are not taken
minimum or the maximum of a set (as “ω data”) into account. This lack is also evident from the
and that “ω data” can also be counted as “c data”. implementation point of view. Nowadays, com-
By transitivity, “∑ data” can also be counted. mercial GIS systems (i.e. ArcGIS, MapInfo, etc.)
implement only union and splitting operators. They
Geostatistic Models provide a simple control on the aggregations ap-
plied to alphanumeric attributes using only their
The aggregation of geographic information is cru- type (i.e. numeric, alphanumeric, etc.) but not their
cial in spatial analysis especially in the geostatistic semantics. These tools propose some numeric ag-
domain. Spatial aggregation operators (i.e. union, gregation functions (i.e. sum and average) for the
convex hull, etc.) and the management of aggre- numeric attributes and some particular methods, as
gations applied to alphanumeric attributes have for example the usage of a default value or of the
been widely discussed. Different frameworks, count operator, for textual attributes. They do not
dealing with the type of alphanumeric attributes consider the semantics of the attributes. Therefore,
and with their aggregation functions have been it is possible to sum temperatures or population
proposed to address the issue of disjoint spatial densities, which makes no sense. Moreover, all
units aggregation into bigger spatial units (Charre other spatial aggregation operators (i.e. the convex
et al., 1997; Chrisman, 1974; Chrisman, 1998). hull, etc.) only create new geometries without
Alphanumeric attributes are grouped into classes aggregating alphanumeric attributes.
and they are associated with particular aggregation
rules. For example, a rule defines that attributes Spatial OLAP Models
representing relative values such as the tempera-
ture or the population density, can be aggregated Some SOLAP logical models have been proposed
using a weighted average. Another rule expresses in literature (Ahmed & Miquel, 2005; Damiani &
that attributes used to represent raw quantitative Spaccapietra, 2006; Jensen et al., 2004; Pourab-
values, such as the number of inhabitants, can be bas, 2001; Sampaio et al, 2006, Silva et al., 2008).
aggregated using the SUM operator. For the disag- Ahmed & Miquel (2005) define a spatial
gregation process (moving to smaller units), the dimension according to the raster model as an
sum is forbidden as it makes non-sense (Charre infinite set of members. They simulate this “infi-
et al., 1997; Chrisman, 1974; Egenhofer & Frank, nite” set using interpolation functions to calculate
1986). Finally, we underline that the concept of measures in every point of the analysis region

382
A Multidimensional Model for Correct Aggregation of Geographic Measures

represented by spatial dimensions. Pourrubas by classifying measures according valid aggrega-


(2003) provides a formal framework to integrate tion functions (c.f. the Section “Multidimensional
multidimensional and geographic databases. The Models”). Nevertheless, the model does not ex-
model handles spatial dimensions in which mem- plicitly represent spatial measures nor associated
bers of different levels are related by topological aggregation constraints.
inclusion relationship. These models only handle Pedersen & Tryfona (2001) investigate pre-
numerical SQL aggregation functions. Sampaio et aggregation in multidimensional applications
al. (2006) define a logical model for spatial data with spatial measures. In particular, they study
warehouse using the object-oriented approach. the pre-aggregation of alphanumeric attributes
Spatial measures and numerical measures can be associated with spatial measures. In other words,
aggregated using SQL spatial and numerical ag- they investigate the correct aggregation of alpha-
gregation functions. The model does not provide numeric attributes of geographic objects according
any support for correct aggregation. Damiani & to their spatial components. The model represents
Spaccapietra (2006) define a model in which measures as bottom levels of dimensions. Ag-
all levels of a spatial dimension can be used as gregated measures are less detailed dimensions
measures, allowing multi-granular analysis. The levels. Since a topological inclusion relationship
model aims at supporting measures as geographic always exists between spatial objects at different
objects, but it does not explicitly represent spatial levels, the model does not allow applying spatial
and alphanumeric attributes. Then, no aggregation disaggregation functions. Then, only union can
constraint is defined on measures (attributes of be used. Spatial disaggregation functions, such
geographic objects). as intersection or centroid, can not be applied.
Silva et al., (2008) define formally the ele- To conclude, few SOLAP models consider
ments of a spatial data warehouse with particular aggregation constraints on spatial and numerical
attention to aggregation functions for spatial data. measures without correctly support aggregation
They introduce a set of aggregation functions that of geographic measures. Indeed, as shown in the
combine numerical and spatial functions. These next section, correct aggregation of geographic
aggregation functions are classified according to measures should take into account: semantics
whether numerical aggregation is scalar, distribu- of measures, used dimensions, overlapping ge-
tive or holistic, and whether spatial aggregation ometries and dependency between spatial and
is unary or n-ary function. However, these ag- alphanumeric functions.
gregation functions are not associated with any
aggregation constraints and numerical functions
are applied exclusively to metric values of spatial RESEARCH MOTIVATIONS
data (i.e. perimeter, etc.).
Only, Jensen et al., (2004) and Pedersen et al., The aggregation of geographic measures raises
(2001) define logical multidimensional models several problems from both theoretical and
for spatial data warehouses taking into account implementation points of view. In particular,
aggregation constraints. In particular, Jensen et alphanumeric and spatial functions used for the
al., (2004) propose a model for location-based aggregation are dependent: used spatial func-
services. It supports partial inclusion of spatial tion dictates alphanumeric functions that are
dimension members granting imprecise measures. allowed for non-spatial attributes. This implies
Measures are numerical and alphanumeric values. a redefinition of the OLAP additivity concept
The model extends (Pedersen et al., 2001) and it for spatio-multidimensional databases. Existing
grants correct aggregation of non-spatial measures spatio-multidimensional models do not complete-

383
A Multidimensional Model for Correct Aggregation of Geographic Measures

ly investigate this issue, which covers different measure. The number of trees is calculated as a
research domains: multidimensional databases, weighted average on the surface. Areas of Zone
geostatistic models and Geographic Information 1 of A and of Zone 2 of A are the same (55), and
Systems (GIS) as discussed in the previous sec- a weighted average on the surface is used for the
tion. Let us consider the application of Figure 1, number of trees. Then, this latter is the same (10)
and let us suppose we need to get information at for the two new zones. By the same way, spatial
the year level. A Roll-Up operation on the year and alphanumeric attributes values of Zone 1 of
level aggregates the geographic measures Zone A B and Zone 2 of B are calculated.
and Zone B. In such a situation, how to aggregate Then, using these geographic objects, we
these geographic objects? calculate another set of geographic objects apply-
Using union to aggregate the geometries, que- ing the average to the number of trees for the
ries like: “Where and how many trees have been geographic objects with the same geometry and
damaged by some fires during 1978?” (Query 1) coordinates (Figure 4). We use average because
can be answered. the aggregation is made on the time dimension,
To calculate the number of damaged trees which requires not counting several times the
aggregating on the time dimension, we propose same tree. For instance in Figure 3, Zone 2 of A
to apply the average operator, as the number of and Zone 1 of B have the same geometry and
trees is non-additive, and then to sum all these coordinates, and some trees (note that the number
averages in order to have the total of trees in the of trees varies in time) (Figure 3). Then, from
merged zone. More in details, Zone A (resp. Zone these two geographic objects, we create a new
B) is first splitted into two zones (Zone 1 of A and geographic object Zone V2 whose geometry is
Zone 2 of A (resp. Zone 1 of B, and Zone 2 of the same as the ones of Zone 2 of A and Zone 1
B)), and then its alphanumeric attributes values of B and whose number of trees is the average
are recalculated (trees and area) (Figure 3). Please (8=(10+6)/2) (Figure 4).
note that a user-defined function is used to calculate Finally, we apply union to these zones and sum
alphanumeric non-derived attributes of geographic the numbers of trees (Figure 5). In particular, the

Figure 3. Decomposition of geographic measure (Step 1)

384
A Multidimensional Model for Correct Aggregation of Geographic Measures

Figure 4. Vertical aggregation of trees using average

geometry of Zone E is the union of the geometries In this case, the multidimensional query is:
of Zone V1, Zone V2 and Zone V3. The number “Where and how many trees have been damaged
of trees is the sum of the number of trees of Zone by all fires during 1978?” (Query 2). The number
V1, Zone V2 and Zone V3 (24= 10+8+6). Without of trees cannot be calculated using the sum of the
this approach, damaged trees for the Zone E will averages because it is not representative of the
be erroneously 32 as it is the union of Zone A and number of trees in the intersected zone. Therefore,
Zone B and as the associated sum operation does instead of applying sum, we apply a weighted
not consider that the damaged trees of the Zone 2 average on the area (Figure 6). Consequently,
of A, Zone 1 of B are counted twice when aggre- the number of trees of Zone E is the same than
gating on the time dimension. Finally, the result the one of Zone V2 (8). Damages trees are not
of the Roll-Up operator is shown in Table 1. counted twice.
Let us use another spatial function: intersec- These two examples show that:
tion. In the OLAP context, intersection (Shekar
et al., 2001) is a spatial aggregation function, but 1. aggregation process has to take into account
it should be better to considered it as a spatial overlapping geometries and
disaggregation function following the geostatistic 2. alphanumeric aggregation functions applied
approach (c.f. Sec. “Geographic Data” and “Geo- to the descriptive attributes of geographic
statistic Models”). measures depend on their semantics (∑:
measures that can be summed, ω: measures
Table 1. Roll-up on the time dimension that can be averaged, c: measures that can be
only counted), on used dimensions (as for
Year Phenomenon Aggregated Zone classical OLAP measures), and on spatial
1978 Fire Zone E function (as in geostatistic models). In other
… … …

Figure 5. Aggregation of park’s zones on the time Figure 6. Aggregation of park’s zones on the time
dimension dimension

385
A Multidimensional Model for Correct Aggregation of Geographic Measures

words, additive measures cannot be added As discussed in Section “Related Work”, exist-
if a spatial disaggregation function is used. ing spatio-multidimensional models (Abello et al.,
2006; Damiani & Spaccapietra, 2006; Pourabbas,
Finally, the aggregation process of geographic 2001, Sampaio et al., 2006, Silva et al., 2008)
measures can be seen as a three-step process using do not introduce any constraint on spatial and
two alphanumeric aggregation functions (noted alphanumeric functions.
φ and κ) for each alphanumeric attribute and a Jensen et al. (2004) do not extend aggregation
spatial function (spatial aggregation or spatial constraints for spatial measures. Moreover, since
disaggregation) for the spatial attribute. the model provides a symmetrical representation
The three steps are: of measures and dimensions and since it does
not support dimensions attribute, each attribute
• Step 1-Calculate disjoint geographic ob- of a geographic object has to be represented as a
jects: Calculate the set of geographic ob- dimension which implies aggregating attributes
jects whose geometries are obtained using separately. This means that the model cannot sup-
the intersection operator on the geographic port the dependency of spatial and alphanumeric
measures (i.e. Figure 3). Alphanumeric at- functions in the aggregation process.
tributes are calculated using user-defined Only Pedersen & Tryfona, (2001) model the
functions. Geographic objects with the dependency of spatial and alphanumeric functions,
same geometry and coordinates could be but as described above, they restrict spatial func-
created. tions to union, discarding spatial disaggregation
• Step 2-Vertical Aggregation: Geographic functions. This limits the analysis capabilities of
objects created at step 1 with the same ge- the model because, as shown by our case study,
ometry and coordinates are unified to create spatial disaggregation functions could be neces-
one geographic object, by applying an ag- sary for spatial analysis.
gregation function (κ) to the alphanumeric
attributes (i.e. Figure 4). These aggrega-
tions have to be coherent with the seman- CORRECT GEOGRAPHIC
tics of the measures and with the dimen- MULTIDIMENSIONAL AGGREGATION
sions used in the multidimensional query
(Semantic of the measures “∑, ω, c data” In this section we introduce the base concepts
cf. Sec. “Multidimensional Models”). of the logical multidimensional model GeoCube
• Step 3-Horizontal Aggregation: (Bimonte et al., 2006) and its extension to cor-
Aggregate the geometries of original geo- rectly aggregate geographic measures.
graphic measures. Aggregate the alphanu-
meric attributes of the geographic objects An Overview of the GeoCube Model
created at step 2 (i.e. Figures 5 and 6). The
alphanumeric aggregations (φ) have to be Bimonte et al., (2006) present a logical multidi-
coherent with the semantics of the measure mensional model called GeoCube, which supports
(∑, ω, c) and with the used spatial function. measures and dimension members as geographic
objects. GeoCube allows the usage of a set of
It is important to underline that this process spatial and/or alphanumeric attributes as one
can be applied also to lines, points, etc. To best of single complex measure (geographic measures).
our knowledge, no work deals with this problem. It also proposes user-defined aggregation func-
tions for each attribute of the geographic object

386
A Multidimensional Model for Correct Aggregation of Geographic Measures

representing the aggregated measure. GeoCube Definition-Geographic Entity Schema: An


also provides an algebra that redefines common Entity Schema Se is a Geographic Entity
OLAP operators. Schema if the domain of one attribute is a
set of spatial objects.
Data Model Example 1: In the case study presented previously,
the geographic measure representing the
The main concepts of multidimensional data model zones is Szone = 〈geometry, nbDamagedTrees,
are: Entity, Hierarchy and Base Cube. The con- farea〉 where farea: dom(geometry) → N is a
cepts of Entity Schema and Entity Instance permit function to calculate the area of a zone. An
to represent indifferently the data of the analysis instance of Szone is 〈pt01, 20, 110〉 (Zone A)
universe: dimension members and measures. An (Figure 2).
Entity is a set of attributes and functions used to
represent derived attributes (data calculated using Entities are organized in hierarchies thanks
other data). Derived attributes are necessary to to the concepts of Hierarchy Schema and Hier-
model metric attributes (i.e. area, perimeter, etc.) archy Instance. A level of a hierarchy is an Entity
of geographic objects. Schema and a member is an Entity Instance. The
Entity Schemas and their instances are orga- Hierarchy Schema organizes levels into a lattice.
nized into hierarchies (Hierarchy Schema and Thanks to the Hierarchy Instance, the members’
Hierarchy Instance). The Base Cube represents levels are organized in a tree structure. The root
the facts table. is an instance of the Entity Schema which repre-
In what follows, we only provide the defini- sents the top level of the lattice represented by
tions that are necessary to describe the framework the Hierarchy Schema. Leafs are the instances of
we propose. the Entity Schema which represents the bottom
level of the lattice.
Definitions-Entity Schema and Entity Instance: This definition allows modeling non-balanced
An Entity Schema Se is a tuple 〈a1, …an, and non-strict hierarchies, which are necessary
[F]〉 where: for spatio-multidimensional applications (Ma-
◦◦ ai is an attribute defined on a domain linowski & Zimányi, 2005).
dom(ai)
◦◦ F, if it exists, is a tuple 〈f1,…fm〉 where Definition-Hierarchy: A Hierarchy Schema is a
fi is a function defined on a sub-set of tuple Hh = 〈L h, ⌊h, ⌈h, ⇞h〉 where:
attributes ar, …ak. ◦◦ Lh is a set of Entity Schemas,
◦◦ ⌊h and ⌈h are two Entity Schema and
An Instance of an Entity Schema Se is a tuple ⌈h contains one instance (‘all’),
ti such as ◦◦ /h is a partial order defined on the
levels of the hierarchy (L h ∪ ⌊h ∪ ⌈h)
• If F exists then ti = 〈val(a1), val(an), and /h is a lattice where ⌊h and ⌈h are
val(b1),…val(bm)〉 where val(ai) ∈ dom(ai) respectively the bottom and the top
and val(bj) = fj(val(ar),…val(ak)) levels of the order.
• If F does not exist, ti = 〈val(a1),…val(an)〉
where val(ai) ∈ dom(ai) We call Entity Schema levels (L(Hh)) the entities
belonging to the set {L h ∪ ⌊h ∪ ⌈h}. An Instance
F permits to model derived measures and/or of a Hierarchy Hh is a partial order ↑h defined on
dimension attributes. L(Hh), such as:

387
A Multidimensional Model for Correct Aggregation of Geographic Measures

• if ti ↑h tj then Si /h Sj, where ti ∈ I(Si) and the most detailed levels of all hierarchies. These
tj ∈ I(Sj) measures are not aggregated. Then, in order to
• ∀ ti not belonging to the top level, then ∃ tj represent a multidimensional query (i.e. cuboid),
such as ti ↑h tj we introduce the concepts of Aggregation Mode
• ∀ ti not belonging to the bottom level, ∃ tj and View. A multidimensional query defines the
such as tj ↑h ti dimension levels used (e.g. “Type” and “Year”),
the (geographic) object used as measure (“Zone”)
Schema and data of the spatio-multidimension- and a set of aggregation functions (Aggregation
al application are represented by Base Cube. Base Mode) to aggregate its attributes (e.g. union for
Cube Schema defines dimensions (Hierarchies) geometry and sum for number of trees) (Figure
and measures (e.g. the spatio-multidimensional 7). For simplicity, we provide only the definition
application schema of Figure 1). The instance of of the Aggregation Mode and we give a example
the Base Cube represents facts table data (e.g. data of View.
of Figure 2). In our approach, dimensions and mea- Aggregation Mode defines a function for each
sures are (geographic) objects, for example phe- non-derived attribute of the (Geographic) Entity
nomena and zones. Then, our model must define representing the measure. The result of the ag-
measures in the same way as dimensions levels. gregation is another (Geographic) Entity whose
Following the approach of Vassiliadis (1998), we attributes values are calculated using these func-
define a Base Cube as a tuple of Hierarchies and tions.
a boolean function which represents the tuples
of the facts table. Here, the bottom levels of the Definition-Aggregation Mode: An Aggregation
hierarchies are (geographic) objects. They can be Mode Θk is a tuple 〈Sa, Sb, Φ〉 where:
used as dimensions and measures. This definition ◦◦ Sa is an Entity Schema 〈a1,…am,
allows defining measures as (geographic) objects, [Fa]〉 (the detailed measure)
and not as numerical value. ◦◦ Sb is an Entity Schema 〈b1,…bp,
[Fb]〉 (the aggregated measure)
Example 2: The multidimensional model of Fig- ◦◦ Φ a set of p ad-hoc aggregation func-
ure 1 is represented by the Base Cube Schema tions ϕi
BCnaturalrisks = 〈Hnatural_phenomena, Htime, Hzone, δ〉
where Hnatural_phenomena, Htime and Hzone are the An Aggregation Mode defined on geographic
hierarchies representing the phenomena, objects (geographic measures) is called Geo-
time and zones. δ: I(Sphenomen)×I(Sday)×I(Szone) graphic Aggregation Mode.
is a boolean function defined on the bottom
levels of the hierarchies. The instance of BC- Definition-Geographic Aggregation Mode:
naturalrisks
represents the facts table (Figure 2). An Aggregation Mode Θk =〈Sa, Sb, Φ〉 is a
Geographic Aggregation Mode if Sa and Sb
In this application, each bottom level of a are Geographic Entity Schemas.
hierarchy (object and/or geographic object) can Example 3: Let the aggregation of the geo-
be used as measure. graphic Measures Szone be represented by
the Entity Schema Szone_agg = 〈geometry,
Multidimensional Query Model nbDamagedTrees, F〉.

Base Cube represents the basic cuboid, which is The area is a metric attribute of the damaged
represented by measures values associated with zone and so it can be directly and univocally

388
A Multidimensional Model for Correct Aggregation of Geographic Measures

Figure 7. Geographic aggregation mode: Zones of a park

calculated using the geometry. For this reason, it the set of Entity Schemas used for the multidimen-
has to be considered as a derived measure. Then, sional query, and an Aggregation Mode which
an example of Geographic Aggregation Mode for defines the Entity used as measure and aggregation
the zones is Θzone= 〈Szone, Szone_agg, Φunion〉 where functions to aggregate it.
Φunion is the set of functions:
Example 4: The multidimensional query Where
1. ϕ g e o m e t r y : d o m ( S z o n e . g e o m e t r y ) →
n
and how many trees (which zones) have
dom(Szone_agg.geometry), i.e. the Geometric been damaged by some fires during 1978?”
Union (Query 1, cf. Sec. “Research Motivations”)
2. ϕnbtrees:dom(Szone.geometry, Szone.nbDam- is represented by the View Vzones-year = 〈BC-
agedTrees) n → dom(S zone_agg .nbDam- naturalrisks
, 〈Sphenomenon, Syear〉, Θzone, γ〉. This query
agedTrees), i.e. the sum aggregates geographic measures along the
time dimenson. We note that the View is
An example of aggregation of two geographic composed of the Base Cube, the levels
measures (Zone 3 and Zone 4) using Θzone is shown used in the query Sphenomenon (phenomena)
in Figure 7. Szone is a schema entity with attributes and Sye(years) and the Aggregation Mode
a1 (the number of trees damaged by fires), a2 (the Θzone which defines Szone as measure and
geometry of the region) and a a3 area attribute a3 establishes how to aggregate it. The instance
(a3 is derived from the geometry). Zone 3 and Zone of Vzones-year is shown in Table 1.
4 in the figure are two Szone entities. Szone-aggreg is
another schema entity, which represents the result Correct Geographic Aggregation
of aggregating Zone 3 and Zone 4 (as shown in
the dark grey area in the right panel of the figure). In order to formalize the constraints that ensure
The attribute b1 in Szone-aggreg is the result of apply- the correct aggregation of geographic measures
ing the operator ϕ1 (sum) to attribute a1, and the according to the algorithm described in the Sec-
attribute b2 is the result of applying the operator tion “Research Motivations”, we formalize the
ϕ2 (union) to attribute a2. semantics of alphanumeric attributes of geo-
GeoCube introduces the concept of View to graphic measures and the dependency of spatial
represent the result of a multidimensional query. and alphanumeric functions, and then we introduce
A View is a tuple of a Base Cube (the facts table), the constraints.

389
A Multidimensional Model for Correct Aggregation of Geographic Measures

Semantics of Alphanumeric Attributes Example 5: In our example, SemanticsMeasure(


of Geographic Measures nbDamagedTrees) = AF∑, because the sum
of number of trees makes sense, and Dim
Following the approach to ensure correct aggrega- ensionsMeasure(nbDamagedTrees, Syear,
tion, described in the Section “Multidimensional Sphenomenon)= AFω, because number of trees
Models”, we introduce the functions Dimen- is not additive on the time dimension (the
sionsMeasure and SemanticsMeasure. level Syear, which is not the most detailed
They classify aggregation functions of at- level of the “Time” dimension, is used in
tributes of geographic objects in three different the multidimensional query).
groups, called aggregation types: AF∑, AFω and
AFc, where AF∑ > AFω > AFc, “>” being a total Dependency of Spatial and
order. Functions classified as AF∑ can be applied Alphanumeric Functions
to measures that can be summed, AFω are func-
tions that can be applied to measures that can be In order to model dependency of spatial and
averaged, and AFc are functions that can be applied alphanumeric functions, we introduce a function
to measures that can be only counted. SpatialType, which takes as inputs the type of
SemanticsMeasure takes into account the spatial function (AFU, AFΩ) and the semantics of
semantics of attributes. It takes as input an al- the alphanumeric attribute (SemanticsMeasure),
phanumeric attribute and returns an aggregation and returns the aggregation type (AF∑, AFω, or
type (AF∑, AFω, AFc). For example, functions that AFc) that can safely be applied to the alphanumeric
can be applied to the number of trees are AF∑, as attribute. Let us note spatial aggregation functions
number of trees can be summed. as AFU, and spatial disaggregation functions as
AFΩ. Then, spatial functions classified as AFU al-
Definition-SemanticsMeasure: For each Entity lows summing additive data (e.g. number of trees)
Schema Se, we assume a function Seman- because they preserve the set of input geometries
ticsType: A(Se) → {AF∑, AFω, AFc} which in the result (i.e. union, convex hull, etc.). AFΩ do
returns the aggregation type. not allow summing data as they leave out some
spatial objects (i.e. centroid, intersection, etc.).
DimensionsMeasure takes into account the se-
mantics of attributes, and used dimensions levels. Definition-SpatialType function: We define a
It takes as input an alphanumeric attribute and the function SpatialType: {AFU, AFΩ} × { AF∑,
dimensions levels of the View, and it returns an AFω, AFc} → { AF∑, AFω, AFc } as defined
aggregation type (AF∑, AFω or AFc). For example, in Table 2.
functions that can be applied to the number of trees
aggregating on the time dimension are AFω, as the The function SpatialType models the depen-
number of trees cannot be summed on the time dency of spatial and alphanumeric functions as
dimension (some trees must not be counted twice. defined by the Step 3 of our algorithm. In par-

Definition-DimensionsMeasure: For each Entity


Table 2. SpatialType function
Schema Se and set of levels of a View Ve,
we assume a function DimensionsMeasure: SpatialType AF∑ AFω AFc
A(Se)× L(Ve) → {AF∑, AFω, AFc} which AFU AF∑ AFω AFc
returns the aggregation type. AFΩ AFω AFω AFc

390
A Multidimensional Model for Correct Aggregation of Geographic Measures

ticular, when using spatial aggregations (AFU), it Using the definitions previously given, we
is possible to sum additive alphanumeric attributes formalize the three steps of our algorithm by
(AF ∑ ) values of geographic measure defining two functions (DisjGeoObjects and
(SpatialType(AFU, AF∑) = AF∑). For example, OverlayGeoObjects) and two constraints (Vertical
when using the union, it is possible to sum num- Aggregation Constraint and Horizontal Aggrega-
ber of trees as it is additive (AF ∑ ): tion Constraint) for the Geographic Aggregation
SpatialType(UNION, AF∑) = AF∑ (see Figure 5).. Mode.
When using spatial disaggregations (AFΩ), it is These two constraints allow the Geographic
not possible to sum alphanumeric attributes values Aggregation Mode ensuring a correct geographic
of geographic measures, but it should be possible aggregation of geographic measures.
to use average (AF ω ) or count (AF c ) In particular, each alphanumeric aggregation
(SpatialType(AFΩ, AF∑)= AFω). For example, function ϕ of the Geographic Aggregation Mode
when using intersection, it is not possible to use is decomposed in two functions: κ and φ used
sum, even if number of trees is additive (AF∑): to calculate attributes values of the geographic
SpatialType(INTERSECTION, AF∑) = AFω (see objects created at Step 2 and 3 respectively.
Figure 6).. Then, the two constraints are defined on κ and φ
Finally, if the alphanumeric attribute is not according to the requirements of the Vertical and
additive (AFω or AFc) and spatial aggregation is the Horizontal steps respectively.
used, then it is not possible to add alphanumeric The Vertical Aggregation Constraint defines
attribute values (SpatialType(AFU, AFω) = AFω correct functions for κ considering disjoint
and SpatialType(AFU, AFc)= AFc). geographic measures using the function Dis-
jGeoObjects (see Figure 3), and taking into
Correct Geographic Aggregation Mode account semantics and used dimensions (i.e. Di-
mensionsMeasure) of the alphanumeric attributes
Before defining the Correct Geographic Aggrega- (e.g. number of trees cannot be summed on the
tion Mode, we need to introduce the concept of time dimension).
overlay. The GIS operator overlay takes as inputs For example, in order to calculate geographic
two maps and overlays them one on the top of the objects of Figure 3 resulting from an aggregation
other to form a new map. The Union Overlay is an on the time dimension, the Vertical Aggrega-
operator that takes as inputs 2 maps and returns a tion Constraint does not allow using the sum
map whose geometries are the set of all disjoint (κnbDamagedTrees = AVG) because number of trees is
geometries obtained by applying the topological not additive on the time dimension (κnbDamagedTrees∈
intersection among all features. Figure 8 shows an AFω = Min(DimensionType(nbDamagedTrees,
example of overlay of two maps whose geometries Syear,Stype), where DimensionType(nbDamagedT
are polygons. rees,Syear,Stype)= AFω).

Figure 8. Union overlay

391
A Multidimensional Model for Correct Aggregation of Geographic Measures

The Horizontal Aggregation Constraint de- geometry and coordinates than tiD1,
fines correct functions for φ taking into account the … tiDr of the vertical instances (See
semantics of the alphanumeric attributes thanks to Zone V1, Zone V2 and Zone V3,
the function SemanticsType (e.g.. number of trees Figure 4)
can be summed), and the dependency between ◦◦ SpatialAggregationType be the spa-
spatial and alphanumeric functions thanks to the tial function type {AFU, AFΩ}
SpatialType function (when using intersection, it ◦◦ geomAggregate be the result of the
is not possible to sum the number of trees). spatial aggregation on the geom-
For example, in order to aggregate geographic etries of t1 …,tn
objects resulting from Vertical Aggregation Step of ◦◦ The alphanumeric aggregation func-
Figure 3, the Horizontal Aggregation Constraint tion ϕi of Φe be defined by means of:
does not allow using the sum for number of trees. ◦◦ φi (tdi.am, … tdi.al, tdi.geom)
Therefore, φnbDamagedTrees = Weighted Average on (Horizontal Aggregation)
surface because a spatial disaggregation (inter- ◦◦ κi (tiDi.am,…,tiDi.al) (Vertical
section) is used for geometry (Figure 6).Indeed, Aggregation) such as:
sum is not representative of number of trees in 1. ϕi = φ1
the intersection region (φnbDamagedTrees ∈ AFω = 2. tdi.ai=κ1r(tiDi.am,…,tiDi.al)
SpatialType(AFΩ, Min(SemanticType(nbDama
gedTrees))). then the Geographic Aggregation Mode Φe is
By this way, user chooses the functions φ and correct if are respected the following constraints:
κ, and the model avoids building any non-sense
aggregated geographic measures. 1. Vertical Aggregation Constraint: κ i ∈
In the following, we formally introduce the Min(DimensionsMeasure(am, Sl1,…Slm),…
Correct Geographic Aggregation Mode and ,DimensionsMeasure(al, Sl1, …, Slm))
then the functions DisjGeoObjects and Overlay- 2. Horizontal Aggregation Constraint:
GeoObject. φ i ∈ SpatialType(SpatialAggregation
Type, Min(SemanticsMeasure(a m ),…,
Definition-Correct Geographic Aggregation SemanticsMeasure(al))
Mode: Let: Definition-DisjGeoObjects: DisjGeoObjects is a
◦◦ Ve be a View (See example 4: function which takes as inputs n Geographic
Vzones-year = 〈BCnaturalrisks, Entity Instances and returns l ≥ n Geographic
〈Sphenomenon, Syear〉, Θzone, γ〉) Entity Instances whose geometries are
◦◦ t1 …,tn be instances of the Geographic obtained using the geometric intersection
Entity which must be aggregated: in- operator.
put instances (See Zone A and Zone
B, Figure 2) Figure 3 shows the results of the DisjGeoOb-
◦◦ tiD1, … tiDr be the instances calcu- jects function on geographic objects of Figure 2.
lated using DisjGeoObjects on input
instances: vertical instances (See Definition-OverlayGeoObjects: Overlay-
Zone 1 of A, Zone 2 of A, Zone 1 of B GeoObjects is a function which takes as
and Zone 2 of B, Figure 3) inputs n Geographic Entity Instances repre-
◦◦ td1 …,tdm be the instances calcu- senting n maps and returns m≥ n Geographic
lated using OverlayGeoObjects on Entity Instances whose geometries are
input instances and tdi have the same obtained using the Union Overlay.

392
A Multidimensional Model for Correct Aggregation of Geographic Measures

Figure 4 shows the results of the Overlay- • Geographic Aggregation Mode Θzone =
GeoObjects function on geographic objects of 〈Szone, Szone_agg, Φunion〉 where Φunion is (Figure
Figure 3. 5)
We present now the two Geographic Aggre- 1. ϕ geometry: dom(S zone.geometry) n →
gation Modes for our case study: the first uses dom(Szone_agg.geometry) is Union
union and the second intersection. We show how • ϕgeometry is AFU (spatial aggregation)
the Geographic Aggregation Mode constraints 2. ϕnbDamagedtrees: dom(Szone.geometry, Szone.
oblige the user to use sum only in the first case. nbDamagedTrees)n → dom(Szone_agg.nb-
Moreover, these examples show how the user is DamagedTrees)= φnbDamagedTrees where:
forced to aggregate the number of trees consider- • φnbDamagedTrees is SUM
ing the semantics of the attributes and the used • κnbDamagedTrees = Average
dimensions.
The multidimensional query Vzones-year defines
Example 6: Let us consider the query “Where a Correct Geographic Aggregation Mode because
and how many trees have been damaged by the Geographic Aggregation Mode Constraints
some fires during 1978?” (Query 1, cf. Sec are satisfied:
“Research Motivations”). We define a View
with a Correct Geographic Aggregation 1. κnbDamagedTrees∈ AFω =Min(DimensionType(n
Mode, which uses average for the Vertical bDamagedTrees,Syear,Stype)
Aggregation and sum and union for the (It is not possible to apply the sum operator
Horizontal Aggregation (Figure 9). to numbers of trees because this mea-
sure is not additive on time dimension)
Formally, let Vzones-year = 〈BCnaturalrisks, 〈Sphenomenon, 2. φnbDamagedTrees∈ AF∑ =SpatialType(AFU,Min(
Syear〉, Θzone, γ〉 represent the Query 1 where: SemanticsType(nbDamagedTrees)))
(It is possible to apply the sum to the num-
• DimensionsMeasure(nbDamagedTrees, bers of trees for the geographic objects
Syear, Stype) = AFω resulting from VerticalAggregation
• SemanticsMeasure(nbDamagedTrees) = because the spatial function is union
AF∑ and it is a spatial aggregation)

Figure 9. Correct Geographic Aggregation Mode on Time dimension and using spatial union

393
A Multidimensional Model for Correct Aggregation of Geographic Measures

The Geographic Aggregation Mode Θzone uses ◦◦ φnbDamagedTrees is Weighted Average on


spatial Union (ϕgeometry) to aggregate geometry. surface
The aggregation of the number of damaged trees ◦◦ κnbDamagedTrees = Average
is performed using the AVG for the vertical ag-
gregation (κnbDamagedTrees), as the number of damaged The multidimensional query Vzones-year uses a
trees is not additive on time dimension (Dimensio Correct Geographic Aggregation Mode because
nsMeasure(nbDamagedTrees, Syear, Stype) = AFω), the Geographic Aggregation Mode constraints
and the SUM (φnbDamagedTrees) for the horizontal ag- are satisfied:
gregation, as ϕgeometry is a spatial aggregation (AFU).
1. κnbDamagedTrees ∈ AFω = Min(DimensionType(
Example 7: Let us suppose now we want to nbDamagedTrees, Syear, Stype))
answer to the query “Where and how many (The sum cannot be applied to numbers of
trees have been damaged by all fires during trees because this measure is not additive
1978?” (Query 2, cf. Sec “Research Moti- on time dimension)
vations”). We define a View with a Correct 2. φnbDamagedTrees ∈ AFω = SpatialType(AFΩ, Mi
Geographic Aggregation Mode, which uses n(SemanticType(nbDamagedTrees)))
an interpolation function and intersection for (The sum cannot be applied to numbers of
the Horizontal Aggregation, and average for trees of the geographic objects resulting
the Vertical Aggregation (Figure 10). from VerticalAggregation because the spatial
function is intersection and it is a spatial
Let Θzone= 〈Szone, Szone_agg, Φintersect〉 (Query 2) disaggregation).
where Φintersect is:
The Geographic Aggregation Mode Θzone uses
1. ϕ g e o m e t r y : d o m ( S z o n e . g e o m e t r y ) n → spatial Intersection (ϕgeometry) to aggregate geom-
dom(Szone_agg.geometry) is Intersection etry. The aggregation of the number of damaged
◦◦ ϕgeometry is AFΩ (spatial disaggregation) trees is performed using the AVG for the vertical
2. ϕ nbDamagedtrees: dom(S zone.geometry, S zone. aggregation (κnbDamagedTrees), as the number of dam-
nbDamagedTrees)n → dom(Szone_agg.nbDam- aged trees are not additive on time dimension (D
agedTrees) = φnbTrees where: imensionsMeasure(nbDamagedTrees, Syear, Stype)
= AFω), and the Weighted Average on surface

Figure 10. Correct geographic aggregation mode on time dimension and using spatial intersection

394
A Multidimensional Model for Correct Aggregation of Geographic Measures

(φnbDamagedTrees) for the horizontal aggregation, as taking into account the spatial component of the
ϕgeometry is a spatial disaggregation (AFΩ). geographic information. Finally, only Pedersen
We have presented and illustrated the approach & Tryfona (2001) provide a model that explicitly
we propose for Correct Geographic Aggregation supports aggregation constraints for spatial and
Mode. To complete our case study, we have to alphanumeric data. The model supports overlap-
recall that no aggregation function is provided to ping geometries and aggregation constraints as
get the value of the attribute representing the area well as our 3-step framework. However, it sup-
of the damaged zone. Indeed, the area is a derived ports only partially the dependency of spatial
measure calculated using the geometry resulting and alphanumeric functions as it does not allow
from the spatial aggregation of the component disaggregation functions. We have addressed this
geographic objects. issue in this paper
Currently, we are working on the implemen-
tation of our framework in the SOLAP system
CONCLUSION AND DISCUSSION GeWOlap (Bimonte et al., 2006b). We are imple-
menting the three steps of our framework using
Spatial OLAP refers to the integration of spatial PL/SQL functions and user-defined aggregation
data in OLAP. Correct aggregation is crucial functions provided by Oracle. Semantics of mea-
in multidimensional analysis. In this paper, we sures (valid aggregation functions) are defined
provide an overview of solutions for aggrega- in XML files representing the multidimensional
tion of geographic objects in multidimensional, application in the ROLAP Server Mondrian.
geostatistic, GIS and SOLAP models. By intro- Semantics of measures have to be parsed before
ducing a case study concerning the monitoring of any OLAP query computation in order to verify
natural phenomena, we define the aggregation of dependencies between spatial and alphanumeric
geographic measures as a –three-step process in functions. The main issues of this approach deal
which two constraints on aggregation functions are with performances in large spatial data ware-
defined. They extend classical OLAP aggregation houses. Developing ad-hoc pre-aggregation and
constraints with dependence between spatial and indexing techniques for geographic measures are
alphanumeric aggregation functions applied to the our future work.
attributes of geographic measures. We present an
extension of the logical multidimensional model
GeoCube (Bimonte et al., 2006), which formal- ACKNOWLEDGMENT
izes our approach. Alphanumeric aggregation
functions are decomposed into two aggregation Authors wish to thank Pr. Pierre Dumolard for
functions and two constraints are applied to them to precious discussions and material on GIS and
take into account the additivity of the measures and geostatistics.
the dependency of spatial and alphanumeric ag-
gregation functions. By this way, GeoCube ensures
the correct aggregation of geographic measures. REFERENCES
This requirement is not supported by existing
spatio-multidimensional model. Indeed, most of Abelló, A., Samos, J., & Saltor, F. (2006). YAM2:
SOLAP models do not provide any aggregation a multidimensional conceptual model extending
constraint for aggregation functions. Jensen et al. UML. Information Systems, 31(6), 541–567.
(2004) define aggregation constraints, but they doi:10.1016/j.is.2004.12.002
reduce spatial measures to classical data without

395
A Multidimensional Model for Correct Aggregation of Geographic Measures

Ahmed, T., & Miquel, M. (2005). Multidimen- Degrene, J., & Salge, F. (1997). Les systèmes
sional Structures Dedicated to Continuous Spatio- d’information géographique, (2nd Ed.). Paris:
temporal Phenomena. In Proceedings of the 22th Presses Universitaires de France.
BNCOD (pp. 29-40). Berlin: Springer.
Egenhofer, M., & Frank, A. U. (1986). Connec-
Anselin, L. (1989). What Is Special about Spatial tion between Local and Regional: Additional
Data? Alternative Perspectives on Spatial Data Intelligence Needed. In Proceedings of the 18th
Analysis, (Technical Report). Santa Barbara, CA: International Congress of figure
National Center for Geographic Information and
Egenhofer, M., & Frank, A. U. (1992). Object-
Analysis.
Oriented Modeling for GIS. URISA Journal,
Bédard, Y., Merrett, T., & Han, J. (2001). Funda- 4(2), 3–19.
ments of Spatial Data Warehousing for Geographic
Erwig, M., & Schneider, M. (2000). Formaliza-
Knowledge Discovery. In Geographic Data
tion of advanced map operations. In Proceedings
Mining and Knowledge Discovery (pp. 53-73).
of the 9th Int. Symp. on Spatial Data Handling
London: Taylor & Francis.
(pp. 3-17).
Bimonte, S., Tchounikine, A., & Miquel, M.
Escribano, A., Gomez, L., Kuijpers, B., & Vais-
(2006). GeoCube, a Multidimensional Model
man, A. (2007). Piet: a GIS-OLAP implementa-
and Navigation Operators Handling Complex
tion. In Proceedings of the ACM 10th International
Measures: Application in Spatial OLAP. In Pro-
Workshop on Data Warehousing and OLAP (pp.
ceedings of the 4th ADVIS (pp. 100-109). Berlin:
73-80). New York: ACM Press.
Springer-Verlag.
Fidalgo, R., Times, V., Silva, J., & Souza, F. (2004).
Bimonte, S., Wehrle, P., Tchounikine, A., &
GeoDWFrame: A Framework for Guiding the
Miquel, M. (2006). GeWOlap: A Web Based Spa-
Design of Geographical Dimensional Schemas.
tial OLAP Proposal. In Meersman, R., Tari, Z., &
In Proceedings of 6th International Conference
Herrero, P. (Eds.), In Proceedings of the Workshop
on Data Warehousing and Knowledge Discovery
on Semantic-Based Geographical Information
(pp. 26-37). Berlin: Springer.
Systems (pp. 1596-1605). Berlin: Springer.
Franklin, C. (1992). An Introduction to Geographic
Charre, J., Dumolard, P., & Le Berre, I. (1997).
Information Systems: Linking Maps to databases.
Initiation aux pratiques statistiques en geographie.
Database, 15(2), 13–21.
Paris: Masson.
Guting, R. (1998). Geo-Relational Algebra: A
Chrisman, N. (1974). Attributes of geographic
model and query language for geometric data-
entities. Technical Report, Harvard Laboratory
base systems. In Proceedings of the EDBT (pp.
for Computer Graphics, Cambridge, MA.
506-527).
Chrisman,N.(1998).RethinkingLevelsofMeasure-
Güting, R., & Schneider, M. (1995). Realm-based
ment in Cartography. Cartography and GIS, 25(4),
spatial data types: the ROSE algebra. The VLDB
231–242. doi:10.1559/152304098782383043
Journal, 4(2), 243–286. doi:10.1007/BF01237921
Damiani, M., & Spaccapietra, S. (2006). Spatial
Data Warehouse Modeling. In Processing and
Managing Complex Data for Decision Support
(pp. 1-27). Hershey, PA: IGP.

396
A Multidimensional Model for Correct Aggregation of Geographic Measures

Han, J., Stefanovic, N., & Koperski, K. (1998). Pedersen, T. B., & Tryfona, N. (2001). Pre-aggre-
Selective Materialization: An Efficient Method for gation in Spatial DataWarehouses. In Proceedings
Spatial Data Cube Construction. In Proceedings of the 7th International Symposium on Spatial and
of 6th PAKDD Conference (pp. 144-158). Berlin: Temporal Databases (pp. 460-478).
Springer.
Pourabbas, E. (2003). Cooperation with Geo-
Horner, J., Song, I., & Chen, P. (2004). An Analysis graphic Databases. In Multidimensional data-
of Additivity in OLAP Systems. In Proceedings bases: problems and solutions (pp. 393-432).
of the 7th DOLAP (pp. 83-91). New York: ACM Hershey, PA: IGP
Press.
Rafanelli, M., & Ricci, F. (1983). Proposal of a
Jensen, C., Kligys, A., Pedersen, T., & Timko, logical model for statistical databases. In Pro-
I. (2004). Multidimensional data modeling for ceedings of the 2nd International Workshop on
location-based services. The VLDB Journal, 13(1), Statistical and Scientific Database Management,
1–21. doi:10.1007/s00778-003-0091-3 (pp. 264-272).
Kimball, R. (1996). The Data Warehouse Toolkit. Rao, F., Zhang, L., Yu, X., & Li, Y. & Chen, Y.
New York: John Wiley. (2003). Spatial hierarchy and OLAP-favored
search in spatial data warehouse. In Proceedings
Lehner, W. (1998). Modeling large scale OLAP
of the 6th ACM International Workshop on Data
scenarios. In Proceedings of the 6th EDBT (pp.
Warehousing and OLAP (pp. 48-55). New York:
153–167).
ACM Press.
Longley, P., Goodchild, M., Maguire, D., & Rhind,
Rigaux, P., & Scholl, M. (1995). Multi-Scale
D. (2001). Geographic Information Systems and
Partitions: Application to Spatial and Statistical
Science. New York: John Wiley & Sons.
Databases. In Proceedings of the 4th International
Lopez, I., & Snodgrass, R. (2005). Spatiotemporal Symposium on Advances in Spatial Databases,
aggregate computation: A survey. IEEE Transac- (pp. 170-183). Berlin: Springer-Verlag
tions on Knowledge and Data Engineering, 17(2),
Rigaux, P., Scholl, M., & Voisard, A. (2002).
271–286. doi:10.1109/TKDE.2005.34
Spatial databases with applications to Gis. New
Malinowski, E., & Zimányi, E. (2004). Represent- York: Academic Press.
ing spatiality in a conceptual multidimensional
Rivest, S., Bédard, Y., Proulx, M., Nadeaum, M.,
model. In Proceedings of the 12th ACM GIS (pp.
Hubert, F., & Pastor, J. (2005). SOLAP: Merging
12-22). New York: ACM Press.
Business Intelligence with Geospatial Technology
Malinowski, E., & Zimányi, E. (2005). Spatial for Interactive Spatio-Temporal Exploration and
Hierarchies and Topological Relationships in Analysis of Data. Journal of International Society
SpatialMultiDimER model. In Proceedings of the for Photogrammetry and Remote Sensing, 60(1),
22th British National Conference on Databases 17–33. doi:10.1016/j.isprsjprs.2005.10.002
(pp. 17-22). Berlin: Springer.
Sampaio, M. C., Sousa, A. G., & Baptista, C.
Pedersen, T. B., Jensen, C., & Dyreson, C. (2001). (2006) Towards a logical multidimensional model
A foundation for capturing and querying complex for spatial data warehousing and olap. In Proceed-
multidimensional data. Journal of Information ings of the 9th ACM international workshop on
Systems, 26(5), 383–423. doi:10.1016/S0306- Data warehousing and OLAP (pp. 83-90). New
4379(01)00023-0 York: ACM Press.

397
A Multidimensional Model for Correct Aggregation of Geographic Measures

Shekar, S., Lu, C., Tan, X., Chang, S., & Vatsrai, Weibel, R., & Dutton, G. (2001). Generalizing
R. (2001). Map Cube: A Visualization Tool for Spatial Data and Dealing with Multiple Rep-
Spatial Data Warehouses. In Geographic Data resentations. Geographic Information Systems
Mining and Knowledge Discovery (pp. 74-90). and Science. New York: John Wiley & Sons (pp.
London: Taylor & Francis 125-155)
Silva, J., Times, V., Salgado, A., Souza, C., Fidalgo, Zhang, D., & Tsotras, V. (2005). Optimizing spa-
R., & Oliveira, A. (2008). A set of aggregation tial Min/Max aggregations. The VLDB Journal,
functions for spatial measures. In Proceedings of 14(3), 170–181. doi:10.1007/s00778-004-0142-4
the ACM 14th International Workshop on Data
Warehousing and OLAP (pp. 25-32). New York:
ACM Press. KEY TERMS AND DEFINITIONS
Stefanovic, N., Han, J., & Koperski, K. (2000). Geographic Information System: Informa-
Object-Based Selective Materialization for Ef- tion system for capturing, storing, analyzing,
ficient Implementation of Spatial Data Cubes. managing and presenting data that are spatially
IEEE TKDE, 12(6), 938–958. referenced
Tao, Y., & Papadias, D. (2005). Historical Multidimensional Model: Conceptual model
spatio-temporal aggregation. ACM Transac- for the multidimensional analysis of huge datasets,
tions on Information Systems, 23(1), 61–102. based on the concepts of dimensions, facts and
doi:10.1145/1055709.1055713 measures.
Spatial Data Warehouse: Subject-oriented,
Trujillo, J., Palomar, M., & Gómez, J. (2000). An non volatile, time variant and integrated reposi-
Object Oriented Approach to Multidimensional tory of spatial data that is designed to facilitate
Databases & OLAP Operations. International reporting and spatial analysis.
Journal of Computer and Information Science, Spatial Decision Support System: Interac-
2(3), 75–85. tive, computer-based system designed to support
Vassiliadis (1998). Modeling Multidimensional a user or group of users in achieving a higher
Databases, Cubes and Cube Operations. In Pro- effectiveness of decision making while solving a
ceedings of 10th International Conference on Sci- semi-structured spatial problem
entific and Statistical Database Management (pp. Spatial OLAP: Visual platform built espe-
53-62). Alamitos, CA: IEEE Computer Society. cially to support rapid and easy spatio-temporal
analysis and exploration of data following a mul-
Voisard, A., & David, B. (2002). A Database tidimensional approach comprised of aggregation
Perspective on Geospatial Data Modeling. IEEE levels available in cartographic displays as well
TKDE, 14(2), 226–243. as in tabular and diagram displays.

This work was previously published in Evolving Application Domains of Data Warehousing and Mining: Trends and Solu-
tions, edited by Pedro Nuno San-Banto Furtado, pp. 162-183, copyright 2010 by Information Science Reference (an imprint
of IGI Global).

398
399

Chapter 24
Location Privacy in
Automotive Telematics
Muhammad Usman Iqbal
University of New South Wales, Australia

Samsung Lim
University of New South Wales, Australia

ABSTRACT
Over the past few decades, the technologies of mobile communication, positioning, and computing have
gradually converged. The automobile has been a natural platform for this convergence where satellite-
based positioning, wireless communication and on-board computing work in tandem offering various
services to motorists. While there are many opportunities with these novel services, significant risks to
the location privacy of motorists also exist as a result of the fast-paced technological evolution. These
risks must be confronted if trust and confidence are to prevail between motorists and service providers.
This chapter provides an overview of the current situation of location privacy in automotive telematics
by exploring possible abuses and existing approaches to curb these abuses followed by a discussion of
possible privacy-strengthening measures.

INTRODUCTION filter. These services can be triggered by an event,


for example, the location of the vehicle can be
The proliferation of location-aware computing transmitted to an emergency response center on
devices promises an array of “quality-of-life deployment of air bags. Some services can be
enhancing” applications. These services include explicitly requested by the driver, for example,
in-car navigation, roadside assistance, infotain- in-car navigation or road side assistance. While
ment, emergency response services, vehicle other applications can be quietly running at all
diagnostics and prognostics. The key idea is to times, passing on real-time information of the
provide services using “location” as a geographic vehicle’s movements such as Global Positioning
System (GPS) enabled Pay-As-You-Drive (PAYD)
DOI: 10.4018/978-1-4666-2038-4.ch024
insurance (Grush, 2005).

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Location Privacy in Automotive Telematics

Although location data is critical to the opera- How Positioning Systems


tion of such applications, there is a precarious can be Privacy Invasive?
balance between the necessary dissemination of
location information and the potential for abuse Positioning systems can be categorized into either
of this private information. Spatio-temporal (loca- being ‘Self-positioning’ or ‘Remote-positioning’.
tion in time) information continuously monitored In Self-positioning systems, the vehicle is either
(and logged) about the places a person visits can fitted with a GPS receiver or Dead-Reckoning
reveal a lot about one’s persona. Given the current system (based on one or more gyroscopes, a
capabilities of inference by combining disparate compass and odometer) to locate where it is on
sources of information, a lot can be inferred about the road. Remote-positioning systems require a
an individual. These derived profiles can then central site to determine the location of the vehicle
be used to make judgments about a person or (Drane and Rizos, 1997). The result is a set of
used for unsolicited marketing by location-based coordinates (or position) of the vehicle expressed
marketers. Orwell (1949), in his criticism against in relation to a reference frame or datum. Self-
totalitarianism, would have most likely referred positioning systems inherently protect location
to these “Small Brothers” (location-based retail privacy because they do not report the location
marketers) had he known about these inference of the vehicle to any other system. On the other
attacks. hand, remote-positioning systems track, compute
In the next few sections a background on loca- and retain the location information at the central
tion privacy is presented, some possible privacy monitoring site and creates a risk to the individual’s
abuses of telematics services are discussed, and privacy. Self-positioning systems also pose a pri-
existing approaches to curb these abuses are vacy risk if they report the vehicle’s GPS-derived
investigated. The chapter then suggests possible location to a server through the communications
measures to strengthen location privacy. infrastructure.

BACKGROUND PRIVACY ATTACKS

Before delving into the core issue of location ACME Rent a Car Company
privacy, it is important to agree on a definition of
privacy itself. Much of the literature pertaining Most readers would be familiar with the highly
to privacy refers to Westin’s precise definition. publicized abuse of GPS technology where
In the context of telematics, location privacy is a ACME charged its customers $150 for speeding
special case of privacy, relating to the privacy of occurrences of more than 80mph. A customer
location information of the vehicle, and ultimately took ACME to court and won on grounds that
the user of the vehicle. the company failed to clearly explain how the
Privacy is the claim of individuals, groups and location tracking system would be used (Ayres
institutions to determine for themselves, when, and Nalebuff, 2001). This is an obvious scenario
how and to what extent information about them of how personal information can be exploited. It
is communicated to others. (Westin, 1967) is not unreasonable to imagine that an ordinary
car trip can become an Orwellian ordeal when
one’s location information can be used in ways
not imagined.

400
Location Privacy in Automotive Telematics

Location-Based Spam Electronic Toll Collection

Figure 1 illustrates a possible threat scenario Electronic toll collection seeks to alleviate traffic
where a vehicle is equipped with an on-board GPS congestion at toll gates, and provides a convenient
receiver and the vehicle periodically transmits its method for drivers to pay tolls. Such schemes
location data to a tracking server. The tracking typically require the car to have an electronic tag
server is connected to various service providers attached to the front windscreen. Tolls are deducted
which have been authorized by the driver to ac- from the vehicle’s account when the scanner
cess location data in order to provide telematics senses the toll tag. Electronic toll can become
services. The service providers are not necessarily privacy invasive, for example, if the toll system
trusted and it is not unreasonable to expect loca- passes the entry and exit times of the vehicle to
tion information of individuals being sold on the law enforcement agencies giving them the ability
market (much like email address lists). to issue speeding tickets if the distance is traveled
in too short a time (Langheinrich, 2005).
Profiling Driving Behavior

Greaves and De Gruyter (2002) discuss how a PRIVACY DEFENSES


driving profile of a person can be derived from
GPS track data. They sought an understanding In the previous section location privacy threats
of driving behaviors in real-world scenarios by provoking some serious ambivalence about the
fitting low-cost GPS receivers to vehicles, and social and ethical telematics issues were discussed.
logging the vehicle movements. Consequently, There are some countermeasures that can be taken.
they were able to identify driving styles from The first and most simple one would be an opt-out
this data. Imagine a PAYD insurance provider approach. This would result in a denial of service
accessing this information, in order to identify for the vehicle driver. The more challenging issue
an individual with an ‘aggressive’ driving style. is how to preserve location privacy while at the
same time maximizing the benefits of telematics
services.
Figure 1. A typical privacy threat scenario
Legislation and Regulation

Location privacy can be considered to be a special


case of information privacy. However, because
this area of the law is in its embryonic stages,
one can consider ‘location’ and ‘information’ as
being synonymous.
In the United States, legislation to protect
location information arises primarily from the
Telecommunications Act of 1996 and the 1998
E911 amendments. As a result, there is ambiguity
about the so-called “opt-in” or “opt-out” approach
for customer consent. However, a bill specifi-
cally addressing location privacy, the Wireless
Location Privacy Protection Act of 2005, which

401
Location Privacy in Automotive Telematics

required location-based services (LBSs)to give principle that provides both security and privacy
their informed consent for disclosure of loca- within the telematics environment. There is one
tion information, was referred to the U.S. Senate problem with these implementations, since the
(Ackerman et al, 2003). policies serve as a mutual contract; the driver has
In Australia, the Privacy Act of 1988 (“Privacy to trust the organization to abide by the policies.
Act 1988 (Commonwealth)”, 2005) deals with The Internet Engineering Task Force (IETF),
consumers’ privacy. Besides legislation, Standards the standards body responsible for developing
Australia has published a guideline suggesting internet standards, has also realized the importance
procedures for toll operators and electronic parking of location privacy. It has proposed the Geographi-
operators to protect the personal privacy of their cal Location/Privacy Charter, referred to simply as
customers (Standards-Australia, 2000). geopriv. This standard seeks to preserve location
Japan and the European Union have well privacy of mobile internet hosts (IETF, 2006).
established laws for protecting consumers’ loca- Synnes et al (2003) have implemented secure
tion privacy (Ackerman et al, 2003).One issue location privacy, using a similar approach of using
that should be emphasized is that legislation is rules to implement policies. In the near future, it is
not the only defense against (location) privacy not hard to imagine automobiles having Internet
attacks. The corporate world is very good at Protocol (IP) addresses and ultimately using the
obscuring questionable practices with fine print geopriv solution to implement privacy policies.
in a service agreement or contract (Schilit et al,
2003). Therefore there has to be enforcement of Identity and Location Decoupling
laws as well as open audit of privacy practices.
One conclusion that can be drawn is that the
Policy-Based and Rule- vehicle can be uniquely identified when it com-
Based Protection municates with a particular Telematics Service
Provider (TSP). Therefore, decoupling of identity
Privacy protection regulation concludes that “user and vehicle location is essential at retention of
consent” is an essential requirement. If the growth data. This can be regulated through policy, and
in telematics services proceeds as predicted, then laws such as discussed above. Herborn et al (2005)
it would be difficult for a member of the public to have studied this concept in pervasive comput-
keep track of all details. Secondly, constant explicit ing networks. They argue that decoupling these
consent requirements can become a source of data from each other would have more benefits.
driver distraction. Hence an analogy can be drawn Name ‘hijacking’ would simply not be possible.
from the internet, where the Platform for Privacy The issue here is that for decoupling the identity
Preferences (P3P) is used to manage web server and other data to work, a robust scheme to resolve
privacy policies in Extensible Markup Language naming would be required. This, however, is still
(XML) machine readable format. Typically these an open research issue.
operate by comparing user profile rules of a web
client with the rules on a particular web server. Anonymous Access
Such is the importance of location privacy that
there are already efforts to extend the P3P for Researchers in the field of LBSs have looked at
location rules (Morris, 2002). This means that anonymous solutions to location privacy. The basic
rules like “Allow Alice to access my location on idea here is to access the services anonymously.
a weekday” can be created. Duri et al (2002) pro- Unfortunately, this cannot be regarded as a com-
posed an end-to-end framework based on a similar plete solution given the inference capabilities of

402
Location Privacy in Automotive Telematics

Geographical Information Systems (GIS) and one possible approach to protect location privacy
advanced surveillance techniques, as discussed in location-aware computing (Duckham and
already (Gruteser and Hoh, 2005). An adversary Kulik, 2005). This deliberate degradation of loca-
can apply data-matching techniques to indepen- tion information is performed by the individual,
dent samples of anonymous data collected, and through deciding which service would require
map them on predictable paths such as roads, what ‘granularity’ of information, often referred
and infer the identity of an individual based on to as the “need to know principle”. Snekkenes
where one is. (2001) constructed rules for implementing privacy
Drawing from techniques used by census policies using this principle. He emphasized that
boards and electoral commissions to obscure different services require different resolutions, or
data so that individuals are not identified, another accuracy, of location information. The advantage
methodology similar to anonymous access has of obfuscation over anonymity is that it allows
been proposed. It is called “k-anonymous access”. authentication and customization of the services.
This means that when the location information of However, it still is not the ideal remedy when high
a subject is requested, it will only be responded accuracy of reported location, instead of deliberate
to if there are k other nodes present in the vicinity degradation, is required.
(Gruteser and Grunwald, 2003). This approach can
give good protection against privacy attacks if the Privacy Aware Designs
value of k is set to a high number, however this
would affect the quality of LBSs. In this approach, While defenses discussed above propose measures
k is a variable that could only be altered globally. for limiting disclosure of location information,
A second approach deals with k on a per node others have sought to understand privacy aware
basis. This means that each user can specify his designs. The success of future LBSs depends on
or her privacy variable (Gedik & Liu, 2005). This designing systems with privacy in mind, not just
approach appears to more realistically simulate it being an “afterthought”. Langheinrich (2005)
user privacy preferences in the real world. discuss the need for anonymous location infra-
Apart from being identified through map- structures and transparency protocols allowing
matching techniques, there is one additional customers to understand and track how their data
problem that can affect the correct operation of is collected and used. Kobsa and Telztrow (2006)
telematics services using anonymous techniques. argue that clearly explaining privacy policies at
Existing approaches discussed here are aimed subscription would encourage users to disclose
at solving the location privacy problem in the information and create a sense of trust. They
context of LBSs. Telematics can be considered to conducted experiments to prove this comparing
be a special case of LBSs, the authors, however, their privacy friendly systems to the traditional
argue that it needs a totally different mindset for data collection systems.
addressing privacy problems of the mobile public Other examples of privacy aware designs in-
mainly because of differences such as higher aver- clude work by Coroama and Langheinrich (2006)
age speeds, predictable paths, and the magnanimity where they implemented a GPS based PAYD insur-
of the number of users. ance system depicting real-time risk assessment
of actual road conditions. Their system calculates
Obfuscation premiums on board the vehicle guaranteeing pri-
vacy of owners. There is periodic transmission of
The term “obfuscation” means the process of aggregated information to the insurance provider
confusing or obscuring. It has been identified as for bill generation. Iqbal and Lim (2006) extended

403
Location Privacy in Automotive Telematics

this idea further and proposed a GPS-based insur- in” privacy protection within such systems so that
ance product that preserves location privacy by as new applications become available, appropriate
computing distances traveled on the onboard unit. privacy measures are integral to them.
They additionally safeguarded “spend privacy”
by proposing smart card based anonymous pay-
ment systems. Their approach was to redesign a REFERENCES
closed system curtailing redundant exchange of
location data. Ackerman, L., Kempf, J., & Miki, T. (2003).
Wireless location privacy: A report on law and
policy in the United States, the European Union,
CONCLUSION and Japan: DoCoMo USA Labs Technical Report
DCL-TR2003-001.
Location privacy protection in telematics is indeed Ayres, I., & Nalebuff, B. (2001). Connecticut’s
a social issue. The authors have reviewed in this speeder-friendly crackdown. The New York Times.
short article location privacy threats and possible
countermeasures. Each countermeasure to protect Coroama, V., & Langheinrich, M. (2006). Per-
privacy has its own implications, and it is clear sonalized Vehicle Insurance Rates – A Case
that no general panacea exists. This suggests that for Client-Side Personalization in Ubiquitous
a combination of several different approaches may Computing. Paper presented at the Workshop on
be the best solution. Privacy-Enhanced Personalization at CHI 2006,
The reader might feel that the authors have Montréal, Canada, 22 April, 2006.
taken a pessimistic view of privacy issues. It is
Drane, C., & Rizos, C. (1997). Role of Position-
acknowledged that location disclosure would be
ing Systems in ITS. In Positioning Systems in
necessary in life threatening scenarios, or where
Intelligent Transportation Systems (pp. 298-299).
law enforcement officials need access to this
Boston: Artech House, Inc.
information. This critical information, however,
like other worthwhile liberties needs to be pro- Duckham, M., & Kulik, L. (2005). A formal
tected by law. Under normal circumstances, only model of obfuscation and negotiation for loca-
the location information subject has the right to tion privacy. Lecture Notes in Computer Science,
exercise control of one’s personal information. 3468, 152–170.
Development in telematics is through a coop-
Duri, S., Gruteser, M., Liu, X., Moskowitz, P.,
eration of companies that are involved in transport
Perez, R., Singh, M., et al. (2002). Framework
management, vehicle manufacture or information
for security and privacy in automotive telemat-
technology services. The current approach recog-
ics. Paper presented at the Proceedings of the 2nd
nizes privacy to be a “non-technical barrier” to
international workshop on Mobile commerce,
the implementation of ITS (US-Department-of-
Atlanta, Georgia, USA.
Transportation, 1997). Since research in transport
telematics is in its nascent stages, it is important Gedik, B., & Liu, L. (2005). Location Privacy in
to understand that these issues are not merely Mobile Systems: A Personalized Anonymization
social hindrances. Once such scenarios become Model. Paper presented at the 25th Institute of
commonplace, the general user may be reluctant Electrical and Electronic Engineers, Inc. (IEEE)
to use these telematics services at all. Therefore, International Conference on Distributed Comput-
it is important to dispel these privacy concerns ing Systems (ICDCS’05).
right from the beginning, and focus on “building

404
Location Privacy in Automotive Telematics

Greaves, S. P., & De Gruyter, C. (2002). Profiling Morris, J. (2002). P3P and the Privacy of Loca-
driving behaviour using passive Global Position- tion Information. Paper presented at the World
ing System (GPS) technology. Paper presented at Wide Web Consortium (W3C) Workshop on the
the Institute of Transportation Engineers Interna- Future of Platform for Privacy Preferences (P3P).
tional Conference, Melbourne, Australia.
Orwell, G. (1949). Nineteen Eighty-four. New
Grush, B. (2005). Optimizing GNSS-Based Mo- York: Harcourt, Brace.
bility Pricing for Road-Use, Parking, and PAYD
Privacy Act 1988 (Commonwealth). (2005).
Insurance. Paper presented at the 4th European
(Vol. Act No. 119 of 1988 as amended): Office
Traffic Congress, Salzburg, Austria.
of Legislative Drafting and Publishing (OLDP),
Gruteser, M., & Grunwald, D. (2003). Anonymous Australia.
Usage of Location-Based Services through Spatial
Schilit, B., Hong, J., & Gruteser, M. (2003). Wire-
and Temporal Cloaking. Paper presented at the
less location privacy protection. IEEE Computer
First International Conference on Mobile Systems,
Magazine, 36(12), 135–137.
Applications, and Services.
Snekkenes, E. (2001). Concepts for personal
Gruteser, M., & Hoh, B. (2005). On the Anonymity
location privacy policies. Paper presented at the
of Periodic Location Samples. Paper presented at
Proceedings of the 3rd ACM conference on Elec-
the 2nd International Conference on Security in
tronic Commerce, Tampa, Florida, USA.
Pervasive Computing, Boppard, Germany.
Standards-Australia. (2000). AS 4721-2000: Per-
Herborn, S., Boreli, R., & Seneviratne, A. (2005).
sonal privacy practices for the electronic tolling
Identity Location Decoupling in Pervasive Com-
industry: Standards Australia.
puting Networks. Paper presented at the 19th
International Conference on Advanced Informa- Synnes, K., Nord, J., & Parnes, P. (2003). Location
tion Networking and Applications (AINA ‘05). Privacy in the Alipes Platform. Paper presented at
the 36th Annual Hawaii International Conference
IETF. (2006). Internet Engineering Task Force
on System Sciences (HICSS’03).
(IETF) Geographic Location/ Privacy (geopriv).
Retrieved 27th March, 2006, from http://www. US-Department-of-Transportation. (1997). A
ietf.org/html.charters/geopriv-charter.html Report to Congress: Nontechnical Constraints
and Barriers to the Implementation of Intelligent
Iqbal, M. U., & Lim, S. (2006). A privacy preserving
Transportation Systems. Washington, D.C: US
GPS-based Pay-as-You-Drive insurance scheme.
Department of Transportation.
Symposium on GPS/GNSS (IGNSS2006), Surfers
Paradise, Australia, 17-21 July, 2006. Vidales, P., & Stajano, F. (2002). The Sentient
Car: Context-Aware Automotive Telematics. Paper
Kobsa, A., & Teltzrow, M. (2006). Convincing
presented at the LBS-2002.
Users to Disclose Personal Data. In A. Kobsa, R.
Chellappa, & S. Spiekermann, (Eds.), Proceed- Westin, A. F. (1967). In PRIVACY AND FREE-
ings of PEP06, CHI 2006 Workshop on Privacy- DOM (pp. 7). New York: Atheneum.
Enhanced Personalization, Montréal, Canada.
Whelan, R. (1995). Smart Highways, Smart Cars.
Langheinrich, M. (2005). Personal Privacy in Boston: Artech House, Inc.
Ubiquitous Computing – Tools and System Sup-
port. PhD thesis No. 16100, ETH Zurich, Zurich,
Switzerland.

405
Location Privacy in Automotive Telematics

KEY TERMS AND DEFINITIONS efficient movement of road transportation and


provision of travel information to the vehicles.
Context Aware Computing: The process of Location Privacy: Location privacy is the
customization of software and services to user ability of an individual to control access to his/
preferences. The computing mechanism changes her current and past location information.
based on the context, in telematics perspective, Obfuscation: Obfuscation is the deliberate
location is a context for customization. degradation of location information by respond-
Electronic Tolls: Electronic payment systems ing in a less granular fashion about requested
designed to identify an electronic tag mounted on location data.
a vehicle to deduct the toll charges electronically Telematics Service Provider: Telematics Ser-
from the vehicle owner’s account. vice providers offer services to vehicle drivers for
In-Car Navigation: Usually a voice-activated either a subscription fee or any other arrangement.
system with a liquid crystal display (LCD) screen These can be emergency services or informational
displaying maps and a combination of on-board services to improve the driving experience.
GPS receivers, accelerometers, compass and gy- Vehicle Prognostics: Factory installed sys-
roscopes for positioning the vehicle on the map. tems monitoring and reporting health of vehicle
Intelligent Transportation Systems: Tools, equipment to owner and manufacturer periodi-
software, hardware and services designed for the cally.

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 293-301,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

406
407

Chapter 25
Diagnostic Modeling of
Digital Systems with Multi-
Level Decision Diagrams
Raimund Ubar
Tallinn University of Technology, Estonia

Jaan Raik
Tallinn University of Technology, Estonia

Artur Jutman
Tallinn University of Technology, Estonia

Maksim Jenihhin
Tallinn University of Technology, Estonia

ABSTRACT
In order to cope with the complexity of today’s digital systems in diagnostic modeling, hierarchical
multi-level approaches should be used. In this chapter, the possibilities of using Decision Diagrams
(DD) for uniform diagnostic modeling of digital systems at different levels of abstraction are discussed.
DDs can be used for modeling the functions and faults of systems at logic, register transfer and behavior
like instruction set architecture levels. The authors differentiate two general types of DDs – logic level
binary DDs (BDD) and high level DDs (HLDD). Special classes of BDDs are described: structurally
synthesized BDDs (SSBDD) and structurally synthesized BDDs with multiple inputs (SSMIBDD). A
method of iterative synthesis of SSBDDs and SSMIBDDs is discussed. Three methods for synthesis of
HLDDs for representing digital systems at higher levels are described: iterative superposition of HLDDs
for high-level structural representations of systems, symbolic execution of procedural descriptions for
functional representations of systems, and creation of vector HLDDs (VHLDD) on the basis of using
shared HLDDs for compact representing of a given set of high level functions. The nodes in DDs can be
modeled as generic locations of faults. For more precise general specification of faults different logic
constraints are used. A functional fault model to map the low level faults to higher levels, particularly,
to map physical defects from transistor level to logic level is discussed.

DOI: 10.4018/978-1-4666-2038-4.ch025

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

INTRODUCTION have been introduced for test generation (Blanton


& Hayes, 2003) and fault diagnosis (Mahlstedt,
The most important question in testing today’s Alt & Hollenbeck, 1995; Holst & Wunderlich,
digital systems is: how to improve the test 2008). On the other hand, the defect-orientation
generation and fault simulation efficiency and is increasing the complexity of the diagnostic
quality at continuously increasing complexities modeling task even more. To get out from the
of systems? Two main trends can be observed: deadlock, these two opposite trends – high-level
defect-orientation and high-level modeling. To modeling and defect-orientation – should be com-
follow the both trends simultaneously, a hierar- bined into a hierarchical approach. The advantage
chical approach seems to be the only possibility. of hierarchical diagnostic modeling compared to
One of the attractive ways to manage hierarchy the high-level functional modeling lies in the pos-
in diagnostic modeling (test generation, fault sibility of constructing and analyzing test plans
simulation, fault location) in a uniform way on on higher levels, and modeling faults on more
different levels of abstraction is to use decision detailed lower levels.
diagrams (DD).
Traditional flat low-level test methods and
tools for complex digital systems have lost their BACKGROUND
importance, other approaches based mainly on
higher level functional and behavioral methods The difficulties in developing of analytical multi-
are gaining more popularity (Lee, Patel, 1997; level and hierarchical approaches to digital test
Makris, Collins, Orailoglu & Vishakantaiah, 2000; generation and fault simulation lay in using dif-
Fummi, & Sciuto, 2000; Vedula & Abraham, ferent languages and models for different levels
2002; Mirkhani, Lavasani & Navabi, 2002; Al- of abstractions. Most frequent examples are logic
Yamani, & McCluskey, 2004; Kundu, 2004; Yi, expressions for combinational circuits, state transi-
& Hayes, 2006; Misera, Vierhaus, Breitenfeld & tion diagrams for finite state machines (FSM), ab-
Sieber, 2006; Alizadeh, & Fujita, 2010; Misera stract execution graphs, system graphs, instruction
& Urban, 2010). Hierarchical mixed- or multi- set architecture (ISA) descriptions, flow-charts,
level approaches have also been used both, for hardware description languages (HDL, VHDL,
test generation (Lee, Patel, 1997; Makris, Collins, Verilog, System C etc.), Petri nets for system level
Orailoglu & Vishakantaiah, 2000; Vedula & Abra- description etc. All these models need dedicated
ham, 2002; Ravi, Jha, 2001; Ichihara, Okamoto, for the given language manipulation algorithms
Inoue, Hosokawa & Fujiwara, 2005), and fault and fault models which are difficult to merge into
simulation (Mirkhani, Lavasani & Navabi, 2002; hierarchical test methods. HDL based modeling
Kundu, 2004; Misera, Vierhaus, Breitenfeld & methods which are efficient for fault simulation
Sieber, 2006; Misera & Urban, 2010). A general lack the capability of analytical reasoning and
idea in these methodologies is detailed low-level analysis that is needed in test generation and
fault simulation in one module of a system, while fault diagnosis.
propagating the effects through other modules Excellent opportunities for multi-level and
modeled at a higher level abstraction. hierarchical diagnostic modeling of digital systems
The trend towards higher level modeling moves provide decision diagrams (DD) because of their
us away from the real life of defects and, hence, uniform cover of different levels of abstraction, and
from accuracy of testing. To handle adequately because of their capability for uniform graph-based
defects in deep-submicron technologies, new fault analysis and diagnostic reasoning (Lee, 1959;
fault models and defect-oriented test methods Ubar, 1976; Akers, 1978; Plakk & Ubar, 1980;

408
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Bryant, 1986; Minato, 1996; Sasao, & Fujita, derived from logic functions don’t represent in-
1996; Ubar, 1996; Drechsler & Becker, 1998). herently the structural details of the circuits they
Within the last two decades BDDs have be- represent, which are related to signal paths and
come the state-of-the-art data structure in VLSI structural location of faults. For the lack of direct
CAD for representation and manipulation of representation of such structural information in the
Boolean functions. They were first introduced model, indirect and less efficient methods should
for logic simulation in (Lee, 1959), and for test be applied when the structure related tasks are
generation in (Ubar, 1976, Akers, 1978). In 1986, being solved like test generation, fault simulation
Bryant proposed a new data structure called and fault diagnosis.
reduced ordered BDDs (ROBDDs). He showed Structurally synthesized BDDs (SSBDD)
simplicity of the graph manipulation and proved were proposed and developed with the goal to
the model canonicity that made BDDs one of the represent, simulate and analyze structural features
most popular representations of Boolean functions of circuits (Ubar, 1996; Jutman, Raik & Ubar,
(Minato, 1996; Sasao, & Fujita, 1996; Drechsler & 2002). The most significant difference between
Becker, 1998). Different types of BDDs have been the function-based BDDs and SSBDDs is in the
proposed and investigated during decades such as method how they are generated. While BDDs
shared or multi-rooted BDDs (Minato, Ishiura, & are generated on the functional basis by Shan-
Yajima, 1990), ternary decision diagrams (TDD), non expansion, or other types of expansions (like
or in more general, multi-valued decision diagrams Davio or Reed-Muller expansions), which handle
(MDD) (Srinivasan, Kam, Malik, & Bryant, 1990), only the Boolean function of the logic circuit, the
edge-valued binary decision diagrams (EVBDD) SSBDD models are generated by superposition of
(Minato, Ishiura, & Yajima, 1990), functional BDDs that extracts both, functions and data about
decision diagrams (FDD) (Kebschull, Schubert, structural signal paths of the circuit. The linear
& Rosenstiel,1992), zero-suppressed BDDS complexity of the SSBDD model results from the
(ZBDD) (Minato, 1993), algebraic decision dia- fact that a digital circuit is represented as a system
grams (ADD) (Bahar, Frohm, Gaona, Hachtel, of SSBDDs, where for each fanout-free region
Macii, Pardo, & Somenzi, 1993), Kronecker FDDs (FFR) a separate SSBDD is generated. SSBDDs
(Drechsler, Theobald, & Becker, 1994), binary have found efficient applications in simulation of
moment diagrams (BMD) (Bryant, & Chen, 1995), structural aspects and faults for test generation
free BDDs (Bern, Meinel, & Slobodova,1995), (Raik & Ubar, 1998), hazard and timing analysis
multiterminal BDDs (MTBDD) and hybrid BDDs with multi-valued simulation (Ubar, 1998; Jut-
(Clarke, Fujita, & Zhao, 1996), Fibonacci decision man, Ubar & Peng, 2001), fault simulation and
diagrams (Stankovic R.S., Astola, Stankovic, M., fault coverage analysis (Ubar, Devadze, Raik &
& Egiazarian, 2002) etc. Overviews about dif- Jutman, 2008 & 2010), design error diagnosis
ferent types of BDDs can be found for example (Ubar & Borrione, 1999).
in (Sasao, & Fujita, 1996, Drechsler, & Becker, Recent research has shown that generalization
1998; Karpovsky, Stanković, & Astola, 2008). of BDDs for higher levels provides a uniform
For more detail about the many ways to represent model for both gate and RT or behavioral level
logic functions, we refer for example to (Sasao, simulation (Ubar, Moraviec & Raik, 1999), mixed-
1999; Astola, & Stankovic, 2006). level fault simulation (Ubar, Raik, Ivask & Brik,
Most of these listed different types of BDDs, 2002), hierarchical test generation (Raik & Ubar,
however, suffer from the memory explosion, which 2000), and high-level design verification (Jenih-
limits their usability for large designs. Regarding hin, Raik, Chepurov & Ubar, 2009).
test and fault simulation, the listed types of BDDs

409
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

The disadvantage of the traditional hierarchical bolic execution of procedural descriptions of the
approaches to test is the conventional application functionality of systems, and creation of vector
of gate-level stuck-at fault (SAF) model. It has HLDDs (VHLDD) on the basis of using shared
been shown that high SAF coverage cannot guar- HLDDs for compact representing of a given set
antee high quality of testing (Huisman, 1993). The of high level functions. The last section is devoted
types of faults that can be observed in a real gate to functional fault modeling which helps to map
depend not only on the logic function of the gate, faults from lower to higher levels in multi level
but also on its physical design. Good possibilities diagnostic modeling of systems with DDs.
to combine logical and physical level fault model-
ing are provided by the extensions of the SAF-
based fault model like pattern fault (Blanton & DECISION DIAGRAMS
Hayes, 2003) or conditional fault models (Holst & AND DIGITAL TEST
Wunderlich, 2008). A similar pattern related fault
modeling approach called functional fault model Diagnostic Modeling of Digital
was proposed in (Ubar, 1980) for the module level Circuits with SSBDDs
fault diagnosis in combinational circuits. Based
on this extended SAF fault model, a deterministic In the following we present a definition of
defect-oriented test pattern generator DOT was SSBDDs, method of synthesis of SSBDDs, and
developed in (Raik, Ubar, Sudbrock, Kuzmicz & describe briefly fault simulation and test genera-
Pleskacz, 2001), which allows to prove the logic tion algorithms for SSBDDs with the purpose to
redundancy of not detected physical defects from generalize all this topics for HLDDs.
the given fault model. Consider first, the following graph theoreti-
Whereas the methods of synthesis of different cal definitions of the BDD. We use the graph-
types of BDDs are well described in the literature, theoretical definitions instead of traditional ite
the formal methods of generation of high-level expressions (Bryant, 1986; Drechsler & Becker,
DDs (HLDD) have not been discussed in the 1998) because all the procedures defined further
literature so far. This gap will be filled up in this for SSBDDs are based on the topological reasoning
chapter. We present first, a short overview about rather than on graph symbolic manipulations as
SSBDDs including the definition, synthesis, and traditionally in the case of BDDs. The topological
main algorithms of diagnostic modeling. In the approach allows to generalize all the algorithms
next Section we show possibilities to improve related to low logic level BDDs (SSBDDs) for
the model by reducing its complexity while high functional or behavioral level DDs.
retaining all the information about the structure
of the original circuits. Next, we generalize the Definition 1: A BDD that represents a Boolean
graph model of logic level SSBDDs for higher function y=f(X), X = (x1,x2, …, xn), is a
levels of abstraction with introducing high level directed acyclic graph Gy = (M,Γ,X) with
DDs (HLDD). We compare them with BDDs a set of nodes M and a mapping Γ from M
and present main algorithms of diagnostic mod- to M. M = MN ∪ MT consists of two types
eling based on HLDDs as the generalizations of nodes: nonterminal MN and terminal MT
of similar algorithms for BDDs. Then, three nodes. A terminal node mT ∈ MT = {mT,,0,
methods for synthesis of HLDDs for represent- mT,1} is labeled by a constant e∈{0,1} and is
ing digital systems at higher levels are proposed: called leaf, while all nonterminal nodes m ∈
iterative superposition of HLDDs for high-level MN are labeled by variables x ∈ X, and have
structural representations of digital systems, sym- exactly two successors. Let us denote the as-

410
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

sociated with node m variable as x(m), then input and a signal path of the given tree-like
m0 is the successor of m for the value x(m) = circuit. The BDD is called a structurally
0 and m1 is the successor of m for x(m) = 1. synthesized BDD (SSBDD) iff there exists
Γ(m) ⊂ M denotes the set of all successors one-to-one correspondence between the
of m ∈ M, and Γ -1(m) ⊂ M denotes the set literals x ∈ X and the nodes m ∈ MN given
of all predecessors of m ∈ M. For terminal by the set of labels { x(m) | x ∈ X, m ∈ MN},
nodes mT ∈ MT we have Γ (mT) = ∅. There and iff for all the possible vectors Xt ∈ {0,1}
is a single node m0 ∈ M where Γ -1(m) = ∅ n
a path l(m0, mT) is activated, so that y =
called root node. f(Xt) = x(mT).
Definition 2: For the assigned value of x(m) = e,
e ∈ {0,1}, we say the edge between nodes An example of such a SSBDD for the Boolean
m ∈ M and me ∈ M is activated. Consider a formula
situation where all the variables x ∈ X are
assigned by a Boolean vector Xt ∈ {0,1}n y = (x 11x 12 ) ∨ x 12x 31x 4 )(x 13x 22x 32 ) (1)
to some values. The activated by Xt edges
form an activated path through nodes l(m0,
mT) ⊆ M from the root node m0 to one of the which corresponds to the gate-level circuit in
terminal nodes mT∈ MT. Figure 1a is represented in Figure 1b. We call
Definition 3: We say that a BDD Gy = (M,Γ,X) this graph structurally synthesized because it is
represents a Boolean function y=f(X), iff derived from and represents the structure of the
for all the possible vectors Xt ∈ {0,1}n a Formula (1) or the corresponding circuit in Figure
path l(m0, mT) ⊆ M is activated so that y = 1a. There is a one-to-one mapping between the
f(Xt) = x(mT). nodes in graph and the literals in the Formula (1)
Definition 4: Consider a BDD Gy=(M,Γ,X) which or in the corresponding signal paths in the circuit
models a gate-level tree-like combinational in Figure 1a.
circuit represented by a Boolean formula
(equivalent parenthesis form [21]) y = P(X) Synthesis of SSBDDs
where X is the vector of literals, the nodes m
∈ MN are labeled by x(m) where x ∈ X, and For synthesis of SSBDDs for a given gate network,
|M | = |X |. Each literal x ∈ X represents an the graph superposition procedure is used. If the

Figure 1. A circuit and a SSBDD which correspond to the Boolean Formula (1)

411
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

label x(m) of a node m in the SSBDD Gy is an form) y = (x1 ∨ x21) (x22 ∨ x3) is shown in Figure 2.
output of a subnetwork which is represented by For simplicity, the values of variables on edges
another SSBDD Gx(m) then the node m in Gy can of the SSBDD are omitted (by convention, the
be substituted by the graph Gx(m). In this graph right-hand edge corresponds to 1 and the down-
superposition procedure the following changes wards directed edge to 0). Likewise, the terminal
in Gy and Gx(m) are made. nodes with constants 0 and 1 are omitted: leaving
the graph to the right corresponds to y = 1, and
Algorithm 1: Graph superposition procedure down, to y = 0.
1) The node m will be removed from Gy. We start with the output AND gate and its BDD
2) All the edges in Gx(m) that were con- Gy which consists of two nodes a and b. The input
nected to terminal nodes mT,e in Gx(m) a of the AND gate is simultaneously the output
will be cut and then connected, corre- of the OR gate represented by the BDD Ga which
spondingly, to the successors me of the consists of the nodes x1 and x21. First, we substitute
node m in Gy. Here, mT,e is the terminal the node a in Gy by the graph Ga. Thereafter the
node labeled by constant e∈ {0,1} node b in Gy is substituted by the graph Gb which
3) All the incoming edges of m in Gy will consists of the nodes x22 and x3. The final graph
be now incoming edges for the initial which represents the whole circuit except the
node m0 in Gx(m). fanout stem x2 consists of the nodes x1, x21, x22,
and x3. The full SSBDD model for the given
We assume that the BDDs for elementary circuit consists of two graphs: the graph created
components (gates) of the circuit are given as a (in Figure 2), and the graph consisting of a single
library. Let us call these library BDDs as elemen- node x2.
tary SSBDDs. Starting from the SSBDD of the In general case, the SSBDD model for a given
output gate, and using iteratively Algorithm 1, we combinational circuit consists of SSBDDs for all
compress the initial model of the gate-network (by tree-like fanout-free regions (FFR) and of 1-node
each substitution we reduce the model by one node SSBDDs for all primary inputs which have fan-
and by one graph). To avoid the explosion of the out branches.
SSBDD model, we generate the SSBDDs only Unlike the “traditional” BDDs, SSBDDs
for tree-like subnetworks. As the result we get a directly support test generation and fault simula-
macro-level logic network where each macro (a tion for gate-level structural SAF faults without
tree-like subcircuit) is represented by a SSBDD. representing these faults explicitly. The advantage
An example of the graph superposition pro- of the SSBDD based approach is that the library
cedure for a combinational circuit represented of components is not needed for structural path
by a Boolean formula (equivalent parenthesis activation, as it is in the case of traditional gate-

Figure 2. Example of superposition of SSBDDs

412
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

level path activation. This is the reason why on the macro-level (FFRs as macros) with direct
SSBDD based test generation procedures do not representation of the faults in the model.
depend on whether the circuit is represented on
the logic gate level or on the logic macro-level Test Generation with SSBDDs
where the logic macro means an arbitrary single-
output gate-level subcircuit of the whole gate Consider a combinational circuit as a network of
network. Moreover, the test generation procedures gates, which is partitioned into interconnected
developed for SSBDDs can be easily generalized tree-like subcircuits (macros). This is a higher
for higher level DDs to handle digital systems level (macro-level) representation of the same
represented at higher levels. circuit. Each macro is represented by a SSBDD
From the above-described synthesis of where each node corresponds to a signal path from
SSBDDs, automatic fault collapsing results for the an input of the macro to its output. In the tree-
given gate-level circuits which can be formulated like subcircuits only the stuck-at faults at inputs
as the corollary resulting from Algorithm 1. should be tested. This corresponds to testing all
the nodes in each SSBDD.
Corollary 1: Since all the SAF faults on the inputs Test generation for a node m in SSBDD, which
of a FFR according to the approach of fault represents a function y = f(X) of a tree-like sub-
folding (To, 1973) form the collapsed fault circuit (macro), is carried out by the following
set of the FFR, and since all these faults are procedure.
represented by the faults at the nodes of the
corresponding SSBDD, then the creation of Algorithm 2: Test generation for a node m in the
the SSBDD is equivalent to the fault col- SSBDD Gy, y = f(X)
lapsing procedure similar to fault folding. 1) A path lm = l(m0, m) from the root node
of SSBDD to the node m is activated.
Assume a node m with label x(m) represents 2) Two paths lm,e = l(m,mT,e) consistent with
a signal path p(m) in a circuit. Suppose the path lm, where e∈{0,1}, from the neighbors
p(m) goes through n gates. Then, instead of 2n me of m to the corresponding terminal
faults of the path p(m) in the circuit, only 2 faults nodes mT,e should be activated. If the
related to the node variable x(m) should be tested node m is directly connected via e-edge
when using the SSBDD model. to mT,e, no path lm,e should be activated
for this particular value of e.
Theorem 1: Let G(C) be the SSBDD model 3) For generating a test for a particular
generated for the combinational circuit C stuck-at-e fault x(m) ≡ e, e∈{0,1}, the
by Algorithm 1. Then, any set of tests that opposite assignment is needed:
checks all the SAF faults at the nodes of x (m ) = e.
G(C) checks all the SAF faults in C. 4) All the values assigned to node vari-
ables (to variables of X) form the local
The proof follows from Corollary 1 and Theo- test pattern Xt (input pattern of the
rem 5 in (To, 1973). macro) for testing the node m in Gy
Differently from the traditional gate level (for testing the corresponding signal
approaches to test generation and fault simula- path p(m) on the output y of the given
tion which use the collapsed fault list apart from tree-like circuit).
the simulation model, the SSBDD based test
generation and fault simulation are carried out

413
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

As an example, consider test generation for path lm* = l(m*,mT,e*) activated by the
the fault x11 ≡ 0 at the node x11 in Figure 1b. To pattern Xt is determined; if mT,e* ≠ mT,e
test this fault we have to assign to x1 the opposite then the fault of the node m is detected
value x1 = 1. Since x11 is the root node, the path lm by Xt otherwise not.
is empty and there is no need to activate it. The
path lm,1 can be activated by assignment x2 = 1, As an example, consider the test pattern Xt =
and the path lm,0 can be activated by assignment (x1, x2, x3, x4)t = 1011 which activates the path l =
x3 = 0. The generated test pattern is: Xt = (x1, x2, (x11, x21, x12, x31, x4, #1) in Figure 1b. According
x3, x4)t = (110-). The pattern detects all the faults to Step 2 of Algorithm 3 we have: lx11*= lx12 = (x12,
(x11 ≡ 0, x21 ≡ 0, y ≡ 0) along the signal path from x31, x4, #1); lx21* = (#1); lx12* = lx13(x13, #0), lx31* =
x11 to y in the circuit in Figure 1a. lx13(x13, #0); lx4* = lx13(x13, #0). From this it follows
This example demonstrated test generation that by the given test pattern the faults x12 ≡ 0, x31
in a single SSBDD. In general case, the test gen- ≡ 0, x4 ≡ 0 are detected.
eration for embedded macro-components of the
circuit the local test patterns for macros should Fault Diagnosis with SSBDDs
be extended to the final test patterns in terms of
primary inputs of the circuit, similarly as for the The fault simulation procedure described by
gate-level circuits where fault propagation and Algorithm 3 can be used during fault diagnosis
line justification procedures are needed. The based on the effect-cause fault location concept.
difference, however, is that these procedures are Suppose there has been an error detected on the
carried out on the higher macro level (instead of output y of the circuit in Figure 1a by the pattern
the gate level) whereas the macros of the circuit Xt = 1011. According to the example, the fault
are represented by SSBDDs. The fault propagation candidates should be suspected first along the path
through a macro from the input x to its output y is l activated by the pattern. After fault simulation
carried out similarly to the test generation for the the set of candidate faults has been pruned to the
node m labeled by x in the corresponding SSBDD subset { x12 ≡ 0, x31 ≡ 0, x4 ≡ 0}
Gy as explained in Algorithm 2. Line justification The experiments of logic-level simulation with
for the task y = e is carried out by activating a SSBDD model were carried out on ISCAS’85
path in the graph Gy from the root node to the benchmarks. They were run for each circuit on
terminal node mT,e. two levels of abstraction: gate-level and SSBDD.
This allows to directly measure the effect of the
Fault Simulation with SSBDDs chosen model on simulation speed. Figure 3 il-
lustrates the average speed-up brought by usage
Fault simulation of a test pattern Xt on the SSBDD of the SSBDD model compared to the gate-level
Gy, which represents a function y = f(X) of a simulation speed. The fault simulation (fat dashed
tree-like subcircuit (macro), is carried out by the line) shows the most noticeable acceleration.
following procedure. Other simulation algorithms vary in decreasing
the runtime by 2,5 up to almost 4 times compared
Algorithm 3: Fault simulation of a test pattern to algorithms working on the gate-level netlist
Xt in Gy, y = f(X) model. This effect is possible due to shift from
1) The path l = l(m0,mT,e) activated by the lower gate level to a higher macro level when
pattern Xt in Gy is determined. working with SSBDD model.
2) For each node m ∈ l, its successor m*
is determined so that m* ∉ l, and the

414
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Figure 3. Logic level simulation speed-up for different algorithms

MODELING DIGITAL CIRCUITS WITH can be calculated by entering the graph G1 (G2)
MULTIPLE INPUT SSBDDS at additional inputs 4 and 5 (6 and 7).

Introduce now a new type of SSBDD model called Algorithm 4: Generation of SSMIBDDs
Structurally Synthesized Multiple Input BDDs
(SSMIBDD) by extending the superposition The general procedure of constructing the
procedure beyond the fanout nodes of the given SSMIBDD is as follows. Let us have a circuit
circuit. The goal is to further compress the SSBDD C with a set of outputs Y, a set of fanout stems
model by exploiting the effect of the superposi- FS, and a library of BDDs for all the gates in C.
tion procedure where by each superposition step For all outputs of Y we start a generation of SS-
a node and a graph from the whole BDD model of MIBDD by Algorithm 1. Let us have two graphs
the gate-level network are removed. To avoid the to be superpositioned: G2 for the gate g2 into G1
negative effects of fanout stems xi with branches for the gate g1. We have now three possibilities:
xi,1, xi,2,... xi,k, the superposition in the graph G
under construction is allowed only for a single 1) If for all input variables x of the gate g1 , x ∉
branch xi,j, 1 ≤ j ≤ k. For the remaining branches FS, then Algorithm 1 is carried out as it is.
of the same fanout stem a link will be established 2) If there is a single x: x ∈ FS, then before the
from other graphs via corresponding additional superposition the nodes in G1 are reordered
inputs inserted into the graph G. in such a way that the node m labeled with
In Figure 4, a hypothetical initial circuit is x ∈ FS will be the last node in G1, i.e. Γ(m)
represented with three outputs 1, 2, 3 and four = ∅. The node m will get an additional input
fanout stems 4, 5, 6, 7. The circuit is modeled x as the link to other branch variables (in
by two SSMIBDDs G1 and G2 with root nodes, other graphs) of this fanout stem. Note that
respectively, 1 and 2, and by a SSBDD G3 with for all other nodes m*∈ Γ -1(m), no additional
the root node 3. The links from the nodes 4, 5, 6 inputs can be inserted.
in G2, and from the nodes 6, 7 in G3 to the graph 3) If there are more than one input variables x: x
G1 are available via additional inputs to G1. In ∈ FS, then only one of them can be selected
other words, when calculating the output values for further superpositioning according to the
2 and 3, respectively, in graphs G2 and G3, the case 2, and for all other variables we have to
values of variables in nodes 4 and 5 (6 and 7) start the construction process of new graphs.

415
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Figure 4. Concept of SSMIBDDs

An example of a combinational circuit with The SSMIBDD model can be used for test
three outputs and 9 internal fan-outs is depicted generation in a similar way as the SSBDD
in Figure 5a, and the corresponding SSMIBDD model was used for test generation and for fault
model consisting of only 3 graphs is shown in simulation. Consider as an example test generation
Figure 5b. SSBDD model for this circuit would for the fault SAF-13/1 (stuck-at-1 on the upper
consist of 12 graphs. The number of faults to be branch of the node 13) in Figure 5a with SS-
processed in the gate level circuit in Figure 5a is MIBDDs in Figure 5b. Let find by Algorithm 2
before fault collapsing 84. Using the FFR based the input pattern which propagates the faulty
SSBDD model with 27 nodes we can collapse signal from 13 to the output 20 to make the fault
the fault set up to 54 faults (corresponding to the observable. To activate the paths l13 = (17,16,13)
representative faults at 9 inputs and 18 internal and l13,0 = (13,5,#0) (l13,1 is empty) in G20 we as-
fan-out branches of the circuit). Using the SS- sign: x17 = 0, x16 = 1, x13 = 0, and x5 = 0. In G19 we
MIBDD model with 18 nodes in Figure 5b allows see that x17 = 0 can be justified by x15 = 0. How-
collapsing the fault set up to 36 faults. ever, in G20 we see that x15 = 0 is already justified.

Figure 5. Circuit with high number of fan-outs and its SSMIBDDs

416
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

In G21 we justify x16 = 1 by x8 = 1, and x13 = 0 by are merged into the same graph can be found in
x6 = 0. Hence, the test pattern for detecting the (Ubar, Mironov, Raik & Jutman, 2010).
fault SAF-13/1 is: Xt = (x5, x6, x8) = 001. The fault A comparison of the complexities of gate-
is propagated to the output 20 via the highlighted level circuits, SSBDDs and SSMIBDDs in terms
path in Figure 5a. To solve the same task on the of fault collapsing was carried out using ISCAS’85
gate-level would be more complex because of the and ISCAS’89 circuits. The differences in the
higher complexity of the model. number of nodes between the gate level, SSBDD
The method of generating SSMIBDDs de- and SSMIBDD models are shown in Figure 7.
scribed above starts from the outputs of the The average minimization gained for SSMIBDDs
given circuits. Using this procedure, in the best in the number of nodes (and also in the size of
case, a circuit with n outputs can be represented collapsed fault sets) for ISCAS’85 and ISCAS’89
by n SSMIBDDs. However, it is easy to see that circuits is up to 2,4 times compared to the gate
the number of graphs can be further reduced by level, and up to 1,4 times compared to the SSBDD
sharing the same subgraphs by different output model.
graphs, and correspondingly, the number of nodes The results prove that the SSMIBDD model
in the whole model can be further reduced, which is more compact than the previously discussed
results in further fault collapsing. SSBDD or gate level models, and as the result
An example of the minimal SSMIBDD for the allows better fault collapsing which in its turn has
circuit in Figure 6a is presented in Figure 6b. The the influence to the efficiency and speed of test
new SSMIBDD model with 7 nodes contains only generation and fault simulation.
14 collapsed faults as targets for test generation.
The nodes x1, and x5 represent signal paths in the
circuit from x1 to y1 and from x5 to y2, respectively. MODELING DIGITAL SYSTEMS WITH
The node x3 represents the path from the lower HIGH LEVEL DECISION DIAGRAMS
input of g1 to y1. The node x2 represents the path
from the upper input of g3 to y1. The nodes ¬x3 and Test generation methods developed for SSBDDs
¬x4 represents the paths from the inputs of g2 to have an advantage compared to other logic level
both outputs y1 and y2. And, finally, the node ¬3 methods, namely that they can be easily gen-
represents the path from the upper input of g6 to eralized to handle the test generation and fault
y2. More detailed discussion of using SSMIBDDs simulation problems at higher levels of systems.
where the graphs for different output functions The possibility of generalization results from the

Figure 6. Minimized SSMIBDD for the ISCAS’85 benchmarks family circuit c17

417
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Figure 7. Comparison of the complexities of gate-level circuits, SSBDDs and SSMIBDDs

topological similarity of representing systems with Definition 6: For the assigned value of z(m) = e,
DDs at lower and higher levels. e ∈ V(m), we say the edge between nodes
Consider a digital system S as a network of m ∈ M and me ∈ M is activated. Consider
components (or subsystems) where each compo- a situation where all the variables z ∈ Z are
nent is represented by a function z = f (z1, z2,..., zn) assigned by a vector Zt to some values from
= f (Z) where Z is the set of variables (Boolean, the domains of definition V(z), z∈ Z. The
Boolean vectors or integers), and V(z) is the set activated by Zt edges form an activated path
of possible values for z ∈ Z which are finite. In through nodes l(m0, mT) ⊆ M from the root
the case of representing next state functions z ∈ Z. node m0 to one of the terminal nodes mT∈ MT.
Definition 7: We say that a decision diagram Gz
Definition 5: A decision diagram (DD) which represents a function z = f (z1, z2, …, zn) = f
represents a digital function z=f(Z) is a (Z), iff for each value V(Z) = V(z1) × V(z2)
directed acyclic graph Gz = (M,Γ,Z) with ×... × V(zn), a path in Gz is activated from
a set of nodes M and a mapping Γ from M the root node z0 to a terminal node mT ∈MT,
to M. M = MN ∪ MT consists of two types so that z = z(mT) is valid.
of nodes: nonterminal MN and terminal MT
nodes. The terminal nodes m ∈ MT may be It is easy to see that SSBDDs represents a
labeled either by variables z(m) ∈ Z, digital special case of DDs where for all z ∈ Z, V(z) =
functions z(m) = fm(Zm), Zm ⊆ Z, or constants. {0,1}, and two terminal nodes are labeled by the
The nonterminal nodes m ∈ MN are labeled Boolean constants 0 and 1.
by variables z(m) ∈ Z, and have successors Depending on the class of the system (or its
whose number may 2 ≤ |Γ(m)| ≤ |V(z(m))|. representation level), we may have various DDs,
A mapping exists between the values of where nodes have different interpretations and
z(m) and the successors in Γ(m). Denote the relationships to the system structure. In RTL
successor of m for a given value of z(m) as descriptions, we usually partition the system
mz(m) ∈ Γ(m). Γ(m) ⊂ M denotes the set of into control and data paths. Nonterminal nodes
all successors of m ∈ M, and Γ -1(m) ⊂ M in DDs may correspond then to the control path,
denotes the set of all predecessors of m ∈ M. and they are labeled by state or output variables
For terminal nodes mT ∈ MT we have Γ (mT) of the control part serving as addresses or control
= ∅. There is a single node m0 ∈ M where Γ words. Terminal nodes in DDs correspond in this
-1
(m) = ∅ called root node. case to the data path, and they are labeled by the

418
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

data words or functions of data words, which cor- Algorithm 5: High-level test generation for a
respond to buses, registers, or data manipulation non-terminal node m∈ MN in the DD Gz
blocks. When using DDs for describing complex 1) A path lm = l(m0, m) from the root node
digital systems, we have to first, represent the of SSBDD to the node m is activated.
system by a suitable set of interconnected com- 2) For all e ∈ V(m), non-overlapping
ponents (combinational or sequential subcircuits). paths lm,e consistent with lm from me up
Then, we have to describe these components by to terminal nodes mT,e are activated. If
their corresponding functions which can be rep- the node m is directly connected via
resented by DDs. e-edge to mT,e, no path lm,e should be
In Figure 8, a RTL data-path and its compressed activated.
DD is presented. The variables R1 and R2 represent 3) A proper set of data (the values of the
registers, IN denotes the input bus, the integer variables in Z) is found, so that the in-
variables y1, y2, y3, y4 represent control signals, equality z(mT,,1) ≠ z(mT,2) ≠ … ≠ z(mT,n)
M1, M2, M3 are multiplexers, and the functions holds, where n = |Γ(m)|.
R1+R2 and R1*R2 represent the adder and multiplier, 4) The test data consist of a static part TS
correspondingly. Each node in DD represents a (the values generated in Steps 1-3), and
subcircuit of the system (e.g. the nodes y1, y2, y3, y4 of a dynamic part TD (all the values
represent multiplexers and decoders,). The whole from V(m)). To implement the test, a
DD describes the behavior of the input logic of the loop is created where TS is repeated
register R2. To test a node means to test the cor- |V(m)| times for all the values of z(m)
responding to the node component or subcircuit. ∈ V(m) in TD. Before each loop all the
Test pattern generation and fault simulation values in TS are restored.
algorithms can be derived easily by generalization
of algorithms 2 and 3, respectively. The test gen- The paths in the SSBDD activated by the
eration for terminal and non-terminal nodes described procedure are illustrated in Figure 9.
slightly differs, as it can be seen from the follow- Note, the test generation according to Algo-
ing algorithms 5 and 6. rithm 5 is carried out entirely on the high-level,
based on the high-level fault model, according to

Figure 8. Representing a data path by a decision diagram

419
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Figure 9. Test generation for the node m with low level and high-level DDs

which the behavior of the node m is tested ex- = D3


haustively. It is reasonable, since the domains of Read the value of R2,e
definitions of control variables are not large. END.
Satisfaction of the inequality in Step 3 of Algorithm
5 is the guarantee that the node m under testing Algorithm 6: Test generation for a terminal node
behaves correctly. m ∈ MT in the DD Gz
As an example, consider test generation for 1) A path lm = l(m0, m) from the root node
testing the multiplexer M3 represented by the node of SSBDD to the node m ∈ MT is ac-
y3 in DD in Figure 8b. We activate 4 paths lm,e for tivated. This will be the static part TS
each value e = 0,1,2,3 of y3. Two of them, lm,1, of the test.
lm,2, for values y3 = 1 and y3 = 2, respectively, are 2) The proper sets of values TD = {Ztm},
“automatically” activated since the successors of t =1,2,..., n, for testing the function
y3 for these values are terminal nodes. The control fm(Zm) represented by the terminal node
values for the test are found by activating the path m is selected. This operation should
lm with assigning y4 = 2, and by activating two be carried out at the lower (e.g. gate)
paths lm,0 and lm,3 with assigning y1 = 0 and y2 = level if the implementation details for
0, respectively. The test data R1 = D1, R2 = D2, IN fm(Zm) are given, which directly leads
= D3 are found to satisfy the inequality R1 + R2 ≠ to hierarchical approach to test genera-
IN ≠ R1 ≠ R1 * R2. tion. Otherwise high level fault models
According to Step 4 of Algorithm 5, the follow- for testing fm(Zm) should be used.
ing test program for the control variable y3 results: 3) To implement the test, a loop is created
Test program for control part: where TS is repeated n times for all the
patterns in TD. Before each loop the
For e = 1,2,3,4 values in TS are restored.
BEGIN
Load the data registers R1 = D1, R2 = As an example, consider test generation for test-
D2 ing the multiplexer M3 represented by the node y3
Carry out the tested working mode at in DD in Figure 8b. By activating the path to this
y3 = e, y1 = 0, y2 = 0, y4 = 2 and IN node (shown in bold in Figure 8b) we generate a

420
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

control word (y2, y3, y4) = (0, 3, 2). To find the proper The feasibility and advantages of using HLDDs
values of R1 and R2 we need to descend to the lower in diagnostic modeling of digital systems was dem-
level (e.g. gate level) and generate test patterns by onstrated by using the test generator DECIDER
a low level ATPG for the low level implementation (Raik & Ubar, 2000). The results regarding the
of the multiplier. Let us have a test set of n test speed of test generation and fault coverage (Table
patterns (D1,1,D2,1; D1,2,D2,2; … D1,n,D2,n) generated 1) were compared with other known test generators
for the multiplier with input registers R1 and R2. for complex sequential digital systems HITEC
From above, according to Step 3 of Algorithm 6, (Niermann, Patel, 1991) and GATEST (Rudnick,
the following test program results: Patel, Greenstein, Niermann, 1994). The experi-
Test program for data part: mental results show the high speed of the ATPG
DECIDER which is explained by the DD based
For all the values of t = 1,2, …, n hierarchical approach used in test generation
BEGIN The advantage of using DDs for test generation
Load the data registers R1 = D1,t, R2 and fault simulation is the very close similarity of
= D2,t handling low logic level and higher behavior or
Carry out the tested working mode register transfer level representations of digital
at the control values (y2,y3 y4) = systems. This helps to develop uniform approach-
(0,3,2) es and generalize the concepts of solving different
Read the value of R2,t test problems like test generation, fault simulation,
END. fault diagnosis, testability analysis etc. from logic
level to higher levels. To do this, we need formal
In test pattern simulation, a path is traced in methods for HLDD synthesis from common high-
the graph, guided by the values of input variables level representations of digital systems.
in Z until a terminal node is reached, similarly as
in the case of SSBDDs. In Figure 8, the result of
simulating the vector Zt = (y1, y2, y3, y4, R1, R2, IN) SYNTHESIS OF HIGH-LEVEL
= 0,0,3,2,10,6,12 is R2 = R1*R2 = 60 (bold arrows DECISION DIAGRAMS FROM THE
mark the activated path). Instead of simulating by NETWORK OF COMPONENTS
a traditional approach all the components in the
circuit, in the DD only 3 control variables are vis- In this chapter we present two methods for syn-
ited during simulation (y4, y3, y2,), and only a single thesis of high level DDs for representing digital
data manipulation R2 = R1*R2∈ MT is carried out. systems. The first method is based on iterative

Table 1. Comparison of test generators

HITEC GATEST DECIDER


Circuit # Faults
FC% T, s FC% T,s FC% T, s
gcd 454 81.1 170 91.0 75 89.9 14
sosq 1938 77.3 728 79.9 739 80.0 79
mult 2036 65.9 1243 69.2 822 74.1 50
ellipf 5388 87.9 2090 94.7 6229 95.0 1198
risc 6434 52.8 49020 96.0 2459 96.5 151
diffeq 10008 96.2 13320 96.4 3000 96.5 296
Average FC% 76.9 87.9 88.6

421
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

superposition of HLDDs, and can be regarded as able corresponds to the state sj of the FSM. The
a generalization of Algorithm 1 used for synthesis apostrophe refers to the value from the previous
of SSBDDs in Section 2. This method can be used clock cycle. Assume that the functions λ and δ are
in cases when the system is given structurally by represented by HLDDs y = Gy (q’, x), and q = Gq
a network of components (subsystems), and for (q’, x), respectively.
each component its HLDD is given. An example The data path consists of the memory block M
of such presentation of a system is depicted with three registers A, B, C together with the ad-
in Figure 10a. The second method is based on dressing block ADR, represented by three DDs: A
symbolic execution of procedural descriptions, = GA (yM, z), B = GB (yM, z), C = GC (yM, z); of the
which correspond to the functional representa- data manipulation block CC where z = Gz (yz, z1,
tions of systems at higher behavioral levels. This z2); and of two multiplexers z1 = Gz,1 (yz,1, M) and
method can be used in cases when the system is z2 = Gz,2 (yz,2, M). The block COND performs the
given functionally as a procedure in a hardware calculation of the condition function x = Gx (A, C).
description language or in the form of a flow-chart. The component level model of the system
An example of such presentation of a system is consists of the following set of DDs: N={Gy,Gq,
depicted in Figure 10b. Consider in this Section GA,GB,GC,Gz,Gz,1,Gz,2,Gx}. Using now the follow-
the first method. ing chain of superpositions of DDs:
The system in Figure 10a consists of the con-
trol and data parts. The finite state machine (FSM) A = GA (yM, z) = GA (yM, Gz (yz, z1, z2))= GA (yM,
of the control part is given by the output function Gz (yz, Gz,1 (yz,1, M), f4 (yz,2, M))) = GA (yM, yz, yz,1,
y = λ (q’, x) and the next-state function q = δ (q’, yz,2, M) = GA (y, M) = GA (Gy (q’, x), M) = G’A
x), where y is an integer output vector variable, (q’, x, A, B, C),
which represents a microinstruction with four
control fields y = (yM, yz, yz,1, yz,2), x = (xA, xC) is and repeating a similar chain of superposition
a Boolean input vector variable, and q is the in- steps for the graphs B = GB (yM, z) and C = GC
teger state variable. The value j of the state vari- (yM, z), we create a new compact DD model of the

Figure 10. Digital system as a network of components and a flowchart of its behavior

422
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Figure 11. High Level Decision Diagrams for the system represented as a network in Figure 10a

system N’= {Gq, G’A, G’B, G’C} which is presented (binary vs. integers in V(m)). If a terminal node
in Figure 11. The part of the model related to the m in a graph Gy is labeled by a data variable z(m)
data path is represented by three DDs G’A, G’B, which is represented by another graph Gz(m) then
G’C where, for simplicity, the terminals nodes for the procedure is trivial: the node m in Gz can be
the cases where the value of the function variable simply substituted by the graph Gz(m). Superposi-
does not change, are omitted. tion of a non-terminal node m labeled by z(m) in
For synthesis of the HLDD model for a given Gz by the graph Gz(m) is only then possible when
network of component HLDDs, the iterative graph all the values z(m) in V(z(m)) have one-to-one
superposition of two HLDDs is used, which can mapping to the terminal nodes of Gz(m) labeled by
be regarded as the generalization of Algorithm 1. constant values from the same set V(z(m)).
If the label z(m) of a node m in the HLDD Gz is Consider as an example the data path as a
an output of a component in the network, which network of components in Figure 8a and the
is represented by another HLDD Gz(m), then the descriptions of the working modes of these com-
node m in Gz can be substituted by the graph Gz(m). ponents in Table 2.
In this graph superposition the following chang- Superposition of two graphs R2 = GR2 (y4, R2,
es in Gy and Gx(m) are made. e) for the register input logic and the multiplexer
M3 with internal variable e, e = Ge (y3, R1, IN, d)
Algorithm 7 is illustrated in Figure 12. The whole fully com-
1) The node m labeled by z(m) is removed pressed HLDD for the data-path in Figure 8a is
from Gy. depicted in Figure 8b.
2) All the edges in Gz(m) that were con-
nected to terminal nodes mT,e in Gz(m)
will be cut and then connected, to the SYNTHESIS OF HIGH-LEVEL
successors me of the node m in Gy, DECISION DIAGRAMS FROM
respectively. PROCEDURAL DESCRIPTIONS
3) All the incoming edges of m in Gy will
be now the incoming edges for the root Consider a procedure representing the behavior
node m0 in Gz(m). level description of a digital system. The proce-
dure can be represented by a flow chart which is
Note, this procedure corresponds exactly (!) a directed graph where a path Pi can be defined
to Algorithm 1 for synthesis of SSBDDs with as usual in the graph theory.
the only difference in the ranges of values for e

423
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Table 2. Working modes of the components in


path condition PCi = ∧ C ij is extracted as a
Figure 8a j :cij ∈ci

set of constraints on the input variables (condi-


y1 M1/a y2 M2/b y3 M3/e y4 R2 tions), such that the path is feasible iff the PCi is
0 R1 0 R1 0 M1+R2 0 0 satisfied. The path Pi is not feasible iff PCi ≡ 0.
1 IN 1 IN 1 IN 1 R2 Denote the set of all feasible paths as P.
2 R1 2 M3 In this section, a procedure similar to symbolic
3 M2*R2 execution is used for HLDD synthesis. We assume
that all the assignment statements correspond to
updating of the related data-path registers during
Denote by Y = {y1, y2,... ym} a set of system the current cycle. Based on that we introduce a
variables which values will be changed when “simplified” cycle based symbolic execution pro-
the procedure is executed. These changes are cedure. The procedure of HLDD synthesis based
determined by the assignment statements along on the procedural description of a digital system
the paths of the flow chart. A path Pi represents is carried out as follows.
a sequence of assignment statements Si = {sik}
and a sequence of conditional expressions Ci = Algorithm 8
{cil}. Here k = 1,2,...,m, and each sik ∈ Si denotes 1) Insertion of states into the flow-chart
an expression whose value will be assigned to yk. of the digital system
If the value of yk will not change in the current 2) Cycle based symbolic execution of
path Pi then formally sik ≡ y’k where the apostro- the procedure given by the flow-chart.
phe means that the value of y’k is taken from the Generation of the table of paths by
previous cycle. determining for each feasible path Pi
A path Pi is executable (or feasible) if there the sets of assignment statements Si
are input data such that the program can be exe- and the sets of constraints Ci.
cuted along that path. Otherwise, it is un-execut- 3) Generation of the conditional expres-
able (or infeasible). For deciding the feasibility sions yk = ∨i:P ∈PPC i ⋅ si,k for each
of program paths, symbolic execution of programs i

system variable yk ∈ Y.
and constraint solving techniques are used (Zhang,
2004). By symbolic execution for each path Pi, a

Figure 12. Superposition of two HLDDs

424
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

4) Synthesis of HLDDs for each system of system variables is 4 (y1 = A, y2 = B, y3 = C,


variable yk by factorization of its con- y4 = q). Only these assignment statements sik, k =
ditional expression. 1,2,3,4, are noted in the last column of Table 3,
where the system variable may change its value
As an example, consider the procedure in during the transition.
Figure 10b which represents the system in Figure Next, for all the left-hand side system variables
10a. First, according to Step 1 of Algorithm 8, A, B, C and q in the last column of Table 3 we
the states are inserted into the flow chart of the create HLDDs which describe the cycle-based
procedure under analysis in a similar way as the behavior of these variables during execution of
algorithmic state machines are marked by states the procedure in Figure 10b.
during synthesis of FSM [46]. There are six states Steps 3 and 4 of Algorithm 8 are illustrated in
s0, s1,..., s5, inserted into the procedure in such a the following for the system variable A. In Table
way that during the state transfer each data variable 4, all the paths Pi are depicted where the value of
can be changed only once. A global state variable A may be changed according to the expression A
q is introduced to map the states to values of q. = y1= si,1 if the path constraint PCi = ∧ j :c ∈c C ij
For example, q = i for the state si. ij i

is satisfied.
During cycle based symbolic execution (Step
According to Step 3, from Table 4 the follow-
2 of Algorithm 8), the path trees are created only
ing expression for the behavior of the system
for the restricted regions between neighboring
variable A can be derived:
states, avoiding in this way the explosion of
paths. As a result of tracing all the transitions
A = (q=0)(B+C) ∨ (q=1)(xA=0) (¬A + 1) ∨
between the states in the flow graph, a table is
constructed where each row corresponds to a path
(q=3)(xC=1)(¬C+B) ∨
between neighboring states. The result of cycle
based symbolic execution of the procedure in (q=4)(xA=0)(xC=0)(A+ ¬B + C + 1)
Figure 10b is presented in Table 3. The number
According to Step 4, by factorization of this
Table 3. Results of symbolic execution of the expression, a HLDD for the variable A can be
procedure in Figure 10b derived in a similar way as BDDs are derived by
using Shannon factorization [16]. The only differ-
Path Constraints Ci Assignment statements ence is that instead of Boolean factorization we
Pi Si = {sik}
q xA xB xC use multi-valued factorization, depending on the
1 0 A = B + C; q = 1 possible number of values |V(z)| of the constraint
2 1 0 A = ¬A + 1; q = 4 variable z.
3 1 1 B = B + C; q = 2
4 2 0 C = A + B; q = 5
Table 4. Results of symbolic execution for the
5 2 1 C = ¬C; q = 3
variable A
6 3 0 C = A + B; q = 5
7 3 1 A = ¬C + B; q = 5 q xA xB xC A
8 4 0 B = ¬B 0 B+C
9 4 0 0 A = A + ¬B + C; q = 5 1 0 ¬A + 1
10 4 0 1 q=5 3 1 ¬C + B
11 4 1 C = ¬C; q = 5 4 0 0 A + ¬B + C

425
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

The HLDDs created by factorization of the to calculate by a single run the values of all the
expressions yk = ∨i:P ∈PPC i ⋅ si,k , k = 1,2,3,4, for components of the vector variable Y = (y1, y2,... yn).
i
An example of merging the DDs GA, GB, GC
all the four variables A, B, C and q are shown in
and Gq in Figure 11 into a single VHLDD GM =
Figure 11. To underline that the values of variables
Gq, A, B, C to calculate the value of the vector M =
are taken from the previous cycle we use the
A.B.C.q is shown in Figure 13. For calculating and
apostrophes at the data and state variables.
assigning the new values to different components
In this model, for simplicity, all the edges in the
of the vector variable M, we introduce a new type
graph Gz for the function z = f(Z), which lead into
of a node in VDD called addressing node labeled
the terminal node labeled by the graph variable z,
by an addressing variable i. The vector decision
are dropped. This case corresponds to the assign-
diagrams offer the capability to efficiently repre-
ment where the value of z remains unchanged.
sent the array variables (corresponding to register
In other words, if we reach a non-terminal node
blocks and memories) for calculating and updating
m where the successor me for the value z(m) = e
their values. VHLDDs are particularly efficient for
is missing, the graph variable z holds the same
representing functional memories with complex
value as in the previous cycle. For example, in
input logic – with shared and dedicated parts for
the graph A in Figure 11, if q’ = 4 and xA ∨ xC =
different memory locations. In general case, all
0, the variable A will hold its value, i.e. A = A’.
the registers of the data path can be combined in
the VHLDD model as a single memory block.
Using VHLDDs allows significally to increase
VECTOR HIGH-LEVEL
the speed of simulation. Consider, as an example,
DECISION DIAGRAMS
the input pattern Zt = (q’,xA, xB, xC) = (1,1,0,0). As
the result of simulation of this pattern on all the
To further reduce the complexity of HLDDs
4 graphs in Figure 13 (the paths traversed are
we introduce the concept of vectorizing a set of
shown by bold lines and grey nodes) we get: A =
HLDDs. A set of HLDDs {Gy1, Gy2,... Gyn} can be
A’, B = B’ + C’, C = C’ and q = #2. The simulation
merged into a single vector HLDD (VHLDD) Gy1,
needs 11 visits to the decision nodes (note that
,... yn with less complexity as a result of sharing
y2
for A and C we stop simulation in the node where
similar parts in separate DDs. Such a graph allows

Figure 13. Representing a set of 4 HLDDs by a single vector HLDD

426
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

the successors for xA=1 and q’=1, respectively,


y * = fy * (x 1, x 2 ,..., x n , ∆) = ∆fy ∨ ∆fy ∆
are missing, which means that the variables hold
their values). When simulating the same input
where fyΔ represents the faulty function of the
pattern in the joint VHLDD in Figure 13, we get
component because of the defect Δ. The solution
the same result by visiting only 5 decision nodes
Wy(Δ) of the Boolean differential equation
whereas the variables A and C need no simulation
at all. The addressing node i in the joint graph
points by the values on the output edges at the ∂fy *
= 1
vector components which should be assigned by ∂∆
the new value. These values are calculated by the
expressions in the terminal nodes. describes a condition which activates the defect Δ
to produce an error on the output y of the compo-
nent. The parametric modeling of a given defect Δ
FUNCTIONAL FAULT MODELING by the condition Wy(Δ) = 1 allows to use it either
IN DIGITAL SYSTEMS for defect-oriented fault simulation (to check
whether the condition Wy(Δ) = 1 is fulfilled), or
New failure mechanisms in today’s deep-sub- for defect-oriented test generation under the con-
micron electronic devices cannot be modeled by straint Wy(Δ) = 1 when a test pattern is searched
traditional stuck-at faults (SAF) which in case of for detecting the defect Δ. A similar approach is
DDs are directly associated with the nodes of the described in (Devadze & Ubar, 2010) for simula-
graphs. As the result, new advanced fault models tion of extended class of faults.
are continuously being developed to improve the If the components of the circuit represent
confidence of test quality measures and to increase standard library (complex) gates, the described
the accuracy of fault diagnosis. The types of faults analysis for finding conditions should be made
that can be observed in a real gate depend not only once for all library components, and the sets of
on the logic function of the gate, but also on its calculated conditions will be included into the
physical design. Good possibilities to combine library of components in the form of fault tables.
logical and physical level fault modeling provide The defect characterization may be computation-
pattern fault model (Blanton & Hayes, 2003) or ally expensive, but it is performed only once for
conditional fault models (Mahlstedt, Alt & Hol- each library cell. An example of the functional
lenbeck, 1995; Holst & Wunderlich, 2008). A fault table for the complex gate AND2,2/NOR2
similar pattern related fault modeling approach is presented in Table 5 (Ubar, Kuzmicz, Pleskacz
called functional fault model was proposed in & Raik, 2001). The defect lists WFy of library
(Ubar, 1980) for the module level fault diagnosis components embedded in the circuit can be ex-
in combinational circuits. tended by additional physical defect lists WSy for
Consider a parametric model of a component the interconnect structure in the neighboring of
(e.g. a complex gate) in a combinational circuit the component to take into account also different
with a correct function y = fy (x1,x2,...xn), and in- defects (bridging faults, crosstalks etc.) outside
cluding a Boolean fault variable Δ to represent an the components. For these defects additional
arbitrary physical defect (Δ = 0 when the defect characterization should be carried out by a similar
is missing, and Δ = 1 when the defect is present) way as for the library cells.
as a generic function The functional fault model Wy(Δ) allows to use
for test generation and fault simulation for any

427
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Table 5. Library defect table for a complex gate AND2,2/NOR2

i Fault di Erroneous function fdi Input patterns tj


0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 B/C not((B*C)*(A+D)) 1 1 1 1
2 B/D not((B*D)*(A+C)) 1 1 1 1
3 B/N9 B*(not(A)) 1 1 1 1 1 1 1
4 B/Q B*(not(C*D)) 1 1 1 1 1 1 1 1 1
5 B/VDD not(A+(C*D)) 1 1 1
6 B/VSS not (C*D) 1 1 1
7 A/C not((A*C)*(B+D)) 1 1 1 1
8 A/D not((A*D)*(B+C)) 1 1 1 1
9 A/N9 A*(not(B)) 1 1 1 1 1 1 1
10 A/Q A*(not(C*D)) 1 1 1 1 1 1 1 1 1
11 A/VDD not(B+(C*D)) 1 1 1
12 C/N9 not(A+B+D)+(C*(not((A*B)+D))) 1 1 1 1 1
13 C/Q C*(not(A*B)) 1 1 1 1 1 1 1 1 1
14 C/VSS not(A*B) 1 1 1
15 D/N9 not(A+B+C)+(D*(not((A*B)+C))) 1 1 1 1 1
16 D/Q D*(not(A*B)) 1 1 1 1 1 1 1 1 1
17 N9/Q not((A*B)+(B*C*D)+(A*C*D)) 1
18 N9/VDD not((C*D)+(A*B*D)+(A*B*C)) 1
19 Q/VDD SA1 at Q 1 1 1 1 1 1 1 1
20 Q/VSS SA0 at Q 1 1 1 1 1 1 1 1 1

physical defect Δ traditional stuck-at fault test (Raik, Ubar, Sudbrock, Kuzmicz & Pleskacz,
generators or fault simulators developed for the 2005). Column 2 shows the total number of de-
logic level. Denote by yW(Δ) the value of a node y fects simulated in the circuits as described in this
determined by the functional fault condition Wy(Δ) Section. Column 3 reflects the number of gate
= 1. To generate a test for a defect Δ, a test pattern redundant defects (GRD). These are defects that
should be generated for the stuck-at fault y ≡ (yW(Δ) cannot be covered by any input pattern of the gate.
⊕1) under the additional constraint Wy(Δ) = 1. For example, AND-short between the two inputs
Otherwise, if a test pattern detects a SAF fault at of the AND gate is a GRD. In column 4, circuit
the node y, then all the defects Δ are detected as redundant defects (CRD) are counted. CRDs
well if the test pattern does not contradict with are defects that cannot be covered by any input
Wy(Δ) = 1.The described approach has been used pattern of the circuit. Next, three fault coverages
for defect oriented test generation (Raik, Ubar, to characterize the capability of SAF model for
Sudbrock, Kuzmicz & Pleskacz, 2005) and fault testing physical defects are provided. The defect
simulation (Ubar, Devadze, Raik & Jutman, 2010) coverages are measured for the test sets which
based on the SSBDD model. cover 100% stuck-at faults. The first coverage
Table 7 gives the results of test generation (column 5) shows the ratio of detected defects
for ISCAS’85 circuits using the deterministic versus the total set of defects. In column 6, gate
defect-oriented test generator DOT generation redundancy has been excluded (i.e. all GRDs are

428
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Table 7. Results of defect oriented test generation

Redundant defects Defect coverage


Object Defects Gate level System level SAF test with different redundancies
New method
GL SL With GL&SL With SL Without
c432 1519 226 0 78.6 99.05 99.05 100
c880 3380 499 5 75.0 99.50 99.66 100
c2670 6090 703 61 79.1 97.97 99.44 100
c3540 7660 985 74 80.1 98.52 99.76 99.97
c5315 14794 1546 260 82.4 97.53 100 100
c6288 24433 4005 41 77.0 99.81 100 100

excluded from the list of possible defects). The eration: they are generated at lower level and used
third coverage measure (column 7) shows the in a test plan which is generated at higher level.
test efficiency (or test coverage). In this column,
both, gate redundancy and circuit redundancy
have been excluded. The last column shows the CONCLUSION
defect coverage achieved by the defect oriented
test generator DOT. In this chapter a unified method for diagnostic
When using HLDDs for test generation pur- modeling of digital systems at lower logic and
poses, the idea of functional fault model was used higher functional levels was described. The
in generating tests for terminal nodes in DDs. method is based on using decision diagrams
Remind that the main concept of the fault model where the well known BDDs can be regarded as
used in HLDDs was to test exhaustively each a special case of (high level) DDs. A special class
node. For non-terminal nodes which model the of BDDs called structurally synthesized BDDs
control variables such a concept is meaningful was introduced and analyzed. SSBDDs allow to
because of the low number of possible values for represent the internal structure of the circuit in
these variables. The situation is different with the model which is important regarding the fault
terminal nodes which model the units of data modeling. It was also shown that a byproduct
paths. In this case, hierarchical approach is advis- of the SSBDD synthesis is fault collapsing. A
able. Assume the terminal node mT in a graph Gy method for diagnostic modeling of complex digital
is labeled by a multiplication expression A * B. systems with high level DDs was described and
Test pattern for activation of the working mode compared with using BDDs. We described three
y = A * B is generated at the higher level using methods for synthesis of HLDDs for representing
the HLDD Gy. The set of local test patterns {At, digital systems at higher behavior, functional or
Bt} to be applied to the inputs A and B of the register-transfer levels. The first method is based
multiplier are generated at the lower gate-level on symbolic execution of procedural descriptions,
(possibly, even in the defect oriented way). The which corresponds to functional representation
set of patterns {At, Bt} can be regarded as a set of of systems at the behavioral level. An approach
constraints (as the functional fault model) when called cycle-based symbolic execution was dis-
testing the terminal node mT. The set of local test cussed as a tool for formalized HLDD synthesis,
patterns {At, Bt} can be regarded also as an inter- which prevents the explosion of the number
face between two levels in hierarchical test gen- of paths to be executed. The second method is

429
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

based on iterative superposition of HLDDs, and Baranov, S. (2008). Logic and System Design of
the created model corresponds to the high-level Digital Systems. Tallinn, Estonia: Tallinn Univer-
structural representation of the system. The sec- sity of Technology Press.
ond method can be regarded as a generalization
Bern, J., Meinel, C., & Slobodova, A. (1995).
of the superposition of BDDs developed for the
Efficient OBDD-based manipulation in CAD
creation of SSBDDs. Surprisingly, the procedure
beyond current limits. In 32nd Conference on
for superposition of HLDDs corresponds exactly
Design Automation, (pp. 408-413).
(!) to the superposition procedure of SSBDDs. The
third method aims at creation of vector HLDDs Blanton, R. D., & Hayes, J. P. (2003). On the Prop-
(VHLDD) on the basis of using shared HLDDs erties of the Input Pattern Fault Model. ACM Trans-
for compact representing of a given set of high actions on Design Automation of Electronic Sys-
level functions. A functional fault model known tems, 8(1), 108–124. doi:10.1145/606603.606609
also as pattern fault model or conditional fault
Bryant, R. E. (1986). Graph-based algorithms
model was described and connected to the concept
for Boolean function manipulation. IEEE
of diagnostic modeling of digital systems with
Transactions on Computers, C-35(8), 667–690.
DDs. It was shown that this fault model serves as
doi:10.1109/TC.1986.1676819
a natural interface between two levels in hierarchi-
cal or multi-level diagnostic modeling. Bryant, R. E., & Chen, Y.-A. (1995). Verifica-
tion of arithmetic functions with binary moment
diagrams. In Proc. 32nd ACM/IEEE DAC.
REFERENCES
Clarke, E. M., Fujita, M., & Zhao, X. (1996).
Akers, S. B. (1978, Oct). Functional Testing with Multi-terminal binary decision diagrams and
Binary Decision Diagrams. J. of Design Automa- hybrid decision diagrams. In Sasao, T., & Fujita,
tion and Fault-Tolerant Computing, 2, 311–331. M. (Eds.), Representations of Discrete Functions
(pp. 93–108). Amsterdam: Kluwer Academic
Al-Yamani, A., & McCluskey, E. J. (2004). Test Publishers.
quality for high level structural test. In Proc. of
9th IEEE High-Level Design Validation and Test Drechsler, R., & Becker, B. (1998). Binary Deci-
Workshop, (pp.109-114). sion Diagrams. Amsterdam: Kluwer Academic
Publishers.
Alizadeh, B., & Fujita, M. (2010). Guided Gate-
level ATPG for Sequential Circuits using a High- Drechsler, R., Theobald, M., & Becker, B. (1994).
level Test Generation Approach (pp. 425–430). Fast FDD based minimization of generalized
ASP-DAC. Reed-Muller forms. In European Design Automa-
tion Conf., (pp. 2-7).
Astola, J. T., & Stanković, R. S. (2006). Funda-
mentals of Switching Theory and Logic Design. Fummi, F., & Sciuto, D. (2000). A hierarchical test
Berlin: Springer. generation approach for large controllers. IEEE
Transactions on Computers, 49(4), 289–302.
Bahar, R. I., Frohm, E. A., Gaona, C. M., Hachtel, doi:10.1109/12.844343
G. D., Macii, E., Pardo, A., & Somenzi, F. (1993).
Algebraic decision diagrams and their applica- Holst, S., & Wunderlich, H. J. (2008). Adaptive
tions. In Int. Conf. on CAD, (pp. 188-191). Debug and Diagnosis Without Fault Dictionaries.
In Proc. 13th European Test Symposium, (pp.
199-204).

430
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Huisman, L. M. (1993). Fault Coverage and Lee, J., & Patel, J. H. (1997). Hierarchical test
Yield Predictions: Do We Need More than 100% generation under architectural level functional
Coverage? In Proc. of European Test Conference, constraints. IEEE Trans. Computer Aided Design,
(pp. 180-187). 15, 1144–1151.
Ichihara, H., Okamoto, N., Inoue, T., Hosokawa, Mahlstedt, U., Alt, J., & Hollenbeck, I. (1995).
T., & Fujiwara, H. (2005). An Effective Design Deterministic Test Generation for Non-Classical
for Hierarchical Test Generation Based on Strong Faults on the Gate Level. In 4th Asian Test Sym-
Testability. In Proc. of Asian Test Symposium – posium, (pp. 244-251).
ATS’2005, (pp. 288-293).
Makris, Y., Collins, J., Orailoglu, A., & Vishakan-
Jenihhin, M., Raik, J., Chepurov, A., & Ubar, R. taiah, P. (2000, May). Transparency-Based Hierar-
(2009). PSL Assertion Checking Using Tempo- chical Test Generation for Modular RTL Designs.
rally Extended High-Level Decision Diagrams. In Proc. of ISCAS’2000, (pp. II-689 - II-692).
Journal of Electronic Testing: Theory and Ap-
Minato, S. (1995). Zero-suppressed BDDs for set
plications – JETTA, 25(6), 1-12.
manipulation in combinational problems. In Proc.
Jutman, A., Raik, J., & Ubar, R. (2002, May). 30th ACM/IEEE DAC, (pp. 272-277).
On Efficient Logic-Level Simulation of Digital
Minato, S. (1996). BDDs and Applications for
Circuits Represented by the SSBDD Model. In
VLSI CAD. Amsterdam: Kluwer Academic Pub-
23rd Int. Conf. on Microelectronics, (Vol. 2, pp.
lishers.
621-624).
Minato, S, Ishiura, N., & Yajima. (1990). Shared
Jutman, A., Ubar, R., & Peng, Z. (2001). Algo-
binary decision diagrams with attributed edges for
rithms for Speeding-Up Timing Simulation of
efficient Boolean function manipulation. In Proc.
Digital Circuits. In IEEE Proc. of Design Auto-
27th IEEE/ACM ICCAD’90, (pp.52-57).
mation and Test in Europe (pp. 460–465). DATE.
Mirkhani, S., Lavasani, M., & Navabi, Z. (2002).
Karpovsky, M. G., Stanković, R. S., & Astola,
Hierarchical Fault Simulation Using Behavioral
J. T. (2008). Spectral Logic and Its Applications
and Gate Level Hardware Models. In Proc. of
for the Design of Digital Devices. New York:
Asian Test Symposium – ATS’2002, (pp.374-379).
Wiley-Interscience. doi:10.1002/9780470289228
Misera, S., & Urban, R. (2010). Fault simulation
Kebschull, U., Schubert, E. & Rosenstiel, W.
and fault injection technology based on SystemC.
(1992). Multilevel logic synthesis based on func-
In Ubar, R., Raik, J., & Vierhaus, H. T. (Eds.),
tional decision diagrams. IEEE EDAC’92.
Design and Test Technology for Dependable
Kundu, S. (2004). Pitfalls of Hierarchical Systems-on-Chip. Hershey, PA: IGI Global.
Fault Simulation. IEEE Trans. on CAD of IC
Misera, S., Vierhaus, H.-T., Breitenfeld, L., &
and Systems, 23(2), 312–314. doi:10.1109/
Sieber, A. (2006). A Mixed Language Fault
TCAD.2003.822099
Simulation of VHDL and SystemC. In Proc. of
Lee, C. Y. (1959, July). Representation of Switch- 9th EUROMICRO Conf on Digital System Design
ing Circuits by Binary Decision Programs. The (DSD’06).
Bell System Technical Journal, 985–999.

431
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Niermann, T., & Patel, J. H. (1991). HITEC: A Stanković, R. S., Astola, J., Stanković, M., &
test generation package for sequential circuits. Egiazarjan, K. (2002). Circuit synthesis from
In Proc. European Conf. Design Automation Fibonacci decision diagrams. VLSI Design. Spe-
(EDAC), (pp.214-218). cial Issue on Spectral Techniques and Decision
Diagrams, 14, 23–34.
Plakk, M., & Ubar, R. (1980). Digital Circuit Test
Design using the Alternative Graph Model. Auto- To, K. (1973). Fault Folding for Irredundant and
mation and Remote Control, 41(5, Part 2), 714- Redundant Combinational Circuits. IEEE Trans-
722. New York: Plenum Publishing Corporation. actions on Computers, C-22(11), 1008–1015.
doi:10.1109/T-C.1973.223637
Raik, J., & Ubar, R. (1998). Feasibility of Structur-
ally Synthesized BDD Models for Test Generation. Ubar, R. (1976). Test Generation for Digital Cir-
In Proc. of the IEEE European Test Workshop, cuits with Alternative Graphs. In [in Russian)].
(pp.145-146). Proceedings of Tallinn Technical University,
409, 75–81.
Raik, J., & Ubar, R. (2000). Fast Test Pattern
Generation for Sequential Circuits Using Decision Ubar, R. (1980). Detection of Suspected Faults
Diagram Representations. Journal of Electronic in Combinational Circuits by Solving Boolean
Testing: Theory and Applications – JETTA, 16(3), Differential Equations. Automation and Remote
213–226. Control, 40(11, Part 2), 1693–1703.
Raik, J., Ubar, R., Sudbrock, J., Kuzmicz, W., & Ubar, R. (1996). Test Synthesis with Alternative
Pleskacz, W. (2005). DOT: New Deterministic Graphs. IEEE Design & Test of Computers,
Defect-Oriented ATPG Tool. In Proc. of 10th (Spring): 48–57. doi:10.1109/54.485782
IEEE European Test Symposium, (pp.96-101).
Ubar, R. (1998). Multi-Valued Simulation of Digi-
Ravi, S., & Jha, N. K. (2001). Fast test generation tal Circuits with Structurally Synthesized Binary
with RTL and gate-level views. In International Decision Diagrams. In Multiple Valued Logic,
Test Conference, (pp.1068-1077). (Vol.4, pp.141-157). New York: OPA (Overseas
Publ. Ass.) N.V. Gordon and Breach Publishers
Rudnick, E. M., Patel, J. H., Greenstein, G. S.,
& Niermann, T. M. (1994). Sequential circuit test Ubar, R., & Borrione, D. (1999). Single Gate De-
generation in a genetic algorithm framework. In sign Error Diagnosis in Combinational Circuits.
Proc. Design Automation Conference, (pp. 698- Proceedings of the Estonian Acad. of Sci. Engng,
704). 5(1), 3-21.
Sasao, T. (1999). Switching Theory for Logic Syn- Ubar, R., Devadze, S., Raik, J., & Jutman, A.
thesis. Amsterdam: Kluwer Academic Publishers. (2007). Ultra Fast Parallel Fault Analysis on
Structural BDDs. IEEE Proc. of 12th European
Sasao, T., & Fujita, M. (Eds.). (1996). Representa-
Test Symposium – ETS’2007, (pp.131-136).
tions of Discrete Functions. Amsterdam: Kluwer
Academic Publishers. Ubar, R., Devadze, S., Raik, J., & Jutman, A.
(2008). Fast Fault Simulation in Digital Circuits
Srinivasan, A., Kam, T., Malik, Sh., & Bryant.
with Scan Path. In IEEE Proc. of 13th Asia and
(1990). Algorithms for discrete function manipu-
South Pacific Design Automation Conference –
lation. In Proc. of Informations Conference on
ASP-DAC’2008, (pp. 667-672).
CAD, ICCAD-90, (pp.92-95).

432
Diagnostic Modeling of Digital Systems with Multi-Level Decision Diagrams

Ubar, R., Devadze, S., Raik, J., & Jutman, A. KEY TERMS AND DEFINITIONS
(2010a). Parallel X-Fault Simulation with Critical
Path Tracing Technique. IEEE Proc. of Design Binary decision diagrams (BDD): A directed
Automation and Test in Europe - DATE. acyclic graph based data structure that is used to
represent a Boolean function.
Ubar, R., Mironov, D., Raik, J., & Jutman, A. Fan-out free region (FFR): A subcircuit in
(2010b). Structural Fault Collapsing by Super- the given network of logic gates, which does not
position of BDDs for Test Generation in Digital include reconverging in this region fan-out stems.
Circuits. In 11th Int. Symp. on Quality Electronic Functional fault model: A model which rep-
Design – ISQED’2010, (pp. 1-8). resents any arbitrary change in the logic function
Ubar, R., Moraviec, A., & Raik, J. (1999). Cycle- of a subcircuit or a circuit component because of
based Simulation with Decision Diagrams. In the fault, by determining (1) a signal line where
IEEE Proc. of Design Automation and Test in the erroneous value will appear, and (2) a logic
Europe (pp. 454–458). DATE. condition which is needed for activating the fault.
High-level DDs (HLDD): A general case of de-
Ubar, R., Raik, J., Ivask, E., & Brik, M. (2002). cision diagrams which represent digital systems at
Multi-Level Fault Simulation of Digital Systems higher than Boolean level such as register transfer,
on Decision Diagrams. In IEEE Workshop on Elec- behaviour or transaction levels using word level
tronic Design, Test and Applications – DELTA’02, variables and expressions as node labels in DDs.
(pp. 86-91). Shannon expansion: A method by which a
Vedula, V. M., & Abraham, J. A. (2002). Program Boolean function can be represented by the sum
Slicing for Hierarchical Test Generation. In Proc. of two sub-functions of the original function.
of VTS, (pp. 237-243). Structurally synthesized BDDs (SSBDD):
A BDD which represents both the function and
Yi, J., & Hayes, J. (2006). High-Level Delay the structure of a single FFR of a digital circuit
Test Generation for Modular Circuits. IEEE through one-to-one mapping between the nodes
Transactions on Computer-Aided Design of In- in the BDD and the signal paths in the circuit.
tegrated Circuits and Systems, 25(3), 576–590. Structurally synthesized multi-input BDDs
doi:10.1109/TCAD.2005.853697 (SSMIBDD): A general case of SSBDD which
Zhang, J. (2004). Symbolic Execution of Program represents two or more FFRs in a single BDD
Paths Involving Pointer and Structure variables. with separate root nodes for each FFR.
In 4th Int. Conf. on Quality Software – QSIC’04, Superposition of BDDs: A method to construct
(pp. 1-6). a composite BDD from two given simpler BDDs.
Vector high-level DDs (VHLDD): A com-
posite HLDD constructed on the basis of sharing
similar sub-HLDDs to represent concurrently and
in a compact way the high-level functions of more
than two modules of a digital system.

This work was previously published in gn and Test Technology for Dependable Systems-on-Chip, edited by Raimund Ubar, Jaan
Raik and Heinrich Theodor Vierhaus, pp. 92-118, copyright 2011 by Information Science Reference (an imprint of IGI Global).

433
434

Chapter 26
Semantic Annotation of
Geospatial RESTful Services
Using External Resources
Victor Saquicela
Universidad Politécnica de Madrid, Spain

Luis. M. Vilches-Blázquez
Universidad Politécnica de Madrid, Spain

Oscar Corcho
Universidad Politécnica de Madrid, Spain

ABSTRACT
RESTful services are increasingly gaining traction over Web Services (WS-*). As with WS-* services,
their semantic annotation can provide benefits in tasks related to their discovery, composition, and media-
tion. In this chapter, the authors present an approach to automate the semantic annotation of geospatial
RESTful services using a cross-domain ontology like DBpedia, domain ontologies like GeoNames, and
additional external resources (suggestion and synonym services). They use combinations of these re-
sources to discover meanings for each of the parameters of the geospatial RESTful services and perform
semantic annotations of them.

INTRODUCTION Machine-oriented Web applications and APIs that


are conformant to the REST architectural style
In recent years, since the advent of Web 2.0 ap- (Fielding, 2000), normally referred to as REST-
plications and given some of the limitations of ful Web services, have started appearing mainly
“classical” Web services (WS-*) based on SOAP, due to their relative simplicity and their natural
WSDL, and Open Geospatial Consortium (OGC) suitability for the Web. Nowadays the largest
services, Representational State Transfer (REST) online repository of information about Web 2.0
services have become an increasing phenomenon. mashups and APIs is ProgrammableWeb.com.
This aggregator site provides information on 5,465
DOI: 10.4018/978-1-4666-2038-4.ch026 mashups and 2,647 APIs that were registered be-

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Semantic Annotation of Geospatial RESTful Services Using External Resources

tween September 2005 and December 2010, as of services (Maleshkova, et al., 2009b; Alowisheq,
the time of the writing of this chapter. Mashups et al., 2009) still require a large amount of human
tagged as “mapping” represent a 34% mashups intervention. First, humans have to understand the
(2,411 mashups) of the listed ones, what represents informal descriptions provided in the RESTful
the importance of geospatial information in the service description pages, and then the semantic
generation of these applications. With respect to annotation of RESTful services has to be auto-
APIs, GoogleMaps is the most used with a 42%, mated as much as possible.
that is, this API is used on 2,149 mashups. These In this chapter, we address these two main
data show the importance of geospatial informa- challenges by: (1) providing syntactic descrip-
tion in the context of the REST world. tions of geospatial RESTful services that allow
However, using geospatial RESTful services their automatic registration and invocation, and
still requires much human intervention since the (2) interpreting and enriching the geospatial
majority of their description pages are given in the RESTful services’ parameters, by means of their
form of unstructured text in a Web page (HTML), semantic annotation.
which contains a list of the available operations, The main contribution of our work is an ap-
their Uniform Resource Identifiers (URIs) and proach to perform an automatic lightweight se-
parameters (also called attributes), expected mantic annotation process of RESTful services.
output, error messages, and a set of examples This process is implemented in a system, which
of their execution. This hampers the discovery, takes into account a cross-domain and a geospatial
composition and mediation between services, ontology (DBpedia ontology combined with Geo-
which may be required in the development of Names ontology), its SPARQL Endpoint, as well
applications, and which are also important tasks as different external resources such as synonyms
in WS-* approaches. and suggestion services. We use combinations of
An approach in order to solve this difficulty these resources to discover meanings for each of
is to think of semantic annotation activities of the parameter of the RESTful services and perform
these services. semantic annotations of them. These semantic an-
Besides, it can also play an important role in notations might provide benefits in tasks related
the Semantic Web by providing data to semantic to discovery, composition, and mediation of these
software agents, as can be seen in (Ferreira, et services. Furthermore, semantic annotations of
al.,2009; Alarcon, et al., 2010). RESTful services might help to developers and
Semantic annotation refers to the task of at- users in the creation of mashups.
taching meaningful descriptions to the service and The remainder of this paper is structured as fol-
the served geospatial data or processes (Maué, lows: Next, we present background in the context
et al., 2009). Traditionally, semantic annotation of semantic annotation of WS-* and RESTful ser-
approaches for services have focused on defining vices (from a general and geographical viewpoint).
formalisms to describe services, and have been Later, we introduce our approach for automating
normally applied to WS-* service description the annotation of RESTful services, including
formalisms and middleware. More recently, these explanations on how we derive their syntactic
(usually heavyweight) approaches have started description and semantic annotation. Finally,
to be adapted in a more lightweight manner for we present some conclusions of this chapter and
the semantic description of RESTful services identify future lines of work.
(Maleshkova, et al., 2009a; Kopecký, et al., 2008;
Lathem, et al., 2007). However, most of the
processes related to the annotation of RESTful

435
Semantic Annotation of Geospatial RESTful Services Using External Resources

BACKGROUND AND SPARQL Query Language for RDF


RELATED WORK
SPARQL Query Language for RDF (SPARQL)
In this section, we describe briefly main topics (Prud’hommeaux, et al., 2008) is a query language
related to our work such as: RDF, SPARQL and for RDF. This language can be used to express
REST. Likewise, we present existing approaches queries across diverse data sources, whether the
associated with semantic annotations and their data is stored natively as RDF or viewed as RDF
relationships with our proposal. via middleware. SPARQL contains capabilities
for querying required and optional graph patterns
Resource Description along with their conjunctions and disjunctions.
Framework (RDF) SPARQL also supports extensible value testing
and constraining queries by source RDF graph.
The Resource Description Framework (RDF) More details about SPARQL can be found in the
(Klyne, et al., 2004) is a metadata model and W3C Specification2.
language recommended by the W3C for building A SPARQL endpoint is a conformant SPARQL
an infrastructure of machine-readable semantics protocol service as defined in the SPROT (stands
for the data on the Web, a long-term vision known for SPARQL Protocol for RDF) specification.
as Semantic Web. The motivation behind the A SPARQL endpoint enables users (human or
development of RDF by the W3C was, as Tim other) to query a knowledge base via the SPARQL
Berners-Lee pointed out for the Semantic Web, language. Results are typically returned in one or
to have a common and minimal language to en- more machine-processable formats. Therefore,
able to map large quantities of existing data onto a SPARQL endpoint is mostly conceived as a
it so that the data can be analyzed in ways never machine-friendly interface towards a knowledge
dreamed of by its creators (Berners-Lee, 1998). base. Both the formulation of the queries and the
In the RDF model (Gutierrez, et al., 2005), human-readable presentation of the results should
the universe to be modeled is a set of resources, typically be implemented by the calling software,
essentially anything that can have a universal re- and not be done manually by human users.
source identifier, URI. The language to describe
them is a set of properties, technically binary Representation State
predicates. Descriptions are statements very much Transfer (REST)
in the subject-predicate-object structure. In ad-
dition, the RDF specification includes a built-in Representation State Transfer (REST) is a style
vocabulary with a normative semantics (RDFS). of software architecture for distributed systems
This vocabulary deals with inheritance of classes such as the World Wide Web. This term was in-
and properties, as well as typing, among other troduced by Roy Thomas Fielding in his doctor
features allowing the descriptions of concepts dissertation “Architectural Styles and the Design
and relationships that can exist for a community of Network-based Software Architecture.” REST
of people and software agents, enabling knowl- emphasizes scalability of component interactions,
edge sharing and reuse. That is, using this simple generality of interfaces, independent deployment
model, it allows structured and semi-structured of components, and intermediary components to
data to be mixed, exposed, and shared across dif- reduce interaction latency, enforce security, and
ferent applications. More details about RDF can encapsulate legacy systems (Fielding, 2000).
be found in this website1. A RESTful web service is based on the fol-
lowing principles (Fielding, 2000):

436
Semantic Annotation of Geospatial RESTful Services Using External Resources

• Resources and representations: A re- Related Work


source is a single piece of web applica-
tion that you make accessible to clients. Most research in the semantic annotation of
Because the resources you have are not RESTful services has focused on the definition of
transportable over the network, “providing formal description languages for creating semantic
them” means providing representations of annotations. The main proposed formalisms for
their state. describing these services are: the Web Application
• Addressability and connectedness: Description Language (WADL) which describes
Resources have their representations. In RESTful services syntactically, MicroWSMO
REST, every resource must have at least (Maleshkova, et al., 2009c) which uses hREST
one address, that is, one URI. To address (HTML for RESTful services) (Maleshkova, et
the resources, you simply specify the URI. al., 2009c; Kopecký, et al., 2008), and SA-REST
This concept is called “addressability.” (Maleshkova, et al., 2009b; Lathem, et al., 2007)
• Uniform interface: You need a communi- which uses SAWSDL (Maleshkova, et al., 2009a)
cation protocol/interface to use. In a REST and RDFa to describe service properties.
architecture, such an interface must be uni- In Battle et al. (2008), the authors introduced
form which means that whatever URI you an approach to annotate WADL documents linking
acceded, the interface should be the same. them to ontologies. Among these approaches, some
• Statelessness: Statelessness means that authors propose rather heavyweight approaches
a web application is not responsible for for semantic description, which are normally
keeping any information about the state derived from Web Service (WS-*) semantic de-
of its clients. The client is responsible for scription frameworks like WSMO or OWL-S.
tracking its own actions. The service main- An example is proposed in (Ferreira et al., 2009),
tains its resources and provides a uniform which makes use of a specific selection of existing
interface to the clients. languages and protocols, reinforcing its feasibility.
Firstly, OWL-S is used as the base ontology for
The following semantics are defined (Field- services, whereas WADL is used for syntactically
ing, 2000): describing them. Secondly, the HTTP protocol is
used for transferring messages, defining the action
• HTTP GET is used for obtaining a rep- to be executed, and also defining the executing
resentation of a resource. A consumer scope. Finally, URI identifiers are responsible
uses it to retrieve a representation from a for specifying the service interface. Neverthe-
URI. Services provided through this inter- less, these languages are strongly influenced by
face must not incur any obligation from existing traditional Web Services.
consumers. Other approaches are more lightweight, for in-
• HTTP DELETE is used for removing rep- stance, the proposals of Maleshkova et al. (2009a,
resentations of a resource. 2009b). The authors advocate an integrated light-
• HTTP POST is used for updating or creat- weight approach for formally describing semantic
ing the representations of a resource. RESTful services. This approach is based on use
• HTTP PUT is used for creating representa- of the hREST and MicroWSMO microformats,
tions of a resource. which enable the creation of machine-readable
service descriptions and the addition of semantic
annotations. Furthermore, the authors present
SWEET, a tool that effectively supports users in

437
Semantic Annotation of Geospatial RESTful Services Using External Resources

creating semantic descriptions of RESTful ser- only considered RESTful services without Open
vices based on the aforementioned technologies. Geospatial Consortium (OGC) standards.
Our work can be considered as an extension In the same way, ESRI (2010) has developed the
of the work presented in Lerman et al. (2007) and GeoServices REST Specification that provides a
Ambite et al. (2009), in which the development standard way for Web clients to communicate with
of domain-independent approaches to semanti- Geographic Information System (GIS) servers
cally label Web services is described. The authors through Representational State Transfer (REST)
propose to automatically learn the semantics of technology. Clients issue requests to the server
information sources labelling the input and output through structured URLs. The server responds with
parameters used by the source with semantic types map images, text-based geographic information,
of the user’s domain model. In our approach we or other resources that satisfy the request.
have dealt with RESTful services and have used
semantic repositories to try to semantically label
these services. AN APPROACH FOR AUTOMATIC
Another approach is presented in Heß et al. SEMANTIC ANNOTATION OF
(2003). This approach classifies data type us- GEOSPATIAL RESTFUL SERVICES
ing HTML treated Web form files as the Web
service’s parameters. They use Naïve Bayes to In this section, we present our approach visual-
classify assigned semantic types to the input and ized in Figure 1 for automating the syntactic
output parameters. However, this approach does and semantic annotation of geospatial RESTful
not consider values (instances) associated with services. Our prototypical system consists of
different parameters in the annotation process of three main components, including invocation
a web service. and registration, repository, and semantic annota-
Likewise, our work is also similar to the ap- tion components, which are enriched by diverse
proaches related to schema matching or integration external resources. Next, we briefly describe the
(Rahm, et al., 2001; Doan, et al., 2003). In these different components, illustrating the descriptions
proposals, the main goal is to establish semantic with some sample services.
mappings between two different schemas. In our
approach, the developed system sets matchings A Sample Set of RESTful Services in
between different parameters of a RESTful service the Geospatial Domain
and the DBpedia ontology.
Regarding the geospatial approaches, in Gang In this section, we present two different REST-
(2009) the authors illustrate using REST semantic ful services related to the geospatial domain to
methods to construct geospatial web services. They guide the explanation of the proposed semantic
present two methods of manual mapping geospa- annotation process. These services are taken from
tial resource to semantic world according to the programmableweb.com:
elements of REST. Moreover, in Sousa (2009), the
authors propose an approach based on Semantic • Service 1. http://ws.geonames.org/
annotations that use geographic metadata and on- countryInfo?country=ES
tologies to describe heterogeneous geospatial data.
It uses RDF/XML files and Federal Geographic This service retrieves information related to a
Data Committee (FGDC) metadata schema to link ‘country.’ More specifically, it returns informa-
ontology terms and metadata. In our approach, we tion about the following parameters: ‘capital,’
‘population,’ ‘area’ (km2), and ‘bounding box

438
Semantic Annotation of Geospatial RESTful Services Using External Resources

Figure 1. RESTful service semantic annotation system

of mainland’ (excluding offshore islands). In the manually a URL of an available RESTful service.
specified URL, we retrieve information about In this case, we add manually different URLs of
Spain. services and obtain automatically information
related to each of the aforementioned RESTful
• Service 2. http://nominatim.openstreetmap. service. Once URLs have been added, our system
org/reverse?format=xml&lat=40.4167&l invokes the RESTful service with a sample of
on=-3.7003& zoom=18&addressdetails=1 parameters and analyzes the response to obtain
a basic syntactic description of a parameter set,
This service retrieves information about which is used like inputs and outputs.
places. More specifically, it returns parameters In this process, our system uses the Service
like: ‘latitude,’ ‘longitude,’ ‘road,’ etc. In the Data Object (SDO) API to perform the invocation
specified URL, we retrieve information about a of the RESTful service and determine whether it
theatre (Teatro Reina Victoria, Madrid, Spain). is available or not. SDO is a specification for a
In these examples, each service has a URL programming model that unifies data program-
with different input parameters. In the case of the ming across data source types and provides robust
‘Service 1,’ input parameter is ‘Country’ and its support for common application patterns in a
associated value is ‘ES.’ The inputs parameters of disconnected way (Resende, 2007). The invoca-
‘Service 2’ are constituted by ‘format,’ ‘lat,’ ‘lon,’ tion process is performed as follows: first, it takes
‘zoom’ and ‘addressdetails.’ These parameters are the input parameters and their values, which are
used in our approach to automate the semantic given to the service as part of a URL. Then, the
annotation of geospatial RESTful services. system invokes the service, which translates our
“RESTful service call” into a query to the specific
Syntactic Description: service, including the URL and related parameters.
Invocation and Registration The service invocation of a specific REST-
ful service may return diverse formats, such as
Our system takes as input Web applications and HTML, JSON, XML, etc. In our work, we use
APIs, which are known by users, or users can add only the XML response for describing the service.

439
Semantic Annotation of Geospatial RESTful Services Using External Resources

Figure 2. XML response of two sample RESTful services

The results of invoking both services are shown our approach only takes into account parameters
in Figure 2. with an associated value. Therefore, parameters
These XML responses are processed using as ‘addresparts’ (see Figure 3) are not considered.
SDO, which enables to navigate through the XML The output parameters are registered and stored
and extract output parameters of each service. in a repository. This repository is a database spe-
The result of this invocation process is a syntac- cifically designed to store syntactic descriptions
tic definition of RESTful services in XML, which of RESTful services. We selected this storage to
can be expressed in description languages like increase efficiency in the recovery of the REST-
WADL or stored into a relational model. In this ful services.
work, we use a relational model as data model as Finally, both input and output parameters are
a consequence of the simplicity of WADL for considered as inputs in our semantic annotation
showing concepts. Figure 3shows the different process.
output parameters of each service. In this case,

Figure 3. Syntactic description of RESTful service

440
Semantic Annotation of Geospatial RESTful Services Using External Resources

Semantic Annotation parameters as show in Figure 4. Next, we describe


the main components of the semantic annotation.
Without proper descriptions, the use of such a
RESTful service is limited to a small user group. Ontological Resources
Therefore, they have to be annotated with descrip-
tive metadata to make it usable to a broad audience. A growing amount of geo-data, being produced
Otherwise people will neither be able to find the and made available using various representa-
resource using search engines nor will they be tion formats, need to be used in the context of
able to evaluate if the discovered resource satis- domain-specific use cases where the semantic
fies their current information need (Maué, 2009). accuracy of the data descriptions is of the essence.
Thus, in our approach once the RESTful service Such accuracy is achieved by using controlled
is syntactically described with all its identified vocabularies (ontologies) defined and used by
input and output parameters, we proceed into its authoritative entities in each of these domains. In
semantic annotation. We understand the semantic our approach, we use two widespread ontologies,
annotation as a reference which establishes a Link such as the DBpedia and GeoNames.
Annotation (Bechhofer, et al., 2002) between the DBpedia is a community effort to extract struc-
application specific metadata (a parameter of a tured information from Wikipedia and to make
RESTful service) and a shared external vocabulary this information available on the Web. DBpedia
(an ontology). We follow a heuristic approach allows you to ask sophisticated queries against
that combines a number of external services and Wikipedia, and to link other data sets on the Web to
semantic resources to propose annotations for the Wikipedia data. We hope this will make it easier for

Figure 4. Semantic annotation process

441
Semantic Annotation of Geospatial RESTful Services Using External Resources

the amazing amount of information in Wikipedia typically be implemented by the calling software,
to be used in new and interesting ways, and that and not be done manually by human users. Next,
it might inspire new mechanisms for navigating, we describe the process:
linking, and improving the encyclopaedia itself.
With respect to its ontology3, it is a shallow, • First, the system retrieves all the classes
cross-domain ontology, which has been manually from the DBpedia ontology whose names
created based on the most commonly used info- have an exact match with each parameter
boxes within Wikipedia. The ontology currently of the RESTful service. If the system ob-
covers over 259 classes, which form a subsumption tains correspondences from the matching
hierarchy and are described by 1,200 different process, it uses these DBpedia concepts
properties. Moreover, this ontology currently individually to retrieve samples (concept
contains about 1,478,000 instances (413,000 instances) from the DBpedia SPARQL
instances belong to “Place” class). Endpoint. Likewise, when a parameter
GeoNames is integrating geographical data matches an ontology class related to some
such as names of places in various languages, geospatial information; such as latitude,
elevation, population and others from various longitude, or bounding box, our system
sources. All lat/long coordinates are in WGS84 retrieves samples from the GeoNames
(World Geodetic System 1984). Users may manu- SPARQL Endpoint. The resulting informa-
ally edit, correct, and add new names using a user tion (RDF) is suggested automatically to
friendly wiki interface. Likewise, the GeoNames the system and registered as a possible val-
geographical database is available for download ue for the corresponding parameter. When
free of charge under a creative commons attribu- a parameter matches more than once in the
tion license. DBpedia ontology, our system only con-
Regarding its ontology4, it makes it possible to siders concepts that have information (in-
add geospatial semantic information to the World stances), and automatically discards those
Wide Web. All over 6.2 million GeoNames top- ontology concepts without instances.
onyms have a unique URL with a corresponding
RDF web service. In order to retrieve information about identi-
fied parameters of RESTful services the system
Using Semantic Sources in has registered the DBpedia and/or GeoNames
the Annotation Process SPARQL Endpoint as a service. This service
enables automatically invocation of SPARQL
At this stage, the list of syntactic parameters queries over DBpedia and GeoNames Endpoints.
obtained previously is used to query the DB- Next, we present queries used by the system for
pedia and GeoNames SPARQL Endpoints and retrieving DBpedia/GeoNames information.
retrieve associated results for each parameter. This SPARQL query (see Figure 5) enables
A SPARQL endpoint enables users (human or to retrieve classes of the DBpedia or GeoNames
other) to query a knowledge base via the SPARQL ontology. The results of this query are compared
language. Results are typically returned in one or to the concepts with each parameter of a service.
more machine-processable formats. Therefore,
a SPARQL endpoint is mostly conceived as a • Next, the system tries to find correspon-
machine-friendly interface towards a knowledge dences between parameters of the RESTful
base. Both the formulation of the queries and the service and DBpedia properties. If the sys-
human-readable presentation of the results should tem obtains some correspondences, it uses

442
Semantic Annotation of Geospatial RESTful Services Using External Resources

Figure 5. SPARQL query for retrieving classes of the ontologies (filter omitted)

these DBpedia properties individually to This SPARQL query (see Figure 7) enables to
retrieve information of the DBpedia or retrieve possible values for a certain class of the
GeoNames SPARQL Endpoint, as de- previous ontologies.
scribed above. Furthermore, this informa- This SPARQL query (see Figure 8) enables to
tion is registered as a possible correct value retrieve possible values for a certain property of
for the corresponding parameter. the aforementioned ontologies.

This SPARQL query (see Figure 6) enables to Enriching the Semantic Annotations
retrieve properties of the DBpedia or GeoNames
ontology. The system uses results to compare them Since we request exact matches with DBpedia
with each parameter identified in the syntactic (and GeoNames) classes and properties, our
description. system does not establish correspondences with
ontology classes or properties for all parameters of
• Finally, with the obtained classes and prop- the RESTful service, since there are many lexical
erties, the system calls the DBpedia and and syntactic variations that the parameter names
GeoNames SPARQL Endpoints to retrieve may have, and because in some cases the informa-
values (instances) for those classes and tion that is being requested may not be available
properties, so that now we have possible in any of the external sources that are consulted.
values for them. Next we show some query In order to annotate semantically the parameters
examples: that did not match any DBpedia resource, we use
additional external services to enrich the results.

Figure 6. SPARQL query for retrieving properties of the ontologies (filter omitted)

Figure 7. SPARQL query for retrieving possible values for a class

443
Semantic Annotation of Geospatial RESTful Services Using External Resources

Figure 8. SPARQL query for retrieving possible values for a property

Below we describe the main characteristics of the of ‘address’ such as extension, reference, men-
external services that are considered in the system. tion, citation, denotation, destination, source, cite,
Spelling Suggestion Services: Web search acknowledgment, and so on. These outputs are
engines (e.g. Google, Yahoo, and Microsoft) registered and stored into the repository, and then,
usually try to detect and solve users’ writing the service calls to the DBpedia (and GeoNames)
mistakes. Suggestions services, also called “Did SPARQL Endpoints for results.
You Mean,” are spelling algorithms, which aim
at solving these spelling mistakes. For example, Checking the Semantic
when a user writes ‘countryName’these algorithms Annotation of RESTful Services
suggest ‘country’ and ‘name’ separately.
In our system, we use the Yahoo Boss service In order to check the collected sample individuals
to retrieve suggestions about the parameters that and initial semantic annotations of the previous
we have obtained in the previous steps and for process, our system invocates the RESTful service
which we have not obtained any candidate in our that was already registered in the repository and
semantic resources. Thus, for each parameter that validates the input and output parameters.
the system did not find a correspondence with For the validation of the input parameters, our
classes or properties in DBpedia (nor GeoNames), system selects, for each parameter, a random subset
this service is invocated for obtaining a list of of the example instances (of classes and/or proper-
suggestions to query DBpedia (and GeoNames) ties) coming from the DBpedia (and GeoNames)
again. The output is registered and stored into ontology that we have obtained and registered
the repository. Following the previous example, before. Next, it makes several invocations of the
the parameter ‘countryName’ is not found in the RESTful service iterating over these registered
DBpedia ontology. Nevertheless, the added service values. The system does not check this with all the
allows separating this parameter in ‘country’ and possible combination of collected instances for all
‘name,’ and then it calls to the DBpedia SPARQL parameters for two reasons: first, because of the
Endpoint for obtaining results. combinatorial explosion that may be produced in
Use of Synonyms: This external service is such a case, and second because many RESTful
incorporated into the system to retrieve possible services have invocation limitations.
synonyms for a certain parameter. This service If the service returns results from the invoca-
tries to improve the semantic annotation process tion, then the service is deemed as executable,
when our system does not offer results for the and the corresponding annotations are marked as
previous steps, that is, when we still have param- valid. If a service cannot be invoked successfully,
eters in a RESTful service without any potential the service is classified as non-executable and is
annotations. automatically discarded from the list of services
As an example, we may have a parameter that can be automatically annotated.
called ‘address.’ The invocation process uses the For the validation of the output parameters,
synonyms service to retrieve a set of synonyms our system only takes into account executions

444
Semantic Annotation of Geospatial RESTful Services Using External Resources

with the correct inputs from the input sets that EXPERIMENTAL RESULTS
have been considered before. Next, the system
compares the outputs obtained after execution with In order to evaluate our approach we use 12 dif-
the information already stored in the repository ferent RESTful services founded in http://www.
due to the initial retrieval processes done before programmableweb.com/, which are characterized
with DBpedia (and GeoNames), and external to contain geospatial information. The list of
utility services. If the output can be matched, our RESTful services can be seen in this website5.
system considers the output annotation as valid. This analysis follows the three steps described in
Finally, the correspondences that have been the semantic annotation. First, our system identi-
established between the different parameters fies correctly 16 parameters calling directly the
of the RESTful service and the DBpedia (and DBpedia and GeoNames ontologies, but it fails to
GeoNames) ontology are registered and stored recognize 161 parameters. Second, the system uses
in the repository, so that they can be used later. the suggestion service and calls the used ontolo-
In such a way, the RESTful service is annotated gies. In this case, it identifies 41 correspondences,
semantically and it will allow generating semantic but it fails to recognize 120 parameters. Third, the
descriptions or annotations of any of the types that system uses the synonyms service and calls the
were identified in the background section (WADL, ontologies. It identifies 19 correspondences, but
hREST, SA-REST, MicroWSMO, etc.). Figure 9 fails to recognize 101. A detailed view of these
provides an abbreviated form of this description results is shown in Figure 10.
for our exemplar “Service 2.” In this description, We cannot guarantee the success of the system
inputs/outputs parameters are associated with in all the cases, because in some cases the system
classes and properties of the aforementioned on- has not found any correspondence between REST-
tologies (DBpedia or GeoNames). For instance, the ful service parameters and the concepts or prop-
“Country” parameter is related to www.dbpedia. erties of the DBpedia and GeoNames ontologies.
org/resource/Country. After having analysed our findings, we have
seen that some parameters are useless, because
they refer to a navigation process through RESTful

Figure 9. Semantic annotation of a RESTful service

445
Semantic Annotation of Geospatial RESTful Services Using External Resources

Figure 10. Results of the service test

service results, for example: page, total, hits, etc. To illustrate our work and guide the explana-
These parameters make it difficult to carry out the tions of the proposed semantic annotation process
right annotation semantic process. We are planning we have used two exemplary RESTful services
to discard these types of parameters in the future. related to the geospatial domain. Besides, we
To the best of our knowledge, there are no have presented some preliminary experimental
available results from existing research works results that prove the feasibility of our approach,
to compare our results against. Likewise, these at least in the geospatial domain, and show that
preliminary results prove the feasibility of our it is possible to carry out a semantic annotation
system and highlight that is possible to carry out of RESTful services automatically, again at least
an automatic semantic annotation of RESTful in this domain.
services. Future work will focus on the development of
a GUI that will ease the introduction of existing
services by users for their semantic annotation,
CONCLUSION probably incorporated in any existing RESTful
semantic annotation tool/utility suite. Further-
In this chapter, we have proposed an approach to more, we also plan to make improvements to the
perform an automatic semantic annotation process proposed system, related to the matching process
of RESTful services. This process is implemented and the use of similarity metrics, so as to improve
in a system that takes into account the DBpedia the results that have been demonstrated in our
ontology and its SPARQL Endpoint, for gen- evaluation. In the same sense, we also aim at
eral annotation, and GeoNames and its SPARQL improving in the SPARQL queries to DBpedia
Endpoint for geospatial specific results, as well and other semantic resources that may be associ-
as different external resources such as synonyms ated or not to a specific domain to better explore
and suggestion services. We use combinations the knowledge of this resource in the annotation
of these resources to discover meanings for each process, and optimize the use of suggestion and
of the parameters of the RESTful services that synonyms services. Additionally, we will adapt the
a user may select and perform semantic annota- described process to add semantic annotation to the
tions of them. Open Geospatial Consortium services; concretely

446
Semantic Annotation of Geospatial RESTful Services Using External Resources

we will focus on Web Feature Services (WFS). Berners-Lee, T. (1998). Principles of design.
Moreover, we will incorporate ontology domains Retrieved from http://www.w3.org /DesignIssues/
in the semantic process for taking advantage of Principle s.html.
specific domain characteristics. Finally, we are
Doan, A., Domingos, P., & Halevy, A. Y. (2003).
planning to generate outputs of semantic annota-
Learning to match the schemas of data sources:
tion process based on different formats, such as:
A multistrategy approach. Machine Learning,
hRESTS, SA-REST and MicroWSMO. These
50(3), 279–301. doi:10.1023/A:1021765902788
files (outputs) will be published in a SPARQL
Endpoint and/or in a public repository. ESRI. (2010). GeoServices REST specifica-
tion version 1.0. White Paper. Retrieved from
http://www.esri.com /library/whitepapers/pdfs/
REFERENCES geoservices-rest-spec.pdf.

Alarcon, R., & Wilde, E. (2010). Linking data from Ferreira Filho, O. F., & Grigas Varella Ferreira,
RESTful services. Linked Data on the Web (LDOW M. A. (2009). Semantic web services: A RESTful
2010). Retrieved from http://www.linkeddata.org. approach. Paper presented at the IADIS Inter-
national Conference WWW/INTERNET 2009.
Alowisheq, A., Millard, D. E., & Tiropanis, T. Rome, Italy.
(2009). EXPRESS: Expressing restful semantic
services using domain ontologies. In Proceedings Fielding, R. (2000). Architectural styles and the
of the International Semantic Web Conference design of network-based software architectures.
2009, (pp. 941-948). ACM. PhD Dissertation. Irvine, CA: University of
California.
Ambite, J. L., Darbha, S., Goel, A., Knoblock,
C. A., Lerman, K., Parundekar, R., & Russ, T. A. Gang, Y. (2009). A research on semantic geospa-
(2009). Automatically constructing semantic web tial web service based REST. In Proceedings of
services from online sources. In Proceedings of the International Forum on Computer Science-
the International Semantic Web Conference, (pp. Technology and Applications, 2009, (vol 1), (pp.
17-32). ACM. 208-210). IFCSTA.

Battle, R., & Benson, E. (2008). Brinding the Gutierrez, C., Hurtado, C., & Vaisman, A. (2005).
semantic web and web 2.0 with representational Lecture Notes in Computer Science: Vol. 3532.
state tranfer (REST). Web Semantics, 6, 61–69. Temporal RDF (pp. 93–107). Berlin, Germany:
doi:10.1016/j.websem.2007.11.002 Springer.

Bechhofer, S., Carr, L., Goble, C. A., Kampa, S., Heß, A., & Kushmerick, N. (2003). Learning
& Board, T. M. (2002). The semantics of semantic to attach semantic metadata to web services. In
annotation. In Proceedings of On the Move to Proceedings of the International Semantic Web
Meaningful Internet Systems, 2002 - DOA/Coo- Conference 2003. ACM.
pIS/ODBASE 2002 Confederated International Klyne, G., & Carroll, J. J. (Eds.). (2004). RDF
Conferences DOA, CoopIS and ODBASE 2002 concepts and abstract syntax. W3C Recommenda-
(pp. 1152–1167). London, UK: Springer-Verlag. tion. Retrieved from http://www.w3c.org.
doi:10.1007/3-540-36124-3_73

447
Semantic Annotation of Geospatial RESTful Services Using External Resources

Kopecký, J., Gomadam, K., & Vitvar, T. (2008). Prud’hommeaux, E., & Seaborne, A. (2008).
hRESTS: An HTML microformat for describing SPARQL query language for RDF. Retrieved from
RESTful web services. In Proceedings of Web http://www.w3.org /TR/2008/REC-rdf-sparql-
Intelligence, (pp. 619-625). Web Inelligence. query-20080115/.
Lathem, J., Gomadam, K., & Sheth, A. P. (2007). Rahm, E., & Bernstein, P. (2001). On matching
SA-REST and (s)mashups: Adding semantics to schemas automatically. The VLDB Journal, 10(4).
RESTful services. In [ICSC.]. Proceedings of doi:10.1007/s007780100057
ICSC, 2007, 469–476.
Resende, L. (2007). Handling heterogeneous data
Lerman, K., Plangprasopchok, A., & Knoblock, C. sources in a SOA environment with service data
A. (2007). Semantic labeling of online informa- objects (SDO). In Proceedings of the ACM SIG-
tion sources. International Journal on Semantic MOD International Conference on Management
Web and Information Systems, 3(3), 36–56. of Data, (pp. 895-897). ACM.
doi:10.4018/jswis.2007070102
Sousa, S. R. (2009). A semantic approach to
Maleshkova, M., Gridinoc, L., Pedrinaci, C., describe geospatial resources. In Proceedings of
& Domingue, J. (2009c). Supporting the semi- the ER 2009 Workshops (CoMoL, ETheCoM, FP-
automatic acquisition of semantic RESTful service UML, MOST-ONISW, QoIS, RIGiM, SeCoGIS) on
descriptions. Paper presented at European Seman- Advances in Conceptual Modeling—Challenging
tic Web Conference 2009. Geneva, Switzerland. Perspectives, (pp. 327-336). Berlin, Germany:
Springer.
Maleshkova, M., Kopecky, J., & Pedrinaci, C.
(2009a). Adapting SAWSDL for semantic annota-
tions of RESTful services. Paper presented at the
ENDNOTES
Workshop: Beyond SAWSDL at OnTheMove
Federated Conferences & Workshops. Vilamoura, 1
http://www.w3.org/RDF/
Portugal. 2
http://www.w3.org/TR/rdf-sparql-query/
Maleshkova, M., Pedrinaci, C., & Domingue,
3
http://wiki.dbpedia.org/Ontology
J. (2009b). Semantically annotating RESTful
4
http://www.geonames.org/ontology/ontol-
services with SWEET. Paper presented at the 8th ogy_v2.2.1.rdf
International Semantic Web Conference. Wash-
5
http://www.oeg-upm.net/files/RESTFulAn-
ington, DC. notationWeb/RESTfulservices.xlsx

Maué, P. (Ed.). (2009). Semantic annotations in


OGC standards. OGC Discussion Paper 08-167.
Retrieved from http://portal.opengeospatial.org /
files/?artifact_id=34916.

This work was previously published in Discovery of Geospatial Resources: Methodologies, Technologies, and Emergent Ap-
plications, edited by Laura Díaz, Carlos Granell and Joaquín Huerta, pp. 156-171, copyright 2012 by Information Science
Reference (an imprint of IGI Global).

448
449

Chapter 27
Framework for Graphical
User Interfaces of Geospatial
Early Warning Systems
Martin Hammitzsch
Helmholtz Centre Potsdam, GFZ German Research Centre for Geosciences, Germany

ABSTRACT
An important component of Early Warning Systems (EWS) for man-made and natural hazards is the
command and control unit’s Graphical User Interface (GUI). All relevant information of an EWS is
concentrated in this GUI and offered to human operators. However, when designing the GUI, not only
the user experience and the GUI’s screens are relevant, but also the frameworks and technologies that
the GUI is built on and the implementation of the GUI itself are of great importance. Implementations
differ based on their applications in different domains but the design and approaches to implement the
GUIs of different EWS often show analogies. The design and development of such GUIs are performed
repeatedly on some parts of the system for each EWS. Thus, the generic GUI framework of a geospatial
EWS for tsunamis is introduced to enable possible synergistic effects on the development of other new
related technology. The results presented here could be adopted and reused in other EWS for man-made
and natural hazards.

INTRODUCTION disaster management related activities regarding


threats (Raape et al., 2010) of natural or man-
As part of an Early Warning and Mitigation System made hazards such as tsunamis. Thus, the DSS
(EWMS), the Decision Support System (DSS) is intended to help the officer on duty to become
provides processing, assessment, visualization, aware of a current situation, to assess incoming
decision support, analysis, warning and man- information, to exploit synergies of information
agement functions for the purpose of supporting fusion and analysis, to assess the impact and con-
sequences and to make informed decisions (Raape
DOI: 10.4018/978-1-4666-2038-4.ch027

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

et al., 2010). As a result, the DSS and especially its utilization of existing resources with accelerated
user interface are taken into account for a tsunami generation of large volumes of data.
EWMS initially designed and implemented in the
project Distant Early Warning System (DEWS).
Hammitzsch et al. (2010) outline that the DEWS METHODOLOGY
project has the objective of creating a new gen-
eration of interoperable early warning systems After providing the motivation for the presented
based on an open sensor platform. This platform results and the application of free and open source
integrates Open Geospatial Consortium (OGC) software (FOSS), the technological framework for
Sensor Web Enablement (SWE) compliant sen- a command and control unit’s GUI to be applied
sor systems for the rapid detection of hazardous in each EWS for man-made and natural hazards
events, such as earthquakes, sea level anomalies, is introduced with a simplified architecture for
ocean floor occurrences and ground displacements the Command and Control User Interface (CCUI)
for the case of tsunami early warnings. Based on used in natural crisis management (NCM) for
the upstream information flow, DEWS focuses tsunamis developed in DEWS and TRIDEC. In
on the improvement of downstream capacities this context, the selected GUI frameworks and geo
for warning centers, especially by improving frameworks are introduced together with an ap-
information logistics for effective and targeted plication oriented Geographic Information System
warning message aggregation for a multilingual (GIS) and the relevant standards and technologies.
environment. Even though DEWS was primarily
focused on tsunami early warning in the Indian
Ocean region, Lendholt and Hammitzsch (2011) MOTIVATION AND PRECONDITIONS
explain that the system offers a modular design
that serves as a reference architecture for early In ‘The Global Survey Of Early Warning Systems,’
warning systems independent of the hazard type Annan (2006) concludes that considerable short-
and region. A generic reference architecture for comings and gaps remain, especially in developing
early warning systems should have components countries, where basic capacities, equipment and
designed for serving in new deployments and resources are often not available. The UN (2006)
new sites without re-programming or compila- reports that systems for some hazards, such as
tion procedures. Instead, deployment-specific tsunamis, are often absent. In this regard, Löwe et
add-ons or plug-ins should be easy to add in al. (2011) summarize that the Boxing Day Tsunami
a configurable system based on the needs of a of 2004 killed over 240,000 people in 14 coun-
specific scenario. This means, also, that the user tries and inundated the affected shorelines with
interface has to be not bound to specific hazard waves reaching heights of up to 30 m. Whereas
characteristics and must be developed and imple- tsunami early warning capabilities have improved
mented independently from specific infrastructure in the meantime by continuing the development
characteristics. Wächter et al. (2011) report that of modular Tsunami Early Warning Systems
these developments are resumed and continued by (TEWS), recent tsunami events, such as the Chile
the project Collaborative, Complex, and Critical 2010 and the Honshu 2011 tsunami, demonstrate
Decision-Support in Evolving Crises (TRIDEC, that there are still shortcomings in the systems.
TRIDEC 2010) focusing on real-time intelligent Furthermore, most of the disaster management
information management and including the design systems currently in place are proprietary systems
and implementation of a robust and scalable ser- (Chen et al., 2010). These proprietary systems are,
vice infrastructure supporting the integration and however, challenged by limitations such as high

450
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

cost, low flexibility, and constrained compatibil- infrastructures (SDIs). They conclude that ‘open-
ity (Ashcroft et al., 2002; Burghardt, 2006). For source software provides great potential to make
example, Chen et al. (2010) exemplify adopting available components for SDI implementations
the FOSS concept due to following reasons: to that are affordable by resource poor organizations’
evade many of the existing challenges, such as a and add that ‘many building blocks for SDIs are
lack of funds to generate proprietary software; to already available.’ Ramsey (2007) remarks that
endorse greater flexibility, wealth of knowledge, ‘existing [FOSS GIS] products are now entering a
and improved compatibility; and to make wide- phase of rapid refinement and enhancement’. He
spread deployment and customization a reality. further emphasizes that ‘open source [GIS] soft-
In this regard, Steiniger and Bocher (2009) ware can provide a feature-complete alternative
argue that the development of FOSS has experi- to proprietary software in most system designs.’
enced a boost over the last few years. The variety Thus, a FOSS concept is followed to increase
of FOSS that can be found on desktop computers the utility of the CCUI and to leverage benefits such
ranges from word processors and web browsers as low cost, flexibility, and improved compatibil-
to drawing and scientific applications. In the GIS ity. Furthermore, the presented framework based
domain, the widespread use of FOSS is apparent on FOSS tries to fill the gaps outlined partially
as well. This rise in popularity of free GIS tools by providing an approach to copy findings and
can be measured using four indicators. The first development to implement a CCUI for natural
indicator is the number of projects started in the last and man-made hazards.
few years. The second indicator is the increasing
financial support by governmental organizations
for the foundation of FOSS GIS projects. The third COMMAND AND CONTROL
indicator is the download rates of free desktop USER INTERFACE
GIS software. Finally, a fourth indicator is an
increasing number of cases of using open source Based on profound FOSS community activities
GIS software. Along with this trend towards the and results described hereinafter, the CCUI will be
application of FOSS goes the number of research based on the Eclipse Rich Client Platform (RCP)
publications that mentions the use of open source and the User-friendly Desktop Internet GIS (uDig)
software tools and libraries (i.e., Mitasova & Ne- Software Development Kit (SDK), with their
teler, 2004). Moreover, software and algorithms behind frameworks and standards. Hammitzsch
developed in research projects are increasingly et al. (2010) summarize that existing standards
published under open source licenses (e.g., Badard have been integrated, wherever possible. The
& Braun, 2003, Pebesma, 2004, Burghardt et al., CCUI integrates various services standardized
2005, Buliung & Remmel, 2008). As such, it is by OGC. Using the Web Map Service (WMS)
important to note that the FOSS movement that and Web Feature Service (WFS) specifications,
postulates freedom of use and modification for spatial data are utilized to present the situation
software is not restricted to software only. Rather, picture and to integrate a simulation system via
one regards free software as ‘the foundation of a the Web Processing Service (WPS), to identify the
learning society where we share our knowledge affected areas. Warning messages are compiled and
in a way that others can build upon’ (Lee, 2011). transmitted using the Common Alerting Protocol
Hence, this movement also includes the free avail- (CAP) for the content together with addressing
ability of data that forms a basis for our knowledge. information using the Emergency Data Exchange
Reid and Martin (2001) discussed the potential of Language – Distribution Element (EDXL-DE).
open source software for implementing spatial data Both standards are specified by the Organization

451
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

for the Advancement of Structured Informa- (Figure 1, Element 2), exemplarily shown in
tion Standards (OASIS). Internal interfaces are Figure 2, provides a survey of a specific area and
realized with SOAP web services. Standards contributes an overall situation picture to the
and functionalities are implemented mainly via operator, with geo-spatial information to track
plug-ins, a central concept of the Eclipse RCP running events. The Forecasting Perspective
that enables the building of platform applications (Figure 1, Element 3), exemplarily shown in
with user interfaces. Figure 1 depicts the stack of Figure 3, supports the operator in analyzing dif-
frameworks, libraries, technologies and plug-ins ferent probable forecasts provided by the simula-
adopted and realized in the CCUI. tion system. The Message Composition Perspec-
Atop of various library and functionality plug- tive (Figure1, Element 4), exemplarily shown in
ins, the CCUI reveals its functionalities to the Figure 4, facilitates the operator to prepare and
operator via a GUI assembled by so-called per- send warning messages or system messages. The
spectives (Figure 1, Element 1). Each perspective Dissemination Perspective (Figure 1, Element 5)
determines visible actions and provides views provides a comprehensive overview of the status
within a window that regard the dedicated major of disseminated messages sent through the dif-
tasks of the operator. The composition of the GUI’s ferent dissemination channels and allows observa-
graphical part (i.e., the perspectives) is described tion of all disseminations initiated for specific
in Hammitzsch et al. (2012) and forms the basis user groups. The Situation Picture Perspective
for the technological approach presented in this (Figure 1, Element 6) also known as the Map
paper. Hammitzsch et al. (2012) explain that the Perspective is included via uDig and provides a
following main perspectives support operators set of preconfigured and selected thematic map
doing their tasks with EWSs for man-made and layers, also allowing the incorporation of dy-
natural hazards. The Monitoring Perspective namic additional information on the map to ana-

Figure 1. CCUI framework stack

452
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

Figure 2. CCUI monitoring perspective with exemplary setup for Thailand developed in DEWS

lyze the specifics of a given situation. The Main- disconnecting them. The Administration Perspec-
tenance Perspective (Figure 1, Element 7) enables tive (Figure 1, Element 8) allows the administra-
the operator to maintain sensors and sensor net- tion of configurations and settings needed for
works by means of requesting sensor observations information logistics. Additionally, each of the
and planning services as well as rebinding and perspectives ships its own libraries, employed

Figure 3. CCUI forecasting perspective with evaluation setup for Turkey developed in TRIDEC (Ozel
et al., forthcoming)

453
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

Figure 4. CCUI message composition perspective with validation setup for Indonesia developed in
continued DEWS activities (Hammitzsch et al., 2012)

solely by the relevant perspective. This distribu- Eclipse and Eclipse Rich
tion allows the deployment of each perspective Client Platform
as a standalone application or, as outlined here,
in a compound application together with other Aniszczyk (2010) describes Eclipse as an open
perspectives. source community focused on providing an
extensible development platform for building
CCUI Application Framework software and developing a universal platform
for application frameworks and exemplary tools
The CCUI application framework (Figure 1, that make it easy and cost-effective to build and
Element 9) forms the basis of the introduced deploy software. There is a large consortium of
framework stack and is composed of the Java major software vendors, solution providers, cor-
Platform Enterprise Edition (Java EE; Figure 1, porations, educational and research institutions,
Element 10) together with components built upon and individuals working cohesively to create an
GeoTools including a GeoAPI implementation ecosystem that enhances and cultivates the Eclipse
(Figure 1, Element 11), Eclipse RCP (Figure 1, platform with complementary products, capabili-
Element 12) and uDIG (Figure 1, Element 13). ties, and services.
These components form the basis for the imple- While the Eclipse platform is designed to
mentation of CCUI specific components. The serve as an open tools platform, it is architected
resulting platform that is employed to build the so that its components could be used to build
CCUI on top is introduced briefly in this section. almost any client application. The minimal set of

454
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

plug-ins needed to build a rich client application is a thick client, natively on Windows, Mac OS/X
collectively known as the Eclipse RCP. Applica- and Linux; Internet oriented, adhering to standards
tions can be built using a subset of this platform. such as OGC WMS, WFS, Web Coverage Service
This rich set of applications is still based on the (WCS) and de facto geospatial web services such
dynamic plug-in model, and the user interface is as GeoRSS, Keyhole Markup Language (KML)
built using the same toolkits and extension points. and tiles; and GIS ready, providing the framework
The layout and function of the workbench is under on which complex analytical capabilities can be
the fine-grained control of the plug-in developer, built and gradually subsuming those capabilities
in this case (Irawan, 2009). into the main application.
With regard to Eclipse RCP, Irawan et al. (2010) Thus, the integration of uDig as an open source
comments that many people that have built, or GIS framework in Eclipse RCP based applica-
are building RCP applications state that the main tions saves efforts on developing necessary GIS
value they get from using the RCP is that it allows functionality and leverages the flexibility of the
them to build a professional-looking application underlying Eclipse framework. However, Steini-
quickly, with a native look-and-feel, on multiple ger and Bocher (2009) also point out that there are
platforms, allowing them to focus on their value two perceived disadvantages that result from the
added. They appreciate that the components that use of Eclipse RCP as the core component: The
form the RCP are of high quality, are actively first disadvantage is the size of the application,
maintained, and are open source. They often and the second disadvantage is that the GUI is
discover, after the initial adoption of RCP, that quite similar to the development environment for
there are many other Eclipse components avail- programming and thus, may be too complex for
able. Several individuals have also discovered end users. On the positive side, the user and the
that the inherent extensibility of Eclipse allows developer documentation are very good, and the
them to build not only a closed-form product but project management and software development
also an open-ended platform in their own domain. processes are outlined and documented.
Using Eclipse RCP, the development of uDig
User-Friendly Desktop Internet GIS started between 2004 and 2005 and has been
‘uDig,’ its Software Development Kit initiated by Refractions Research Inc. in Canada,
and the Eclipse RCP Target Platform which also develops the spatial extension PostGIS
for the PostgreSQL database (Steiniger & Bocher,
uDig is an open source desktop application frame- 2009). Based on Eclipse RCP, uDig is developed
work, built with RCP technology. uDig can be used in Java and was initially focused on the editing
as a stand-alone application, can be extended with of vector data. However, since 2007 the uDig
RCP plug-ins and can be used as a plug-in for an team has been joined by the JGrass team, which
existing RCP application (Refractions, 2010a). works on raster analysis functionality (Antonello
Thus, uDig is used as an off-the-shelf product, & Francheschi, 2007). Besides JGrass, further
and it also provides an environment to imple- projects are involved in the development. Because
ment solutions for domain specific applications of the joint work of these projects and the possi-
(WhereGroup, 2010). Refractions (2010a) reports bility of using uDig as a frontend for the PostGIS
that the goal of uDig is to provide a complete Java database (which has already a very wide user
solution for Desktop GIS data access, editing, and base), the user and developer community grows
viewing. uDig aims to be the following: user- consistently (Steiniger & Bocher, 2009). uDig is
friendly, providing a familiar graphical environ- further developed and maintained at the present
ment for GIS users; desktop located, running as time under the Refraction Research and uDig user

455
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

community to make it a better and more powerful easily usable code base. GeoTools implements the
application. This activity follows the vision to fill core global geospatial standards of the Interna-
in functional gaps in the two technology com- tional Organization for Standardization (ISO) and
munities: the open source geospatial community of the OGC to ensure maximal interoperability
and the open standards geospatial community, as with other geospatial software. GeoTools is a
represented by OGC (Tress & Moestofa, 2009). project that is growing continuously and breaking
To perform standardized operations for core GIS new grounds by implementing untested standards.
functionality such as data reading, coordinate The library is therefore still evolving to address
re-projection, and rendering, uDig uses the well- weaknesses in certain areas, to implement new
established GeoTools library (Refractions, 2010b). functionality, to work around defects in existing
standards and to embrace newer standards when
Open Source Java GIS proposed. GeoTools provides almost no graphical
Toolkit ‘GeoTools’ user interface functionality. Thus developers plan-
ning applications that will depend on the library
GeoTools is a free, open source Java geospatial should carefully assess the relevant strengths and
toolkit for working with both vector and raster weaknesses of GeoTools.
data (GeoTools, 2010) and is a library that pro- Importantly, not only is GeoTools an open
vides classes, objects, and methods required to source Java GIS toolkit providing implementations
implement standards conformant to geospatial of many OGC specifications as they are developed,
applications (Santokhee, 2008). The library con- but it also is associated with the GeoAPI project
sists of a large number of various modules that that creates geospatial Java interfaces (OSGEO,
can be combined according to programmer needs 2010).
(Santokhee, 2008) and that allow you to do the fol-
lowing: access GIS data in many file formats and GeoAPI’s Java Language Programming
spatial databases; work with an extensive range of Interfaces for Geospatial Applications
map projections; filter and analyze data in terms
of spatial and non-spatial attributes; compose and In 2003 the GeoTools project started an outreach
display maps with complex styling; and create and program in the form of the GeoAPI project as an
analyze graphs and networks (GeoTools, 2010). effort to collaborate with other projects working in
Santokhee (2008) reports that GeoTools can be the same domain and as a line of communication
readily extended by adding new modules, either to the standards community (GeoTools, 2011). The
for custom applications or as contributions to the GeoAPI provides a set of Java language program-
library. GeoTools supports additional formats ming interfaces which include many of the data
through the use of plug-ins. A developer can structures and manipulation methods needed for
control the formats supported by the application GIS applications (OGC, 2011). It provides neutral,
by only including the plug-ins required. Thus, interface-only APIs derived from OGC and ISO
GeoTools can be used to build applications of standards to reduce duplication and to increase
many different types covering most processes of interoperability (Desruisseaux & Garnett, 2010).
GISs, including network server applications and The need for a standardized set of programming
desktop clients. GeoTools leverages a modern, interfaces that remains constant across different
object oriented structure and uses common, well toolkits is reflected in GeoAPI’s history, which
documented computational design patterns. The includes an attempt for GeoAPI to become a
library consists of loosely dependent modules to standard. Desruisseaux et al. (2010) recollects
present third party programmers with a coherent, that the GeoAPI project was initially an initiative

456
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

of various open source communities wanting to the API, from library code, which implements the
reduce work duplication. The goal was to make API. These interfaces are not an implementation.
it easier to exchange pieces of software between Clients can use the API without concern for the
independent projects, so that a project does not particular implementation which they will use. One
need to reinvent functions already provided by of various groups, that have implemented differ-
another project. In 2004, GeoAPI merged with the ent subsets of GeoAPI, is GeoTools introduced.
GO-1 initiative from OGC. In September 2004, the
creation of a GeoAPI working group has been ap- Java Programming Language
proved by OGC voting members. In May 2005, the and Computing Platform
GO-1 final specification, which includes GeoAPI
interfaces, finally, has been accepted as an OGC Based on the use of available frameworks and
standard. Discussing why a standardized set of libraries, the characteristics of the underlying
programming interfaces is required instead of the technology used (i.e., Java) will be summarized,
standardized OGC web services, Desruisseaux et to outline the potentials of this technology for
al. (2010) proposes that web services are efficient the future adoption of the CCUI in related EWS
ways to publish geographic information using application domains.
existing software. However, some users need to Jendrock et al. (2006) argue that developers
build their own solution, for example as a wrap- today increasingly recognize the need for dis-
per on top of their own numerical model. Many tributed, transactional, and portable applications
existing software packages provide sophisticated that leverage the speed, security, and reliability
developer toolkits, but each toolkit has its own of server-side technology. In the world of infor-
learning curve, and one cannot easily switch from mation technology, enterprise applications must
one toolkit to another or mix components from be designed, built, and produced for less money,
different toolkits. Using standardized interfaces, with greater speed, and with fewer resources. With
a significant part of the API can remain constant the Java EE platform, development of enterprise
across different toolkits, thus reducing both the applications is easy and fast. The aim of the Java
learning curve and the interoperability challenges. EE platform is to provide developers with a pow-
OGC (2011) summarizes that the GeoAPI erful set of APIs while reducing the development
interfaces closely follow the abstract model and time, reducing the application complexity, and
concrete specifications published collaboratively improving the application performance. Further-
by the ISO in its 19100 series of documents more, Jendrock et al. (2010) adds that the Java
and the OGC in its abstract and implementation EE platform is developed through the Java Com-
specifications. GeoAPI provides an interpretation munity Process (JCP), which is responsible for all
and adaptation of these standards to match the ex- Java technologies. Expert groups, composed of
pectations of Java programmers. These standards interested parties, have created Java Specification
provide GeoAPI with the richness which comes Requests (JSRs) to define the various Java EE
from the expertise of the specification writers. technologies. The work of the Java community
Clients benefit from the potential for interoper- under the JCP program helps to ensure Java tech-
ability which comes from using a well-defined, nology’s standard of stability and cross-platform
standardized data model. Implementors benefit compatibility.
from having a pre-defined set of well-considered, Importantly, Java EE is platform independent
formal boundaries to modularize their develop- and supports applications on various computing
ment work. So the GeoAPI interfaces provide a platforms ranging from super computers and
layer which separates client code, which would call enterprise servers on the one end to mobile and

457
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

embedded devices on the other end. It is a de-facto EWS Specific CCUI Plug-Ins
industry standard for developing applications. Java
EE is one of the most widely used technologies The application plug-in (Figure 1, Element 21) is
in its field. Finally, Java EE is free and supported based on the introduced application framework, the
by a large worldwide community and with lots required plug-ins and the optional plug-ins. The
of vendors in the information technology (IT) application plug-in is the plug-in from which the
domain. application is started and which forms the basis
for specific EWS implementation. On the basis of
Generic CCUI Plug-Ins the described layers of the CCUI framework, EWS
specifics are implemented by further plug-ins.
The CCUI application framework introduced in Special purpose plug-ins (Figure 1, Element 22)
Figure 1 is supplemented with required plug-ins extend functionality, for example, with extensions
(Figure 1, Element 14). The core plug-in (Figure to the GeoTools library (Figure 1, Element 23) and
1, Element 15) provides core functionality for required messaging functionality for communica-
graphical parts of the application such as perspec- tion (Figure 1, Element 24). Standards defined by
tives, views, and internal events. The configura- the OGC are covered by OGC specific plug-ins
tions plug-in (Figure 1, Element 16) contains the (Figure 1, Element 25). uDig already provides
central configurations and settings used to store the required functionality to communicate via
all of the plug-in preferences, e.g., region and WMS and WFS (Figure 1, Element 26). However,
hazard dependent configurations for the differ- other functionalities to access sensors via SWE
ent setups depicted in Figures 2 through 4. The (Figure 1, Element 27) and to instruct geospatial
libraries plug-in (Figure 1, Element 17) contains processing via WPS (Figure 1, Element 28) are
the libraries accessed by other plug-ins, e.g., to added by further plug-ins.
work with standards such as CAP, EDXL-DE
and SOAP. Further plug-ins provide function- Hazard Specific CCUI Plug-Ins
alities to be used optionally (Figure 1, Element
18) by other plug-ins built on top. For example, Additional sensor specific plug-ins (Figure 1,
the AspenCloud widgets (Figure 1, Element 19) Element 29) such as the Tsunami Service Bus
extend the available Standard Widget Toolkit (TSB) plug-in (Figure 1, Element 30) and the
(SWT) widgets integrated in Eclipse RCP and thus earthquake (Figure 1, Element 31) and sea level
form a new widget with an entirely new capacity, (Figure 1, Element 32) plug-ins extend the generic
meeting the specific requirements of the CCUI, and standardized access covering system and haz-
which are related to time management. Another ard specifics (Figure 1, Element 33), in this case
optional plug-in named gazetteer (Figure 1, Ele- for earthquake and sea level monitoring. Further
ment 20) contributes to the main application by hazard specifics and convenience functionality
providing access to a gazetteer service to assist might be provided by other plug-ins such as the
the operator in the task of navigating around the tsunami functionalities plug-in (Figure 1, Element
map by typing a location name and centering the 34), which provides specific functionality to deal
map on its coordinates. Other optional plug-ins with aspects related to tsunami risks in a geospatial
might be integrated if desired. EWS for tsunami warning and mitigation. Also,
the already introduced perspectives that reveal the
functionality of one or more operators on duty are
hazard specific plug-ins that expose graphically
the required and implemented functionality.

458
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

Realization as Desktop GIS communication between the elements of a GIS


preferred to WebGIS (Avraam, 2009).
However, the WhereGroup (2010) argues that
The CCUI is developed as a Desktop GIS utilizing Desktop GIS enables a wide range of users to
an open standards based Service Oriented Archi- process geospatial data and to utilize a wealth of
tecture (SOA) with the adoption of free and open geospatial functionality based on comparatively
source technologies, software and frameworks, cheap and readily available Personal Computers
as discussed earlier. This section outlines briefly (PCs). Over time, Desktop GIS has become a typi-
why the Desktop GIS approach has been applied. cal example that represents a tendency to move
Nalani (2009) explains that the rapid growth of from powerful centralized computing hardware
the Internet has led to an increasing concern over to decentralized networks of PCs for geo-data
the WebGIS approach, with the World Wide Web processing. With the introduction of distributed
(WWW) offering new application areas. This ap- architectures and web based processing services,
proach provides access to geographic information the use of geospatial data has moved from its lim-
via the WWW and is used to distribute informa- ited traditional application areas to a mainstream
tion to potentially numerous users. Because of source of information. At the same time, the
the considerable interest in geospatial data visu- relevance of stand-alone Desktop GIS products
alization through web services that employ GIS is steadily declining. While the total number of
the widespread availability of internet and related installed seats is more or less stable, most Desktop
services such as WWW technology have been GIS now plug into service oriented SDIs as data
widely applied in geospatial data visualization sources and thus combine the advantages of local
and in providing the infrastructure to allow wide processing capabilities with distributed architec-
access to geo-referenced data or GIS services. tures. In this regard, Avraam (2009) summarizes
Further, these technologies provide an advanced that the interaction between the components is
means to assist with visual data exploration and usually very direct in Desktop GIS. Data, maps
spatial decision making systems in the context and analysis happen on the same computer, so
of landslides, earthquakes, flood modeling and communication is done internally. Enterprise
debris flow. Desktop GIS often allows the user to commu-
Additionally, Siddiquee et al. (2009) state that nicate with data remotely, and sometimes even
the WebGIS is comparatively new but is a very analysis is performed remotely. WebGIS enables
fast growing sub-set of GIS. WebGIS is achieving the communication of all components to happen
particular significance for spatial data handling through the web, enabling diverse data, analysis
over the web. In other words, WebGIS are used for algorithms, users and visualization techniques
distributing and processing geographic informa- that may be hosted at any location on the web.
tion via the Internet and the WWW. It is attain- Summarizing the WebGIS approach, there is
ing increasing momentum and acceptability for a major focus on cloud computing (Armbrust et
different levels of users, such as geospatial data al., 2009) in which the Desktop GIS approach
handlers and producers as well as governmental changes dramatically from a traditional mono-
and non-governmental agencies. Because the lithic desktop application, accessing a few re-
WebGIS is platform independent, this capabil- mote services only, to a desktop application that
ity reduces the necessity for purchasing costly provides access to both integrated functionality
Desktop GIS software. WebGIS then refers to and distributed services on servers and the cloud.
GIS that uses web technologies as a method for With the increasing performance of accessing
network nodes in the WWW and the growing

459
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

availability of widespread services in the cloud, FOSSLAB is a platform for the development of
a modern GIS has to support the access and usage FOSS projects in a geospatial context, allowing
of distributed services to be suitable for the future. users to save, advance and reuse results achieved
However, cloud computing is still an emerging in previous and on-going project activities and
approach in its current state, and does not cover enabling further development and collaboration
professional requirements that need to be met with a wide community including scientists, de-
to be implemented in (enterprise) Desktop GIS. velopers, users and stakeholders. Furthermore,
Apparently, WebGIS solutions do not provide the FOSSLAB constitutes an umbrella encompass-
large amount of professional functionalities today ing a number of related geoinformatic activities,
provided by Desktop GIS solutions. Also, Web- such as the documentation of best practices for
GIS solutions are often specialized applications working experiences and results with SDIs, GIS,
with single components in what is termed GIS, spatial processing on clusters and in clouds, and
providing limited functionality tailored to specific geomatics related topics.
and not extensive application fields. Moreover,
limited web mapping applications accessing the
so-called GeoWeb (Leclerc et al., 2001) are often APPLICATIONS
misleadingly described as WebGIS. Finally, a ma-
jor drawback of the cloud dependent WebGIS is In this regard, readers interested in the ideas cov-
that the response time can be long, depending on ered in this paper and readers who want to make
a number of factors, such as connection capacity, use of those ideas in constructive ways are enabled
data volume, network traffic, and processor power to act on or apply those ideas by firstly visiting the
(Bonnici, 2005). Thus, an application specific GIS related FOSSLAB sub-projects at www.fosslab.
such as an EWS for tsunamis should implement a org. Secondly, using the products and results of-
Desktop GIS approach based on a geo framework fered by the FOSSLAB sub-projects, asking and
that provides both access to distributed services answering questions, reporting bugs and making
and access to integrated functionality, allowing feature requests are critical parts of the projects’
utilization of the appropriate functionality based communities. Still only a few key persons repre-
on the required needs. sent the communities in small, gradually evolv-
ing groups and provide instructions successively
and on request. It is the user feedback which can
DEVELOPMENT AND RESEARCH strongly drive the projects and the technologies.
CONTINUED AS FOSS INITIATIVE Beyond the user activities, it would be greatly
appreciated getting help with the development
Löwe et al. (2011) summarize that all develop- of the projects. This can mean getting involved
ments of the CCUI are based to the largest extent with the discussions on the development mailing
on FOSS components and industry standards. They lists, answering user questions, providing patches
announce that emphasis has been and will be made for bug-fixes or features, and helping to improve
on leveraging open source technologies that sup- the documentation.
port mature system architecture models wherever In this context it is important to mention that
appropriate. All open source software produced is behind the work presented here various software
foreseen to be published on FOSSLAB, a publicly components form the overall system by provid-
available software repository provided by the Cen- ing the required data and services, which finally
tre for GeoInformation Technology (CeGIT) of the expose their information and functionality to an
GFZ German Research Centre for Geosciences. operator on duty (OOD) with the CCUI. These

460
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

required software components have to be built, by the stakeholders and domain experts and is cur-
deployed and configured appropriately on an op- rently under further development. More extensive
erating system (OS) with other suitably installed evaluation and revision of this system is expected
FOSS before the CCUI can be used. Based on to be completed in the future with multiple experts
activities in the DEWS successor TRIDEC stable from research and government agencies offering
versions of these components, used in the problem their opinions on the system. Through open-ended
domain of natural crisis management for tsunamis, survey questions, the requirements of the system
are bundled irregularly together with the pre- have to be controlled and successively refined.
installed OS on a raw disk image to be used with In addition, the unique benefits of the system in
many virtualization software packages. Alterna- providing a FOSS multi hazard framework concept
tively, there is the opportunity to download these implementation, and the relevance of the approach
components, publicly shared on the FOSSLAB followed here should be affirmed by the respective
sub-projects one by one, and to install them with stakeholders. Finally, decision-makers in local
the help of available instructions. and regional authorities already have expressed
interest in the system (ICT, 2010).

CONCLUSION
ACKNOWLEDGMENT
This paper covers the investigation and results
of selecting the appropriate GUI and geospatial The work presented in this paper is part of the
frameworks for the implementation of the CCUI research and development in the DEWS project
and exposes the generic framework designed and (contract no. 045453), partially funded by the 6th
developed in conjunction with the realization of the Framework Program of the European Commis-
CCUI. Specifically, the user interface framework sion. DEWS is based on the collective creativity,
presented for early warning systems is an attempt idealism and optimism of a global community
to provide an appropriate and useful concept for the sharing achievements of performed work and find-
business of public safety, emergency management ings with free and open source licenses. Thus, it is
and homeland security. The concept presented is not possible to acknowledge all of the individuals
independent of a specific hazard domain, thus and institutions that have contributed. The author
provides a generic solution for specific applica- would like to express his sincere thanks to all of
tions in the area of EWS and might be adopted in those who contributed towards the development
other systems for managing natural and man-made of DEWS and its CCUI by providing the vital
hazards. Within DEWS and TRIDEC, the designed foundations with their efforts and collectively
concepts have been realized and implemented shared results. Further findings and develop-
as generic software components validated and ments are conducted within the TRIDEC project
approved by different interest groups in several (contract no. 258723), partially supported by the
project-internal and public live-demonstrations, 7th Framework Program of the European Com-
demonstrating the complete early warning system mission. The author is grateful to the anonymous
prototype including the CCUI as a concept for an reviewers for their comments, which have helped
adaptable and extensible user interface for early to improve the earlier version of this paper.
warning systems (Lendholt & Hammitzsch, 2011).
In this regard, Hammitzsch et al. (2012) assert that
the overall system, including the CCUI and its
framework, has received preliminary evaluation

461
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

REFERENCES Burghardt, D., Neun, M., & Weibel, R. (2005).


Generalization services on the web – classification
Aniszczyk, C. (2010). Eclipse IDE project re- and an initial prototype implementation. Cartog-
sources. Retrieved October 31, 2010, from http:// raphy and Geographic Information Science, 32(4),
www.ibm.com/developerworks/opensource/top- 257–268. doi:10.1559/152304005775194665
projects/eclipse/eclipse-starthere.html
Chen, R., Thiyagarajan, T., Rao, R. H., & Lee,
Annan, K. A. (2006). Foreword to the global J. K. (2010). Design of a FOSS system for flood
survey of early warning systems. In Proceedings disaster management. In Proceedings of the 7th
of the Third International Conference on Global International Conference on Information Systems
Survey of Early Warning Systems, Bonn, Germany. for Crisis Response and Management, Seattle, WA.
Armbrust, M., Fox, A., Griffith, R., Joseph, A. Desruisseaux, M., & Garnett, J. (2010). Home.
D., Katz, R. H., & Konwinski, A. …Zaharia, Retrieved October 31, 2010, from http://docs.
M. (2009). Above the clouds: A Berkeley view codehaus.org/display/GEO/Home
of cloud computing. Retrieved December 27,
2011, from http://www.eecs.berkeley.edu/Pubs/ Desruisseaux, M., Garnett, J., et al. (2010). FAQ.
TechRpts/2009/EECS-2009-28.html Retrieved October 31, 2010, from http://docs.
codehaus.org/display/GEO/FAQ
Ashcroft, J., Daniels, D. J., & Hart, S. V. (2002).
Crisis Information Management Software (CIMS) DEWS. (2010). Distant early warning system.
feature comparison report. Washington, DC: Retrieved December 17, 2011, from http://www.
National Institute of Justice. dews-online.org

Avraam, M. (2009). Geoweb: Web mapping and GeoTools. (2010). GeoTools user guide frequently
web GIS. Retrieved November 3, 2010, from asked questions. Retrieved October 29, 2010,
http://michalisavraam.org/2009/03/geoweb-web- from http://docs.geotools.org/stable/userguide/
mapping-and-web-gis/ faq.html

Badard, T., & Braun, A. (2003). OXYGENE: GeoTools. (2011). End of GeoAPI involvement.
An open framework for the deployment of geo- Retrieved December 18, 2011, from http://geoto-
graphic web services. In Proceedings of the 21st olsnews.blogspot.com/2011/01/end-of-geoapi-
International Cartographic Conference, Durban, involvement.html
South Africa. Hammitzsch, M., Lendholt, M., & Esbrí, M. A.
Bonnici, A. M. (2005). Web GIS software com- (2012). User interface prototype for geospatial
parison framework. Retrieved November 8, 2010, early warning systems – a tsunami showcase.
from http://www.webgisdev.com/webgis_frame- Natural Hazards and Earth System Sciences, 12,
work.pdf 555–573. doi:10.5194/nhess-12-555-2012

Buliung, R. N., & Remmel, T. K. (2008). Open Hammitzsch, M., Lendholt, M., & Wächter, J.
source, spatial analysis, and activity-travel behav- (2010). Distant early warning system for tsuna-
iour research: Capabilities of the aspace package. mis – A wide-area and multi-hazard approach.
Journal of Geographical Systems, 10(2), 191–216. In Proceedings of the Conference of the Euro-
doi:10.1007/s10109-008-0063-7 pean Geosciences Union: Vol. 12. Geophysical
Research Abstracts. Retrieved August 17, 2010,
from http://meetingorganizer.copernicus.org/
EGU2010/EGU2010-4496.pdf

462
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

Irawan, H., McCrary, T., Barbareau, L., et Mitasova, H., & Neteler, M. (2004). GRASS as
al. (2010). RCP FAQ. Retrieved October 31, an open source free software GIS: Accomplish-
2010, from http://wiki.eclipse.org/index.php/ ments and perspectives. Transactions in GIS, 8(2),
RCP_FAQ#Why_should_I_build_my_applica- 145–154. doi:10.1111/j.1467-9671.2004.00172.x
tion_on_the_Eclipse_Rich_Client_Platform.3F
Nalani, H. A. (2009). WebGIS based 3D visu-
Irawan, H., Vogel, L., Ebert, R., et al. (2009). alization of geospatial data. In Proceedings of
Rich client platform. Retrieved October 31, 2010, the Applied Geoinformatics for Society and En-
from http://wiki.eclipse.org/index.php/Rich_Cli- vironment Conference, Stuttgart, Germany (pp.
ent_Platform 212-215). Retrieved November 3, 2010, from
http://www.gis-news.de/papers/AGSE_Proceed-
Jendrock, E., Ball, J., Carson, D., Evans, I., Fordin,
ings_2009_07_01_1025.pdf
S., & Haase, K. (2006). The Java EE 5 tutorial:
For Sun Java System Application Server Platform OGC. (2011). GeoAPI. Retrieved December 18,
Edition 9 (Java Series). Upper Saddle River, NJ: 2011, from http://www.geoapi.org
Prentice Hall.
OSGEO. (2010). GeoTools InfoSheet. Retrieved
Jendrock, E., Evans, I., Gollapudi, D., Haase, K., October 31, 2010, from http://www.osgeo.org/
& Srivathsa, C. (2010). The Java EE 6 tutorial geotools
1: Basic concepts (Java Series). Upper Saddle
Ozel, N. M., Necmioglu, O., Yalciner, A. C.,
River, NJ: Prentice Hall.
Kalafat, D., Yilmazer, M., & Comoglu, M. …
Leclerc, Y., Reddy, M., Iverson, L., & Heller, A. Erdik, M. (forthcoming, June 17-22). Tsunami
(2001). The GeoWeb - A new paradigm for finding early warning in the Eastern Mediterranean, Ae-
data on the Web. In Proceedings of the Interna- gean and Black Sea. Paper to be presented at the
tional Cartographic Conference, Beijing, China. 22nd International Ocean and Polar Engineering
Conference, Rhodes (Rodos), Greece.
Lee, M. (2011). What is free software and why
is it so important for society? Retrieved June 13, Pebesma, E. J. (2004). Multivariable geosta-
2011, from http://www.fsf.org/about/what-is- tistics in S: The gstat package. Computers &
free-software Geosciences, 30(7), 683–691. doi:10.1016/j.
cageo.2004.03.012
Lendholt, M., & Hammitzsch, M. (2011). Generic
information logistics for early warning systems. In Raape, U., Teßmann, S., Wytzisk, A., Steinmetz,
Proceedings of the 8th International Conference T., Wnuk, M., & Hunold, M. …Jirka, S. (2010).
on Information Systems for Crisis Response and Decision support for tsunami early warning in
Management, Lisbon, Portugal. Indonesia: The role of OGC standards. In M.
Konecny, T. L. Bandrova, & S. Zlatanova (Eds.),
Löwe, P., Hammitzsch, M., & Lendholt, M.
Geographic information and cartography for risk
(2011, May 19-20). Significance of FOSSGIS
and crisis management (Vol. 2, pp. 233-247).
in the TRIDEC Project. Paper presented at the
Berlin, Germany: Springer-Verlag.
Conference of Geoinformatics, Prague, Czech
Republic. Retrieved June 11, 2011, from http:// Ramsey, P. (2007). The state of open source GIS.
geoinformatics.fsv.cvut.cz/gwiki/Significance_ Paper presented at the annual Free and Open
of_FOSSGIS_in_the_TRIDEC_Project Source Software for Geospatial Conference,
Vancouver, BC, Canada.

463
Framework for Graphical User Interfaces of Geospatial Early Warning Systems

Refractions Research. (2010a). A GIS framework Steiniger, S., & Bocher, E. (2009). An overview
for eclipse. Retrieved October 31, 2010, from on current free and open source desktop GIS de-
http://udig.refractions.net/ velopments. International Journal of Geographi-
cal Information Science, 23(10), 1345–1370.
Refractions Research. (2010b). Developing with
doi:10.1080/13658810802634956
uDig. Retrieved October 31, 2010, from http://
udig.refractions.net/developers/ Tress, S., & Moestofa, A. (2009). uDig – An
overview of open source desktop GIS application.
Reid, J., & Martin, F. (2001). The open source
In Proceedings of the Conference of the Applied
movement and its potential in implementing
Geoinformatics for Society and Environment,
spatial data infrastructures. Paper presented at
Stuttgart, Germany.
the International Symposium on Spatial Data
Infrastructure, Melbourne, Australia. TRIDEC. (2010). Collaborative, complex, and
critical decision processes in evolving crises.
Results, I. C. T. (2010). New tsunami early warn-
Retrieved December 17, 2011, from http://www.
ing system stands guard. Retrieved August 17,
tridec-online.eu
2010, from http://cordis.europa.eu/ictresults/
index.cfm?tpl=article&ID=91371 UN. (2006). Global survey of early warning sys-
tems. In Proceedings of the Third International
Santokhee, A. (2008). GeoTools. Retrieved June
Conference on Early Warning, Bonn, Germany.
13, 2011, from http://www.resc.rdg.ac.uk/twiki/
bin/view/Resc/GeoTools Wächter, J., Fleischer, J., Häner, R., Küppers,
A., Lendholt, M., & Hammitzsch, M. (2011).
Siddiquee, Z. H., Strzalka, A., & Eicker, U.
Development of tsunami early warning systems
(2009). Publication of energy consumption data of
and future challenges. In Proceedings of the Con-
Scharnhauser Park via Web GIS. In Proceedings
ference of the European Geosciences Union: Vol.
of the Conference of the Applied Geoinformatics
13. Geophysical Research Abstracts. Retrieved
for Society and Environment, Stuttgart, Germany
December 17, 2011, from http://meetingorganizer.
(pp. 186-190). Retrieved November 3, 2010, from
copernicus.org/EGU2011/EGU2011-11015-2.pdf
http://www.gis-news.de/papers/AGSE_Proceed-
ings_2009_07_01_1025.pdf WhereGroup. (2010). Desktop GIS. Retrieved
November 3, 2010, from http://www.wheregroup.
com/en/desktop_gis

This work was previously published in the International Journal of Open Source Software and Processes (IJOSSP), Volume 3,
Issue 4, edited by Stefan Koch, pp. 49-63, copyright 2011 by IGI Publishing (an imprint of IGI Global).

464
465

Chapter 28
User-Friendly Geoportal
Interfaces for Geospatial
Resource Discovery
Victor Pascual Ayats
Spatial Data Infrastructure of Catalonia (IDEC), Spain

ABSTRACT
The Spatial Data Infrastructure of Catalonia (IDEC) was launched in 2002. From the beginning, the
Metadata Catalog (MC) service has been considered as one of the main pieces of the infrastructure.
Building a metadata catalog is important for any Spatial Data Infrastructure to foster resource interop-
erability and integration. In addition to organizing, classifying, and sorting metadata records, one of
the hardest parts in the IDEC was to design web applications that allow users to easily discover and
access such geospatial resources. This chapter reviews the different trends in building friendly user
interfaces of web applications to search and discovery metadata records through the evolution of user
interface of the IDEC Geoportal.

INTRODUCTION focuses more on the design of web applications


and how the knowledge and observation of the
This chapter attempts to explain how to create “user experience” has permitted the creation of
a web application to search metadata records more useful and effective applications in terms
focusing on the evolution experimented by the of user interfaces and functionalities.
IDEC (Spatial Data Infrastructure of Catalonia) To build a metadata catalog from a technologi-
metadata search client application over the past cal point of view means implementing a complex
8 years. The first part provides a general intro- system based on ISO1 standards (International
duction to the metadata catalog technologies Standards Office) for the metadata description
and the standardized process. The second part and Open GeoSpatial Consortium2 (OGC) speci-
fications for the catalog service implementation,
DOI: 10.4018/978-1-4666-2038-4.ch028 which allows geodata and geoservices providers

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

to register resources, and final users—consum- of a search engine like Google5. It has changed
ers—to locate and use these resources. the users’ behavior when using web search appli-
Nevertheless, a Spatial Data Infrastructure cations. This impact has affected all web search
(SDI) is not only a technological implementa- applications, even the metadata catalogs, and
tion, but also is a set of institutional agreements has been called “googlization” (Battelle, 2003;
and policies to identify and encourage local and Salkever, 2003). Google offers a single search
regional institutions, research centers, universi- box. Usually what we are looking for appears well
ties, and companies to describe—creating meta- positioned in the first page and it is also very fast.
data—and to share its geographic information. This does not happen in most metadata search
We might say that the IDEC has achieved some engines, despite having a smaller and very well
significant results as its catalog had more than described volume of information.
37,000 metadata records from over 200 providers Finally, the future work section will introduce
at the end of 2010. the latest improvements developed in the meta-
All of this work may go unnoticed if an SDI data search using natural language programming
is not able to build good front-end applications— techniques.
such as a shop window—to discover, find, or bind
the metadata records. We are talking about web
services and a Web Catalog Client Application OGC CATALOG SERVICES
(WCCA). SPECIFICATION
This chapter explains and justifies the changes
applied during the last 8 years to the IDEC WCCA. Among the applications, which lie at the core of
With every change and update, there is always all SDIs, perhaps the most important ones are the
the same aim of trying to facilitate access to the metadata catalogs. Implementing a catalog may
information in the most accessible and easy way be a tedious and think-less task but it is vital to
possible. All of these changes have been follow- the success of an SDI. The job basically consists
ing the same trend, decreasing the complexity and of locating all possible producers of cartographic
search options in the design of the web interface, data (departments, agencies, companies, or even
while the server processes—hidden from the end universities) and convincing them to make an
user—have been increasing exponentially. inventory of all their geographic information and
This trend in minimal user interfaces has had to describe it following a standardized procedure.
nothing to do with successive versions released by This is what we call “creating metadata.”
the specification producers such as OGC or ISO. This metadata, which is stored in eXtensible
The key factors have been: (1) improvements re- Markup Language6 (XML) encoded files accord-
lated to the design and web programming; (2) web ing to the International Standards Office (ISO)
usability and a commitment to focus on respond- standards, 19115, are loaded into the metadata
ing to user demands; and (3) a perspective change catalog so that they can be interrogated by any
from data centric offer to user centric demand. user. It has to be said that the creation of metadata
Obviously, the emergence of new web-pro- is not an easy task and involves describing fields
gramming techniques such as AJAX (Asynchro- such as map creator, creation date, revisions data,
nous JavaScript and XML) and Javascript API accessibility, information formats, quality, distri-
(Application Programming Interface) frameworks bution, and the coordinate box corresponding to
like jQuery3 and OpenLayers4 have impacted web the information described.
applications’ design and performance; however,
the biggest impact has probably been the existence

466
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

Context different geographic data from different servers in


the same environment. Each request contains a set
In an optimal scenario, all SDI users should be of specific parameters defined by the WMS speci-
able to find out what geographic information ex- fication which are understood by all map servers
ists in a particular area of the territory (catalog) that meet the specification. Thus, when we say a
in question, view this information (viewer), and map server is standard and is WMS compliant, it
finally, if it is of interest, download it (process). means that the server is capable of responding to
To achieve this goal any SDI development should, such requests, while the already mentioned WMS
adopt geospatial interoperability standards in order or others such as WCS (Web Coverage Service)
to facilitate communication between components. and WFS (Web Feature Service) are specifications
There is a consortium called the Open Geospa- related to data visualization.
tial Consortium (OGC) whose job is to decide on The OGC specification related to the discovery
the specifications to be used. OGC’s role in the of data is OGC Catalogue Services Specification.
geospatial world is not that different from W3C’s In the late 90’s the OpenGIS consortium—nowa-
role in the standardization of the World Wide Web. days Open GeoSpatial consortium—needed to
OGC is a nonprofit consortium, created in 1994 define a specification that allowed interoper-
and made up of over 400 enterprises, governmental ability among metadata catalogs focused on
agencies, and universities, which work together geospatial information. The motivation to define
towards developing open specifications. a specification was very clear due to vast amounts
The purpose of these specifications is to pro- of geospatial information resources that were
mote geospatial interoperability. In any SDI the distributed among multiple databases, stored in
interoperability between components is based on many formats, and accessed by many different
the OGC’s specifications. The success of these interfaces (Nerbert, 1999).
specifications lies in the fact that they are pro- Within the range of OpenGIS documents and
moted and created by the same companies that specifications, the Abstract Specification docu-
will later implement them in their commercial ment (OGC, 1999) describes, documents and cre-
products. In particular, an OGC specification is a ates a conceptual model sufficient enough to allow
technical document describing the communication for the creation of Implementation Specifications.
interfaces between servers and how they are to be Among them, the topic number 13 focuses on
implemented. These specifications make no men- Catalog Services.
tion of the architecture, platform, or programming The term “Catalog” is described as a “set of
languages to be used. Before being considered as service interfaces which support organization,
an OpenGIS Implementation Specification, each discovery, and access of geospatial information.
document must be developed and tested by various Catalog services help users or application soft-
working groups within the OGC and then voted on. ware to find information that exists anywhere in
One of the most commonly used OGC a distributed computing environment” (Kottman,
specifications is Web Map Service (WMS). The 1999).
purpose of WMS is to enable the superimposed Kottman (1999) defines catalogs as having
visualization of complex geographic information three essential purposes:
distributed over various servers. A client may re-
quest other servers, which are also based on this • To assist in the organization and manage-
specification in order to search for geographic ment of diverse Geospatial data and ser-
information. Once the data is found, the client vices for discovery and access,
may make use of it simultaneously and visualize

467
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

• To discover resource information from Protocol Bindings


diverse sources and gather it into a single
searchable location, and Catalog Service for the Web (CS-W) is prob-
• To provide a means of locating, retrieving ably the most used protocol binding in metadata
and storing the resources indexed by the catalogs OGC implementations. It is based on
catalog. Hypertext Transfer Protocol (HTTP) a protocol
that is widely used to exchange information on
The OGC Catalog Services Specification- ver- the web. CS-W has been available since version
sion 1.0.0 was released in 1999. It establishes a 2.0 and permits the interaction between a client
general framework to access metadata and speci- and a server using a standard request-response
fies a design pattern to define interfaces to publish model of the HTTP protocol, that is, a client sends
and search collections of descriptive information a request to a server using HTTP and expects to
(metadata). It also defines both an information receive a response to the request or an exception
model and a service model. Figure 1 shows the message. Request and response messages are
conceptual architecture proposed by OGC. encoded as keyword-value pairs within a request
From version 1.0.0 until present day, version URI or using an XML entity-body. Requests may
2.0.2, OGC has defined several protocol bindings also be embedded in a messaging framework
and application profiles of the OGC Catalog such as SOAP.
Services to meet the needs of stakeholders in a
wide variety of application domains. Table 1 shows Application Profiles
the different versions and key issues in the
specification. There are several profiles of the current OGC CS-W
binding. On the one hand, the ISO 19115/19139
metadata profile7 specifies an application profile
for ISO metadata with support for XML encoding
per ISO 191398. This application profile is widely

Figure1. Catalog service architecture

468
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

Table 1. OGC catalog services specification history

Year Version Key Issues


1999 1.0 Implementation for the OLEDB, CORBA, and ANSI
2001 1.1 Z39.50 (ISO 23950) environments.
2002 1.1.1
2003 2.0.0 HTTP Protocol Binding. CS-W (Catalog Service for the
Web)
2004 2.0.0 ISO19115/ISO19119 Application Profile for CS-W 2.0
2005 2.0.1
2005 2.0.1 Ebrim (ISO / TS 15000-3) Application profile for CS-W
2007 2.0.2
2009 2.0.2 OWL Application profile for CS-W
201? 3.0.0 OGC OpenSearh protocol binding

implemented in Europe, and has been chosen by WEB CLIENT APPLICATION


INSPIRE9 as part of the discovery services.
Om the other hand, the ebRIM (ISO / TS A web client is an application that accesses a remote
15000-3) profile10 applies the CS-W interfaces to service on another computer system, known as a
the OASIS11 ebXML registry information model server and is reliant on a common web browser
(ebRIM 3.0) so as to provide a general and flex- to render the application executable.
ible web-based registry service that enables users, The client presents a Graphical User Interface
software agents or human-to locate, access, and (GUI) that allows users to interact with the server;
make use of resources in an open, distributed a GUI represents the information and actions avail-
system. It provides mechanism to retrieve, store, able to a user through graphical icons, buttons,
and manage many kinds of resource descriptions visual indicators and typed text box navigation.
beyond ISO metadata. The actions are usually performed through direct
The application profiles constrain CS-W re- manipulation of the graphical elements.
quests and therefore the interoperability among The GUI design has been evolving based on
catalogs. A CS-W request needs to include: details trends in internet. In a web catalog client appli-
of how search criteria will be encoded; what XML cation the evolution has clearly been influenced
application profile will be used to encode meta- by the Google search engine, both in its behavior
data; and how specific metadata content will be and design.
encoded in the chosen application profile.
The future version 3 will be based on Open- First Web Catalog Client
Search12 with Geo and Time extensions as the Applications
“baseline” query operation for all profiles, which
means the complete CSW 2.0.2 implementation In the early 2000s, the web client applications
without any additional profiles or extension focused on searching metadata that showed a web
packages. interface with many search options. Many of these
user interfaces were like a one by one mapping
process between the web search options and the
core queryable elements defined by the OGC
Catalogue Service Specification (see Table 2).

469
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

Table 2. OGC catalog service common queryable elements

Name Definition
Subject The topic of the content of the resource b
Title A name given to the resource
Abstract A summary of the content of the resource
AnyText A target for full-text search of character data types in a catalogue
Format The physical or digital manifestation of the resource
Identifier An unique reference to the record within the catalogue
Modified Date on which the record was created or updated within the catalogue
Type The nature or genre of the content of the resource. Type can include general categories, genres or
aggregation levels of content.
BoundingBox A bounding box for identifying a geographic area of interest
CRS Geographic Coordinate Reference System (Authority and ID) for the BoundingBox
Association Complete statement of a one-to-one relationship

Figure 2 shows the web catalog client of IDEC up to eight concatenated different filters. So users
in 2003. Notice within the GUI you can see up to were asking what exist about this place? Instead
18 boxes text that the user could fill out before to of What is in this place?
launch a query. We can separate a catalog system in two sides:
The web application was structured and de- the server side, which contains the database, the
signed following the three classic search op- metadata records, and the client side, which is the
tions—WHAT, WHERE and WHEN—that are web client application. It is possible to observe that
heritage from applications used mainly in librar- these client applications delegate the complexity
ies (Baldonado, 1997). Looking at this web user to the client side and therefore to the end user.
interface we may notice that, obviously, these The server side, does not have to interpret the
kind of application were focused on a small set request, it just has to send it directly to the catalog
of expert users, who might build a request with as a mapping translation by just replacing values.

Figure 2. IDEC catalog web client: year 2003

470
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

This way of proceeding provides a few options to keywords. The “WHEN” option was removed
potential users, not experts, and most importantly, from the user interface because it was rarely used
it provides high quality information, since the user by users. The map is presented as a complex tool
only finds what has been accurately described in with many options, and with the unique purpose
the metadata. of populating the coordinates text boxes within
This can work in an environment such as the WHERE section.
libraries, for example, where a small group of Figure 3 shows a first evolution of the previ-
people are responsible for describing informa- ous design (see Figure 2). This reduces the text
tion. But how does it work in an SDI distributed boxes 18 to 11 and it begins to show the map as
environment? And when do you have more than important piece of the GUI.
200 metadata producers? These changes were made, more or less, in an
intuitive way, just attending to feedback received
First Attempt at Simplification from some well-known users, but a year after the
change (2006) we note that visits to the catalog
At the end of 2005, the IDEC catalog systems had just grown a little. We decided to develop a
implemented the version 2 of the OGC speci- study about user behavior.
fication with the Ebrim application profile and
the CS-W HTTP protocol binding. The CS-W
protocol facilitates the possibility to promote the A WEB SEARCHING STUDY
catalog as a web service, which means to offer the
possibility to query the catalog without needing Web searching studies fall into three categories: (1)
to enter into the IDEC website. those that primarily use transaction-log analysis,
At the client application level, this is a first (2) those that incorporate users in a laboratory
attempt to “simplify” the user interface, by adding survey or other experimental setting, and (3) those
a new search option called “quick search.” In fact, that examine issues related to or affecting Web
this option simultaneously searches several fields searching (Bernard, 2003).
of metadata: title, description, purpose, scale, and

Figure 3. IDEC catalog web client: year 2005

471
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

In 2007, an internal study was performed, queries made by combining more than two search
based on use transaction-log analysis—all user options. A query example that would be made
actions are stored in log files—and focusing on with zero results found:
the following research questions: FREETEXT {Enology} + KeyWord {Base
Mapping} + ISOThemes {Ecology}
• What are the search options used by the Indeed, there was not any metadata record that
user? met these conditions, but there were other metadata
• Does the user combine them? related to Ecology and Enology that could be of
• What is the number result pages queried? interest to the user and were never found.
• What are the trends in search topics? Second, there was an extensive use of the
• Does the user find what he is looking for? “quick search” option and more than 66% of the
inputs referred to place names. This meant, for
We defined 11 variables (see Table 3), using example that although there are metadata from a
them to classify 5,000 sessions. particular place, if the place name did not appear
Trying to determine how the user searches for explicitly written in the metadata, than there was
information: If he uses a single text box or com- no way to find it, unless a direct coordinate—
bines search options and especially if he uses the spatial—search was used. This only happened
map as a spatial interface. Another variable referred in 11% of the requests. Notice that probably the
to categorizing the results and whether the user most powerful searches supported by an OGC
found what he was looking for; to determine if catalog are the spatial ones, such as those based
the user go forward to see more search results on a bounding box.
(pagination) or used the print option. The final Third, 42% of the searches contain the search
purpose was to determine if the catalog was a option “by organization,” and fourth, 84% of
useful tool, and the conclusion was a negative users only view one result page. This trend will
response. inform us how persistent searchers are when
Notice that the study revealed some very in- locating information or services on the catalog.
teresting issues. First of all, 32% of the requests Table 4 shows percentages in the search options
return zero results. Most of these requests were used by the users.

Table 3. Variables defined in the web study

Variable Definition
Single search option Use only “quick search option”
Spatial search option Use only spatial search using the Map
Combined search options Combine several search options
Combined+ spatial search option Combine several search option but using the Map.
Due Pagination Perform pagination or not
Query metadata details Query the candidates or not
Search metadata Services Interested in services
Search metadata Datasets Interested in Datasets.
Search category Evaluating search option try to classify the search in topics
User IP address User origin (University, Government,...)
User Type Classify the user depends precedence (Public sector, research,..

472
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

Table 4. Interoperability levels in the literature


pened in many catalog client implementations.
Search options % used request The unity of the interface is important because
parameters it has the potential to link concepts and objects
Quick search 69.63% together (Bernard, 2003).
By Organizations 42.01% A second step was to reduce the number of
By Map (coordinates) 11.28% search options to 4 and permit a direct search us-
By Keywords 23.34% ing one of them. These options were in some way
By IsoTopics 35.12% “controlled”—using suggestions list—in order to
avoid possible no result found.
The four search options were:
The final conclusions of the study were the
following: • By place (WHERE): this option offers
a suggestion list while the user is typing
• Most people only use the simple text box a place name, retrieved from a gazetteer
like Google. service. The purpose was to translate the
• Most users will not pass the first page (no place name into concrete coordinate posi-
pagination). tions and request the catalog in a spatial
• Many requests contain references to a way, so that the metadata does not contain
place name. specific place names, but the geographical
• Expert users who combine more than two extent belonging to that place.
or three search options obtain unsatisfac- • By keyword (WHAT): as by place, this
tory results. option offers a suggestion list from an in-
• The information contained in some meta- dexed table that indexes not only keywords
data records were wrong or, in some cases, but all the terms contained in the metadata.
limited. This allows searching for any word, not
• Many of these do not contain explicit refer- only keywords.
ences to their geographical location. • By Organization: this option shows a bul-
leted list with a hyperlink based text for
each organization (provider) and the num-
ACTIONS FOR FRIENDLY ber of metadata from datasets, series and
USER INTERFACE services available in the catalog. A user
just need one click to access the metadata
Looking at the conclusions and adding some other provider. As mentioned by Bernard (2003)
inputs received from end users and metadata pro- a bulleted list of links is useful, for instance
viders, we decided to create guidelines in order to in a drop-down list.
improve the catalog system performance. • By Map: The map is considered as a search
option too, not as before when the map was
Make the Search More only a “tool” used to obtain bounding box
Intuitive, Useful, and Promote coordinates. With just one click on the map
the Spatial Search or by drawing a box, a spatial request is
sent to the catalog.
The first step was trying to maintain the unity of
the interface and, for instance, not showing the Figure 4 shows how these four search op-
result candidates in a separate window as hap- tions were localized among the web catalog cli-

473
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

Figure 4. IDEC catalog web client with 4 search options

ent application. As you see “by place” and “by sponse content information, avoiding again the “no
keyword” options are top-right side, while “by results found” message. The results were drawn
organization” is at the bottom-right side and “by in the map as rectangles, showing the extent that
map” is in left side. was described in the metadata record. The map
rectangles and candidates are both connected, so
Making the Candidates’ Response it is possible to select a rectangle and to highlight
More Useful the candidate and vice versa.
Another important point was when the results
The candidates’ response found in a request are contains metadata from services—for instant
shown as a link list with summaries and intuitive OGC Web Map Service (WMS)—allowing the
icons, as Bernard (2003) mentions, “the summary user to connect the service and more importantly
condition was perceived as being the easiest in to facilitate information about the availability of
finding information, being visually pleasing, pro- the service. To do this a service monitoring tool
moting comprehension, participants’ satisfaction called Service Status Checker (SSC) was used.
with the site, and looking professional.” The SSC is a tool for monitoring services,
The results may be filtered by useful topics which was designed by the U.S. Federal Geo-
such as: scale, format, time, provider, or just free graphic Data Committee13 (FGDC) to test the
text. Notice that the values of these filters—less geospatial Web services such as WMS, WFS,
free text option—are populated depending on re- ArcIMS services, and Z39.50 (FGDC, 2010).

474
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

The SSC provides real-time tests (accepts XML Minimize the Response Time
requests, perform testing, and immediately returns
test results in XML) and file tests (tests performed To achieve this objective, the use of the CS-W
daily services previously recorded and stored the protocol binding as a message request-response
results in a database). between client and server has been discarded.
To access this second service, servers need to This does not affect the interoperability of the
be previously registered. The service can offer catalog, which remains accessible to third parties
two important facts regarding the WMS server: via CS-W as a web service.
The CS-W implies the use of XML meta-
• Availability: Indicates whether the service language to build requests and responses. In par-
is working or not. ticular, a web client send request parameters to a
• Reliability: Measures the percentage of middle ware service that builds a CS-W-compliant
successful responses. An unsatisfactory re- XML request and sends it to the server. Then, the
sponse is any slow or incorrect response. server receives the XML request, parses it to build
a Structured Query Language (SQL) sentence to
Figure 5 shows a candidate response and the the database, and translates the responses database
linkage existing between candidates and the map

Figure 5. IDEC catalog web candidates and filter options

475
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

Figure 6. New conceptual architecture

to CS-W-compliant XML responses and sends it Extending the Concept and


back to the client. Implementation of the Catalog
The best way to eliminate steps and minimize
the response time would be: (1) to use Structured They are a small set of actions aimed to promote
Query Language14 (SQL) sentences directly to the use of the catalog:
request the catalog’s database: (2) to use JavaS-
cript Object Notation15 (JSON) message format to • Provide additional channels to disseminate
interchange messages between client and server the catalog results such as syndication RSS
(see Figure 6). JSON is lighter and easier to ma- and visualization with Google Earth.
nipulable than CS-W xml. • Provide an OpenSearch geo16 wrapper to in-
tegrate the catalog in the third application.
Review and Improve the Quality and • Facilitate searching several online catalogs
Identification of Important Details of simultaneously. This is why we developed
Metadata the software CatalogConnector17, which
is a Java servlet application that sends re-
A review quality protocol has been setup. Basi- quests to several OGC CSW catalogs, pro-
cally, its function is to review whether the meta- cesses the responses and shows them in a
data has been described properly and assure that unitary way. It supports different CSW ver-
there is consistency in the information provided. sions (2.0.0, 2.0.1, 2.0.2) and application
For instance, some of the controls established by profile implementations (ebRIM and ISO)
the protocol that look at “title” and “description” and it also implements OpenSearch-Geo
metadata items are self-explanatory and discour- interfaces.
age the use of codes or abbreviations in these • Use the catalog to search other resources
items. Another control is related to bounding box related to geospatial data. Thanks to the
definition. A municipality metadata cannot have ebRIM application profile it is possible to
regional bounding box coordinates, for instance. catalog other kinds of resources such as

476
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

sensors, images data models or schema. So on the client side. So it went from a value of -8 in
it is feasible to set up specialized catalogs. 2002 to a value of + 4 in 2010. In other words, the
value +4 means that there are four search options
SERVER SIDE VS. CLIENT in web application and 8 server processes to deal
SIDE COMPLEXITY and setup these 4 search options.
For the search by place option, it has been
Reducing the search options to four (see Figure necessary to implement a gazetteer server and a
4) in the web client side has meant a significant process to build spatial queries from a point (x,
increase in the server process. y coordinates). This means that when a user re-
There is a clear relationship between usability quests for the place called “X,” the request that
and simplifying the search in the client side and will be sent to the catalog server will contain the
a growing complexity and concatenation of pro- coordinates of “X” place not the string “X.”
cesses in the server. The tendency to create web For the search by keyword indexing processes
applications that are easier to use and focused have been created to relieve significant informa-
on the “web-mass” implies building a complex tion from metadata and eliminate duplicated
system; on the server side to understand and to keywords. For the organization list and filters to
answer the user properly. candidates, statistical processes based on unique
One way to quantify this trend is to subtract values categorizations have been created. In this
the number of processes on the server side and the way, it is possible to offer the user a link to a text
number of search options on the client side. A posi- list with relevant information about the number
tive result means that there are more processes on of records provided by providers. All of this is
the server and more complexity, whereas a negative available without any click by the user. Finally,
result means more complexity at the client side statistical processes based on unique values cat-
of the application. Figure 7 shows the evolution egorizations have been created to start automati-
between server processes and web client search cally processes each time the catalog has been
options, from 2002 to 2010. The IDEC catalog updated with new metadata records in order to
has had clear tendencies to increase the server keep the catalog synchronized.
side processes and decrease the search options

Figure 7. Server side and client side complexity

477
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

FUTURE WORK: A WEB The use of these techniques should provide


SEMANTIC APPROACH more reliable results and permit the non-expert
user to interact normally with a metadata catalog.
Nowadays there are two objectives that must be Another key issue will be the new versions
filled, simplifying the client web application, of the OGC Catalog Specification. The fact that
joining the search boxes “by place” and “by this future version will be based on OpenSearch
keyword” in a single one, and providing mecha- application profile, must allow a better interoper-
nisms to understanding what the user is looking ability among catalogs and the most important, the
for. To achieve this, it is mandatory to provide efforts made implementing semantic techniques,
the catalog with some semantic capabilities and will not remain in only a web application domain
more complexity on the server side. and may be re-used as a OGC standard catalog
Basically, we are testing several approaches service.
using semantic techniques dealing with text
strings, such as:
REFERENCES
• The stemming process reducing each word
to its stem or root (not exactly their mot- Baldonado, M., Chen-Chuan, K., Gravano,
to lexicon that would be a pure linguistic L., & Paepcke, A. (1997). The stanford digital
process, but rather an algorithmic known library metadata architecture. International
approach). Journal on Digital Libraries, 1(2). doi:10.1007/
• Removing stop words (stop words) such s007990050008
as prepositions, adverbs, conjunctive, and Battelle, J. (2003). The creeping googlization
disjunctive clauses and ultimately all those meme. BusinessWeek Online. Retrieved May
words that do not provide information in 10, 2011, from http://battellemedia.com /ar-
the context of the problem. chives/000145.php.
• Calculating the distance between what us-
ers search for and the content indexed. Bernard, J., & Spink, A. (2005). How are we
• Automatic place names recognition— searching the world wide web? A comparison of
geoparsing—and connection with gazet- nine search engine transaction logs. Retrieved
teer service—translating a places name May 10, 2011, from http://citeseerx.ist.psu.edu
into coordinates. /viewdoc/download?doi=10.1.1.90.4299&rep=r
• Decomposing the request into bigrams ep1&type=pdf.
(two words) and providing a scoring sys-
Bernard, M. (2003). Optimal web design. Re-
tem for matching. For instance, the fol-
trieved May 10, 2011 from http://uwf.edu /ddaw-
lowing search “ Barcelona’s water quality”
son/d3net/documents/ web_usability/optimal%20
will be searched as:
web%20design.pdf.
◦◦ Exacts string “Barcelona’s water
quality.” ISO 19115. (2003). Geographic information -
◦◦ By each bigram “water quality” and Metadata. ISO 19115:2003. Geneva, Switzerland:
“water Barcelona.” International Organization for Standardization
◦◦ By every word “Barcelona,” “water,” (ISO).
“quality.”
◦◦ By coordinates of Barcelona and
bigrams.

478
User-Friendly Geoportal Interfaces for Geospatial Resource Discovery

Kottman, C. (1999). The OpenGIS abstract Voges, U., & Senkler, K. (2004). ISO19115/
specification: Topic13: Catalog services, ver 4.0. ISO19119 application profile for CSW 2.0 (CAT2
OpenGIS Project Document 99-113. New York, AP ISO19115/19). OpenGIS Project Document
NY: OpenGIS Consortium Inc. 04-038r2. New York, NY: OpenGeoSpatial Con-
sortium Inc.
Martell, R. (2005). OpenGIS catalogue services
- ebRIM (ISO/TS 15000-3) profile of CSW. Open-
GIS Project Document 05025r3. New York, NY:
ENDNOTES
OpenGeoSpatial Consortium Inc.
Nebert, D. (2002). OpenGIS catalog services
1
International Standards Office, http://www.
specification, ver 1.1.1. OpenGIS project docu- iso.org
ment 02-087r3. New York, NY: Open GIS Con-
2
GeoSpatial Consortium, http://www.open-
sortium Inc. geospatial.org/
3
Jquery, http://jquery.com/
Nebert, D. (2004). OpenGIS - Catalogue services 4
OpenLayers, http://openlayers.org
specification, ver 2.0. OpenGIS Project Document 5
Google Search, http://www.google.com
04-021. New York, NY: Open GIS Consortium Inc. 6
eXtensible Markup Language, http://www.
Nebert, D. (2004). Developing spatial data in- w3.org/XML/
frastructures: The SDI cookbook. Retrieved May
7
ISO 19115/19139 metadata profile,
10, 2011, from http://www.gsdi.org /docs2004/ http://www.iso.org/iso/catalogue_detail.
Cookbook/ cookbookV2.0.pdf. htm?csnumber=32557
8
ISO 19139, http://www.iso.org/iso/cata-
Nebert, D., & Whiteside, A. (2005). OpenGIS logue_detail.htm?csnumber=32557
- Catalogue services specification, ver 2.0.1. 9
Infrastructure for Spatial Information in the
OpenGIS Project Document 04-021r3. New York, European Community, http://www.inspire-
NY: OpenGeoSpatial Consortium Inc. geoportal.eu/
Nebert, D., Whiteside, A., & Vretanos, P. (2007).
10
Ebrim (ISO / TS 15000-3) profile, http://
OpenGIS - Catalogue services specification, ver portal.opengeospatial.org/files?artifact_
2.0.2. OpenGIS Project Document 07-006r1. New id=31137
York, NY: OpenGeoSpatial Consortium Inc.
11
Organization for the Advancement of Struc-
tured Information Standards, http://www.
Nogueras-Iso, J., Zarazaga-Soria, F. J., Béjar, R., oasis-open.org
Álvarez, P. J., & Muro-Medrano, P. (2005). OGC 12
OpenSearch interface, http://www.open-
catalog services: A key element for the develop- search.org/Home
ment of spatial data infrastructures. Computers 13
Federal Geographic Data Committee, http://
& Geosciences, 31(2), 199–209. doi:10.1016/j. www.fgdc.gov
cageo.2004.05.015 14
Structured query language, http://
Salkever, A. (2003). Google here, there, and ev- en.wikipedia.org/wiki/SQL
erywhere. BusinessWeek Online. Retrieved May
15
JavaScript Object Notation, http://www.json.
10, 2011, from http://www.businessweek.com / org/
technology/content/dec2003/ tc20031216_9018_
16
OpenSearch geo, http://www.opensearch.
tc047.htm. org/Specifications/OpenSearch/Extensions/
Geo/1.0/Draft_1

This work was previously published in Discovery of Geospatial Resources: Methodologies, Technologies, and Emergent Ap-
plications, edited by Laura Díaz, Carlos Granell and Joaquín Huerta, pp. 261-276, copyright 2012 by Information Science
Reference (an imprint of IGI Global).
479
480

Chapter 29
Development and Application
of a Spreadsheet-Based Spatial
Decision Support System (SDSS)
Dossay Oryspayev
University of Northern Iowa, USA

Ramanathan Sugumaran
University of Northern Iowa, USA

John DeGroote
University of Northern Iowa, USA

ABSTRACT
Spatial decision support systems (SDSS) are decision support tools which have been used widely in ad-
dressing complicated issues involving a spatial component. The use of SDSS has increased greatly over
the last few decades especially in fields such as planning, natural resources management, and environ-
mental science. Traditionally, SDSS have been developed with Geographic Information Systems (GIS)
technology as a major component and used in application areas in which the use of GIS technology
has been common. GIS software is often expensive and requires significant expertise, which can lead
to under-utilization of GIS-based SDSS. In this paper, we describe the development of a freely avail-
able SDSS extension developed for Microsoft Excel, a very commonly used spreadsheet application.
The purpose of this SDSS is to expand potential SDSS use to a wider potential audience for research,
management, and teaching purposes.

INTRODUCTION computer systems that support decision makers in


addressing semistructured or unstructured spatial
Spatial decision support systems (SDSS) are problems in an interactive and iterative way with
decision support systems (DSS) which utilize functionality for handling spatial and nonspatial
geographical or spatial data. Sugumaran and De- databases, analytical modeling capabilities, deci-
Groote (2010, p. 14) defined SDSS as “integrated sion support utilities such as scenario analysis,
and effective data and information presentation
DOI: 10.4018/978-1-4666-2038-4.ch029 utilities.” Research on SDSS has been strongly

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

influenced by that of DSS (Malczewski, 1999) and which can range in cost from several hundred to
has followed advances in geospatial technologies; many thousands of US dollars.
especially Geographic Information Systems (GIS) Due to the GIS-centric nature of SDSS, adop-
software advances (Sugumaran & DeGroote, tion has naturally occurred in disciplines which
2010). The use of geospatial techniques and were also adopters of geospatial technologies. In
technologies such as GIS, remote sensing, and a review of SDSS publications, Sugumaran and
Global Positioning Systems (GPS) has grown DeGroote (2010) listed the application of SDSS
tremendously over the last several decades. As from most to least frequent being natural resources
Keenan (2006) pointed out, many phenomena have management, environmental, urban, agriculture,
a geographic component which can be captured, emergency planning/hazard analysis, transporta-
described, and analyzed in GIS which has led to tion, business, utility/communications/energy,
its playing an important role in a wide range of and public health. Malczewski (2006) found
fields. The development and utilization of SDSS that what he classified as environment/ecology
technologies grew tremendously from the mid- was the most common application area. Fields
1990s based on the number of articles published such as natural resources management were early
on the subject (Malczewski, 2006; Sugumaran & adopters of GIS and remote sensing technologies.
DeGroote, 2010). This followed a similar trend in Other fields such as business and public health
the uptake of geospatial technologies. Drivers of have been later adopters of technologies such as
this increased use were more powerful desktop GIS and thus have had slower uptake of SDSS.
computers, the development of GIS software Some of the reasons for lack of adoption are the
with more friendly graphical user interfaces (e.g., lack of spatial data knowledge of the users, lack
ArcView (ESRI, 1996)), as well as built-in GIS of training and education for these technologies
development environments (e.g., Avenue which within their fields, and the expensiveness of GIS
was bundled with ArcView (ESRI, 1996)). systems. Uran and Janssen (2003) analyzed five
The vast majority of SDSS applications devel- SDSS applications and found one of the major
oped to date have had GIS as a major component drawbacks to effective use was that they required
of the architecture. The architectures of SDSS a high level of GIS expertise in order to carry out
have varied with some of them being developed operations in the SDSS. They specifically indi-
entirely within the GIS software (embedded), some cated that the outputs of the model (spatial data)
with a tight coupling (common user-interface) still required further spatial processing in order
between GIS and other modeling or mutli-criteria to properly evaluate results. This type of situation
evaluation software, or a loose coupling between limits effective use of SDSS to GIS experts.
GIS and other components through formatting Although spatial data processing, analysis,
and exchange of data files. In a review of SDSS management, and visualization functionality are
composed of GIS and multi-criteria decision often essential for SDSS, a given SDSS often uti-
analysis (GIS-MCDA) techniques, Malczewski lizes a mere fraction of the tools available in GIS
(2006) found that loose coupling of systems was software. However, as the GIS software have been
the most common method of integration. Thus, developed to offer a wide-range of functionality
the development and application of SDSS has for use in many disciplines, the interfaces and tools
generally required possession of GIS software will not necessarily be intuitive for non-expert
and experience in using such software. While users. Thus for many SDSS, unless the users are
open-source and free GIS software are becoming experienced GIS users, or the developers invest
more widely available, the majority of SDSS have in the careful design of user interfaces and tools
been developed using commercial GIS software for ease of use, the SDSS might be under-utilized.

481
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

There are two main data models utilized for Most GIS software will have functionality and
storing and displaying spatial data in GIS soft- mechanisms for handling both raster and vec-
ware. These are vector and raster. In a vector tor data models often with vector functionality
data model, entities in the real world are divided making up the majority of tools available. The
into clearly defined features which are based on major commercial GIS software applications
point, line, and polygon geometry (Sugumaran from companies such as ESRI (http://www.esri.
& DeGroote, 2010). The simplest vector feature com), Intergraph (http://www.intergraph.com),
is a point which is defined by x, y (and possibly and Bentley (http://www.bentley.com/en-US/)
z) coordinates. Line and polygon geometry are are vector-based although they usually have ex-
defined by points and connecting line segments tensions for raster data handling. In his review
with the polygon having the beginning and end of 319 articles detailing GIS-MCDA (type of
point at the same point. The raster data model on SDSS) applications, Malczewski (2006) reported
the other hand divides space into an array of (usu- approximately equal number of studies utilizing
ally square) cells (Longley, Goodchild, Maguire, raster and vector data models with some of these
& Rhind, 2011) with each cell having a value that using both data models. The simple nature of the
represents some aspect of the real world (e.g., land raster data model is similar to the structure of
use, elevation, precipitation amount, etc.). Both other data concepts (e.g., a matrix) and the way
of these data models have advantages and disad- other computer applications (e.g., spreadsheets
vantages in terms of data storage, management, and databases) store data. Although spreadsheet
and analysis. The vector data model is better for software were not developed with the idea of
topological or network analyses, provides explicit storing spatial data, they are capable of being
geographic representation at all scales, allows formatted so that the cells are square with each
for production of high quality cartographic prod- cell holding a single value similar to the raster
ucts, and allows sophisticated attribute handling. data model in GIS software. One of the most
However, the vector data model has complex widely used spreadsheet applications, Microsoft
data structures, can require significant computer Excel, provides excellent tools for resizing cells
resources for processing and display, and requires (square) and for changing the representation (e.g.,
expensive data collection techniques (Burrough & color, pattern, texture) of each cell according to
McDonnell, 2000; Lo & Yeung, 2002). The raster the value stored. Microsoft Excel has a wealth
data model has the advantages of being easy to of mathematical, logical, and other functional-
understand, allows rapid reading and analysis, and ity that can be carried out on a cell-by-cell basis
is easily displayed on devices that display pixels and based on examination of neighboring cells.
(Clarke, 2011). The raster model is often used for Although these attributes/tools do not approach
remotely sensed imagery collected from airplanes the level of cartographic capabilities present in
and satellites. Image processing software allows GIS software, they do provide the possibility of
the production of derived data products from this mapping both input and modeled values in a map.
remotely sensed imagery which also is in a raster The main advantages of using spreadsheet soft-
data format. The raster data model is insufficient ware such as Microsoft Excel are that the software
for representing each individual entity from the real is much less expensive and is much more widely
world (e.g., signs, trees, fire hydrants). However, used throughout the world. There are a broad
the raster data model is useful for storing and dis- range of potential users that are familiar with the
playing phenomena which vary continuously over functionality of Microsoft Excel that also might
space such as elevation or topography, rainfall, or be interested in analysis of some data that has a
categorical representations of land use. spatial component. For these reasons, spreadsheet

482
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

software such as Microsoft Excel is conducive to user interface, familiarity to students, low cost,
the development of SDSS which utilize raster data and easy access to viewing code or macros used
models and which do not require the extensive to calculate outputs. The authors pointed out that
set of spatial data, management, and analysis spreadsheet applications could be especially well
functionality present in expensive GIS software. suited for SDSS using cellular automata models.
This article is not the first to propose the idea Indeed, Microsoft Excel had already been used in
of implementing spatially-explicit modeling or a coupled SDSS system utilizing cellular automata
the development of an SDSS using spreadsheets. methods (Engelen, White, Uljee, & Drazan, 1995).
As early as 1991 (Ali, Ross, & Beveridge, 1991) A custom application with a vegetation dy-
the idea was put forward in a paper describing the namics model combined with models of envi-
implementation of a system for assessing spatial ronmental determinants was developed within
parameters related to potential fish aquaculture in Microsoft Excel (Berardi, 2002). The purpose of
Pakistan. The authors transformed hard-copy map the application was to predict vegetation change
data for climatic, topography, transportation, and within the Astroni crater, a state nature reserve
market characteristics into a series of separate grids near Naples, Italy. The application was developed
which stored values representing the suitability using the Visual Basic programming language and
for each cell for aquaculture. Using spreadsheet included simulation models, a spatial data visual-
functionality, they weighted each separate grid ization facility, and user friendly interfaces. The
and combined them in order to come up with a choice of Microsoft Excel was made because it
final grid representing potential suitability for was commonly-used software that was relatively
aquaculture in each square. The authors concluded inexpensive and had user friendly interfaces both
that their approach was a cheap and effective for usage and programming through Visual Basic
alternative to using GIS. (VB). The conclusion was that it is not necessary
In 1997, a paper describing an approach to to use expensive and complex software to develop
teaching raster GIS operations using spread- modeling systems for ecosystem management.
sheets was published in the Journal of Geography Spatially explicit modeling capabilities within
(Raubal, Gaupmann, & Kuhn, 1997). The authors spreadsheet applications have continued to be de-
proposed and demonstrated the usage of spread- veloped within spreadsheet applications, especial-
sheets for spatial analysis (SDSS) using Microsoft ly using Microsoft Excel. Li, Shen, and Li (2004)
Excel. They proposed the use of Excel for execut- developed an SDSS which was based in Microsoft
ing common categories of raster operations used Excel that embedded mapping functionality from
in GIS software such as local, neighborhood, and MapObjects. The natural analogy between raster-
zonal functions. They demonstrated the use of based data structures and analysis techniques in
simple overlay in which two grids (precipitation both GIS and spreadsheets is described in a paper
and elevation) were combined on a cell-by-cell for the 2009 International Conference on System
basis based on some constraints and an output grid Sciences (Keisler, Blake, & Wagner, 2009). This
representing suitability for fir (type of tree) growth paper presented two examples of analytical models
was calculated. Their overall goal was to develop built within Microsoft Excel. The first example
tools that could be used in introductory courses for was an Excel-based tool used to assist adminis-
teaching raster operations. They wanted the tools trators in deciding on a good location for a new
to allow users to easily manipulate the raster data parking facility at a university while the second
and additionally let the user interactively analyze was a model for examining potential locations
the operations in a process. The advantages to us- for a retail store. While the authors acknowledge
ing the spreadsheet software included the simple limitation of spreadsheet technology for spatial

483
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

analysis in comparison to GIS software, they argue Spreadsheet-SDSS and the techniques used, give
that for non-GIS experts spreadsheet technologies an overview of the user interface, explain the com-
provide a less-complex and relatively inexpensive ponents of the extension, and finally use a case
alternative which allows for greater intra- and study to demonstrate the functionality available.
inter-organizational collaboration. Specifically
they mention that millions of business and other
spreadsheet users would be exposed to spatial DEVELOPMENT OF SPREADSHEET-
processing functionality that they otherwise likely SDSS EXTENSION
would not be. They also point out that there are
great opportunities for these types of applications The Spreadsheet-SDSS was developed using
to be used in teaching settings. The usefulness the Visual C# programming language, using
of spreadsheet-based decision support tools with tools from Microsoft such as Visual Studio (VS)
spatially-explicit capabilities has been demon- 2008, Visual Studio Tools for Office (VSTO),
strated. However, all of the applications described and Microsoft Office Excel 2007. Figure 1 shows
were developed for very specific purposes and the general architecture and development tools
would not be applicable in other disciplines. To used for Spreadsheet-SDSS. When installed, the
this point, as known to the authors, there have extension can be added to Excel as a new ribbon.
been no generic SDSS tools developed within a The Spreadsheet-SDSS extension was originally
spreadsheet environment. These types of tools created as an add-in to the Microsoft Office 2007
could be used by a wide variety of disciplines. environment.
The goal of this paper is to describe the de- In Microsoft Excel 2007, the process of adding
velopment, implementation and application of a new tabs and groups to the ribbon was not avail-
Microsoft Excel extension that can be used for able as an option from inside the Excel environ-
spatial decision support. This extension is called ment itself directly. Rather, the user had to go
Spreadsheet-SDSS. The following sections will through outside options to do this which were not
describe the rationale behind the development of very efficient. In order to package the tools for

Figure 1. Overall architecture and development environments used for Spreadsheet-SDSS

484
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

easier installation, we utilized VS 2008 with VSTO from many other raster data formats using open-
support to do this in a much more efficient way. source and free software such as the Geospatial
However, in Microsoft Office 2010 customization Data Abstraction Library (GDAL, 2010), which
of the ribbon is considerably easier. Users can is a translator library for geospatial data formats.
add custom tabs and groups of his/her own choice In the future, functionality for importing other
and associate them with their own macros using raster data formats will be made available within
the Visual Basic for Applications development Spreadsheet-SDSS.
environment. We have tested and verified that our Several types of modeling operations which
add-in properly works in a Microsoft Office 2010 have been used in numerous SDSS have been
environment also. implemented in Spreadsheet-SDSS. The first and
Microsoft Excel was chosen as it is one of simplest method is Boolean Overlay including the
the most popular spreadsheet based applications following Boolean operators: AND, OR, NOT, and
used around the world. It is used in a very wide XOR. An in-depth discussion of Boolean opera-
range of disciplines and has extensive math- tions is given by Malczewski (1999). Boolean
ematical, statistical, financial, logical and other Overlays are often carried out using raster data in
functionality. It also has capabilities that make it GIS environments for combining multiple layers
amenable to the use of cell-based raster spatial for applications such as land suitability analysis.
data. Specifically it has the capability for manu- An example SDSS application using Boolean
ally, or through programming, resizing the width Overlay was carried out in locating a new hotel
and height of the cells in order that square cells project based on transportation features, land
can be utilized for raster representation. It also use, elevation, existing tourist facilities, and the
has a zoom functionality that lets a user view a coastline in Mauritius (Beedasy & Whyatt, 1999).
small or large number of cells. This functionality The second major class of modeling operation
approximates zooming functionality in mapping implemented is Weighted Linear Combination
and GIS software. (WLC) which is a map overlay technique that at-
There have been a significant number of tempts to capture the value judgments and expert
discussions concerning what functionality an opinions within the spatial analysis procedures by
architecture must have before being considered the assigning of weights of relative importance
an SDSS (Densham & Goodchild, 1989; Keenan, to each map layer (Malczewski 1999; Sugumaran
2003; Sugumaran & DeGroote, 2010). For the & DeGroote, 2010). In the hotel example given
purpose of this paper we are considering an SDSS above, higher weights would be given to layers
to consist of at least the following components: considered more important resulting in output that
isn’t expressed as suitable/unsuitable but which
• Database management utilities has a range of scores indicating the level of suit-
• Modeling capability ability. A WLC technique was used in an SDSS for
• Dialog management environmental planning and watershed manage-
• Map display and report generation utilities ment in Missouri, USA (Sugumaran, Meyer, &
Davis, 2004). A more complicated method called
In the Spreadsheet-SDSS, database manage- Ordered Weighted Averaging (OWA) has also been
ment tools were created to allow the importation implemented. The OWA method, as implemented
of ESRI ASCII raster files (ESRI, 2011b). These in Spreadsheet-SDSS, was originally described by
types of files are one of the most common distri- Yager (1988) and is explained for spatial contexts
bution formats for raster spatial data. Also ESRI in greater detail in other publications (Jiang & East-
ASCII raster files (ESRI, 2011b) can be created man, 2000; Malczewski, 1999; Malczewski et al.,

485
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

2003). In addition to the above listed models, we for specific cell values. It is envisioned that the
have implemented weight estimating methods. An report generation utility will be enhanced in later
example is the Analytical Hierarchical Processing versions of the extension.
(AHP) method, which is a pairwise comparison
method (Malczewski, 1999). This method has USER INTERFACE OF THE
the user set the relative importance of different SPREADSHEET-SDSS EXTENSION
factors as compared to each other and then uses
an algorithm to set weights for the factors for In Figure 2, the user interface of Spreadsheet-
use in a model such as WLC. So for example, in SDSS is shown as a Microsoft Excel Add-In. The
relation to the hotel example mentioned above, extension is divided into five categories: Data,
the user might indicate distance to coastline is Model, Map, Navigation, and Help.
more important than the distance to other tourist The first tool (Add Layers) on the Data section
facilities and this will be reflected in how the AHP allows users to import ESRI ASCII raster files
algorithm assigns the weight for the respective (EARF, 2011) into Excel. The second tool (Change
layers. Numerous studies have used AHP and Colors) allows users to manipulate the way the
WLC methods. For example these techniques layer is displayed (i.e., choose color schemes).
were used by Strager and Rosenberger (2006) for The Change Values tool allows the user to reclas-
identifying priority areas for land conservation. sify values in the layer. In the Model section, the
The user interface or dialog management com- user selects the model they would like to run.
ponent of Spreadsheet-SDSS was implemented They can choose from WLC Direct (without AHP),
in VS 2008 and VSTO tools using the Visual C# OWA, WLC using AHP, or Boolean Overlay. The
programming language and will be described in next section, Map, provides some basic carto-
more detail below. The map display capability graphic capabilities. In this category there are
was realized in Microsoft Excel worksheets by tools that allow users to insert a north arrow (North
adding functionality for resizing the width and Arrow) and text (Text), receive a report on a
height of the cells (to square cells). In addition layer (Report), change the legend of the layer
code was written that alters the color of cells based (Change Legend), and lastly view a print preview
on the values of the cells. Currently the report (Print Preview) of the layer. The Navigation sec-
generation functionality generates reports which tion provides the user the ability to zoom in and
describe the distribution of values (count of cells) out for viewing data at different scales while the
and the area in acres, hectares, and square miles Contents (has been changed to Info in latest imple-

Figure 2. The Spreadsheet-SDSS extension or plug-in in Microsoft Excel

486
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

mentation) section provides help and information chosen are worksheets in the Excel workbook.
on the extension. The Spreadsheet-SDSS (for The next step in this example, as the WLC using
Microsoft Excel 2007 and 2010), as well as instal- AHP model is being used, is to set the relative
lation instructions, can be freely downloaded at importance (more detail below) of each layer in
http://www.geotree.uni.edu/SpreadsheetSDSS. relation to each other in the AHP dialog (Figure
aspx (GeoTREE Center, 2011). 5). After the user sets these values and clicks
Compute the model runs and a new raster output
(worksheet) is produced and added to the Excel
EXAMPLE APPLICATION workbook. The user can then use the Map tools
to add cartographic elements, produce a descrip-
In order to demonstrate the utility of the Spread- tive report, or use the navigation tools to zoom in
sheet-SDSS application, we provide an example and out on the new layer.
of a potential real-world application using the In the example provided, the selection of
WLC using AHP technique. In this example ap- values in the AHP pairwise comparison is based
plication, the goal is to find a suitable location on the perceived level of importance of the layers
for construction of a new elementary school in a in relation to each other. The original AHP
growing small city (~40,000 people). Four raster method developed by Saaty (1980), and adopted
layers (Table 1) are considered in this example as in Spreadsheet-SDSS, used a range of values from
important determinants of locating a new school. 1 (equal importance of two variables) to 9 (one
In this example all layers had a spatial resolution of variable has extreme importance in relation to
30 m2 square and the layers were made up of 250 other variable) in a pairwise comparison. In our
columns and 398 rows for a total of 99,500 cells. example, shown in Figure 5, the pairwise com-
The general flow for using the Spreadsheet- parison indicates that population change is con-
SDSS is illustrated in Figure 4. In our case study sidered of strong importance (score of 5) in
example, we first import (using Add Layer tool) comparison to schools, schools are considered
the four layers (Table 1 and Figure 3) into Micro- equally important as compared to parcel size, and
soft Excel. Next the model is chosen (in this case land use is considered of moderate importance in
the ‘WLC using AHP’ button). After clicking on relation to parcel size. These pairwise comparison
this button a user interface is displayed in which scores (and their reciprocals – e.g., reciprocal
the user chooses which layers are to be included score of 1/5 for schools in comparison to popula-
in the modeling effort. The layers that can be tion change) are used in Spreadsheet-SDSS, using

Table 1. Description of raster layers used in hypothetical real-world application of Spreadsheet-SDSS

Layer # Layer Name Description


1 Population Population change in the city from 1990-2000 derived from Census blocks in city limits. The two
change categories (0 and 1) represent whether the Census block gained (1) or lost (0) people over the 10 year
time period.
2 Schools Represents a 2 km buffered area around the six elementary schools that already exist in the city. The ‘0’
represents areas within 2 km of an elementary school. The ‘1’ represents areas that lie outside of this
area.
3 Land use Another binary category that represents land uses that are either (1) suitable for purchase and construc-
tion of a new school or (0) not suitable for building a new elementary school. This classification was
based on landcover datasets (e.g., agricultural land was suitable while wetlands were not).
4 Parcels A raster layer that portrays the size of land parcels in the city based on the assumption that a parcel
needs to be a minimum of 10 acres in size.

487
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

Figure 3. The raster layers used in hypothetical Figure 4. Generalized workflow for running a
real-world application of Spreadsheet-SDSS model in Spreadsheet-SDSS

Figure 5. The AHP pairwise comparison interface in Spreadsheet-SDSS

488
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

algorithms described in more detail by Saaty est gray areas are determined by the model to be
(1980) and Malczewski (1999), to develop weights the most suitable locations for a new elementary
that are used in the WLC method. In a real ap- school. The darkest areas were determined by
plication of this type of spatial decision situation the model to be unsuitable for the construction
(i.e., locating a new school), a variety of stake- of a new school (the center of town). Note that
holders (e.g., school administrators, city planners, this analysis produces a range of suitability. The
parent teacher association members) would help output produced from the WLC Direct and OWA
decide on the important layers and also the pair- would look similar to that seen in Figure 6 in
wise comparison scores. In our example, popula- that the output would be a range of scores with
tion growth is determined to be the most important the lowest score being the least suitable and the
factor in selecting a site for the new school. The highest score being the most suitable. The Bool-
logic for this is that these areas of the city would ean Overlay model only accepts rasters that have
likely have the largest increase in the number of values of zero (bad or unsuitable) or one (good or
school age children entering the system. It is suitable) and the output would always be a raster
deemed to be preferred over the locations of exist- with cells of zero (unsuitable) and one (suitable)
ing elementary schools as these schools are also. All commercial raster GIS software (e.g.,
likely already at or near full capacity. Land use ArcGIS (ESRI, 2011a) and IDRISI (Clark Labs,
is slightly favored when compared to existing 2011)) can carry out Boolean Overlays. In recent
school locations as it is necessary to build a new versions, the ArcGIS (ESRI, 2011a) software has
school on land that can be easily converted for
other uses and zoned as such. Population growth
was the most important factor in locating a new Figure 6. The output raster data from the hypo-
school building, so it is slightly favored over both thetical real-world application Spreadsheet-SDSS
parcel size and land use. When using the AHP
using WLC model in the Spreadsheet-SDSS, the
decisions of how to score the pairwise comparisons
are very important and require careful consider-
ation by the stakeholders who are involved in the
spatial decision making process. Of course it is
possible for the users to alter the values and run
the model multiple times in order to see how the
outcome changes.
The output produced from running the AHP
using WLC model in Spreadsheet-SDSS for the
school location example is shown in Figure 6. The
output worksheet contains a score for each cell
representing its suitability for locating the school.
In this example, the scores range from zero (dark)
to one (light gray) as all of the input cells from
the layers were classified as ones (favorable) or
zeroes (unfavorable). The input rasters to this
model could have had any numeric values (i.e.,
not only zero and ones) and the resulting score
would have been scaled accordingly. The light-

489
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

added a simple WLC tool (called Weighted Sum) commonly used spatial raster data format, reclas-
added. IDRISI (Clark Labs, 2011) has WLC, sifying values, running models, and visualizing
AHP and OWA tools (Clark Labs, 2011). For us- modeling results cartographically. The modeling
ers familiar with the ArcGIS (ESRI, 2011a) and tools include commonly used methods in SDSS
IDRISI (Clark Labs, 2011) environments these including Weighted Linear Combination, Ordered
tools are relatively easy to use. Weighted Averaging, Analytical Hierarchical
Processing (with WLC), and Boolean Overlay.
Although, these modeling techniques have been
CONCLUSION AND FUTURE built into GIS software to provide SDSS capa-
DIRECTIONS bilities, the inclusion of these capabilities within
Microsoft Excel provides a means of exposing a
In this article we have described the development wider range of users and decision-makers to their
and application of a new Spatial Decision Support applicability in spatial decision making situations.
System (SDSS) which is available as an exten- It is envisioned that disciplines such as business,
sion to Microsoft Excel. The Spreadsheet-SDSS various agencies of government, economics, and
provides spatial decision support capabilities in a social sciences would benefit as there is a higher
widely used spreadsheet program and thus opens rate of use of Microsoft Excel as compared to
up this technology to a wider audience than is GIS tools/software. Additionally, there are in-
typical of SDSS. The vast majority of SDSS de- stances in which these disciplines are involved
velopment has traditionally included a commercial with issues or problems that at least have a partial
GIS component. These GIS packages are often spatial component. The extension would provide
expensive and require a high level of experience. an inexpensive yet powerful tool to improve the
Thus, SDSS based on these GIS packages can be efficiency of many organization’s workflows.
under-utilized. Spreadsheet software packages are Finally, the Spreadsheet-SDSS extension could
generally less expensive and more widely used prove to be an extremely valuable aid in teaching
than GIS software. The spreadsheet environment environments. The extension could be used in a
is analogous to the raster data model often used wide range of disciplines in order to demonstrate
within GIS software and thus provides an envi- real-world issues that have a spatial component.
ronment which allows spatial data to be modeled. Given the value of Spreadsheet-SDSS de-
Indeed numerous examples of existing SDSS scribe in this article, it must be pointed out that
applications developed in spreadsheet software the software cannot be considered as having the
have been documented. However, these SDSS breadth of functionality that many GIS software
were generally developed for specific applica- offers for SDSS development. For example, the
tion areas or purposes. The Spreadsheet-SDSS Spreadsheet-SDSS, based on the Microsoft Excel
extension described in this article provides a environment, has limited cartographic function-
more generic framework which includes several ality for display purposes. Also, commercial
modeling techniques which have often been used GIS software packages such as ArcGIS (ESRI,
in GIS and SDSS applications. A hypothetical 2011a) and IDRISI (Clark Labs, 2011) are the
real-world example was used to demonstrate the result of decades of development and thus there
functionality of the Spreadsheet-SDSS extension. are tremendous number of spatial analysis and
The Spreadsheet-SDSS is provided as an exten- processing tools that could potentially be utilized
sion or plug-in to Microsoft Excel and provides for SDSS development by the experienced user
a rich set of functionality and a user-friendly or developer. Some of these tools or functional-
interface. There is functionality for importing ity could be added to Spreadsheet-SDSS with

490
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

further development efforts. An example of a Clarke, K. C. (2011). Getting started with geo-
class of functions which are in commercial GIS graphic information systems. Upper Saddle River,
but not in Spreadsheet-SDSS are neighborhood NJ: Pearson/Prentice Hall.
analysis functions, which consider the influence
Densham, P. J., & Goodchild, M. F. (1989). Spatial
of proximal cells.
decision support systems: a research agenda. In
Although this article demonstrated the potential
Proceedings of the Annual Conference on Geo-
utility of the Spreadsheet-SDSS extension, there
graphic Information Systems and Land Informa-
are still improvements that could be made in the
tion Systems, Orlando, FL (pp. 707-716).
future. Improvements to be made are inclusion
of more models, more flexible import/export Engelen, G., White, R., Uljee, I., & Drazan, P.
functionality for different raster formats, and (1995). Using cellular automata for integrated
improved cartographic functionality. modeling of socio-environmental systems. En-
vironmental Monitoring and Assessment, 34,
203–214. doi:10.1007/BF00546036
REFERENCES
Environmental Systems Research Institute (ESRI).
Ali, C. Q., Ross, L. G., & Beveridge, M. C. M. (1996). ArcView GIS: The geographic information
(1991). Microcomputer spreadsheets for the imple- system for everyone. Redlands, CA: ESRI.
mentation of geographic information systems in Environmental Systems Research Institute (ESRI).
aquaculture: a case study on carp in Pakistan. (2011a). ESRI Products: Desktop GIS. Retrieved
Aquaculture (Amsterdam, Netherlands), 92, February 22, 2011, from http://www.esri.com/
199–205. doi:10.1016/0044-8486(91)90021-X products/index.html#desktop_gis_panel
Beedasy, J., & Whyatt, D. (1999). Diverting the Environmental Systems Research Institute (ESRI).
tourists: A spatial decision support system for (2011b). ESRI ASCII Raster format. Retrieved
tourism planning on a developing island. Inter- February 23, 2011, from http://resources.esri.
national Journal of Applied Earth Observation com/help/9.3/arcgisengine/java/GP_ToolRef/spa-
and Geoinformation, 1(3), 163–174. doi:10.1016/ tial_analyst_tools/esri_ascii_raster_format.htm
S0303-2434(99)85009-0
GDAL. (2010). Geospatial Data Abstraction Li-
Berardi, A. (2002). ASTROMOD: a computer brary: Version 1.6.0 [Software]. Beaverton, OR:
program integrating vegetation dynamics mod- Open Source Geospatial Foundation. Retrieved
eling, environmental modeling and spatial data from http://www.gdal.org
visualization in Microsoft Excel. Environmental
Modelling & Software, 17, 403–412. doi:10.1016/ GeoInformatics Training, Research, Education,
S1364-8152(02)00011-7 and Extension (GeoTREE) Center. (2011) Spread-
sheet Spatial Decision Support System (SDSS):
Burrough, P. A., & McDonnell, R. A. (2000). Version 1.1 [Software]. Retrieved February 22,
Principles of geographical information systems. 2011, from http://www.geotree.uni.edu/Spread-
Oxford, UK: Oxford University Press. sheetSDSS.aspx
Clark Labs. (2011). IDRISI Taiga GIS and Image Jiang, H., & Eastman, J. R. (2000). Applica-
Processing Software. Retrieved February 22, tion of fuzzy measures in multi-criteria evalu-
2011, from http://www.clarklabs.org/products/ ation in GIS. International Journal of Geo-
idrisi-taiga.cfm graphical Information Science, 14(2), 173–184.
doi:10.1080/136588100240903

491
Development and Application of a Spreadsheet-Based Spatial Decision Support System (SDSS)

Keenan, P. B. (2003). Spatial decision support Malczewski, J., Chapman, T., Flegel, C., Walters,
systems. In Mora, M., Forgionne, G., & Gupta, D., Shrubsole, D., & Healy, M. A. (2003). GIS
J. N. D. (Eds.), Decision making support sys- – multicriteria evaluation with ordered weighted
tems: Achievements and challenges for the new averaging (OWA): case study of developing wa-
decade (pp. 28–39). Hershey, PA: Idea Group. tershed management strategies. Environment &
doi:10.4018/978-1-59140-045-5.ch003 Planning A, 35, 1769–1784. doi:10.1068/a35156
Keenan, P. B. (2006). Spatial decision support Raubal, M., Gaupmann, B., & Kuhn, W. (1997).
systems: A coming of age. Control and Cyber- Teaching Raster GIS operations with spreadsheets.
netics, 35(1), 9–27. The Journal of Geography, 96(5), 258–263.
doi:10.1080/00221349708978800
Keisler, J., Blake, R., & Wagner, J. (2009). Imple-
menting Geographic Information Systems (GIS) Saaty, T. L. (1980). The analytical hierarchy pro-
in spreadsheet models: What, why, and how. In cess. New York, NY: John Wiley & Sons.
Proceedings of the 42nd Hawaii International
Strager, M. P., & Rosenberger, R. S. (2006). In-
Conference on System Sciences (pp. 1-10).
corporating stakeholder preferences for land con-
Li, Y., Shen, Q., & Li, H. (2004). Design of spatial servation: Weights and measures in spatial MCA.
decision support systems for property profession- Ecological Economics, 58, 79–92. doi:10.1016/j.
als using MapObjects and Excel. Automation ecolecon.2005.05.024
in Construction, 13, 565–573. doi:10.1016/j.
Sugumaran, R., & DeGroote, J. (2010). Spatial de-
autcon.2004.04.004
cision support systems: Principles and practices.
Lo, C. P., & Yeung, A. K. W. (2002). Concepts Boca Raton, FL: CRC Press/Taylor & Francis
and techniques of geographic information systems. Group. doi:10.1201/b10322
Upper Saddle River, NJ: Prentice Hall.
Sugumaran, R., Meyer, J. C., & Davis, J. (2004).
Longley, P. A., Goodchild, M. F., Maguire, D. J., A web-based environmental decision support
& Rhind, D. W. (2011). Geographic information system for environment planning and watershed
systems & science. Hoboken, NJ: John Wiley & management. Journal of Geographical Systems,
Sons. 6, 1–16. doi:10.1007/s10109-004-0137-0
Malczewski, J. (1999). GIS and multicriteria deci- Uran, O., & Janssen, R. (2003). Why are spa-
sion analysis. New York, NY: John Wiley & Sons. tial decision support systems not used? Some
experiences from the Netherlands. Computers,
Malczewski, J. (2006). GIS-based multi-
Environment and Urban Systems, 27, 511–526.
criteria decision analysis: a survey of the
doi:10.1016/S0198-9715(02)00064-9
literature. International Journal of Geo-
graphical Information Science, 20(7), 703–726. Yager, R. R. (1988). On ordered weighted av-
doi:10.1080/13658810600661508 eraging aggregation operators in multicriteria
decisionmaking. IEEE Transactions on Sys-
tems, Man, and Cybernetics, 18(1), 183–190.
doi:10.1109/21.87068

This work was previously published in the International Journal of Decision Support System Technology (IJDSST), Volume 3,
Issue 3, edited by Pascale Zaraté, pp. 1-14, copyright 2011 by IGI Publishing (an imprint of IGI Global).

492
493

Chapter 30
A Package-Based Architecture
for Customized GIS
Andrés Pazos
Universitat Jaume I, Spain

José Poveda
University of Texas, USA

Michael Gould
Universitat Jaume I, Spain

ABSTRACT
In this chapter we present a package-based component architecture for the specific deployment and
maintenance of public sector applications, specifically corporate Geographical Information Systems
(GIS). The three-tier architecture defines a group of server-side packages, a kernel and external packages,
that are dynamically requested and assembled at execution time according to the needs of individual
users to produce on demand customized GIS applications. An intermediate middleware layer handles
user authentication and version control. In order to demonstrate the proposed architecture, a practical
prototype has been implemented using Java WebStart. This prototype demonstrates the creation of several
GIS client applications, with increasing levels of functionality, and based on free software packages.

INTRODUCTION and in general any service relying on location


information as part of the context within which
Geographic Information Systems (GISs) have other information is interpreted. Corporate GISs
migrated over the past decade, from stand-alone are complex, multi-participant (cost-sharing) and
scientific niche applications to key parts of corpo- multi-user computing environments complicating
rate IT infrastructures in public sector (and com- practical tasks such as data sharing among depart-
mercial) organizations (Longley et al, 2001). GISs ments and routine system maintenance. Each of
provide services such as vehicle fleet tracking, potentially thousands of users may have specific
spatial planning (optimal location of resources), requirements based on their capacity to use the
GIS, and hardware and software configuration
DOI: 10.4018/978-1-4666-2038-4.ch030 requirements. Conventionally, system administra-

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
A Package-Based Architecture for Customized GIS

tors install custom upgrades and extensions one or open source) and adapting them to the CBSD.
workstation at a time, or offer all users the same These applications may be composed of a light
monolithic package to be downloaded remotely. kernel, which implements the basic functionality,
Until recently, GIS software packages were largely and a set of independent packages offering ex-
monolithic; although experience shows that many tended functionality (commonly called plug-ins).
users exercise only a fraction of the functionality In this case, each package implements a specific
offered by a full-featured GIS, system admin- part of the GIS—data conversion, visualization,
istrators were required to install and support a and so forth, is connected to the kernel through
full, often costly, commercial license for each published interfaces in order to compose the final
and every user. In search of just the right set of application desired by the user.
functionality for particular users, two primary op- The architecture described here centralizes
tions have emerged: end user development (EUD) software maintenance and distribution, exploiting
which assumes the user has software development the existence of ubiquitous Internet infrastructure
skills (Morch, 2004) or server-side configuration throughout the public sector (in the developed
of software modules, here termed packages based world). Conventional client/server architectures
on the terminology common in the Java, Perl, and do not cope well with certain authentication and
Python communities. monitoring requirements. For this, the authors
In organizations with an elevated number of moved to 3-tier software architecture. A central
users, and especially in the public sector which server stores and dispatches the different pack-
is extremely cost conscious, an extensible and ages that would compose the final application,
easily-customizable GIS application solution is while the middleware provides a natural place to
needed. This is possible assuming two levels of locate the adaptive behaviour (McKinley, 2004).
application flexibility. On one hand at administra- The client interface is through a specific HTTP
tion level the system should present scalability that browser that connects to the middleware services.
allows the use of existing software modules or System administrator takes charge of controlling
components, and on the other hand, at user level, the server side adapting and updating the pack-
the user should be able to determine locally the ages to the CBDS.
customization, installing or updating only the parts The case study that motivated this work stems
from the kernel of the system needed to perform from our interaction with the public administra-
the work (Morch, 1997). tion, specifically the Infrastructure and Transport
The component-based software development Department of a regional government authority.
(CBSD) paradigm (Brown, 2000) is helping to This organization has more than 400 computers
bring this extensibility and flexibility to reality. users, many of whom utilize GISs on a sporadic
In the CBSD approach new code development is basis. They commonly use proprietary desktop
minimized and system upgrades become the task applications with high annual license payments.
of replacement of well-bounded functional units However, according to internal surveys, many of
of the system. In our proposed architecture, the these users require only a specific subset of GIS
components are grouped into packages, defined as functionality. Moreover, users are distributed
a functional, replaceable set of services. A pack- geographically, so it is important to centralize
age is a higher-level aggregation of functionality the remote maintenance and the installation of
compared to an ActiveX component or a JavaBean. the software in order to save time and money.
Adherence to interface standards allows acqui- The architecture proposed here is valid for
sition of packages developed from third-party both commercial and free software, but additional
developers (Commercial off-the-shelf (COTS) monetary benefit naturally accrues for the case

494
A Package-Based Architecture for Customized GIS

of GIS applications based on free software com- ActiveX components is viable for groups of users,
ponents (Anderson, 2003). Today, it is possible but not for individual customization.
to reuse existing software libraries that cover the In the field of GIS a widely-implemented
basic functionality of a GIS application, and in example of CBSD is the ESRI MapObjects (Hart-
many cases source code is available so that these man, 1997) platform: an ActiveX control and a
libraries can be modified or extended by system collection of dozens of automation objects that
administrators. Examples of these libraries are provide basic GIS and mapping capabilities. These
Geotools (www.geotools.org) and Terralib (Ca- components must be embedded in an application
mara, 2000), which are distributed under the Lesser development environment, such as Visual Basic,
General Public License (LGPL) (GNU, 1999), so beyond the capability of typical GIS users and
they are valid for open source developments as therefore the domain of end user composition.
well as for commercial purposes. We are interested in a higher aggregation of
components (self-contained packages) that can
be included into the main application without
RELATED WORK the need to modify the source code of the base
system, and that can be maintained and updated
The structure of the package-based architecture is centrally by the system operator
directly related to the object-oriented frameworks
(Fontoura, 2001) concept. The main difference
is that instead of a collection of smaller compo- ARCHITECTURE OVERVIEW
nents, we implement a collection of higher-level
packages with predefined collaborations between In search of a low cost GIS solutions with an
them and extension interfaces. In order to con- optimal level of functionality, we modify the tradi-
nect each developed package to the kernel, it tional concept of a monolithic GIS application and
must apply the extension interfaces, and connect redistribute it to better fit a changing corporative
to the points of extension called hot spots (Pree, environment, one which is integrated in a high-
1995). Our packages are pieces of software that speed network (100 Mbps LANs). The proposed
implement a specific functionality, deployable as architecture would not have been viable in most
a unit of versioning and replacement. The package organisations 10 years ago, as shown in previous
contains immutable code, and metadata informa- studies (Anitto, 1994) due to the inherent com-
tion necessary for connection to the kernel. This munication capabilities required. The proposed
structure must allow a dynamic recomposition architecture is defined keeping in mind two key
of the software transparent with respect to the system features: minimal system maintenance
application source code. costs due to centralization and scalability of the
Software components are well defined by system in a natural way, avoiding costly reengi-
Szyperski (1998) as binary units of independent neering processes.
production, acquisition, and deployment that In order to achieve the desired flexibility, the
interact to form a functioning system. The most system is separated into a kernel and a set of ex-
well-know component model is ActiveX/COM+ ternal packages. The kernel implements the main
(Rogerson, 1997), an interesting model for thread of the system and is in charge of building
Windows programming integration but not very the client side GIS application. In this process, the
interesting for dynamic composition. Moreover, requested packages are loaded in order to build
the option of programming custom solutions using the final application.

495
A Package-Based Architecture for Customized GIS

System maintenance is high-overhead cost, the clients for the its representation. At the server
and so we follow Isnard (2004), in assuring that side, the system loses flexibility and introduces a
the system administration design also builds performance bottleneck at executing time. On the
maintenance tasks into the computing environ- other hand, at the client side the system overloads
ment. The proposed architecture centralizes all communication between the server and the clients
software maintenance and distribution, by means due to redundant functionality sent to the different
of a central server that contains the packages for clients. Taking into consideration these aspects,
future composition of custom GIS applications. the schema that has been adopted corresponds to
a three-tier software architecture (see Figure. 1) as
Corporate Needs in GIS a result of adding to a simple server-client schema
a new tier located between the client and server
After studying the existing solutions for Corporate in charge of user authentication. If we take into
Geographic Information Systems (CGIS), we have consideration the possible geographic informa-
identified the following key factors to take into tion databases of the system, the architecture (see
consideration: Figure. 1) can be naturally extended to a N-tier
software architecture.
• Customization. Following EUD prin- The proposed architecture exploits the main
ciples, the user must be able to select and benefits of the web as transfer protocol (ubiquity,
install only the specific packages needed to portability, reliability and trust) to deliver the
perform his/her particular work and not all different pieces that will conform to the GIS ap-
the functionalities of the CGIS, assuming plication. The client layer is responsible for the
access priveledges. interaction with the user and for the generation
• Extensibility. The system must be scalable of the queries sent to the middleware with the
according to the new needs of the organi- user profile validation and authentication ser-
zation, allowing the total re-use of the ex- vices. These requirements, in practical terms, are
isting system. achieved with a web-based system architecture
• Software distribution. Organizations where the client is a standard web browser and the
operate in a decentralized fashion, and middleware is implemented as a web server. The
conduct business in many geographical server layer, as repository, contains the packages
regions, therefore they require distributed that will be sent to the client after the processing
computer support. The software must be of the middle tier messages. The interchange of
downloaded and installed across a net- messages between the different layers is imple-
work, avoiding repetitive, manual local mented as XML files (Aloisio, 1999).
installations. The process to get the final GIS application at
• Maintenance. A centralized architecture the client side is as follows:
helps in the administration of the system,
because it allows updates of each indepen- 1. Connection to the middleware layer through
dent piece (package) with a minimal cost. the client browser, the first time the user
composes the GIS application and each
General Structure time that the GIS application configuration
is updated.
Traditionally, the GIS architecture follows a client- 2. Validation and authentication with user
server schema, where the data processing takes name and password. According to the user
place on the server side and the results are sent to profile and package visibility privileges, the

496
A Package-Based Architecture for Customized GIS

middleware displays a list of the possible matically. Figure 2 shows the general structure of
packages (read here functionalities) that can the packages composition.
be added to the final GIS application. The CGIS has been designed in such a way
3. Selection of packages (functionalities) that that packages fit to the general structure of the
the user requests to install. kernel. All the parts of the kernel to be accessed
4. XML configuration. After receiving the from external packages must be defined according
result of the selection, the middleware dy- to an open and well defined specification. In the
namically creates a XML file that contains literature we find that these parts have been called
the user profile and packages configuration “hot spots” or “point of extension” (Anderson
information. This configuration file is sent and Moreno-Sanchez, 2003). After receiving the
to the server. requested packages, the kernel examines them
5. Processing at server side. The server pro- and extracts the information contained in a valid
cesses the XML configuration file and sends XML configuration file. This configuration file
the selected packages to the client. These must be defined according to the Document Type
packages are automatically connected to the Definition (DTD) of the kernel and contains in-
client kernel thereby composing the updated formation about the points of extension to be
GIS application. extended. For example, if for a particular GIS
application (an instantiation of the CGIS) a par-
ticular extension of the GIS application menu is
Application Composition required, the XML configuration file must contain
a label for the package to be included in the menu
After downloading the requested packages, the and for the associated actions.
kernel builds and launches the application auto- Besides the hot spots, the kernel of the applica-
tion exposes an interface that allows the packages
to the access internal objects. For access to the
internal kernel objects, the packages must use this
Figure 1. Architecture overview public interface of the kernel.

FURTHER FEATURES

Once the interfaces for the interaction of the pieces


(kernel and other packages) and the hot spots of
the CGIS are defined, developers are able to build
up new packages with additional functionality and
to simply include them in the CGIS for possible
use by any client. The remainder of the CGIS
does not have to be modified, because of the
structure of the CGIS that allows by design the
direct inclusion or replacement of new packages.
The task of uploading new packages to the CGIS
repository corresponds to the administrator of the
system. The users (CGIS clients) connected to the
middleware tier, are able to add to the GIS client

497
A Package-Based Architecture for Customized GIS

Figure 2. Package composition

application new packages (functionalities) if they appropriate design for corporate environments in
are visible according to the user profile. an effort to minimize maintenance costs, while at
The user needs to connect to the middleware the same time promoting the interoperability and
only the first time that the application is built and flexibility of the resulting software framework.
downloaded or when an updating of any installed In this article, we proposed an architecture,
packages is need After the first time the GIS ap- named CGIS, that allows the optimum distri-
plication has been downloaded and installed at bution and installation of the GIS application,
the client side, the GIS application remains in where each user can customize his/her applica-
the local computer. However, the user always tion interactively. The software architecture has
may uninstall existing packages or the whole GIS been validated through the implementation of a
application and download it again. testing prototype. This development has been a
The administrator of the CGIS or any user with key aspect in order to improve and demonstrate
administrator privilege through the middleware the design features. The architecture is universal,
tier is able to list the available packages, to delete not strictly related to any protocol, programming
any of them or to add new ones. The administrator language or platform, though some characteris-
of the CGIS grants user privileges. tics of the selected implementation language, as
for example platform independence, makes the
system even more universal. In particular, in the
CONCLUSION validation implementation Java WebStart was
utilized with success.
In this paper we examine some of the limitations Finally, the architecture creates a simple yet
of GIS applications involved in public sector effective framework for the development of GIS
corporate environments. Specifically, we con- applications. With the inclusion of independent
centrated on some crucial issues of integrating packages in a central server, different needs of
component-based software development and the corporative environment can be supported.
three-tier architectures for geographic information As future work, we propose the implementation
systems. These ideas are the result of previous of new packages to extend functionalities of the
investigation into the uses of GIS software in the current prototype. It is important to make an effort
public administration and the possibilities of open to incorporate more open source libraries such as
source in this field. We have attempted to find the Geotools 2 and Terralib that allow the creation

498
A Package-Based Architecture for Customized GIS

of new packages with a minimum effort. On the Isnard E., Perez, E., & Galatescu, A. (2004).
other hand, the system can be extended in the MECASP – An Environment for Software Main-
middleware layer. We propose to add functional- tenance and Adaptation. ERCIM News, No.58,
ity for monitoring and controlling of downloads 45-46.
of the different users in order to produce useful
Longley, P. A., Goodchild, M. F., Maguire, D.
statistics for administration purposes.
J., & Rhind, D. (2001). Geographic information
systems and science. Wiley and Sons.
REFERENCES McKinley, P., Masoud, S., Kasten, E., & Cheng, B.
(2004). Composing adaptive software, computer
Aloisio, G., Millilo, G., & Williams, R. D. (1999). IEEE. Computers & Society, 37(7), 56–64.
An XML architecture for high performance web-
based analysis of remote-sensing archives. Future Monson-Haefel, R. (1999). Enterprise JavaBeans.
Generation Computer Systems, 16, 91–100. O’Reilly Press.

Anderson, G., & Moreno-Sanchez, R. (2003). Morch, A. (1997). Three Levels of End-User Tai-
Building Web-based spatial information solu- loring: Customization, Integration, and Extension.
tions around open specifications and open source M. Kyng & L. Mathiassen (Eds.), Computers and
software. Transactions in GIS, 7, 447. Design in Context. The MIT Press, Cambridge,
MA, 51-76.
Anitto, R. N., & Patterson, B. L. (1994). A new
Paradigm for GIS data Communications. URISA Morch, A., Stevens, G., & Won, M. (2004).
Journal, 64-67. Component-based technologies for end-user
development. Communications of the ACM,
Brown, A. W. (2000). Component-Based Develop- 47(9), 59–62.
ment. Prentice Hall PTR.
Open Geospatial Consortium Inc. (OGC). Web
Camara, G., et al. (2000). Terralib: Technology in Map Service 1.3. (2004). www.opengeospatial.org
Support of GIS Innovation. II Workshop Brasileiro
de Geoinformática, Geoinfo2000. Sao Paulo. Pissinou, N., Makki, K., & Park, E. K. (1993).
Towards the Design and Development of a New
Fontoura, M., Pree, W., & Rumpe, B. (2001). UML Architecture of Geographic Information Systems,
Profile for Framework Architectures. Addison- ISBN:0-89791-626-3, 565-573, Washington.
Wesley/ACM Press.
Pree, W. (1995). Design Patterns for Object-
Gosling, J., & McGilton, H. (1996). White Paper: Oriented Software Development. Wokingham:
The Java Language Environment. Java Articles: Addison-Wesley/ACM Press.
http://java.sun.com
Rogerson, D. (1997). Inside COM: Microsoft’s
Guo, J. (2003). An Approach for Modeling and Component Object Model. Microsoft Press.
Designing Software Architecture. 10th IEEE
International Conference and Workshop on Schmidt, R. (2001). Java Networking Launching
the Engineering of Computer-Based Systems Protocol & API Specification. Java Articles http://
(ECBS’03). Alabama. java.sun.com.

Hartman R. (1997). Focus on GIS Component


Software. OnWordPress.

499
A Package-Based Architecture for Customized GIS

KEY TERMS AND DEFINITIONS Component: In object-oriented programming


and distributed object technology, a component
Application Programming Interface (API): is a reusable program building block that can be
The interface to a library of language-specific combined with other components in the same or
subroutines, for instance a graphics library that other computers in a distributed network to form
implement higher level graphics functions. an application.
Binding: Language-dependent code that al- Corporate Geographic Information System
lows a software library to be called from that (CGIS): A Corporate GIS is a GIS defined to be
computer language. used as a corporate resource by the members of
Client-Server Architecture: The term client/ an enterprise or institution.
server was first used in the 1980s in reference to Extensible Markup Language (XML): Is
personal computers (PCs) on a network. The actual a flexible way to create common information
client/server model started gaining acceptance in formats and share both the format and the data
the late 1980s. The client/server software archi- on the World Wide Web, intranets, and elsewhere.
tecture is a versatile, message-based and modular Graphical User Interface (GUI):The graphi-
infrastructure that is intended to improve usabil- cal user interface: or GUI, provides the user with
ity, flexibility, interoperability, and scalability as a method of interacting with the computer and its
compared to centralized, mainframe, time sharing special applications, usually via a mouse or an-
computing. A client is defined as a requester of other selection device. The GUI usually includes
services and a server is defined as the provider of such things as windows, an intuitive method of
services. A single machine can be both a client and manipulating directories and files, and icons.
a server depending on the software configuration. Middleware: Layer(s) of software between
Commercial Off-the-Shelf (COTS): A gen- client and server processes that deliver the extra
eral term for software products that are made and functionality behind a common set of APIs that
available for sale or lease. client and server processes can invoke.

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 311-319,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

500
Section 3
Tools and Technologies

This section presents extensive coverage of various tools and technologies available in the field of Geographic
Information Systems that practitioners and academics alike can utilize to develop different techniques. These
chapters enlighten readers about fundamental research on the many tools facilitating the burgeoning field of
Geographic Information Systems. It is through these rigorously researched chapters that the reader is provided
with countless examples of the up-and-coming tools and technologies emerging from the field of Geographic In-
formation Systems. With 18 chapters, this section offers a broad treatment of some of the many tools and tech-
nologies within the Geographic Information Systems field.
502

Chapter 31
Algorithms for 3D Map
Segment Registration
Hao Men
Stevens Institute of Technology, USA

Kishore Pochiraju
Stevens Institute of Technology, USA

ABSTRACT
Many applications require dimensionally accurate and detailed maps of the environment. Mobile map-
ping devices with laser ranging devices can generate highly detailed and dimensionally accurate coor-
dinate data in the form of point clouds. Point clouds represent scenes with numerous discrete coordinate
samples obtained about a relative reference frame defined by the location and orientation of the sensor.
Color information from the environment obtained from cameras can be mapped to the coordinates to
generate color point clouds. Point clouds obtained from a single static vantage point are generally
incomplete because neither coordinate nor color information exists in occluded areas. Changing the
vantage point implies movement of the coordinate frame and the need for sensor position and orienta-
tion information. Merging multiple point cloud segments generated from different vantage points using
features of the scene enables construction of 3D maps of large areas and filling in gaps left from occlu-
sions. Map registration algorithms identify areas with common features in overlapping point clouds and
determine optimal coordinate transformations that can register or merge one point cloud into another
point cloud’s coordinate system. Algorithms can also match the attributes other than coordinates, such
as optical reflection intensity and color properties, for more efficient common point identification. The
extra attributes help resolve ambiguities, reduce the time, and increase precision for point cloud regis-
tration. This chapter describes a comprehensive parametric study on the performance of a specialized
Iterative Closest Point (ICP) algorithm that uses color information. This Hue-assisted ICP algorithm,
a variant developed by the authors, registers point clouds in a 4D (x, y, z, hue) space. A mobile robot
with integrated 3D sensor generated color point cloud used for verification and performance measure-
ment of various map registration techniques. The chapter also identifies various algorithms required to
accomplish complete map generation using mobile robots.

DOI: 10.4018/978-1-4666-2038-4.ch031

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Algorithms for 3D Map Segment Registration

INTRODUCTION quality varies depending upon the sensor resolu-


tion and the extent of overlap between the map
Complete and dimensionally accurate maps of segments. Different techniques exist for merging
the environments are of interest to many domains 3D maps by exploiting geometric features and
including surveying, search and rescue, security, measuring surfaces. The most popular registra-
defense and construction. Laser based scanning tion algorithm for point cloud registration is the
devices (Light Detection And Ranging-LIDAR) iterative closest point (ICP) algorithm (Thrun,
are generally used to generate point clouds that 2003). In ICP, the corresponding closest points in
describe spatial information in the form of numer- different point clouds are associated and optimal
ous discrete point coordinate measurements. Point rigid transformation required to minimize a mean-
data are acquired by measuring time of flight of square error of separation between the associated
scattered light or phase shift between incident points (Bsel, 1992) is iteratively found. The color
and reflected light to find the distance between attributes of the sampled point can be utilized in
the object surface and the scanning device (Blais, ICP progress to increase computational speed and
2004). The speed of scanning discrete points can provide higher accuracy. Anderson (2007) filtered
be enhanced by pulse and phase based measure- the point set data based on hue before conducting
ment technologies (Blais, 2004). Precise rotation traditional ICP. Houng et al., (2009) processed
mechanisms with high-resolution encoders spin a images to extract corresponding visual features
2D LIDAR device to generate a 3D point cloud. that are used in registration process.
Point cloud scanners have been mounted on air- In this chapter, we examine the algorithms
planes (Browell et. al. 1990) and ground vehicles required for a mobile robot to generate a dimen-
(Gebre, et al. 2009) to create large area terrain sionally accurate and complete map of an area
maps. When vision sensors are integrated with without prior information about the area. We focus
the laser ranging systems, point clouds can also particularly on the techniques for registration of
contain the color information of the scene. Opti- map segments taken from various vantage points.
cal imagery from the camera is associated with The chapter also describes a mobile robotic sys-
point coordinates to produce color point clouds tem with a color point cloud scanner and various
(Andresson, 2007). algorithms required for accomplishing the mis-
A 3D point cloud obtained from a single sion of generating a complete and dimensionally
vantage point is seldom adequate to construct a accurate map of an area.
complete map. Generation of a complete map of
an environment requires merging or registration of
map segments taken from various vantage points. MOBILE MAPPING WITH COLOR
The registration enables construction of large-scale POINT CLOUD SCANNERS
global 3D maps (Thrun, 2003). Registering the
map segments is trivial if precise position and ori- Color point clouds are created by synchronizing
entation of the sensor are accurately known about range sensors such as the LIDAR with video/still
a global reference frame. Position sensors such cameras. LIDAR devices discretely measure the
as inertial measurement units or those relying on distance between a light source and a reflection
global positioning systems are prone to errors and target at a high frequency. By changing the path
can be highly inaccurate under certain conditions. of the light through mirrors and actuators, a point
The map registration process determines the rigid cloud of a 3D space is produced. A calibrated
body translation and rotation of the sensor as its vision sensor maps the color information to the
output (Thrun, 1993, 2003). The map registration sampled points. Installing such a scanning sensor

503
Algorithms for 3D Map Segment Registration

on a mobile platform extends its range and enables to generate the coordinate in a spherical coordinate
mapping of large areas. system, which is transformed to Cartesian system
as necessary. Figure 1 also shows a picture of a
3D Color Scanner compact version of the system.
The 3D color scanner is mounted on a mobile
The 3D color scanner used in this effort consists vehicle for mapping large areas. This mobile
of a 2D LIDAR and two 1.3 megapixel high-frame mapping system generates color point cloud data.
rate video cameras installed on the LIDAR scan- Figure 2 shows the mobile system with the scan-
ning plane. The LIDAR and the cameras move ner installed on top of the vehicle. The vehicle
such that the scan plane is rotated about an axis has no global positioning devices other than wheel
within the plane, thus generating 3D color point encoders. Cameras and short-range infrared sen-
clouds. Figure 1 shows that the LIDAR consists sors enable observation of terrain conditions,
of a rotating mirror which is driven about Y axis collision avoidance and allow a remote operator
(degree of freedom: θ) and the scan plane is to drive the vehicle. Map data and video feeds are
rotated about Z axis (φ: degree of freedom). The transmitted using an on-board wireless commu-
rotations are controlled by servomotors installed nication system. This mapping system performs
on the axes. The cameras are calibrated to be on scans only when it is stopped. The vehicle can
the LIDAR scan plane and a forty-pixel wide localize itself from the map observations and
image stripe is extracted from the cameras. The moves directly from one vantage point to the next
color information is then matched, in real-time, and acquires additional map information. This
to the points ranged by the LIDAR. The relative system can generate color point cloud maps with
distance between cameras and LIDAR is pre- 0.25° angular resolution in the vertical scanning
configured and images are pre-aligned. The 2D direction with a coverage angle of 100°. In the
LIDAR generates scans at a frequency of 38 Hz rotation (φ) direction, the resolution is at 0.1° with
and the cameras provide imagery at 60 frames per coverage angle 300°. The map segment from one
second. Time synchronization establishes that the vantage point covers a maximum radius of 80
pixel color is mapped to each ranged point. Use of meters.
two cameras reduced occlusions due to the offset The data elements produced by the scanner
between the LIDAR mirror and the camera lens. are shown in Figure 3. Figure 3(a) shows the
All areas visible to the LIDAR are visible to one camera image taken from the vantage point de-
of the two cameras. The 2D range measurement picting scene visible to the scanner. The 3D color
along with the scanner rotation position (φ) is used point cloud generated at that vantage point is

Figure 1. 3D scanning devices built with 2D commercial scanners

504
Algorithms for 3D Map Segment Registration

Figure 2. Mobility platform used for 3D color map construction in large area (Gebre et al., 2009)

shown in Figure 3(b). In this figure, the coordinate data and shown in Figure 3(c). The object surface
(x,y,z) and the color (r,g,b) for all the pixels are material, color and distance towards scanner cause
known. The point density (spatial resolution of variations in intensity data. Similarities between
the point cloud) varies on the left and right sides intensity point cloud and color point cloud can
of the color scan scene depending upon the distance be observed between Figure 3(b) and (c) on
between the scanned point and the scanner. The edges, doors, and windows.
closest area to the scanner has the highest den-
sity of points. The scanner also records the optical
reflection intensity of laser beam. The intensity
information is combined with range measurement

Figure 3. High dimensional point cloud map segment taken from a single vantage point: (a) Image of
an urban building (b) Color point cloud map (c) Laser reflection intensity map

505
Algorithms for 3D Map Segment Registration

Algorithms for Complete Mapping tion, we discuss the algorithms that address each
of these tasks.
An autonomous robot with the color point cloud
scanner can reduce the surveying and map building Robot Self-Localization
cost and time. However, several methodologies
for robust self-localization, map completeness The self-localization problem requires mobile
evaluation, map based navigation and 3D map mapping robot to determine its location in an
registration must evolve before a high degree of unknown environment. Localization is critical
autonomy can be achieved. because robot cannot effectively navigate to the
A mapping robot deployed at initial start next waypoint without the location information.
position must go through the four phases of the Map registrations require location and pose es-
mapping processes as shown in Figure 4. The timates. Usually robot is equipped with multiple
robot must be able to localize itself so it can navi- position and orientation sensors like GPS, Inertial
gate the scene. This can be accomplished by 2D Measurement Unit (IMU), odometer, and wheel
SLAM (Simultaneous Localization and Mapping) encoders to measure real-time pose and position.
techniques or other methods. Methodologies for Multiple position and location sensors return robot
establishing the map completeness and detection position information with certain level of error due
of occluded areas are necessary. Determination to reasons like sensor precision, GPS signal noise
of the optimal vantage point for filling in the and errors, sensor drift for IMU and inaccurate
occluded areas and exploring unmapped areas is measurements from other sensors.
also a critical step. As the navigation is based on The main challenge for robot localization is
imprecise mapping and localization information, to escape location sensor noise, drift errors, and
the map segment registration based on 3D color constantly provide accurate location and position
point clouds is the last but crucial step in building reference for the robot. Probabilistic self-local-
the complete map of a given area. In this subsec- ization techniques based on maximum-likelihood
estimation have been applied to address this
problem. These techniques assume that the noise
of position sensor follows certain probabilistic dis-
Figure 4. Map completeness orientated robotic
tribution, which can be described mathematically.
mapping process
They also assume that two subsequent map results
are highly comparable to each other and several
landmarks can be quickly identified. Therefore,
accurate relative position and location can be
solved by comparing current map with a previous
map in short time intervals, and probabilistically
maximizing similarity between two maps (Olson,
2000). Map could be generated by different sensors
like stereo cameras, sonar or laser range finders.
Landmarks extracted from maps are commonly
applied in the self-localization process to reduce
computation cost. Whyte and Bailey (2006) uti-
lize the relative localization results between two
neighbor vantage points to merge the two maps.

506
Algorithms for 3D Map Segment Registration

A two-step process, termed as Simultaneous P (z t | x t , m ) (2)


Localization and Mapping (SLAM), typically lo-
calizes the robot. The robot position is established The robot motion model describes probability
from multiple but imprecise sensor measurements on state transition of robot state vector, xt with
and comparison of landmarks in the scene. The known previous state xt-1 and control input ut
position sensor data is improved using sensor
fusion techniques by Spletzer (2003). Location
P (x t | x t −1, ut ) (3)
information is estimated based on previous loca-
tion, driving command information and current
sensor measurements. In SLAM, probabilistic The transition of state vector is assumed as a
methods are applied to reduce sensor noise ef- Markov process, implying that the next robot state
fects. Extended Kalman filter and particle filters xt can only be determined on previous state xt-1
and noise models improve the location estimates and latest control input ut and not the history of
(Montemerlo, et al. 2003). The SLAM solution states. The state of robot is independent of both
has been expanded into 3D space with a six de- observations and landmarks. Equation (1) can be
gree of freedom (6DOF) SLAM which applies recursively solved in a Prediction (time update)
sensor measurement and robot kinematics models and Correction (Measurement update) form.
(Nücher, 2005). Landmark extraction and map
comparison entail the major computation effort Prediction:
during the SLAM progress. Real-time SLAM
has been demonstrated with stereovision sensors P (x t , m | z 0:t −1, u 0:t , x 0 ) =
(Davison, 2003).
The SLAM technique simultaneously consid-
∫ P(x t
| x t −1, ut )P (x t −1, m | z 0:t −1, u 0:t −1, x 0 )dx t −1
(4)
ers the localization and mapping mission (Thrun,
et al., 2000). The SLAM problem can be described
Correction:
by a joint posterior:
P (x t , m | z 0:k , u 0:t , x 0 )
P (x t , m | z 0:t , u 0:t , x 0 ) (1)
P (z t | x t , m )P (x t , m | z 0:t −1, u 0:t , x 0 ) (5)
=
P (z t | z 0:t −1, u 0:t )
Where, xt is the state vector representing the
robot location and orientation, mi is the vector
representing the ith landmark location, zit is the Equation (4) and Equation (5) recursively solve
robot mapping measurement about ith landmark latest robot state joint posterior. Robot state can be
at time t, and ut is the control vector applied at t-1 predicted from the motion model P (x t | x t −1, ut )
time to drive robot to state xt at time t. and control input at time t. The observation model
The SLAM problem requires that Equation (1) P (z t | x t , m ) is applied to correct state prediction
be solved for the time, t, and the latest robot state with observation and mapping at time t.
vector xt be computed. Solving the joint posterior In order to find solutions to the SLAM prob-
from, 0-t requires an observation model and a mo- lem, proper practical descriptions about motion
tion model based on Bayes Theorem (Whyte and and observation model in Equation (2) and Equa-
Bailey, 2006). The observation model determines tion (3) should be provided with reliability and
the probabilistic distribution of observation zt with efficiency. Extended Gaussian Filter (EKF) is
known vehicle state and landmarks location as: applied to represent these models on state-space

507
Algorithms for 3D Map Segment Registration

model with additive Gaussian noise (Welch and extracted from current incomplete map for pos-
Bishop, 1995). The EKF based SLAM simplifies sible paths for navigation. The map evaluation
motion model as: also returns possible explorative area that is acces-
sible to the mobile robot but not mapped. If map
x t = f (x t −1, ut ) + wt (6) completeness is the most important factor for the
mission, algorithms that evaluate latest exploration
status after every scan may require assessment of
f(xt-1,ut) is the robot kinematics model and wt
the complete map and not just the current map
is the additive uncorrelated Gaussian disturbances
segment. There are many techniques to evaluate
with zero mean and covariance Qt. The observation
the completeness of mapping, namely, grid based
model can be described as:
occupancy map (Thrun, 2003), network/graph,
cell based map (Zelinsky, 1994) and template
z t = h(x t , m ) + vt (7)
based completeness evaluation (Oh et al, 2004).
The occupancy grid map is one of the most
In which, h(xt,m) is the observation geometry commonly used methods to determine map
description and vt is the additive uncorrelated completeness. Area of interest is gridded and
Gaussian disturbance with zero mean and covari- acquired maps from different vantage position
ance Rt. Equations (6) and (7) can be applied to the are transferred into or projected onto the grid.
SLAM prediction and correction. In EKF-SLAM Grid is marked as occupied when data exists on
process, the mean and covariance of both motion this grid, every grid should be represented with
model and observed motion should be updated at certain level of occupancy, which is computed by
every time t. Other probabilistic methods such as density of point cloud map on this grid. Map can
Particle Filter (PF) (Montemerlo et al., 2003) and be assumed as complete all the mapped objects
Graph Filter (GF) are used to solve the SLAM form self-closed contours or closed contours with
problem. A typical SLAM method is implemented the boundaries of the mapped area.
on 2D space, however, SLAM in 3D space with 6 A major challenge in map completeness evalu-
Degree of Freedom (6DOF) on robot kinematics ation is deciding whether an area can be mapped.
have been implemented by expanding landmarks For example, when mapping robot is performing
state, motion model and observation model into indoor exploration, space behind wall of the hall-
3D space (Nücher, 2005). way may not be accessible. Contours extracted
from latest global map may be used to determine
Map Completeness Evaluation possible navigation paths. Possible mapping area
exists for contours with gaps. Ascertaining that
The map completeness problem can be addressed the gaps in map contours are indeed traversable
with several methodologies including grid oc- paths requires discerning traversable pathways
cupancy, obstacle recognition and object view in the map.
completion detection. The completeness of map is
calculated by occupancy grid map (Thrun, 2003), Map-Based Navigation
which entails projecting the acquired map on an
occupancy grid and calculating the occupancy Determination of the next vantage point may
level. Possible mapping area is determined based depend upon several criteria: best view, coverage
on the contour of the objects and separating the of unmapped areas, areas of overlap with current
map into areas that can be potentially mapped or map, localization, accessibility and traveling costs.
impossible to map (Oh et al., 2004). Terrains are Two steps are required for determination of the

508
Algorithms for 3D Map Segment Registration

next vantage position. The first step is the genera- of this chapter remains on the registration aspect
tion of candidate positions and second step is the of the mapping exploration.
selection of optimal vantage point from the list.
The candidate vantage positions can be created
based on frontier exploration algorithm (Basilico ALGORITHMS FOR REGISTERING
& Amigoni, 2009) considering obstacles, position MAP SEGMENTS
and terrain conditions. The vantage position is
selected between candidate positions that have Three-dimensional point cloud segments acquired
the best view coverage and shortest traveling cost. from different locations have to be combined to-
Next vantage point should be decided based on gether as complete large-scale map. Position and
the best view to fill occluded regions and cover orientation information required for registration
as much new area as possible. Frontier based can be provided directly by mobile platform sen-
exploration algorithm provides vantage point sors such as GPS and IMU (Thrun 1993). In most
candidates for the best view point, these candidate cases, position information acquired from sensor
points are evaluated to determine best vantage is reasonably accurate. However, the orientation
point for next mapping. information is costly and relatively imprecise
Computing vantage position for mapping based because orientation sensor measurement can be
on previous vantage positions and incomplete map affected by external disturbances like magnetic
is known as the Next Best View (NBV) problem field variations and sensor integration drift with
(Yamauchi, 1997; Basilico & Amigoni, 2009). time. Position and orientation information can also
NBV algorithms navigate robot to acquire maxi- be provided by indirect techniques based on both
mum uncovered area. A certain level overlapped rough position sensor measurement and common
area ensures that the robot has enough landmarks geometric feature identification. Figure 5 shows
to navigate between the current and the next best two maps generated from separate vantage points.
view vantage point. Frontier based algorithm can The left map on the top row shows map gener-
be applied to provide candidate positions for the ated with robot facing towards one side of the
next best view point. Based on the regions on the building, the right map shows the map generated
boundary between mapped and unmapped space, from the second vantage point. The bottom figure
the frontier can be extracted. Considering the range shows the map data from the first vantage point
for mapping sensor constraints, next mapping registered into the coordinate system of the second
position on the frontier can then be generated. location. Registering the two segments produces
Current frontier should be evaluated in occupancy the complete map of the façade of the building.
grid map so that the frontier grid positions that Comparing with the SLAM algorithm, map
cover more unoccupied can be selected to accel- registration techniques focus on generating ac-
erate the coverage of the area. These candidate curate map details rather than localization of the
points can be evaluated based on the criteria for robot in a global coordinate system (Arun, 1987;
the exploration and time and power requirements Bsel, 1992; Lorusso, 1995; Rusinkiewucz, 2001).
for reaching the vantage point. Discrete range points received from color point
The map data acquired from various vantage cloud sensor contain detailed spatial information
points must be registered into global map space about the environment. Different techniques exist
using various registration algorithms. Although for merging such point clouds together by exploit-
this section describes the various algorithms ing geometric features and measuring surfaces.
required for complete map generation, the focus Map registration techniques such as Iterative
Closest Point (ICP) algorithm proposed by Bsel

509
Algorithms for 3D Map Segment Registration

Figure 5. Map segments generated from two vantage points (Top) and registered map (Bottom)

(1992) has been applied to stitch two neighbor responding points. Parallel ICP algorithms have
3D point cloud maps together into one map based been developed by Robertson (2002) to accelerate
on their common coverage area. Upon conver- computation speed. Point to plane registration
gence, ICP algorithm terminates at a minimum. method (Lorusso, 1995, Rusinkiewucz, 2001,
Several algorithms are in existence for calculating Salvi et al., 2007) accelerates the ICP iteration
the minimum average distance between two point and convergence.
clouds. Singular Value Decomposition (SVD) Other techniques include the point signature
method by Arun (1987), eigen-system methods method by Chua (1997), which uses signature
that exploit the orthonormal properties of the points to describe curvature of point cloud and
rotation matrices, and unit and dual quaternion matches corresponding signature points during the
techniques were adopted in ICP process. Quater- registration process. Spin image based methods
nion based algorithms have been used in ICP for compute 2D spin image to represent surface char-
map fusion by Bsel (1992), SVD based algorithms acterization and solve the registration problem by
are widely used in ICP and 6DOF SLAM (Arun finding best correspondence between two different
1987, Nucher, 2005, Joung et al., 2009) as they scan spin images (Johnson 1997). Other methods
are robust to reach local minimum and easy to like principle component analysis (Chung and Lee,
implement. Several variants of ICP are reported 1998) and algebraic surface model (Tarel et al.,
by Rusinkiewucz (2001) to increase the speed 1998) are based on the point cloud surface geo-
and precision. Corresponding points sampling, metrical features. The normal vector distribution
matching, weighting and rejecting are some can be translated into an orientation histogram in
methods used to accelerate the ICP algorithm. In an Extended Gaussian Image (EGI) (Makadia &
the ICP algorithm, associating corresponding Daniilidis, 2006). Rigid motion required to register
points in two point cloud data sets is the most two point clouds is solved from the cross covari-
critical step. Nearest neighbor search in 2D or 3D ance function (Chibunichev & Vilizhev, 2008) of
space is commonly used for associating the cor- the two EGI images. Rigid motion could also be

510
Algorithms for 3D Map Segment Registration

solved in Fourier domain by computing Discrete HUE-ASSISTED ITERATIVE


Fourier Transform on Rotation Group on SO(3) CLOSEST POINT (H-ICP)
(SOFT) (Joistekecm and Ricjnirem, 2008). ALGORITHM
Registration of color point clouds has been
considered (Ferbabdez, et al., 2007; Druon, The primary hypothesis of this algorithm is that the
2007; Newman et al., 2006; Anderson, 2006, hue value can be applied to increase the accuracy
2007). By applying proper calibration on the of point association and accelerate the registration
hybrid sensor system (Joung et al., 2009; New- process. The major time and computation cost
man, Cole, Ho, 2006), range measurement and during ICP is finding the correct points pairs.
visual information can be integrated together to Closest spatial distance is typically applied in 3D
construct a visually accurate representation of the ICP method. The distance value in 3D space can
scene. Color mapped 3D data was used in map be expanded into 4D space by adding weighted
registration by weighted red, green, blue data. hue value as the 4th dimension. By integrating hue
The corresponding point search during the ICP is value into the closest point search, accuracy of
conducted on both the coordinate and color data point association can then be improved.
(Johnson, Kang, 1997). Hue filters were also used
to constrain the closest point search in every ICP Hue Invariance with Vantage Point
iteration (Druson, 2007). Color data can be used
to estimate initial alignment of pair wise scans Hue value remains consistent about the same
using Scale Invariant Feature Transform (SIFT) point between images taken from two vantage
techniques. Color attributes transferred in YIQ points, while the color values represented in red,
color model can also be weighted to construct green and blue quantities usually differ because
new variant together with range information for of variation in light conditions. In order to apply
ICP fine registration. Depth-interpolated Image color to improve the association process, light-
Feature (DIFT) algorithm solves corresponding ing effect should be removed. Color raw data
points between two images and registers color are transformed into representation of separate
point clouds based on extracted correspondences chroma, lightness and brightness value. Figure 6
(Anderson, Lilienthal, 2010). shows two camera images of different angles of
In this chapter, we introduce hue assisted ICP a color palette on a Rubik’s cube, four colors are
algorithm for registration of color point clouds. used on the same surface. Figure 6 also shows the
The criteria for association are defined on a 4D color pixels with the background and black frame
space rather than 3D geometric space. The fourth removed. Histograms showing the red, green and
dimension selected is the hue, representing the blue value in RGB space for all the pixels are
intrinsic color values of the pixel. While achieving shown in Figure 7. In the RGB histogram, R, G,
the effect of a hue-based filter, hue-association and B distributions of the image vary considerably
reduces the nearest neighbor search burden con- with the vantage point. When the RGB color space
siderably (Men & Pochiraju, 2010). The remaining is transformed into HSL space and histograms of
sections of the paper describe the approach and hue, lightness and saturation are plotted in Figure
the performance of the algorithm under several 8, the hue values remain relatively invariant with
hue distributions in the scene. the position of the camera. Therefore, hue value of
the pixel, taken from the Hue-Saturation-Lightness
(HSL) model, is used as the fourth dimension in
the point association process. In Figure 9, the
hue rendered point cloud of color point cloud in

511
Algorithms for 3D Map Segment Registration

Figure 6. Rubik’s cube camera images take from two vantage points

Figure 3(b) is shown. Hue values are normalized weighted hue value. Hue values are normalized
between 0 and 1. The hue distribution is typically to a 0-1 range and must be weighted during the
similar to the color distribution in Figure 3(b). closest point search in the four-dimensional space.
In order to normalize the coordinates, we find the
Construction of a Weighted 4D bounding box for each point cloud segment and
Search Space the coordinate space is rescaled to a 0-1 range.
The normalized variant for point association is
Both hue and range value have to be combined {x, y. z, hw}, where x=xo/rx, y=yo/ry, z=zo/rz. rx, ry,
together in the H-ICP variant as {xo, yo, zo, hw} rz are the dimensions of the bounding box in x,
for point association. xo, yo, zo are the original y, z directions.
coordinate values with distance units and hw is the

Figure 7. RGB distribution varies with camera positions

512
Algorithms for 3D Map Segment Registration

Figure 8. HSL distribution: Hue remains invariant

Figure 9. Hue rendered point cloud of the scene shown in Figure 3

The weight value for the hue dimension should k-d Tree Based Point
be properly selected for point association. Since Cloud Association
both range and hue value are normalized from 0
to 1. Weight for hue represents its influence in In 3D ICP algorithm, corresponding points are
the nearest neighbor search process. Low weight searched according to the closest distance rule.
biases the point association towards the range This may cause incorrect matching during single
data and a high weight towards the hue values. iteration loop as Figure 10. Dashed line circle
Small weight values for the hue correspond to the illustrates range based nearest point association
traditional 3D-ICP. Hue weight should be selected results, in which all points in data set look for
between 10% and 35% for accurate point asso- nearest neighbor in 3D space. It takes more than
ciation. Error in H-ICP will be evaluated by the one iteration to pair correct nearest neighbor points
average mean square root distance of normalized for given data points set. Grey circle denotes the
associated point pairs. H-ICP nearest point search that also uses the
correct hue property in finding the best neighbor
in the model. Depending on the correct color
information, corresponding point can be locked
with less iteration.

513
Algorithms for 3D Map Segment Registration

Figure 10. Point association based on nearest distance (dotted) and nearest distance and hue (solid)

The ICP computation speed and precision are tree and follows the same procedure as insertion.
highly dependent on association process. Use of Search stops at a leaf node of the tree and the
a k-d tree for closest point search and association points in the model tree within the search radius
or the Nearest Neighbor Search (NNS) problem are identified. The nearest point is obtained using
increases the speed and efficiency of the search. distance computation. Figure 11(b) shows the
The k-d tree is a spatial partitioning data structure nearest neighbor (red square) for the search point
that stores and organizes data in a k dimensional at the center of the circle. The nearest point is
space. The k-d tree is a generalized type of bi- then regarded as the point associated with the
nary tree, with every leaf node is a k-dimension- search point.
al data point that splits the hyperspace into two In 3D closest point search, the distance between
subspaces. Splitting is done sequentially from the 2 points between 2 point clouds is:
first dimension to the kth dimension. A typical k-d
tree in 2D space is shown in Figure 11(a). Each rij = (mix − d jx )2 + (miy − d jy )2 + (miz − d jz )2
point in the 2D space divides the space sequen-
(8)
tially into a left-right spaces (about x-axis) or into
a top-bottom spaces (about y-axis).
in which, di{dix,diy,diz} and mj{mjx,mjy,mjz} are
Nearest neighbor search can be done very ef-
point spatial coordinates in data and model point
ficiently on k-d trees. For a given point with known
cloud respectively.
coordinates in the data point cloud and a search
radius, the algorithm recursively moves down the

Figure 11. k-d tree construction and nearest neighbor search in 2D space: (a) k-d tree construction in
2D space (b) 2D space nearest neighbor search in k-d tree

514
Algorithms for 3D Map Segment Registration

In 4D space, the 4th dimension for each point Use of hue as a fourth dimension is significant
should be weighed hue value dhw or mhw. The in those instances where the coordinate based
spatial value of points should be normalized by matching results in a non-unique registration. For
3D search radius rij as mentioned in section 4.1. example, if the points in the model and the data
In order to accomplish closest point search in point clouds belong to a plane, traditional coordi-
4D space, the distance between two normalized nate based ICP results in non-unique association
points di{dix,diy,diz,,hihw} and mj{mjx,mjy,mjz,mjhw } of points. In such cases, using the hue value may
should be: result in unique registration of the points. The
color assisted ICP algorithm in this paper can be
rij ' = (mix − d jx )2 + (miy − d jy )2 + (miz − d jz )2 + (mihw − d jhw )2 described as follows.
(9)
1. Estimate the initial transformation matrix R
or and T;
2. Construct k-d tree of model point cloud
M{m1,m2,m3…mM}, hue value has been
rij ' = rij2 + ∆hijw 2 (10)
weighted as the 4th dimension;
3. While merging error ε >preset error
In the ICP process, search radius effects the Use R and T to transfer data point cloud
computation time and final result. A constant D{d1,d2…dN}.
search radius is applied for all iteration loops.
 
Once the search radius is large, too many points D = RD + T
will be included as candidates and increases the
computational burden. Candidate points cloud will
4. For i=1 to length of data point cloud
be missed if search radius is too small. The search
Search closest point for point di {dix, diy, diz
radius is determined by the density of point cloud.
, dih} in model k-d tree
In 4D k-d tree search, the search radius comprises
If closest point mj exists in search range r
of two parts -- a distance part and weighted hue
Pair di and mj as {dk, mk};
part as seen in Equation (9). The search range for
k++;
3D distance is selected such that it ensures about
End If
50 candidate points within search radius. As hue
End For
value is not transformed at iteration, hue search
5. Acquire paired point cloud Dp and Mp, con-
is analogous to filtering. If the weight for hue is
tain N Points, calculate normalized mean
high, k-d tree search will bias toward hue dimen-
square root distance ε as error,
sion. Therefore, appropriate hue weighting ensures
that spatial search dominates over hue filtering. ε=
The ICP algorithm iteratively converges at 1 N

minimum error, which is described by mean square N
∑ (dix − mix )2 + (diy − miy )2 + (diz − miz )2
i =1
root of the spatial distance between paired points.
At each iteration, a rigid transformation matrix
is computed so that the distance error metric be- 6. Construct orthonormality matrix H
tween the associated points is minimized. Data (Equation14) and solve rigid rotation R
point cloud is transformed into the model space and translation T (Equation15, 16) for next
using the computed transformation matrix. This iteration;
iteration continues until error metric converges. End While

515
Algorithms for 3D Map Segment Registration

Solving Rigid Transformation matrix H can be constructed based on m ′ {mi ′ ,


i=1… N} and d ′ {di ′ , i=1… N}.
ICP algorithm is an iteration process to calculate
rigid transformation matrix based on associated
S S xy S xz 
point clouds. Mi = {mix,miy,miz} represent the co-  xx
ordinates of the ith point in the model point cloud H = Syx Syy Syz 
and dj = {djx,djy,djz} is the jth point in data point  
S zx S zy S zz 

cloud. Rigid transformation (R) that minimizes
the error measure E(R,T) shown in Equation (11)
is determined. Where

N
N
1 S xx = ∑ m 'ix d 'ix
E (R,T ) =
N
∑m i
− (Rdi + T ) (11)
i =1
i =1 N
Syy = ∑ m 'iy d 'iy
A centroid for the associated points is calcu- i =1
N
(14)
lated as the first step (Equation12) and associ- S zz = ∑ m 'iz d 'iz
ated points are translated into centroid relative i =1
N
coordinates (Equation13). Orthonomal matrix of S xy = ∑ m 'ix d 'iy
associated points can then be constructed as shown i =1

in Equation14. Rotation R and translation T are


decoupled based on the gravity center of associ- Singular value decomposition is performed
ated points. Using Singular Value Decomposition on constructed H matrix for optimal rotation R
(SVD) methods, R can be determined as shown
in Equation15. Translation T is computed using H = U ΛV T (15)
Equation16.
where optimal rotation R = VU T .
N N
1 1 The translation T can be calculated as
m=
N
∑ m ,.d = N ∑ d
i i
(12)
i =1 i =1
T T
T = m − Rd (16)
In which, m = {m x , m y , m z } a n d
d = {d x , d y , d z } are the center points of associ- Convergence Criteria
ated points in model and data point clouds. N is
the amount of point pairs. The coordinated of Convergence and stopping criterion for the H-ICP
associated point in center point relative space iteration are discussed in this sub-section. An as-
should be sociation stability criteria is introduced as the one
of the convergence criteria. Association stability,
mi ′ = mi − m, di ′ = di − d (13) denoted as S, is defined as the number of points
which changed their paired point in any iteration.
If a point comes into association or changes its
In which, mi′ = {mix′ , miy′ , miz′ } and nearest neighbor, S is incremented. Large value of
di′ = {dix′ , diy′ , diz′ } are the ith associated point with S signifies that point association has not stabilized.
center relative coordinates. The orthonormality H-ICP iteration is terminated when S vanishes and
the distance error converges.

516
Algorithms for 3D Map Segment Registration

A pair wised color point H-ICP registration MAP REGISTRATION WITH ICP AND
was accomplished based on above criteria. Model H-ICP
point cloud contained 122,409 points with color
attributes. Data point cloud is extracted from The hue distribution or the color of the model
model point cloud with a known rotation (θ z=5o). is generally independent of the geometry. If the
The H-ICP registration process is compared with entire body is painted with a color of a single hue,
3D ICP, error as shown in Figure 12(a). The as- H-ICP is as effective as the traditional ICP. In
sociated point number reaches maximum after this section, we describe the performance of the
the 5th iteration (Figure 12(b)), but error has not algorithm under various hue distribution scenarios.
converged. From Figure 12(c) the association The Stanford bunny point cloud is considered as
stability (S) reaches 0 after 15th and 26th iteration the benchmark data set. In HSL color space, hue
for H-ICP and 3D ICP respectively. Error and value varies from 0- 360. The color correspon-
rigid transformation are shown in Figure 12(a) dence between RGB and hue is given in Table 1.
and Figure 13. The known transformation (θ z=5o)
is recovered by the H-ICP and ICP algorithms. Environments with Fixed Hue
Distributions

For the first experiment, we textured the Stanford


bunny point cloud model as shown in Figure
14(a). In this model, the hue varies from 0 to 360
with from bottom to top at Z direction in seven

Figure 12. Building color point cloud registration comparison between H-ICP and 3D ICP algorithm:
(a) Comparison of error convergence (b) Association number convergence (c) Association stability
convergence

517
Algorithms for 3D Map Segment Registration

Figure 13. Convergence of translation and rota-


Continuously Varied Hue along One
tion estimates during registration
Dimension

In the second simulation, a continuous hue distri-


bution is assigned to the bunny model. The hue
value is varied from 0 to 360, smoothly, along
the z (vertical) direction. The resultant model
and data clouds are shown in Figure 16 (a), (b).
Saturation and lightness value have been set as
constant at every point inside dataset. Hue value
can be calculated by Equation (17).

z j − z min
h = 360 (17)
z max − z min

h is the hue value at range point i, zi is the


coordinate distance for ith point at z direction, zmax
and zmin are maximum and minimum coordinate
Table 1. Hue and RGB values for several com- of the point cloud at z direction.
mon colors Continuous hue distribution on point cloud
data is registered together (Figure 16 (c)) and the
Color R G B Hue results are shown in Figure 17. A comparison of
Gray 128 128 128 0 model performance on discrete and continuous
Yellow 255 255 0 60 distribution of hue on the same model shows
Green 0 255 0 120 the expected acceleration in performance due to
Cyan 0 255 255 180 uniform distribution of hue on the model.
Blue 0 0 255 240
Magenta 255 0 255 300
Red 255 0 0 360
Figure 14. Registration of point clouds with
uniformly distributed hues: (a) Stanford Bunny
segments. Figure 14(b) also shows the initial point clouds with hue distributed as seven distinct
registration of the model and data point clouds stripes (b) Registered color point cloud
used for this simulation.
The H-ICP registration progress is shown in
Figure 15(a) and Figure 15(b). Figure 15(a) shows
the mean square error during the ICP process and
Figure15 (b) shows the number of points associ-
ated during iteration loops. Both data and model
point cloud after registration is shown in Figure
14(b). The hue-assisted ICP registers the point
and data clouds faster than the traditional coor-
dinate based ICP.

518
Algorithms for 3D Map Segment Registration

Figure 15. Registration comparisons between H-ICP and 3D ICP algorithm: (a) Mean square error
comparison (b) Associated point number comparison

Figure 16. Bunny model with continuous hue variation in one axis: (a) Data point cloud (b) Model point
cloud (c) Merged view

Figure 17. Registration comparisons between 7 segment hue model and continuous hue model: (a) Mean
square error comparison (b) Associated point number comparison

519
Algorithms for 3D Map Segment Registration

Randomized Hue on the Model Effect of Imaging Noise

In this case, the model considered has a continu- In the previous simulation, the imaging sensor is
ously distributed hue but with a randomized and assumed perfect. The hue on a point is assumed
noisy pattern. In this case, there is no geometric to be recorded by the imaging sensor perfectly
pattern for the color on the object. The color in both model and data clouds. Some noise in
point clouds are rendered in Figure 18 (a, b). the color measurement can be expected when
The merged cloud point cloud after registration the point clouds are generated from two vantage
is shown in Figure 18(c). Figure 19 shows the points (Gebre et al., 2009). Considering this situa-
error minimization iteration and comparison with tion, we colorized the bunny model but with 50%
the seven-segment hue distribution model. In this noise in the sensor. The points in the model and
case, the hue confuses the nearest neighbor search. data clouds differ in color by as much as 50%.
The registration accuracy is also not as good as a The resulting point clouds are shown in Figure
patterned hue case. 20(a, b). The merged color point cloud is shown
as Figure 20(c).

Figure 18. Bunny point cloud with randomized hue distribution: (a) Data point cloud (b) Model point
cloud (c) Merged view

Figure 19. Comparison between discrete and random hue distribution case: (a) Mean square error
comparison (b) Associated point number comparison

520
Algorithms for 3D Map Segment Registration

Figure 20. Hue mapped with noise: (a) Data point cloud (b) Model point cloud (c) Merged view

Hue assisted color ICP matching result in point clouds with known (and exact) registration
camera noise color point cloud is compared with transformation. Both H-ICP algorithm and 3D ICP
3D ICP matching performance. From Figure 21, algorithm have been applied on a building data set
noise in hue decreases the matching accuracy and (Gebre et al., 2009). The data point cloud is taken
reduces the iteration efficiency. Two groups of from a view position that is 10° off in Y and Z axis
cloud point clouds are selected to evaluate the from the model point cloud. Translation between
performance of H- ICP algorithm compared with the point clouds is known to be 2.46, 2.612 and
typical 3D ICP. A known transformation point 0.347 along the X, Y, and Z respectively. Same
cloud data pair was generated by transforming parameters for registrations are selected to be
model point cloud at 6DOF to compare the con- the same as in the previous 1-DOF registration.
vergence speed and registration accuracy as the Error comparison and associated point number
rigid transformation is already known. Outdoor comparison are shown in Figure 22(a) and (b).
large scale area pair wised registration includes Association stability is shown in Figure 22(c).
8 pair wised data registration. The evolution of rigid transformation during ICP
is shown in Figure 23. The H-ICP completes reg-
Registration with Six DOF Rotation istration after 102th iteration and the traditional 3D
ICP after the 164th iteration, which demonstrates
In this experiment, registration speed between 3D the effectiveness of H-ICP for registering complex
ICP and H-ICP are compared using data and model

Figure 21. Comparison between H- ICP in and 3D ICP for noisy hue case: (a) Mean square error com-
parison (b) Associated point number comparison

521
Algorithms for 3D Map Segment Registration

Figure 22. Registration comparisons between 3D ICP and H-ICP algorithm: (a) Mean square error
comparison (b) Associated point number comparison (c) Stability comparison

and realistic point clouds. The merged color point required to register the point clouds is shown in
cloud about building is shown in Figure 24. Table 2. H-ICP requires less number of iterations
than 3D ICP.
Sequential Registration of Multiple
Point Clouds Figure 23. Convergence of translation and rota-
tion estimates during registration
3D ICP and H ICP algorithms have been applied
on several outdoor map segments. Color point
clouds taken from eight different vantage points
have been registered together to construct a large
scale color point cloud map. Figure 25 shows the
top view of outdoor mapping area in aerial image.
This scene includes trees, road, electrical poles
and buildings. Figure 26 shows the registered map
and the vantage points from which map segments
are obtained. Pair-wise registration is applied to
construct a single map about the reference coor-
dinate of the first map segment. 3D search radius
in k-d tree was set as 1.5 and the 3D range data
was normalized based on this radius. Hue value
was normalized to a 0-1 range, hue search radius
was set to be 0.15, and hue weight was set to
5.0. The final error and the number of iterations

522
Algorithms for 3D Map Segment Registration

Figure 24. Registered data and model point clouds

Figure 25. Aerial image of outdoor mapping area and vantage positions

Figure 26. Top view of eight sequentially registered color point cloud maps

523
Algorithms for 3D Map Segment Registration

Table 2. Sequential registration of multiple point


FUTURE RESEARCH DIRECTIONS
cloud maps

3D ICP H- ICP 3D ICP H- ICP Point clouds are inefficient representations of


Position Iterations Iterations Error Error geometry. Some of the future research directions
2 45 35 0.842 0.856 can include:
3 54 44 0.929 0.961
4 77 54 0.039 0.290 a. Efficient generation of higher order geo-
5 49 43 0.104 0.319 metric representations --- lines, surfaces and
6 66 59 0.165 0.179 solids from the point cloud data;
7 73 69 0.129 0.128 b. Map completeness measures that predict
8 99 95 0.068 0.070 the geometry missing in the occluded areas
based on a knowledge-base; and
c. Extra sensing modalities such as infrared
This experiment proves that faster registration or thermal imaging, acoustic/ultrasonic and
will be conducted by adding color value into radio frequency imaging to help determina-
registration progress. Position 3 and 4 acquired tion of materials in the scene.
point clouds have been registered together and
shown in Figure 27, Figure 27(a) describes two Architecture, surveying and engineering fields
different point clouds with two different colors; have considerable needs for automatic or semi-
point cloud at position 4 (black) has been registered automatic conversion of 3D point clouds into
into position 3 point cloud (blue). Combined point higher order line, surface and solid models that are
clouds with color are shown in Figure 27(b). compatible with commercial CAD software. This

Figure 27. Map registered from scans taken from two vantage points: (a) Registered position 4(black)
point cloud into position 3 (blue) point cloud (b) Color point cloud after registration

524
Algorithms for 3D Map Segment Registration

enables bringing the point cloud data into existing REFERENCES


business processes like generation of drawings for
code compliance, additions and modifications to Andreason, H., & Lilienthal, A. J. (2010). 6D scan
existing built areas and remodeling interior spaces. registration using depth-interpolated local image
features. Robotics andAutonomous Systems, 59,
157-165.
CONCLUSION Andresson, H. (2007). Vision aided 3D laser
scanner based registration. Paper presented at
This chapter describes an algorithm to introduce European Conference on Mobile Robots: ECMR.
color attribute into point cloud registration process
and fundamental algorithms for autonomous ro- Arun, K. S., Huang, T. S., & Blostein, S. D.
botic complete mapping. Normalization of range (1987). Least square fitting of two 3D-Point sets.
data and hue value have been applied during the IEEE Transactions on Pattern Analysis and Ma-
registration process and quantitatively evaluate the chine Intelligence, 9(5), 698–700. doi:10.1109/
effect of hue search range and weight for the point TPAMI.1987.4767965
association process. Different hue distribution and
Basilico, N., & Amigoni, F. (2009). Exploration
noise effect have been discussed with specific hue
strategies based on multi-criteria decision mak-
rendered color point clouds. A building data set and
ing for an autonomous mobile robot. Presented at
large-scale outdoor point cloud has been registered
European Conference on Mobile Robots, Mlini/
using image data assisted algorithm. Use of the
Dubrovnik, Croatia.
hue value to assist the point association and error
minimization is shown to be effective during the Blais, F. (2004). Review of 20 years of range sen-
ICP iteration schemes. Higher dimensional point sor development. Journal of Electronic Imaging,
association based on weighted hue and range data 13, 231–243. doi:10.1117/1.1631921
leads more accurate point matching result, conduct
Browell, E. V., Butler, C. F., & Ismail, S. (1990).
earlier convergence of ICP progress, and reduce
Airborne lidar observations in the wintertime
computation time. When rigid transformation is
arctic stratosphere: Polar stratospheric clouds.
been application in every iteration loop during the
Geophysical Research Letters, 17, 385–388.
ICP period, hue value does not change in space
doi:10.1029/GL017i004p00385
transformation. However, in HSL data space,
Lightness should change according to the view Bsel, P. J. (1992). A method for registration of 3D
angle and light position. Corresponding point shapes. IEEE Transactions on Pattern Analysis
search using additional lightness value could and Machine Intelligence, 14, 239–256.
be a further research field to increase Color ICP
Chibunichev, A. G., & Velizhev, A. B. (2008).
algorithm.
Automatic matching of terrestrial scan data
using orientation histograms. Presented at the
International Archives of the Photogrammetry,
ACKNOWLEDGMENT
Remote Sensing and Spatial Information Sciences,
Beijing, China.
The authors acknowledge the partial support
from US/ARMY ARDEC for the development Chua, C. J. R. (1997). Point signatures: A new
of ROAMS robot. We acknowledge the help of representation for 3D object recognition. Inter-
Mr. Biruk Gebre for generating some of the scan national Journal of Computer Vision, 25, 63–85.
point cloud maps used in this chapter. doi:10.1023/A:1007981719186

525
Algorithms for 3D Map Segment Registration

Chung, D., & Lee, Y. D. S. (1998). Registration of Lorusso, A. (1995). A comparison of four algo-
multiple range views using the reverse calibration rithms for estimating 3D rigid transformations.
technique. Pattern Recognition, 31(4), 457–464. Presented at British Machine Vision Conference.
doi:10.1016/S0031-3203(97)00063-0
Makadia, A., Iv, E. P., & Daniilidis, K. (2006).
Davison, A. J. (2003). Real-time simultaneous Fully automatic registration of 3D point clouds.
localization and mapping with a single camera. Presented at the 2006 Computer Society Confer-
Presented at the 9th IEEE Computer Society ence on Computer Vision and Pattern recognition,
Conference on Computer Vision and Pattern New York, NY.
Recognition, Madison, Wisconsin.
Men, H., & Pochiraju, K. (2010). Hue assisted reg-
Druon, S. (2007). Color constrained ICP for istration of 3D point clouds. Presented at ASME
registration of large unconstructed 3D/Color data 2010 Interaional Design Engineering Technical
sets. Presented at IEEE International Conference Conference, Montreal, Canada.
on Robotics and Automation, Roma, Italy.
Montemerlo, M., Thrun, S., et al. (2003). Fast-
Fernandez, J. C., & Singhania, A. (2007). An SLAM 2.0: An improved particle filtering algo-
overview of lidar point cloud processing software. rithm for simultaneous localization and mapping
GEM Center Report, University of Florida. that provably converges. Presented at the Interna-
tional Joint Conference on Artificial Intelligence,
Gebre, B., Men, H., & Pochiraju, K. (2009). Re-
Acapulco, Mexico.
motely operated and autonomous mapping system
(ROAMS). Paper presented at the 2nd Annual Newman, P., Cole, D., & Ho, K. (2006). Outdoor
IEEE International Conference on Technologies SLAM using visual appearance and laser rang-
for Practical Robot Applications, Woburn, MA. ing. Presented at IEEE International Conference
on Robotics and Automation, Orlando, Florida.
Johnson, A. E., & Kang, S. B. (1997). Registration
and integration of textured 3D data. Presented Nüchter, A. (2005). 6D SLAM with approximate
at International Conference on Recent Advances data association. Presented at IEEE International
in 3D Digital Imaging and Modeling, Ottawa, Conference on Robotics and Automation, Barce-
Canada. lona, Spain.
Joung, J. H., An, K. H., Kang, J. W., et al. (2009). Nuchter, A. (2007). Catched k-d tree search for
3D environment reconstruction using modified ICP algorithms. Presented at 6th International
color ICP algorithm by fusion of a camera and a Conference on 3-D Digital Imaging and Model-
3D laser range finger. Presented at the 2009 IEEE ing, Montreal, Canada.
International Conferrnce on Intelligent Robos and
Oh, J. S., & Choi, Y. H. (2004). Complete
Systems, St. Louis, USA.
coverage navigation of cleaning robots using
Johnson, A. (1997). Spin-images: A representation triangular-cell-based map. IEEE Transactions on
for 3d surface matching. Doctoral dissertation, Industrial Electronics, 51, 718–727. doi:10.1109/
Carnegie Mellon University, USA. TIE.2004.825197
Kostelec, P. J., & Rockmore, D. N. (2008). FFT Olson, C. F. (2007). Probabilistic self-local-
on the rotation group. Journal of Fourier Analy- ization for mobile robot. IEEE Transactions
sis and Application, 14, 145–179. doi:10.1007/ on Robotics and Automation, 16, 55–67.
s00041-008-9013-5 doi:10.1109/70.833191

526
Algorithms for 3D Map Segment Registration

Robertson, C. (2002). Parallel evolutionary Yamauchi, B. (1997). A frontier based approach


registration of range data. Computer Vision and for autonomous exploration. Presented at IEEE
Image Understanding, 87, 39–50. doi:10.1006/ International Conference on Autorotation and
cviu.2002.0981 Robotics, Albuquerque, NM.
Rusinkiewucz, S. (2001). Efficient variants of Zelinsky, A. (1994). Planning paths of com-
the ICP algorithm. Presented at 3rd International plete coverage of an unstructured environ-
Conference on 3-D Digital Imaging and Modeling, ment by a mobile robot. The International
Quebec City, Canada. Journal of Robotics Research, 13(4), 315.
doi:10.1177/027836499401300403
Salvi, J., Matabosch, C., & Fofi, D. (2007).
A review of recent range image registration
methods with accuracy evaluation. Image and
ADDITIONAL READING
Vision Computing, 25, 578–596. doi:10.1016/j.
imavis.2006.05.012 Blais, F. (2004). Review of 20 Years of Range Sen-
Spletzer, J. R. (2003). Sensor fusion techniques for sor Development. Journal of Electronic Imaging,
cooperative localization in robot teams. Doctoral 13, 231–243. doi:10.1117/1.1631921
dissertation, University of Pennsylvania. Druon, S. (2007), Color Constrained ICP for Reg-
Tarel, J., Civi, H., & Cooper, D. (1998). Pose istration of Large Unconstructed 3D/Color Data
estimation of free-form 3D objects without point Sets, Presented at IEEE International Conference
matching using algebraic surface models. Pre- on Robotics and Automation, Roma, Italy.
sented at IEEE Workshop on Model-based 3D. Gebre, B., Men, H., & Pochiraju, K. (2009),
Thrun, S. (1993). Exploration and model building Remotely Operated and Autonomous Map-
in mobile robot domains. Paper presented at IEEE ping System(ROAMS), Paper presented at the
International Conference on Neural Networks, 2nd Annual IEEE International Conference on
San Francisco, USA. Technologies for Practical Robot Applications,
Woburn, MA.
Thrun, S. (2003). Robotic mapping: A survey. In
Lakemeyer, G. (Ed.), Exploring artificial intel- Fernandez, J. C., & Singhania, A. (2007). An
ligence in the new millennium (pp. 1–35). Morgan Overview of Lidar Point Cloud Processing Soft-
Kaufmann Publishers Inc. ware, GEM Center Report. University of Florida.

Thrun, S., Fox, D., Burgard, W., & Dellaert, Lorusso, A. (1995), A Comparison of Four Algo-
F. (2000). Robust Monte Carlo localization for rithms for Estimating 3D Rigid Transformations,
mobile robots. Artificial Intelligence, 128(1-2), Presented at British Machine Vision Conference.
99–141. doi:10.1016/S0004-3702(01)00069-8 Makadia, A., Iv, E. P., & Daniilidis, K. (2006),
Welch, G., & Bishop, G. (1995). An introduction Fully Automatic Registration of 3D Point Clouds,
to the Kalman filter. Technical Report TR 95-041. Presented at the 2006 Computer Society Confer-
University of North Carolina, Department of ence on Computer Vision and Pattern recognition,
Computer Science. New York, NY.

Whyte, H. D., & Bailey, T. (2006). simultaneous Men, H., & Pochiraju, K. (2010), Hue Assisted
localization and mapping (slam): part i- the es- Registration of 3D Point Clouds, presented at
sential algorithms, IEEE Robotics and Automation ASME 2010 Interaional Design Engineering
Magazine. Technical Conference, Montreal, Canada.

527
Algorithms for 3D Map Segment Registration

Montemerlo, M., Thrun, S., et al. (2003), Fast- Thrun, S. (2003). Robotic Mapping: A Survey,
SLAM 2.0, An Improved Particle Filtering Exploring artificial intelligence in the new mil-
Algorithm for Simultaneous Localization and lennium (pp. 1–35). Morgan Kaufmann Publish-
Mapping that Provably Converges, Presented at ers Inc.
the International Joint Conference on Artificial
Intelligence, Acapulco, Mexico.
KEY TERMS AND DEFINITIONS
Nüchter, A. (2005), 6D SLAM with Approximate
Data Association. Presented at IEEE Interna- 3D Map: 3D map represents certain level of
tional Conference on Robotics and Automation, spatial information about the geometry features in
Barcelona, Spain. specific area. The map is dimensionally accurate
Nuchter, A. (2007), Catched k-d tree Search for and may have a relative or absolute reference.
ICP Algorithms, Presented at 6th International 3D Mapping: 3D mapping is the process of
Conference on 3-D Digital Imaging and Model- applying measurement devices to construct 3D
ing, Montreal, Canada. map about specified environment.
Color Point Cloud: Discrete points group
Rusinkiewucz, S. (2001), Efficient Variants of with both dimensiaonl accurate measurement
the ICP Algorithm, Presented at 3rd International and texture property, normally generated by both
Conference on 3-D Digital Imaging and Modeling, ranging device and color camera.
Quebec City, Canada. Map Registration: A process to accurately
Salvi, J., Matabosch, C., & Fofi, D. (2007). A stitch pair or multiple point clouds together into
Review of Recent Range Image Registration single point cloud.
Methods with Accuracy Evaluation. Image and Point Cloud: Discrete points group with ac-
Vision Computing, 25, 578–596. doi:10.1016/j. curate 3D coordinates describing object surface
imavis.2006.05.012 dimensional measuremnts, usually contructed by
laser ranging devices.

This work was previously published in Depth Map and 3D Imaging Applications: Algorithms and Technologies, edited by Aamir
Saeed Malik, Tae Sun Choi and Humaira Nisar, pp. 56-86, copyright 2012 by Information Science Reference (an imprint of
IGI Global).

528
529

Chapter 32
TVAL-Farm:
A Qualitative Enhancement
of the LESA Model

Art Rex
Appalachian State University, USA

Leah Greden Mathews


University of North Carolina at Asheville, USA

Anne Lancaster
Independent Researcher, USA

ABSTRACT
The Total Value Assessment Tool for Farmland (TVAL-Farm) is a tool which incorporates scenic qual-
ity and cultural heritage elements to create an enhanced Land Evaluation and Site Assessment (LESA)
model. The enhancement of the LESA model provides insight and a framework on how to collect and
incorporate qualitative public values within the quantitative environment of a Geographic Information
Systems (GIS). Inclusion of these public values is essential for holistically valuing land parcels and us-
ing LESA to make land protection decisions.

INTRODUCTION has been used across the country as a decision


making resource particularly in the context of
This research presents the Farmland Values Proj- farmland preservation programs. This project
ect1’s TVAL-Farm tool which incorporates scenic develops a method for enhancing LESA’s Site
quality and cultural heritage elements to create an Assessment Factor 3 (SA-3), factors measuring
enhanced Land Evaluation and Site Assessment “Other Public Values.” The enhancement of the
(LESA) model. The LESA model, developed LESA model provides insight and a framework on
by the United States Department of Agriculture how to collect and incorporate qualitative public
Natural Resource Conservation Service (NRCS), values data within the quantitative environment
of a Geographic Information System (GIS). This
information is not typically included in farmland
DOI: 10.4018/978-1-4666-2038-4.ch032

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
TVAL-Farm

preservation benefit estimates and thus represents While LESA has the potential for being a
a significant bridge between the qualitative and useful tool for farmland preservation, it has not
quantitative studies of the importance of farmland been used in Western North Carolina. The need
to rural areas. for linking LESA and GIS has been stressed
LESA is a numeric rating system created by the (Soil and Water Conservation Society, 2003).
NRCS to evaluate a parcel’s relative agricultural Once site and benefit measures are tied to GIS
importance. The land evaluation (LE) component data, future land use scenarios can be applied to
of a LESA system measures soil quality (Pease identify which strategies maximize all the factors
& Coughlin, 1996). It is usually based on land determined best for farmland preservation. The
capability classes, important farmland classes, soil enhanced GIS-LESA model we have developed,
productivity ratings and/or soil potential ratings TVAL-Farm, is a tool to help identify farmland
(Pease & Coughlin, 1996). The site assessment most suitable for protection efforts.
(SA) component evaluates other factors that
contribute to the site’s agricultural importance
such as parcel size and on-farm investments. SA THE STUDY AREA
factors may also consider indicators of develop-
ment pressure such as distance to sewer and water The study area used to test our methodology
infrastructure and public amenities like wildlife consists of four contiguous counties in the North
habitat or scenic views (Pease & Coughlin, 1996). Carolina Department of Agriculture and Consumer
LESA systems assign points and a relative weight Service’s (NCDACS) Western Mountain District:
to each of the LE and SA factors. The sum of the Buncombe, Henderson, Madison, and Haywood
weighted ratings is the LESA score; the higher (Figure 1). These counties provide an excellent
the LESA score, the more significant the site for test region for our methodologies because parts of
agriculture. States and localities often adapt the the region are fairly urbanized while others have
federal LESA system to meet their needs. Once a great potential for growth. Buncombe is the most
local system is approved by NRCS, it supersedes populated county in our study area with 314 people
the federal system: NRCS is required to use the per square mile and 6,454 non-farm establish-
local version to review federal projects (Pease et ments. Henderson is less populated but growing
al., 1994, p. 62). at a faster rate with 238 people per square mile
Our enhancements focused on two benefits, and 2,302 non-farm establishments. Haywood’s
scenic quality and cultural heritage significance, population density is 97.6 people per square mile;
derived from survey information gathered in a the county has 1,411 non-farm establishments.
community mapping activity conducted as part Madison is the least populated with 44 people
of the Farmland Values Project. Following the per square mile and 309 non-farm establishments
methodology of LESA, we assigned weights to (U.S. Census Bureau, 2004). Madison County is
each benefit category in order to derive the holistic perceived to be under an urgent threat of urban-
benefit valuation score. Alternative weighting ization since the recent completion of Interstate
measures may also be calculated based on public 26 through the county now makes it more acces-
input from various stakeholder groups. The pri- sible to commuters and tourists, thus raising the
mary product of this research is an enhanced geo- likelihood that property values will increase and
spatial database and assessment tool that includes create additional stressors on farmland.
qualitative layers for several types of farmland According to the NCDACS, Buncombe and
benefits; we call it the Total Value Assessment Henderson counties have the highest cash receipts
Tool for Farmland (TVAL-Farm). from farm goods in Western North Carolina (NC-

530
TVAL-Farm

Figure 1. The study area

DACS, 2004). In the most recent Census of Ag- 20 years. Land use change is expected to continue,
riculture, 2002-2007, Buncombe County led the in part, because the region has relatively lax land
region in loss of farmland acreage with a 24% use regulation. An exception is the steep slope
loss. Henderson and Madison saw similar losses ordinance that Haywood and Buncombe counties
(22% and 21%, respectively) while Haywood lost have recently adopted to regulate development
13% of farmland acreage (USDA NASS, 2009). on mountainsides. These regulations have the
The area is popular with tourists largely consequence of shifting development pressure
because of its scenic quality (Brothers & Chen, to mid-slope and valley lands which are also the
1997; Kask et al., 2002; Mathews, Stewart, & most desirable lands for agriculture.
Kask, 2003), and regularly attracts individuals
who choose to retire or purchase a second home
in the region because of its high quality of life. AN ENHANCED LESA MODEL
USDA’s National Resources Inventory shows
a rapid decrease in farmland in Western North Figure 2 illustrates the TVAL-Farm model input,
Carolina over the past 20 years (USDA NRCS, the selected ModelBuilder (ESRI ™) tool, and the
2000); if this data is projected into the future much modeled results. All data are raster format with a
of the remaining farmland will be lost in the next spatial resolution of 90 x 90 feet. The traditional

531
TVAL-Farm

Figure 2. TVAL-Farm, a qualitative enhancement of the LESA model

LESA has four input data sets: population, land cludes cultivated crops and pasture, areas which
value per acre, agricultural soils potential, and are often considered as prime development lands.
land use/land cover. The TVAL-Farm enhance- Lands that were already developed or classified
ment adds two more--scenic value and cultural as water were valued at the lowest end of the
value--bringing the total to six model input layers. scale (1) as these areas would be near impossible
to develop further without large capital outlays.
Model Input, Data Collection, and
Processing Population

Land Use/Land Cover Population data was acquired from Landscan’s 1


km population data base available from the Oak
2001 Land Use/Land Cover data was collected Ridge National Laboratory (ORNL, 2003). Data
from the U.S. Geological Survey’s National Land was re-sampled and recalculated to reflect the
Cover Dataset (Homer et al., 2004). Fourteen land population per square mile. Class ranges were
cover categories identified by the National Land then assigned as indicated in Table 1. Areas with
Cover Dataset (NLCD) are found in our study moderate population density bring both threats
area; these were regrouped into five classes for (development including roads, infrastructure,
the TVAL-Farm model (Table 1). A value of 5 housing, and commercial development) and op-
indicates the land cover type most likely under portunities (proximity to markets, opportunities for
pressure to be developed; in our model, this in- roadside stands, and walk-in or drive by business

532
TVAL-Farm

Table 1. TVAL-Farm model values

Model Land Use/ Population (people/


Agricultural Soils Potential Land Value ($/acre)
Value Land Cover square mile)
Cultivated Crops, 0-5% slope with deeper loamy soils often in
5 41-400 < $10,000
Pasture/Hay flood prone areas that may be waterlogged
5-12% slope with lowland soils that are
Forest (Deciduous,
4 4-40 fairly deep and well drained, transitioning to $10,001-25,000
Evergreen and Mixed)
foothills
Shrub/Scrub, Grass- 12-20% slope with well drained mid-slope
3 <4 $25,001-75,000
land soils of varying depth and quality
Barren Land, Wet- 20-30% slope, well drained soil with varied
2 401-600 $75,001-250,000
lands depth but primarily shallow
Open Water, Devel-
> 30% slope with very shallow, very well
1 oped > 1600 > $250,000
drained, and often acidic soils
(all levels)

that are not economically viable in more remote are protected from flooding. Prime soils are deep
locations). As a result, in our model the Wildland loamy soils with gentle slopes (less than 5%);
Urban Interface (WUI)/Mixed classification these are classified as excellent in the TVAL-
(population density of 41-400 people per square Farm model with a value of 5. The poorest soils,
mile) was allocated the most important value of assigned a model value of 1, are on the steepest
5, as this would indicate areas that are under the slopes (greater than 30%), are very shallow, are
most pressure to be lost to development based on very well drained and are often acidic.
population density. Values of 4 and 3 are classified
rural and wildland, respectively. These areas are Land Value per Acre
also susceptible to development but are further
from the densely populated urban and suburban Land value per acre is based on the tax value of
classes and as such are less likely to be developed land parcels in each county. Cadastral data from
than WUI/Mixed category. each county was combined into a single dataset
and polygon features were converted to point
Agricultural Soils Potential features with the full parcel attribution attached.
Large parcels were given multiple points with the
Data for TVAL-Farm’s Agricultural Soils Potential same attribution as to cover those parcels with an
layer consisted of two parts: each county’s digital even distribution of values (approximately one
soils data published by Natural Resources and Con- point for every 100 acres). From this feature point
servation Service (USDA NRCS, 2009) and slope dataset, a raster surface was generated based on
data. The model’s classification of agricultural an interpolation of land values using a nearest-
soils potential is based on the NRCS definition neighbor algorithm. The least expensive land (less
of prime agricultural soils. Prime agricultural soil than 10,000 dollars per acre) was assigned the
is comprised of the best combination of physical, highest value of 5; because it is most affordable,
chemical and organic characteristics that is avail- the land is the most vulnerable to development
able for agricultural uses. Prime agricultural soils while also being the most cost-effective to retain
are not excessively erodible or saturated with in agriculture. The most expensive land (greater
water for long periods and infrequently flood or than 250,000 dollars per acre) was assigned the

533
TVAL-Farm

lowest value of 1 because it is less cost-effective present here two variations derived from survey
for agricultural use and is less vulnerable to de- data for illustration purposes: the rank importance
velopment. and choice models.

The Enhancement Layers: Scenic Rank Importance Model


Quality and Cultural Heritage
The Rank Importance Model weightings were
To gather data on scenic quality and cultural heri- derived directly from responses to a question on
tage value, The Farmland Values Project conducted the Farmland Values Project survey. The question
a community mapping activity in 2008 with 33 (Farmland Values Project, n.d.) asked respondents
residents of the four-county study region. Com- to rank a set of statements in order of importance;
munity members were recruited to participate via the community preferences identified in this
the Farmland Values Project survey and flyers, question were used to determine the weighting
radio and print media invitations. Participants scheme for the Rank Importance Model. Partici-
were asked to identify and locate the places that pants identified local food as the most important
were important to them using Google Earth ™, benefit provided by farmland followed by scenic
and assign a scenic and cultural value rating to value and cultural value; these preferences led to
each place they identified. Participants identified heavier weights for these layers in the model. Since
236 places of interest in the study region. Ratings we didn’t have data on local food production, we
ranged from 1 through 5 based on each individual’s used Agricultural Soils Potential, an indicator of
perception of value for that specific location. These soil productivity and an essential factor in food
values were converted to raster format using the production, as a proxy for local food produc-
nearest-neighbor method, resulting in a “scenic tion. Because local food was ranked highest by
value surface” and a “cultural value surface,” respondents, in this model the Agricultural Soils
where each cell in the interpolated grid was as- Potential layer was weighted heaviest (37%). The
signed a value from 1 to 5. cultural and scenic layers were assigned lower but
equal weights (18% each). The remaining layers
Modeling and Output (Land Value, Land Cover Type, Population) were
weighted the least in this model (9% each). The
Modeling and the Weighted model influence of each layer is summarized in
Overlay Technique Table 2; results appear in Figure 3.

Weighted overlay is a technique for applying a


common scale of values to diverse and dissimilar Table 2. Percent influence of input layers in rank
input to create an integrated analysis. The TVAL- importance and choice models
Farm model classifies 6 different raster layers on
a standardized 1-5 scale: land value in dollars, Rank Impor- Choice
Input Layer
tance Model Model
agricultural soils potential, population per square
Land Cover 9% 11%
mile, land cover type, and scenic and cultural pref-
Population 9% 11%
erences. Each input raster is weighted based on its
Agricultural Soils Potential 37% 23%
relative influence and layers are added together
Land Value 9% 11%
to result in a model output that varies according
Scenic Value 18% 22%
to the weighting scheme applied. While many
Cultural Value 18% 22%
different weighting schemes can be applied, we

534
TVAL-Farm

Figure 3. Rank importance results

Choice Model contribution required for that scenario (Farmland


Values Project, n.d.).
Choice models are used to learn how consum- One of the strengths of the choice model is
ers make decisions and to estimate nonmarket that it enables implicit prices to be estimated for
environmental benefits or costs. We used a attributes, which in our case allows us to weight
choice model in the Farmland Values Project to the importance or influence of the various raster
determine respondent preferences for farmland in layers in TVAL-Farm Choice Model. The Choice
western North Carolina. Specifically, we asked Model weightings were estimated using data from
respondents to choose between different scenarios the choice model responses to the Farmland Values
representing the following bundles of character- Project survey. The Choice Model results indi-
istics: the likelihood of farmland conversion, the cated that scenic quality, cultural heritage, and the
scenic quality of the rural landscape, the presence value of access to local food were approximately
of indicators of agricultural heritage are present, equally valued by respondents (Mathews, 2009).
the extent to which the land contributes to local Thus in the TVAL-Farm Model, these layers are
food production, the amount of natural services weighted equally (Table 2). Once again, we used
that are provided by farmland, and the financial the Agricultural Soils Potential layer as a proxy for

535
TVAL-Farm

Figure 4. Choice model results

local food production. To complete the modeling SUMMARY


scenario the remaining three layers (Land Value,
Land Cover Type, Population) were weighted The Farmland Values Project developed the
equally but at only half the value of the other TVAL-Farm Model to identify and quantify the
three. Table 2 summarizes each layer’s model values that rural residents place on farmland. Our
influence; results appear in Figure 4. methodology asked local residents to define and
Comparison of the Rank Importance and rank important attributes of farmland that are
Choice Model results yield some overlap of lands important in their communities. Two of these at-
valued the same in each model. Figure 5 illustrates tributes, scenic quality and cultural heritage, are
all lands from both the Rank Importance and characteristics that are not typically included in
Choice Model results with the highest model land valuations. The TVAL-Farm Model results
value of 5. This indicates the clusters of areas reflect local community preferences in a way that
where land is considered the most valuable the traditional LESA does not.
within our modeled parameters. A related innovative feature of the spatial
database is its potential to identify clusters of
tracts that are most appropriate for preservation.

536
TVAL-Farm

Figure 5. Rank importance and choice models of concentrated benefits

Research by Brabec and Smith (2002) suggests Quantifying data that is ordinarily qualita-
that while the number of preserved agricultural tive, such as scenic quality and cultural heritage,
acres is important, whether the preserved land re- is challenging for community mapping activity
mains in active farming is also dependent on other respondents. They aren’t acclimated to thinking
factors including the adjacency and contiguity of about scenic quality or cultural heritage quanti-
protected parcels. Being able to readily identify tatively. They may interpret qualitative attributes
currently preserved parcels in a spatial database differently from researchers, and from other
which also incorporates information about the respondents, which is a limitation of the model.
other benefits of the tract will be useful to policy Additionally, the limited sample size represents
makers, land trusts, and individual farmers. The a small population and may not accurately reflect
high and rising cost of farmland protection efforts the views of the residents within the study region.
increases the need for efficient targeting of scarce Increasing the sample size could alter the surfaces
preservation dollars. Our spatial database can be used in this research and the model outcomes.
used to identify the most valuable farmland tracts Regardless of sample size, the surfaces generated
to preserve in order to maximize the efficiency from our community mapping activity participants
of farmland preservation expenditures by govern- represent the preferences of this select group.
ment and nonprofit groups. Accordingly, as sample size increases, individual

537
TVAL-Farm

responses are often generalized or lessened in the Kask, S. B., Mathews, L. G., Stewart, S., & Ro-
generated data surface. tegard, L. (2002). Blue Ridge Parkway Scenic
It is important to incorporate qualitative data Experience Project Final Report. Report submit-
sources in available quantitative tools like GIS ted in fulfillment of obligations for Cooperative
in order to better understand the areas we study; Agreement #CA5143990137, National Park
TVAL-Farm provides a method for doing so. The Service. Retrieved August 26, 2009, from http://
result is a GIS mapping tool that will help our www.nps.gov/blri
communities better understand the relationship
Mathews, L. G., Stewart, S., & Kask, S. B. (2003).
between people, values, threats, opportunities
Blue Ridge Parkway Scenic Experience Project
and farmland in our region.
Phase 2 Final Report. Retrieved August 26, 2009,
from http://www.nps.gov/blri
REFERENCES North Carolina Department of Agriculture and
Consumer Services. (2004). North Carolina agri-
Brabec, E., & Smith, C. (2002). Agricultural cultural statistics. Retrieved March 14, 2005, from
land fragmentation: The spatial effects of three http://www.ncagr.com/stats/general/general1.htm
land protection strategies in the eastern United
States. Landscape and Urban Planning, 58(2-4), Oak Ridge National Laboratories (ORNL). (2003).
255–268. doi:10.1016/S0169-2046(01)00225-0 LandScan (2003)TM High Resolution Global Popu-
lation Data Set. Retrieved April 10, 2005, from
Brothers, G., & Chen, R. J. C. (1997). 1995-96 http://www.ornl.gov/sci/landscan/index.shtml
Economic impact of travel to the Blue Ridge
Parkway: Virginia and North Carolina. Asheville, Pease, J. R., & Coughlin, R. E. (1996). Land
NC and Roanoke. VA: The Coalition for the Blue evaluation and site assessment: A guidebook for
Ridge Parkway and the National Park Service. rating agricultural lands, second edition. Ankeny,
IO: Soil and Water Conservation Society.
Environmental Systems Research Institute (ESRI).
(2009). Online documentation. Retrieved No- Pease, J. R., Coughlin, R. E., Steiner, F. R., Suss-
vember 10, 2010, from http://webhelp.esri.com/ man, A. P., Papazian, L., Pressley, J. A., & Leach,
arcgisdesktop/9.2 J. C. (1994). State and local LESA systems: Status
and evaluation. In F. R. Steiner, J. R. Pease, &
Farmland Values Project. (n.d.). Final report. Re- R. E. Coughlin (Eds.), A decade with LESA: The
trieved from http://www2.unca.edu/farmlandval- evolution of land evaluation and site assessment
ues/pdfs/Farmland_Values_Project_Quick_Facts. (pp. 56-75). Ankeny, IO: Soil and Water Conser-
pdf vation Society.
Farmland Values Project. (n.d.). Results. Retrieved Soil and Water Conservation Society. (2003, June
from http://www.unca.edu/farmlandvalues/re- 3-4). Enhancing LESA: Ideas for improving the
sults.htm use and capabilities of the land evaluation and
Homer, C., Huang, C., Yang, L., Wylie, B., & site assessment system. Report from the Farmland
Coan, M. (2004). Development of a 2001 National Information Center Workshop, Nebraska City, NE.
Landcover Database for the United States. Pho- United States Census Bureau. (2004). State and
togrammetric Engineering and Remote Sensing, county quick facts. Retrieved March 14, 2005,
70(7), 829–840. from http://quickfacts.census.gov/qfd/maps/
north_carolina_map.html

538
TVAL-Farm

United States Department of Agricultural Natural ENDNOTES


Resources Conservation Service (NRCS). (2009).
Web soil survey. Retrieved November 10, 2010,
1
The Farmland Values Project was designed to
from http://websoilsurvey.nrcs.usda.gov inventory, analyze, and display the multiple
nonmarket values associated with farmland
United States Department of Agriculture Na- in four counties in Western North Carolina.
tional Agricultural Statistics Service. (2009). 2007 The Farmland Values Project was supported
census of agriculture county profile. Retrieved by the National Research Initiative of the
August 26, 2009, from http://www.agcensus. U.S. Department of Agriculture’s Coopera-
usda.gov/Publications/2007/Online_Highlights/ tive State Research, Education, and Exten-
County_Profiles/North_Carolina/index.asp sion Service, Grant # 2005-35618-15647.
United States Department of Agriculture Natural This manuscript is adapted from the Farm-
Resources Conservation Service. (2000). 1997 land Values Project final report. Complete
national resources inventory (Revised December information about the project, results, data
2000). Retrieved March 14, 2005, from http:// and maps are available at http://www.unca.
www.nrcs.usda.gov/technical/NRI edu/farmlandvalues.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 3,
edited by Donald Patrick Albert, pp. 58-69, copyright 2012 by IGI Publishing (an imprint of IGI Global).

539
540

Chapter 33
OpenStreetMap
Kevin Curran
University of Ulster, UK

John Crumlish
University of Ulster, UK

Gavin Fisher
University of Ulster, UK

ABSTRACT
OpenStreetMap is a collaborative web-mapping project that collects geospatial data to create and distrib-
ute online maps, freely available to anyone with an Internet connection. Once accessed, OpenStreetMap
allows Internet users to contribute and edit geospatial data, effectively making it the mapping equivalent
of Wikipedia. OpenStreetMap is maintained by volunteer cartographers from around the world who
use GPS devices, portable cameras, and laptops for field mapping. Collected data are complemented
with digitised open source aerial photography and free maps from the governmental and commercial
sources. This report provides a summary of OpenStreetMap as a remarkable example of participatory
geographic information systems (GIS).

INTRODUCTION attention to the locations of particular interest to


him, and including much more detail than any
OpenStreetMap (OSM) is a collaborative web available online map service was providing at that
mapping project—an example of participatory time. He correctly envisioned that making such
GIS—which makes free crowd-sourced geospa- data augmentable, editable and freely available
tial data available to anyone with access to the online, where anyone could access or contribute
Internet. OpenStreetMap was started in 2004 by information, might create a snowball effect, which
Steve Coast, a programmer from the United King- would eventually lead to the development of,
dom residing in London. Using his laptop and a in his own words, a “jigsaw map of the world”
global positioning system (GPS) receiver, Steve (Musgrove, 2010).
Coast began mapping his neighborhood, paying Since its inception, OpenStreetMap has at-
tracted countless Internet users throughout the
DOI: 10.4018/978-1-4666-2038-4.ch033 world who have contributed to the project in differ-

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
OpenStreetMap

ent ways. At the time of the first State of the Map maps of Haiti contained little more than a few
(SoTM) conference, set up by OpenStreetMap highways and roads, with Port-au-Prince being
Foundation in July 2007 in Manchester, there were signified by a shaded outline. Haiti is one of the
only 9,000 registered users. Starting from 2007, poorest countries in the Western Hemisphere, and
Yahoo! allowed OSM to tap into its vertical aerial as very few of the population could afford GPS,
photography database, which greatly facilitated the there was no point for the commercial mapping
progress of the OSM project (in 2010 Microsoft’s services in creating accurate digital maps. The
Bing Maps also released its aerial imagery for the lack of an accessible online cartographic reference
OSM). By August 2008, the number of registered had serious impeding implications for the relief
users rose to 50,000; by March 2009, the number efforts once the catastrophic earthquake killed
rose to 100,000 members; and by the beginning hundreds of thousands of people, and made close
of 2010 the project had 200,000 members. In to one million homeless. International rescue and
November 2011 the total number of volunteers aid workers who were trying to provide much
reached 500,000. needed relief encountered many problems iden-
Among other milestones, one should mention: tifying what roads and routes were available for
(a) valuable functionality was added in January the rescue and delivery of the necessary supplies.
2008 making OSM available for cyclists who According to the website CrisisCommons.org,
started downloading map data to use it on trips; there were a number of stages in the develop-
(b) funding of 2.4 million euro became avail- ment of the inclusive Haiti maps to allow the aid
able from CloudMade, a company founded by workers to get help to the victims. Firstly, there
Steve Coast, that uses OpenStreetMap data for was an immediate search for all available period
providing commercial geospacial services; (c) and contemporary maps. These included various
in September 2009 Flickr announced that it be- historic maps and even maps unclassified by
gan supporting OpenStreetMap, which allowed the CIA. Due to critical circumstances, the New
geotagged photos of landmarks to be uploaded York Public Library, which has a specialized Map
to the OpenStreetMap server (Lardinois, 2009). Rectifier facility, provided volunteer cartographers
Currently, netizens around the world utilize and with NYPL Map Warper, a software application
update data available at the project’s website that allowed for digitizing of the historic maps of
OpenStreetMap.org. Citizen cartographers range Haiti and aligning them with contemporary maps.
from volunteers providing humanitarian relief At this point, OpenStreetMap volunteers began to
work in disaster regions—where up-to-date maps get deeply involved with the project. Before the
are essential and could be a matter of life and earthquake of 2010, they had already collected
death for the victims—to residents of cities, small some mapping data of the disaster areas, which
towns and villages; who simply want to map their were used in conjunction with historic maps.
neighborhoods or highlight certain landmarks that Starting from this point, the OpenStreetMap
might be of interest to others. At the time of this community began collecting and editing the field
writing, OpenStreetMap community continues to data and mapping collapsed bridges and impass-
grow with contributors across the globe submit- able roads, locations of hospitals and refugee tent
ting novel data or editing existing information. camps, as well as accurately marking the posi-
Probably the most significant use of Open- tion of damaged buildings where victims could
StreetMap has come in the aftermath of the massive remained trapped. The next vital step involved the
earthquake that struck Haiti on January 12, 2010 United Nations releasing satellite imagery of the
with the epicenter located 25 km to the west of the region for public use, a move that was followed
country’s capital Port-au-Prince. Pre-earthquake by commercial satellite overhead imagery provid-

541
OpenStreetMap

ers such as GeoEye and Digital Globe. All these OPENSTREETMAP


combined efforts helped to collect a vast base of PRODUCTION TECHNIQUES
cartographic knowledge, which has permitted the
OpenStreetMap community to build what has The data collection for OpenStreetMap is increas-
become the most comprehensive map of Haiti. ing rapidly. Basically, participatory digital map
The Haiti mapping effort may be the most production techniques stem from on-the-ground
significant example of how OpenStreetMap has GPS or paper surveys, intimate knowledge of a
had a major social impact, but there are many certain geographical community plus available
other examples of OSM performing valuable aerial imagery. Volunteers who travel by foot,
social service for society. A large part of the suc- bicycle, car or boat study the landscape using
cess of OpenStreetMap was due to its diverse handheld GPS, a paper log, or voice recorder, and
user base around the world. The OpenStreetMap create the initial OSM data file. The project can
official blog, user diaries and OSM forum threads greatly benefit from access to aerial photography
contain references to a number of local or even datasets from government agencies or commercial
global mapping events such, as Night of the Living services, and volunteer cartographers in different
Maps, a global OSM map editing marathon. The countries work on securing them. Technical teams
OpenStreetMap Wiki also highlights similar and are set up to manage the conversion and import
smaller events. These events could be anything the data when large datasets are available to use in
from a casual get-together for novice users to the OSM. In some cases, GPS would not be used
explore neighborhoods using on-the-ground GPS if satellite imagery had been previously uploaded
surveys or walking papers, with collected data to the OSM server. After the upload of data a map
to be later imported and processed in the OSM editor is used to edit the data.
editing environment; to mapping parties, where Another important form of contribution to the
cartography enthusiasts meet with the aim of OpenStreetMap project is based on an editor’s own
inclusive mapping of a particular area; or events local knowledge, which allows features such as
such as armchair mapping—editing cartographic schools, libraries, various community places and
information collected in the field, working with landmarks, roundabouts and busy roads bypasses
available sets of data or digitizing aerial photog- to be placed on the map. A number of enthusiasts
raphy. While armchair mapping provides only a who are extremely committed to the project will
virtual camaraderie, the mapping parties are made methodically map whole towns or rural districts
up of a number of people who then usually split over a period of time by systematically organising
into groups of two. Using GPS units, volunteers mapping parties to intensively map a particular
typically start with plotting the locations of ob- area in the evenings or over weekends. A score of
scure roads and footpaths, local landmarks, etc. other volunteers correct errors and add features
Coordinates can be recorded in handwritten form by performing a large number of smaller edits.
by novice users, or saved on GPS receivers by This is done via OpenStreetMaps built-in and
the more experienced. These coordinates are then stand-alone map editors.
brought to a person experienced with the OSM Several governments, including the United
editing environment such as JOSM, who uploads States, have released official cartographic data
the information into existing OpenStreetMap to OpenStreetMap, including Landsat 7 Satellite
dataset. The main objective of these and similar Imagery, Prototype Global Shorelines and Tiger
volunteer efforts is to chart exhaustively by adding Data. Other countries have also made available
layer after layer of detail to a map of a designated the use of detailed local aerial photography
area, and to keep it continuously up-to-date. through the Open Aerial Map initiative. Some

542
OpenStreetMap

commercial companies have also donated data to Map OS Matic, which can generate city maps
the OpenStreetMap project on suitable licences. by using OSM data with the map split in squares
For instance, the Automotive Navigation Data, a allowing to easily look for streets and an index of
well-known provider of the digital maps for GPS- the streets with references to the squares on the
based navigation devices, donated a complete map. Other examples of such software include
road dataset for the Netherlands and partial road Mapnik, Osmarender and Cartagen.
datasets for China and India. The use of Yahoo!’s Kosmos, which is currently being replaced
aerial photography and Bing Maps is an example by Maperitive, is an example of a lightweight
of a partnership with commercial map providers. OpenStreetMap rendering platform that provides
There is a long list of software that can be used many tasks, including the ability to render Open-
in the creation of online editable maps such as StreetMaps interactively, print the maps, set up a
OSM. The main purposes are map rendering and local tile map server, create separate map render-
representation, map data collection, and manage- ing rules or share rules stored in OpenStreetMap
ment. Map editing software used in the production Wiki pages, view maps on a Java enabled mobile
and presentation of OpenStreetMap came from phone, and to perform tasks from the command
different sources, each with its own licensing. Map line (Figure 1).
rendering and presentation software comes in the Other map rendering and presentation software
form of interactive online web-based software, applications created for use on mobile phones and
desktop software, and mobile phone applica- GPS devices include AndNav2 and Navit for
tions. OpenLayers is an open source JavaScript Android smartphones; OpenTouchMap, which is
library for displaying map data in web browsers. a is a slippy map optimised for touch screen de-
It provides an API for building rich web-based vices such as iPhone; and finally, GpsMid which
geographic applications similar to Google Maps is a Java ME application that is a free, fully of-
and Bing Maps (Surrey councils open access, fline, vector-based map application for mobile
2010). Slippy map interface is used to display phones. It displays one’s current position on a
map tiles rendering by other software such as

Figure 1. Kosmos OpenStreetMap rendering platform

543
OpenStreetMap

zoomable map and can be used to search for and and much more. It is an essential application for
navigate to roads or points of interest. users of OpenStreetMap (as shown in Figure 3).
A prime example of mapping and data col- Other map editing applications available to
lection software that is most commonly used users of OpenStreetMap include Potlatch, an
for OpenStreetMaps is GPSBabel, which many online flash editor; Merkaartor, a desktop editor
contributors use to convert GPS track data from similar to JOSM; OSM2Go, a mobile/desktop
various proprietary formats to the GPX format editor; and Quantum GIS, another desktop editor
that OpenStreetMap requires. GPSBabel is a which includes an OpenStreetMap plug-in. The
cross platform, free software to transfer routes, OpenStreetMap interface is easy to navigate and
tracks and waypoint data between GPS units and it is visually appealing. It includes a built-in search
mapping programs. It can convert over a hundred to find places. Overall, OpenStreetMap offers
different types of GPS data formats. GPSBabel many functions and features allowing users to
grants users the ability to freely move their own edit maps, review recent edits, export map data,
waypoint data between the programs and the view and upload GPS data, and make comments
hardware they choose to use. and suggestions.
In OSM there is no option of categorising
what is uploaded (e.g., whether it is a road, or
a river) therefore, the data needs to be edited. INTERACTING WITH
The volunteers perform this task with the use of OPENSTREETMAP
map editing software. The most commonly used
editing application for OpenStreetMap is JOSM, There are various methods to augment and edit the
the original Java OpenStreetMap desktop editing data in OpenStreetMap. Each individual method
software application (Figure 2). JOSM is a feature- is geared toward the individual technical skills
rich editor with an interface that can seem complex of users, and also depends on how much time
at first. It also requires certain installation and and effort the user would like to contribute. A
configuration efforts to get some plug-in features range of online and standalone editing software
when working particularly with Yahoo! imagery. applications support the OSM project, with the
The user can utilize JOSM software to edit most frequently used including Potlatch, JOSM
place names, build locations, add photo imagery, and Merkaator.

Figure 2. JOSM using Yahoo WMS plug-in

544
OpenStreetMap

Figure 3. JOSM displaying OpenStreetMap data and GPS track logs

Potlatch The user simply has to identify the road, etc,


on the Yahoo! or Bing overhead image. Single
Potlatch 2 is the OSM default online editor, which clicking at the beginning of a road will produce
opens when an Internet user goes to the edit tab on a node. The difference from the “point of inter-
the OSM interface—utilizing any major browser est” entry is that once the user moves the mouse
and requiring only a Flash plug-in. It allows the pointer, the movement is traced with waypoints.
user, for example, to work with the satellite imag- Another single click will produce another node,
ery provided by Yahoo! and Bing for the purposes and the tracing will continue from this point, al-
of creating a map of a neighborhood. By double- lowing the user to negotiate the tracing of bends
clicking on the image, the user creates what is in a motorway, road, track etc. Once the end of a
called a node. A node can be used to highlight particular road has been reached, a double click
any single point of interest on the map, such as will enter a final node that will signify the end of
a building, and can be used in conjunction with that particular entry. The user is then free to enter
other nodes. To complete the map, the nodes are the necessary details, such as what type of road
connected using waypoints. In the latest version it is, motorway, connecting road, cycle path, or
of OpenStreetMap, many public amenities such footpath, and any names that the entry may have,
as telephone boxes, bus stops, etc, have their own such as M1, Dunfield Terrace, etc.
highlighting icons, which the user simply needs to Once all the information that the user wants
drag from the toolbar and drop onto the required to include has been added, they are provided with
place on the map to mark the location. If a par- the option to save the changes that they have
ticular point of interest does not already have a made for editing later, or to save the changes
designated icon, the user can simply double click and upload them to the OpenStreetMap server.
the correct location on the image, creating a node. After the changes have been uploaded to the
The user can then manually add the description OpenStreetMap server, other members of the
of that particular point of interest. OpenStreetMap community can then review and
Roads, tracks and pathways can be added to edit them if necessary.
OpenStreetMap in a very similar way to places.

545
OpenStreetMap

Java OpenStreetMap that is added is actually a series of coordinates.


These coordinates can then be traced over by the
Java OpenStreetMap (JOSM) is an individual user, creating nodes. These nodes can then be used
stand-alone software application that allows to signify an individual object, such as a street
OpenStreetMap users to render GPS data. This postbox, or as part of a series of nodes, or way-
information can then be uploaded onto the Open- points making up a road or a track, etc. Photos or
StreetMap server. The user can either download images such as a statue, street sign, building, etc.,
existing data associated with the area on the map can also be uploaded, giving the user additional
that they would like to edit or augment, or they information on a particular location. Various types
can edit and upload their own raw GPS data to of information may be included about each of
JOSM. This would most likely be decided when the nodes and waypoints. For example, if a node
establishing exactly how much data already is signifies a street postbox, information should be
available for a particular area. For example, the entered to detail that it represents an amenity, and
accessible data, such as satellite imagery and exist- that it is indeed a postbox. If a series of nodes and
ing OpenStreetMap entries, for major cities and waypoints signifies the location of a road, then the
their surrounding areas is usually quite detailed. user should enter details of the type of road it is,
With this in mind, it may be helpful for the user e.g., motorway, residential, cycle path, footpath
to reference the existing data when making their etc., the name if applicable, and whether it is one
own entries, as it will enable them to establish way or not. JOSM uses a uniform tagging system,
how accurate their GPS readings are—if need which encourages users to tag all features on a
be—as well as establish whether the data they map in a consistent way.
have compiled about particular items (streets,
landmarks, points of interest, etc.) already exists Merkaator
within the server. The user may find that the cur-
rent data contained within the server may need to Merkaator, like JOSM, is a stand-alone map editing
be updated. On the flip side, as some urban areas software application that allows users to access,
in the United Kingdom are having quite detailed add and edit the datasets stored on the OpenStreet-
cartographic information available, some others, Map server. Once the data has been retrieved,
for example, Derry City in the Northern Ireland, Merkaators’ mapping functions are very similar
has very little. The satellite imagery available to that of JOSM, in that it deals with nodes which
for the Derry City is poor and the information are used individually to signify points of interest,
exhibited on the OpenStreetMap is quite minimal or as a series to create waypoints signifying roads
for anywhere other than the city center. With this and paths etc., which can in turn be linked to Ya-
in mind, users wanting to update data associated hoo! or Bing satellite imagery. It has an intuitive
with an undermapped urban area such as Derry user interface, providing a transparent display of
City may need to collect and upload the raw data map features. However, Merkaator places more
from their GPS to JOSM. emphasis on the aesthetics of mapping, since it
When using JOSM, the user will initially gather allows mapping more precisely curved roads. On
as much GPS data as possible using a GPS unit. the other side, the OpenStreetMap Wiki warns that
This information is then imported into JOSM as a this application, still being in the phase of devel-
GPX file. The coordinates of the GPX file can then opment, is “very memory hungry and can freeze
be read by JOSM, and this in turn allows the user the machine while page-swapping” (“Comparison
to retrieve any existing data on that particular area of editors,” 2012).
from the OpenStreetMap server. The information

546
OpenStreetMap

DISCUSSION AND CONCLUSION major advantage that OpenStreetMap would have


over any Ordnance Survey maps would be that it
Given that a lot of mapping information with a has the potential to be updated in real time. On the
high level of detail is now available online via other side, under the direct influence of the OSM,
MapQuest, Google, Yahoo! or Bing maps, it is Ordnance Survey has started to provide more free
worth asking the question, what benefit does services, such as OS OpenSpace, which allows
access to free, editable maps provide to regular embedding maps of Great Britain into web pages.
Internet users? It is important to recognize that the The practices of commercial online map and
OpenStreetMap project has had major influence image providers have also been influenced by the
on both the attitudes of citizens and governments OpenStreetMap project. As mentioned, first Yahoo!
towards the issue of cartographic data availabil- and then Bing Maps permitted volunteer mappers
ity and accessibility. For example, over the past to use their proprietary aerial photography as a
years, The Guardian has spearheaded a campaign backdrop. Yahoo! Flickr also allows tagging photo
in the UK to make cartographic data collected by images with OSM tags to be easily searched and
government-funded organizations freely available transferred on OSM maps if needed. At the 4th
to the public. The argument is that, if taxpayers’ annual international State of the Map 2010 confer-
money paid for the data to be collected, why ence, MapQuest, owned by AOL, announced that
should taxpayers have to pay any more to access it decided to become “the first major mapping site
the data? This in turn has resulted in Ordnance to embrace and encourage open source mapping
Survey, the UK national mapping agency, making at scale” (MapQuest Opens Up, 2010). As part of
maps in several formats available online for free this initiative, MapQuest founded an open source
downloads from April 2010 (Sung, 2010). The project – open.mapquest.co.uk mapping site – based
Independent newspaper (OS Maps, 2010) wrote on the OpenStreetMap data. Additionally, in 2008,
that Sir Tim Berners-Lee, the founder of the World Google started its own OSM-type collaborative
Wide Web, advised Prime Minister Gordon Brown project called Google Map Maker. According to
on allowing publicly funded map data to be freely the Times newspaper (Blakely, 2008), this service
available in an effort to boost entrepreneurship and is aimed mainly at amateur cartographers based
kick-start cutting-edge technology enterprises in in “hyper-growth countries” like India, where the
the United Kingdom. According to Dr. Christopher infrastructure has a tendency to change at an alarm-
Board, chairman of the Charles Close Society for ing rate in comparison to more developed countries.
the Study of Ordnance Survey Maps, having free Google Map Maker creator Lalitesh Katragadda felt
Ordnance Survey maps at the user’s disposal is that Google’s “enormous online reach” would be
very important. Board claims that if it was left the deciding factor on its popularity in comparison
up to the providers to map what they felt was to similar non-commercial applications such as
important, then the “most popular tourist areas OpenStreetMap. Four years later, Google Map
would be mapped regularly and kept up to date, Maker is available in 188 countries and its com-
but there would be huge areas of agricultural land, munity has reached 25,000 members. Google Map
moor or croft that would be left untouched” (OS Maker invites Internet users to correct and update
Maps, 2010). Ordnance Survey maps can only be Google Maps and provides step-by-step instructions
implemented as a point of reference for the user, and technical support. However, as appealing and
however if any drastic changes may occur within functional as Google’s mapping applications may
the landscape of the area mapped, this effectively look, one should remember that they serve as a
renders the map out of date and it is up to the Ord- means of generating corporate revenue—not for the
nance Survey cartographers to provide updates. A altruistic purposes of mapping the world. On the

547
OpenStreetMap

other hand, the OpenStreetMap project allows for Comparison of editors. (2012, January 3). Open-
uninterrupted programming access to its datasets StreetMap Wiki. Retrieved from http://wiki.
and provides a foundation for similar crowdsourced openstreetmap.org/wiki/Comparison_of_editors
initiatives such as Information Freeway, Open-
Google experiments with hotel prices on Google
CycleMap, OpenBusMap, OpenRouteService,
maps. (2010, March 24). The Independent. Re-
OpenOrienteeringMap, and other projects.
trieved from: http://www.independent.co.uk/
The OpenStreetMap initiative is growing
travel/news-and-advice/google-experiments-
and expanding with new volunteers joining the
with-hotel-prices-on-google-maps-1927650.html
effort to build and maintain a free-to-use, com-
prehensive map of the world. With continuing Haiti OpenStreetMap. (2010). CrisisCommons.
advances in GIS technology and new approaches Retrieved from http://www.crisiscommons.org/
to gathering and editing digital cartographic data, Haiti-Open-Street-Map
the seemingly utopian project steadily moves
Lardinois, F. (2009, September 28). Flickr now
towards fruition as time passes. New features
supports OpenStreetMap tags. ReadWriteWeb.
for OpenStreetMap are being added, more areas
Retrieved from http://www.readwriteweb.com/ar-
in developed countries are getting covered, and
chives/flickr_now_supports_openstreetmap_tags.
more and more developing countries are getting
php
involved with the OpenStreetMap project. Citizen
cartographers are enthusiastically exploring un- Mapping party. (2009, May 8). Weait.com. Re-
orthodox ways of using OSM, from highlighting trieved from http://weait.com/content/mapping-
the location of a communal event, to keeping a party
crisis map up-to-date during a humanitarian or
MapQuest Opens Up. (2010, July 9). Map-
relief mission. Haiti’s catastrophic earthquake in
quest. Retrieved from http://blog.mapquest.
2010 has shown how powerful participatory and
com/2010/07/09/mapquest-opens-up/
activist cartography can be when properly chan-
neled and implemented. A major contribution to Musgrove, M. (2010, January 31). Citizen car-
the success of OpenStreetMap work during the tographers’ map the microcosms of the world.
Haiti crisis is that OSM gave remote volunteer The Washington Post. Retrieved from http://
cartographers and onsite relief workers the ability www.washingtonpost.com/wp-dyn/content/ar-
to work together and update geographical data on ticle/2010/01/30/AR2010013000033.html
the OSM in real time. Despite the fact that much
OpenStreetMap. (2012). Retrieved from http://
of the work is still needed to complete the project,
www.openstreetmap.org
OpenStreetMap already serves as a remarkable
example of public participation GIS, and one of OpenStreetMap Wiki. (2012). Retrieved
the largest collective mapping endeavours in the from http://wiki.openstreetmap.org/w/index.
history of human civilisation. php?title=Main_Page&oldid=726612
OS maps: No.1 in the charts since 1747. (2010,
REFERENCES April 2). The Independent. Retrieved from http://
www.independent.co.uk/news/uk/this-britain/
Blakely, R. (2008, October 4). Google enlists pub- os-maps-no1-in-the-charts-since-1747-1934059.
lic to map fast-changing world. The Times Online. html
Retrieved from http://technology.timesonline.
co.uk/tol/news/tech_and_web/article4878039.ece

548
OpenStreetMap

Sung, D. (2010, April 1). Ordnance Survey fi- Surrey councils open access to aerial photography.
nally gives up the goods. Pocket-lint. Retrieved (2010, April 6). Guardian Professional. Retrieved
from http://www.pocket-lint.com/news/32400/ from http://www.guardian.co.uk/government-
ordnance-survey-free-up-data computing-network/2010/apr/06/surrey-heath-
aerial-photography-openstreetmap-gis-06apr10

This work was previously published in the International Journal of Interactive Communication Systems and Technologies (IJI-
CST), Volume 2, Issue 1, edited by Tatyana Dumova, pp. 69-78, copyright 2012 by IGI Publishing (an imprint of IGI Global).

549
550

Chapter 34
GeoCache:
A Cache for GML Geographical Data

Lionel Savary
PRiSM Laboratory, France

Georges Gardarin
PRiSM Laboratory, France

Karine Zeitouni
PRiSM Laboratory, France

ABSTRACT
GML is a promising model for integrating geodata within data warehouses. The resulting databases are
generally large and require spatial operators to be handled. Depending on the size of the target geo-
graphical data and the number and complexity of operators in a query, the processing time may quickly
become prohibitive. To optimize spatial queries over GML encoded data, this paper introduces a novel
cache-based architecture. A new cache replacement policy is then proposed. It takes into account the
containment properties of geographical data and predicates, and allows evicting the most irrelevant
values from the cache. Experiences with the GeoCache prototype show the effectiveness of the proposed
architecture with the associated replacement policy, compared to existing works.

INTRODUCTION 1999) are based on XML, which becomes a stan-


dard for exchanging data between heterogeneous
The increasing accumulation of geographical data sources. Proposed by OpenGIS (OpenGIS, 2003),
and the heterogeneity of Geographical Informa- GML is an XML encoding for the modeling,
tion Systems (GISs) make difficult efficient query transport, and storage of geographical informa-
processing in distributed GIS. Novel architectures tion including both the spatial and non-spatial
(Zhang, 2001) (Gupta, 1999) (Leclercq, 1999) fragments of geographical data (called features).
(Chen, 2000) (Paolucci, 2001) (Corocoles, 2003) As stressed in (Savary, 2003), we believe that
(Boucelma, 2002) (Stoimenov, 2000) (Voisard, GML is a promising model for geographical data
mediating and warehousing purpose.
DOI: 10.4018/978-1-4666-2038-4.ch034

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
GeoCache

By their nature, geographical data are large. proposed cache implementation and replacement
Thus GML documents are often of important size. policy. Finally, the conclusion summarizes our
The processing time of geographical queries over contributions and points out the main advantages
such documents in a data warehouse can become of the proposed GML cache-based architecture.
too large for several reasons:

1. The query evaluator needs to parse entire RELATED WORKS


documents to find and extract query relevant
data. Cache Replacement Policy
2. Spatial operators are not cost effective, es-
pecially if the query contains complex selec- In the literature, several approaches have been
tions and joins on large GML documents. proposed for cache replacement policy. The most
well-known is the Least Recently Used (LRU,
Moreover, computational costs of spatial op- Tannenbaum, 1992). This algorithm replaces the
erators are generally more expensive than those of document requested the least recently. Rather at
standard relational operators. Thus, geographical the opposite, the Least Frequently Used (LFU)
queries on GML documents raise the problem of algorithm evicts the document accessed the least
memory and CPU consumption. To solve this frequently. A lot of extensions or variations have
problem, we propose to exploit the specificities been proposed in the context of WWW proxy
of a semantic cache (Dar, 1996) with an optimized caching algorithms. We review some in the sequel.
data structure. The proposed structure aims at The LRU-Threshold (Chou, 1985) is a simple
considerably reducing memory space by avoid- extension of LRU in which documents larger
ing storing redundant values. Furthermore, a new than a given threshold size are never cached. The
cache replacement policy is proposed. It keeps in LRU-K (O’Neil, 1993) considers the time of the
cache the most relevant data for better efficiency. last K references to a page and uses such infor-
Related works generally focus on spatial data mation to make page-replacement decisions. The
stored in object-relational databases (Beckmann, page to be dropped is the one with a maximum
1990). The proposed cache organizations are backward K-distance for all pages in the buffer.
better suitable for tuple-oriented data structures The Log(size)+LRU (Abrams, 1995) evicts the
(Brinkhoff, 2002). Most cache replacement poli- document with the largest log(size), and apply
cies are based on Least Recently Used (LRU) and LRU in case of equality. The Size algorithm evicts
its variants. Other cache replacement policies the largest document. The Hybrid algorithm aims
proposed in the literature (Lorenzetti, 1996) (Cao, at reducing the total latency time by computing
1997) (Arlitt, 1999) deal with relational or XML a function that estimates the value of keeping a
databases, but have not yet investigated the area page in cache. This function takes into account
of XML spatial databases. the time to connect with a server, the network
The rest of the paper is organized as follows: bandwidth, the use frequency of the cache result,
Section 2 gives an overview of related works. Sec- and the size of the document. The document with
tion 3 presents our cache architecture adapted for the smallest function value is then evicted. The
GML geographical data. Section 4 discusses about Lowest Relative Value (LRV) algorithm includes
the inference rules of spatial operators and presents the cost and the size of a document in estimat-
an efficient replacement policy for geographi- ing the utility of keeping it in cache (Lorenzetti,
cal data considering inference between spatial 1996). LRV evicts the document with the lowest
operators. Section 5 shows some results of the utility value.

551
GeoCache

One of the most successful algorithms is the Let us recall that in GIS jargon, the MBR of
Greedy Dual-Size (GD-size) introduced by Cao an object is the minimum-bounding rectangle
and Irani (Cao, 1997). It takes into account the of this object. The area of a page of objects is
cost and the size of a new object. When a new the minimum rectangle including all MBRs of
object arrives, the algorithm increases the rank- that page. The margin is the border of an area.
ing of the new object by the cost of the removed Beckmann (1990) and Brinkhoff (2002) define
object. In the same spirit, the Greedy Dual-Size five spatial pages-replacement algorithms based
Frequency (GDSF) algorithm proposed by Arlitt on spatial criteria:
et al. (Arlitt, 1999) takes into account not only
the size and the cost, but also the frequency of 1. Maximizing the area of a page (A): a page
accesses to objects. As an enhancement of GDSF, with a large area should stay in the buffer
Yang et al. introduce the time factor (Yang, 2003). as long as possible. This result from the ob-
Combined to the Taylor series, it allows predicting servation that the larger is the area, the more
the time of the next access to an object. Thus, it frequently the page should be requested.
provides a more accurate prediction on future ac- 2. Maximizing the area of the entries of a page
cess trends when the access patterns vary greatly. (EA): instead of the area of a page, the sum
But the main bottleneck of this approach is the of the area of its entries (spatial objects) is
time consumption to recalculate the priority of maximized.
each object. 3. Maximizing the margin of a page (M): the
margin of a page p is defined as the margin
Spatial Cache Replacement Policy of the MBR containing all entries of p. The
larger a page margin is, the longer it will
Most proposed spatial cache replacement policies stay in the buffer.
are based on variants of LRU and are developed in 4. Maximizing the margin of the entries of a
the context of relational databases. In the area of page (EM): instead of the margin of a page p,
spatial database systems, the effect of other page- that of the composing MBRs are considered.
replacement strategies has not been investigated 5. Maximizing the overlaps between the entries
except in (Brinkhoff, 2002). of a page (EO): this algorithm tries to maxi-
Considering a page managed by a spatial data- mize the sum of the intersection areas of all
base system, one can distinguish three categories pairs of entries with overlapping MBRs.
of pages (Brinkhoff, 1993): directory pages (de-
scriptors), data pages (classical information), and As a synthesis, Brinkhoff (2002) proposes
object pages (storing the exact representation of a combination of LRU-based and spatial page-
spatial objects). Using the type-based LRU (LRU- replacement algorithms. To evict a document, a
T), first the object pages are evicted, followed by set of victim candidates is determined using the
the data pages, and finally by the directory pages. LRU strategy. Then, the page to be dropped out
Using primitive based LRU (LRU-P), pages are of the buffer is selected from the candidate set
removed from buffer according to their respective using a spatial page replacement algorithm. The
priorities. If a tree-based spatial access method page dropped by this selection is placed in an
is used, the highest priority is accorded from the overflow buffer, where a victim is evicted using
root to the index directory pages, followed by the the FIFO strategy. Depending on its spatial and
data pages, and finally the object pages. Thus, the LRU criteria, a requested page found in the over-
priority of a page depends on its height in the tree. flow buffer is moved to the standard part of the
buffer, influencing the size of the candidate set.

552
GeoCache

Buffer cache techniques are mainly used in spa- results, whereas the non-spatial region content
tial database systems in order to optimize queries contains non-spatial data of geographical query
response time. The work conducted by Brinkhoff results. These region contents are associated with
uses a spatial index for better management of the the geographical region descriptor. It contains
buffer cache. However, there is no spatial index information about each geographical query stored
for GML documents, as they are encoded in XML. in cache. The cache is then divided into two parts
Hence, the spatial criteria mentioned above could (see Figure 1): (i) the non-spatial part of the cache
not be applied. Other criteria must be considered to composed of the non-spatial region content and it
handle geographical queries. Moreover, semantic associated description contained in the geographi-
cache gives better performances than page or tuple cal region descriptor (section 3.1); (ii) the spatial
replacement strategies (Dar, 1996), but until now part of the cache composed of the spatial region
it has not been really studied for geographical contents and it associated description contained in
queries where data are stored in XML. the geographical region descriptor (section 3.2).

Non-Spatial Part of the Cache


CACHE STRUCTURE FOR
GEOGRAPHICAL QUERIES IN GML All data are encoded in XML. For simplicity and
standard enforcement, we encode XML data as
Generally spatial data consume a lot of memory DOM trees. More compact structures are possible,
space. Hence, caching spatial objects has a ten- but it would not change the relative results and
dency to flood the available space in cache. For would require specific developments. Thence,
example in a spatial join query, a spatial object A query results are stored in cache as DOM trees
can match with several objects B1, B2, etc. Thence, that compose the non-spatial region content (see
a same object A can be replicated many times in Figure 2). To separate non-spatial and spatial data,
spatial query results. This may considerably reduce a generated identifier replaces each GML sub-
the available space in cache, especially when a tree (XML fragment with “gml:” as namespace).
large amount of spatial fragments must be stored. The identifier value is used to uniquely retrieve
To avoid spatial object replication in cache, we the corresponding spatial fragment stored in the
propose a simple data structure which facilitates
object identification and isolation. This structure
is divided into two parts. The first is devoted to Figure 1. General cache structure
the non-spatial elements of the geographical data.
The second one contains non-redundant spatial
fragments of geographical data. i.e., only distinct
spatial objects are stored in cache. In semantic
cache, the semantic region is divided into two
parts (Chidlovskii, 1999): the region descriptor
describing each query result stored in cache, and
the region content where the data are stored. In
the case of geographical queries, we introduce two
kinds of region content: the non-spatial region
content, and the spatial region content.
The spatial region content contains non-
redundant spatial data of geographical query

553
GeoCache

Figure 2. Cache organization Figure 3. Spatial part of the cache

Cache Insertion Algorithm

When a geographical query result must be stored


spatial part of the cache structure. For non-spatial in the cache, the spatial and non-spatial fragments
data, the geographical region descriptor contains have to be identified and stored in the correspond-
classical information about each query (region ing part of the cache. The insertion algorithm is
formula, replacement values, and pointers) described in Figure 4. A key issue is the genera-
(Chidlovskii, 1999). tion of the GML fragment identifier. We simply
use the centroïde of the object defined as a point
Spatial Part of the Cache standing for the geometry center. More sophisti-
cated identifiers could be used, e.g., a numerical
The second part of the cache contains a specific encoding of the shapes to avoid confusion between
structure designed, from one hand, to store only geographical objects of same centroïde.
distinct spatial objects, and from the other hand, A new DOM tree is created (line 00) for the
to quickly retrieve spatial fragments. Each spatial non-spatial fragment. The XML query result is
fragment is stored as a DOM tree, since spatial read homogeneously fragment by fragment. If a
data are in GML. The spatial fragments compose non-spatial fragment is read (line 02), then it is
the spatial region content. Each spatial fragment placed in the non-spatial part of the cache (line
is referenced in a hash table. The identifier is 03). Else, if a spatial fragment is read (line 05),
used as a key of the hash table. It is composed of an identifier is automatically generated (line 06).
a counter value and a reference to the root tree If the identifier already exists in the hash table
of the spatial fragment (see Figure 3). Thus, each (line 07), the counter is incremented (line 08).
DOM tree corresponding to a spatial fragment Otherwise, a Dom tree for the spatial fragment is
can be quickly retrieved in the spatial part of the generated (line 11) and inserted into the hash
cache. The counter value indicates the number of table with the new identifier (line 12). The gener-
references to the spatial fragment from the non- ated identifier is added in the non-spatial DOM
spatial part of the cache. Thence, the entry can tree (line 14).
be deleted when the counter is 0. The hash table
composes the geographical region descriptor for
spatial data. It contains: identifiers ID, counters
and references to spatial fragments (see Figure 3).

554
GeoCache

Figure 4. Cache insertion algorithm


in the tree (line 02). The counter value related to
the spatial fragment is decremented (line 03). If
this value is equal to 0 (line 04), the DOM tree
corresponding to the spatial fragment is removed
from the hash table (line 05). Once all identifiers
have been scanned, if there is no identifier with
associated counter strictly greater than 0 (line 11),
the non-spatial tree corresponding to the query
result is removed from the cache (line 12).

GEOGRAPHICAL QUERY
PROCESSING

Spatial Operators with


Inference Rules

In this section, we introduce the spatial operators


we use to compose queries. We further detail the

Figure 5. Cache deletion algorithm

Cache Deletion Algorithm

When a geographical query result has to be evicted


from the cache, the DOM tree corresponding to the
non-spatial part of the result is logically removed.
Physically, only the referenced spatial fragments
with counter value 0 (no reference) are removed.
The algorithm is sketched in Figure 5 which
depicts the cache deletion algorithm. A Boolean
value defined at line 00 indicates if all counters
of a query result have no reference. The DOM
tree corresponding to the non-spatial part of the
query result is retrieved using the root node (line
01). Then the tree is scanned to get the spatial
fragments corresponding to each identifier stored

555
GeoCache

inference rules between geographical predicates, • Contains (Path1, Path2). Determines if the
which are useful for determining if a query result spatial object specified by the path Path1
is included in another one. This is important for ‘spatially contains’ the spatial object of
semantic cache management. path Path2.

Spatial Operators and Predicates Some spatial predicates are based on topologi-
cal relationships and can be deduced from other
For geographical queries, not only conventional spatial predicates. This means the results of spatial
comparison operators θ = {≤, <, =, ≠, >, ≥} must queries can be contained in results of other queries,
be taken into account, but also spatial operators. with different operators. If inference rules between
To query the database, we use XQuery extended spatial predicates can be established, then it could
with spatial operators as proposed in GML-QL be possible to determine whether the result of a
(Vatsavai, 2002). For illustration purpose, we geographical query is contained in another one.
select the following spatial operators (OpenGIS, This feature may allow using the cached results
2003): instead of accessing the original document, and
thus, optimizing the query.
• Overlaps (Path1, Path2). Determines if For example, consider the two GML docu-
two spatial objects specified by the paths ments: Feature1.xml in Figure 6, which contains
Path1 and Path2 are overlapping. the non-spatial elements T1, T2, T3 and a spatial
• Distance (Path1, Path2) θ d. Determines fragment of type linearGeometry (OpenGIS,
if the distance between two spatial objects 2003) standing for a polyline; and Feature2.xml
specified by the paths Paht1 and Path2 sat- in Figure 7, which contains the non-spatial frag-
isfies the relation θ d, where d is a given ments N1, N2, and a spatial fragment also of type
distance. linearGeometry.
• Touch (Path1, Path2). Determines if two
spatial objects specified by the paths Path1 Figure 6. The Feature1.xml data
and Path2 are touching.
• Intersects (Paht1, Path2). Determines if
two spatial objects specified by the paths
Path1 and Path2 are intersecting.
• Within (Path1, Path2). Determines if the
spatial object specified by Path1 is within
the spatial object of path Path2.
• Equals (Path1, Path2). Determines if the
spatial object specified by the path Path1
is ‘spatially equal’ to the spatial object of
path Path2.
• Disjoint (Path1, Path2). Determines if the
spatial object specified by the path Path1 is
‘spatially disjoint’ to the spatial object of
path Path2.
• Crosses (Path1, Path2). Determines if the
spatial object specified by the path Path1
crosses the spatial object of path Path2.

556
GeoCache

Figure 7. The Feature2.xml data


Return
<Result>
{ $b/T1 }
{$b/gml:LineString/gml:coordinates}
{ $c/N2 }
{$c/gml:LineString/gml:coordinates}
</Result>

R1 determines if two objects Feature1 and


Feature2 are within a distance closer than 1200. In
this case, the query returns the geometry of the two
objects and the specified values. R2 determines if
two objects Feature1 and Feature2 intersects. In
this case, the query returns their geometry with
the other specified elements.
Two objects are intersecting if their distance is
equals to 0. Hence, the result of R2 is contained
in the results of R1 as all the values returned by
R2 are included in the values returned by R1, and
1200 > 0. Thus the result of R2 can be extracted
Let R1 and R2 be two GML-QL queries: from that of R1 without scanning the database,
R1: provided that the result of R2 is kept in cache. As
For $b in doc(“Feature1.xml”)/Fea- semantic cache replacement policies are based
ture1/Object on the semantic of queries, the query-processing
For $c in doc(“Feature2.xml”)/Fea- algorithm shall determine if a geographical query
ture2/Object result is contained in cache. Thus, an originality
Where Distance ($b/gml:LineString/ of semantic caching in geographical databases is
gml:coordinates,$c/gml:LineString/ the ability to compare geographical predicates.
gml:coordinates) < 1200
Return
Spatial Predicate Inference Rules
<Result> { $b/T1 } { $b/T3 }
{$b/gml:LineString/gml:coordinates}
Let us denote a ⇒ b the implication and a ⇔ b
{ $c/N2 }
the equivalence inference rule. Let O1 and O2 be
{$c/gml:LineString/gml:coordinates}
two geometric objects. A simple inference rule is,
</Result>
for all real d positive or null: Intersect (O1, O2) ⇒
R2:
(Distance (O1, O2) < d).
For $b in doc(“Feature1.xml”)/Fea-
Using the nine operators defined in section
ture1/Object
3.1, more inference rules are given in Figure
For $c in doc(“Feature2.xml”)/Fea-
8. Notice that, for a couple of objects (O1, O2)
ture2/Object
verifying a predicate P at level i, if the predicate
Where Intersect ($b/gml:LineString/
P implies the predicate Q at level i+1 and if the
gml:coordinates, $c/gml:LineString/
predicate Q implies the predicate S at level i+2,
gml:coordinates)
then the operator P at level i, infers the operator S
at level i+2. That is the implication of predicates

557
GeoCache

Figure 8. Spatial predicates inference rules

is transitive. It is important to take into account results C stored in cache. The white box with full
all rules for better cache management. outline represents the new query Q. For an exact
In our proposed architecture (Savary, 2003), match, the query Q already exists in cache: ∃ Ci
spatial queries are computed using JTS, is a spa- ⊆ C \ Ci = Q. For a containing match, the query
tial Java API proposed by the OGC (Open Geo- Q is totally contained in the cache: ∃ D ⊆ C \ (Q
spatial Consortium) and based on the OpenGIS ⊂ D ∧ D ⊄ Q). For a contained match, the query
Simple Feature Specification (OpenGIS, 1999). Q contains some cache queries: ∃ D ⊆ C \ (Q
It implements the spatial operators introduced in ⊄ D ∧ D ⊂ Q). For an overlapping match, the
section 3.1. Hence, in this practical context, the query Q is partially contained in cache: ∃ D ⊆
use of spatial predicate inference rules makes it C \ (Q ⊄ D ∧ D ⊄ Q ∧ Q ∩ D ≠ ∅). Finally, for
possible to determine if a new query is contained a disjoint match, the query Q is not contained in
or partially contained in cache. cache: Q ∩ C =∅.
For exact and containing match, Q can be
Query Processing totally answered from the cache; but for contained
and overlapping match, an access to the database
Suppose an input query Q and a set C = {C1, is required in order to answer the remainder
C2…, CK} of K cache queries. Then, according query (Dar, 96). For geographical queries and
to (Lee, 1999) there are five general cases of especially for join queries, the processing time
query matching: exact match, containing match, can be prohibitive. To palliate this disadvantage,
contained match, overlapping match and disjoint we propose to exploit the geographical region
match. Figure 9 illustrates the various cases. The descriptor.
grey box in shop outline stands for the queries

Figure 9. Query match types

558
GeoCache

General Query Processing Scheme optimization. This will be detailed in sec-


tion 4.2.2 bellow.
The geographical query processing is depicted
in Figure 10. When a new query Q must be pro- The elementary probe query is performed by
cessed, the Query Analyzer determines if the query the cache Query Executor, with the non-spatial
can be answered from the cache, the database or and spatial data stored in cache. In the cases of
both. Then the Query Rewriter rewrites Q into contained match, overlapping match, and disjoint
the remainder and probe queries. We introduce match, if an optimization is possible using the spa-
two kinds of probe query: tial data stored in cache (as shown in the example
of section 4.2.2), then the required temporary
• The elementary probe query allows re- spatial data are determined using the optimized
trieving totally or partially the data stored probe query and forwarded to the Spatial Query
in cache to answer the query Q. Executor. The temporary spatial data is composed
• The optimized probe query aims at retriev- of n documents corresponding to the n features
ing the temporary spatial data for a further implied in the remainder query. Each document

Figure 10. Geographical query processing

559
GeoCache

only contains the geometries of a specific feature. avoid the process of spatial predicates for the
The remainder query is split by the Query De- remainder query result. Only comparisons with
composer into the non-spatial remainder query, geometry features of database temporary result
and the spatial remainder query. The non-spatial and geometry features of their corresponding
remainder query is free of spatial predicates and temporary spatial data are required. Moreover,
allows interrogating the database. The database it makes a better cache management, since the
temporary results (non-spatial remainder query size of those temporary spatial data documents
results) are then forwarded to the Spatial Query is reduced. Thus, performances are improved
Executor. These database temporary results are especially for join queries, where multiple scans
composed of n documents corresponding to the are often necessary. For example, if the query Q
n features implied in the remainder query. The is the following:
Spatial Query Executor performs the remainder
spatial query which contains the spatial predicates. For $b in doc(“Feature1.xml”)/Fea-
If the temporary spatial data is not empty, an ture1/Object
optimization is possible which avoids computing For $c in doc(“Feature2.xml”)/Fea-
spatial predicates. The result (remainder query ture2/Object
result) is then sent to the Query Combiner, which Where Intersect ($b/gml:LineString/
merges the remainder and elementary probe gml:coordinates, $c/gml:LineString/
query results into a single GML document R. gml:coordinates)
The query result R of Q is then sent to the cache Return <Result> { $b/a } { $c/b } {
Replacement Manager. The Cache Replacement $b/c } </Result>
Manager determines then if the result has to be
placed in cache or not, and which cache queries Let Ci be the closest cache query of Q contained
must be evicted. in cache, i.e., the query containing all the required
entities to answer Q with a minimal number of
Cache-Based Query Processing attributes not contained in Q (not mentioned in the
and Optimizations returned result of Q). Ci is found using the infer-
ence rules which have been previously presented
In the cases of contained match, overlapping (figure 6). For instance, Ci is found to be the query:
match, and disjoint match, data which are not
contained in cache are retrieved from the database. For $b in doc(“Feature1.xml”)/Fea-
To reduce the I/O and CPU costs, we propose to ture1/Object
exploit the spatial part of the cache. The idea is For $c in doc(“Feature2.xml”)/Fea-
to determine the set of spatial geometries implied ture2/Object
in the remainder query. If this set is known, then Where Distance ($b/gml:LineString/
it is possible to directly compare the geometries gml:coordinates, $c/gml:LineString/
contained in this set with the geometries contained gml:coordinates) < 9
in the documents of the database temporary re- Return
sults. This set of spatial geometries results from <Result> { $b/e } { $c/f } </Result>
the optimized probe query. The optimized probe
query results (the temporary spatial data) contain n Here, the optimized probe query contains the
documents corresponding to the n features implied intersect predicate. It will be applied to the cache
in the remainder query. Only spatial geometries query result Ci. The resulting temporary spatial
are stored in these documents. This allows to data will be composed of two documents contain-

560
GeoCache

ing only geometries of Feature1 and geometries cache and the minimal cost out of cache if it must
of Feature2 that match. The geometries of these be recomputed, as formalized by equation (i).
documents will be compared, using their cen- Secondly, the total size of the set must be equal
troïdes, with geometries of those of the database or greater than the size of the new query result,
temporary results. The required data a, b and c as formalized by equation (ii) bellow.
of respectively Feature1 and Feature2 are then
retrieved to answer the query Q. n
(i )MIN (∑ X i * CCacheCPU + (1 − X i ) * C DiskCPU )
i =1
Cache Replacement Policy: B&BGDSF n
(ii )∑ X i * S i ≥ T
i =1
When a new query result must be stored in a
saturated cache, the most irrelevant queries have
to be evicted. It is important to take into account In equation (i), Xi is set to 1 if object i is kept
the constraints on size but also the cost and fre- in cache, and 0 otherwise, CCacheCPU stands for the
quency of access to spatial objects. Thus, we base cost in CPU of the cached query result and CDiskCPU
our replacement policy on the Greedy Dual-Size stands for the cost in CPU to re-compute the query
Frequency (GDSF) algorithm proposed by Arlitt result I from the database. The second constraint
et al. (Arlitt, 99). This strategy replaces the object concerned the size of the set of cached queries to
with the smallest key value for a certain cost func- remove. It is given by equation (ii), where T stands
tion. When an object i is requested, it is given a for the size of the new query result, and Si, the
priority key Ki computed as follows: size of a cached query result i. This is equivalent
to the following expression, which can be resolved
Ci using the Branch and Bound algorithm:Where
K i = Fi * +L.
Si
n
(i )MAX (∑Yi * (CCacheCPU − C DiskCPU )).
where: Fi is the frequency usage of the object I; i =1
Yi = 1 − X i
Ci is the cost associated with bringing the object i n n
into cache.; Si is the size of the object I; and, L is (ii )∑Yi * S i ≤ ∑ Si − T
a running age factor that starts at 0 and is updated i =1 i =1

for each replaced object o to the priority key of


this object in the priority queue, i.e., L = Ko. PROOF:
Our cache replacement policy performs in two By replacing Xi by (1-Yi) in equation (i), we
steps:. First, the cost of each query result contained obtain:
in cache is computed using the GDSF policy. Ci
n
stands for the cost of a geographical query. The (i ) ⇔ MIN (∑ (CCacheCPU −Yi * (CCacheCPU − C DiskCPU ))
computation of Ci for the experimentations is dis- i =1

cussed in the experimentation section. Secondly,


The Branch and Bound algorithm (Pisinger, 04) is n
used to determine the most irrelevant query result Since ∑C CacheCPU
is a constant, the problem
in cache. In the case of spatial cache replacement i =1

policy as in the general case, the problem is to is reduced to the second expression:
determine the optimal set of candidates to be
removed. This set must have the minimal cost in

561
GeoCache

n the performances of this algorithm compared to


(i ) ⇔ MIN (−∑Yi * (CCacheCPU − C DiskCPU ))
i =1
existing ones.
n
⇔ MAX (∑Yi * (CCacheCPU − C DiskCPU ))
n
i =1 EXPERIMENTATIONS
(ii ) ⇔ ∑ (1 −Y ) * S i i
≥T
i =1
n n The Geographical Data
⇔ ∑Y i
* Si ≤ ∑ Si − T Warehouse Architecture
i =1 i =1

This formalization is generally used to resolve The cache has been implemented in a geographi-
the knapsack problem, in operational research cal data warehouse prototype. The architecture of
(Martello, 90). To solve this operational research this prototype is depicted in Figure 11.
problem, we choose the Branch and Bound algo- The geographical data warehouse is entirely
rithm. The Branch and Bound algorithm take as built using open sources components based on
parameter: the GDSF cost of each cache query Java and XML. The data extract from heteroge-
result, the size of each query result, and the size neous sources are encoded in SOAP (Simple
of the new query result. Object Access Protocol) - a standard protocol
Compared to GDSF, the proposed B&BGDSF defined by the W3C - messages and sent to the
algorithm determines the optimal set of candidates DSA (Data Staging Area) to be processed. In the
to be replaced by the new query result. Thus, DSA, the data are integrated using spatial ETL
better cache management is done. The results process before it feeds the database of the data
depicted in the experimentations below, show warehouse. The XML database eXist has been
chosen as the data warehouse manager, since it
is an open source and allows querying the non-
Figure 11. Geographical data warehouse archi- spatial data using the XQuery language. The
tecture geographical queries are computed using the JTS
(Java Topology Suite) packages which is OpenGIS
compliant. Users queries are sent to the data
warehouse embedded in a SOAP message.
The implemented architecture is composed of
four levels: sources, the DSA, the data warehouse
and the clients. The communication between these
four levels is based on SOAP protocol using the
apache tomcat Web application. At the source
level, Hibernate 2.0 is used for extracting data
from heterogeneous database. Hibernate has
been chosen since it allows interrogating any
database with the singular HQL language. The
data extracted from these sources are sent into a
SOAP message to the DSA in order to be cleaned
and integrated by the spatial ETL transformation
process. Once the data have been prepared, they
are sent to the data warehouse in SOAP messages
to feed the XML eXist database. By the same

562
GeoCache

Figure 12. Cache hit ratio


way, the user queries are sent into a SOAP mes-
sage to the data warehouse. The query analyzer
then extracts and processes the queries (section
4.2.1). The user level is composed of a graphi-
cal interface which has been implemented using
SWING Java technology. This interface allows
users to specify a request and display the queries
results. The queries results are encoded in SVG
(Scalable Vector Graphic) – the XML format for
vector graphic specified by the W3C - embedded
in SOAP messages and sent to the users to be
graphically displayed.

The Experimentation
Parameters and Results Figure 13. Cache coverage ratio

The experimentations were done on a Pentium IV


2.0 GHz with 512 MB of memory. The used dataset
is composed of 10000 and 1000 features stored in
two GML documents in the eXist (Meier, 2002)
XML database. They describe the road networks
of Lille (a French town) (Savary, 2005). Their
geometry is of type linearGeometry (OpenGIS,
2003). They have been generated by our ETL
process from the source, one was in format shape
(the exchange format of ArcGIS software), and
the other was stored in the Oracle 9i DBMS which
allows spatial data management.
Our cache replacement policy has been com-
pared with GDSF (without Branch and Bound
Figure 14. Cache hit ratio
selection), LFU and LRU, which is the most
popular in spatial database cache replacement
policy. The cache size has been moving from 10
to 100 Mega Bytes. To compare those policies,
we have simulated a flow of approximately 70
queries (point, range and join queries), possibly
made by different users. We have performed two
series of experimentations, consisting in two sets
of about 70 GML XQueries. The one (random.
gxq file) uses an arbitrary order of queries, while
in the second (freq.gxq) the most recent queries
are repeated. These two files contain different
queries. The evaluation of the first query set is
reported in Figures 12, 13 and 16. Whereas the

563
GeoCache

Figure 15. Cache coverage ratio


results obtained using freq.gxq file are illustrated
in Figures 14, 15 and 17.
Figure 12 shows the hit ratio performances
realized with a random dataset queries. Here we
can see that in general, GDSF and B&BGDSF,
outperform LRU and LFU. When the cache size
becomes larger, GDSF roughly scale like B&BGDSF.
But with smaller cache size, our B&BGDSF algo-
rithm performs better than the other cache replace-
ment policies. This is because the Branch and
Bound algorithm associated to GDSF only evicts
the most irrelevant cache results with smallest
cost and which size sum is greater but closest to
the new query which must be put in cache. Thus,
the cache is better managed. For example with a
cache size equal to 20 Mega Bytes, B&BGDSF
Figure 16. Performance gain
performs about 8% better than GDSF.
Figure 13 shows the cache coverage ratio in
percentage. Cache coverage hit shows the effect
of partial matching in semantic caching. This in-
formation is not readable in traditional cache hit
ratio. It is defined as follows (Lee, 1998):
Given a query set consisting of I queries q1,
…qI, let Ai be the number of answers found in
the cache for the query qi, and let Bi be the total
number of answers for the query qi for 1≤ i ≤I.


I
QCRi
CCR = i =1

I

Figure 17. Performance gain where

Ai
QCRi =
Bi

if Bi > 0.
Here again, we can notice that in general
GDSF and B&BGDSF outperform LRU and LFU.
But it is interesting to see that compared to Fig-
ure 12, when the number of hit ratio is equal for
GDSF and B&BGDSF with a cache size of 50 MB
or 60MB, the number of cache coverage ratio is
better using the B&BGDSF algorithm. Figure 16

564
GeoCache

shows the performance gain which is defined as but B&BGDSF outperforms GDSF, LFU and LRU,
follows (Brinkhoff, 2002): especially when the cache size is smaller (from
10 to 50 MB). This is due to a better selection of
disk _ acces _ for _ a _ given _ Policy cache queries to evict.
gain = −1
disk _ access _ of _ B & BGDSF These results show that the cache is better
managed using the B&BGDSF compared to GDSF,
LFU and LRU cache replacement policies. The
where Policy stands for one of the three cache cache hit ratio (Figures 12 and 14) show that
replacement policies: GDSF, LRU and LFU. B&BGDSF outperforms GDSF, LRU and LFU; but
The performance gain shows the reduction when the performances of B&BGDSF are roughly
in the number of disk access using the proposed equal to another cache replacement algorithm, the
cache replacement policy, compared to others. In cache coverage ratio (Figures 13 and 15) show
general, we can notice that the gain is positive us- that a better cache management is obtained us-
ing our proposed algorithm. However, depending ing the proposed algorithm. This is confirmed by
on the cache size, the gain with GDSF is more the performance gain in disk access presented in
or less important, especially when the cache size Figures 16 and 17.
becomes small or large. For example in Figure
12 with a cache size equal to 20 or 100 MB,
the performances of GDSF are the same or less CONCLUSION
than B&BGDSF. However, if we look at the gain
distribution, the values obtained are about 8 or In order to optimize the I/O-performance of a da-
18%. Especially with a cache size of 100 MB, tabase system, many techniques have been consid-
the hit ratio for GDSF and B&BGDSF are roughly ered. One of the most used in spatial database is the
the same, but the gain obtained is different about buffering technique associate to the LRU strategy
18%. These results show that the cache is bet- and its variants. Related works generally focus on
ter managed using B&BGDSF. For a cache size spatial data stored in object-relational databases.
equal to 30 Mega Bytes, we can see that the gain With the popularity of Internet and XML, the OGC
compared to GDSF is roughly equal to 23%. But proposed the GML semi-structured language,
for a cache size between 40 and 90 MB, the gain which is an extension of XML for the modeling,
compared to GDSF is roughly the same or less transport and storage of geographical information
than our proposed algorithm. But the Hit Ratio including both spatial and non-spatial fragments
and Cache Coverage Ratio show that better cache of geographical data. The GML language is more
management is obtained using the B&BGDSF algo- and more used in many distributed architectures.
rithm. Moreover, for a cache size of 40 or 60 MB In order to handle efficiently geographical queries
in Figure 12, LRU performs like B&BGDSF; but in stored in GML documents, we have proposed an
Figure 16 the gain obtained compared to LRU is appropriate cache structure and cache replace-
the same or better (for example with a cache size ment policy. The proposed cache structure aims
of 40 MB). For a cache size of 50 MB peaks of at considerably reduce the memory used to store
gain around 37% with LRU and 25% with LFU geographical queries. Only non redundant spatial
are obtained. fragments of a geographical query are stored. Thus,
Figures 14, 15 and 17 show other experimen- much more queries can be stored in cache. This
tations performed with another query set where architecture is associated to a semantic cache where
the most recent queries are repeated. In this case, inference rules between spatial predicates are ex-
we notice that LRU outperforms GDSF and LFU, ploited for a better cache management. Associated

565
GeoCache

with this semantic cache, a new cache replacement Brinkhoff, T. (2002). A robust and self-tuning
policy is proposed. It takes into account the cost page-replacement strategy for spatial database
implied by the different predicates and gives bet- systems. The 8th International Conference on
ter performances than existing cache replacement Extending Database Technology (EDBT 2002)
policies like GDSF, LRU and LFU. The proposed (pp. 533-552), Prague, Czech Republic, 2002.
cache replacement policy is based on Branch and LNCS, Vol. 2287, Springer-Verlag.
Bound and GDSF algorithms. It only evicts the
Brinkhoff, T., Horn, H., Kriegel, H. P., & Sch-
most irrelevant cache results according to the cost
neider, R. (1993). A storage and access architecture
and size of data stored in cache. The hit ratio, cache
for efficient query processing in spatial databases.
coverage ratio and performance gains show that the
In Proceedings of the 3rd International Symposium
proposed B&BGDSF replacement policy outperforms
on Large Spatial Databases (pp. 357-376), Sin-
GDSF, LRU and LFU algorithms. In perspective,
gapore. LNCS, Vol. 692, Springer.
this work will be extended to spatiotemporal data
stored in semi-structured documents in order to Cao, P., & Irani, S. (1997). Cost-aware WWW
optimize spatiotemporal queries. proxy caching algorithms. Proceedings of USENIX
Symposium on Internet Technologies and Systems
(USITS) (pp. 193-206), Monterey, CA, December.
REFERENCES
Chen, L., Wang, S., & Rundensteiner, E. A. (2004).
Aboulnaga, A., & Naughton, J. (2000). Accurate Replacement strategies for XQuery caching sys-
estimation of the cost of spatial selections. IEEE tems. Data & Knowledge Engineering, 49(2),
International Conference on Data Engineering 145–175. doi:10.1016/j.datak.2003.10.002
(ICDE ’00) (pp. 123-134), San Diego, CA. Chidlovskii, B., Roncancio, C., & Schneider, M.
Abrams, M., Standbridge, C. R., Adbulla, G., L. (1999). Semantic CacheMechanism for hetero-
Williams, S., & Fox, E. A. (1995). Caching prox- geneous Web querying. In Proceedings of the 8th
ies: Limitations and potentials. WWW-4, Boston World-Wide WebConference (WWW8).
Conference. Chou, H., & DeWitt, D. (1985). An evaluation
Arlitt, M., Friedrich, R., Cherkasova, L., Dilley, J., of buffer management strategies for relational
& Jin, T. (1999). Evaluating content management database systems.In Proceedings of the 11th VLDB
techniques for Web proxy caches. In Proceedings Conference.
of the Workshop on Internet Server Performance Corocoles, J. E., & Gonzalez, P. (2003). Querying
(WISP), Atlanta, GA. spatial resources. An approach to the semantic
Beckmann, N., Kriegel, H. P., Schneider, R., & geospatial Web. CAISE 2003 Workshop, Web,
Seeger, B. (1990). An efficient and robust access e-Business, and the Semantic Web (WES: Foun-
method for points and rectangles. In Proceeding dations, Models, Architecture, Engineering and
ACM SIGMOD International Conference on Man- Applications, LNCS, Springer-Verlag. Dar, S.,
agement of Data (pp. 322-331), Atlantic City, NJ. Franklin, M. J., Jonsson, B. T., Srivastava, D., &
Tan, M. (1996). Semantic data caching and replace-
Boucelma, O., Messid, M., & Lacroix, Z. (2002). ment. Proceedings of the 22nd VLBD Conference,
A WFS-based mediation system for GIS interoper- Bombay, India.
ability. The 10th ACM International Symposium on
Advances in Geographical Information Systems
(ACM GIS), McLean, VA.

566
GeoCache

Gupta, A., Marciano, R., Zaslavsky, I., & Baru, O’Neil, E. J., O’Neil, P. E., & Weikum, G. (1993).
C. (1999). Integrating GIS and imagery through The LRU-K Page replacement algorithm for da-
XML-based information mediation. In P. Agouris tabase disk buffering. In SIGMOD (pp. 297-306).
& A. Stefanidis (Eds.), Integrated spatial data-
Open GIS Consortium. (1999). Inc., OpenGIS
bases: Digital images and GIS, LNCS, Vol. 1737
Simple Features Specification for SQL Revision
(available from http://www.npaci.edu/DICE/
1.1. OpenGIS Project Document 99-049.
Pubs/).
Open GIS Consortium. (2003). Inc., OpenGIS®
Kellerer, H., Pferschy, U., & Pisinger, D. (2004).
Geography Markup Language (GML) Imple-
Knapsack problems. Springer.
mentation Specification, version 3.00, Document
Leclercq, E., Djamal, B., & Yétongnon, K. 02-023r4.
(1999). ISIS: A semantic mediation model and
Pagel, B., Six, H., Toben, H., Widmayer, P. (1993).
an agent-based architecture for GIS interoper-
Towards an Analysis of Range Query Performance
ability. Proceedings of the International Database
in Spatial Data Structures. PODS. 214-221.
Engineering and Applications Symposium IDEAS
(pp. 81-92), Montreal, IEEE Computer Society. Savary, L., & Zeitouni, K. (2003). Spatial data
warehouse—A prototype. In The 2nd EGOV In-
Lee, D., & Chu, W. (1998). A semantic caching
ternational Conference (pp. 335-340), Prague.
schema for wrappers in Web databases. The 1st
LNCS 2739.
ACM International Workshop on Web Information
and Data Management (WIDM), Washington DC. Savary, L., & Zeitouni, K. (2005). Automated lin-
ear geometric conflation for spatial data warehouse
Lee, D., & Chu, W. (1999). A semantic caching via
integration process. The 8th AGILE Conference on
query matching for Web sources. In Proceedings
GIScience. Estirl, Portugal.
of the 8th ACM International Conference on In-
formation and Knowledge Management (CIKM), Silvano, M., & Paolo, T. (1990). Knapsack prob-
Kansas City, Missouri, USA. lems: Algorithms and computer implementations.
John Wiley & Sons.
Lorenzetti, P., & Rizzo, L. (1996). Replacement
policies for a proxy cache. Technical report, Sindoni, G., Tininini, L., Ambrosetti, A., Bedes-
Universita di Pisa. chi, C., De Francisci, S., & Gargano, O. (2001).
SIT-IN: A real-life spatio-temporal information
Luo, O., Naughton, J. F., Krishnamurthy, R., Cao,
system. VLDB, 2001, 711–712.
P., & Li, Y. (2000). Active query caching for da-
tabase {Web} servers. ACM SIGMOD Workshop Stoimenov, L., Djordjevic-Kajan, S., & Stoja-
on the Web and Databases, WebDB. novic, D. (2000). Integration of GIS data sources
over the Internet using mediator and wrapper
Meier, W. (2002). eXist: An open source native
technology. MELECON 2000, 10th Mediterranean
XML database. In A. Chaudri, M. Jeckle, E.
Electrotechnical Conference (Vol. 1, 334-336).
Rahm, R. Unland (Eds.), Web, Web-Services,
and Database Systems. NODe 2002 Web- and Tanenbaum, A. S. (1992). Modern operating
Database-Related Workshops. Springer LNCS systems. Prentice-Hall.
Series, 2593. Erfurt, Germany.

567
GeoCache

Theodoridis, Y., & Sellis, T. (1996). A model for Yang, O., Zhang, H., & Zhang, H. (2003). Taylor
the prediction of r-tree performance. In Proceed- series prediction: A cache replacement policy
ings of the 15th ACM Symp. Principles of Database based on second-order trend analysis. The 34th
Systems (pp. 161-171). Annual Hawaii International Conference on
System Sciences, Maui, Hawaii.
Vatsavai, R. R. (2002). GML-QL: A spatial query
language specification for GML.UCGIS Summer, Zhang, M., Javed, S., Shaheen, A., & Gruenwald,
Athens, Georgia. L. (2001). A prototype for wrapping and visual-
izing geo-referenced data in distributed environ-
Voisard, A., & Juergens, M. (1999). Geographical
ments using the XML technology. ACMGIS at
information extraction: Querying or quarrying?
VA (pp. 27-32).
In M. Goodchild, M. Egenhofer, R. Fegeas, C.
Kottman (Eds.), Interoperating geographical in-
formation systems, New York: Kluwer Academic
Publishers.

This work was previously published in International Journal of Data Warehousing and Mining, Vol. 3, Issue 1, edited by D.
Taniar, pp. 67-88, copyright 2007 by IGI Publishing (an imprint of IGI Global).

568
569

Chapter 35
PDE-Based Image Processing:
Image Restoration

Rajeev Srivastava
Institute of Technology, Banaras Hindu University (ITBHU), India

ABSTRACT
This chapter describes the basic concepts of partial differential equations (PDEs) based image mod-
elling and their applications to image restoration. The general basic concepts of partial differential
equation (PDE)-based image modelling and processing techniques are discussed for image restoration
problems. These techniques can also be used in the design and development of efficient tools for various
image processing and vision related tasks such as restoration, enhancement, segmentation, registration,
inpainting, shape from shading, 3D reconstruction of objects from multiple views, and many more. As
a case study, the topic in consideration is oriented towards image restoration using PDEs formalism
since image restoration is considered to be an important pre-processing task for 3D surface geometry,
reconstruction, and many other applications. An image may be subjected to various types of noises
during its acquisition leading to degraded quality of the image, and hence, the noise must be reduced.
The noise may be additive or multiplicative in nature. Here, the PDE-based models for removal of both
types of noises are discussed. As examples, some PDE-based schemes have been implemented and their
comparative study with other existing techniques has also been presented.

INTRODUCTION resentation of the image. The scientific structure


of any image can be supposed to be based on an
Image processing is a rapidly growing field which intrinsic principles of mathematics i.e. from im-
can be defined as the manipulation of an image age analysis to image processing. Image analysis
for the purpose of either extracting information includes modeling and analysis of original image
from the image or producing an alternative rep- itself, i.e. from image space analysis to different
methods to represent image. The various tools of
DOI: 10.4018/978-1-4666-2038-4.ch035 image analysis include spectral analysis, wavelets,

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
PDE-Based Image Processing

statistics, level-sets and partial differential equa- acterization of the quantity that each picture ele-
tions (PDEs). On the other hand, image processing ment or pixel represents. An image could represent
is to modify the original image to improve the luminance of objects in a scene such as in digital
quality or extracting information from the given photography, the absorption characteristics of the
image, for example, image restoration, compres- body tissue as in X-Ray imaging, the radar cross
sion, segmentation, shape and texture analysis. section of a target in radar imaging, the tempera-
There are two dual fields that are directly con- ture profile of a region in infrared imaging or the
nected to the image processing in contemporary gravitational field in an area in geophysical imag-
Computer science. These are Computer vision ing. In general, any 2D function that contains
which is related to the construction of the 3D world information can be considered as an image. Image
from the observed 2D images; and another one models give a logical or quantitative description
is Computer graphics which pursues the opposite of the properties of this function. There are three
direction in designing suitable 2D scene images to crucial ingredients of image processing which
simulate our 3D world. Image processing can be include modeling, analysis, and efficient imple-
considered as the crucial middle way connecting mentation of processing tools. The design of a
the vision and graphics fields. Image process- successful image processing technique relies on
ing can be considered as an input-output system having a successful model for images. A funda-
shown in Figure 1. mental issue faced in the design of image analy-
I0 is the input data which represents an observed sis techniques is the identification and character-
or measured single image or image sequences; τ ization of the image space. The various
denotes a typical image processor e.g. restoration, approaches to image modeling and analysis in-
enhancement, segmentation, compression, inter- clude statistical representations or random field
polation, feature extraction, inpainting etc.; and modeling; spectral and wavelet representations;
I denotes output which is also an image or an and scale-space representations or regularity
image sequence (I1,I2,I3,…..) that contains all the spaces.
targeted image features. The problem very often The statistical approaches treat images as
encountered in this area is to design an efficient samples from random fields, which are often mod-
and cost effective and accurate image processor. eled by Markov or Gibbs fields or via statistical
Typical design tasks include: de-noising, de- learning from an image database. The statistical
blurring, edge detection, intensity enhancement, properties of the fields are often established
inpainting, interpolation, compression and de- through the filtering technique and learning
compression etc. In addition to these relatively theory. Statistical models describe an image as a
low-level tasks, there are mid- and high-level member of an ensemble, often characterized by
tasks like disocclusion, shape from shading, mo- its mean and covariance functions. This permits
tion analysis, image segmentation, and pattern development of algorithms that are useful for an
identification and recognition. In image represen- entire class or an ensemble of images rather than
tation (Jain, 2006), one is concerned with char- for a single image. Often the ensemble is assumed

Figure 1. An image processing system

570
PDE-Based Image Processing

to be stationary so that the mean and covariance Transform uses overlapping functions of variable
functions can easily be estimated. Random field size for analysis. The overlapping nature of the
modeling is the most appropriate approach for transform alleviates the blocking artefacts, as each
describing natural images with rich texture patterns input sample contributes to several samples of the
such as trees and mountains. Stationary models output. The variable size of the basis functions,
are useful in data compression problems such as in addition, leads to superior energy compaction
transform coding, restoration problems such as and good perceptual quality of the decompressed
Wiener filtering (Jain, 2006), and in other applica- image. Wavelets Transform is based on the concept
tions where global properties of an ensemble are of sub-band coding (Gonzalez, 1987; Soman,
sufficient. A more effective use of these models 2004; Topiwala, 1998). The current applications
in image processing is to consider them to be of wavelet include statistical signal processing,
spatially varying or piecewise spatially invariant. Image processing, climate analysis, financial time
The spectral and wavelet representations are series analysis, heart monitoring, seismic signal
also playing an important role in the design and de- noising, de-noising of astronomical images,
analysis of an efficient image processor. Wave- audio and video compression, compression of
let analysis is about analyzing signal with short medical image stacks, finger print analysis, fast
duration finite energy functions. They transform solution of partial differential equations, computer
the signal under investigation into another repre- graphics etc.
sentation which presents the signal in more useful The scale-space decomposition is a partial
form. This transformation of the signal is called differential equation (PDE) based approach to the
Wavelet Transform (Gonzalez, 1987) i.e. Wavelet multi-scale analysis of images. This theory is based
Transforms are based on small waves, called wave- on the linear filtering or heat diffusion process, and
lets, of varying frequency and limited duration. was highlighted by the famous nonlinear model as
Unlike the Fourier transform, we have a variety of proposed by (Perona, & Malik, 1990) for remov-
wavelets that are used for signal analysis. Choice ing small-scale noise images without losing large
of a particular wavelet depends on the type of ap- scale structures like edges. There are other novel
plication in hand. Wavelet Transforms provides theories and trends for image analysis, including
time-frequency view i.e. provides both frequency regular-space representations by the bounded
as well as temporal (localization) information and variation (BV) space, free boundary models, and
exhibits multi-resolution characteristics. Fourier shape models (Romeny,1994).
transform is good for periodic or stationary signals The first step of image processing is to con-
and Wavelet is good for transients. Localization struct suitable models for the given task. Currently,
property allows wavelets to give efficient repre- processing models are being developed using tools
sentation of transients. In Wavelet transforms a like Bayesian decision, variational optimization,
signal can be converted and manipulated while inverse problems and the like. These approaches
keeping resolution across the entire signal and still arise from fields such as statistical mechanics,
based in time i.e. Wavelets have special ability to the calculus of variations, nonlinear PDEs, dif-
examine signals simultaneously in both time and ferential geometry and topology, harmonic and
frequency. Wavelets are mathematical functions functional analysis, and numerical analysis. For
that satisfy certain criteria, like a zero mean, and example, numerous PDE based methods (Perona,
are used for analyzing and representing signals or & Malik, 1990; Rudin et al., 1992; Witkin, 1983;
other functions. A set of dilations and Translations Gilboa et al., 2004) have been proposed for image
of a chosen mother wavelet is used for the spatial/ de-noising problem using PDEs. Once an image
frequency analysis of an input signal. The Wavelet processing model is selected, the next step is its

571
PDE-Based Image Processing

analysis, in order to answer questions of existence is not available in many practical applications such
and uniqueness, stability, properties of solutions, as in space exploration and astronomy. Therefore,
and so on. Many image processing models are to retrieve unknown images one has to incorporate
nonlinear or non-convex. Analyzing them requires human visual perception mechanism in order to
new mathematical insights. Relatively few image accurately restore the image i.e. restores the im-
processing models have been analyzed in this way. age according to the requirement of the desired
There is a vast engineering literature of image information. The various possible reasons that can
processors, which were mostly developed based cause the observed image to deviate from the origi-
on empirical insights and leaps of good intuition. nal one may include atmospheric irregularities,
They require rigorous mathematical analysis that instrument aberrations, diffraction limit, detector
can be crucial for answering questions of great noise, thermal noise, electronic noise, and intrinsic
practical significance, such as which among the noise such as Poisson noise. These noises may be
many techniques proposed for the same task is additive, multiplicative (speckle noise) or may
superior, what effect the various parameters that be distributed according to a specific probability
appear in a method have on its behavior, and under distribution (PDF) function in an image. Several
what conditions a given technique can be expected restoration methods such as Weiner Filtering,
to perform well. Finally, an efficient algorithm for Inverse Filtering, Constrained Least Squares, and
implementing the processing model is devised. Lucy-Richardson iteration have been proposed in
The image restoration is a fundamental pre- literature (Gonzalez, &Wintz, 1987; Sonka et al,
processing step of many image processing or 2007; Lucy, 1974) that remove the noise either
computer vision related applications. The goal using Fourier Transformation in frequency domain
of image restoration problem is to restore the or by using optimization techniques. The various
degraded image for further analysis and use of methods to estimate the degradation function for
image data in a specific application. During the use in restoration are observation, experimenta-
capture or formation of an image through an tion and mathematical modeling. In recent past
imaging device it may be subjected to various years, a new idea has been evolved by the research
types of noise that may lead to degradation of community for image processing in continuous
the observed image. The noise may be additive domain using partial differential equation (PDEs)
or multiplicative in nature. The additive noise is formalism or scale-space decomposition in par-
easy to remove in comparison to multiplicative allel to the statistical modeling. The scale-space
one. Image restoration techniques are based on decomposition (Chambolle, 1994; Witkin, 1983)
modeling the degradation using a priori knowledge is a PDE approach to the multi-scale analysis of
and applying the inverse process in order to restore images. These PDE based approaches arise from
the original image (Gonzalez, & Wintz, 1987). the fields of mathematical physics such as various
For some images, the a priori information of the transport phenomenon e.g. heat flow, fluid flow,
degradation phenomenon may not be available that statistical mechanics, the calculus of variations,
can be further used for modeling the degradation nonlinear PDEs, and numerical analysis. In PDE
process. In such cases, blind restoration techniques based image processing (Caselles et al., 1998),
are applied for the restoration of the corrupted the basic idea is to obtain the processed image
image. In blind image restoration technique, the as the solution of PDE’s, being in general the
original image is estimated without the explicit initial condition a function of the original image.
knowledge of the underlying degradation process. The extensive research on numerical analysis al-
This process is difficult since information about the lows us the implementation of these equations,
original image or the underlying blurring process obtaining accurate image processing algorithms.

572
PDE-Based Image Processing

When the image is represented as a continuous PDE-BASED IMAGE PROCESSING


signal, PDEs can be seen as the iteration of local
filters with an infinitesimal neighborhood. This General Principles
interpretation of PDEs allows to unify and clas-
sify iterated filters as well as to derive new ones Partial Differential Equations (PDE’s) and
(Caselles, 1998). In literature, authors Price et al. Geometry-Driven Diffusion in image process-
(1990); Alvarez (1992); Alvarez (1993); Rudin et ing and analysis is a relatively new area which
al. (1992); Osher, (1990) have proposed various provides a strong theoretical framework for image
methods for image restoration and de-noising and processing and analysis. Attention to these meth-
edge detection using PDEs formalism including ods has increased recently due to the important
the pioneering work of Perona, & Malik (1990). theoretical and practical results that have been
The proposed chapter is divided in to five obtained. The use of PDEs and curve or surface
sections. Introduction, gives an introduction of evolution theory in image modeling and analysis
the topic; PDE-based image processing, presents became a major research topic in recent years. The
general principles, literature survey, examples and PDE-based approach permits to unify and extend
application areas of PDE-based image process- existent algorithms, as well as to develop novel
ing; Performance metrics, gives definitions of ones. Extensive research on numerical analysis
various performance metrics that can be used to allows the implementation of these equations,
evaluate the performance of a image processing obtaining accurate image processing algorithms.
model; Image restoration, is the main section of The basic idea is to obtain the processed image as
the chapter where various PDE-based techniques the solution of PDE’s, being in general the initial
used to restore an image corrupted with additive condition a function of the original image.
and multiplicative noises are discussed. The first To explain this (Caselles et al., 1998), suppose
subsection, deals with various PDE-based additive ϕ0: R2→R represent a gray level image, where ϕ0(x,
noise removal techniques. The second subsection, y) is the gray level value. Introducing an artificial
deals with multiplicative noise reduction from time t parameter the image deforms according to
images. The example of multiplicative noise in-
cludes speckle noise present in various imaging ∂ϕ
= F [ϕ(x , y, t )]; (1)
modalities such as ultrasound imaging, SAR imag- ∂t
ing and digital holography. The subsequent next
section, examines the statistics of speckle noise where ϕ(x, y, t)=R2×[0,τ) →R is the evolving
patterns for various imaging modalities, discusses image, F: R→R is an operator characterizing the
the standard speckle reduction techniques avail- algorithm and the image ϕ0(x, y) is the initial
able in literature, PDE-based speckle reduction condition. The solution ϕ(x, y, t) of the PDE gives
techniques, performance analysis and comparison the processed image at scale t.
of PDE-based techniques with other techniques In case of vector valued images, a system of
and finally last section, gives the conclusion of coupled PDEs of the form of (1) is obtained. The
the chapter. same formalism may be applied to planar curves
(boundaries of planar shapes), where ϕ(x, y) is
a function from R→R2 or surfaces, functions
from R→R3. In this case the operator F must be
restricted to the curve, and all isotropic motions
can be described as a deformation of the curve

573
PDE-Based Image Processing

or surface in its normal direction, with velocity Extensive research is also being done on the
related to its principal curvature(s) and flow of direct derivation of evolution equations which
the form are not necessarily obtained from the energy ap-
proaches. This is the case for a number of curvature
∂ϕ →
equations of the form (2).
= F (ki ) N , (2)
∂t The ideas on the use of PDEs in image pro-
cessing (Caselles et al, 1998) were introduced
is obtained, where ki’s are the principal curvatures very back by Gabor (1965); Jain (1977); and Jain
→ (1978). The notion of scale space i.e. representa-
and N is normal to the curve or surface ϕ(x, y). tion of images simultaneously at multiple scales
A tangential velocity can also be added, which which is now the basis of most of the research in
may help the analysis but does not affect the PDEs for image processing was introduced by
geometry of flow. Koederink (1983) and Witkin (1984) where they
The PDEs describing the evolution phenom- shown that the multiscale image representation is
enon can be obtained from variational problems obtained by Gaussian filtering which is equivalent
and for an image processing problem it is for- to deforming the optimal image via the classical
mulated as heat equation, which generates an isotropic dif-
fusion flow. Perona and Malik (1990) work on
arg{MinϕU (ϕ)}, (3) anisotropic diffusion proposed to replace Gaussian
smoothing which is equivalent to isotropic diffu-
sion via heat flow, by a directional diffusion that
where U is a given energy. preserves edges. Their work opened a number of
Let F(ϕ) denote the Euler-Lagrange derivative theoretical and practical questions that continue
i.e. first variation and under general assumptions, to occupy the PDE image processing community.
a necessary condition for ϕ to be minimizer of U In the same framework, the very good work of
is that F(ϕ)=0, then the local minima may be com- Osher and Rudin (1990) on shock filters and Rudin
puted via the steady state solution of the equation et al. (1992) on total variation (TV) decreasing
methods explicitly stated the importance and the
∂ϕ need for understanding PDEs for image process-
= F(ϕ), (4)
∂t ing applications.
Many other interesting problems in computer
where t is an artificial time parameter. One of the vision can be formulated as minimisation problems
most popular example for energy functional U, for for energy functional (Shen, 2002). In order to
an image, to be minimized is the Dirichlet integral solve these problems, the Euler- Lagrange equa-
tions of the functional are computed, resulting
2
in a set of necessary conditions. In effect, these
U (ϕ) = ∫ ∇ϕ (x )dx , (5)
conditions are partial differential equations which
are reformulated as a surface evolution problem.
which is associated with the heat equation Among the first successfully utilised for computer
vision problems was Snakes: Active Contours
∂ϕ Models (Kass et al., 1988). While originally
= ∆ϕ(x ). (6) developed for object segmentation in 2D, this
∂t
approach can be generalised to 3D. The minimal

574
PDE-Based Image Processing

surfaces may be employed for 3D reconstruction Surface Evolution


of static objects from multiple views as proposed
by Faugeras and Keriven (1998). The basic con- Analogous to curve evolution, one can define a
cepts of curve and surface evolution are as follows surface evolution on a surface S0=D→R3. Let u
(Osher, &Sethian, 1988; Romeny,1994; Evans, & and v denote the surface parameters and t≥0 the
Spruck,1991): necessary time parameter. The surface evolution
is then defined by
Curve Evolution


S (u, v, t ) = α1 (u, v, t ) tu (u, v, t )
The curve evolution deals with the task of deform- ∂t
ing a curve. Let C0: I→R2 be initial curve and let p → →
+α2 (u, v, t ) tv (u, v, t ) + β(u, v, t ) n(u, v, t ),
denote its parameterisation. To describe an evolu-
(9)
tion in time, a time parameter t≥0 is introduced.
The curve evolution is defined as
with S (u, v, 0) = S 0 (u, v ).
→ → →
→ →
∂ tu (u, v, t ) ,tv (u, v, t ) and n(u, v, t ) denote the
C (p, t ) = α(p, t ) t (p, t ) + β(p, t ) n(p, t ),
∂t tangential vectors in u and v direction and the
(7)
surface normal, respectively. Similar as in the
case of curves, the tangential components of the
with the initial curve C(p,0)=C0(p).
surface evolution act only as re-parameterisations
The movement of each curve point is written
→ →
and are therefore irrelevant for the geometric
in the local coordinate system ( t ; n ) of the curve, shape of the evolving surface. As in the curve
→ → evolution case, the constant velocity flows
where t denotes the tangential vector and n ∂ →

denotes the normal vector. This equation can be S = ± n describe dilations and erosions. As
∂t
simplified to a curve evolution restricted to the there are two different concepts of curvature on
normal direction, i.e. surfaces, one can define different flows, one de-
pending on the mean curvature H and one depend-

∂ ing on the Gaussian curvature k. Most of the PDEs
C (p, t ) = β(p, t ) n(p, t ) , (8)
∂t used in image processing and computer vision
are based on moving curves and surfaces with
if the normal velocity 𝛽(p, t) only depends on C(p, curvature based velocities. In this area, the level
t) and t. If this is the case, the tangential motion set numerical method developed by Osher and
only acts as a re-parameterisation of the curve. Sethian (1988) played a very important role. The
The curve evolution in Equation (8) depends on basic idea is to represent the deforming curve,
the definition of the velocity (p, t). If, for example surface or image, as the level set of a higher di-
(p, t) = 1 for all parameterisations, the evolving mensional hyper surface. This technique not only
curve at a certain time step t is a dilation of the provides more accurate numerical implementa-
initial curve C0. Erosion can thus be modelled by tions but also solves topological issues that were
reverting the sign of (p, t). As another example, very difficult to treat before. The representation
(p, t)= k(p, t) yields the prominent Euclidean of objects as level sets (zero-sets) is a fundamen-
geometric heat flow or curvature motion. Where tal mathematical technique in mathematical
k(p, t) denotes the Euclidean curvature. morphology (Osher, & Fedkiw, 2001; Evans,

575
PDE-Based Image Processing

&Spruck,1991). Another key contribution in the ∂Φ


= F1[Φ(x , y, t )] and
PDE formalism has been the general segmentation ∂t
frame work developed by Mumford and Shah ∂Φ
(1989). Their work has unified a large number of = F2 [Φ(x , y, t )], then they can be com-
∂t
image segmentation approaches, and opened as ∂Φ
well a large number of theoretical and practical bined as = αF1 + F2 , where α ∈ R + .
∂t
problems. Kimia et al. (1990) introduced curve If F1 and F2 above are the corresponding
evolution methods in to computer vision for a Euler-Lagrange operators of two energy
computational theory of a planar shape. Refer- minimization problems with energies U1
ences to works of other authors in related field and U2, then the flow above minimizes the
can be found in (Caselles et al., 1998). The frame energy αU 1 + U 2 .
work of PDE’s and geometry driven diffusion • Another important advantage of the PDE
have been applied to many problems in image approach is the possibility of achieving
processing and computer vision, since the seminal high accuracy and stability, with the help
works mentioned above. Examples include con- of extensive available research on numeri-
tinuous mathematical morphology, invariant shape cal analysis.
analysis, shape from shading, segmentation, object
detection, optical flow stereo, image de-noising,
image sharpening, contrast enhancement, image APPLICATION AREAS
interpolation etc.
Scale-space theory, anisotropic diffusion, vector-
Advantages of PDEs Evolution valued diffusion, systems of coupled PDE’s,
Approaches in Image Analysis PDE based segmentation, image and video en-
(Caselles et al., 1998) hancement, shape analysis, axiomatic and PDE
based theoretical approaches to image analysis,
• They bring out a new concept to the area. variational and PDE’s-from-energy approaches
• One can think about image processing in to image processing, curve and surface evolution,
continuous domain. PDE’s in hybrid systems, comparison of models,
• The problem in hand is then approached image and noise models and adaptation of PDE’s,
as an image deformation task, which help implementations and numerical analysis, Applica-
to arrive to novel solutions of classical tions (Mathematical morphology, Invariant shape
problems. analysis, Shape from shading, Segmentation,
• When the image is represented as a con- Object detection, Optical flow, Image Denois-
tinuous signal, PDEs can be seen as the it- ing, Image sharpening, Contrast Enhancement,
eration of local filters with an infinitesimal Image quantization, and Image Compression).
neighborhood. This interpretation of PDEs Examples of some classical PDE-based algo-
allows to unify and classify iterated filters rithms and explanation about how the modeling
as well as to derive new ones. of curves and surfaces may occur are explained
• PDEs approach allows not only to derive in following sections.
new algorithms, but also to unify previous
ones. PDE formulation is natural in order
to combine algorithms. If two different im-
age processing schemes are given by

576
PDE-Based Image Processing

Image Restoration evolution and may come from energy minimiza-


tion or from pertinent local heuristics (Kass et al.,
A noisy 2D image may be considered for instance 1988; Morel, 1988; Caselles, 1997).
as a surface I: (x, y)→(x, y, I(x, y). With this kind
of model, regularization or smoothing of the im- Image Registration and Optical Flow
age I may be equivalent smooth surface similar
enough to the original noisy one. This can be The idea is to find a function allowing to trans-
done by minimizing energy functional or directly form an image to another one (Zitova, & Flusser,
designing PDE’s with specific regularization 2003; Brown, 1992). It is used to detect motions
behaviors that evolve the noisy surface (Perona, in video sequences by registering consecutive
& Malik,1990; Rudin,1992; Gilboa et al., 2005; frames, or readjust two images in a way that they
You, & Kaveh, 2000). fit together such as in application medical image
analysis. Here, a vector field models the pixel
Image Segmentation motions between the two images and a PDE is
used to describe its evolution until it converges
A classical way of segmenting images with PDEs to the expected image transformation.
is to model a closed contour with a 2D curve and
then to evolve it from an initial position (random Shape from Shading
or user defined) until it fits the exact shape of the
objects present in the picture. Like image resto- The problem is reconstructing a 3D representation
ration purposes, a PDE is describing the curve of an object from a single photograph of it. It is

Figure 2. Noisy image (left), and restored image Figure 3. Image segmentation treated as the
(right) evolution of a 2D contour curve, original image
with initialization curve (left), and segmented
image (right)

Figure 4. Image registration treated as the evolution of a displacement field, Direct superposing of two
images (First two images from left), and superposing after image registration (last two images in right)

577
PDE-Based Image Processing

possible if one looks at the intensity variations of PERFORMANCE MEASUREMENT


the image pixels due to the shadows and the dif- METRICS
ferent illumination conditions during the snapshot.
PDEs can describe the flow of an originally flat The various performance measurement metrics
3D surface converging to the 3D shape of the real for an image of size m×n used for evaluating the
object. A survey report on shape and shading is performance and efficacy of the various proposed
presented in (Zhang, 1999). methods are defined as follows:

Image and Surface Inpainting Mean square error

Inpainting refers to technique of modifying an 1 m n 2

image in an undetectable form. It has numerous


MSE =
m ×n
∑ ∑ I (i, j ) − I (i, j )
'

i =1 j =1
applications such as restoration of scratched (10)
photographs, removal of undesirable objects,
reconstruction of 3D surfaces, etc. Inpainting is where I is the observed noisy image and I ' is the
done by propagation of information of surround- filtered image.
ing structure and intensity. The various methods
available in literature includes geometric approach Root mean square error
and iterative algorithm. PDE based methods can
be used for image and surface inpaintings (Chan,
RMSE = MSE (11)
2005).

Peak signal-to-noise ratio


Figure 5. Image inpainting: original scratched
image (left), and inpainted image (right)  255 
PSNR = 20 log10   (12)
 RMSE 
 

Figure 6. Image inpainting: original image (left), and inpainted image (right)

578
PDE-Based Image Processing

Signal-to-noise ratio Average SNR=1/SI. (17)

 m n  Effective Number of Looks (ENL)



 ∑ ∑ I 2 (i, j ) 

SNR = 10 log10  m n i =1 j = 1 
 The number of looks (Dewaele, 1990) in an
 2
 ∑ ∑ (I (i, j ) − I (i, j ))  intensity image is a measure of the statistical
'

 i =1 j =1  fluctuations introduced by speckle resulting from


(13) the interference between randomly positioned
scatterer. Thus ENL gives essentially an idea about
Normalized mean square error the smoothness in the regions on the image that is
supposed to have a homogeneous appearance but
m n
are corrupted by noise. ENL is generally defined as
∑ ∑ (I (i, j ) − I (i, j ))
' 2

i =1 j =1
NMSE = (14)
m n
µt2
∑ ∑ I (i, j ) 2
ENL = (18)
i =1 j =1 σt2

For optimal performance, measured values where t denotes the target area or region of inter-
of MSE, RMSE and NMSE should be small and est, µt and σt are the pixel mean and standard
PSNR and SNR should be large. deviation of a target area of the image. In this
paper, target area is the whole image. A large
Speckle Index value of ENL reflects better quantitative perfor-
mance of the filter.
Since speckle noise is multiplicative in nature,
average contrast of an image may be treated as a Correlation Parameter (CP)
measure of speckle removal. Speckle index (SI)
is defined as (Dewaele, 1990): CP (Salinas et al., 2007) is a qualitative measure
for edge preservation. If one is interested in sup-
var(I ) pressing speckle noise while at the same time
SI = , (15)
E (I ) preserving the edges of the original image then
this parameter can be used. Therefore, to evalu-
ate the performance of the edge preservation or
and its discrete version for an image reads
sharpness, the correlation parameter is defined
as follows
1 m n
σ(i, j )
SI =
mn
∑ ∑ µ(i, j ) , (16)

i =1 j =1
m n − ^ ^

∑ ∑ (∆I − ∆ I ) × (∆ I − ∆ I )
where m×n is the size of the image, μ is the mean CP = i =1 j =1


m n − m n ^ ^
and σ is the standard deviation.
The speckle index can be regarded as an aver-
∑ ∑ (∆I − ∆ I ) ×∑ ∑ (∆ I − ∆ I )
2 2

i =1 j =1 i =1 j =1

age reciprocal signal-to noise ratio (SNR) with (19)


the signal being the mean value and noise being
the standard deviation.

579
PDE-Based Image Processing

^
where ∆I and ∆ I are high pass filtered versions m n
1
^
of original image I and filtered image I obtained
MSSIM (X ,Y ) =
mn
∑ ∑ SSIM (X ij
,Yij ) .
i =1 j =1
via a 3x3 pixel standard approximation of the (22)

− ^
Laplacian operator. The ∆ I and ∆ I are the The MSSIM value should be closer to unity
^
for optimal measure of similarity.
mean values of I and I , respectively. The cor-
relation parameter should be closer to unity for
Histogram Analysis
an optimal effect of edge preservation.
Though the MSE analysis quantitatively measures
Structure Similarity
the average difference between the original im-
Index Map (SSIM)
age and the reconstructed image, it is often as
important and desirable to know how the errors
SSIM (Wang et al., 2004) is used to compare
are distributed in the image. A histogram analysis
luminance, contrast and structure of two different
can be carried out to quantitatively investigate the
images. It can be treated as a similarity measure
performance of histogram distribution. Based on
of two different images. The SSIM of two images
the histogram distribution of the zoomed/restored
X and Y can be defined as
image, one can evaluate the performance of the
proposed algorithms.
SSIM (X ,Y ) =
In a blind image restoration problem a priori
α β
l (X ,Y ) . c(X ,Y ) . s(X ,Y )
γ, (20)
information about the degradation process of an
     
image is not known. Only noisy image is available
as the input and it has to be restored. Since, we
where α > 0 , β > 0 and γ > 0 are parameters don’t have the original image therefore mean-
and can be used to adjust the relative importance square-error (MSE) and peak signal-to-noise
of three components. SSIM of two images X and ratio (PSNR) between original (without noise)
Y can be calculated as and restored image can’t be calculated and these
criterions can’t be used for performance evalu-
(2µx µy + C 1 ) × (2σxy + C 2 ) ation of the proposed algorithm. Therefore, the
SSIM (X ,Y ) = ,
(µ + µy2 + C 1 ) × (σx2 + σy2 + C 2 )
2
x performance of the blind restoration algorithms
(21) can either be evaluated subjectively by a human
observer or by some qualitative procedure that
where µi (i = X or Y) is the mean intensity, σi can be used to compare the various restoration
algorithms from the information available from
(i=X or Y) is the standard deviation, σxy = σx .σy
noisy image only. In this thesis, the performance
and Ci (i=1 or 2) is the constant to avoid instabil-
measurement metrics used are blurred signal-to-
ity when µx2 + µy2 is very close to zero and is noise ratio (BSNR) and average signal-to-noise
defined as C i = (ki L)2 in which ki << 1 and L ratio (SNR). In addition to BSNR and average
is the dynamic range of pixel values e.g. L=255 SNR, the histogram analysis technique is also
for 8-bit gray scale image. In order to have an used to compare the performances of the various
overall quality measurement of the entire image, schemes.
mean SSIM is defined as

580
PDE-Based Image Processing

Average signal-to-noise ratio (SAR) images, digital holographic images, optical


coherence tomography (OCT) images etc. Speckle
mean(I ) µ noise in 2D B-scan ultrasound images follow
Avg.SNR = = (23)
std .deviation(I ) Rayleigh’s probability distribution; speckle noise
σn2
present in SAR imagery follows k-distribution
and noise present in magnetic resonance imaging
A higher value of avg. SNR indicates less (MRI) follow Rician distribution. Depending upon
noise i.e. better restoration. Average SNR is also the type of noise a specific noise removal filter
related to speckle index (SI) which is a measure is required for restoration of a particular image.
of speckle noise. It is related as follows: Restoration of a digital image from its noisy
version is also referred to as regularization or
1 smoothing. In image regularization or smoothing,
Avg.SNR = (24)
SI we simplify the image data in a way that only
interesting features are preserved. The regulariza-
Blurred signal –to-noise ratio (bsnr) ∂I
tion term R in PDE formulations like =R
∂t
 1  introduces additional notion of scale space i.e.

 (M × N ) ∑ ∑ [I orignoisy − I restored ]2 
 the data are iteratively regularized or smoothed
BSNR = 10 log10  
 σ 2  and a continuous sequence of smoother images
 n 



 is generated as time t goes by. The regularization
(25) PDEs may be seen as nonlinear filters that sim-
plify the image little by little and minimize the
where M×N is the size of image, Iorignoisy is the image variations. The regularization term consti-
original noisy image; Irestored is the restored filtered tutes the key elements for solving ill-posed com-
image and σn2 is the variance of noise in original puter vision problems such as restoration, seg-
noisy image (Jain, 2006). A low value of BSNR mentation, registration, surface reconstruction
indicates less blurring or less noise or better res- etc. Specifically, the de-noising algorithms are
toration. usually based on a regularization term R coupled
with a data attachment term (I noisy-I) which is also
known as fidelity term and its corresponding PDE
IMAGE RESTORATION USING is
PDES FORMALISM
∂I
= R + (I noisy − I ) (26)
In this section, 2D image restoration problem ∂t
is addressed which can be generalized to 3D
restoration. The noise present in an image may Figure 7 shows the examples of various
be additive, multiplicative or may be distributed noise patterns present in an image which may
according to a specific probability distribution have specific noise statistics. Figure 8 shows the
function (pdf) depending on the application. For histogram plots of various noise patterns in an
example, additive noise in an image normally image. Histogram plot shows the noise variations
follows the Gaussian distribution. Speckle noise present in an image.
which is multiplicative in nature may be present
in ultrasound images, synthetic aperture radar

581
PDE-Based Image Processing

Figure 7. Various noise patterns present in an image

Figure 8. Histogram plots of noise patterns present in an image

582
PDE-Based Image Processing

ADDITIVE NOISE REMOVAL FROM Koenderink (1984) has shown that the solution
DIGITAL IMAGES of Equation (30) at a particular time t is the con-
volution of the original image Inoisy with a normal-
In PDE based noise removal techniques (Romeny, ized 2D Gaussian kernelGσ with variance
1994; Caselles et al., 1998), suppose I is a 2D
σ = 2t ,
scalar noisy image that we want to restore and
the noise can be considered as high frequency
It (x , y ) = ∫∫ I noisy (x − u, y − v ) * Gσ (u, v )dudv,
variations σ with low amplitude, added to the
pixels of the regular image (31)

I noisy = I regular + σ . (27) 1  x 2 + y 2 


  and σ = 2t
withGσ = exp − 
2πσ 2  2σ 2
 
To regularize Inoisy a common idea is to minimize which indicates that regularization is linear based
its variations estimated by gradient norm of image: on convolution. Here the image is blurred little
by little in an isotropic way during the PDE evo-
lution.
|| ∇I ||= (I x 2 + I y 2 ) . (28)
Some PDE-Based Image Restoration
Then the corresponding variational problem Models for Additive Noise
is the minimization of energy functional
Total Variation (TV) Approach
∫ || ∇I || dΩ . (29)
2
min E (I ) =
I :Ω→R
Ω Rudin et al. (1992) proposed an algorithm for
noise removal based on the minimization of the
The necessary condition for minimizing the total first variation (TV) of image I(x, y, t) given
energy functional E(I) described by Equation (29) by following energy functional:
can be obtained using Euler-Lagrange minimiza-
tion that results in following heat equation E = arg ∫ I x2 + I y2 dxdy = arg ∫ ∇I dxdy.
min image min image

(32)
∂I
= c∇2I = c(I xx + I yy ) , (30a)
∂t The minimization is performed under certain
constraints and boundary conditions (zero flow
I = I noisy with initial condition as the on the boundary). The constraints they employ
(t =0)
are zero mean and given variance of the noise.
observed noisy image given as:, (30b)
The Euler-Lagrange minimization of above
energy functional leads to following total variation
where ∇2I is Laplacian of image I and c is the (TV) based restoration model reads:
diffusion constant and I (x , y, t ) = I (x , y ) .This
equation describes the isotropic diffusion process.  
The basic disadvantage of the isotropic diffusion ∂I  ∇I 
= ∇.  , (33)
is that in addition to noise removal it may also ∂t  ∇I 
 
blur the edges and fine structures present in the
image after certain iterations.
with initial condition I (x , y; 0) = I noisy .

583
PDE-Based Image Processing

Anisotropic Diffusion Based Model that differentiates homogeneous area and regions
of contours and edges. The value of conductivity
Perona and Malik (1990) proposed a nonlinear coefficient ranges in between 20-50.
diffusion method to avoid blurring and localiza-
tion problem of linear diffusion filtering which is Fourth Order PDE-Based Model
termed as anisotropic diffusion. Anisotropic dif-
fusion is the opposite of isotropic i.e. to designate The second order PDEs such as anisotropic dif-
a regularization process that does not smooth the fusion model (Perona, & Malik, 1990) has side
image with the same weight in all the spatial direc- effects of producing the blocky effects during the
tions.This achieves both noise removal and edge evolution process (You, & Kaveh, 2000). The
enhancement through the use of a non-uniform fourth order PDEs avoids blocky effect while
diffusion which acts as unstable inverse diffusion achieving good trade off between noise removal
near edges and as linear heat equation like dif- and edge preservation. The fourth order PDE
fusion in homogeneous regions without edges. proposed by (You, & Kaveh, 2000) is derived
In paper (Perona, &Malik, 1990), authors have from a functional which is an increasing function
used the anisotropic diffusion process which is of the Laplacian of the image intensity function
the extension of isotropic diffusion process to i.e. ∇2I . The Laplacian of an image at a pixel is
remove additive noise from images. In anisotropic zero if the image is planar in its neighbourhood
diffusion based filter, the basic idea is that heat and these PDEs attempt to remove noise and
Equation (30) for linear diffusion can be written preserve edges by approximating an observed
in divergence form: image with a piecewise planar image. The energy
functional proposed by You and Kaveh (2000)
∂I   for additive noise reads
= ∇2I = div(gradI ) = ∇.∇I (34)
∂t
2

The introduction of a conductivity coeffi-


arg E (u ) =
min
∫ ∇2I d Ω . (37)

cient c in the above diffusion equation makes it


possible to make the diffusion adaptive to local The Euler-Lagrange (E-L) minimization of
image structure: above energy functional leads to following fourth
order PDE for edge enhancement and additive
∂I   noise removal from a digital image.
= ∇.c∇I = c∇2I + ∇c.∇I (35)
∂t
∂I
The two possible choices for diffusion coef- ∂t
(
= ∇2 C ( ∇2I )∇2I , ) (38)
ficient c are:
where the diffusion coefficient is defined as:
 || ∇I 1 ||
c1 = exp −  and c2 =
;
 k2 
|| ∇I ||2 1
1+ C ( ∇2I ) = . (39)
k2  ∇2I 
2

where k>0 (36)  


1 +  
 k 
Both expressions are equal up to first order
approximation and k is a fixed gradient threshold

584
PDE-Based Image Processing

Complex Diffusion Based Model I It = cI I Rxx + cR I Ixx , I I ,t =0 = 0, (43b)

In anisotropic diffusion based model (Perona,


where cR = cos θ and cI = sin θ .For small θ,
&Malik, 1990), if real time factor t is replaced by
complex time factor it and the diffusion coefficient relation I Rxx >> θI Ixx holds and second term in
c( ∇I tn ) by c(Im(I )) then it leads to following RHS of Equation (22a) can be omitted. Therefore,
under small θ approximation above equation re-
complex diffusion equation (Gilboa et al., 2004) duces to
originally proposed for image enhancement and
additive noise removal from digital images.
I Rt ≈ I Rxx (44a)

∂I
= div (c(Im(I ))∇I ) (40) I It ≈ θI Rxx + I Ixx (44b)
∂t

In linear complex diffusion based filter for Equation (44) describes the evolution of image
image enhancement and de-noising, the authors using complex diffusion process as proposed by
(Gilboa et al.,2004) proposed to replace the dif- Gilboa et al. (2004). The initial condition is the
fusion coefficient term in equation (40) with a noisy image and after certain number of iterations
complex diffusion coefficient c = exp(iθ) , utiliz- the de-noised image is evolved according to Equa-
ing the approximation cos θ = 1 + O(θ 2 ) and tion (44). From Equation (44) it can be observed
sin θ = θ +O(θ 3 ) and after introducing an op- that evolution of real part of the image is controlled
− by the linear forward diffusion, whereas evolution
erator H = c∆ , the equation (40) can be written of imaginary part of the image is controlled by

both the real and imaginary equations. Qualitative
as: I t = H I with initial condition I t=0 = I 0 .The

properties of edge detection i.e. second smoothed
solution of this PDE is given as I = exp(t H ) . derivative is described by the imaginary part of
The above assumed approximations generates the the image for small value of θ,whereas real values
following solution depicts the properties of ordinary Gaussian scale
-space. For large values of θ, the imaginary part
I (x , y, t ) ≈ exp(t ∆)(1 + iθt ∆)I 0 . (41) feeds back in to the real part creating the wave
like ringing effect which is an undesirable prop-
erty. Here, for experimentation purposes value of
Further separating the real and imaginary parts π
of the image, I = I R + iI I , and diffusion coef- θ is chosen to be .
30
ficient, c = cR + icI , the Equation (41) reads For nonlinear complex diffusion, the diffu-
sion coefficient is defined as follows (Gilboa et
I t = c∇2I = cI xx + cI yy (42) al., 2004):

e iθ
can be written in x-component form as follows: c(Im(I )) = . (45)
2
 Im(I )
1 +  
I Rt = cR I Rxx − cI I Ixx , I R,t =0 = I 0 , (43a)  k θ 

585
PDE-Based Image Processing

Here k is the edge threshold parameter. The is significantly better than currently known gra-
value of k ranges from 1 to 1.5 for digital images dient projections-based methods. The results are
having additive noise. The value of k is fine tuned applicable to both the anisotropic and isotropic
according to the application in hand. discretized TV functional. Initial numerical re-
sults demonstrate the viability and efficiency of
Other PDE-Based Models the proposed algorithms on image de-blurring
problems with box constraints. Image de-noising
Osher and Rudin (1990) proposed an algorithm methods based on gradient dependent regularizes
for image enhancement based on shock filters. In such as Rudin et al.’s total variation (TV) model
this case the image ϕ(x, y, t) evolves according to often suffer the staircase effect and the loss of fine
details. In order to overcome such drawbacks,
∂ϕ authors of the paper (Chen et al.,2009) presents
= − ∇ϕ F (L(ϕ)) , (46)
∂t an adaptive total variation method based on a new
edge indicator, named difference curvature, which
where function F(u) satisfies certain technical can effectively distinguish between edges and
conditions and L is a 2nd order nonlinear elliptic ramps. In another recent paper (Chang, & Yang,
operator. An image evolving according to above 2009) authors have proposed a Lattice Boltzmann
equation develops shocks where L=0. Main goal method for image de-noising.
of this method is to get as close as possible to the Some other PDE-based model includes: Im-
inverse heat equation. age Restoration from motion blurred image using
Alvarez et al. (1992) proposed an algorithm of PDEs formalism (Srivastava et al., 2009); PDE
image selective smoothing and edge detection. In based Unsharp Masking, Crispening and High
this case the image evolves according to Boost Filtering of Digital Images (Srivastava et
al., 2009); and Brownian Motion based image
modelling and its applications to image regular-
∂ϕ ∇ϕ
= g( G ∗ ∇ϕ ) ∇ϕ div( ), (47) ization (Srivastava et al., 2009).
∂t ∇ϕ
Discretization of Models for
where G is a smoothing kernel (e.g. Gaussian) Digital Implementations
and g () is a non increasing function which tends
to zero as t tends to infinity. For digital implementations the PDE based mod-
In paper (Beck, &Teboulle, 2009), authors els discussed in above section can be discretized
studied gradient-based schemes for image denois- using finite differences schemes (Press et al.,1992)
ing and de-blurring problems based on the dis- i.e. Finite Time Centered Scheme (FTCS), where
cretized total variation (TV) minimization model we consider equally spaced points along both the
with constraints. They derived a fast algorithm t-axes and x-axes and denote x j = x 0 + j ∆x ,
for the constrained TV-based image de-burring j=0,1,2,….,m and tn = t0 + n ∆t , n=0,1,2,….,n
problem. To achieve this task, they combined an where m×n is the size of image.
acceleration of the well known dual approach to The discretized form of anisotropic diffusion
the de-noising problem with a novel monotone based model, given by Equation (35), reads
version of a fast iterative shrinkage/ thresholding
algorithm (FISTA). The resulting gradient-based  
I n +1 (x , y ) = I n (x , y ) + ∆t.(∇.c∇I ) (48)
algorithm shares a remarkable simplicity together
with a proven global rate of convergence which

586
PDE-Based Image Processing

The second term in R.H.S. of Equation (48) RESULTS AND ANALYSIS


can be further discretized using centred difference
scheme as proposed in (Perona, & Malik, 1990). Case I: Anisotropic
The discretized form of the fourth order PDE Diffusion Based Model
based model, given by Equation (38), reads
The de-noising scheme defined by Equation (35)
n +1 n has been implemented in MATLAB. The both
I (x , y ) = I (x , y ) + ∆t .
(49) choices of diffusion coefficients (c1 and c2) as
2 n 2 n
[div(c( ∇ I (x , y ) )∇ I (x , y ))] defined in Equation (36) have been used. The
value of ∆t was set to 0.24 and value of conduc-
The discretized form of the nonlinear complex
tivity coefficient k was set to 50 and k may attain
diffusion based model given by Equations (40)
any value in between 20-100 (Perona and Malik,
and (45) reads
1990). The performance of the anisotropic diffu-
sion method for image de-noising defined by
I tn++11 (i, j ) = Equation (35) has been evaluated both qualita-

( )
I tn (i, j ) + ∆t.(∇. c(Im(I tn (i, j ))∇I nt (i, j ) tively and quantitatively in terms of various
(50) performance measurement metrics such as MSE,
RMSE, NMSE, SNR, PSNR and time for differ-
For the numerical scheme, given by above ent gray images of different resolutions for vary-
Equations (49-50) to be stable, the von Neumann ing amount of Gaussian noise variance. The mean
analysis (Press et al.,1992), shows that we require of noise is considered to be zero. From experi-
∆t mentation it has been tested that the scheme defined
2
< ¼. If the grid size is set to ∆x = 1 , then by Equation (35) converges to solution at 30-35
(∆x )
∆t < ¼ i.e. ∆t < 0.25. Therefore, the value of iterations for a noise variance of 0.02 for the
∆t is set to 0.24 for stability of Equations (48-50). sample image in consideration. For less variance
we obtain the processed de-noised image after
20-25 iterations. Figure 9 shows the visual results.
Performance comparisons are shown in Figures
10-11 for both choices of diffusion coefficients.
The execution time is less for option 2 of diffusion
coefficient (Figure 11). NMSE is less and SNR

Figure 9. Results of anisotropic diffusion after 35 iterations for noise variance 0.002 for sample image
lena.jpg (512x512)

587
PDE-Based Image Processing

Figure 10. Performance comparison of anisotropic diffusion: SNR vs noise variance; and NMSE vs noise
variance for both choices of conductivity coeffs (c1 and c2) for different iterations for the image lena.jpg

Figure 11. Performance comparison, Execution time vs noise variance for both choices of conductivity
coeffs (c1 and c2) for different iterations for the image lena.jpg

588
PDE-Based Image Processing

Sample MATLAB code: anisotropic diffusion based model

%K is the conductivity coefficient that varies between 20-100; I is the input


image ; and niter is the number of iterations required for the PDE to converge
to the solution.
function [Ianiso]=PMAnisoDiff (I,k,niter)
It=double(I);
dt=0.25;
t=1;
[x y]=size(I);
epsilon=0.0000000001; % epsilon is added to avoid divide by zero
% k=20-100
for t = 1:niter
[Ix,Iy]=gradient(It);
di=sqrt(Ix.^2+Iy.^2+epsilon);% norm (I)
c=1./(1.+(di./k).^2);
F=(c.*gradient(It));% F=delI/norm(DelI)
Fdiv=gradient(F);% divergence of F
It=It+dt.*Fdiv;
end
Ianiso=It;
End

is high for option 2 of diffusion coefficient (Fig- Case III: Comparative Study of PDE-
ure 10). Hence in terms of NMSE, SNR and ex- Based Filters for Additive Noise
ecution time option 2 of diffusion coefficient is Removal
performing better.
In this section, the comparative studies of four
Case II: Complex Diffusion Based PDE based filters in consideration are presented
Model in terms of MSE, PSNR, CP, and MSSIM (see
Table 1). The sample image is lena.jpg of size
See Figure 12 and Figure 13 for details about 200x200. Numbers of iterations for TV-based
Case II and the complex diffusion based model. model were chosen to 100 for acceptable quality

Figure 12. Original image (Left); Real part of the image (Middle); Imaginary part of the image show-
ing edges (Right)

589
PDE-Based Image Processing

Figure 13. Results of linear complex diffusion based filter, original noisy microscopic image (left),
restored real image (right)

of results. For anisotropic diffusion based PDE, MULTIPLICATIVE SPECKLE NOISE


fourth order PDE, and nonlinear complex diffu- REDUCTION FROM DIGITAL IMAGES
sion based PDE, the number of iterations were
set to 50 for the acceptable quality of images. Speckle Noise: Theoretical
For more than 50 iterations the performance Background
decreases. The value of diffusion coefficient k
for anisotropic diffusion based PDE, and fourth This section describes the general speckle noise
order PDE were set to 20 and that of nonlinear patterns present in various digital images such
complex diffusion was set to 1.5. Figures 14 and as ultrasound images and synthetic aperture ra-
15 give performance comparison of the schemes dar (SAR) images. The speckle noise occurs in
in consideration. coherent imaging of objects whenever surface
roughness of the image being imaged is of the

Table 1. Performance comparison of additive noise removal techniques (Sample image: lena.jpg, (200x200)

Method Noise Variance MSE PSNR [dB] CP MSSIM


Total Variation (TV) 0.0001 290.53 23.499 0.9132 0.5105
0.0025 290.11 23.517 0.91604 0.50941
0.0049 289.63 23.472 0.91668 0.510130.5106
0.0073 294.77 23.436 0.91552 0.51385
0.0097 291.08 23.491 0.91862
Anisotropic Diffusion 0.0001 339.66 22.82 0.93356 0.49915
0.0025 343 22.778 0.93562 0.49918
0.0049 339.82 22.818 0.93311 0.502
0.0073 346.65 22.732 0.93371 0.49723
0.0097 351.63 22.67 0.93374 0.49347
Fourth Order PDE 0.0001 286.74 23.556 0.93839 0.57417
0.0025 287.2 23.549 0.93959 0.57937
0.0049 291.24 23.488 0.94005 0.57761
0.0073 295.06 23.432 0.9399 0.5762
0.0097 300.73 23.349 0.93981 0.57484
Nonlinear Complex Diffusion 0.0001 260.21 23.978 0.935 0.58983
0.0025 259.4 23.991 0.93492 0.58979
0.0049 254.73 24.07 0.93719 0.59331
0.0073 260.33 23.976 0.93478 0.58851
0.0097 263.46 23.924 0.93466 0.58666

590
PDE-Based Image Processing

Figure 14. Performance comparison of additive noise removal filters; MSE(left), and PSNR (right)

Figure 15. Performance comparison of additive noise removal filters; CP(left), and MSSIM (right)

591
PDE-Based Image Processing

order of the wavelength of the incident radiation scatterers being imaged, and that are weighted in
(Jain, 2006). The presence of speckle noise in amplitude by the echogenicity map. This scattering
an imaging system reduces its resolution; espe- function convolved with a point spread function
cially for low contrast images and suppression to produce the RF echo data is shown in the lower
of speckle noise is an important consideration in left panel. The RF echo data is zero-mean and thus
the design of coherent imaging systems. For low does not show what is really of interest, i.e. a map
contrast images speckle noise is multiplicative in of local echogenicity, or local echo magnitude.
nature. Speckle noise may be present in ultrasound Envelope detection removes the carrier, producing
medical images, synthetic aperture radar (SAR) the desired image of echo magnitude in the lower
imagery and digital holographic images and in right panel. The differences between this image
other application images. An inherent character- and the original echogenicity map arise from
istic of coherent imaging, including ultrasound speckle noise. The statistics of fully-developed
imaging, is the presence of speckle noise which speckle (Anderson, 2009; Goodman, 1984) can
is a random, deterministic, interference pattern be described as follows: For the given stochastic
in an image formed with coherent radiation of a nature of speckle noise, this noise pattern can be
medium containing many sub-resolution scatter- described statistically. Each of the diffuse scat-
ers. The texture of the observed speckle pattern terers in the isochronous volume contributes a
does not correspond to underlying structure. The component to the echo signal in a sum is known
local brightness of the speckle pattern, however, as a random walk in the complex plane. If each
does reflect the local echogenicity of the underly- step in this walk is considered an independent
ing scatterers. Speckle has a negative impact on random variable, over many such walks we can
ultrasound imaging and other types of digital imag- apply the Central Limit Theorem to their sum.
ing e.g. SAR (Synthetic Aperture Radar) imaging Therefore, in fully developed speckle, this com-
and digital holography (Srivastava et al., 2009). plex radio-frequency echo signal from diffuse
scatterers alone has a zero mean, two-dimensional
Speckle Noise in Ultrasound Images Gaussian probability density function (PDF) in the
complex plane. The values of the magnitude of
Speckle noise pattern present in ultrasound im- r for many such scatterer populations follow the
ages are multiplicative in nature and distributed Rayleigh PDF. Envelope detection removes the
according to Rayleigh’s probability distribution phase component, creating a signal with Rayleigh
function. In ultrasound imaging, speckle leads amplitude PDF (Anderson, 2009; Goodman, 1984)
to reduction in contrast resolution which is re- which reads
sponsible for the poorer effective resolution of
ultrasound compared to x-ray and MRI. Speckle a a2
pA (a ) = exp(− ), a ≥ 0 (51)
is present in both RF data and envelope-detected σ2 2σ 2
data. Figure 14 conceptually demonstrates the
impact of speckle noise on information content
where a and σ 2 are amplitude and variance of the
(Anderson, 2009). The ultrasound image of a
back scatterer signal. Speckle brightness is
hypoechoic lesion of 5 mm diameter with -9 dB
greater if there are fewer, longer steps in the
contrast is considered. The echogenicity map
random walk than if there are many shorter steps.
corresponding to this object is shown in the top
This could be accomplished by improving the
left panel, opposite the corresponding scattering
spatial resolution of the system. If the scatterer
function in the top right panel. The scattering func-
tion represents the population of sub-resolution density is doubled, a 2 increase in brightness

592
PDE-Based Image Processing

results. When a coherent component is introduced where I0 is the observed SAR image, I is the
to the speckle, it adds a constant strong phasor to original image to be estimated or restored using
the diffuse scatterer’s echoes and shifts the mean some filtering technique and n is the multiplicative
of the complex echo signal away from the origin speckle noise. The image I can also be considered
in the complex plane. Due to this the Rayleigh as the radar scattering characteristics of a random
PDF changes into a Rician PDF. The Rician PDF ground object. The speckle noise in a SAR image
is defined by the following equation: follows the k-distribution (Iskander 1999) i.e.
radar clutter models are based on k-distribution.
a a2 + s2 as k-distribution which is also known as Rayleigh-
pA (a ) = exp(− )I 0 2 , a ≥ 0
σ 2
2σ 2
σ Gamma distribution and is widely used in optics
(52) and radar signal processing for modelling laser
speckles and sea echoes and is found to be use-
These PDFs are nonzero for a >= 0 only. The ful in describing the statistics of the echo. The
parameter s is the echo strength of the bright scat- statistics of a k-distributed random variable X are
terer, while σ is the standard deviation of the described by the following distribution function
complex Gaussian described above, i.e. both the (Iskander 1999; Iskander et al. 1999):
real part and the imaginary part have variances
of σ . I0 is the incomplete Bessel function of zero 2  x 
v +1
x 
order. The Rician PDF is parameterized by the FX (x ) = 1 −  
 K v +1   ,
a Γ(v + 1)  2a  a 
s
variable k, which is defined as . The Rician (55)
σ
PDF reduces to the Rayleigh PDF for the special
where x > 0 and v > -1.
case s = 0. The speckle noise in ultrasound imag-
The probability distribution function of the
ing is also approximated as multiplicative noise,
k-distribution is defined as:
given by following model:
v +1
g(x, y)= I(x, y∙ s(x, y), (53) 2  x  x 
fX (x ) =  
 K v   , (56)
a Γ(v + 1)  2a  a 
Here, g(x, y) is the recorded ultrasound image,
I(x, y) is the original ultrasound image and s(x,
where x > 0 and v > -1.
y) is the multiplicative speckle noise present in
Here Γ(.) is the gamma function, K v (.) is
ultrasound image.
the modified Bessel function of order v and a is
Speckle Noise Pattern a positive constant. The k-distribution is com-
in SAR Imagery pletely described by the shape parameter v and
the scale parameter a, which can be estimated
In synthetic aperture radar (SAR) images, the using method of moments or maximum likelihood
speckle noise is generally caused by the decline methods. If the observed image is I0 and the es-
of echo signals. The image observation model for timated or filtered image is I then the pdf given
SAR imagery is given by by Equation (56) reads

I0=I∙n, (54) v +1
2  I  I 
p(I / I ) = f (I ) = 
  Kv   .
0 I 0 a Γ(v + 1)  2a  a 
(57)

593
PDE-Based Image Processing

Since the acquisition noise on the SAR im- v(x , y ) = u(x , y )s(x , y ) . (60)
age is not Gaussian, a least square fit may not
the best choice for such an estimation process. The speckle noise model given by Equa-
Furthermore, independent pixel estimation does tion (60) is a multiplicative noise model. This
not reflect the spatial regularity of the diffusion multiplicative noise model can be converted to
function. Hence, it is proposed to tackle these additive noise by using homomorphic filtering
issues within a variational framework which is approach which is easier to implement. Other,
adapted to the noise distribution pattern in the effective filters may also be used for the reduction
image and is able to use valuable information of speckle noise.
given by the neighbour pixels. The speckle noise can be reduced by multi-
Similarly, speckle noise is also present in look processing, spatial filtering or homomorphic
digitally reconstructed holographic images which filtering (Jain, 2006). The multi-look processing
are multiplicative in nature and can be reduced is usually done during data acquisition stage and
by PDE-based image processing (Srivastava et. speckle reduction by spatial and homomorphic
al., 2009). filtering is performed on the image after its acqui-
sition. Irrespective of the methods used to reduce
the speckle noise from images, the ideal speckle
SPECKLE NOISE REDUCTION reduction method must preserve radiometric
TECHNIQUES information and the textural information i.e. the
edges between different areas and spatial signal
General Approaches variability. The spatial filters are of two types i.e.
adaptive and non-adaptive. Non-adaptive filters
In general the speckle noise (Jain, 2006) has take the parameters of the whole image signal into
complex amplitude given as consideration and leave out the local properties of
a(x , y ) = aR (x , y ) + jaI (x , y ) , where aR and aI the terrain backscatter or the nature of the sensor.
are zero mean, independent Gaussian random These kinds of filters are not appropriate for non-
variables for each (x, y) with some variance. The stationary scene signal. Fast Fourier Transform
intensity field of speckle noise is given as (FFT) is an example of such filters. The adaptive
filters accommodate changes in local properties
2 of the terrain backscatter as well as the nature of
s(x , y ) = a(x , y ) = a 2 + a 2 . (58)
R I the sensor. In adaptive filters, the speckle noise
is considered as being stationary but the changes
The general image observation model for in the mean backscatters due to changes in the
speckle noise reads type of target are taken into consideration. Adap-
tive filters reduce speckles while preserving the
v(x , y ) = u(x , y )s(x , y ) + η(x , y ) (59) edges and these filters modify the image based
on statistics extracted from the local environment
where v(x, y) is the observed noisy image, u(x, y) of each pixel. Adaptive filter varies the contrast
is the original image to be restored, s(x, y)is the stretch for each pixel depending upon the Digital
intensity of speckle noise and 𝜂(x, y)is detector Number (DN) values in the surrounding moving
noise which is additive in nature. Assuming de- kernel. A filter that adapts the stretch to the re-
tector noise to be zero, the general observation gion of interest produces a better enhancement.
model for speckle noise reads Examples of adaptive filters are: Mean, median,
Wiener filter (Jain, 2006), Lee filter (Lee, 1983),

594
PDE-Based Image Processing

Frost filter (Frost et al., 1982), Kuan filter (Kuan where mean = average of pixels in a moving
et al., 1987), Hybrid filter (Rajan et al., 2008) and window,
Speckle Reducing Anisotropic Diffusion (SRAD)
based filter (Yu, &Acton, 2002). In a recent paper var(x )
k= , (62)
(Juan et al., 2009), the authors investigate and (mean )2 σ 2 + var(x )
compile some of the techniques mostly used in
the smoothing or suppression of speckle noise in
ultrasound images. The basic concepts of these and
adaptive filters are as follows:
 σ + µ2 
var(x ) =  w 2 w 
 − µw .
2
(63)
Mean Filter  σ + 1 

The Mean Filter is a simple one and does not


remove the speckles but averages it into the data µw and σw are the mean and variances of pixels
and it is the least satisfactory method of speckle within chosen window. The Sigma filter is based
noise reduction as it results in loss of detail and on the probability of a Gaussian distribution. It
resolution. However, it can be used for applica- is assumed that 95.5% of random samples are
tions where resolution is not the first concern. within a two standard deviation range. This noise
suppression filter replaces the pixel of interest
Median Filter with the average of all DN values within the mov-
ing kernel that fall within the designated range.
The Median filter is also a simple one and removes
pulse or spike noises. Pulse functions of less than Frost Filter
one-half of the moving kernel width are suppressed
or eliminated but step functions or ramp functions The Frost filter (Frost et al., 1982) replaces the
are retained. pixel of interest with a weighted sum of the values
within the n × n moving kernel. The weighting
Lee-Sigma and Lee Filters factors decrease with distance from the pixel of
interest. The weighting factors increase for the
The Lee-Sigma and Lee filters (Lee, 1981; Lee, central pixels as variance within the kernel in-
1983) utilize the statistical distribution of the DN creases. This filter assumes multiplicative noise
values within the moving kernel to estimate the and stationary noise statistics and follows the
value of the pixel of interest. These two filters following formula:
assume a Gaussian distribution for the noise in
the image data. The Lee filter is based on the as- −α t
DN = ∑ k αe , (64)
sumption that the mean and variance of the pixel of n×n
interest is equal to the local mean and variance of
all pixels within the user-selected moving kernel.
4 σ2
The scheme for computing digital number output where α = ( )() . (65)
(DNout) is as follows: nσ2 I 2
Here k = normalization constant, I = local
(DNout)=[mean]+k[(DNin−mean] (61) mean, σ =local variance, σ = image coefficient
of variation value, t = X − X 0 + Y −Y0 , and
n = moving kernel size.

595
PDE-Based Image Processing

Kuan Filter the correct value for NumberofLooks should be


the effective number of looks of the radar image.
Kuan filter (Kuan et al., 1987) first transforms the It should be close to the actual number of looks,
multiplicative noise model into a signal-dependent but may be different if the image has undergone
additive noise model. Then the minimum mean re-sampling. The user may experimentally adjust
square error criterion is applied to the model. The the NumberofLooks value so as to control the ef-
resulting filter has the same form as the Lee filter fect of the filter. A smaller NumberofLooks value
but with a different weighting function. Because leads to more smoothing; a larger NumberofLooks
Kuan filter made no approximation to the original value preserves more image features.
model, it can be considered to be superior to the
Lee filter. PDE Based Filters
The resulting grey-level value R for the
smoothed pixel is: In recent years, several PDE based methods have
been developed for removal of additive noise from
R = I c *W + I m * (1 −W ) , (66) images (Perona, &Malik, 1990; Gilboa et al.,
2004; You, & Kaveh, 2004) which can be used by
homomorphic filters to reduce speckle noise. The
where:
basic idea behind PDE based noise removal based
on energy minimization technique as discussed in
W = (1 − C u2 C i2 ) / (1 + C u2 )
section image restoration using PDEs formalism.

1
Speckle Reducing Anisotropic
Cu = Diffusion (SRAD) Filter
NumberofLooks
In this paper (Yu, &Acton, 2002), the authors
S provides the derivation of speckle reducing
Ci = .
Im anisotropic diffusion (SRAD), a diffusion method
tailored to ultrasonic and radar imaging applica-
tions. SRAD is the edge-sensitive diffusion for
Ic= center pixel in filter window, Im = mean
speckled images, in the same way that conven-
value of intensity within window, and S = standard
tional anisotropic diffusion is the edge-sensitive
deviation of intensity within window.
diffusion for images corrupted with additive noise.
The Kuan filter is used primarily to filter
At first authors had shown that the Lee and Frost
speckled radar data. It is designed to smooth out
filters can be cast as partial differential equations,
noise while retaining edges or shape features in
and then SRAD filter is derived by allowing edge-
the image. Different filter sizes will greatly affect
sensitive anisotropic diffusion within this context.
the quality of processed images. If the filter is too
SRAD exploits the instantaneous coefficient of
small, the noise filtering algorithm is not effective.
variation, same as the Lee and Frost filters utilize
If the filter is too large, subtle details of the image
the coefficient of variation in adaptive filtering.
will be lost in the filtering process. A 7x7 filter
The instantaneous coefficient of variation is a
usually gives the best results. The NumberofLooks
function of the local gradient magnitude and
parameter is used to estimate noise variance and
Laplacian operators.
it effectively controls the amount of smoothing
applied to the image by the filter. Theoretically,

596
PDE-Based Image Processing

PDE-Based Homomorphic Filtering 3. Finally, the restored holographic image,


Approach for Speckle Reduction Irestored, can be obtained by taking the expo-
nentiation of output obtained in step ii.
Based on the discussions in section Speckle
noise: Theoretical background, it can be said that I restored = exp(I (x , y )). (69)
the speckle noise is multiplicative in nature for
ultrasound images and SAR images. Assuming
detector noise to be zero, the general observation
EXAMPLES
model for speckle noise reads
Non-Linear Complex Diffusion Based
v(x, y)=u(x, y)s(x, y). (67)
Homomorphic Filter for Speckle
Since, the direct reduction of multiplicative
Reduction from Digital Images
noise is a difficult task therefore the multiplica-
In this section (Srivastava et al., 2009), the nonlin-
tive noise is first converted in to additive noise
ear complex diffusion based homomorphic filter
by taking the logarithm of equation (67) i.e. by
is proposed to be used for speckle reduction from
applying homomorphic transform and reduction
ultrasound and synthetic aperture radar (SAR) im-
of this additive noise by some available filters and
ages. The performance of the nonlinear complex
finally taking the exponentiation of filter output
diffusion based homomorphic filter is evaluated
to produce the speckle reduced image.
in terms of MSE and PSNR and a comparative
The homomorphic filtering approach for
study of this scheme with other standard speckle
speckle reduction can be described as (Srivastava
reduction techniques such as Lee filter, Frost fil-
et al., 2009):
ter, Kuan filter and SRAD filter is presented. The
obtained results show that the nonlinear complex
1. Apply the logarithmic transform on equa-
diffusion based homomorphic filter outperforms
tion(67) to convert the multiplicative noise
all schemes in consideration.
in to additive one which reads

log v(x , y ) = log u(x , y ) + log s(x , y ) Methods and Models


(68)
⇒ w(x , y ) = I (x , y ) + ηs (x , y )
As discussed earlier, the nonlinear complex dif-
fusion based PDE reads
Where w(x , y ) = log v(x , y ) is the observed
hologram image in log domain, ∂I → →
I (x , y ) = log u(x , y ) is the noiseless image = ∇ .(D(Im(I ) ∇ I )) . (70)
∂t
in log domain that is to be recovered and
ηs (x , y ) = log s(x , y ) the amount the of the
speckle noise which is now an additive noise Expressing the above PDE in its tangent and
and is to be minimized. normal forms, the presented model reads
2. In this step, an additive noise removal filter
(e.g. Wiener filter, median filter, PDE based ∂I
= ∇2I (ξ, ξ ) + D(Im(I ))∇2I (η, η) (71a)
diffusion filters and other filters) is applied ∂t
to remove or minimize the additive noise
ηs (x , y ) .

597
PDE-Based Image Processing

I (x , y, 0) = I 0 (Initial Condition) (71b) sion in smooth region of image. The function D(.)
is chosen with values between 0 an 1 according
to Equation (73). When D(.) = 1, the two diffusion
The first term in equation (71a) diffuses the
terms in Equation (71a) have equal weights and
image, I, only in the orthogonal direction of the
the Equation (71a) converts to a 2D isotropic heat
gradient and the second term diffuses the image,
equation i.e. Laplacian. When D(.) = 0, only the
I, in the direction of gradient. The terms ξ and
diffusion in the orthogonal direction of the gradi-
η are unit vectors respectively orthogonal and in
ent operates. In homogeneous zones of the image,
the direction of the gradient vector ∇I of the
g (.) will have values near 1.
image I (t, ) .These unit vectors are defined
For simplicity of representation, suppose
(Perona, 1998) as follows
I ξξ = ∇2I (ξ, ξ )represents the diffusion along the
normal of the edge and I ηη = ∇2I (η, η) represents
∇I ⊥ I y , −I x
ξ= = and the diffusion along the tangent of the edge. The
∇I I x2 + I y2 diffusion along the normal diffuses across the
edges and diffusion along the tangent continues
∇I I x , Iy
η= = (72) along the edges. The blurring of edges of image
∇I I x2 + I y2 or area of high frequency content is mainly due
to diffusion along the normal. To control the dif-
The term D(Im(I ) is the complex diffusion
fusion the diffusion across the edges, I ηη , is
coefficient and Im(I ) is the imaginary part of the
image. The diffusion coefficient D(Im(I ) is de- lowered in comparison to the diffusion along the
fined as follows (Gilboa et al.,2004) tangent and this is achieved by varying the values
of the diffusion coefficient D(Im(I )) according
iθ to Equation (73). Further, I ξξ and I ηη are defined
D(Im(I )) = e (73) (Perona, 1998) as follows-
 Im(I )

1 +  
 λθ  I xx I x2 − 2 ∗ I xy I x I y + I yy I y2
I = (74)
ξξ
I x2 + I y2
Here λ is a threshold parameter and θ is the
small phase angle(θ << 1) . The imaginary value
of image I i.e. Im(I ) divided by θ is used for I xx I x2 + 2 ∗ I xy I x I y + I yy I y2
controlling the diffusion process. For small t, this I ηη = .
term vanishes and allows stronger diffusion to I x2 + I y2
reduce the noise; with time its influence increas- (75)
es and preserves the ramp features of the image.
The function D(Im(I ) , known as conductivity Discretization of the Model
coefficient, is a function of local image differen-
tial structure that depends on local partial deriva- For digital implementations, the Equation (71) can
tives (Gilboa et al., 2004). The conductivity coef- be discretized using forward time centered scheme
ficient controls the diffusion process within the (FTCS) and centered difference scheme. Here, we
image that can be varied according to image consider equally spaced points along both the t-
structure i.e lower the diffusion where edges oc- axes and x-axes and denote x j = x 0 + j ∆x ,
curs to preserve the edges and enhance the diffu- j=0,1,2,….,m and tn = t0 + n ∆t , n=0,1,2,….,n

598
PDE-Based Image Processing

where m×n is the size of image. Using the FTCS, win, j+1 = win, j + ∆t.[w + D(Im(w ))w ηη ] ,
ξξ
the discrete form of Equation (70) reads

I in, j+1 = I in, j + ∆t.[∇2I (ξ, ξ ) + D(Im(I ))∇2I (η, η)] with w(x , y, 0) = w
End
3. Finally the speckle reduced image, u, is
i.e. obtained by exponentiation of the output w
of step (2) i.e. u = ew .
I in, j+1 = I in, j + ∆t.[I + D(Im(I ))I ηη ] .
ξξ
(76a)
RESULTS AND PERFORMANCE
COMPARISON
I (x , y, 0) = I 0 (Initial Condition) (76b)
Test Case I: Speckle Reduction
For discretizing I ξξ and I ηη as defined in Equa- from Ultrasound Images
tion (74-75), centred difference scheme have been
The first category of images considered for speckle
used. For the numerical scheme given by Equation
noise reduction is ultrasound images. The proposed
(75) to be stable, the von Neumann analysis (Press
algorithm was applied on several ultrasound
∆t
et al.,1992) shows that we require < ¼. images and visual results for one sample image
(∆x )2 Gyn_normal_intrauterine_IUD.jpg (465x589)
If the grid size is set to ∆x = 1 , then ∆t < ¼ i.e. for varying amount of speckle noise variance
∆t < 0.25. Hence, ∆t = 0 − 0.25 for stability e.g. 0.001, 0.002, 0.004, 0.01, 0.02 and 0.04 are
of Equation (76) during implementation. The shown in Figure15 for speckle noise variance
basis for selection of the stopping time is that 0.04. The performance of the proposed scheme is
running the diffusion equation on an image I at also compared with other state of the arts methods
scale t is equivalent to convolution with Gaussian available in literature for speckle reduction as
kernel of standard deviation 2t . shown in Table 2 for test image Gyn_normal_in-
The final proposed algorithm reads as follows. trauterine_IUD.jpg. Figure16 shows performance
comparison of various speckle reduction methods
Algorithm- Complex Diffusion with the proposed scheme for the sample image
Based Homomorphic Filtering Gyn_normal_intrauterine_IUD.jpg (465x589) in
terms of speckle variance Vs RMSE and speckle
1. In first step, logarithm transformation is ap- variance Vs PSNR and respectively. From Fig-
plied on input given by equation (67) which ure16 it can be observed that the proposed homo-
reads morphic complex diffusion process for speckle
reduction is associated with minimum RMSE
log v(x , y ) = log u(x , y ) + log s(x , y ) and maximum PSNR in comparison to other

⇒ w(x , y ) = I (x , y ) + η(x , y ) techniques reported in literature for the sample
ultrasound image taken in to consideration and
the performance trend remained same for other
2. In second step, input image in log domain
images too. Therefore, the results show that the
is processed using complex diffusion based
proposed complex diffusion based homomorphic
filter to remove the additive noise. For niter
= 1 to n

599
PDE-Based Image Processing

Table 2. Performance comparison of speckle reduction methods (Sample ultrasound image: Gyn_nor-
mal_intrauterine_IUD.jpg, (465x589)

Method Speckle Variance RMSE PSNR [dB]


Lee Filter 0.001 4.0559 35.9691
0.002 4.3344 35.3922
0.004 4.8315 34.4491
0.01 5.9573 32.6298
0.02 7.0466 31.1712
0.04 7.9918 30.0779
Frost Filter 0.001 6.1928 32.2930
0.002 6.8972 31.3573
0.004 8.1355 29.9231
0.01 11.0139 27.2920
0.02 14.6570 24.8099
0.04 19.8619 22.1704
Kuan Filter 0.001 4.0580 35.9646
0.002 4.3325 35.3961
0.004 4.8404 34.4331
0.01 5.9526 32.6367
0.02 7.0544 31.1616
0.04 7.9911 30.0787
Hybrid Filter 0.001 7.0480 31.1695
(After 02 iterations) 0.002 7.5056 30.6231
0.004 8.3987 29.6465
0.01 10.5484 27.6671
0.02 17.4995 23.2703
0.04 17.5058 23.2672
SRAD (After 05 iterations) 0.001 9.0476 29.0001
(Anisotropic diffusion based 0.002 9.0810 28.9681
filter) 0.004 11.6356 26.8151
0.01 10.9302 27.3583
0.02 12.9925 25.8570
0.04 16.0786 24.0059
Proposed Method (After 10 0.001 3.3114 37.7305
iterations) 0.002 3.3804 37.5516
0.004 3.5280 37.1801
0.01 3.8814 36.3509
0.02 4.2837 35.4945
0.04 4.9452 34.2471

filtering scheme performs better than other exist- 0.004, 0.01, 0.02 and 0.04 are presented here.
ing scheme for ultrasound images. Figure17 shows the visual results for the sample
image for speckle noise variance 0.04. Figure18
Test Case II: Speckle Reduction from show performance comparison of various speckle
SAR Images reduction methods with the proposed scheme in
terms of speckle variance Vs RMSE and speckle
The second category of images considered for variance Vs PSNR, respectively. From Figure18 it
speckle noise reduction is synthetic aperture radar can be observed that the proposed homomorphic
(SAR) images. The proposed algorithm was ap- complex diffusion process for speckle reduction
plied on several SAR images and results for one is associated with minimum RMSE and maxi-
sample image, SAR.gif (516x416) for varying mum PSNR in comparison to other techniques in
amount of speckle noise variance e.g. 0.001, 0.002, consideration for the sample SAR image. Also,

600
PDE-Based Image Processing

Figure 16. Impact of speckle noise on information content for ultrasound images

this performance trend remained same for other both category of images. This method also per-
test images. Therefore, the results show that the forms better for digital holographic images (Sriv-
proposed complex diffusion based homomorphic astava et al., 2009).
filtering scheme performs better than other exist-
ing scheme for SAR images too.
Therefore, from above results and discussions FUTURE RESEARCH DIRECTIONS
it can be concluded that the nonlinear complex
diffusion based filter performs better in com- The various possible areas where PDE-based
parison to other speckle reduction methods for image modelling techniques can be applied

Figure 17. Original ultrasound image Gyn_normal_intrauterine_IUD.jpg (465x589) (top-left); Speckled


image with variance 0.04 (top-right); De-speckled image using proposed scheme (bottom)

601
PDE-Based Image Processing

Figure 18. Performance comparison of speckle reduction methods: Speckle Variance Vs RMSE (left),
Speckle Variance Vs PSNR(right) for sample ultrasound image: Gyn_normal_intrauterine_IUD.jpg,
465x589

include: Scale-space theory, image restoration, ematical morphology, object detection, optical
image sharpening, contrast enhancement, image flow, image compression, shape from shading, 3D
analysis and feature extraction, segmentation, reconstruction of views from 2D images, pattern
video processing, shape analysis, axiomatic recognition and vision related applications.
and theoretical approaches to image analysis, PDE-based image modeling techniques can
variational and PDE’s-from-energy approaches play an important role in analyzing the images
to image processing, curve and surface evolution, obtained from various application areas such as:
PDE’s in hybrid systems, comparison of models,
implementations and numerical analysis, math-

Figure 19. Original SAR image (Courtesy ESA-ESRIN), SAR.gif (top-left); Speckled image with variance
0.04 (top-right); De-speckled image using proposed complex diffusion based homomorphic filter (bottom)

602
PDE-Based Image Processing

Figure 20. Performance comparison of speckle reduction methods: Speckle Variance Vs RMSE (left),
Speckle Variance Vs PSNR (right) for sample image: SAR.gif, 516x416

Medical Imaging • Biometrics: Some open problems in this


domain include development of efficient
The efficient and accurate PDE-based image PDE-based models for segmentation, fin-
processing tools can be developed which can aid ger prints enhancement & restoration, fea-
in to accurate diagnosis of the disease. The some ture extraction, face detection, identifica-
problems of medical imaging that needs further tion etc.
attention include: • Optical Information Processing:
Restoration of optical images, speckle re-
• Medical image enhancement and duction and restoration of 3D holograms
de-noising and image processing of microscopic
• Rician noise removal from MRI images images.
• Speckle reduction from optical coherence • Other application areas: where the PDE-
tomography (OCT) images based image modeling can be applied are:
• Segmentation of an object of interest e.g. ◦◦ Biosignal processing (ECG etc)
Segmentation for Alzheimer’s disease de- ◦◦ Confocal Microscopy
tection, segmentation of Iris, segmentation ◦◦ Seismic analysis
of tissue. ◦◦ Geology and weather monitoring and
• Medical image registration etc. control
• Synthetic Aperture Radar (SAR) ◦◦ Robotic vision and many others.
Imaging and Remote Sensing: Some
open problems in this domain include de-
velopment of efficient PDE-based models
for speckle reduction & segmentation, en-
hancement, restoration etc.

603
PDE-Based Image Processing

CONCLUSION models in image processing and computer vision


which may form the basis for further research in
In this chapter, partial differential equation this area for other image processing tasks such as
(PDE)-based approaches for image modelling and image registration, shape from shading and 3D
processing was discussed for image restoration surface reconstruction.
task. The PDE-based approaches can be used in
various image processing and computer vision
applications such as restoration, segmentation, REFERENCES
shape from shading, inpainting, registration, 3D
reconstruction of surfaces from multiple views and Alvarez, L., Guichard, F., Lions, P. L., & Morel,
many more. This chapter is oriented towards the J. M. (1992). Image selective smoothing and
image restoration problem which is an important edge detection by nonlinear diffusion. SIAM
pre-processing task for many applications such Journal on Numerical Analysis, 29, 845–866.
as 3-D surface geometry and reconstruction and doi:10.1137/0729052
many more. The introduction part, discussed the Alvarez, L., Guichard, F., Lions, P. L., & Morel,
basic concepts of image processing and various J. M. (1993). Axioms and fundamental equations
image modelling techniques. General principles of image processing. Archives of Rational Me-
of PDE based image processing and concepts chanics, 23, 199–257. doi:10.1007/BF00375127
of curve and surface evolution were discussed.
Advantages and application areas of PDE-based Anderson, M. E., & Trahey, G. E. (2009). A semi-
image processing were presented. The various nar on k-space applied to medical ultrasound.
performance measurement metrics used for Retrieved June 10, 2009, from http://dukemil.
comparison purposes were also given. Further, egr.duke.edu/Ultrasound/k-space
various methods and techniques for removal and
Barbara, Z., & Jan, F. (2003). Image registra-
reduction of additive and multiplicative noises
tion methods: A survey. Image and Vision
were discussed. For additive noise removal vari-
Computing, 21, 977–1000. doi:10.1016/S0262-
ous popular PDE-based filters such as TV-based
8856(03)00137-9
approach, anisotropic diffusion based approach,
Fourth order PDEs, and complex diffusion based Beck, A., & Teboulle, M. (2009). Fast gradient-
approach were discussed. The performances of based algorithms for constrained total variation
some filters were also evaluated. The complex image de-noising and de-blurring problems.
diffusion based filter is considered to be more IEEE Transactions on Image Processing, 18(11),
efficient filter in comparison to other filters in 2419–2434. doi:10.1109/TIP.2009.2028250
consideration. For multiplicative or speckle noise,
Brown, L. G. (1992). A survey of image registra-
the speckle noise pattern in various imaging
tion techniques. [CSUR]. ACM Computing Sur-
modalities such as ultrasound imaging and SAR
veys, 24(4), 325–376. doi:10.1145/146370.146374
imaging are discussed. Further, the various meth-
ods for speckle reduction were also discussed and Caselles, V., Catte, F., Coll, T., & Dibos, F. (1997).
finally a comparative analysis of various methods A geometric model for active contours. Interna-
were presented for two cases which concludes tional Journal of Computer Vision, 22, 61–69.
that complex diffusion based homomorphic filter doi:10.1023/A:1007979827043
performs better in comparison to other methods
in consideration. The schemes discussed in this
chapter justify the applicability of the PDE-based

604
PDE-Based Image Processing

Caselles, V., Morel, J., & Sapiro, G. (1998). Intro- Gonzalez, R. C., & Wintz, P. (1987). Digital image
duction to the special issue on partial differential processing (2nd ed.). New York, NY: Academic
equations and geometry driven diffusions in image Press.
processing. IEEE Transactions on Image Process-
Goodman, J. W. (1984). Statistical properties of
ing, 7(3), 269–273. doi:10.1109/TIP.1998.661176
laser speckle patterns. In Dainty, J. C. (Ed.), Laser
Chambolle, A. (1994). Partial differential equa- speckle and related phenomena (pp. 9–75). Berlin,
tions and image processing. IEEE International Germany: Springer-Verlag.
Conference on Image Processing, Austin, TX.
Iskander, D. R. (1999). Estimation of the param-
Chan, T. F., & Shen, J. (2005). Variational image eters of the k-distribution using higher order and
inpainting. Communications on Pure and Ap- fractional moments. IEEE Transactions on Aero-
plied Mathematics, 58(5), 579–619. doi:10.1002/ space and Electronic Systems, 35(4), 1453–1457.
cpa.20075 doi:10.1109/7.805463
Chen, Q., Montesinos, P., Sun, Q. S., Heng, P. Iskander, D. R., Zoubir, A. M., & Boashash, B.
A., & Xia, D. S. (2009). Adaptive total variation (1999). A method for estimating the parameters
denoising based on difference curvature. Image of k-distribution. IEEE Transactions on Signal
and Vision Computing, 28(3). doi: 10.1016/j. Processing, 47, 880–884. doi:10.1109/78.752614
imavis. 2009.04.012
Jain, A. K. (1977). Partial differential equations
Dewaele, P., Wambacq, P., Oosterlinck, A., & and finite difference methods in image process-
Marchand, J. L. (1990). Comparison of some ing, part I: Image representation. Journal of
speckle reduction techniques for SAR images. Optimization Theory and Applications, 23, 65–91.
Geoscience and Remote Sensing Symposium, doi:10.1007/BF00932298
IGARSS ‘90, (pp. 2417-2422).
Jain, A. K. (1978). Partial differential equations
Evans, L. C., & Spruck, J. (1991). Motion of level and finite difference methods in image processing,
sets by mean curvature. International Journal of Part II: Image restoration. IEEE Transactions on
Differential Geometry, 33, 635–681. Automatic Control, 23(5), 817–834. doi:10.1109/
TAC.1978.1101881
Frost, V. S., & Stiles, J. A. (1982). A model for radar
Images and its application to adaptive digital filter- Jain, A. K. (2006). Fundamentals of digital image
ing of multiplicative noise. IEEE Transactions on processing. India: PHI.
Pattern Analysis and Machine Intelligence, 4(2),
Kass, M., Witkin, A., & Terzopoulos, A. (1988).
157–166. doi:10.1109/TPAMI.1982.4767223
Snakes: active contour models. International
Gabor, D. (1965). Information theory in elec- Journal of Computer Vision, 1, 321–331.
tron microscopy. Laboratory Investigation, 14, doi:10.1007/BF00133570
801–807.
Kimia, B. B., Tanenbaum, A., & Zucker, S. W.
Gilboa, G., Sochen, N., & Zeevi, Y. Y. (2004). (1990). Lecture Notes in Computer Science: Vol.
Image enhancement and denoising by complex 427. Toward a computational theory of shape: An
diffusion processes. IEEE Transactions on Pat- overview. New York: Springer-Verlag.
tern Analysis and Machine Intelligence, 25(8),
Koenderink, J. J. (1984). The structure of images.
1020–1036. doi:10.1109/TPAMI.2004.47
Biological Cybernetics, 50, 363–370. doi:10.1007/
BF00336961

605
PDE-Based Image Processing

Kuan, D. T., & Sawchuk, A. A. (1987). Adaptive Osher, S., & Rudin, L. I. (1990). Feature oriented
restoration of images with speckle. IEEE Transac- image enhancement using shock filters. SIAM
tions on Acoustics, Speech, and Signal Processing, Journal on Numerical Analysis, 27, 919–940.
35, 373–383. doi:10.1109/TASSP.1987.1165131 doi:10.1137/0727053
Lee, J. S. (1981). Speckle analysis and smooth- Osher, S., & Sethian, J. A. (1988). Fronts propa-
ing of synthetic aperture radar images. Com- gating with curvature dependent speed: Algo-
puter Graphics and Image Processing, 17, 24–32. rithms based on Hamilton-Jacobi formulations.
doi:10.1016/S0146-664X(81)80005-6 Journal of Computational Physics, 79, 12–49.
doi:10.1016/0021-9991(88)90002-2
Lee, J. S. (1983). Digital image smoothing and the
sigma filter. Computer Vision Graphics and Im- Perona, P. (1998). Orientation diffusions. IEEE
age Processing, 24, 255–269. doi:10.1016/0734- Transactions on Image Processing, 7, 457–467.
189X(83)90047-6 doi:10.1109/83.661195
Lucy, L. B. (1974). Image restoration of high Perona, P., & Malik, J. (1990). Scale space and
photometric quality. In R.J Hanisch & R. L. White edge detection using anisotropic diffusion. IEEE
(Eds.), Proceedings of the Restoration of HST Transactions on Pattern Analysis and Machine
Images and Spectra STSci, (pp. 79-85). Intelligence, 12, 629–639. doi:10.1109/34.56205
Mateo, J. L., & Fernández, C. A. (2009). Finding Press, H., Teukolsky, S., Vetterling, T., & Flan-
out general tendencies in speckle noise reduc- nery, B. (1992). Numerical recipes in C: The art
tion in ultrasound images. Expert Systems with of scientific computing (2nd ed.). Cambridge
Applications, 36, 7786–7797. doi:10.1016/j. University Press.
eswa.2008.11.029
Price, C. B. (1990). Image enhancement and
Milan, S., Vaclav, H., & Boyle, R. (2007). Image analysis with reaction diffusion paradigm. Pro-
processing, analysis and machine vision (2nd ed.). ceedings of the Institution of Electrical Engineers,
Washington, DC: PWS Publishing. 137, 136–145.
Morel, J. M., & Solimini, S. (1988). Segmentation Rajan, J., Kannan, K., & Kaimal, M. R. (2008).
of images by variational methods: A constructive An improved hybrid model for molecular image
approach. Revista Matematica de Universidad de-noising. Journal of Mathematical Imaging
Complutense Madrid, 1, 169–182. and Vision, 31, 73–79. doi:10.1007/s10851-008-
0067-4
Mumford, D., & Shah, J. (1989). Optimal ap-
proximations by piecewise smooth functions Romeny, B. ter H. (1994). Geometry driven dif-
and variational problems. Communications on fusion in computer vision. Boston, MA: Kluwer.
Pure and Applied Mathematics, 42, 577–685.
Rudin, L. I., Osher, S., & Fatemi, E. (1992). Non
doi:10.1002/cpa.3160420503
linear total variation based noise removal algo-
Osher, S., & Fedkiw, R. P. (2001). Level set meth- rithms. Physica D. Nonlinear Phenomena, 60,
ods: An overview and some recent results (pp. 259–268. doi:10.1016/0167-2789(92)90242-F
1–65). California, USA: IPAM GBM Tutorials.

606
PDE-Based Image Processing

Salinas, H., & Fernandez, D. C. (2007). Compari- Srivastava, R., Gupta, J. R. P., & Parthasarthy,
son of PDE-based nonlinear diffusion approaches H. (2009). Complex diffusion based speckle re-
for image enhancement and de-noising in optical duction from digital images. IEEE International
coherence tomography. IEEE Transactions on Conference on Methods and Models in Computer
Medical Imaging, 26(6), 761–771. doi:10.1109/ Science (ICM2CS-09), Delhi, India, (pp. 43-49).
TMI.2006.887375
Topiwala, P. N. (Ed.). (1998). Wavelet image and
Shen, T. F., & Vese, L. (2002). Variational PDE video compression. Kluwer Academic Publishers.
models in image processing. Joint Math Meeting,
Wang, Z., Bovik, A. C., Sheikh, H. R., & Celli,
San Diego.
E. P. S. (2004). Image quality assessment: From
Soman, K. P., & Ramachandran, K. I. (2004). error visibility to structural similarity. IEEE
Insight in to wavelets from theory to practice. Transactions on Image Processing, 13(4), 1–14.
India: PHI. doi:10.1109/TIP.2003.819861
Srivastava, R. (2011). A complex diffusion based Witkin, A. P. (1983). Scale space filtering. Interna-
nonlinear filter for speckle reduction from opti- tional Joint Conference on Artificial Intelligence,
cal coherence tomography images. International Germany, (pp. 1019-1023).
Conference on Communication, Computing &
You, Y. L., & Kaveh, M. (2000). Fourth order
Security (ICCCS-2011), Rourkela, India, (pp.
partial differential equations for noise removal.
259-264). ACM Press. ISBN-978-1-4503-0464-1
IEEE Transactions on Image Processing, 9,
Srivastava, R., & Gupta, J. R. P. (2010). A PDE- 1723–1730. doi:10.1109/83.869184
based nonlinear filter adapted to Rayleigh’s
Yu, Y., & Acton, S. T. (2002). Speckle reducing
speckle noise for de-speckling 2D ultrasound
anisotropic diffusion. IEEE Transactions on Im-
images. International Conference on Contempo-
age Processing, 11(11), 1260–1270. doi:10.1109/
rary Computing (IC3-2010), Communications
TIP.2002.804276
in Computer and Information Science (CCIS),
CCIS-94, India, (pp. 1-12). Berlin, Germany: Zhang, R., Tsai, P.-S., Cryer, J. E., & Shah, M.
Springer-Verlag (1999). Shape from shading: A survey. IEEE Trans-
actions on Pattern Analysis and Machine Intel-
Srivastava, R., Gupta, J. R. P., & Parthasarathy, H.
ligence, 21(8), 690–706. doi:10.1109/34.784284
(2009). Comparison of PDE based and other tech-
niques for speckle reduction from digitally recon-
structed holographic images. Optics and Lasers
KEY TERMS AND DEFINITIONS
in Engineering, 48(5), 626–635. doi:10.1016/j.
optlaseng.2009.09.012 Additive Noise: It follows Gaussian distribu-
Srivastava, R., Gupta, J. R. P., Parthasarathy, H., & tion.
Srivastava, S. (2011). An adaptive nonlinear PDE Homomorphic Filter: This filter is used for
based speckle reduction technique for ultrasound speckle reduction from digital images using filters
images. International Journal of Biomedical defined for additive noise removal in logarithmic
Engineering and Technology, 6(3). doi:10.1504/ domain. In logarithmic domain, the multiplicative
IJBET.2011.041468 noise converts in to additive noise.
PDE-Based Filters: Partial differential equa-
tion based filters derived using variational calculus

607
PDE-Based Image Processing

by minimizing the energy functional of the image Speckle Noise: A multiplicative noise that
defined in terms of gradient norm of the image. appears in digital holographic images, ultrasound
Performance Metrics: The various perfor- imaging, SAR imaging and many other applica-
mance measurement metrics are MSE, PSNR, tions. It degrades the visual quality of the image.
CP, MSSIM, speckle index and effective number This noise is introduced during the formation of
of looks. image. Speckle noise in ultrasound images follows
Restoration: This is one of the digital image Rayleigh’s distribution.
processing tasks that removes or reduces the noise Speckle Reduction: It deals with the methods
from images to improve its visual quality. to reduce speckle noise.

This work was previously published in 3-D Surface Geometry and Reconstruction: Developing Concepts and Applications,
edited by Umesh Chandra Pati, pp. 49-89, copyright 2012 by Information Science Reference (an imprint of IGI Global).

608
609

Chapter 36
Multicast over Location-
Based Services
Péter Hegedüs
Budapest University of Technology and Economics, Hungary

Mihály Orosz
Budapest University of Technology and Economics, Hungary

Gábor Hosszú
Budapest University of Technology and Economics, Hungary

Ferenc Kovács
Budapest University of Technology and Economics, Hungary

ABSTRACT
This chapter details the potential found in combining to different technologies. The two basically dif-
ferent technologies, LBSs in mobile communication and the well-elaborated multicast technology are
merged in the multicast via LBS solutions. As this chapter demonstrates, this emerging new area has a
lot of possibilities, which have not been completely utilized.

INTRODUCTION or a given group of people in order to process


location-dependent information. This use does
Currently, an important area of mobile communi- not cover the full range that is conceivable for
cation is ad-hoc computer networks, where mobile these services. This article introduces so-called
devices need base stations however they form an Zone Services as a new sub-category of LBSs. In
overlay without any Internet-related infrastruc- contrast to traditional LBSs, Zone Services collect
ture, which is a virtual computer network among information about persons currently located in a
them. In this case, the selective, location-related given geographic area. For these services, new
communication has not been solved completely. considerations regarding data collection, privacy,
Traditional Location-Based Services (LBSs) and efficiency have to be made. Hence, it has to
determine the current location of a given person be determined what techniques or mechanisms
common in traditional LBSs or in other areas like
DOI: 10.4018/978-1-4666-2038-4.ch036

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Multicast over Location-Based Services

databases or mobile communication systems can information by sending a request to a server that
be reused and what concepts have to be developed. provides location-dependent information.
One of the various communication models LBSs process information with respect to the
among software entities is the one-to-many data location of one or several persons, also referred to
dissemination, called multicast. The multicast as targets before presenting it to the user. In recent
communication over mobile ad-hoc networks years, LBSs have become increasingly important
has increasing importance (Hosszú, 2005). The and have helped accelerate the development
article described the fundamental concepts and towards ubiquitous computing environments.
solutions on the area of LBSs and the possible Traditional LBSs map targets to locations (e.g.,
multicasting over the LBS systems. This kind Where is person X located?), i.e., they find the
of communication is in fact a special case of the position of a specific person or group of people.
multicast communication model, called geocast, This type of LBS is denoted as Tracking Services.
where the sender disseminates data to a subset of There are a lot of location positioning meth-
the multicast group members that are in a specific ods and technologies, such as the satellite-based
geographical area. This chapter shows that this Global Positioning System (GPS) that is widely
special kind of multicast utilizes the advantages of applied (Hofmann-Wellenhof et al., 1997). The
LBSs, since multicast is based on location-aware location determination methods that do not use
information that is available in location-based the GPS can be classified into three categories:
solutions. Proximity, Triangulation (lateration), and Scene
The two basically different technologies, LBSs analysis or pattern recognition (Hightower &
in mobile communication and the well-elaborated Borriello, 2001). Signal strength is frequently
multicast technology are merged in the multicast applied to determine proximity. As a proximity
via LBS solutions. As the chapter demonstrates, measurement, if a signal is received at several
this emerging new area has a lot of possibilities, known locations, it is possible to intersect the
which has not been completely utilized. coverage areas of that signal to calculate a location
area. If one knows the angle of bearing (relative
to a sphere) and distance from a known point to
BACKGROUND the target device, then the target location can be
accurately calculated. Similarly, if somebody
The positioning technologies in the LBS solutions knows the range from three known positions
are based on the various distances of the commu- to a target, then the location of the target object
nication mobile from the different base stations. can be determined. A GPS receiver uses range
With advances in automatic position sensing and measurements to multiple satellites to calculate
wireless connectivity, the application range of its position. The location determination methods
mobile LBSs is rapidly developing, particularly can be server-based or client-based according
in the area of geographic, tourist and local travel to the place of computation (Hightower & Bor-
information systems (Ibach et al., 2005). Such riello, 2001).
systems can offer maps and other area-related LBSs utilize their ability of location-awareness
information. The LBS solutions give the capability to simplify user interactions. With advances in
to deliver location-aware content to subscribers on wireless connectivity, the application range of
the basis of the positioning capability of the wire- mobile LBSs is rapidly developing, particularly in
less infrastructure. The LBS solutions can push the field of tourist information systems - telematic,
location-dependent data to mobile users according geographic, and logistic information systems.
to their interest or the user can pull the required However, current LBS solutions are incompat-

610
Multicast over Location-Based Services

ible with each other since manufacturer-specific sender disseminates the data to a subset of the
protocols and interfaces are applied to aggregate multicast group members that are in a specific
various components for positioning, networking, geographical area.
or payment services. In many cases, these com-
ponents form a rigid system. If such a system has
to be adapted to another technology, e.g., moving MULTICASTING
from GPS based positioning to in-house IEEE
802.11a-based Wireless Local-Area Network The models of multicast communication differ in
(WLAN) or Bluetooth based positioning, it has the realization of the multiplication function in
to be completely redesigned (Haartsen, 1998). the intermediate nodes. In the case of Datalink-
As such, the ability of interoperation of differ- Level the intermediate nodes are switches, on
ent resources under changeable interconnection the Network-Level they are routers and on the
conditions becomes crucial for the end-to-end Application-Level the fork points are applica-
availability of the services in mobile environments tions on hosts.
(Ibach, & Horbank, 2004). The Datalink-Level based multicast is not flex-
Chen et al. (2004) introduces an enabling ible enough for new applications, which is why it
infrastructure, which is a middleware, in order to has no practical importance. The Network-Level
support LBSs. This solution is based on a Location Multicast (NLM), known as IP-multicast, is well
Operating Reference Model (LORE) that solves elaborated and sophisticated routing protocols
many problems of constructing LBSs, including are developed for it. However, it has not yet been
location modeling, positioning, tracking, location- widely deployed since routing among the Autono-
dependent query processing and smart location- mous Systems (AS) has not been solved perfectly.
based message notification. Another interesting The application level solution gives less efficiency
solution is the mobile yellow page service. compared to the IP-multicast, however, its deploy-
An interesting development is the Compose ment depends on the application itself and it has
project, which aims to overcome the drawbacks no influence on the operation of the routers. That
of the current solutions by pursuing a service inte- is why the Application-Layer Multicast (ALM) is
grated approach that encompasses pre-trip and on- currently increasing in importance.
trip services where on-trip services could be split There are a lot of various protocols and imple-
into in-car and last-mile services (Bocci, 2005). mentations of the ALM, some of which are suit-
The pre-trip service means the 3D navigation of able for communication over wireless networks,
the users in a city environment, and the on-trip which enhance the importance of the ALM. The
service means the in-car and the last-mile services reason for this is that in the case of mobile devices
together. The in-car service is a composition of the importance of ad-hoc networks is increasing.
an LBS and a satellite broadcasting/multicasting Ad-hoc is a network that does not need any infra-
method. In this case, the user has wireless-link structure. Such networks are Bluetooth (Haartsen,
access by Personal Digital Assistant (PDA) to 1998) and Mobile Ad Hoc NETwork (MANET),
broadcast or multicast. The last-mile service helps which comprise a set of wireless devices that can
the mobile user with PDAs to receive guidance move around freely and communicate in relaying
during the final part of the journey. packets on behalf of one another (Mohapatra et
The article focuses on the multicast solutions al., 2004).
over the current LBS solutions. This kind of com- In computer networking, there is a weaker
munication is in fact a special case of the multicast definition of this ad-hoc network. Ad-hoc is a
communication model, called geocast, where the computer network that does not need a routing

611
Multicast over Location-Based Services

infrastructure. It means that the mobile devices mesh members, the node either sends a join request
that use base stations can create ad-hoc computer toward a core or attempt to reach a group member
networks. In such situations, the usage of Applica- using an expanding-ring search process. Any mesh
tion-Level Networking (ALN) technology is more member can respond to the join request with a join
practical than IP-Multicast. In order to support this Acknowledgement (ACK) that propagates back to
group communication, various multicast routing the request originator.
protocols are developed for the mobile environ- Compared to the mesh-based routing protocols,
ment. The multicast routing protocols for ad-hoc which exploit variable topology, the so-called
networks differ in terms of state maintenance, gossip-based multicast routing protocols exploit
route topology and other attributes. randomness in communication and mobility.
The simplest ad-hoc multicast routing methods Such multicast routing protocols apply gossip
are flooding and tree-based routing. Flooding as a form of randomly controlled flooding to
is very simple, which offers the lowest control solve the problems of network news dissemina-
overhead at the expense of generating high data tion. This method involves member nodes to
traffic. This situation is similar to the traditional IP- talk periodically to a random subset of other
Multicast routing. However, in a wireless ad-hoc members. After each round of talk, the gossipers
environment, the tree-based routing fundamen- can recover their missed multicast packets from
tally differs from a wired IP-Multicast situation, each other (Mohapatra et al., 2004). Compared
where tree-based multicast routing algorithms to the deterministic approaches, this probabilistic
are obviously the most efficient ones, such as in method will better work in a highly dynamic ad
the Multicast Open Shortest Path First (MOSPF) hoc network because it operates independently of
routing protocol (Moy, 1994). Though tree-based network topology and its random nature fits the
routing generates optimally small data traffic on typical characteristics of the network.
the overlay in the wireless ad-hoc network, the
tree maintenance and updates need a lot of control
traffic. That is why the simplest methods are not THE LOCATION-AWARE MULTICAST
scalable for large groups.
A more sophisticated ad-hoc multicast rout- The geocasting can be combined with flooding.
ing protocol is the Core-Assisted Mesh Protocol Such methods are called forwarding zone meth-
(CAMP), which belongs to the mesh-based mul- ods, which constrain the flooding region. The
ticast routing protocols (Garcia-Luna-Aceves & forwarding zone is a geographic area that extends
Madruga, 1999). It uses a shared mesh to support from the source node to cover the geocast zone.
multicast routing in a dynamic ad-hoc environ- The source node defines a forwarding zone in the
ment. This method uses cores to limit the control header of the geocast data packet. Upon receiving
traffic needed to create multicast meshes. Unlike a geocast packet, other machines will forward
the core-based multicast routing protocol as the it only if their location is inside the forwarding
traditional Protocol Independent Multicast-Sparse zone. The Location-Based Multicast (LBM) is
Mode (PIM-SM) multicast routing protocol (Deer- an example for such geocasting-limited flooding
ing et al., 1996), CAMP does not require that all (Ko & Vaidya, 2002).
traffic flow through the core nodes. CAMP uses An interesting type of ad-hoc multicasting is
a receiver-initiated method for routers to join the geocasting. The host that wishes to deliver
a multicast group. If a node wishes to join the packets to every node in a certain geographical area
group, it uses a standard procedure to announce can use such a method. In this case, the position
its membership. When none of its neighbors are of each node with regard to the specified geocast

612
Multicast over Location-Based Services

region implicitly defines group membership. Ev- receiver. The real advantages of the method are
ery node is required to know its own geographical that geocasting services can be offered for all
location. For this purpose they can use the GPS. existing mobile phones without any additional
The geocasting routing method does not require device or infrastructure.
any explicit join or leave actions. The members
of the group tend to be clustered both geographi-
cally and topologically. The geocasting method FUTURE TRENDS
of routing exploits the knowledge of location.
The multicast communication over mobile ad-
hoc networks has increasing importance. The
LBM GEOCASTING AND article has described the fundamental concepts
IP MULTICASTING and solutions. It especially focused on the area
of LBSs and the possible multicasting over them.
Using LBM in a network where routers are in It was shown that a special kind of the multicast,
fixed locations and their directly connected hosts called geocast communication model utilizes
are within a short distance, the location of these the advantages of LBSs, since it is based on the
hosts can be approximated with the location of location-aware information made available in the
their router. These requirements are met by most of location-based solutions.
the GSM (Global System for Mobile Communica- There are two known issues of this IP level
tions), UMTS (Universal Mobile Telecommunica- geocasting. The first problem is the scalability, the
tion System), WIFI (Wi-Fi Certified), WIMAX flooding type of message transfer is less robust as
(Worldwide Interoperability for Microwave Ac- compared to multicast tree based protocols, but this
cess) and Ethernet networks, therefore a novel method is more efficient in a smaller environment
IP layer routing mechanism can be introduced. than using tree allocation overhead of multicast
This new method is a simple kind of the protocols. The second issue is that the source must
geocasting-limited flooding, extending the normal be connected directly to the router that is physically
Multicast Routing Information Base (MRIB) with in the center position in order to become source of
the geological location of the neighbor routers. a session. The proposed geocasting-limited flood-
Every router should know its own location, and a ing protocol should be extended to handle those
routing protocol should be used to spread location situations where the source of a session and the
information between routers. The new IP protocol target geological location are in different places.
is similar to the User Datagram Protocol (UDP)
protocol, but it extends it with a source location and
a radius parameter. The source location parameter CONCLUSION
is automatically assigned by the first router. When
a router receives a packet with empty source loca- The two basically different technologies, the
tion, it assigns its own location to it. The radius Location-Based Services in the mobile commu-
parameter is assigned by the application itself, or nication world and the well-elaborated multicast
it can be an administratively defined parameter communication technology of the computer
This method requires changes in routing networking are jointed in the multicast over LBS
operational systems, but offers an easy way to solutions. As it was described, this emerging new
start geocasting services on an existing IP infra- area has a lot of possibilities, which have not been
structure without using additional positioning completely utilized.
devices (e.g., GPS receiver) on every sender and

613
Multicast over Location-Based Services

As a conclusion it can be stated that despite Hosszú, G. (2005). Reliability Issues of the
the earlier predicted slower development rate of Multicast-based Mediacommunication. In Pagani,
the LBS solutions, nowadays the technical possi- M. (Ed.), Encyclopedia of Multimedia Technology
bilities and the consumers’ demands have already and Networking (pp. 875-881). Hershey, PA: Idea
met. The geospatial property of LBSs provides Group Reference.
technical conditions to apply a specialized type
Ibach, P., & Horbank, M. (2004). Highly-Available
of the multicast technology, called geocasting,
Location-based Services in Mobile Environments.
which gives an efficient and user group targeted
Paper presented at the International Service Avail-
solution for one-to-many communication.
ability Symposium 2004, Munich, Germany, May
13-14.
REFERENCES Ibach, P., Tamm, G., & Horbank, M. (2005).
Dynamic Value Webs in Mobile Environments
Bocci, L. (2005). Compose Project Web Site, Using Adaptive Location-Based Services. In
Retrieved from http://www.newapplication.it/ Proceedings of the 38th Hawaii International
compose Conference on System Sciences (9 pages). IEEE.
Chen, Y., Chen, Y. Y., Rao, F. Y., Yu, X. L., Li, Ko, Y.-B., & Vaidya, N. H. (2002). Flooding-
Y., & Liu, D. (2004). LORE: an infrastructure to Based Geocasting Protocols for Mobile Ad Hoc
support location-aware services. IBM Journal of Networks. Proceeding of the Mobile Networks and
Research and Development, 48(5/6), 601–615. Applications. Kluwer Academic, 7(6), 471-480.
Deering, S. E., Estrin, D., Farinacci, D., Jacobson, LocatioNet. (2006). LocatioNet and Ericsson En-
V., Liu, C-G., & Wei, L. (1996). The PIM architec- ter Into Global Distribution Agreement. Retrieved
ture for wide-area multicast routing. IEEE/ACM from http://www.locationet.com
Trans. on Networking, 4(2), 153-162.
Mohapatra, P., Gui, C., & Li, J. (2004). Group com-
Garcia-Luna-Aceves, J. J., & Madruga, E. L. munications in mobile ad hoc networks. Comput-
(1999, August). The core-assisted mesh protocol. er, 37(2), 52–59. doi:10.1109/MC.2004.1266296
IEEE Journal on Selected Areas in Communica-
tions, 1380–1394. doi:10.1109/49.779921 Moy, J. (March 1994). Multicast extensions to
OSPF. Network Working Group RFC 1584.
Haartsen, J. (1998). The universal radio interface
for ad hoc, wireless connectivity. Ericsson Re- Uppuluri, P., Jabisetti, N., Joshi, U., & Lee, Y.
view,3. Retrieved 2004 from http://www.ericsson. (2005). P2P Grid: Service Oriented Framework
com/review for Distributed Resource Management. In Pro-
ceedings of the IEEE International Conference
Hightower, J., & Borriello, G. (2001 Aug.). Loca- on Web Services, July 11-15, Orlando, FL, USA.
tion Systems for Ubiquitous Computing. IEEE
Computer, 57-65. Weiss, D. (2006). Zone Services — A New Ap-
proach of Location-based Services, Retrieved
Hofmann-Wellenhof, B., Lichtenegger, H., & from http://www.pervasive2006.org/
Collins, J. (1997). Global Positioning System:
Theory and Practice. Fourth Edition, Springer-
Verlag Wien, New York, NY.

614
Multicast over Location-Based Services

KEY TERMS AND DEFINITIONS towards the server. Usually the server provides
services that the client can request from the server.
Ad-Hoc Computer Network: Mobile devices Its alternative is the P2P model (see below).
that require base stations can create the ad-hoc Geocast: One-to-many communications
computer network if they do not need routing among communicating entities, where an entity
infrastructure. in the root of the multicast distribution tree sends
Application-Layer Multicast (ALM): A data to that certain subset of the entities in the
novel multicast technology, which does not require multicast dissemination tree, which are in a spe-
any additional protocol in the network routers, cific geographical area.
since it uses the traditional unicast (one-to-one) IP-Multicast: Network-level multicast tech-
IP transmission. Its other name: Application-Level nology, which uses the special class-D IP-address
Multicast (ALM). range. It requires multicast routing protocols in the
Application-Level Network (ALN): The network routers. Its other name: Network-Level
applications, which are running in the hosts, can Multicast (NLM).
create a virtual network from their logical con- Multicast Routing Protocol: In order to
nections. This is also called overlay network. The forward the multicast packets, the routers have
operations of such software entities are not able to to create multicast routing tables using multicast
understand without knowing their logical relations. routing protocols.
In most cases these ALN software entities use the Multicast Tree: A virtual graph, which gives
P2P model (see below), not the client/server (see the paths of sending multicast data from the source
below) for the communication. (the root of the tree) to the nodes of the tree. Its
Autonomous System (AS): A network with other name: Dissemination tree or Distribution
common administration; it is a basic building Tree.
element of the Internet. Each AS is independent Peer-to-Peer (P2P): It is a communication
from the others. model where each node has the same author-
Client/Server Model: It is a communicating ity and communication capability. They create
model, where one hardware or software entity a virtual network, overlaid on the Internet. Its
(server) has more functionalities than the other members organize themselves into a topology
entity (the client), whereas the client is responsible for data transmission.
to initiate and close the communication session

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 239-245,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

615
616

Chapter 37
A New Image Distortion
Measure Based on Natural
Scene Statistics Modeling
Abdelkaher Ait Abdelouahad Hocine Cherifi
University of Mohammed V-Agdal, Morocco University of Burgundy, France

Mohammed El Hassouni Driss Aboutajdine


University of Mohammed V-Agdal, Morocco University of Mohammed V-Agdal, Morocco

ABSTRACT
In the field of Image Quality Assessment (IQA), this paper examines a Reduced Reference (RRIQA)
measure based on the bi-dimensional empirical mode decomposition. The proposed measure belongs
to Natural Scene Statistics (NSS) modeling approaches. First, the reference image is decomposed into
Intrinsic Mode Functions (IMF); the authors then use the Generalized Gaussian Density (GGD) to
model IMF coefficients distribution. At the receiver side, the same number of IMF is computed on the
distorted image, and then the quality assessment is done by fitting error between the IMF coefficients
histogram of the distorted image and the GGD estimate of IMF coefficients of the reference image,
using the Kullback Leibler Divergence (KLD). In addition, the authors propose a new Support Vector
Machine-based classification approach to evaluate the performances of the proposed measure instead
of the logistic function-based regression. Experiments were conducted on the LIVE dataset.

INTRODUCTION sentially on human observers’ opinion. A popular


method for assessing image quality involves ask-
Performances of image processing algorithms can ing people to quantify their subjective impressions
be evaluated essentially based on the perceived by selecting one of the five classes: Excellent,
image quality. As a result, an image quality as- Good, Fair, Poor, Bad, from the quality scale
sessment method is required. Indeed, subjective (ITU, 2000), then these opinions are converted
measures can be involved. They are based es- into scores. Finally, the average of the scores is
computed to get the Mean Opinion Score (MOS).
DOI: 10.4018/978-1-4666-2038-4.ch037
However, subjective measures cannot be applied

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

in real-time applications due to the time required known some success when tested on five types of
for subjective experiments. distortion, it suffers from some weaknesses. First,
Beside subjective measures, objective mea- steerable pyramids are a non-adaptive transform,
sures assess the quality automatically with no need and depend on a basis function. This later cannot
to human observers. But up to now, an objective fit all signals, and when this happens, a wrong
measure which can replace the human observer time-frequency representation of the signal is
does not exist. obtained. Consequently it is not sure that steer-
General-purpose objective measures belong able pyramids will achieve the same success for
to Full Reference (FR) approaches. Thanks to the other type of distortions. Furthermore, the wavelet
availability of the reference image, they can use transform provides a linear representation which
a pixel-to-pixel comparison like the well-known cannot reflect the nonlinear masking phenom-
Peak-Signal-to-Noise (PSNR) or introduced sta- enon in human visual perception (Foley, 1994).
tistical features-based similarity measure (Wang, A novel decomposition method was introduced
Bovik, Sheikh, & Simoncelli, 2004). Nevertheless, by Huang et al. (1998), named Empirical Mode
in real-world applications (like transmission) the decomposition (EMD). It aims to decompose non
reference image is not always available. In that stationary and non linear signals to finite number
case, No-Reference (NR) approaches are more of components: Intrinsic Mode Functions (IMF),
suitable. They attempt to assess the quality of a and a residue. It was first used in signal analysis,
processed image without any cue from its unpro- and then it attracted more researchers’ attention.
cessed version. For this goal, they need a prior A few years later Nunes, Bouaoune, Delechelle,
information on the distortion. As consequence, Niang, and Bunel (2003) proposed an extension
most of NR approaches are conceived for specific of this decomposition in the 2D case Bi-dimen-
distortion type and cannot be generalized to other sional Empirical Mode Decomposition (BEMD).
distortions (Wang, Sheikh, & Bovik, 2002). Re- A number of authors have benefited from the
duced Reference (RR) approaches provide a good BEMD in several image processing algorithms:
compromise between FR and NR, as only partial image watermarking (Taghia, Doostari, & Taghia,
information is involved. They rely on perceptually 2008), texture image retrieval (Andaloussi et al.,
relevant features that have to be extracted from 2009), and feature extraction (Wan, Ren, & Zhao,
the reference image. These features are used at the 2008). In contrast to wavelet, EMD is nonlinear
receiver side for detecting visual quality degrada- and adaptive method, it depends only on data
tion. Consequently, the measure proposed in this since no basis function is needed.
paper falls in the RR approaches. Motivated by the advantages of the BEMD,
Recently, a number of authors have success- and to remedy the wavelet drawbacks discussed
fully introduced RR methods based on: image above, here we propose the use of BEMD as a
distortion modeling (Gunawan & Ghanbari, 2003; representation domain. As distortions affects IMF
Kusuma & Zepernick, 2003), human visual system coefficients and also their distribution. The inves-
(HVS) modeling (Carnec, Le Callet, & Barba, tigation of IMF coefficients marginal distribution
2003, 2005), or finally natural image statistics seems to be a reasonable choice. In the literature,
modeling (Wang & Simoncelli, 2005; Ait Abde- most RR methods use a logistic function-based
louahad, El Hassouni, Cherifi, & Aboutajdine, regression method to predict mean opinion scores
2011). Wang and Simoncelli (2005) introduced from the values given by an objective measure.
a RRIQA measure called WNISM and based These scores are then compared in term of cor-
on Steerable pyramids (a redundant transform relation with the existing subjective scores. The
of wavelets family). Although, this method has higher is the correlation, the more accurate is the

617
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

objective measure. In addition to the objective marginal distribution of IMF coefficients. This
measure introduced in this paper, an alternative implies the availability of IMF coefficient histo-
approach to logistic function-based regression is gram of the reference image at the receiver side.
investigated. It is an SVM-based classification, Using the histogram as a reduced reference raises
where the classification was conducted on each the question of the amount of side information
distortion set independently, according to the to be transmitted. If the bin size is coarse, we
visual degradation level. The better is the clas- obtain a bad approximation accuracy but a small
sification accuracy the higher is the correlation data rate while when the bin size is fine, we get
of the objective measure with the HVS judgment. a good accuracy but a heavier RR data rate. To
To verify the relevance of the adaptivity, we avoid this problem it is more convenient to assume
used the LIVE dataset which contains several a theoretical distribution for the IMF marginal
distortion types with different distortion levels. distribution and to estimate the parameters of the
We believe that with such dataset we can study distribution. In this case the only side informa-
the genericity of the proposed method. tion to be transmitted consists of the estimated
This paper is organized as follows. First, we, parameters and eventually an error between the
present the proposed IQA scheme. The BEMD empirical distribution and the estimated one. The
and its algorithm are presented afterwards. Then, GGD model provides a good approximation of
we describe the distortion measure, followed by a IMF coefficients histogram and this only with
section that explains how we conduct the experi- the use of two parameters. Moreover, we consider
ments and presents some results of a comparison the fitting error between empirical and estimated
with existing methods. Finally, we give some IMF distribution. Finally, at the receiver side we
concluding remarks. use the extracted features to compute the global
distance over all IMFs.

IQA PROPOSED SCHEME


THE BI-DIMENSIONAL EMPIRICAL
In this paper, we propose a new IQA scheme MODE DECOMPOSITION
based on the BEMD decomposition. This scheme
provides a distance between a reference image The Empirical Mode Decomposition (EMD)
and its distorted version as an output. This dis- has been introduced (Huang et al., 1998) as a
tance represents the error between both images driven-data algorithm, since it is based purely
and should have a good consistency with human on the properties observed in the data without
judgment. pre-determined basis functions. The main goal
The scheme consists in two stages as men- of EMD is to extract the oscillatory modes that
tioned in Figure 1. First, a BEMD decomposition represent the highest local frequency in a signal,
is employed to decompose the reference image while the remainders are considered as a residual.
at the sender side and the distorted image at the These modes are called Intrinsic Mode Functions
receiver side. Second, the features are extracted (IMF). An IMF is a function that satisfies two
from the resulting IMFs based on modeling natural conditions:
image statistics. The idea is that distortions make
a degraded image appearing unnatural and affect 1. The function should be symmetric in time,
image statistics. Measuring this unnaturalness can and the number of extrema and zero cross-
lead us to quantify the visual quality degradation. ings must be equal, or at most differ by one.
One way to do so is to consider the evolution of

618
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

Figure 1. The deployment scheme of the proposed RRIQA measure

2. At any point, the mean value of the upper signal is designated as an IMF and then the re-
envelope and the lower envelope must be sidual will be considered as the input signal for
zero. the next IMF. The algorithm terminates when a
The so called “sifting process” works itera- stopping criterion or a desired number of IMFs
tively on the signal to extract each IMF. Let x (t ) is reached.
be the input signal, the algorithm of the EMD is After IMFs are extracted through the sifting
summarized as follows: process, the original signal x (t ) can be repre-
sented by:
Empirical mode decomposition algorithm
1. Identify all extrema of x (t ). n

2. Interpolate between minima (resp. x (t ) =∑Imf j (t ) +m (t ) (1)


maxima), ending up with some enve- j =1

lope emin (t ) (resp. emax (t ) ).


3.Compute the mean Where Imf j is the j th extracted IMF and n

m (t ) =
(e (t ) + e
min max )
(t ) is the total number of IMFs.
In two dimensions (BEMD), the algorithm
2
4. Extract the detail d (t ) = x (t ) − m(t ). remains the same as for a single dimension with
a few changes: the curve fitting for extrema in-
5. Iterate on the residual m(t ) .
terpolation will be replaced with a surface fitting;
this increases the computational complexity for
The sifting process consists in iterating from
identifying extrema and specially for extrema
step 1 to 4 upon the detail signal d (t ) until this interpolation.
later can be considered as zero mean. The resultant

619
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

To extract IMFs, several two dimensions EMD creases. For example, the first IMF contains higher
versions have been developed (Linderhed, 2005; frequencies than the second one. Furthermore, in a
Damerval, Meignen, & Perrier, 2005; Bhuiyan, particular IMF the coefficients histogram exhibits
Adhami, & Khan, 2008). They differ mainly by a non Gaussian behavior, with a sharp peak at zero
the interpolation method they use. The most recent and heavier tails than the Gaussian distribution as
one proposed an interpolation based on statistical can be seen in Figure 3 (a). Such a distribution can
order filters. From a computational cost standpoint, be well fitted with a two parameters Generalized
it is a fast implementation. Indeed only no iteration Gaussian Density (GGD) model given by:
is required for computing each IMF. This a good
reason enough for using it in our experiments.   β 
β   x  
Figure 2 illustrates an application of the BEMD. p (x ) = exp −    (2)
The resulting IMFs from a BEMD retain the 1   α  
2αΓ( )    
highest frequencies at each decomposition level. β
The frequency content decrease as the order of
the IMF increases. For example, the first IMF ∞

contains higher frequencies than the second one. where Γ (z ) = ∫e dt , z > 0 represents the
−t z −1
t
0
Gamma function, α is the scale parameter that
DISTORTION MEASURE describes the standard deviation of the density,
and β is the shape parameter.
The resulting IMFs from a BEMD show the high- In the conception of an RR method, we should
est frequencies at each decomposition level, this consider a transmission context, where an image
frequencies decrease as the order of the IMF in- in the sender side with a perfect quality has to be

Figure 2. An example of image decomposition using the BEMD

620
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

Figure 3. Histograms of IMF coefficients under various distortion types: (a) original image, (b) addi-
tive Gaussian noise contaminated image, (c) Gaussian blurred image, (d) transmission errors distorted
image. Solid curves: histogram of IMF coefficients. Dashed curves: GGD model fitted to the histogram
of IMF coefficients in the original image. The horizontal axis represents the IMF coefficients, while the
vertical axis represents the frequency of these coefficients.

transmitted to a receiver side. The RR method possible. Here, we propose a marginal distribution-
consists in extracting relevant features from the based RR method since the marginal distribution
reference image and uses them as a reduced de- of IMF coefficients changes from a distortion type
scription. However, the selection of features is a to another as illustrated in Figures 3 (b), (c), (d),
critical step. On one hand, extracted features and (e). Let us consider IMFO as an IMF from
should be sensitive to a large type of distortions the original image and IMFD its corresponding
to guarantee the genericity, and also be sensitive from the distorted image. To quantify the quality
to different distortion levels. On the other hand, degradation, we use the Kullback Leibler Diver-
extracted features should have a minimal size as

621
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

gence (KLD) which is recognized as a convenient model cause we are not sure that the distorted
way to compute divergence between two Prob- image is still a natural one and consequently if
ability Density Functions (PDFs). Assuming that the GGD model is still adequate. Indeed the dis-
p (x ) and q (x ) are the PDFs of IMFO and IMFD tortion introduced by the processing can greatly
respectively, the KLD between them is defined modify the marginal distribution of the IMF coef-
as: ficients. Therefore it is more accurate to use the
empirical distribution of the processed image.
p (x )dx
d (p || q ) = ∫ p (x ) log (3) pm (x )dx
q(x ) d (pm || q ) = ∫ pm (x ) log q(x )
(6)

For this aim, the histograms of the original


image must be available at the receiver side. Even Then the KLD between p (x ) and q (x ) are
if we can send the histogram to the receiver side estimated as:
it will increase the size of the feature signifi-
cantly and causes some inconvenients. The GGD d(p || q ) = d (pm || q ) − d (pm || p 
) (7)
model provides an efficient way to get back coef-
ficients histogram, so that only two parameters
are needed to be transmitted to the receiver side. Finally the overall distortion between an origi-
In the following, we note pm (x ) the approxima- nal and distorted image is as it follows:

tion of p (x ) using a 2- parameters GGD model.  


1 
Furthermore, our feature will contains a third D = log2 1 +
 Do

K
k =1 (
d k p k || q k )  (8)

characteristic which is the prediction error defined
as the KLD between p (x ) and pm (x ):
where K is the number of IMFs, p k and q k are
pm (x )dx the probability density functions of the k th IMF
d (pm || p) = ∫ p (x ) log
m
p(x )
 (4) in the reference and distorted images, respec-

tively. d k is the estimation of the KLD between
In practice, this quantity can be computed as p k and q k , and Do is a constant used to control
it follows: the scale of the distortion measure.
The proposed method is a real RR one thanks
Pm (i ) to the reduced number of features used. The image
d (pm || p) = ∑ i =1 Pm (i ) log
L
dx (5) is decomposed into four IMFs and from each IMF
p (i )
we extract only three parameters {α, β, d (pm || p )}
so that 12 parameters in the total. Increasing the
Where P (i ) and Pm (i ) are the normalized number of IMF will increase the computational
heights of the i th histogram bins, and L is the complexity of the algorithm and thus the size of
number of bins in the histograms. the feature set. To estimate the parameters ( α, β )
Unlike the sender side, at the receiver side we we used the moment matching method (Van de
first compute the KLD between q (x ) and pm (x ) Wouwer, Scheunders, & Van Dyck, 1999), and for
extracting IMFs we used a fast and adaptive BEMD
(Equation (6)). We do not fit q (x ) with a GGD

622
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

(Bhuiyan, Adhami, & Khan, 2008) based on sta- ence images shown in Figure 4 have very different
tistical order filters, to replace the sifting process textural characteristics, various percentages of
which is time consuming. homogeneous regions, edges and details.
To evaluate the performances of the proposed To score the images one can use either the
measure, we use the logistic function-based re- MOS or the Differential Mean Option Score
gression which takes the distances and provides (DMOS) which is the difference between “refer-
the objective scores. Another alternative to the ence” and “processed” Mean Opinion Score. For
logistic function-based regression is proposed LIVE database, the MOS of the reference images
and it is based on SVM classifier. More details is equal to zero, then the difference mean opinion
about the performance evaluation are given in score (DMOS) and the MOS are the same.
the next section. To illustrate the visual impact of the different
distortions, Figure 5 presents the reference image
and the distorted images. In order to examine how
EXPERIMENTAL RESULTS well the proposed metric correlates with the hu-
man judgement, the given images have the same
Our experimental test was carried out using the subjective visual quality according to the DMOS.
LIVE database (Sheikh, Wang, Cormack, & Bo- As we can see, the distance between the distorted
vik, 2005). It is constructed from 29 high resolu- images and their reference image is of the same
tion images and contains seven sets of distorted order of magnitude for all distortions.
and scored images, obtained by the use of five In Figure 6, we show an application of the
types of distortion at different levels. Set1 and measure in Equation (8) to five white noise con-
2 are JPEG2000 compressed images, set 3 and taminated images, as we can see the distance
4 are JPEG compressed images, set 5, 6 and 7 increases as the distortion level increases; this
are respectively: Gaussian blur, white noise and demonstrates a good consistency with human
transmission errors distorted images. The 29 refer- judgment.

Figure 4. The 29 reference images of the LIVE database

623
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

Figure 5. An application of the proposed measure to different distorted images: (a) White noise, D =
9.36, DMOS =56.68), (b) Gaussian blur, D= 9.19, DMOS =56.17), (c) Transmission errors, D= 8.07,
DMOS =56.51)

Figure 6. An application of the proposed measure to different levels of Gaussian white noise contami-
nated images

624
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

The tests consist in choosing a reference image γ1 − γ2


logistic (γ, D ) = + γ2 ,
and one of its distorted versions. Both images are  D −γ 3 
− 
 γ 
considered as entries of the scheme given in Fig- 1 +e 4

ure 1. After feature extraction step in the BEMD


domain a global distance is computed between where γ = (γ1, γ2 , γ 3 , γ 4 ) . Then,
the reference and distorted image as mentioned
in Equation (8). This distance represents an objec- MOS p = logistic (γ, D ) .
tive measure for image quality assessment. It Figure 7 shows the scatter plot of DMOS
produces a number and that number needs to be versus the model prediction for the JPEG2000,
correlated with the subjective MOS. This can be Transmission errors, White noise, and Gaussian
done using two different protocols: blurred distorted images. We can easily remark
how well is the fitting especially for the Transmis-
Logistic Function Based-Fegression sion errors and the white noise distortions.
Once the nonlinear mapping is achieved, we
The subjective scores must be compared in term obtain the predicted objective quality scores
of correlation with the objective scores. These (DMOSp). To compare the subjective and objec-
objective scores are computed from the values tive quality scores, several metrics were introduced
generated by the objective measure (the global by the VQEG. In our study, we compute the cor-
distance in our case), using a nonlinear function relation coefficient to evaluate the accuracy
according to the Video Quality Expert Group prediction and the Rank order coefficient to
(VQEG) Phase I FR-TV (VQEG, 2000). Here, we evaluate the monotonicity prediction. These met-
use a four parameter logistic function given by: rics are defined as follows:

Figure 7. Scatter plots of (DMOS) versus the model prediction for the JPEG2000, Transmission errors,
White noise and Gaussian blurred distorted images

625
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

CC = As we can see, our method ensures better


( )(
MOS (i ) − MOS MOS p (i ) − MOS p )
N
∑ i =1
prediction accuracy (higher correlation coeffi-
cients), better prediction monotonicity (higher
(MOS (i ) − MOS ) ∑ (MOS )
2 2
(i ) − MOS
N N
∑ i =1 i =1 p p Spearman rank-order correlation coefficients)
(9) than the WNISM method, and this for the white
noise. While for the remaining distortion sets the
(MOS (i ) − MOS (i)) proposed measure supplies results comparable to
N 2
6∑ p
ROCC = 1 − i =1
the WNISM ones. Also comparing to PSNR which
N (N − 1) 2
is a FR method, we can observe a significant
(10) improvements for the blur and transmission errors
distortions.
where the index i denotes the image sample and We notice that we carried out other experi-
N denotes the number of samples. ments for using the KLD between probability
We planned two experiments to evaluate the density functions (PDFs) by estimating the GGD
performances of the proposed RRIQA measure. parameters at the sender and the receiver side, but
The first experiment aims to evaluate performanc- the results were not satisfying comparing to the
es in term of accuracy by the use of the Pearson proposed measure. This can be explained by the
linear correlation coefficient, while the second strength of the distortion that makes reference
experiment involves the Spearman rank order image lose its naturalness and then an estimation
correlation coefficient to evaluate performances of the GGD parameters at the receiver side is not
in term of monotonicity. In Table 1, we report suitable. To go further, we thought to examine
the results obtained for two RR metrics (BEMD, how an IMF behaves with a distortion type. For
WNISM) and two FR metrics (PSNR, MSSIM). this aim, we conducted the same experiments as
As the FR metrics use more information we can above but on each IMF separately. Table 2 shows
expect than they should be more performing than the results for three distortions from the LIVE
RR metrics. This is true for MSSIM but not for database.
the PSNR that perform poorly as compared to the As observed, the sensitivity of an IMF to the
RR metrics for all the types of degradation except quality degradation changes depending on the
for the noise perturbation. distortion type and the order of the IMF. For in-

Table 1. Performance evaluation for the quality measure using LIVE database

Dataset JP2(1) JP2(2) JPG1 JPG2 Noise Blur Error

Correlation Coefficient (prediction accuracy)


BEMD 0.8265 0.8695 0.9137 0.7866 0.9332 0.8405 0.9176
WNISM 0.9329 0.9502 0.9295 0.8613 0.8902 0.8874 0.9221
PSNR 0.9337 0.8948 0.9015 0.9136 0.7742 0.9866 0.8811
MSSIM 0.9675 0.9668 0.9646 0.9856 0.9706 0.9361 0.9439
Rank-Order Correlation Coefficient (prediction monotonicity)
BEMD 0.8285 0.8702 0.8704 0.7846 0.9068 0.8349 0.9065
WNISM 0.9356 0.9453 0.8623 0.8475 0.8699 0.9147 0.9210
PSNR 0.9231 0.8816 0.8907 0.8077 0.9855 0.7729 0.8785
MSSIM 0.9566 0.9677 0.9572 0.9441 0.9718 0.9421 0.9497

626
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

Table 2. Performance evaluation using IMFs separatel

White Noise Gaussian Blur Transmission Errors

IMF1 CC= 0.91 ROCC= 0.90 CC= 0.74 ROCC= 0.75 CC= 0.87 ROCC= 0.87

IMF2 CC= 0.75 ROCC= 0.73 CC= 0.82 ROCC= 0.81 CC= 0.86 ROCC= 0.85

IMF3 CC= 0.85 ROCC= 0.87 CC= 0.77 ROCC= 0.73 CC= 0.75 ROCC= 0.75

IMF4 CC= 0.86 ROCC= 0.89 CC= 0.41 ROCC= 0.66 CC= 0.75 ROCC= 0.74

stance, the performance decreases for the “Trans- from images and trains a learning algorithm to
mission errors” distortion as the order of the IMF classify the images based on the feature extracted.
increases. Also, some IMFs are more sensitive for The effectiveness of this approach is linked to the
one set, while for the other sets it is not. A weight- choice of discriminative features and the choice
ing factor according to the sensitivity of the IMF of the multiclass classification strategy. One can
seems to be a good way to improve the accuracy ask, why Support vector machine classifier?
of the proposed method. The weights are chosen Nowadays, several classifiers exist. Among
in a way to give more importance for the IMFs these classifiers, the Support Vector Machine
which give better correlation values. To do so, the (SVM) have been shown to provide better results
weights have been tuned experimentally, since no than traditional techniques such as neural network.
emerging combination can be applied in our case. The SVM rely on two main keys:
Let us take the “Transmission errors” set for ex-
ample, if w1,w2 ,w 3 ,w 4 are the weights for the 1. The SVM uses a hyperplane or a set of
IMF1, IMF2 , IMF3 , IMF4 respectively, then we hyperplanes for the separation purpose.
Thus, a good separation is achieved with
should have w1 >w2 > w 3 >w 4 . We change the
the hyperplane that has the largest distance
value of wi ,i = 1, …, 4 until reaching better results. to the nearest training data of any class.
Some improvements have been obtained, but only 2. In order to discriminate sets which are not
for the Gaussian blur set as CC=0.88 and linearly separable in their original space, a
ROCC=0.87. This improvement around 5% is mapping into another space with higher or
promising as the weighing procedure is very rough. infinite dimension is used. The separation
One can expect further improvement by using a problem becomes then easier in that new
more refined combination of the IMF. Detailed space.
experiments on the weighting factors remain for
future work. However, SVM is a binary classifier. In other
words, it is designed to separate data into two
SVM-Based Classification classes; while real-world applications classifica-
tion problems are multiclass. Then, an extension
Traditionally, RRIQA methods use the logistic of the SVM to multiclass case is required. Two
function-based regression to obtain objective strategies were introduced. The first consists in
scores. In this approach one extracts features considering all data in a single optimization; it is

627
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

named All In One (AIO). The second, consists give a better classification accuracy. The entries
in combining several binary SVM classifiers, it of the SVM are formed by the distances com-
is termed devide and combine. For the second puted in Equation (7). For the i th distorted image,
strategy we have three main methods: One Against X i = d1, d2 , d3 ,d4  represents the vector of fea-
One (OAO), Directed Acyclic Graph (DAG), One
tures (only four IMFs are used). Table 3 shows
Against All (OAA). The difference between these
the classification accuracy per set of distortion
methods comes from the number of SVM binary
As illustrated in Table 3, the classification
classifiers involved and the strategy used to com-
accuracy varies between 83.42 for the JPG2 dis-
bine them. Nevertheless, OAO, DAG, OAA pres-
tortion set and 96.55 for the white noise distortion
ent some limitations, since they do not consider
set. In the case of logistic function-based regres-
the full problem directly. Beside the devide and
sion, the top value of the correlation coefficient
combine strategy, the AIO approach seems to be
that we can obtain is equal to 1 as a full correla-
more natural one. Demirkesen and Cherifi (2008)
tion between objective and subjective scores while
have made a complete comparison of multiclass
for the classification case, the classification ac-
SVM classification methods. Results show that
curacy can be interpreted as the probability by
the AIO strategy outperforms the three methods
which we are sure that the objective measure
of devide and combine strategy. Convinced by
correlates well with the human judgment, thus a
the results of this study we have chosen the AIO
classification accuracy that equal to 100% is
strategy in our classification problem.
equivalent to a CC that equal to 1. This leads to
The 7 sets of distorted images from the LIVE
a new alternative of the logistic function-based
dataset are used. Set1 and set 2: JPEG2000 com-
regression with no need to predicted MOS. Thus,
pression, set 3 and set 4: JPEG compression, set
one can ask which one is more preferable? The
5: white noise, set 6: Gaussian blur, set 7: fast
logistic function-based regression or the SVM-
fading. The determination of the training and the
based classification. From the first view, the
testing sets has been realized thanks to the cross
SVM-based classification seems to be more pow-
validation (leave one out). Let us consider a spe-
erful. Nevertheless this gain on performances is
cific set (e.g., white noise). Since the DMOS
obtained at the price of an increasing complexity.
values are in the interval [0,100], this later was
On the one hand a complex training is required
divided into five equal intervals ]0,20], ]20,40],
before one can use this strategy. On the other hand
]40,60], ]60,80], ]80,100] corresponding to the
when this training step has been done the clas-
quality classes: Bad, Poor, Fair, Good Excellent,
sification is straightforward.
respectively. Thus the set of distorted images is
divided into five subsets according to the DMOS
associated to each image in the set. Then at each Table 3. Classification accuracy (%) for LIVE
iteration we trained a multiclass SVM (five database
classes) using the leave one out cross validation.
Type of distortion Classification accuracy
In other words each iteration involves using a
JP(1) 84.63
single observation from the original sample as
JP(2) 88.59
the validation data, and the remaining observations
JPG1 94.87
as the training data. This is repeated such that
JPG2 83.42
each observation in the sample is used once as
Noise 96.55
the validation data. The Radial Basis Function
Blur 89.55
(RBF) kernel was utilized and a feature selection
step was carried out to select its parameters that Error 93.10

628
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

CONCLUSION Bhuiyan, S., Adhami, R., & Khan, J. (2008). A


novel approach of fast and adaptive bidimensional
A reduced reference method for image quality empirical mode decomposition. In Proceedings of
assessment is introduced, it’s a new one since it the IEEE International Conference on Acoustics,
is based on the BEMD, and also the classifica- Speech and Signal Processing, Huntsville, AL
tion framework is proposed as an alternative of (pp. 1313-1316).
the logistic function-based regression. This later
Carnec, M., Le Callet, P., & Barba, D. (2003). An
produces objective scores in order to verify the
image quality assessment method based on per-
correlation with subjective scores, while the
ception of structural information. In Proceedings
classification approach provides an accuracy
of the IEEE International Conference on Image
rates which explain how the proposed measure is
Processing, France (Vol. 3, pp. 185-188).
consistent with the human judgment. Promising
results are given demonstrating the effectiveness Carnec, M., Le Callet, P., & Barba, D. (2005).
of the method especially for the white noise dis- Visual features for image quality assessment with
tortion. As a future work, we expect to increase reduced reference. In Proceedings of the IEEE
the sensitiveness of the proposed method to other International Conference on Image Processing,
types of degradations to the level obtained for Paris, France (Vol. 1, pp. 421-424).
the white noise contamination. We plan to use an
Damerval, C., Meignen, S., & Perrier, V. (2005).
alternative model for the marginal distribution of
A fast algorithm for bidimensional EMD. IEEE
BEMD coefficients. The Gaussian Scale Mixture
Signal Processing Letters, 12(10), 701–704.
seems to be a convenient solution for this purpose.
doi:10.1109/LSP.2005.855548
We also plan to extend this work to other types of
distortion using a new image database. De Forges, J. R. O. (1999). Locally adaptative
method for progressive still image coding. In Pro-
ceedings of the IEEE International Symposium on
REFERENCES Signal Processing and its Applications, Brisbane,
Australia (Vol. 2, pp. 825-829).
Ait Abdelouahad, A., El Hassouni, M., Cherifi,
H., & Aboutajdine, D. (2011). Image quality as- Demirkesen, C., & Cherifi, H. (2008). A compari-
sessment based on IMF coefficients modeling. In son of multiclass SVM methods for real world
Proceedings of the International Conference on natural scenes. In J. Blanc-Talon, S. Bourennane,
Digital Information and Communication Technol- W. Philips, D. Popescu, & P. Scheunders (Eds.),
ogy and its Applications, Dijon, France (Vol. 166, Proceedings of the 10th International Conference
pp. 131-145). on Advanced Concepts for Intelligent Vision
Systems, Juan-les-Pins, France (LNCS 5259, pp.
Andaloussi, J., Lamard, M., Cazuguel, G., Tairi, 752-763).
H., Meknassi, M., Cochener, B., & Roux, C.
(2009). Content based medical image retrieval: Foley, J. (1994). Human luminance pattern mecha-
Use of generalized Gaussian density to model nisms: Masking experiments require a new model.
BEMD IMF. In Proceedings of the World Congress Journal of the Optical Society of America. A, Op-
on Medical Physics and Biomedical Engineering, tics, Image Science, and Vision, 11(6), 1710–1719.
Munich, Germany (Vol. 25, pp. 1249-1252). doi:10.1364/JOSAA.11.001710

629
A New Image Distortion Measure Based on Natural Scene Statistics Modeling

Gunawan, I. P., & Ghanbari, M. (2003). Reduced Taghia, J., Doostari, M., & Taghia, J. (2008). An
reference picture quality estimation by using local image watermarking method based on bidimen-
harmonic amplitude information. In Proceedings sional empirical mode decomposition. In Proceed-
of the London Communications Symposium, Lon- ings of the IEEE Congress on Image and Signal
don, UK (pp. 137-140). Processing (Vol. 5, pp. 674-678).
Huang, N. E., Shen, Z., Long, S. R., Wu, M. C., Van de Wouwer, G., Scheunders, P., & Van Dyck,
Shih, H. H., & Zheng, Q. (1971). …Liu, H. H. D. (1999). Statistical texture characterization
(1998). The empirical mode decomposition and the from discrete wavelet representations. IEEE
Hilbert spectrum for non-linear and non-stationary Transactions on Image Processing, 8(4), 592–598.
time series analysis. Proceedings - Royal Society. doi:10.1109/83.753747
Mathematical, Physical and Engineering Sci-
VQEG. (2000). Final report from the video qual-
ences, 454, 903–995. doi:10.1098/rspa.1998.0193
ity experts group on the validation of objective
International Telecommunications Union (ITU). models of video quality assessment. Retrieved
(2000). Recommendation BT. 500-10: Methodol- from http://www.vqeg.org/
ogy for the subjective assessment of the quality of
Wan, J., Ren, L., & Zhao, C. (2008). Image
television pictures. Geneva, Switzerland: Author.
feature extraction based on the two-dimensional
Kusuma, T. M., & Zepernick, H.-J. (2003). A empirical mode decomposition. In Proceedings
reduced-reference perceptual quality metric for of the Congress on Image and Signal Processing,
in-service image quality assessment. In Proceed- Harbin, China (Vol. 1, pp. 627-631).
ings of the Joint First Workshop on Mobile Future
Wang, Z., Bovik, A., Sheikh, C. H. R., & Simon-
and Symposium on Trends in Communications,
celli, E. P. (2004). Image quality assessment:
Nedlands, WA, Australia (pp. 71-74).
From error visibility to structural similarity.
Linderhed, A. (2005). Variable sampling of the IEEE Transactions on Image Processing, 13(4),
empirical mode decomposition of two dimen- 1624–1639. doi:10.1109/TIP.2003.819861
sional signals. International Journal of Wavelets,
Wang, Z., Sheikh, H. R., & Bovik, A. C. (2002).
Multresolution, and Information Processing, 3(3),
No-reference perceptual quality assessment of
435–452. doi:10.1142/S0219691305000932
JPEG compressed images. In Proceedings of the
Nunes, J., Bouaoune, Y., Delechelle, E., Niang, IEEE International Conference on Image Process-
O., & Bunel, P. (2003). Image analysis by bidi- ing, Rochester, NY (Vol. 1, pp. 477- 480).
mensional empirical mode decomposition. Im-
Wang, Z., & Simoncelli, E. (2005). Reduced-
age and Vision Computing, 21(12), 1019–1026.
reference image quality assessment using a
doi:10.1016/S0262-8856(03)00094-5
wavelet-domain natural image statistic model. In
Sheikh, H., Wang, Z., Cormack, L., & Bovik, Proceedings of the SPIE Conference on Human
A. (2005). LIVE image quality assessment data- Vision and Electronic Imaging, San Diego, CA
base. Retrieved from http://live.ece.utexas.edu/ (Vol. 5666, pp. 149-159).
research/quality

This work was previously published in the International Journal of Computer Vision and Image Processing (IJCVIP), Volume
2, Issue 1, edited by Muhammad Sarfraz, pp. 1-15, copyright 2012 by IGI Publishing (an imprint of IGI Global).

630
631

Chapter 38
Modern Navigation Systems
and Related Spatial Query
Wei-Shinn Ku
Auburn University, USA

Haojun Wang
University of Southern California, USA

Roger Zimmermann
National University of Singapore, Singapore

ABSTRACT
With the availability and accuracy of satellite-based positioning systems and the growing computational
power of mobile devices, recent research and commercial products of navigation systems are focusing
on incorporating real-time information for supporting various applications. In addition, for routing
purposes, navigation systems implement many algorithms related to path finding (e.g., shortest path
search algorithms). This chapter presents the foundation and state-of-the-art development of navigation
systems and reviews several spatial query related algorithms.

INTRODUCTION ability and accuracy of satellite-based positioning


systems and the growing computational power of
Navigation systems have been of growing interest mobile devices, recent research, and commercial
in both industry and academia in recent years. The products of navigation systems are focusing on
foundation of navigation systems is based on the incorporating real-time information for support-
concept of utilizing radio time signals sent from ing various applications. In addition, for routing
some wide-range transmitters to enable mobile purposes navigation systems implement many
receivers to determine their exact geographic algorithms related to path finding (e.g., shortest
location. Based on this precise location, mobile path search algorithms). An increasing number
receivers are able to perform location-based of useful applications are implemented based on
services (Shekhar, et al 2004). With the avail- these fundamental algorithms.

DOI: 10.4018/978-1-4666-2038-4.ch038

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Modern Navigation Systems and Related Spatial Query

MODERN NAVIGATION SYSTEMS oped by the U.S. Department of Defense in the


mid-1980s. Since it became fully functional in
A navigation system is an integration of position 1994, GPS has acted as the backbone of modern
and orientation devices, computation devices, navigation systems around the world.
communication hardware and software for guiding The GPS consists of a constellation of 24
the movement of objects (e.g., people, vehicles, satellites in circular orbits at an altitude of 20,200
etc.) from one location to another. In general, the kilometers (Leick, 1995). Each satellite circles
infrastructure of navigation systems can be classi- the Earth twice a day. Furthermore, there are six
fied into two subsystems: positioning signal trans- orbital planes with four satellites in each plane.
mission systems and positioning signal receivers. The orbits were designed so that at least four
The positioning signal transmission system allows satellites are always within line-of-sight from
the signal receiver to determine its location (lon- most places on the earth (Langley, 1991). The
gitude, latitude, and altitude) using timing signals. trajectory of the satellites is measured by five
Positioning signal receivers range from hand-held monitoring stations around the world (Ascension
devices, cellular phones, to car-based devices. Island, Colorado Springs, Diego Garcia, Hawaii,
These devices typically include some storage and Kwajalein). The master control station, at
of map data and the computing capabilities of Schriever Air Force Base, processes the monitor-
spatial operations, such as calculating directions. ing information and updates the onboard atomic
Additionally, in some novel geoinformatics ap- clocks and the ephemeris of satellites through
plications, the receiver also relies on some server monitoring stations (El-Rabbany, 2002).
components for various services, such as real-time Each GPS satellite repeatedly broadcasts radio
traffic information. In such a scenario, a server signals traveling by line-of-sight, meaning that
infrastructure is introduced which includes a Web they will pass through air but will not penetrate
server, a spatial database server, and an applica- most solid objects. GPS signals contain three
tion server to provide these services. The signal pieces of information (Hofmann-Wellenhof et
receiver communicates with the server via wired al, 1994): a pseudo random sequence, ephemeris
or wireless networking infrastructures. data, and almanac data. The pseudo random se-
quence identifies which satellite is transmitting the
Positioning Signal signal. Ephemeris data allows the GPS receiver
Transmission Systems to determine the location of GPS satellites at any
time throughout the day. Almanac data consists of
Positioning signal transmitters, such as satellites information about the satellite status and current
and base stations, broadcast precise timing signals time from the onboard atomic clock of the satellite.
by radio to receivers, allowing them to determine The GPS receiver calculates its location based
exact geographic locations and then dynamically on GPS signals using the principle of trilateration
display and update their current position on digital (Kennedy, 2002). First, the GPS receiver calcu-
maps. As of 2006, the Global Positioning System lates its distance to a GPS satellite based on the
(GPS) is the only fully functional satellite-based timing signal transmission delay from the satellite
positioning signal transmission system. to the receiver multiplied by the speed of radio
signals. After measuring its distance to at least
Global Positioning System four satellites, the GPS receiver calculates its
current position at the intersection of four abstract
The invention of GPS has had a huge influence spheres, one around each satellite, with a radius of
on modern navigation systems. GPS was devel- the distance from the satellite to the GPS receiver.

632
Modern Navigation Systems and Related Spatial Query

GPS Accuracy works for North America as of 2006. However,


the European Geostationary Navigation Overlay
As a positioning signal transmission system, Service (EGNOS) and the Multi-Functional Sat-
the accuracy of GPS is a very important issue. ellite Augmentation System (MSAS) are being
However, GPS was initially introduced with a developed in Europe and Japan, respectively. They
feature called Selective Availability (or SA) that can be regarded as variants of WAAS.
intentionally degraded the accuracy by introducing The Local Area Augmentation System (LAAS)
an error of up to 100 meters into the civil timing (United States Department of Transportation,
signals. Improved accuracy was available to the FAA, 2002) uses a similar approach where cor-
United States military and a few other users who rection messages are calculated, transmitted,
were given access to the undegraded timing sig- and broadcast via VHF data link within a local
nal. On May 1, 2000, SA was finally turned off, area where accurate positioning is needed. The
resulting in a substantial improvement of the GPS transmission range of these correction messages
accuracy (Conley, 2000). is typically about a 30 to 50 kilometer radius
Additionally, the accuracy of GPS can be af- around the transmitter.
fected by the atmospheric conditions (e.g., Iono-
sphere, Troposphere) as well as reflections of the GLONASS and Galileo
radio signal off the ground and the surrounding Positioning System
structures close to a GPS receiver. The normal
GPS accuracy is about 30 meters horizontally and The GLObal NAvigation Satellite System
52 meters vertically at the 95% probability level (Global’naya Navigatsionnaya Sputnikovaya
when the SA option is turned off (Kennedy, 2002). Sistema, GLONASS) is a satellite-based position-
There are several approaches that have been used ing signal transmission system developed by the
to improve the accuracy of GPS. Russian government as a counterpart to GPS in
Differential GPS (DGPS) (Kennedy, 2002) the 1980’s. The complete GLONASS consists
uses a network of stationary GPS receivers on the of 24 satellites in circular orbits at an altitude of
ground acting as static reference points to calculate 19,100 kilometers. Each satellite circles the Earth
and transmit correction messages via FM signals in approximately 11 hours, 15 minutes. The orbits
to surrounding GPS receivers in a local area. The were designed such that at least five satellites are
improved accuracy provided by DGPS is equal to always within line-of-sight at any given time.
0.5 m to 1 m near the reference point at the 95% Based on measurements from the timing signal of
probability level (Monteiro et al. 2005). Before four satellites simultaneously, the system is able
the SA option was turned off by the Department to offer location information with an accuracy
of Defense, DGPS was used by many civilian of 70 meters.
GPS devices to improve the accuracy. There were 17 satellites in operation by
The Wide Area Augmentation System (WAAS) December 2005 offering limited usage. With
(Loh, Wullschleger et al. 1995) has been widely the participation of the Indian government, it is
embedded in GPS devices recently. WAAS uses expected that the system will be fully operational
25 ground reference stations across the United with all 24 satellites by 2010.
States to receive GPS signals and calculate cor- GALILEO (Issle et al. 2003) is being developed
rection messages. The correction messages are by the European Union as an alternative to GPS
uploaded to a geosynchronous satellite and then and GLONASS. GALILEO is intended to provide
broadcast from the satellite on the same frequency positioning signals with a precision higher than
as GPS to the receivers. Currently WAAS only GPS to both civil and military users. Moreover, it

633
Modern Navigation Systems and Related Spatial Query

improves the coverage of satellite signals at high layers that can be analyzed for a wide variety of
latitude areas. The constellation of GALILEO purposes, such as calculating the route from the
consists of 30 satellites in circular orbits at an current position to a destination.
altitude of 23,222 kilometers. The GALILEO
system is expected to be fully operational by 2010.
NAVIGATION RELATED SPATIAL
Positioning Signal Receivers QUERY ALGORITHMS

Most positioning signal receiving devices are As mentioned earlier, many location-based end-
designed for the use with the GPS system. These user applications can be provided after a position-
devices have been manufactured in a wide variety ing signal receiver calculates the position of a user.
for different purposes, from devices integrated There are several spatial query algorithms which
into cars, personal digital assistants, and phones, are commonly utilized by modern positioning
to dedicated devices such as hand-held GPS signal receivers (e.g., GPS devices) for supporting
receivers. The most popular variants are used in location-based services and shortest path routing.
car-based navigation systems that visualize the We broadly categorize them into location-based
position information calculated from GPS signals query algorithms and shortest path search algo-
to locate an automobile on a road retrieved from rithms in this section. Table 1 summarizes the
a map database. symbolic notations used throughout this section.
In these car-based systems, the map database
usually consists of vector information of some Location-Based Query Algorithms
area of interest. Streets and points of interest are
encoded and stored as geographic coordinates. Point Query: The term point query (PQ) can
The client is able to find some desired places be defined as: given a query point q, find all the
through searching by address, name, or geographic spatial objects O which contain q.
coordinates. The map database is usually stored
on some removable media, such as a CD or flash PQ(q ) = {O | ∀oi ∈ O, q ∈ oi }
memory. A common approach is to have a base
map permanently stored in the ROM of GPS de-
The query processing efficiency of a point
vices. Additional detailed information of areas of
query can be improved by utilizing spatial indices,
interest can be downloaded from a CD or online
e.g., the R-tree (Guttman, 1984) or the Quadtree
by the user in the future.
(Samet, 1984). With a spatial index, all the spatial
Integrating the positioning data from a GPS
objects are represented by geometric approxi-
receiver with the Geographic Information Sys-
mations such as an MBR (Minimum Bounding
tem (GIS) involves data retrieval, data format
Rectangle). Consequently, determining whether
transformation, multi-layer data display, and
data processing. With GPS, it is possible to col-
lect the positioning data in either the real-time or Table 1. Symbolic notation
post-processed mode. The digital format of GPS
Symbol Meaning
data is then converted into a compatible format
q The location of a query point
used in the GIS applications (Steede-Terry 2000; O A set of spatial objects
Kennedy, 2002). Together with other spatially dist(a,b) The Euclidean distance between two objects a and
b
referenced data (e.g., the digital road map data), D A set of nodes
the GIS application consists of a collection of E A set of edges

634
Modern Navigation Systems and Related Spatial Query

the query point is in an MBR is less expensive reaches the leaf nodes of the index structure. The
than checking if the query point is in an irregular selected entries in the leaves are used to retrieve
polygon. After retrieving all the MBRs which the records associated with the selected spatial
overlap with the query point as candidates, the keys (Shekhar et al. 2004).
exact geometry of each element in the candidate
set is examined. Point queries can be applied to Shortest Path Search Algorithms
determine the overlapping regions (e.g., admin-
istrative divisions) of navigation system users. Dijkstra’s Algorithm: One important function
Nearest Neighbor Query: The term nearest of navigation systems is to find the shortest route
neighbor query (NNQ) can be defined as: given to a user specified destination. The well-known
a query point q and a set of spatial objects O, find Dijkstra’s algorithm (Dijkstra, 1959) provides an
the spatial object oi ∈ O which has the shortest ideal solution for finding single-source shortest
distance to q. paths in a graph of vertices connected through
edges. We present the algorithm, assuming that
NNQ(q ) = {oi | ∀o j ∈ O, dist (q, oi ) ≤ dist (q, o j )} there is a path from the vertex of interest to each
of the other vertices. It is a simple modification to
handle the case where this is not so. We initialize
R-trees and their derivatives (Sellis et al. 1987; a set of vertices D to contain only the node whose
Beckmann et al. 1990) have been a prevalent shortest paths are to be determined and assume the
method to index spatial data and increase query vertex of interest is v1. We also initialize a set E
performance. To find nearest neighbors, branch- of edges to being empty. First we choose a vertex
and-bound algorithms have been designed that vi that is closest to v1 and add it to D. In addition,
search an R-tree in either a depth-first (Rousso- we also add the edge < v1, vi > to E. That edge is
poulos et al. 1995) or best-first manner (Hjaltason clearly a shortest path from v1 to vi. Then we check
& Samet, 1999) to detect and filter out unqualified the paths from v1 to the remaining vertices that
branches. Both types of algorithms were designed allow only vertices in D as intermediate vertices.
for stationary objects and query points. They may A shortest of these paths is a shortest path. The
be used when moving objects infrequently pose vertex at the end of such a path is added to D and
nearest neighbor queries. the edge that touches that vertex is added to E.
Range Query: The term range query (RQ) can This procedure is continued until D covers all the
be defined as: given a query polygon q and a set vertices. At this point, E contains the edges for the
of spatial objects O, find all the spatial objects in shortest paths (Neapolitan & Naimipour, 1998).
O which intersect with q. Adaptive Shortest Path Search Algorithm:
Most existing shortest path searching algorithms
RQ(q ) = {O | ∀oi ∈ O, oi ∩ q ≠ ϕ} are executed based on static distance informa-
tion: pre-defined road segments with fixed road
conditions are used in the computation. However
Range queries can be solved in a top-down re- any real-time events (e.g., detours, traffic conges-
cursive procedure utilizing spatial index structures tions, etc.) affecting the spatial network cannot
(e.g., the R-tree). The query region is examined be reflected in the query result. For example, a
first against each branch (MBR) from the root. traffic jam occurring on the route to the computed
If the query polygon overlaps with any branch, destination most likely elongates the total driving
the search algorithm is employed recursively time. More drastically, the closure of a restaurant
on sub-entries. This process terminates after it which was found as the destination according to

635
Modern Navigation Systems and Related Spatial Query

its network distance might even invalidate a query ACKNOWLEDGMENT


result. In other words, finding the shortest path
in terms of travel time is more important than This article was made possible by the NSF grants
the actual distance. Therefore, we need adaptive ERC Cooperative Agreement No. EEC-9529152,
shortest path search algorithms which can integrate CMS-0219463 (ITR), and IIS-0534761. Any
real-time events into the search/routing procedure. opinions, findings and conclusions or recom-
Ku et al. (Ku et al. 2005) proposed a novel travel mendations expressed in this material are those
time network that integrates both road network of the authors and do not necessarily reflect those
and real-time traffic event information. Based on of the National Science Foundation.
this foundation of the travel time network, they
developed an adaptive shortest path search algo-
rithm that utilizes real-time traffic information to REFERENCES
provide adaptive shortest path search results. This
novel technique could be implemented in future Beckmann, N., Kriegel, H.-P., Schneider, R., &
navigation systems. Seeger, B. (1990). The R*-Tree: An Efficient and
Robust Access Method for Points and Rectangles.
Proc. ACM SIGMOD Int’l Conf. Management of
CONCLUSION Data (pp. 322–331). ACM Press.
Conley, R. (2000). Life After Selective Avail-
We have presented the foundation and state of ability. U.S. Inistitute of Navigation Newsletter,
the art development of navigation systems and 10(1), Berlin, Germany: Springer-Verlag.
reviewed several spatial query related algorithms.
GPS has been increasingly used in both military Dijkstra, E. W. (1959). A note on two problems in
and civilian applications. It can be forecast that connexion with graphs. Numerische Mathematik,
GPS will be extensively used and its applicability 1, 269–271. doi:10.1007/BF01386390
expanded into new areas of applications in the
El-Rabbany, A. (2002). Introduction to GPS: The
future. Meanwhile, additional civilian frequencies
Global Positioning System. Boston, MA: Artech
will be developed and allocated to ease the conges-
House.
tion of civil usage. GPS developers are anticipating
the advent of the European GALILEO system that Gibbons G. (2004). Compatible with the Future.
will introduce the birth of the Global Navigation GPS World, 15(4).
Satellite System (GNSS) infrastructure, which
Guttman, A. (1984). R-Trees: A Dynamic Index
combines the functionality of GPS and GALILEO
Structure for Spatial Searching. Proc. ACM
together (Gibbons 2004). The interoperation of
SIGMOD Int’l Conf. Management of Data (pp.
GPS and GALILEO will benefit the users with
47–57). ACM Press.
more signal availability, more signal power, and
improved signal redundancy around the world. Hjaltason, G. R., & Samet, H. (1999). Distance
In addition, several websites (e.g., MapQuest, browsing in spatial databases. ACM Transac-
Yahoo! Maps, etc.) have integrated shortest path tions on Database Systems, 24(2), 265–318.
search algorithms into on-line services. Users doi:10.1145/320248.320255
can conveniently search the shortest path to their
Hofmann-Wellenhof, B., Lichtenegger, H., & Col-
destinations by utilizing these services.
lins, J. (1994). Global Positioning System: Theory
and Practice. Berlin, Germany, Springer-Verlag.

636
Modern Navigation Systems and Related Spatial Query

Issler, J. L., Hein, G., Godet, J., Martin, J. C., Er- Sellis, T. K., Roussopoulos, N., & Faloutsos, C.
hard, P., Lucas-Rodriguez, R., & Pratt, T. (2003). (1987). The R+-Tree: A Dynamic Index for Multi-
Galileo frequency and signal design. GPS World, Dimensional Objects. Proc. of Very Large Date
14(6), 30–37. Base Conf., VLDB, (pp. 507–518).
Kennedy, M. (2002), The Global Positioning Shekhar, S., Vatsavai, R. R., Ma, X., & Yoo, J. S.
System and GIS: An Introduction, 2nd Edition. (2004). Navigation Systems: A Spatial Database
New York: Taylor and Francis. Perspective. Location-Based Services. Morgan
Kaufmann Publishers, (pp. 41–82).
Ku, W.-S., Zimmermann, R., Wang, H., & Wan,
C.-N. (2005). Adaptive Nearest Neighbor Queries Steede-Terry, K. (2000). Integrating GIS and
in Travel Time Networks. Proc. 13th ACM Int’l the Global Positioning System. Redlands, CA:
Symp. Geographic Information Systems (ACM- ESRI Press.
GIS 2005) (pp. 210–219). ACM Press.
United States Department of Transportation,
Langley, R. B. (1991). The orbits of GPS satel- Federal Aviation Administration, (2002). Local
lites. GPS World, 2(3), 50–53. Area Augmentation System Ground Facility,
Specification FAA-E-2937A.
Leick, A. (1995). GPS Satellite Surveying, 2nd
Edition. New York: John Wiley & Sons.
Loh, R., Wullschleger, V., Elrod, B., Lage, M., & KEY TERMS AND DEFINITIONS
Haas, F. (1995). The U. S. wide-area augmentation
system (WAAS). Navigation, 42(3). Ephemeris: Refers to the relative positions
of the planets, or satellites in the sky at a given
Monteiro, L. S., Moore, T., & Hill, C. (2005). What moment.
is the accuracy of DGPS? Journal of Navigation, Geosynchronous Satellite: A satellite whose
58, 207–225. doi:10.1017/S037346330500322X orbital track lies over the equator.
Neapolitan, R., & Naimipour, K. (1998). Foun- GIS: A system for creating, integrating, ana-
dations of Algorithms. Jones and Bartlett. (pp. lyzing and storing managing geographical data
234-241). and associated features. In general, GIS provides
users with an interface to query, retrieve, and edit
Roussopoulos, N., Kelley, S., & Vincent, F. the spatial data in an efficient way.
(1995). Nearest Neighbor Queries. Proc. ACM Shortest Path Search: Finding the shortest
SIGMOD Int’l Conf. Management of Data (pp. or least cost path through an underlay network.
71–79). ACM Press Trilateration: Computing the relative posi-
Samet, H. (1984). The quadtree and related hi- tions of an object using the geometry of sphere
erarchical data structures. ACM Comput. Surv. intersections. To accurately determine the relative
ACM, 16(2), 187–260. position of an object in 2D, trilateration uses at
least 3 reference points, and the measured distance
between the object and each reference point.

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 285-292,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

637
638

Chapter 39
Managing Schema Evolution
in a Federated Spatial
Database System
Xiaoying Wu Geoff West
Curtin University, Australia Curtin University, Australia

Jianhong (Cecilia) Xia Lesley Arnold


Curtin University, Australia Landgate, Australia
Bert Veenendaal
Curtin University, Australia

ABSTRACT
A Federated Spatial Database System (FSDBS) is the integration of multiple spatial data sources and
the realisation of effective spatial data sharing. These are becoming increasingly popular as more and
more spatial and non-spatial datasets are integrated, especially those across a number of independent
organisations. However, in a FSDBS environment, database schemas are subject to change and the
management of these changes is complex and inefficient. This is because schema changes in one local
database will invalidate applications built against this local schema, but also applications built against
the global schema.

The research is motivated by developments in the Shared Land Information Platform, built by Landgate
in Western Australia as a Spatial Data Infrastructure enabler that has been running since 2007. The
more than 350 datasets in SLIP are from and controlled by 20 organisations with queries built over the
different datasets. Changes in the various databases require schema updating to be streamlined.

In this chapter, an Automatic Schema Evolution Framework is explored and developed to more effec-
tively manage schema evolution in a FSDBS. This framework provides a Schema Element Dependency
Meta-Model, a set of Schema Change Templates, and incorporates view generation, view rewriting, and
query rewriting as solutions. These developed methods ensure applications can accommodate schema
changes and hence remain valid.

DOI: 10.4018/978-1-4666-2038-4.ch039

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Managing Schema Evolution in a Federated Spatial Database System

INTRODUCTION of semi-automatic methods that, in future work,


will lead to the development of fully automatic
The traditional database can be described by methods.
schemas that define relations, attributes, etc. A Of particular interest is the Shared Land In-
spatial database schema is an extension of the formation Platform (SLIP) built by Landgate (au-
traditional schema to include spatial descriptions thoritative agency for land information in Western
and behaviours required for a spatial database Australia) as a Spatial Data Infrastructure (SDI)
(Yeung & Hall, 2007). Spatial database schemas, enabler (2011). SLIP went into production in 2007
like traditional schemas, are subject to change or and consists of more than 350 datasets from 20
evolution due to changes in perception of reality different agencies. As such it is a federated database
and application requirements. Changes can also and queries are built over the different datasets.
occur as a consequence of integration with other Schema updating is needed because the datasets
systems, compliance to new regulations and the are agency owned and controlled allowing local
implementation of new security requirements. changes to occur.
The task of managing schema evolution is to For schema evolution, the mismatch between
ensure schemas, data and application consistency evolved schemas and queries need to be overcome.
and integrity after change. Schema changes often Queries here include queries generated by applica-
result in the applications built against the schemas tions and stored queries (views and spatial views)
being invalid. as seen by users. The ASE framework defines the
In a Federated Spatial Database System (FS- components and methods for automatic run time
DBS), in which spatial means geographically query and view rewriting, as well as new view
spatial, the spatial data are shared by multiple generation when schemas change.
organizations and applications across a number
of databases. A local database schema change
will affect not only the applications locally but BACKGROUND
also applications built on the federated schemas.
Effective management is problematic and there are Before explaining the proposed ASE framework,
several challenges to managing schema evolution it is necessary to define various aspects required
in a FSDBS including: for the ASE Framework, explore the various is-
sues and concepts involved in the framework, and
• How to evaluate the impact of schema critique the current state of the art in research into
changes on local and global databases; federated and spatial databases.
• How to propagate the schema changes
across databases in a federated environ- Spatial Database
ment; and, Management Systems
• How to automatically rewrite run time que-
ries, and rewrite and generate views (spatial In order to model the spatial aspects of real world
views and traditional views) so that the ap- objects, a number of different approaches have been
plications are immune to schema changes. proposed and developed. For example, the MADS
(Modeling Application Data with Spatio-Temporal
The objective is to explore and develop an features) model defines constructs and language
Automatic Schema Evolution (ASE) framework to describe an application schema from different
for managing schema evolution in a FSDBS. The aspects including thematic and spatial-temporal
main contribution of this chapter is the proposal data structures, and multi-representation aspects

639
Managing Schema Evolution in a Federated Spatial Database System

(Parent, Spaccapietra, & Zimányi, 2006). Spatial SQL implementations include the implementation
object types supported in MADS include both of primitive data types in the including the binary
feature and coverage types. ISO 19109—Rules data type; and support for extended geometry data
for application Schema (2005)—defines a general types(OGC, 2006a, 2006b). However, coverage
feature model to describe the feature object types objects are not included in the SFA specification
and their properties including attributes, operations so storage, query and management of coverage
and associations. Application schema developed data are not supported by the SQL interface.
by INSPIRE (2009) adopted and augmented ISO
19101(2002) and ISO 19109 (2005) standards Federated Spatial Database Systems
by defining a modelling language and introduc-
ing the coverage spatial object type. Application Increasing spatial data and service sharing is
schemas are conceptual schemas that are platform motivated by (1) cost savings on data collection,
independent. The value domain of a spatial attri- maintenance, and software; (2) improved spatial
bute in the conceptual spatial schemas belong to data usability and consistency; and (3) enhanced
defined spatial data types including points, lines inter-organisational relationships and the best use
and polygons. of scarce GIS resources.
With the development of database technolo- A Spatial Data Infrastructure (SDI) facilitates
gies, spatial object types and their properties de- spatial data sharing and can be built with advanced
scribed by the conceptual spatial schema can be relevant technologies and a set of standards,
implemented in a Spatial Database Management specifications and policies (Nebert, 2004). The
System (SDBMS). A SDBMS manages and ma- key point of a SDI is interoperability and integra-
nipulates spatial data including vector and raster tion that enables users from different areas such
data as well as non-spatial data (Rigaux, Scholl, as government, the private sector and the general
& Voisard, 2002). public to discover and access spatial data and
Spatial database systems are distinguished from services provided by different organisations. In
non-spatial database systems in two ways (Yeung the last decade, SDI has become the global trend
& Hall, 2007): in the Geographic Information Science (GIS)
industry. Examples include INSPIRE (2011) in
• Storage of spatial data objects: points, lines, Europe, the National Spatial Data Infrastructure
and polygons; and (NSDI) in the United State (FDGC, 2011) and the
• Functionality to process spatial data such as Australia Spatial Data Infrastructure (ASDI) in
spatial indexes, extended spatial query lan- Australia and New Zealand (ANZLIC, 2011). Of
guages and effective algorithms for spatial particular interest is the SDI enabler (SLIP) build
operations. by Landgate in WA that has been in operation since
2007. More than350 datasets from 20 different
The implementation of feature objects is sup- organisations are available via SLIP. Importantly
ported by contemporary DBMSs. The Simple each agency acts as custodian for their data and
Features Access (SFA) specification (also called will make changes to the database structures as
ISO 19125) defines the SQL implementation of the needed by their business requirements. Hence,
geometry object model that is consistent with ISO queries built on one or more of the datasets needs
19109—Rules for application Schema (2005)— to be aware of such changes and the queries modi-
and ISO 19107—Spatial Schema (2003). SFA spec- fied appropriately.
ifies the SQL schema for storage, query, retrieval A Federated Spatial Database System (FSDBS)
and update of spatial data via a SQL interface. integrates various geographically distributed spa-

640
Managing Schema Evolution in a Federated Spatial Database System

tial data sources and provides a unified data access analyses queries, discomposes them into
mechanism in order to facilitate spatial data sharing sub-queries, sends sub-queries to relevant
(Yeung & Hall, 2007). This important aspect of data sources, composes results from data
a SDI has been achieved through advancements sources and sends them back to the end users
in networks and communications, distributed and applications. The system catalog contains
computing technologies, and conformance with schema information for the federated data-
standards and policies. Federated database systems base as well as matching and mapping in-
have been the subject of active research for deal- formation between schemas;
ing with database heterogeneity (Litwin, Mark, 2. Spatial data sources: Which are autonomous
& Roussoupoulos, 1990; Sheth & Larson, 1990). and heterogeneous; and
A basic FSDBS shown in Figure 1 consists of: 3. Applications: Including desktop and web
applications. For example, the federated
1. AFederated Spatial Database Management server can work with a web feature server
System: An ordinary DBMS that includes a to provide web-mapping services.
server, a federated database and the system
catalog (Yeung & Hall, 2007). This virtual Building a FSDBS is more challenging than
system receives requests from applications, building a non-spatial DBMS because of the need

Figure 1. Architecture of a FSDBS

641
Managing Schema Evolution in a Federated Spatial Database System

to accommodate vector and raster spatial data as and the evolved schemas, and keeps consistency
well as non-spatial data. To integrate spatial data of the correspondences between schemas after
from various data sources, firstly a federated schema changes.
schema which integrates various structurally and Figure 2 is an example of a FSDBS taken from
semantically different source schemas needs to a Landgate database. There are two local schemas
be built along with schema level correspondences S1, S2, and a federated schema G. S1 consists of
between schemas; secondly, the instance level two feature object classes, ExistingMainRoads and
correspondences need to be detected by match- ProposedMainRoads, and one non-spatial object
ing the semantics of the features and matching class, RoadType. A map application A1 built on S1
their geometries (Goesseln & Sester, 2003). For displays all highways. S2 consists of one feature
example, the same real object may be represented object class, LocalRoads and one non-spatial object
differently in different datasets and needs to be class, FeatureNames. The federated schema G
matched in the FSDBS. includes one feature object class, RoadSegments.
In a FSDBS, the federated schema is dependent A web map application A2 displays all road seg-
on local schemas so when a local schema changes, ments including both main roads and local roads.
applications built on local schemas and federated Schema changes in S1 will affect application A1
schemas and applications built on the federated built on S1, as well as Schema G and application
schemas will be affected. Managing schema evolu- A2 built on G. For example, if ExistingMainRoads
tion overcomes the mismatch between applications and ProposedMainRoads in S1 are merged into

Figure 2. A simple example of a FSDBS from Landgate

642
Managing Schema Evolution in a Federated Spatial Database System

one feature object class MainRoads, applications Views and Spatial Views
A1 and A2 will both be affected.
In a relational DBMS, a view is defined as a stored
Query and Spatial Query query, which can be accessed as a derived table that
contains the resulting tuples for the query. Views
In a DBMS, applications generate queries that can be used to provide a flexible representation
are then sent to the server hosting the DBMS to of a database by providing users with only data
retrieve data. Simply speaking, a query is a ques- of interest and concealing the rest (Date, 2003).
tion imposed on data (Ramakrishnan & Gehrke, A spatial view extends a view by adding a spatial
2003). In a relational DBMS, a query is a function column. Spatial views are created the same way
that results a relation consisting of a set of tuples. as traditional views as stored spatial queries that
Structured Query Language (SQL) is used in contain a spatial column and spatial operations.
most commercial DBMSs. SQL is a declarative Spatial selection and spatial joins on the underlying
language in which a user only specifies what they spatial data retrieves the results for applications.
want and do not have to worry about how to get
it. The basic form of a SQL query is Select-From- Query Rewriting
Where.
A spatial query is more complex than a non- Query rewriting is the process of converting query
spatial query because it deals with data of two or statements into different expressions while still
three dimensions. A spatial query includes support keeping the logical structure of the query (Deutsch,
of geometry data types, and spatial operators and Popa, & Tannen, 2006). Equivalent query rewrit-
spatial joins on these data types. ing occurs when the result of a rewritten query
Spatial operators classified by the OGC (2006a, is equivalent to the results of the original query.
2006b) include basic, query and analysis operators. Query rewriting has been widely studied for differ-
Basic operators deal with the general properties of a ent applications such as query optimisation, data
geometric object. Examples are boundary, envelope integration, data exchange and schema evolution
and spatial reference. Query operators test spatial (Halevy, 2001). For example, rewriting queries
relationships including overlap, touch and cross with materialised views has attracted attention for
between geometric objects. Analysis operators are performance improvement of a database system.
used to perform spatial analysis such as distance, Different query rewriting algorithms have been
difference and buffering. A spatial join is similar developed to rewrite queries with views (Duschka
to a join in a regular relational query that compares & Genesereth, 1997; Levy, Rajaraman, & Ordille,
two or more feature classes in terms of their loca- 1996; Popa, 2001). Rewriting queries against
tion using spatial operators, such as overlay, cross one schema into equivalent ones against another
and difference (Yeung & Hall, 2007). schema has been adopted as a solution to schema
A spatial database uses Spatial SQL that ex- integration, schema transformation and schema
tends traditional SQL. While there is increased evolution (Curino, Moon, Ham, & Zaniolo, 2009;
functionality through the ability to deal with spatial Tatarinov & Halevy, 2004; Ullman, 2000).
objects and the incorporation of spatial operators
and spatial joins, spatial SQL still retains the Schema Mapping
simplicity of traditional SQL (Egenhofer, 1994).
Schema mapping specifies the correspondence
between schemas (Bernstein, Green, Melnik, &
Nash, 2008). More specifically, a schema mapping

643
Managing Schema Evolution in a Federated Spatial Database System

is a set of mapping rules between two schemas, In a FSDBS, schema mapping consists of (i)
which maps one schema to another. Schema mapping between local database schemas and the
mapping specifies the relationships between two federated schema, and (ii) mapping between dif-
schemas including both structural and semantic ferent schema versions of one system.
relationships.
Schema mapping is fundamental for query Related Work
rewriting. In an integrated system, the global
schema provides a unified interface to end users. Schema evolution has been widely studied (Bounif
A query imposed on the global schema is typi- & Pottinger, 2006; Zhou, Rundensteiner, & Shin,
cally decomposed into a number of sub-queries 1997). Research on the impact of schema changes
imposed on local schemas. The decomposition has generated different methodologies and tools
here depends on mapping information typically to identify the impact of schema evolution on ap-
stored as metadata. plications (Karahasanovic & Sjoberg, 2001; Maule,
In data integration, views are often used to Emmerich, & Rosenblum, 2008; Sjoberg, 1993).
represent the mapping between schemas. There Rather than rewriting applications, different
are two main approaches: Global-As-View (GAV) approaches have been developed to provide trans-
and Local-As-View (LAV) (Cali, Calvanese, Gia- parent evolution to users. For example, schema
como, & Lenzerini, 2002). GAV represents the versioning uses a version-control mechanism in
global schema in terms of local schemas while which different versions of schemas and their cor-
LAV represents local schemas in terms of the responding data exist concurrently so applications
global schema. against different and legacy schemas are left un-
The form of a GAV mapping is: ∀x(φ(x) touched and remain valid (Monk & Sommerville,
→P(x)), where φ(x) is a conjunction of atoms 1993; Roddick, 1995).
over source schemas and P(x) is the atom over A view mechanism, based on the concept of ver-
the target schema. sioning, is another approach that provides schema
The LAV mapping is of the form: ∀ x(Q(x) evolution transparency to end users (Bellashsene,
→∃y ψ (x,y)),where Q(x) is the atom over the 1996; Young-Gook & Rundensteiner, 1997). In
source schemas and ψ (x,y) is the conjunction of this approach, different views are created for dif-
atoms over the target schema. ferent applications. When a change is needed for
The advantage of GAV is simplicity as query an application, only the view corresponding to that
rewriting can be achieved simply by unfolding and application will be updated to reflect the changes
substituting. That is, each schema element of one required. Other views remain unchanged and thus
schema can be replaced by associated elements of other applications are also unchanged.
another schema. However, GAV doesn’t have same Curino (2009) automated schema evolution for
extensibility as LAV (Cali, Calvanese, Giacomo, the Wikipedia system by adopting a query rewriting
& Lenzerini, 2002). approach. A predictive approach has been proposed
GAV and LAV can be generalised into a mixed by Bounif (2006), which made a database schema
approach, Global-Local-As-View (GLAV) map- ready for evolution by predicting and anticipating
ping, also known as tuple-generating-dependency further possible change requirements before they
(Lenzerini, 2002). The form of GLAV is ∀x(φ(x) could occur.
→∃y ψ (x,y)) which has more expressive power In a federated database system, schema evolu-
than GAV and LAV. In fact, GAV and LAV are tion in one database will affect other databases as
special cases of GLAV. the schema mapping between the two schemas will

644
Managing Schema Evolution in a Federated Spatial Database System

become invalid. To overcome this inconsistency, a consequence, this chapter focuses on improving
a number of approaches have been developed. on manual methods by managing schema evolution
Version Management has been proposed in in a FSDBS in a semi-automatic manner.
a federated system to produce a new version
of mapping between local and federated sche-
mas (Schonhoff, Strassler, & Dittrich, 2001). A CONCEPTUAL AUTOMATIC SCHEMA
meta-model of Model Correspondence Assertions EVOLUTION FRAMEWORK
(MoCAs) has been developed which relates one
schema to others in a federated system (Busse & This section proposes and explains the conceptual
Pons, 2001). Schema evolution can be propagated Automated Schema Evolution (ASE) Framework
using the MoCAs model and evolution action and its component parts. The main task of the ASE
specifications. An incremental schema integration framework is to provide application independence
approach propagates change of the local schema to to schema changes in a FSDBS. The main compo-
the federated schema by identifying sub-schema nents shown in Figure 3 grouped into three layers:
affected that are then re-integrated (Motz, 2005). the database layer, the middle layer and the user
Most research on schema evolution has been interface, delineated by horizontal dashed lines.
theoretical. There has not been much development The database layer includes the global database
on semi-automatic and automatic procedures; and all local spatial databases participating in the
hence, schema evolution still requires a high degree federation. A metadata repository is also included
of manual work. In addition, very little research has to store all metadata needed for this framework.
been carried out on schema evolution for spatial The middle layer provides application immu-
database systems, and particularly for FSDBSs. As nity to changes that occur in the database layer.

Figure 3. The ASE framework for managing schema evolution in a FSDBS

645
Managing Schema Evolution in a Federated Spatial Database System

Two techniques are included: (1) view generation in the Landgate example of Figure 2 dis-
or query rewriting for applications built on base cussed previously, schema S1 changes to
tables to keep applications unaffected by database S2 produced mapping M12. Schema S2 then
schema changes; and (2) view redefinition against changes to S3 and produces M23. M12 and M23
the new schemas for applications built against a are combined into mapping M13 between S1
view to keep consistency between the view and and S3. Mapping combination also provides
the evolved database schemas. the mapping changes between the federated
The user interface layer mainly deals with schema and the evolved local schema.
processes involving interaction with the Data- 5. Mapping validation: Mappings between
base Administrator (DBA) who manages schema schema versions can be reviewed by users
changes by specifying the schema changes and for correctness and overridden if needed.
performing the schema change analysis. Once a 6. View rewriting: According to SED and
schema change is initiated, scripts for the schema schema mapping information, views that
change and data migration can be generated auto- have dependencies on the changed schema
matically. Schema mapping is generated automati- elements can be detected and rewritten into
cally according to the mapping rules defined by the equivalent expressions under the new
SCTs, and rewriting view definitions and update schema.
of SEDs is semi-automatic with interaction from 7. SED update: After a view is redefined, the
the DBA. dependency information is updated replacing
The steps to perform a schema change include: the old one stored in the metadata repository.
8. View generation: Generating a new view
1. Schema change analysis: By specifying might occur for certain kinds of schema
the schema change and consulting SED change such as renaming a table.
information, the DBA evaluates the impact 9. Query rewriting: For applications built on
of the schema change. base tables, schema changes may break the
2. Schema change design: If the DBA decides link between queries and the old version of
to move forward with the change planned, the schemas. For example, for a map applica-
they can design the implementation of the tion, this may result in the removal of features
changes using a sequence of SCTs. SCTs from the map. To solve this, run time query
are developed according to schema change rewriting is required. When a query against
taxonomy and represent different changes in an old version of the schema is processed,
the spatial database. the query will be translated into an equivalent
3. Schema change implementation: Scripts to query expressed against the new version ac-
implement schema changes can be generated cording to the schema mapping.
by specifying SCTs and participating schema
elements. Spatial Data Repositories
4. Schema mapping generation and com-
position: Each SCT has schema mapping Spatial Data Repositories include local spatial
rules. These rules and participating schema databases and the federated spatial database. Local
elements produce the schema mapping spatial databases contain base tables that store spa-
between the old and new versions of the tial and non-spatial data and views. The federated
schema when it changes. The schema map- spatial database also consists of tables and views
ping can be derived by combining the old as a virtual database built on the local databases.
and new schema mappings. For example,

646
Managing Schema Evolution in a Federated Spatial Database System

Schema Change Templates etc. For example, moving a column involves add-
ing a column in the destination table, copying the
A key objective is to identify the schema change data from the original table and deleting the column
scenarios that can occur in a spatial database en- in the original table. For Merge Tables, In order
vironment and generate the set of SCTs to satisfy to be able rewrite queries, a column is added as
different schema changes. These SCTs can be the condition when two tables are merged. For
specified at the conceptual spatial schema level Join Tables, to avoid data loss, an outer join is
and then mapped into schema changes on the needed.
underlying SQL schema (Relational or Object- To simplify schema changes, each SCT can
Relational DBMS) according to mapping rules generate scripts automatically. SCTs capture the
between these two data models. Using the Simple semantics of change and can specify whether or
Features Access (SFA) specification (ISO 19125) not a view should be generated. For example,
(OGC, 2006a, 2006b), Table 1 shows the mapping after renaming a table, a view with the same name
between the conceptual spatial schema and the SQL of the original table can be generated. In this re-
schema. See the section on the SED meta-model search, SCTs are the basis for schema mapping
for more details. because schema change is treated as the mapping
Table 2 lists the schema change taxonomy in as explained later in the section on Schema Map-
a conceptual schema and corresponding examples ping.
for the Landgate example shown in Figure 2.
According to the mapping rules listed in Table Metadata Repository
1, a set of SCTs can be generalised in the SQL
schema, as listed in Table 3 in which A, B, C, and Metadata is treated as the first class for schema
D are sets of attributes, a, b, c, and d are single evolution. Currently, the most common strategy
attributes, and R, T, and S are tables. Each SCT for metadata management is to create a repository
consists of one or more ordered schema changes. to store metadata that can store different types
Schema primitive changes include addition or of metadata such as structural, descriptive, and
deletion of columns, addition or deletion of tables administrative metadata.
A metadata repository provides a consistent
and united access mechanism to data to improve
Table 1. Mapping between the conceptual spatial the effectiveness of information management. It
schema and the SQL schema also performs other functions such as metadata
definition, transformation, mapping, synchroniza-
Conceptual Spatial SQL Schema
Schema
tion and versioning, as well as impact analysis of
Feature Object Class Table (one or more tables depend-
metadata change (Sen, 2004).
ing on the data types supported by There are four parts in the metadata reposi-
the DBMS) tory: (1) SEDs; (2) schema mapping; (3) schema
Non-Spatial Object Class Table change history; and (4) metadata from which the
Spatial Attributes Geometry Column or Geometry schema element dependency and schema mapping
table with reference to the feature
table are derived.
Non-Spatial Attribute Column
Relationship Foreign Key and Primary Key Schema Catalog
(1:1 or 1:M)
Table with two Foreign Keys The system catalog in a RDBMS stores metadata
(M:N)
related to the relational database. For each of the

647
Managing Schema Evolution in a Federated Spatial Database System

Table 2. Examples of conceptual spatial schema change taxonomy based on Figure 2

Conceptual Spatial Schema Examples


Feature Object Addition of a feature object class Add a Line feature object class Rails in S1
Class
Deletion of a feature object class Delete ProposedMainRoads in S1
Modification of Rename a Feature Object Class Rename LocalRoads to Roads in S2
a feature Object
Add an Attribute Add RoadCustodian in LocalRoads in S2
class 
Delete an attribute Delete RoadUsage in ProposedMainRoads in S1
Rename an attribute Change LaneCount to LaneNumber in S2
Non-spatial Addition of a non-spatial Object class  Add object class RoadName in S1
Object class
Deletion  of a non-spatial Object class  Delete FeatureName in S2
Modification Rename a non-spatial Object class  Rename FeatureNames to RoadNames in S2
of a non-spatial
Add an attribute Add attribute Creator in FeatureName in S2
Object class
Delete an attribute Delete FeatureType in FeatureNames in S2
Rename an attribute Rename Name to RoadName in FeatureNames in S2
Join feature object class with non-spatial object class Join LocalRoads with FeatureNames in S2
Split a feature object into a feature object class and non-spatial object class Slipt ExistingMainRoads into ExistingMainRoads and
RoadNames (new) in S1
Compose feature object classes into one  Compose ProposedMainRoads and Existing MainRoads
into MainRoads in S1
Decompose an object class into two or more object classes Decomposed FeatureNames into RoadNames and Other-
Names in S2
Decompose a feature object class into two or more feature object classes Decompose LocalRoads into ExistingLocalRoads and
ProposedLocalRoads in S2
Move an attribute Move Suburb from FeatureNames to LocalRoads in S2
Relationship Class Add a relationship Add relationship between ExistingMainRoads and
RoadType in S1
Delete a relationship Delete relationship between FeatureNames and Local-
Roads in S2
Modify the cardinality Modify the cardinality between FeatureNames and Loc-
alRoads in S2 from 1:1 to 1:n

Table 3. Schema change templates


local databases, the metadata contains informa-
Schema Operation Input Schema Output Schema
tion on the database schema including database
Add a column R(A) R(A,b)
tables, views, attributes, relationships, indexes,
Add a table T(A)
and security.
The federated spatial database system catalog
Rename a table R(A) T(A)
also stores metadata on the mapping between the
Merge tables R(A), S(A) T(A,b)
federated database schema and local database
Join tables R(a,B), S(a,C) T(a,B,C)
schema.
Split a table R(a,B,C) S(a,B), T(a,C)
Both types of catalog are needed for schema
Decompose a table R(A) S(A), T(A)
evolution in a FSDBS.
Move a column R(a,B,c), S(a,D) R(a,B), S(a,c,D)
Split a column R(a,B) R(c,d,B)
Merge columns R(a,b,C) R(d,C)

648
Managing Schema Evolution in a Federated Spatial Database System

The Schema Element non-spatial attributes, and relationships. A feature


Dependency Meta-Model object class represents a collection of vector objects
of the same type. A non-spatial object class rep-
The SED meta-model (illustrated in Figure 4 resents a category of objects with no spatial char-
using UML syntax) is used to represent column acteristics. An object class has a name and attributes
level schema element dependencies in a FSDBS. including spatial (for a spatial object class) and
This conceptual meta-model consists of three non-spatial attributes. Each attribute has its own
parts: the schema elements of a conceptual spatial properties including name, data type etc. Spatial
schema, the schema elements of a SQL schema, attribute data types are points, lines, or polygons.
and the SEDs. Object classes can have relationships between
The conceptual spatial schema is a conceptual them.
data model adapted from the generic feature The SQL schema is an implementation of
model from ISO 19109 (2005). Schema elements a conceptual spatial schema according to SFA
defined in the model include object classes consist- specifications (OGC, 2006a, 2006b). Schema ele-
ing of spatial feature and non-spatial object ments include tables and views that are supported
classes, attributes including spatial attributes and by contemporary relational or object-relational

Figure 4. SED meta-model

649
Managing Schema Evolution in a Federated Spatial Database System

DBMSs. A table contains the name of the table, between the source and the target elements that
names of all columns and data types for these are connected by the relationship. Depending on
columns. A view includes the name of the view the databases containing the source and target
and names of columns, required because a view elements, the dependency will be identified as an
is a virtual table that is derived from base tables internal or external dependency.
or views by means of stored queries. A view may In a DBMS, some schema elements may
also include other clauses in its definition such as refer to other elements when they are defined.
where and having. For example, a view is defined by a query that
There are three types of tables in a DBMS: refers to one or more tables or one or more views.
feature table, geometry table and non-spatial table. Normally, the element that is being referred to
A feature table stores a collection of features of is called a referenced element. The element that
the same type. Attributes of a feature object class references the other element is called a dependent
including the spatial attributes are columns of a element, because it depends on the other element
feature table. A geometry table stores the geometric (or referencing element).
information of the geometric objects. Tables with- If one schema element is changed, the depen-
out any spatial element are called non-spatial tables. dent elements may become invalid. For example,
The relationship between the conceptual spatial when a table is renamed, the views that are defined
schema and the SQL schema can be seen in Figure on this table will become invalid if the definitions
4. Object classes are stored as tables in the relational of the views are not updated to the new name of
database. More specifically, the implementation the corresponding table.
of a non-spatial object class is a non-spatial table There are two kinds of dependencies between
while the implementation of a feature object class database schema elements, direct dependency
varies depending on the data types supported by the and indirect dependency. Direct dependency is
underlying DBMS. For example, a feature object where the referenced element is referred by the
class uses a feature table and a geometry table, with dependent element directly. Indirect dependency
a key reference between them if the predefined data is where the referenced element is referred to by
types in the DBMS are used. However, only the an element that is referred to by another element.
feature table is needed if geometry data types are The dependencies derived from the metadata
supported by the DBMS. Attributes of an object according to the SED meta-model are direct depen-
class are stored as columns of the corresponding dencies. Indirect dependencies can be determined
table. A relationship between two object classes is iteratively based on the direct dependencies.
implemented as either a table or the primary key
and foreign key column of the two tables. Schema Mapping
Schema element dependencies consist of sche-
ma elements and the relationships between them. Schema mapping in the metadata repository con-
Schema elements are divided into Compound and sists of horizontal and vertical schema mapping.
Part. The Part component denotes atomic elements Horizontal mapping refers to mapping between
such as a column. The Compound component different systems such as the global schemas and
represents the elements that contain parts such as local schemas while vertical mapping refers to
tables and views. Relationships include Referential mapping between an old version and a new ver-
Integrity (RI), DerivedFrom and ReferencedBy. A sion of particular schema in one database system.
relationship connects two schema elements. The In this research describing schema change as
dependency information, termed a column level schema mapping is used. When a database schema
dependency, can be derived from the relationships changes, the schema change itself is treated as the

650
Managing Schema Evolution in a Federated Spatial Database System

schema mapping between the old and new ver- Roads, and, the second one is to split Main-
sion. Each change operation has a corresponding Roads into MainRoads and RoadNames.
mapping rule between schema versions before and Table 5 lists the schema changes and their
after the change. corresponding schema mappings.
The corresponding schema mapping of each
SCT is shown in Table 4 in which F(), F1(), and Before the schema change, the schema mapping
F2() are functions for splitting or merging columns. MSG between the federated schema G and the local
In order to avoid loss of information, deletion of schema S1 and S2 is:
schema elements is not allowed. Under such a
condition, the expressive power of GAV (Global- G.RoadSegments (n, g, lc, rt, sl)
as-View that represents the global schema in terms
of local schema as described in the section on ← (S2.LocalRoads (g, lc, rt, st, nID) ⋈
Schema Mapping) is enough to describe schema S2.FeatureNames (n, nID))
mapping. The language used to describe mapping
here is relational algebra but other representations ∪ ((S11. ExistingMainRoads (n, g, lc, sl,rtID) ⋈
could be used such as relational calculus. Schema S11.RoadType (tID, tn))
change can be achieved by applying one or more
SCTs. All the mapping rules from SCTs form the ∪ (S11.ProposedMainRoads (n, g, lc, sl, rtID) ⋈
S11.RoadType (tID, tn)))
schema mapping between the old and new versions
of the schema.
where n, g, lc, ru, sl, rs, rtID, nID and rn denote
Name, Geometry, LaneCount, RoadUsage, Speed-
Example 1. Suppose there are two schema
Limit, RoadStatus, RoadTypeID, NameID and
changes in schema S1, as shown in Figure
RoadName respectively.
5. The first change is to merge ExisingMain-
Schema evolution can result in the schema
Roads and ProposedMainRoads into Main-
mapping being invalid. For example, in Figure 5,
the schema mapping MSG becomes invalid when
Table 4. Schema change and corresponding S11 changes. In order to ensure the consistency
schema mapping of schema mapping, schema mapping combines
two schema mappings into one (Nash, Bernstein,
Schema change Schema Mapping & Melnik, 2007). This approach can be applied
Add a Column π R(A) ← π R(A,b) to vertical and horizontal mapping. When a data
Add a Table source schema changes, a new schema mapping
Rename a Table R(A) ← T(A) between the global schemas and local schemas
Merge Tables
R(A) ← σ (T(A,b)), S(A) ← can be derived from the original mapping and the
σ(T(A,b))
change schema mapping. Likewise, for the local
R(a,B)← π T(a,B,C)
Join Tables
S(a,C)← π T(a,B,C)
schemas, schema mapping between older versions
and the current one can be derived from the schema
Split a Table R(a,B,C)←S(a,B) ⋈ T(a,C)
changes and the old schema mapping.
Decompose a Table R(A) ← S(A) ∪ T(A)
In Example 1, mapping M13 between S11 and
R(a,B,c) ← π σ (R(a,B) ⋈ S(a,c,D))
Move a Column
S(a,D) ← π S(a,c,D)
S13 can be derived from M12 and M23. Mapping
Split a Column R(a) ←F(R(c,d))
MSG between the global schema G and local
R(a) ←F1(R(d)),
schema S1 and S2 has to be adapted to keep con-
Merge Columns sistency of the mapping. Similarly, MSG′ and MSG″
R(b) ←F2(R(d))

651
Managing Schema Evolution in a Federated Spatial Database System

Table 5. Schema changes and corresponding schema mapping in Example 1

Mapping Name Schema Change Schema Mapping


M12 (mapping be- merge ExisingMainRoads S11.ExistingMainRoads (n, g, lc, ru, sl, rtID) ← σ rs =
tween S11 and S12) and ProposedMainRoads into ’Existing’ (S12.MainRoads (n, g, lc,ru, sl, rs, rtID))
MainRoads
S11.ProposedMainRoads (n, g, lc,ru, sl,rtID) ← σ rs =
’Proposed’ (S12.MainRoads (n, g, lc,ru, sl, rs, rtID))
M23(mapping be- split MainRoads into Main- S12.MainRoads (n, g, lc,ru, sl, rs, rtID) ← S13.MainRoads (nID, g, lc,ru, sl, rs,
tween S12 and S13) Roads and RoadNames rtID) ⋈ S13.RoadNames (nID, rn)
n, g, lc, ru, sl, rs, rtID, nID and rn denote Name, Geometry, LaneCount, RoadUsage, SpeedLimit, RoadStatus, RoadTypeID, NameID, and
RoadName, respectively.

can be derived from schema mapping (M12 and the history records if any errors occur can reverse
M23) and the original mapping (MSG and MSG′ schema changes.
later). Table 6 lists mappings that are generated
by mapping combination for Example 1. Query/View Rewriting and
View Generation
Schema Change History
For a spatial database system, applications such as
In the metadata repository, the schema change map viewing access data stored in the underlying
history keeps a record of each schema change database. The users can build queries against the
providing clear information of schema versions base tables or views (spatial views if there is a
and how the schema evolves. The semantics of the spatial column).
schema change can be determined and examining

Figure5. Example of schema changes

652
Managing Schema Evolution in a Federated Spatial Database System

Table 6. Schema mapping derived from combination in Example 1

Mapping Name Mapping Combination Derived Mapping


M13 (mapping between S11 M13 = M12 • M13 S11.ExistingMainRoads (n, g, lc,ru, sl,rtID) ← σ rs =’ Existing’(S13.Main-
and S13) Roads (nID, g, lc,ru, sl, rs, rtID) ⋈ S13.RoadNames (nID, rn))
S11.ProposedMainRoads (n, g, lc,ru, sl,rtID) ← σ rs =’ Proposed’(S13.
MainRoads (nID, g, lc,ru, sl, rs, rtID) ⋈ S13.RoadNames (nID, rn))
MSG′ (mapping between MSG′ = MSG • M12 G.RoadsSegments (n, g, lc, rt, sl) ← (S2.LocalRoads (g, lc, rt, st, nID) ⋈
G and S1 and S2 when S1 S2.FeatureNames (n, nID)) ∪(σ rs =’ Existing’∪ rs =’ Proposed’ ((S12.
changes from S11 to S12) MainRoads(n, g, lc, sl,rtID) ⋈ S12.RoadType (tID, tn)))
MSG″ (mapping between MSG″ = MSG′ • M23 G.RoadsSegments (n, g, lc, rt, sl) ←(S2.LocalRoads (g, lc, rt, st, nID) ⋈
G and S1and S2 when S1 S2.FeatureNames (n, nID)) ∪ (σ rs =’ Existing’∪ rs =’ Proposed’(S13.
changes from S12 to S13) MainRoads (nID, g, lc,ru, sl, rs, rtID) ⋈ S13.RoadNames (nID, rn) ⋈ S13.
RoadType (tID, tn)))
n, g, lc, ru, sl, rs, rtID, nID and rn denote Name, Geometry, LaneCount, RoadUsage, SpeedLimit, RoadStatus, RoadTypeID, NameID, and
RoadName, respectively.

In order to manage schema evolution in a when they are processed. Secondly, view rewrit-
spatial database system, two kinds of scenarios ing must ensure the view schema is unchanged,
have to be considered: query using base tables and as the change of the view schema will invalidate
query using views. For a query using views, the the applications built on that view. For example,
views need to be redefined with the change of the the view name and its column names should not
schemas. When a query is built against the base be changed. Nonetheless, query rewriting only
tables, two methods can be used in order to support needs to provide equivalent results.
old queries expressed against old schema. They Different algorithms have been developed
are (1) query rewriting and (2) the generation of for query rewriting such as inverse rules, bucket
a new view. Query rewriting is used to ensure the algorithm and the Chase and BackChase (C&B)
queries generated from the applications against the algorithm (Bai, Hong, McTear, & Wang, 2006;
old schema can be translated into queries against Deutsch, et al., 2006; Duschka & Genesereth,
the new schema. By doing this, applications will 1997b). The C&B algorithm has attracted most
be still valid. View generation does not apply to all attention and was first developed to rewrite que-
schema changes. Only certain changes can gener- ries under a set of constraints in order to improve
ate a view to keep the application impervious to query optimisation (Deutsch, Popa, & Tannen,
schema change such as the renaming of a table. 1999; Deutsch, et al., 2006).
If an application is built on views, changes In this research, query rewriting is simply
of schemas might invalidate the view definition. achieved by unfolding and substituting. As de-
Therefore, view rewriting is used to redefine and scribed in the section on Schema Mapping, GAV
recompile the view after schema changes. Since a represents the global schemas in terms of local
view is a stored query, view rewriting is very similar schemas. The advantage of GAV is the simplicity
to query rewriting. However, there are differences for query rewriting and the combination of two
between them. Firstly, view rewriting is a one-off GAV mappings is still a GAV mapping. Therefore,
operation. It only occurs when schema changes schema mapping in this research is restricted to this
occur, whereas query rewriting happens every form of GAV mapping. Under such circumstance,
time the query is processed. This is because any a query can be rewritten by unfolding (Cali, 2003).
affected view can be detected by SEDs while the With substitution, queries expressed against the
queries from the application can only be detected global schema can be rewritten as equivalent ones

653
Managing Schema Evolution in a Federated Spatial Database System

against the source schemas. Similarly, queries Schema element dependencies provide the ba-
expressed against old schemas can be rewritten sis for impact analysis and determine which view
against the new schemas. definition needs to be redefined when the schema
changes. SCTs are used to define a rich bounded
Example 2. Suppose application A1 in Figure set of schema change scenarios in a spatial data-
2 is the map application that displays all base environment and provide the mapping rules
highways from S1 and Q1 is the query gen- for each change.
erated by A1. When S1 changes from S11 to By incorporating the mapping composition,
S12, as shown in Figure 5, by substituting schema mappings between the older versions and
ExistingMainRoads(n, g, lc, sl,rtID) and the new version of a local schema can be derived.
ProposedMainRoads(n, g, lc, sl, rtID) accord- Schema mapping adaptation between the global
ing to schema mapping M12, Q1 against S11 and the local schema can be achieved as well. SCTs
can be rewritten against S12. Table 7 shows also provide the solution as to whether or not a
the query Q1 (expressed in relational algebra) view should be generated. Based on the schema
before and after rewriting. mapping information, run time queries and views
that are expressed against an old schema version
can be rewritten into expressions in terms of the
CONCLUSION current schema by unfolding and substituting. The
end result is that applications will be immune to
Schema evolution is inevitable in a FSDBS schema changes.
because of the likelihood of its dependency on Effective management of schema evolution in
databases owned and maintained by other or- a FSDBS is an integral part of an SDI. It ensures
ganisations. Managing schema evolution in such discovery and access to spatial data and services
an environment is a significant challenge. This is continuous even though schema changes have
research aims to develop the methodologies to occurred. It also provides schema evolution trans-
semi-automatically manage schema evolution in parency to end-users and minimises modification of
a FSDBS environment so that schema changes applications thus saving time and money. Effective
will be transparent to applications and users. To management of schema evolution also increases
achieve this, view generation/rewriting and query the reusability of spatial data; ensures the longevity
rewriting are proposed as the solution. of spatial data and services well into the future;
In order to implement the methodology, a SED improves spatial data and service management;
meta-model and a set of SCTs have been developed. and engenders collaboration and interoperability
The SED meta-model defines the constructs to between data custodians and application develop-
build column level dependencies in a FSDBS. The ers and users.
schema element dependencies can then be derived Future developments include methods to deal
from the SED meta-model and the metadata of the with federated fully functional spatial databases
spatial database. as well as developing the semi-automatic methods

Table 7. Query Q1 before and after rewriting

Before Rewriting (against S11) After Rewriting (against S12)


σ tn =’highway’((S11.ExistingMainRoads(n, g, lc, sl,rtID) ∪ S11. σ tn =’highway’ ((σ rs =’Existing’ ∪ σ rs =
ProposedMainRoads (n, g, lc, sl, rtID)) ⋈ S11.RoadType (tID, tn)) ’Proposed’ (S12.MainRoads (n, g, lc,ru, sl, rs, rtID) ⋈ S12.Road-
Type (tID, tn))

654
Managing Schema Evolution in a Federated Spatial Database System

described in this chapter to be fully automatic. Cali, A., Calvanese, D., Giacomo, G. D., & Len-
Semi-automatic and fully automatic methods zerini, M. (2002). On the expressive power of
will significantly advance federated databases data integration systems. Paper presented at the
and move away from the more community-based Proceedings of the 21st International Conference
methods. on Conceptual Modeling. New York, NY.
Curino, C., Moon, H., Ham, M., & Zaniolo, C.
(2009). The PRISM workwench: Database schema
REFERENCES
evolution without tears. Paper presented at the
ANZLIC. (2011). Australian spatial data infra- the 2009 IEEE International Conference on Data
structure. Retrieved from http://www.anzlic.org. Engineering. New York, NY.
au /infrastructure_ASDI.html. Date, C. J. (2003). An introduction to database
Bai, Q., Hong, J., McTear, M. F., & Wang, H. system (8th ed.). Boston, MA: Addison Wesley.
(2006). A bucket-based approach to query re- Deutsch, A., Popa, L., & Tannen, V. (1999). Physi-
writing using views in the presence of inclusion cal data independence, constraints, and optimiza-
dependencies. Journal of Research and Practice tion with universal plans. Paper presented at the
in Information Technology, 38(3), 251–266. 25th International Conference on Very Large Data
Bellashsene, Z. (1996). Views for schema evolu- Bases. New York, NY.
tion in object-oriented database systems. IEEE Deutsch, A., Popa, L., & Tannen, V. (2006). Query
Transactions on Knowledge and Data Engineer- reformulation with constraints. Paper presented at
ing, 9(4), 600–624. the SIGMOD Rec. New York, NY.
Bernstein, P. A., Green, T. J., Melnik, S., & Nash, Duschka, O. M., & Genesereth, M. R. (1997).
A. (2008). Implementing mapping composition. Answering recursive queries using views. In Pro-
The VLDB Journal, 17(2), 333–353. doi:10.1007/ ceedings of the Sixteenth ACM SIGACT-SIGMOD-
s00778-007-0059-9 SIGART Symposium on Principles of Database
Bounif, H., & Pottinger, R. (2006). Schema re- Systems. Tucson, AZ: ACM.
pository for database schema evolution. Paper Egenhofer, M. J. (1994). Spatial SQL: A query
presented at the the 17th International Conference and presentation language. IEEE Transactions on
on Database and Expert Systems Applications. Knowledge and Data Engineering, 6(1), 86–95.
New York, NY. doi:10.1109/69.273029
Busse, S., & Pons, C. (2001). Schema evolution FDGC. (2011). National spatial data infrastruc-
in federated information system. Datenbanksys- ture. Retrieved from http://www.fgdc.gov /nsdi/
teme in Büro [BTW]. Technik und Wissenschaft, nsdi.html.
9, 26–43.
Goesseln, G., & Sester, M. (2003). Semantic and
Cali, A. (2003). Reasoning in data integration geometric Integration of geoscientific data sets
systems: Why LAV and GAV are siblings. Berlin, with ATKIS - Applied to geo-objects from geology
Germany: Springer. and soil science. Paper presented at the ISPRS
Commission IV Joint Workshop “Challenges in
Geospatial Analysis, Integration and Visualization
II”. Stuttgart, Germany.

655
Managing Schema Evolution in a Federated Spatial Database System

Halevy, A. Y. (2001). Answering queries using Maule, A., Emmerich, W., & Rosenblum, D.
views: A survey. The VLDB Journal, 10(4), S. (2008). Impact analysis of database schema
270–294. doi:10.1007/s007780100054 changes. ICSE, 10-17. Retrieved from http://
research.microsoft.com /pubs/118211/Andy%20
INSPIRE. (2009). INSPIRE generic conceptual
Maule%20-%20Thesis.pdf.
model. Geneva, Switzerland: INSPIRE Infrastruc-
ture for Spatial Information in Europe. Monk, S., & Sommerville, I. (1993). Schema evolu-
tion in OODBS using class versioning. SIGMOD
INSPIRE. (2011). About INSPIRE. Retrieved
Record, 22(3), 16–22. doi:10.1145/163090.163094
from http://inspire.jrc.ec.europa.eu /index.cfm/
pageid/48. Motz, R. (2005). Dynamic maintenance of an
integrated schema. In From Integrated Publica-
ISO. (2002). ISO 19101: Geographic information
tion and Information Systems to Information and
- Reference model. Geneva, Switzerland: ISO.
Knowledge Environments (Vol. 3379, pp. 21–30).
ISO. (2003). ISO 19107: Geographic information Berlin, Germany: Springer. doi:10.1007/978-3-
-Spatial schema. Geneva, Switzerland: ISO. 540-31842-2_3
ISO. (2005). ISO 19109: Geographic informa- Nash, A., Bernstein, P. A., & Melnik, S. (2007).
tion -- Rules for application schema. Geneva, Composition of mappings given by embedded
Switzerland: ISO. dependencies. ACM Transactions on Database
Systems, 32(1), 4. doi:10.1145/1206049.1206053
Karahasanovic, A., & Sjoberg, D. (2001). Visu-
alizing impacts of database schmea changes - A Nebert, D. D. (2004). Developing spatial data
controlled experiment. Paper presented at the the infrastructure: The SDI cookbook. The Global
IEEE 2001 Symposia on Human Centric Comput- Spatial Data Infrastructure (GSDI). Retrieved
ing Languages and Environments (HCC 2001). from http://www.gsdi.org.
New York, NY.
OGC. (2006a). OpenGIS® implementation specifi-
Lenzerini, M. (2002). Data integration: A theoreti- cation for geographic information - Simple feature
cal perspective. In Proceedings of the Twenty-First access - Part 1: Common architecture. Retrieved
ACM SIGMOD-SIGACT-SIGART Symposium on from http://www.opengeospatial.org.
Principles of Database Systems. Madison, WI:
OGC. (2006b). OpenGIS® implementation specifi-
ACM.
cation for geographic information - Simple feature
Levy, A. Y., Rajaraman, A., & Ordille, J. J. (1996). access - Part 2: SQL option. Retrieved from http://
Querying heterogeneous information sources using www.opengeospatial.org.
source descriptions. Paper presented at the 22nd
Parent, C., Spaccapietra, S., & Zimányi, E. (2006).
International Conference on Very Large Data
Conceptual modeling for traditional and spatio-
Bases. New York, NY.
temporal applications, the MADS approach.
Litwin, W., Mark, L., & Roussoupoulos, N. (1990). Berlin, Germany: Springer.
Interoperability of multiple autonomous data-
Popa, L. (2001). Object/relational query optimiza-
bases. ACM Computing Surveys, 22(3), 267–293.
tion with chase and backchase. PhD Disseration.
doi:10.1145/96602.96608
Philadelphia, PA: University of Pennsylvania.

656
Managing Schema Evolution in a Federated Spatial Database System

Ramakrishnan, R., & Gehrke, J. (2003). Database Ullman, J. (2000). Information integration
management systems (3rd ed.). New York, NY: using logical views. Theoretical Computer
McGraw-Hill. Science, 239(2), 189–210. doi:10.1016/S0304-
3975(99)00219-4
Rigaux, P., Scholl, M., & Voisard, A. (2002).
Spatial database - With application to GIS. New Yeung, A. K. W., & Hall, G. B. (2007). Spatial
York, NY: Morgan Kaufmann. database systems - Design, implementation and
project management. Berlin, Germany: Springer.
Roddick, J. F. (1995). A survey of schema ver-
sioning issues for database systems. Informa- Young-Gook, R., & Rundensteiner, E. (1997). A
tion and Software Technology, 37(7), 383–393. transparent schema evolution system based on
doi:10.1016/0950-5849(95)91494-K object-oriented view technology. IEEE Transac-
tions on Knowledge and Data Engineering, 9(4),
Schonhoff, M., Strassler, M., & Dittrich, K. R.
600–624. doi:10.1109/69.617053
(2001). Version propagation in federated data-
base systems. Paper presented at the International Zhou, L., Rundensteiner, A. E., & Shin, G. K.
Databases and Applications Symposium (IDEAS (1997). Schema evolution of an object-oriented re-
2001). Grenoble, France. al-time database system for manufacturing automa-
tion. IEEE Transactions on Knowledge and Data
Sen, A. (2004). Metadata management: Past, pres-
Engineering, 9(6), 956. doi:10.1109/69.649319
ent and future. Decision Support Systems, 37(1),
151–173. Retrieved from http://www.sciencedirect.
com /science/article/B6V8S-47X1YN5-1/2/ d75b-
KEY TERMS AND DEFINITIONS
b1ab996404d75a0203f7e97c6041 doi:10.1016/
S0167-9236(02)00208-7 Federated Spatial Database System: An
Sheth, A. P., & Larson, J. A. (1990). Feder- integrated spatial database system. It integrates
ated database systems for managing distrib- multiple autonomous spatial data sources into a
uted, heterogeneous, and autonomous databases. single federated database and provides a unified
ACM Computing Surveys, 22(3), 183–236. data access mechanism to applications.
doi:10.1145/96602.96604 Spatial Query: A query deals with data of 2 or
3 dimensions by supporting spatial data types, spa-
Sjoberg, D. (1993). Quantifying schema evolu- tial operators, spatial operations and spatial joins.
tion. Information and Software Technology, 35(1), Spatial View: A stored query that contains a
35–44. doi:10.1016/0950-5849(93)90027-Z spatial column.
SLIP. (2011). Welcome to SLIP enabler. Retrieved Schema Mapping: Specification of the cor-
from https://www2.landgate.wa.gov.au /web/ respondences between schemas.
guest. Schema Mapping Composition: To compose
two schema mappings into one mapping.
Tatarinov, I., & Halevy, A. (2004). Efficient query
reformulation in peer data management systems.
Paper presented at the 2004 ACM SIGMOD In-
ternational Conference on Management of Data.
Paris, France.

This work was previously published in Discovery of Geospatial Resources: Methodologies, Technologies, and Emergent
Applications, edited by Laura Díaz, Carlos Granell and Joaquín Huerta, pp. 56-77, copyright 2012 by Information Science
Reference (an imprint of IGI Global).

657
658

Chapter 40
Map Matching Algorithms for
Intelligent Transport Systems
Mohammed A. Quddus
Loughborough University, UK

ABSTRACT
Map matching algorithms integrate positioning data with spatial road network data to support the
navigation modules of intelligent transport systems requiring location and navigation data. Research
on the development of map matching algorithms has significantly advanced over the last few years. This
article looks at different methods that have been adopted in map matching algorithms and highlights
future trends in map matching and navigation research.

INTRODUCTION Services. Two essential components required for


the navigation module of an ITS are: (1) a device
A map matching algorithm integrates positioning to determine the geometric position of the vehicle
data with spatial road network data for the iden- and (2) a spatial road network database for spatial
tification of the correct link on which a vehicle is (physical) reference of the vehicle location.
travelling and to determine the physical location The most common geometric positioning
of the vehicle on the link. This algorithm could devices used to obtain inputs to a map matching
be used as a key component in the navigation algorithm are Dead Reckoning (DR) motion sen-
modules of intelligent transport systems (ITS) sors, global navigation satellite systems (GNSS)
that require locational data of a relatively high such as the Global Positioning System (GPS),
quality. Such ITS applications include navigation and integrated navigation systems such as the
and route guidance, bus priority at junctions, fleet integration of GPS and DR. The review of the
management, road user charging, accident and literature suggests that the integrated GPS and
emergency management, and Location-Based DR system increases the performance of a map
matching algorithm if all other factors are equal,
DOI: 10.4018/978-1-4666-2038-4.ch040 especially in the case for dense and built-up urban

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Map Matching Algorithms for Intelligent Transport Systems

areas. However, even with good sensor calibration METHODOLOGIES USED IN MAP
and sensor fusion technologies, inaccuracies in MATCHING ALGORITHMS
the GPS/DR system are inevitable. This is mainly
due to the errors in both GPS and DR sensors. The general purpose of a map matching algorithm
The spatial road network data are the other is to identify the correct road segment on which the
important component of a map matching algo- vehicle is travelling and to determine the vehicle
rithm. Since land vehicles primarily travel on location on that segment. The parameters used to
known road networks, the spatial road network select a precise road segment are mainly based
data are used as a spatial reference to locate them. on the proximity between the position fix and the
This, for example, assists drivers to relate their road, the degree of correlation between the vehicle
observed positions obtained from the navigation trajectory derived from the position fixes and
system with a physical location in the real-world the road centreline, and the topology of the road
and hence can guide them along a pre-calculated network. Orthogonal projection of the position fix
route. However, the spatial road network data also onto the selected road segment is normally used
contain errors arising mainly from the processes to calculate the vehicle location on the segment.
of creation and digitization of maps. The errors Figure 1 shows a general map matching process
can be estimated using either the scale of the map (see Quddus, 2006 for details) which takes inputs
or field experiments. from an integrated GPS/DR such as easting (E),
The performance of a map matching algorithm northing (N), speed (v), and heading (θ) and the
can be improved in two ways: (1) by controlling the error variances associated with them. The map
errors in both positioning sensors and spatial road matching process also takes inputs from a spatial
network data and (2) by improving the technique digital road network database. The outputs of the
used in the map matching process. A number of algorithm are the correct link on which the vehicle
map matching algorithms have been developed by is travelling and the location of the vehicle () and
researchers around the world using different tech- the error variances associated with them.
niques such as topological analysis of spatial road Approaches for map matching processes found
network data, probabilistic theory, Kalman filter, in the literature can be categorised into three
fuzzy logic, and belief theory. The performance of groups: geometric, topological and advanced.
these algorithms has improved over the years due These are briefly discussed below.
to the application of advanced techniques in the
map matching processes and to the improvements Geometric Map Matching Algorithms
of the quality of both positioning data and spatial
road network data over time. The purpose of this A geometric map matching algorithm makes use
article is to present an in-depth literature review of the geometric information of the digital road
on map matching algorithms and to suggest the network by considering only the shape of the links
future trends in map matching research. (Greenfeld, 2002). It does not consider the way
The rest of the article is structured as follows. links are connected to each other. The most com-
The next section describes methodologies used in monly used geometric map matching algorithm is
map matching algorithms. This is followed by a a simple search algorithm. In this approach, each
description of future trends in the development of the positioning fixes matches to the closest
of map matching algorithms. Finally, the conclu- ‘node’ or ‘shape point’ of a road segment. This is
sion is given. known as point-to-point matching (White et al.,
2000). This approach is both easy to implement
and very fast. However, it is very sensitive to

659
Map Matching Algorithms for Intelligent Transport Systems

Figure 1. A map matching algorithm

the way in which the network was digitised and et al., 2000; Phuyal, 2002). This approach first
hence has many problems in practice. That is, identifies the candidate nodes using point-to-
other things being equal, arcs with more shape point matching. Then, given a candidate node, it
points are more likely to be properly matched. In constructs piecewise linear curves from the set
a straight arc with two end nodes, all positioning of paths that originate from that node. Secondly,
points that fall above the arc only match with the it constructs piecewise linear curves using the
arc’s end nodes. vehicle’s trajectory i.e., the positioning points,
Another geometric map matching approach and determines the distance between this curve
is point-to-curve matching (White et al., 2000). and the curve corresponding to the road network.
In this approach, the position fix obtained from The road arc which is closest to the curve formed
the navigation system is matched onto the closest from positioning points is taken as the one on
curve in the network. Each of the curves is com- which the vehicle is apparently travelling. This
prised of a set of piecewise linear line segments. approach is quite sensitive to outliers and depends
Distance is calculated from the position fix to each on point-to-point matching, sometimes giving
of the line segments. The line segment which gives unexpected results.
the smallest distance is selected as the one on which
the vehicle is apparently travelling. Although this Topological Map Matching
approach gives better results than point-to-point Algorithms
matching, it does have several shortcomings that
make it inappropriate in practice. For example, it In GIS, topology refers to the relationship be-
gives very unstable results in urban networks due tween entities (points, lines, and polygons). The
to the high road density. Moreover, the closest link relationship can be defined as adjacency (in the
may not always be the correct link. case of polygons), connectivity (in the case of
The other geometric approach is to compare lines), or containment (in the case of points in
the vehicle’s trajectory against known roads. This polygons). Therefore, a map matching algorithm
is also known as curve-to-curve matching (White which makes use of the geometry, connectivity, and

660
Map Matching Algorithms for Intelligent Transport Systems

contiguity of the links is known as a topological Kim and Kim 2001, Syed and Cannon, 2004,
map matching algorithm (Greenfeld, 2002, and Quddus et al., 2006), or the application of Bayes-
Meng et al., 2003). ian interference (Pyo et al., 2001). The details of
Greenfeld (2002) reviews several approaches these algorithms can be found in Quddus (2006).
for solving the map matching problem and pro- However, the probabilistic and fuzzy logic-based
poses a weighted topological algorithm. This is algorithms are briefly discussed below.
based on a topological analysis of a road network The probabilistic approach to a map match-
and uses only coordinate information on observed ing algorithm was first introduced by Honey et
positions of the user. It does not consider any al. (1989) for the case of a navigation solution
heading or speed information determined from obtained from a DR sensor. Zhao (1997) extended
GPS. This method is very sensitive to outliers as this concept for the case of a navigation solution
these can lead to inaccurate vehicle heading cal- obtained from GPS. Other studies which used a
culations. The other deficiency of this algorithm probabilistic approach or a part of this approach
is not to take into account the vehicle speed as are, for example, Kim et al. (2000) and Pyo et
GPS position fixes are scattered randomly when al. (2001. The probabilistic algorithm requires
the speed is less than 3.0 m/sec (Taylor et al., 2001 the definition of an elliptical or rectangular
and Ochieng et al., 2004). confidence region around a position fix obtained
An enhancement of the topological map match- from a navigation sensor. The error region is then
ing algorithm is proposed by Quddus et al. (2003). superimposed on the road network to identify a
The basic characteristic of this algorithm is the use road segment on which the vehicle is travelling.
of navigation data from either GPS or an integrated If an error region contains a number of segments,
GPS/DR system that includes position, speed, and then the evaluation of candidate segments are
heading. Data on the historical trajectory of the carried out using heading, connectivity, and
vehicle (i.e., data on earlier map-matched posi- closeness criteria. Indeed, there are many other
tions) is used to avoid sudden switching of mapped parameters e.g., speed of the vehicle, distance to
locations between unconnected road links. The the downstream junction, etc. that can be used to
topological aspects of the road network such as improve the probabilistic map matching process.
link connectivity and shape points (representing Ochieng et al. (2004) develops an enhanced
link curvature) are used to improve the perfor- probabilistic map matching algorithm. In this
mance of the algorithm. Heading and speed data algorithm, the elliptical error region is only
from the positioning sensors are also employed constructed when the vehicle travels through a
to further improve the algorithm’s performance, junction, in contrast to constructing it for each po-
especially at junctions. sition fix as suggested by Zhao (1997). Therefore,
there is no need to create the error region when
Advanced Map Matching Algorithms the vehicle travels along a link. This makes the
algorithm faster as there are a number of processes
Advanced map matching algorithms are referred to involved in the creation of the error region and
as those algorithms that use more refined concepts hence the identification of the correct link. The
such as a probabilistic theory (Zhao, 1997, Ochieng method is more reliable as the construction of an
et al., 2004), a Kalman Filter (e.g., Tanaka et al., error region in each epoch may lead to incorrect
1990, Kim et al., 2000), the Dempster-Shafer’s link identification.
theory (also known as Belief Theory) (e.g., Najjar Fuzzy logic based map matching algorithms
and Bonnifait, 2003; Yang et al., 2003), a fuzzy have been developed by a number of researchers
logic model (e.g., Zhao, 1997; Kim et al., 1998, (e.g., Zhao, 1997; Kim et al., 1998, Fu et al., 2004,

661
Map Matching Algorithms for Intelligent Transport Systems

Syed and Cannon, 2004) over the last few years. ing algorithms are not capable of satisfying the
The limitations of these algorithms include: (a) requirements of all ITS applications and services.
ignoring most of the available inputs to the fuzzy For instance, bus priority at junctions require a
logic model, (b) overlooking the connectivity positioning accuracy of 5 m (95%) and none of
among road links and the historical trajectory of the existing algorithms can meet this positioning
the vehicle, which can enhance the performance requirement, especially, within dense urban areas.
of map matching algorithms, (c) ignoring the error
sources associated with the spatial road network
and the navigation sensor, and (d) failing to vali- FUTURE TRENDS
date the map matching algorithms to assess their
performance. Further improvements in map matching algorithms
Quddus et al. (2006) developed an improved are essential as existing algorithms are not always
fuzzy logic-based map matching algorithm that capable of satisfying user requirements. The per-
took account of these limitations. Specifically, a formance and speed of a map matching algorithm
number of new input variables (at no extra cost) largely depends on the technique used in the algo-
were included in the fuzzy logic map matching rithm. For instance, it has been found that the fuzzy
algorithm. These were: the speed of the vehicle, logic based map matching algorithms provide better
the connectivity among road links, the quality performance compared with other methods for the
of position solution, e.g., Horizontal Dilution same inputs. Therefore, future research on map
Of Precision (HDOP), and the position of a fix matching algorithms will focus on the technique
relative to a candidate link. These inputs were used in the algorithm. Other potential techniques
incorporated into the rules in order to improve would be to employ a pattern recognition approach,
the performance of the algorithm. The output of a hieratical fuzzy inference system optimised by a
the fuzzy inference system (FIS) was the link on genetic algorithm (GA), or a hybrid method. Other
which the vehicle is travelling. The membership potential areas of research would be to improve the
functions were then optimised using an accurate inputs to a map matching algorithm. For instance,
input/output dataset. This dataset was obtained the deployment of European Galileo and the mod-
from high accuracy GPS carrier phase observa- ernization of GPS over the next few years will have
tions. Moreover, three sets of knowledge-based a profound impact on future GNSS receiver design
fuzzy rules were formulated when the navigation which will enhance positioning data. The accuracy
data came from GPS/DR. The first set (six rules) of spatial road network data is also continuing to
was for an initial map-matching process (IMP). improve over time. It is therefore expected that
The second set (thirteen rules) was for subsequent future map matching algorithms together with the
map-matching on a link (SMP-Link). The third set use of these improved inputs will be able to sup-
(four additional rules) was for subsequent map- port the navigation function of ITS services with
matching at a junction (SMP-Junction). The results a horizontal accuracy of 5 m (95%) even in dense
showed that the algorithm is able to identify 99.2% and built-up urban areas.
of the links correctly with horizontal accuracy of
5.5m (95%) in suburban area of London.
In summary, it can be said that different algo- CONCLUSION
rithms have different strengths and weaknesses.
Some algorithms may perform very well within Map matching algorithms play a key role to
suburban areas but may not be appropriate for support the navigation module of an ITS. The
urban areas and vice versa. Existing map match- performance of a map matching algorithm largely

662
Map Matching Algorithms for Intelligent Transport Systems

depends on the quality of the spatial road network Kim, W., Jee, G., & Lee, J. (2000). Efficient use
data, the quality of the positioning data, and of digital road map in various positioning for
the methodologies used in the algorithm. This ITS. IEEE Symposium on Position Location and
article has mainly discussed techniques used by Navigation, San Deigo, CA.
map matching algorithms. It has been revealed
Meng, Y. (2006). Improved Positioning of Land
that the performance of a fuzzy logic based map
Vehicle in ITS Using Digital Map and Other Ac-
matching algorithm is better when compared to
cessory Information. PhD Thesis, Department of
other algorithms. However, existing map match-
Land Surveying and Geoinformatics, Hong Kong
ing algorithms cannot meet the requirements for
Polytechnic University.
some applications, especially in built-up urban
areas and hence further improvement is necessary. Najjar, M. E., & Bonnifait, P. (2003). A roadmap
This article then discusses the future trends in the matching method for precise vehicle Localization
development of map matching algorithms. It is using belief theory and Kalman filtering. The 11th
envisaged that a future map matching algorithm International Conference in Advanced Robotics,
will also emphasize the reliability of the position- Coimbra, Portugal, June 30 - July 3.
ing solution.
Ochieng, W. Y., Quddus, M. A., & Noland, R.
B. (2004). Integrated positioning algorithms for
transport telematics applications. In proceedings
REFERENCES
of the Institute of Navigation (ION) annual confer-
Fu, M., Li, J., & Wang, M. (2004). A hybrid map ence, 20-24 September, California, USA.
matching algorithm based on fuzzy comprehen- Phuyal, B. (2002). Method and use of aggre-
sive Judgment, IEEE Proceedings on Intelligent gated dead reckoning sensor and GPS data for
Transportation Systems, 613-617. map matching. In proceedings of the Institute
Greenfeld, J. S. (2002). Matching GPS observa- of Navigation (ION) annual conference, 20-27
tions to locations on a digital map. In proceedings September, Portland, OR.
of the 81st Annual Meeting of the Transportation Pyo, J., Shin, D., & Sung, T. (2001). Development
Research Board, January, Washington D.C. of a map matching method using the multiple
Honey, S. K., Zavoli, W. B., Milnes, K. A., Phil- hypothesis technique. IEEE Proceedings on
lips, A. C., White, M. S., & Loughmiller, G. E. Intelligent Transportation Systems (pp. 23-27).
(1989). Vehicle navigational system and method, Quddus, M. A. (2006). High integrity map match-
United States Patent No., 4796191. ing algorithms for advanced transport telematics
Kim, S., & Kim, J. (2001). Adaptive fuzzy-net- applications. PhD Thesis, Centre for Transport
work based C-measure map matching algorithm Studies, Department of Civil and Environmental
for car navigation system. IEEE Transactions Engineering, Imperial College London.
on Industrial Electronics, 48(2), 432–440. Quddus, M. A., Ochieng, W. Y., & Noland,
doi:10.1109/41.915423 R. B. (2006). A high accuracy map match-
Kim, S., Kim, J.-H., & Hyun, I.-H. (1998). De- ing algorithm for road transport. Journal of
velopment of a map matching algorithm for car Intelligent Transportation Systems: Technol-
navigation system using fuzzy Q-factor algorithm. ogy, Planning, and Operations, 10(3), 103–115.
In proceedings of the World Congress Intelligent doi:10.1080/15472450600793560
Transport System, October, Seoul, Korea.

663
Map Matching Algorithms for Intelligent Transport Systems

Quddus, M. A., Ochieng, W. Y., Zhao, L., & KEY TERMS AND DEFINITIONS
Noland, R. B. (2003). A general map matching
algorithm for transport telematics applications. Accuracy: Accuracy is defined as the nearness
GPS Solutions, 7(3), 157–167. doi:10.1007/ of a measurement to the standard or true value i.e.,
s10291-003-0069-z a highly accurate navigation system will provide
measurements very close to the standard, true or
Syed, S., & Cannon, M. E. (2004). Fuzzy logic- known values. A rigorous statement of accuracy
based map matching algorithm for vehicle naviga- includes statistical measures of uncertainty and
tion system in urban canyons. In proceedings of the variation. Accuracy is generally represented by
Institute of Navigation (ION) national technical standard deviation of errors (difference between
meeting, 26-28 January, California, USA. measurements on the map and the true value).
Tanaka, J., Hirano, K., Itoh, T., Nobuta, H., & Dead-Reckoning (DR): DR is the process of
Tsunoda, S. (1990). Navigation system with estimating the position of a vehicle (or a moving
map-matching method. Proceeding of the SAE object) with respect to a known position using
International Congress and Exposition (pp 40-50). information on heading, speed, time, and distance
travelled by the vehicle.
Taylor, G., Blewitt, G., Steup, D., Corbett, S., Fuzzy Logic: Fuzzy logic is a superset of con-
& Car, A. (2001). Road Reduction Filtering for ventional (Boolean) logic that has been extended
GPS-GIS Navigation. Transactions in GIS, ISSN to handle the concept of partial truth i.e., truth
1361-1682, 5(3), 193-207. values between “completely true” and “com-
White, C. E., Bernstein, D., & Kornhauser, A. pletely false”. It was introduced in the 1960’s by
L. (2000). Some map matching algorithms for Zadeh (1965). It is suitable to deal with problems
personal navigation assistants. Transportation involving knowledge expressed in vague, and
Research Part C, Emerging Technologies, 8, linguistic terms.
91–108. doi:10.1016/S0968-090X(00)00026-7 Global Positioning System (GPS): GPS is
a satellite-based radio-navigation, positioning,
Yang, D., Cai, B., & Yuan, Y. (2003). An improved and time-transfer system. It is designed, financed,
map-matching algorithm used in vehicle naviga- and deployed by the US Department of Defense
tion system. IEEE Proceedings on Intelligent (US DoD) and operated jointly by the US DoD
Transportation Systems, 2, 1246–1250. and the Department of Transportation (US DoT).
Zadeh, L. A. (1965). Fuzzy sets. Information Intelligent Transportation Systems (ITS):
and Control, 8, 338–353. doi:10.1016/S0019- ITS can be defined as the integrated application of
9958(65)90241-X advanced sensors, computers, electronics, naviga-
tion, and communication technologies to vehicles
Zhao, L., Ochieng, W. Y., Quddus, M. A., & No- and roadways that increase safety, reduce conges-
land, R. B. (2003). An Extended Kalman Filter tion, enhance mobility, minimize environmental
algorithm for Integrating GPS and low-cost Dead impact, increase energy efficiency, and promote
reckoning system data for vehicle performance economic productivity for a healthier economy.
and emissions monitoring. Journal of Navigation, Map Matching (MM): MM is a process to
56, 257–275. doi:10.1017/S0373463303002212 match the positioning data (i.e., longitude and
Zhao, Y. (1997). Vehicle Location and Navigation latitude or Easting and Northing) onto the digital
System. Artech House, Inc., MA. road map data.

664
Map Matching Algorithms for Intelligent Transport Systems

Topology: Topology is the relationship be- case of polygons), connectivity (in the case of
tween entities (points, lines, and polygons). The lines), or containment (in the case of points in
relationship can be defined as adjacency (in the polygons).

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 302-309,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

665
666

Chapter 41
Describing and Selecting
Collections of Georeferenced
Media Items in Peer-to-Peer
Information Retrieval Systems
Daniel Blank
University of Bamberg, Germany

Andreas Henrich
University of Bamberg, Germany

ABSTRACT
The ever-increasing amount of media items on the World Wide Web and on private devices leads to a
strong need for adequate indexing and search techniques. Trends such as personal media archives, social
networks, mobile devices with huge storage space, and networks with high bandwidth capacities make
distributed solutions and in particular Peer-to-Peer (P2P) Information Retrieval (IR) systems attractive.
On the other hand, when designing effective media search applications, various search criteria have to
be addressed. Hereby, geospatial information is frequently used as well as other criteria, such as text,
audio or visual media content, and date and time information.

In this chapter, the authors outline how collections of georeferenced media items can be indexed and
searched in P2P IR systems. They discuss different types of P2P IR systems and focus in detail on an
approach based on collection description and selection techniques. This approach tries to adequately
describe and select collections of georeferenced media items. Finally, the authors discuss its broad ap-
plicability in various application fields.

DOI: 10.4018/978-1-4666-2038-4.ch041

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Describing and Selecting Collections of Georeferenced Media Items

INTRODUCTION P2P IR systems offer the benefit that media items


can remain on individual devices since there is no
In recent years, the availability—and with it the need for storing them on remote servers hosted
usefulness—of geospatial metadata has increased by third party service providers. Crawling which
dramatically. Digital cameras and mobile phones consumes large amounts of web traffic (Bockting
are nowadays often equipped with GPS sensors & Hiemstra, 2009) can thus be avoided. In addi-
at affordable cost. Hence, such devices are able tion, dependency from service providers acting
to capture georeferenced information in the per- as informational gatekeepers can be reduced,
sonal lives of millions of people from all over the because they are no longer able to decide which
world. In addition, geo-tagging tools with rich user information can be retrieved or accessed and which
interfaces have emerged in different domains and cannot. In times of a strong market concentration
large geo-tagging initiatives try to georeference in internet search and social network applications
textual resources such as in case of Wikipedia. as well as public debates addressing the privacy
As a consequence, an increased importance of of data, P2P IR could offer some benefits.
geospatial information in the context of search As our primary use case, georeferenced images
can be recognized. are administered in a P2P IR system. The images
Obviously, geospatial information is not the of a certain user are stored locally on the user’s
only search criterion. When searching for media personal device(s) and a scalable P2P IR protocol
items other criteria such as textual content, time- is applied in order to facilitate retrieval. An image
stamps, and (low-level) audio or visual content can hereby be described by various criteria: textual
information can be used as well—often in an metadata, (low-level) visual content features, a
integrative way. A combination of these criteria timestamp, and a geographic coordinate. Personal
can allow for the effective retrieval of text, image, media collections containing multiple images can
audio, and video documents. thus be represented by corresponding collection
As we all know, the amount of media items descriptions allowing for efficient and effective
on the World Wide Web and on private devices collection selection when processing a given query.
steadily increases. Service providers such as Flickr, We assume in the following that at least some of
YouTube, or Facebook (Beaver, et al., 2010) have the images of a peer are geo-tagged. A resource
to maintain huge hardware infrastructures in order description capturing the geographic footprint of
to keep up with the tremendous increase in data an image collection can thus be generated from
volumes. So far, it is unclear if existing server- the set of georeferenced images. In this chapter,
centered solutions will also suit our needs in the we focus on geospatial query processing. In par-
future. Hence, a need for alternative indexing and ticular, we address geospatial k nearest neighbor
search techniques might arise. (k-NN) queries—finding the k closest media items
Peer-to-Peer (P2P) Information Retrieval (IR) according to a given query location.
systems consist of computers from all over the In literature, many approaches for P2P IR can
world. These computers can act as both clients be found. The following section entitled Peer-to-
and servers. By applying a scalable P2P IR pro- Peer Information Retrieval Systems for Geospatial
tocol, a “service of equals” for the administration Search will give an overview on different types of
of media items can be established in contrast to P2P IR systems and outline how georeferenced me-
existing client/server-based solutions. No expen- dia items can be indexed in a P2P setting in general.
sive infrastructure has to be maintained and idle We additionally describe how a comprehensive
computing power in times of inactivity can be indexing of the abovementioned search criteria
used to maintain, analyze, and enrich media items. (geospatial, textual, date and time, and audio or

667
Describing and Selecting Collections of Georeferenced Media Items

visual content information) can be achieved. In Unstructured Topologies


addition, we discuss associated consequences for
query processing in a distributed scenario. Data-independent: Main protocols in this group
A P2P IR system which might use the above- are PlanetP (Cuenca-Acuna, et al., 2003) and its
mentioned criteria and which is based on resource extension Rumorama (Müller, et al., 2005). In
(i.e. collection) selection is presented in detail in Rumorama, a peer sees the network as a single,
section Resource Selection based on Descriptions small PlanetP network (called subnet) with con-
of Geospatial Footprints. Resource selection nections to other peers, which see other PlanetP
techniques are applied to determine a ranking subnets. Each peer can choose the size of its
of promising resources based on descriptions of subnet according to local processing power and
their geospatial footprint. Peers are contacted in bandwidth capacity. Within a subnet, a peer knows
ranked order to retrieve the media items with the data summaries of all other peers in the subnet.
closest locations according to a user-given query Gossiping techniques are used to disseminate the
location. It is important to note that the core of data summaries. In a subnet, summary-based re-
our P2P IR system—its resource description and source selection allows for semantic query routing.
selection techniques addressing georeferenced Additionally, a peer maintains a small set of links
media items—is not limited to P2P IR systems. pointing to neighboring peers in other subnets in
Different application fields and usage scenarios of order to be able to forward queries outside the
these resource description and selection techniques boundaries of its own subnet.
are therefore discussed at the end of this chapter. In its original form, peers are assigned to
In a Future Work section, we will focus on subnets arbitrarily, i.e. independent of the peers’
different open aspects w.r.t. geospatial resource content. However, Rumorama can be easily
description and selection before we end with a extended by a grouping of peers—similar to the
Conclusion section. content-dependent overlays described in the fol-
lowing—for example by content or by geospatial
proximity of the host, which operates the peer.
PEER-TO-PEER INFORMATION A further benefit of the resource descriptions is
RETRIEVAL SYSTEMS FOR their possibility of being visualized. Thus, they
GEOSPATIAL SEARCH might be applied for interactive retrieval, e.g. by
providing—with low bandwidth requirements—a
P2P IR systems are mostly classified as being visual overview of peer data for a large number
structured or unstructured overlay networks. In of peers.
addition, we introduce the distinction of data- Routing indexes in various forms represent
independent and data-dependent overlays as a aggregated information in an unstructured P2P IR
secondary classification criterion to reflect if, for system maintained at a peer for all its neighboring
example, a peer’s content or query profiles affect peers in order to decide in which direction queries
overlay generation. This distinction provides should be forwarded. Initially designed for one-
helpful insights in order to pinpoint different dimensional values in order to avoid network
characteristics in a more organized way. In the flooding in the early days of P2P computing,
following subsections, we will briefly discuss they have for example been extended to allow for
unstructured and structured as well as hybrid and multi-dimensional queries. Here, bounding boxes
super-peer approaches. are used to summarize the content of neighboring
peers (for references cf. Doulkeridis, et al., 2009).

668
Describing and Selecting Collections of Georeferenced Media Items

Data-dependent: Many Semantic Overlay the COIs. Since many approaches are based on
Networks (SONs; for references and a detailed a similarity measure between resources’ content,
description cf. Doulkeridis, et al., 2010) are data- different criteria can be integrated when determin-
dependent, unstructured P2P networks. Here, the ing the similarity of peers in order to group them
content of a peer’s data or information about past together. Alternatively, multiple overlays might be
queries defines a peer’s place in the network. Thus, maintained, i.e. one overlay per search criterion.
summaries of a peer’s content or query profiles are
needed. Two types of links are usually maintained Structured Topologies
in such systems: short links grouping peers with
similar content or query profiles into so called Data-independent: Structured P2P IR systems
“Clusters of Interest” (COIs) and long links which are based on distributed indexing structures.
are established between different COIs. During Distributed Hash Tables (DHTs) represent the
query processing, the query has to be forwarded most prominent class member. Every peer in
to the most promising COI(s). Clustering, clas- the network is usually responsible for a certain
sification, and gossiping techniques can be applied range of the feature space. Thus, when entering
in order to form COIs. the network or updating local content, index data
Indexing of Multiple Criteria in Unstruc- has to be transferred to remote peers according
tured Topologies: Many of the unstructured, to the peers’ responsibilities. In case of data-
data-independent P2P IR systems are based on independent, structured P2P IR systems, terms (cf.
resource descriptions which summarize a single Bender, et al., 2005) or high-dimensional feature
peer’s content, summarize the content of multiple vectors for content-based image retrieval (cf.
peers reachable when following a certain direc- Novak, et al., 2008; Lupu, et al., 2007; Vu, et al.,
tion, or for example summarize information about 2009) are usually mapped to one-dimensional or
past queries. In general, it is possible to apply one multi-dimensional keys. They can be indexed in a
summary type and a corresponding resource selec- classical DHT such as Chord (Stoica, et al., 2001)
tion technique per feature type. Feature-specific or CAN (Ratnasamy, et al., 2001) respectively. It
peer rankings can be combined by applying an has to be noted that there is a large variety of such
algorithm for the merging of ranked lists (Belkin, P2P protocols. Very detailed information with
et al., 1995; Ilyas, et al., 2008). As an alternative references pointing to relevant research articles
for creating independent resource descriptions per can be found in Shen et al. (2010).
search criterion, summaries and resource selection Data-dependent: SONs—as described
algorithms integrating multiple feature types are above—can also be implemented on top of a DHT
also possible (cf. Hariharan, et al., 2008). in order to enhance query routing (Doulkeridis,
In section Resource Selection based on De- et al., 2010). Various forms of clustering, clas-
scriptions of Geospatial Footprints we will focus sification together with gossiping techniques can
on resource description and selection techniques be applied in order to establish links to peers with
for geospatial information. The design of resource similar content.
description and selection techniques for textual Indexing of Multiple Criteria in Structured
data is for example addressed in Cuenca-Acuna Topologies: In structured, data-independent sys-
(2003). Summaries for high-dimensional feature tems, correlations between different criteria are
vectors in order to summarize content-based media difficult to exploit when indexing multiple feature
features are outlined in Blank et al. (2007). types (e.g. geospatial and image content informa-
In data-dependent unstructured networks mul- tion). If we for example assume an image from
tiple search criteria can be addressed when forming the Sahara Desert with shades of beige sand and

669
Describing and Selecting Collections of Georeferenced Media Items

blue sky, different peers might be responsible for capacity, processing power, or available network
indexing the geospatial and the image content in- bandwidth. Often, concepts known from “true”
formation. Thus, when distributing the index data P2P IR systems are extended and transferred to
of the Sahara image, querying for it, or removing super-peer architectures. Also within super-peer
it from the network, (at least) two different peers networks the convergence of different approaches
have to be contacted. Within SONs, the simulta- can be seen. Doulkeridis et al. (2009) for example
neous indexing of multiple criteria would again apply multi-dimensional routing indexes on a
require the definition of a similarity between peers’ super-peer level and additionally group similar
content by combining for example geospatial and super-peers close together in order to allow for
image content information. Alternatively, multiple improved query routing.
overlays might be maintained.

Hybrid and Super-Peer Approaches RESOURCE SELECTION


BASED ON DESCRIPTIONS OF
In unstructured P2P IR systems, a peer only ad- GEOSPATIAL FOOTPRINTS
ministers index data of media items, which belong
to its user. Thus, when entering the network or Geospatial Resource Description
updating media items, full index data does not and Selection in Rumorama-
have to be transferred to remote peers. Peer au- Like P2P IR Systems
tonomy is better respected compared to structured
networks (Doulkeridis, et al., 2010). On the other It is important to note that the resource selection
hand, structured systems offer query processing techniques presented in the following are not re-
with logarithmic cost. In order to reduce the load stricted to data-independent, unstructured P2P IR
imposed on the network when inserting new systems such as Rumorama. The summaries can
media items in structured systems, super-peer also be applied in data-dependent, unstructured
architectures (Papapetrou, et al., 2007) as well as P2P IR systems when forming COIs and in struc-
DHT-based indexing of compact data summaries tured networks in order to be indexed for example
instead of full index data have been proposed (cf. in a DHT. In addition, summaries can be used by
Lupu, et al., 2007). super-peers for selecting “normal” peers or other
In general, there is a convergence of structured super-peers. Here, it might be possible to relax the
and unstructured P2P IR systems with many hybrid strict compliance of some design parameters. A
approaches. We have for example evaluated an system based on super-peers might for example
approach where index data is transferred amongst allow for less space efficient resource descriptions
peers in certain rounds in order to make peers more in order to achieve better retrieval performance.
focused and—as a consequence—summaries In this section, we will present four different
more selective. More selective summaries with resource description and corresponding selection
peers having specialized on a certain range of techniques for geospatial information. Further ap-
the feature space lead to more efficient resource plication fields of the analyzed geospatial resource
selection (Eisenhardt, et al., 2008). description and selection techniques outside the
There is plenty of work addressing super-peer P2P IR context are also possible. We will focus
architectures (for references cf. Doulkeridis, et al., on this aspect in the next section which is entitled
2009). They are designed in order to overcome Resource Descriptions and Selection Techniques
some limitations of “true” P2P IR systems and in different Application Fields.
make use of increased capabilities such as storage

670
Describing and Selecting Collections of Georeferenced Media Items

In this chapter, we outline and extend earlier box over the geospatial coordinates of its image
work from Blank and Henrich (2009, 2010) mainly collection (refer to. Figure 1). We encode a lati-
according to two major directions. First, we ana- tude/longitude-pair (for short: lat/lon-pair) with 8
lyze peer ranking selectivity in more detail by not bytes, 4 for latitude and 4 for longitude. Therefore,
only providing the average fraction of contacted we require 16 bytes of raw data for the bounding
peers as the main performance measure. We ana- box (i.e. two lat/lon-pairs, e.g. the lower left and
lyze the distribution of the fraction of contacted upper right corner).
peers over the set of queries, which gives additional Peer ranking is performed as follows. If a peer
insights. Second, we show that also on a global pa contains the query location in its bounding box
scale (i.e. based on a data collection of geo-tagged whereas peer pb does not, peer pa is ranked
Flickr images from different areas of the world), higher than peer pb and vice versa. In case the
it is sufficient in our scenario to approximate query location lies in the bounding box of both
distances by the use of Euclidean distance when peers pa and pb, the size of a peer (i.e. the number
searching for the 20 nearest neighbors according of documents a peer administers) is used as an
to a given query location. additional criterion. Peers with more documents
We use the terms image and document inter- are ranked higher. In earlier studies, several al-
changeably in order to refer to concrete media ternatives have been evaluated (cf. Blank &
items. This is due to the fact that our experiments Henrich, 2010) and this strategy turned out to be
in Blank and Henrich (2009, 2010) and also in beneficial. If neither peer pa nor peer pb contain
this very chapter are based on image documents. the query within its bounding box, the peer with
Nevertheless, we believe that our approach can the smaller minimum distance from the query
be extended to other media types. Of course, by location to its bounding box is preferred. We as-
restricting our analysis on images, we implicitly sume a spherical model of the earth with a radius
allow only a single lat/lon-coordinate per media of 6371 km. If not stated otherwise, we use Ha-
item. This might be different in case of textual versine distance (Sinnott, 1984) to compute the
documents for example where several locations distance between two points on the sphere.
could be referenced within a single document. A Of course, it is also possible to represent
detailed analysis of these and related challenges the geospatial footprint of a peer by multiple
could be part of future work. bounding boxes. Becker et al. (1991) present
an algorithm for summarizing a set of bounding
Bounding Boxes (BB) boxes by two bounding boxes which minimize
the area that is covered. Chen et al. (2006) pro-
When using bounding boxes as resource descrip- pose several threshold-based algorithms to split a
tions, every peer computes a minimum bounding single bounding box into several smaller ones in

Figure 1. Visualizing summary creation for BB (left), GRID4 (middle), and HFS8/UFS8 (right: ■ cor-
responds to reference points). Four documents indicated as x are geo-tagged.

671
Describing and Selecting Collections of Georeferenced Media Items

order to reduce the space within a bounding box based on a single cell; in the second round it is in
which is not covered by any index data. These most cases based on 1+8=9 cells and in the third
approaches demonstrate that there is optimization round on 1+8+16=25 cells and so on. This is not
potential w.r.t. the representation of geo-regions always the case since there might be no neighbor-
in geospatial indexing. However, the mentioned ing cells in a certain direction, e.g. as soon as a
approaches stick with bounding boxes and the ap- cell in the north or south is reached. Of course,
proaches presented later on in this chapter might be at the 180° meridian we assume that there is no
interesting alternatives in this application field. A boundary and neighborhood relations are valid
detailed comparison might be part of future work. in both directions. The ranking criterion in every
round is the number of grid cells containing one
Grid-Based Summaries (GRIDr) or more image location(s)—the more the better.
Of course, there are alternative approaches in
In a second approach, the lat/lon-coordinates are literature for mapping the spherical coordinates
mapped to a grid (see. Figure 1). A parameter r to a grid for example in order to achieve uniform
represents the number of rows of the grid. The grid cell sizes on the sphere (Putman, et al., 2007).
number of columns is twice the number of rows In various domains, cubed grids (for references cf.
since longitude range is twice as big as latitude Putman, et al., 2007), or triangular meshes (Szalay,
range. The range of a grid cell (in degrees) is de- et al., 2005) have been proposed. These techniques
termined by 180° / r = 360° / (2r) in the latitude might also be considered in future work, although
and longitude domain respectively. This simplified we do not expect large improvements compared
view is for example also applied in Dolin et al. to our naïve approach—neither in terms of rank-
(1997) and results in non-uniform grid cell sizes ing selectivity nor in terms of reduced summary
on the sphere. We gain selectivity and retrieval sizes. We gain ranking selectivity by increasing
performance by increasing the number of grid the number of grid cells. On the other hand, this
cells at the price of additional storage overhead increase does only lead to a sublinear increase
partially compensated through compression tech- of resource description sizes since compression
niques (cf. section Experiments). Every grid cell is techniques are applied.
represented by a single bit. If one or more image
locations fall into a certain cell, the corresponding Highly Fine-Grained Summaries (HFSn)
bit is set to 1. Otherwise, it remains 0. Bit posi-
tions in the summary are determined horizontally This approach is based on resource descriptions
from left to right and from bottom to top. Effects originally designed for summarizing visual con-
of alternative strategies on compression could be tent information of images, for example the color
evaluated in future work. distribution or texture of an image (Blank, et al.,
During peer ranking the grid cell containing 2007). A set of n predetermined image locations
the query location is determined. If peer pa has an are used as reference points. This set of reference
image in this cell whereas peer pb has not, peer pa is points is known to all peers and built from external
ranked higher than peer pb and vice versa. We also sources (a Gazetteer is used in our case). How to
consider neighboring grid cells. If either both or adequately obtain the reference points is outlined
none of peer pa and peer pb have an image located in the remainder of this chapter. So far, it is suffi-
within the cell containing the query location, the cient to note that every image location of a peer’s
GRID approach considers the neighboring cells local image collection is assigned to the closest
recursively until a ranking decision can be made. reference point according to Haversine distance
So, in the first round the ranking decision is always (cf. Figure 1, right). Hereby, a cluster histogram is

672
Describing and Selecting Collections of Georeferenced Media Items

computed counting how many image locations of subtree is similar to the resource selection prob-
a peer’s collection are closest to a certain reference lem. Summaries in the P2P context correspond for
point, i.e. cluster center cj (1 ≤ j ≤ n). example to aggregations maintained in the nodes
Peer ranking is performed as follows. Refer- of a tree, for example bounding boxes in the case
ence points cj are sorted in ascending order ac- of an R-tree (Guttman, 1984). We therefore point
cording to Haversine distance to the query. The the interested reader to Samet (2006).
first element of the sorted list L corresponds to the
cluster center being closest to the query. Peers with Experiments
more documents in this so called query cluster are
ranked higher than peers with fewer documents in Data Collections
the query cluster. If two peers administer the same
amount of documents in the analyzed cluster, the Two data collections of geo-tagged images are
next element out of L is chosen and both peers used in our experiments:
are ranked according to the number of documents
within the very cluster. This procedure continues 1. Geoflickr: During the year 2007 a large
until either a ranking decision can be made, which amount of publicly available images was
favors one peer over the other, or the end of L crawled which had been uploaded to Flickr
is reached. In the latter case, a random decision (http://www.flickr.com). In our scenario, ev-
would be made. ery Flickr user operates a peer of its own. We
therefore assign images to peers by means of
Ultra Fine-Grained Summaries (UFSn) the Flickr user ID. All of the crawled images
are geo-tagged. After some data cleansing
In contrast to HFS, UFS are based on a bit vector the Geoflickr collection consists of 406450
with the bit at position j indicating if center j is geo-tagged images from 5951 different users/
the closest center to one or more of a peer’s image peers.
locations. Therefore, we obtain a bit vector of size 2. Geograph: Geograph (http://www.geo-
n. Of course, there is some loss of information graph.org.uk/) “aims to collect geographi-
when switching from HFS to UFS with n staying cally representative photographs and infor-
constant. However, UFS have the potential of mation for every square kilometre of Great
resulting in more space efficient resource descrip- Britain and Ireland.” We downloaded the
tions. Potentially, this allows for more reference geo-tagged images and distributed them to
points being used, which might result in similar peers again in a user-centric approach. In
or even improved retrieval performance compared our scenario every Geograph participant
to HFS. Among other aspects, this is evaluated operates a single peer: 2609 peers administer
in Blank and Henrich (2010) and will be briefly 246937 images and thus image locations in
summarized in the following experimental sec- total.
tion. Before doing so, we will describe the data
collections we use. The distribution of the number of images per
The abovementioned resource description peer is displayed for both collections in Figure
techniques as well as variations and combinations 2. For both collections the distribution of the
of them (e.g. combining grid and bounding box number of images per peer is very skew which is
based representations) are also used in the context typical for many P2P settings (Cuenca-Acuna, et
of multidimensional and metric index structures al., 2003). Few peers administer large amounts of
(Samet, 2006). The decision of choosing the best the collection. On the other hand, there are many

673
Describing and Selecting Collections of Georeferenced Media Items

Figure 2. Number of images per peer for the Geoflickr and Geograph collection

peers which store only few images. A more detailed underlying data collection. The query locations are
analysis of the distribution of the peer sizes, i.e. visualized in Figure 3. For HFS and UFS where
the number of images per peer, can be found in the outcome of the experiments is affected by the
Blank and Henrich (2010). selection of reference points, we run at least ten
A visualization of the geographic distribution experiments with the 200 queries each. Since we
of the image locations can be found in Blank and do not remove the image with the query location,
Henrich (2010), too. The Geoflickr collection it is—on average—more likely that a big peer
consists of photos taken in various parts of the contributes to the retrieval result than a small peer
world with hotspots in North America, Europe, because—on average—it is more likely to choose
and Japan. In contrast, images of the Geograph the query from a big peer than from a small peer.
collection are limited to the UK and Ireland with An additional strategy for selecting the queries is
images more densely located around urban areas analyzed in Blank and Henrich (2010).
such as London. Space efficiency of different resource descrip-
tions is measured by analyzing average summary
Experimental Settings sizes. For compressing the summaries we apply
Java’s gzip implementation with default param-
In our experiments, we use 200 image locations eter values. Our measurements include serializa-
as queries. These are randomly selected from the tion overhead necessary in order to distribute the

Figure 3. Image locations used as queries for Geoflickr

674
Describing and Selecting Collections of Georeferenced Media Items

resource descriptions in the network. The product Empirical Analysis of Approaches


of average summary size times the number of
peers indicates the network load, which is imposed Peer ranking selectivity for both collections is
by a single round of gossiping (i.e. every peer displayed in Figure 4 with the help of boxplots
sends its summary once to all other peers in the including minimum and maximum values. More
PlanetP-like subnet). experimental results with a focus on the average
In earlier studies (Blank & Henrich, 2010), number of peers, which are contacted during query
in order to measure peer ranking selectivity we processing, can be found in Blank and Henrich
determine the fraction of peers which needs to be (2010). For reasons of brevity, we focus on a set-
contacted on average to retrieve a certain fraction ting with n = 8192 for UFS and HFS here. For the
of the top-k image locations (k = 20). In this chap- GRID approach we partition the data space in 32
ter, we take a closer look at ranking selectivity rows and 64 columns. Thus, in an uncompressed
by analyzing the distribution of the fraction of way, GRID will also result in a bit vector of size
contacted peers over all queries. 8192.
The top-k geo-locations are computed using From Figure 4 can be observed that HFS per-
Vincenty distance (Vincenty, 1975). Since we are forms slightly better than UFS. This seems reason-
interested in the selectivity of the resource descrip- able since HFS encodes frequency information
tion and selection techniques, we analyze all of a which can be beneficial for peer ranking. If we
peer’s image locations as soon as it is contacted, assume that the query is closest to a certain refer-
because the top-k image locations of a peer de- ence point c*, then it is obvious to contact peer
termined using Haversine distance might differ pa before peer pb if peer pa has assigned more
from the top-k image locations computed using image locations to c* than peer pb. In case of UFS,
Vincenty distance. In a real-world application, this information gets lost. Here, peer pb with
only the top-k image locations will be transferred fewer image locations assigned to c* might be
(together with some additional information such selected before peer pa. Although there might be
as peer ID, etc.) cases where such a strategy will lead to better
ranking selectivity, these cases will be excep-
tional.

Figure 4. Ranking selectivity for different approaches

675
Describing and Selecting Collections of Georeferenced Media Items

For the Geoflickr collection (Figure 4, left), and UFS reference points are chosen on a global
both HFS and UFS perform better than BB and scale and are not restricted to the boundaries of
GRID. Interquartile ranges of UFS and HFS are the UK. They are selected from a Gazetteer and
much smaller than for GRID and especially BB. correspond to various locations from all over the
GRID offers better ranking selectivity than BB. world.
In addition to the average number of contacted Summaries based on a single bounding box
peers (cf. Blank & Henrich, 2010), also median per peer lack peer ranking selectivity and seem
and 75th percentile of GRID are clearly below to be too coarse in order to be able to compete
corresponding values for BB. Also interquartile with HFS or UFS, but, bounding boxes can be
range is smaller. Nevertheless, at least for some represented in a very space efficient way. They
queries, it is very difficult for GRID to offer require only 45 byte per resource description as
adequate ranking selectivity as indicated by the can be observed from Figure 5.
maximum value of almost 18%. This is the case For both collections, HFS affords—on aver-
for queries, which lie in a very populated grid cell, age—approximately twice as big resource descrip-
i.e. a grid cell where many peers assign documents tions as UFS. If we take a closer look at the
to. We therefore looked at queries, which offer Geoflickr collection, average summary sizes of
poor ranking selectivity for GRID in more detail. UFS are slightly bigger than in case of GRID. On
It can be observed that many of these queries lie the other hand, further experiments show that
in cells where many of the documents reside, i.e. even GRID with more cells (for example 96 rows
cells, which contain metropolises such as London and 192 columns) cannot outperform UFS in terms
for example. of ranking selectivity. GRID96 contacts on average
In case of Geograph (Figure 4, right), the grid more peers than UFS8192 in order to retrieve the
is not adapted to the boundaries of the United Top-20 with a bigger average summary size.
Kingdom (of course this can easily be done). We A more detailed analysis of ranking selectivity
did not adapt it in order to show the effects of a for HFS and UFS is displayed in Figure 6. It visu-
skew distribution of geospatial image locations alizes the number of contacted peers for different
on a global scale. HFS and UFS are better suited values of n. Medians in case of HFS are always
for such scenarios than GRID, because they bet- smaller than in case of UFS. It can be confirmed
ter adapt to the data which is used. Also for HFS that HFS outperforms UFS. But, performance

Figure 5. Average summary sizes for different approaches

676
Describing and Selecting Collections of Georeferenced Media Items

gaps decrease with increasing values of n. Dif- proportionally select the number of reference
ferences in peer ranking selectivity diminish with points from a certain country. Reference points are
increasing n, since the corresponding histograms selected amongst all populated places of a certain
become more and more similar with many zeros country at random. So, for example, if x% of the
and some summary bin values set to 1. Of course, world’s GDP comes from a certain country, x% of
for HFS, the values of some summary bins might the reference points are randomly chosen amongst
still be bigger than 1, but with increasing n this all populated places of the specific country.
becomes rarer and rarer. Also interquartile ranges Figure 7 shows ranking selectivity in terms of
for HFS and UFS become more and more similar the average fraction of peers which are contacted
when increasing n. This indicates that ranking in order to retrieve the top-20 image locations.
selectivity of HFS and UFS equal more and more. We plot UFS results for different values of n. An
In the following, we will limit ourselves to the analysis of HFS offers similar characteristics and
analysis of UFS since this approach seems to of- is omitted here for reasons of brevity. We can see
fers a good compromise between ranking selectiv- that the strategy based on GDP performs best.
ity and summary size. In order to further trade-off This confirms the finding in Blank and Henrich
these two factors under the influence of other (2010) where only small values of n were ana-
aspects, we refer to the general cost model pre- lyzed. It should be noted here that we used the
sented in Blank and Henrich (2010) for a more GDP-based approach in the preceding experiments
detailed analysis. of this chapter.

Obtaining the Reference Points Analysis of Distance Measures

For obtaining the reference points, we employ In our experiments, the 20 closest image locations
Geonames gazetteer (http://www.geonames.org/) according to a given query location are always
as well as United Nations’ per country statistics computed using Vincenty distance (Vincenty,
obtained through Worldmapper (http://www. 1975) in order to achieve good accuracy, since
worldmapper.org/). Within this chapter, we focus we are interested in the “true” nearest neighbors.
on statistics about men’s income, Gross Domestic On the other hand, in our algorithms we have
Product (GDP), population and WWW usage. used Haversine distance (Sinnott, 1984) which
Various other statistics are evaluated in Blank is computationally more efficient than Vincenty
and Henrich (2010). Based on the statistics we distance at the price of computing less accurate

Figure 6. Ranking selectivity of HFS and UFS for different values of n

677
Describing and Selecting Collections of Georeferenced Media Items

Figure 7. Choosing reference points for UFS from external sources

distance values. Whereas Vincenty distance is IR, where all the online resources of a person are
based on an ellipsoidal shape of the earth, Ha- queried. Web users frequently administer various
versine distance assumes a spherical model. In e-mail accounts, bookmarks of web pages, image
addition to Haversine and Vincenty distance, collections, databases, etc. These resources are
we analyzed two versions of Euclidean distance typically heterogeneous in size, media type and
(the use of lat/lon-pairs as 2-dimensional vectors update frequency (Thomas & Hawking, 2009) pos-
as well as 3-dimensional vectors resulting from sibly requiring space efficient and at the same time
a projection of the lat/lon-coordinates onto a selective (geospatial) resource descriptions. Thus,
sphere). These distances might be used in order to applications might be built providing a unified
further approximate distance values and speed-up search service (similar to a meta-search engine)
query processing. As for Haversine distance, we over all these resources. This could prevent web
use a rather coarse earth radius of approximately users from the time consuming task of querying
6731 km. We observed in experiments that all all resources “manually.”
approaches offer similar retrieval performance. Spatial Data Infrastructures (SDIs; for a general
Hence, it is sufficient to apply Euclidean distance. description see for example Nogueras-Iso et al.
In the following section, we will describe [2005]) might also be an interesting application
how geospatial resource description and selection field for geospatial resource description and selec-
techniques, which were presented and evaluated in tion techniques. Chen et al. (2010) identify three
the preceding sections, could be applied in differ- critical problems with current solutions which are
ent application fields. By doing so we will show for example based on the Catalogue Service for
that their use is not limited to the P2P IR domain. the Web (CSW; http://www.opengeospatial.org/
standards/cat). First, server-based solutions offer
Resource Descriptions and the problem of a single point of failure. Second, it
Selection Techniques in is the task of the users who search for services to
Different Application Fields identify the proper portal. Third, users who want
to provide data have to find a suitable portal to
In addition to P2P IR, geospatial resource summa- register to, which might also be a time consum-
rization and selection techniques can also be used ing task. In order to overcome these issues, it is
in traditional distributed IR applications. Personal broadly recognized that P2P technology might
meta-search is a novel application of distributed be beneficial for the discovery of geospatial web

678
Describing and Selecting Collections of Georeferenced Media Items

services and the publishing of geographic data Lupu et al. (2007) present an approach for
(Xiujun, et al., 2006; Chen, et al., 2010). What information sharing in mobile ad hoc networks.
remains is the question for the best P2P technology When people meet at certain events or places only
in the context of SDIs. Due to the high maintenance for a limited amount of time, there might not be
costs of structured solutions with nodes joining or enough time to transfer full index data when a
leaving the network, Chen et al. (2010) argue for person for example searches for media items in
a hybrid solution. Initially nodes join an unstruc- the proximity of a given query location.
tured network and only if the node stays in the Compact geospatial resource descriptions
network for a longer time period, it can move to might also be valuable for focused crawling
the structured network. While the administration of (cf. Ahlers & Boll, 2009). If a service provides
lat/lon-coordinates in a structured network might summaries of the geospatial extend of a certain
for example be based on a quadtree-like (Chen, website or media archive, a crawler could esti-
et al., 2010) or R+-tree-like (Xiujin, et al., 2006) mate the potential usefulness of this resource for
structure, or a space-filling curve based mapping its focused crawling task before actually visiting
to one dimensional values (Memon, et al., 2009), the source. This way, crawl efficiency can be im-
the unstructured part of the network might use proved by preventing the crawler from analyzing
a resource description and selection based ap- too many irrelevant pages. Web traffic imposed
proach for web service discovery. Here, rather by downloading large irrelevant data volumes can
than minimum bounding boxes such as in case thus be avoided.
of (Xiujin, et al., 2006), grid-based or Voronoi- Distributed IR techniques can also be used
based space partitioning schemes as presented in for vertical selection within aggregated search
this chapter might also be beneficial to represent a (Arguello, et al., 2009). Vertical selection is the
geospatial web service profile. Such profiles might task of identifying relevant verticals, i.e. focused
be generated by different resources/peers by the search services such as image, news, video or
invocation of the getCapabilities() method which shopping search. A user issuing the textual query
is defined by many OGC standards such as Web “Beatles Hamburg” might also be interested in
Map Service (WMS; http://www.opengeospatial. music videos captured in Hamburg and thus the
org/standards/wms), Web Feature Service (WFS; results of video search or small previews should
http://www.opengeospatial.org/standards/wfs), be integrated in result presentation of classical
or Web Coverage Service (WCS; http://www. web search if video files match the geospatial
opengeospatial.org/standards/wcs). Currently, for restriction. In this context, a vertical can be in-
example in case of WCS and WFS, Voronoi-based terpreted as a resource and the task of selecting
coverage summaries are not intended. relevant verticals is similar to resource selection
Resource description and selection techniques in distributed IR requiring adequate features, i.e.
might also be applied within sensor networks resource descriptions, and corresponding selec-
(Elahi, et al., 2009). In sensor networks, limited tion mechanisms.
processing power, bandwidth, and energy capaci- Space efficient geospatial resource descriptions
ties necessitate aggregation techniques which are might also be beneficial in the context of recom-
based on local information with a clear focus on mender systems and social search for example in
space efficiency. One might think of in-situ or order to compute the similarity between different
remote sensing applications, which aggregate users of social network sites. Similar users can
and summarize gathered data before transmitting be determined not only based on having the same
them in the network. friends, using the same tags, bookmarking the

679
Describing and Selecting Collections of Georeferenced Media Items

same media items, etc. (Guy, et al., 2010), but It might also be interesting to further analyze
also depending on the similarity of geospatial ranking selectivity when very remote places on
footprints which are obtained from the media earth are used as query locations. If they repre-
items a user administers. sent places in certain countries with low GDP,
For many scenarios, it might be necessary to this might lead to a loss in ranking selectivity for
enhance the basic description and selection tech- UFS and HFS. In these cases, other strategies for
niques presented in this chapter in order to sum- obtaining the centroids might be used. In addition,
marize for example trajectories or more complex BB or GRID might be better suited in such cases.
objects such as polygons. Also in this context, the As mentioned before, we also plan to use UFS
literature on multidimensional and metric index and HFS to enhance centralized index structures.
structures provides a good starting point for further In general, there is a need for resource descrip-
studies (cf. Samet, 2006). Many multidimensional tion and selection techniques based on local image
access methods are capable of administering lines features. Here, image content is represented by
and polygons, which are aggregated for example several feature vectors per image. Local features
in the inner nodes of tree-based access methods are also used in video retrieval. A further research
in an adequate way. direction might be to apply HFS and UFS in this
context.
Since resource selection techniques for text
FUTURE RESEARCH DIRECTIONS and media content information are available, one
might take a closer look at combined queries.
So far, our heuristic approach lacks the avail- If a user queries for “Beatles Hamburg” it is
ability of an adequate stopping criterion. In our not sufficient to rank peers high, which contain
experiments we analyze how many peers need documents addressing the Beatles or the city of
to be contacted in order to retrieve the 20 closest Hamburg. A user might expect documents which
locations according to a given query location. Of match both criteria.
course, we could use these empirical findings in
order to derive a mechanism which tries to guar-
antee that for example in 90% of the queries all CONCLUSION
20 nearest neighbors can be found. Nevertheless,
we doubt if such an approach adapts to differ- In this chapter, we have outlined different P2P ap-
ent media types and collection sizes. Thus, it is proaches for the large-scale administration of media
necessary to design algorithms for k-NN query content. We identified geospatial information as an
processing which can successfully prune peers important search criterion and discussed how it can
from query processing if they do not contribute be applied in the various P2P approaches. Later in
relevant documents. the chapter, we presented and evaluated different
Another interesting aspect might be to further resource description and selection techniques for
optimize the GRID and BB approaches. For BB, geospatial k-NN queries and identified UFS as a
techniques, which use more than one bounding promising approach. Here, binary histograms are
box per resource, might be promising (cf. Becker, used which capture if a certain reference location
et al., 1991; Chen, et al., 2006). For the GRID is closest to one or more of a resources media loca-
approach, triangular partitions as well as a parti- tions. We also outlined how UFS can be applied in
tioning which results in cells of equal size might a P2P scenario. In addition, we qualitatively argued
be considered (cf. Szalay, et al., 2005; Putman, that geospatial resource description and selection
et al., 2007). techniques such as UFS might also be promising
in various other application fields.

680
Describing and Selecting Collections of Georeferenced Media Items

REFERENCES Blank, D., El Allali, S., Müller, W., & Henrich, A.


(2007). Sample-based creation of peer summaries
Ahlers, D., & Boll, S. (2009). Adaptive geospa- for efficient similarity search in scalable peer-to-
tially focused crawling. In D. Cheung, I. Song, peer networks. In R. Lienhart, A. R. Prasad, A.
W. Chu, X. Hu, J. Lin, J. Li, & Z. Peng (Eds.), Hanjalic, S. Choi, B. P. Bailey, & N. Sebe (Eds.),
Proceedings of the 18th ACM International Con- Proceedings of the 15th International Confer-
ference on Information and Knowledge Manage- ence on Multimedia, (pp. 143-152). Augsburg,
ment, (pp. 445-454). Hong Kong, China: ACM. Germany: ACM.
Arguello, J., Diaz, F., Callan, J., & Crespo, J.-F. Blank, D., & Henrich, A. (2009). Summariz-
(2009). Sources of evidence for vertical selection. ing geo-referenced photo collections for image
In M. Sanderson, C. Zhai, J. Zobel, J. Allan, & retrieval in P2P networks. In Proceedings of the
J.-A. Aslan (Eds.), Proceedings of the 32nd In- International Workshop on Geographic Infor-
ternational ACM SIGIR Conference on Research mation on the Internet, (pp. 55-60). Retrieved
and Development in Information Retrieval, (pp. from http://georama-project.labs.exalead.com /
315-322). Boston, MA: ACM. workshop/GIIW-proceedings.pdf.
Beaver, D., Kumar, S., Li, H. C., Sobel, J., & Blank, D., & Henrich, A. (2010). Description
Vajgel, P. (2010). Finding a needle in haystack: and selection of media archives for geographic
Facebook’s photo storage. In Proceedings of the nearest neighbor queries in P2P networks. In
9th USENIX Conference on Operating Systems A. R. Doherty, C. Gurrin, G. J. F. Jones, & A. F.
Design and Implementation (OSDI 2010). Berke- Smeaton (Eds.), Proceedings of Information Ac-
ley, CA: USENIX Association. cess for Personal Media Archives Workshop, (pp.
Becker, B., Franciosa, P. G., Gschwind, S., Ohler, 22-29). Received from http://doras.dcu.ie/15373/.
T., Thiemt, G., & Widmayer, P. (1991). An optimal Bockting, S., & Hiemstra, D. (2009). Collection
algorithm for approximating a set of rectangles selection with highly discriminative keys. In
by two minimum area rectangles. In H. Bieri & Proceedings of the 7th International Workshop on
H. Noltemeier (Eds.), Proceedings of the Inter- Large-Scale Distributed Systems for Information
national Workshop on Computational Geometry, Retrieval. Retrieved from http://lsdsir09.isti.cnr.
(pp. 13-25). Berlin, Germany: Springer. it/ lsdsir09-1.pdf.
Belkin, N. J., Kantor, P., Fox, E. A., & Shaw, J. Chen, S., Liang, S., & Wang, M. (2010). A local-
A. (1995). Combining the evidence of multiple ity-aware peer-to-peer approach for geospatial
query representations for information retrieval. web services discovery. In Canadian Geomatics
Information Processing & Management, 31(3), Conference. Retrieved from http://www.isprs.
431–448. doi:10.1016/0306-4573(94)00057-A org /proceedings/XXXVIII/part1/13/ 13_01_Pa-
Bender, M., Michel, S., Weikum, G., & Zimmer, per_65.pdf.
C. (2005). The minerva project: Database selec- Chen, Y.-Y., Suel, T., & Markowetz, A. (2006).
tion in the context of P2P search. In G. Vossen, Efficient query processing in geographic web
F. Leymann, P. C. Lockemann, W. Stucky (Eds.), search engines. In S. Chaudhuri, V. Hristidis, &
Proceedings of Datenbanksysteme in Business, N. Polyzotis (Eds.), Proceedings of the 25th In-
Technologie und Web, 11: Fachtagung des GI- ternational Conference on Management of Data,
Fachbereichs “Datenbanken und Informations- (pp. 277-288). Chicago, IL: ACM.
systeme”, (pp. 125-144). Karlsruhe, Germany:
DIS.

681
Describing and Selecting Collections of Georeferenced Media Items

Cuenca-Acuna, F., Peery, C., Martin, R. P., & Guttman, A. (1984). R-trees: A dynamic index
Nguyen, T. D. (2003). PlanetP: Using gossiping structure for spatial searching. In B. Yormark
to build content addressable peer-to-peer informa- (Ed.), Proceedings of ACM SIGMOD Conference,
tion sharing communities. In Proceedings of IEEE (pp. 47-57). Boston, MA: ACM.
International Symposium on High Performance
Guy, I., Jacovi, M., Perer, A., Ronen, I., & Uziel,
Distributed Computing, (pp. 236-246). Seattle,
E. (2010). Same places, same things, same people?
WA: IEEE Press.
Mining user similarity on social media. In Pro-
Dolin, R., Agrawal, D., Abbadi, A. E., & Dil- ceedings of the 22nd International Conference
lon, L. K. (1997). Pharos: A scalable distributed on Computer Supported Cooperative Work, (pp.
architecture for locating heterogeneous informa- 41-50). Savannah, GA: ACM.
tion sources. In F. Golshani & K. Makki (Eds.),
Hariharan, R., Hore, B., & Mehrotra, S. (2008).
Proceedings of the 6th International Conference
Discovering GIS sources on the web using sum-
on Information and Knowledge Management, (pp.
maries. In R. Larsen, A. Paepcke, J. L. Borbinha,
348-355). Las Vegas, Nevada: ACM.
& M. Naaman (Eds.), Proceedings of the 8th ACM/
Doulkeridis, C., Vlachou, A., Nørvåg, K., Kotidis, IEEE Joint Conference on Digital Libraries, (pp.
Y., & Vazirgiannis, M. (2009). Multidimensional 94-103). Pittsburgh, PA: ACM Press.
routing indices for efficient distributed query
Ilyas, I. F., Beskales, G., & Soliman, M. A. (2008).
processing. In D. Cheung, I. Song, W. Chu, X.
A survey of top-k query processing techniques in
Hu, J. Lin, J. Li, & Z. Peng (Eds.), Proceedings
relational database systems. ACM Computing Sur-
of the 18th ACM International Conference on
veys, 40(4), 1–58. doi:10.1145/1391729.1391730
Information and Knowledge Management, (pp.
1489-1492). Hong Kong, China: ACM Press. Lupu, M., Li, J., Ooi, B. C., & Shi, S. (2007).
Clustering wavelets to speed-up data dissemina-
Doulkeridis, C., Vlachou, A., Nørvåg, K., & Vazir-
tion in structured P2P manets. In Proceedings of
giannis, M. (2010). Distributed semantic overlay
the 23rd International IEEE Conference on Data
networks. In Shen, X., Yu, H., Buford, J., & Akon,
Engineering, (pp. 386-395). Istanbul, Turkey:
M. (Eds.), Handbook of Peer-to-Peer Networking.
IEEE Press.
Berlin, Germany: Springer. doi:10.1007/978-0-
387-09751-0_17 Memon, F., Tiebler, D., Dürr, F., Rothermel, K.,
Tomsu, M., & Domschitz, P. (2009). Scalable
Eisenhardt, M., Müller, W., Blank, D., El Allali, S.,
spatial information discovery over distributed hash
& Henrich, A. (2008). Clustering-based, load bal-
tables. In Proceedings of the Fourth International
anced source selection for CBIR in P2P networks.
ICST Conference on COMmunication System
International Journal of Semantic Computing,
softWAre and middlewaRE (COMSWARE 2009).
2(2), 235–252. doi:10.1142/S1793351X08000439
New York, NY: ACM.
Elahi, B. M., Römer, K., Ostermaier, B., Fahr-
Müller, W., Eisenhardt, M., & Henrich, A. (2005).
mair, M., & Kellerer, W. (2009). Sensor ranking:
Scalable summary based retrieval in P2P networks.
A primitive for efficient content-based sensor
In A. Chowdhury, N. Fuhr, M. Ronthaler, H.-J.
search. In Proceedings of the 8th International
Schek, & W. Teiken (Eds.), Proceedings of the
Conference on Information Processing in Sensor
14th ACM International Conference on Informa-
Networks, (pp. 217-228). Washington, DC: IEEE.
tion and Knowledge Management, (pp. 586-593).
Bremen, Germany: ACM Press.

682
Describing and Selecting Collections of Georeferenced Media Items

Nogueras-Iso, J., Zarazaga-Soria, F. J., & Muro- Stoica, I., Morris, R., Karger, D., Kaashoek, F.,
Medrano, P. R. (2005). Geographic informa- & Balakrishnan, H. (2001). Chord: A scalable
tion metadata for spatial data infrastructures: peer-to-peer lookup service for Internet applica-
Resources, interoperability and information tions. In Proceedings of the Annual Conference
retrieval. Secaucus, NJ: Springer-Verlag. on Applications, Technologies, Architectures, and
Protocols for Computer Communication, (pp.
Novak, D., Batko, M., & Zezula, P. (2008). Web-
149-160). San Diego, CA: ACM.
scale system for image similarity search: When
the dreams are coming true. In Proceedings of Szalay, A., Gray, J., Fekete, G., Kunszt, P., Ku-
6th International Workshop on Content-Based kol, P., & Thakar, A. (2005). Indexing the sphere
Multimedia Indexing, (pp. 446-453). London, with the hierarchical triangular mesh. Technical
UK: IEEE. Report: MSR-TR-2005-123. Retrieved from
http://research.microsoft.com /pubs/64531/tr-
Papapetrou, O., Siberski, W., Balke, W.-T., &
2005-123.pdf.
Nejdl, W. (2007). DHTs over peer clusters for
distributed information retrieval. In Proceedings Thomas, P., & Hawking, D. (2009). Server selec-
of the 21st International Conference on Advanced tion methods in personal metasearch: A compara-
Information Networking and Applications, (pp. tive empirical study. Information Retrieval, 12(5),
84-93). Niagara Falls, Canada: IEEE. 581–604. doi:10.1007/s10791-009-9094-z
Putman, W. M., & Lin, S.-J. (2007). Finite-volume Vincenty, T. (1975). Direct and inverse solutions
transport on various cubed-sphere grids. Jour- of geodesics on the ellipsoid with application of
nal of Computational Physics, 227(1), 55–78. nested equations. Survey Review, 22(176), 88–93.
doi:10.1016/j.jcp.2007.07.022
Vu, Q. H., Lupu, M., & Wu, S. (2009). Simpson:
Ratnasamy, S., Francis, P., Handley, M., Karp, Efficient similarity search in metric spaces over
R., & Schenker, S. (2001). A scalable content- P2P structured overlay networks. In H. Sips,
addressable network. In Proceedings of the An- D. Epema, & H.-X. Lin (Eds.), Proceedings of
nual Conference on Applications, Technologies, the 15th International Euro-Par Conference on
Architectures, and Protocols for Computer Com- Parallel Processing, (pp. 498-510). Delft, The
munication, (pp. 161-172). San Diego, CA: ACM. Netherlands: Springer.
Samet, H. (2006). Foundations of multidimen- Xiujun, M., Gang, L., Kunqing, X., & Meng, S.
sional and metric data structures. San Francisco, (2006). A peer-to-peer approach to geospatial
CA: Morgan Kaufmann Publishers Inc. web services discovery. In Proceedings of the 1st
International Conference on Scalable Information
Shen, X., Yu, H., Buford, J., & Akon, M. (2010).
Systems (InfoScale 2006). New York, NY: ACM.
Handbook of peer-to-peer networking. Berlin,
Germany: Springer. doi:10.1007/978-0-387-
09751-0
KEY TERMS AND DEFINITIONS
Sinnott, R. (1984). Virtues of the haversine. Sky
and Telescope, 68(2), 158. Peer-to-Peer (P2P) System: A P2P system is
made up of distributed resources (i.e. computing
devices) which cooperate in order to provide and
consume certain services in a decentralized fash-

683
Describing and Selecting Collections of Georeferenced Media Items

ion. In contrast to traditional client/server systems, Structured P2P IR Systems: P2P IR systems
a peer can act as both, a client and a server. which are based on distributed index structures
Peer-to-Peer Information Retrieval (P2P with distributed hash tables (DHTs) being the
IR) System: A P2P system with a focus on the ad- most prominent class member. In structured P2P
ministration and retrieval of media items. Hereby, IR systems, every peer is responsible for a certain
content-based information retrieval techniques are region of the feature space.
applied in order to search for media items. Summary: The term summary also refers
Resource Description: An important task in to resource descriptions. It emphasizes the need
distributed IR which provides the basis for query for space efficiency of the resource descriptions
routing. The resources have to describe their con- in certain application contexts, e.g. unstructured
tent in an adequate way—optimizing the trade-off P2P IR systems.
between selectivity and space efficiency of the Unstructured P2P IR Systems: In contrast
data summaries (also called resource descriptions). to structured P2P IR systems, connections in un-
Resource Selection: The process of determin- structured P2P IR overlays do not emerge from
ing the order in which distributed resources should a distributed index structure and are thus formed
be contacted to fulfill a certain information need. more arbitrarily. Here, every peer usually admin-
Besides knowing the order in which resources isters the index data of its own media objects.
should be contacted it is also important to deter-
mine when it is no longer beneficial to contact
further resources.

This work was previously published in Discovery of Geospatial Resources: Methodologies, Technologies, and Emergent
Applications, edited by Laura Díaz, Carlos Granell and Joaquín Huerta, pp. 1-20, copyright 2012 by Information Science
Reference (an imprint of IGI Global).

684
685

Chapter 42
Cognitive Mapping and
GIS for Community-Based
Resource Identification
Lyn Kathlene
Colorado State University, USA

ABSTRACT
This chapter describes and analyzes the effectiveness of two methodological techniques, cognitive
mapping and geographical information systems (GIS), for identifying social service resources. It also
examines the processes used to integrate hand-drawn map information into geocoded data points and
provides recommendations for improving efficiency and precision. As a first step to integrate Jefferson
County social service delivery into community-based child welfare “systems of care” (SOC), both formal
and informal services had to be identified. Cognitive mapping, a process by which participants draw
visual representations of geographical areas, was conducted with 247 participants in Jefferson County,
Colorado. Over 3,500 resources were identified and entered into a GIS to analyze the availability, capac-
ity, and distribution of social services in the county and within communities. Identification of community
resources via cognitive mapping and GIS analysis provide: (1) a comprehensive database of existing
services; (2) a basis to build communication networks and cooperation among government and com-
munity providers; (3) the ability to create an efficient system that avoids duplication of efforts; (4) an
understanding of the geographical distribution of resources; (5) the identification of resources lacking in
the county and specific communities; and (6) knowledge differences among diverse participant groups.

INTRODUCTION Division of Human Services, to conduct a re-


source identification analysis. The project was
In December, 2003, the Colorado Institute of one component in the first year of a five-year
Public Policy (CIPP) at Colorado State University Health and Human Services — Children’s Bureau
was contracted by Jefferson County, Colorado, grant to create a “systems of care” (SOC) in child
welfare social service delivery.1 Jefferson County,
DOI: 10.4018/978-1-4666-2038-4.ch042 Colorado, was one of eight pilot sites awarded an

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Cognitive Mapping and GIS for Community-Based Resource Identification

SOC grant. The CIPP component was to identify resource database. Over 3,800 unique resources
services available at the community-level and were identified and entered into a GIS — Arc-
discover services that were lacking. Map, a component of ArcView — to analyze the
SOC is a major paradigm shift in social service availability, capacity, and distribution of social
delivery. It removes the locus of authority away services in the county and within communities.
from one individual, the social service worker, Census data was overlaid to identify high-need
and replaces it with a group of service providers, areas and ethnic enclaves.
family, and community members to develop col- Here, a novel application of GIS for designing
lectively a comprehensive plan to move the child improved social service delivery systems is de-
and family out of crisis. The provision of services scribed. The chapter also discusses complications
are to be coordinated, community-based, cultur- involved in working with human service agencies,
ally competent and individualized (Stroul, 1986). and reconsiders the processes developed to merge
To integrate Jefferson County social service cognitive mapping information into ArcMap.
delivery into community-based comprehensive
child welfare SOC, both formal and informal Cognitive Mapping: Origin and Uses
services had to be identified. Informal services
are of particular interest since these are likely the Cognitive mapping has not been comprehensively
least well-known (there was no official directory) integrated with GIS to the degree discussed in this
and serve populations at a community SOC level project, although elementary integration has been
(rather than county-wide). For definition purposes, done in some previous projects (Fulton, Horan, &
informal services were identified for participants Serrano, 1997; Kathlene, 1997; Kathlene & Horan,
as private or not-for-profit programs, including 1998; Horan, Serrano, & McMurran, 2001).2 The
services such as church soup kitchens, non-profit potential usefulness to the human services sectors
agencies providing transportation services for the through projects that use the combined method-
elderly, and in-home daycare providers not regis- ologies is substantial. To better understand this
tered with the county. Formal services are public “fit,” a brief review of cognitive mapping follows.
programs at the state, county, and local level, such Cognitive mapping did not originate from
as Jefferson County Mental Health Services, Title research on humans. Rather, the term “cognitive
XX daycare providers, public schools, public map” was originally used to describe the mental
transportation, and park and recreation programs. representations that rats develop as they navigate
To identify existing resources at the com- the same maze multiple times (Tolman, 1948).
munity level, cognitive mapping, a process by Quickly, researchers became interested in using
which participants draw visual representations the information from cognitive maps in the human
of geographical areas, was conducted with 247 context, and the resulting collection of methods
participants in Jefferson County, Colorado. Par- became known as cognitive mapping. Later, the
ticipant groups in the mapping included social term expanded to include mental depictions of
service, non-profit, and faith-based providers, more abstract entities, like ideas or chains of
social service clients (Temporary Assistance for events. Since its inception, cognitive mapping
Needy Families (TANF) recipients, youths, foster has been used as an approach to a number of
care providers, and adoptive parents), residents real-world issues.
and ethnic/racial enclaves (Latino, Eastern Euro- Cognitive mapping’s first practical application
pean, Native American, and African American). was in the field of urban planning when Kevin
In addition, all resources listed in the Jefferson Lynch (1960) found that certain places or elements
County resource guides were included in the in a city generated a positive emotional reaction

686
Cognitive Mapping and GIS for Community-Based Resource Identification

among its residents. This type of research, he to draw the area they self-identified as their most
felt, could be used to better design cities so as to recognizable; therefore, they had high levels of
make them more memorable. Jack Nasar (1988) familiarity. The maps required particularly little
extended Lynch’s study by including a model drawing skill as participants were only asked to
of how city attributes influenced the affective draw boundaries (for which they are provided a
responses of residents and visitors. ruler), mark their home with an “X,” and identify
Cognitive mapping is also used to evaluate landmarks with squares. Since the objective in the
mental representations of smaller areas. Map- study was to identify resources of importance to
ping of specific neighborhoods (Quaiser-Pohl, the participant; having the maps drawn to scale
Lehmann, & Eid, 2004; Uzzell, Pol, & Badenas, was irrelevant because the exact geographic co-
2002), college campuses (Hardwick, Wooldridge, ordinates for each location are determined during
& Rinalducci, 1983; Holahan & Dobrowolny, the coding process. Coding the maps involved
1978; Sholl, 1987), and buildings (Moeser, 1988; recording objective location information, thereby
O’Laughlin & Brubaker, 1998; O’Neill, 1991) removing the problem of coder subjectivity. In
adds to knowledge about how spatial abilities short, information gleaned from the maps was
develop and are utilized, the skills related to map not dependent upon the drawing skills of the
creation, and the mapping abilities of various participants.
groups. Numerous overlying maps of these smaller
areas have been used to program autonomous
mobile robots (Yoshino, 1991). ADAPTATION OF MAPPING
Almost anyone can successfully complete a METHODS FOR RESOURCE
mapping exercise. Studies have found the quality IDENTIFICATION
of female and male’s maps are similar (Evans,
1980; Magana & Norman, 1980; O’Laughlin & Each use of mapping described above required a
Brubaker, 1998); nearly all ages can successfully unique adaptation of the method. Participants in
participate (Quaiser-Pohl, Lehmann, & Eid, 2004); these studies were asked to sketch maps, arrange
and map quality is not influenced by drawing skill pictures of pieces of paper according to their loca-
(Evans, 1980; Hardwick, Wooldridge, & Rinalduc- tion or importance, locate points on a map, circle
ci, 1983). The only criteria that affects map quality areas on a map, recognize features on an aerial
is familiarity with the target area, where quality photograph, or label features on a blank map.
increases with familiarity (Evans, 1980; Fridgen, The literature effectively suggests that as many
1987; Unger & Wandersman, 1985). Familiarity, methods of mapping exist as purposes for using it.
however, can distort maps. For example, locations In this project, participants attending a map-
of importance to the participant are often drawn ping workshop were first provided background
larger or more centrally than other map elements on the overall goals of the SOC project as well
(Holahan & Dobrowolny, 1978; Kathlene, 1997). as their role as a mapper in achieving those goals.
Finally, at the coding stage, the use of multiple The hope was that understanding the importance
coders without knowledge of the study hypothesis of the project would encourage conscientious
is commonly used (Daniels & Johnson, 2002); participation. Then, participants were provided
yet, studies have found nearly perfect agreement with a large blank sheet of paper and asked to
among multiple coders (Quaiser-Pohl, Lehmann, indicate the street or physical boundaries within
& Eid, 2004). which they most commonly travel. Street maps
For this project, the above issues did not present of the area were available to help them in this and
problems. Each of the participants was directed later tasks. Workshops consisted of three to sixteen

687
Cognitive Mapping and GIS for Community-Based Resource Identification

people, and at each session they were encouraged GIS ANALYSIS TECHNIQUE
to work cooperatively with their fellow workshop
participants. Following a visualization exercise The GIS for this cognitive mapping project pro-
to assist them in accessing non-verbal memories, vides aggregated spatial analysis of the maps.
they were asked to fill in details such as streets, GIS combines the thousands of data points col-
natural features, and resources they commonly lected from the participant-drawn maps, which
use. It was explained that these maps were not greatly enhances the value of cognitive mapping
meant to be artistic works; simple boxes and lines information. In addition, each distinct physical
drawn with their rulers would suffice to represent address has dozens of descriptors that identify
objects and locations. Once the maps were com- not only who provided the information but also
plete, participants were asked to label various secondary data such as the participants’ percep-
types of resources with colors corresponding to tions of a particular resource or service (Longley,
commercial businesses, community resources, Goodchild, Maguire, & Rhind, 2001). The data
government agencies, community strengths, and is accessible in Excel-type tables that can be eas-
community weaknesses. They also completed a ily queried and displayed on a digital street map
survey asking for detailed description of the im- (Longley, Goodchild, Maguire, & Rhind, 2001).
portant resources and features on their maps and This analysis allows for preliminary interpretation
to provide any additional information regarding of the data set, such as the dispersion or clustering
resources lacking in their area. Each map was of resources in geographic areas (Heagerty & Lele,
then entered into a GIS database. Figure 1 is an 1998; Reich & Davis, 2003). GIS can incorporate
example of a cognitive map drawn for the Jef- external data sources, such as census data, which
ferson County Project. can be layered with the cognitive mapping data.

Figure 1. Example of hand drawn cognitive map

688
Cognitive Mapping and GIS for Community-Based Resource Identification

Integrating the cognitive mapping data with GIS Structure of the GIS for the Project
software creates nearly limitless ways to analyze
the data and allows for easily-interpreted visual The structure of the GIS system for this project
results. was based on three principles:

Technical Capacity for the Project 1. The structure needed to be flexible and al-
low for the maximum amount of data to be
For this project the particular GIS capabilities stored and easily accessed.
were: 2. The structure needed to be relatively simple
such that an inexperienced GIS user can un-
• Visual representation of addresses and derstand the sorting, searching, and selecting
areas via data points, data lines, and data potential of the system.
shapes. 3. The GIS system needed to be able to organize
• The ability to layer data points for sorting, data from a variety of sources and be easily
categorizing, and selection purposes. updated over time.
• Accompanying tables of descriptive vari-
ables for each data layer with the ability The first step to organizing the data was to
to search and sort individual data points create basic social service or resource categories
based on particular variable(s). (in GIS terms these categories are referred to as
• The ability to export the data to conduct layers). Each layer needed to be broad enough to
advanced spatial analysis. encompass a variety of unique data points (iden-
tified resources) but narrow enough to create a
The cognitive mapping process was greatly en- good classification system for a beginning search
hanced, especially for analysis purposes, by using and find procedure. These basic layers were the
GIS to create unique, specialized parameters (for essential framework of the GIS system and the
a theoretical model, see Jordan, Raubal, Gartrell, search capabilities of the system that providers
& Egenhofer, 1998). The potential outcomes of could use; therefore, they had to match either the
such a union include: basic social service field breakdown of resource
types and/or be intuitively organized.
• The capability to export a large quantity Ultimately, 24 layers were created for the proj-
of data in an Excel-type format that can ect including Housing, Mental Health Services,
be searched and sorted based on any given Transportation, Health Care, Emergency Services,
number of criteria. Domestic Violence, and Victim Services. Within
• The ability to organize and sort data in a each layer were key fields with similar categories
spatial manner (i.e. in the form of maps). across all the layers. For example, the data field
• The opportunity to create an easily-updat- “Resource Type” exists in each of the 24 layers and
ed database for organizing additional data contains only one of five values: Governmental,
gathered by the cognitive mapping process Community Non-Profit, Faith-Based, Private, or
and through other resource identification Unknown. This allowed queries and linkages to
processes. be made across layers. As an illustration, a user
could search for all the community non-profit
sponsored resources or services in two or more of
the layers and combine these results on one map.

689
Cognitive Mapping and GIS for Community-Based Resource Identification

Search or selection functions work on a layer by Respondent=Yes.” In the table above, this
by layer level in ArcMap. Within the Housing would select only the last row.
layer there were attributes such as Resource Each row represents one unique data point on
Name, Resource Address, and Resource Hours the GIS map and therefore represents one unique
of Availability. These attributes are easily sorted. resource. Resources can be identified and placed
For example, there should only be one Lakewood on the map multiple times, as in the above ex-
Housing Authority (LHA), but if there are two ample, with the first and second row both being
locations for LHA, they can be distinguished from the Resource Type “LHA.” What distinguishes
one another in the data table due to the different the two data points, however, is not the location
values entered in the Resource Address cell. How- on the map but the multiple attributes in the table
ever, not all searches are done based on a known listed above. Here, LHA was identified once by
name or location; therefore, a more extensive list a provider and once by a client, information that
of attributes must exist to properly describe each indicates common knowledge. Alternatively, if
resource. Some examples of other attribute identi- only clients identify the Allison Care Center, a
fiers were Resource Type (government-sponsored, knowledge gap in Jefferson County Human Ser-
private, faith-based, or community non-profit) vices providers and resource manuals is revealed,
or Funding Source. Another type of attribute is a which can be easily remedied. Creating a com-
binary descriptor such as Used by Respondent Yes/ prehensive database with separate rows of mul-
No. This would apply to data points identified by tiple criteria allows for flexibility in the analysis
clients and useful in various types of qualitative of resources ranging from a simple resource search
analysis. Below is a simplified example of a layer engine to a complex spatial analysis tool to nu-
table structure and associated attributes. merically describe the layout of resources and
It is easy to see in Table 1 the large number of services in Jefferson County.
possible searches based on only seven attributes.
For example, a provider might want to find all
government-sponsored housing resources. The GIS ANALYSIS OF
search would use “Resource Type=Government.” COMMUNITY RESOURCES
The first two rows would be selected from the
search. If housing resources provided by com- To do the GIS analysis, the resources identified
munity non-profits is of interest, the search would by the mapping and focus group participants were
be written as “Respondent Category=Client and verified using the 2004 Denver Metro phone book
Resource Type=Community Non-Profit and Used (www.dexonline.com) and/or MapQuest (www.
mapquest.com). The addresses were entered into

Table 1. Housing layer attribute table

Survey ID Respondent Resource Resource type Hours of Funding Type Used by


Category Name Availability Respondent
0105 Client LHA Government 9am-5pm Government No
0431 Provider LHA Government 9am-5pm Government N/A
0302 Community Lakeside Apart- Private N/A N/A Yes
Member ments
0746 Client Allison Care Community Non- N/A Private Yes
Center Profit

690
Cognitive Mapping and GIS for Community-Based Resource Identification

an Excel spreadsheet, the type of resource and rate. In practical terms, full saturation cannot be
service was coded into one or more of the 24 reached; but more importantly, a trade-off must be
types of resources, and the participant group and made between additional information and cost to
demographics were entered.3 The spreadsheet was acquire the information. In this project, there was
imported into ArcMap and geo-coded to place the an average of 20 resource points per map. Each
identified resources with known addresses on a map took approximately four hours to process
common geographical coordinate system. (from address coding the points to data entry
The 24 layers of resource and service types into ArcMap). An 80% threshold repeat rate was
were created by: (1) working with participants chosen as a break-even level. At this rate, only one
in four pilot sessions to understand how they additional new resource would be identified per
conceptualized services; (2) presenting the pilot one hour of coding time. Less than one additional
workshop information to Jefferson County Human new resource per hour of work was deemed an
Service employees to add categories that aligned inefficient use of resources.
with their current resource guides and experiences;
and (3) aligning with the Aires taxonomy used by
Colorado 2-1-1 system.4 The resulting twenty-four SELECTED RESULTS
layers allowed for some resources/services to be
coded into more than one layer, thereby increas- While the cognitive mapping process identified
ing the ability of information seekers to find the formal and informal resources, without the abil-
desired resources. For example, a service dedicated ity to do aggregated and stratified analyses the
to providing transportation to elderly individuals collected data would only have provided a list
would be included in Aging and Adult/Senior of resources known by the mappers. To create a
Services as well as Transportation Services. The comprehensive SOC child welfare system requires
dual coding in this project and used by the 2-1-1 detailed knowledge about the geographical distri-
system produces a comprehensive searchable da- bution of the density of community resources and
tabase that can reach a specific resource through the types available in the community. Additionally,
multiple avenues. to understand and correct the community-level
At the onset of the project, the research team information gaps among providers, community
decided that a method for determining when the members and clients, a geographical analysis
mapping process had reached a saturation level stratified by resource categories and information
of redundant information was needed. A novel sources (e.g., county resource guides, social ser-
method, a “repeat rate,” was created. The repeat vice workers, community providers, faith-based
rate was set at 80% based on the time and cost providers, and clients) was needed. GIS is able
involved in identifying additional resources. The to provide such geographical analysis. Impor-
repeat rate estimates the projected number of new tantly, data presented in visual form facilitated
points to be expected from each additional map. information dissemination among a wide variety
So, for example, on the first map every identified of stakeholders.
resource is unique and a new point in the database, Seven Jefferson County cities were the focus
the repeat rate is zero. By the 100th map, if 50% of the mapping project. Clients and community
of the points collected were previously identified resident groups were drawn from Lakewood,
on any of the previous 99 maps, the non-repeated Arvada, Wheat Ridge, Golden, Littleton, and the
points for that map would be 50%. At some point, mountain communities of Conifer and Evergreen.
theoretically, no new information will be gleaned A total of 42 workshops were held at locations
from additional maps, resulting in a 100% repeat in each of these communities. Special sessions

691
Cognitive Mapping and GIS for Community-Based Resource Identification

Figure 2. Latino population growth in Jefferson


tion. Because of these difficulties, all complete
County, 1990-2000
client files with zip codes in the study area were
mailed invitations by Jefferson County to par-
ticipate in the study. Fortunately, the clients,
foster, and adoptive parents who participated were
distributed throughout the study area. The follow-
ing map shows the home location cross-streets of
the participants and the extrapolated hypothetical
boundaries of their identified communities, rang-
ing from one-third mile to one mile around their
residence. The buffer area was based on actual
community boundaries drawn on the maps, which
ranged from larger than one mile to a handful that
were smaller than one-third mile. Figure 3 indi-
cates that the mapping participants who were
current or previous users of Jefferson County
social services had collective knowledge of
nearly all of the study area (see Appendix A for
demographic information of the participants).
Figure 4 shows the 3,845 resources identified
and mapped. The number of unique points is 1,819
after removing the repeated information. Identified
were held with members of the dominant ethnic resources are distributed across all the Jefferson
groups, identified using census data imported into County study areas with a scattering throughout
ArcMap. The groups included Native American, Denver County and a dense clustering along the
Russian, and Latino community residents,5 with main arterial east-west street, Colfax.
Latinos being the largest ethnic population in Jef- Next are selected categories of resources to
ferson County. To determine the areas in Jefferson illustrate how GIS was used to inform SOC plan-
County with the highest growth rate of Latinos, ning efforts.
1990 and 2000 census data was imported into Ar-
cMap and the percent change was calculated. The • Childcare: One hundred and eleven dif-
highest concentrations of the Latino population ferent childcare providers were identified.
were found in the central eastern area of Jefferson As Figure 5 shows, there is very little over-
County (Lakewood and Arvada) on the border of lap in knowledge between the Jefferson
Denver County. Figure 2 shows the Latino growth County resource guides and the partici-
patterns from 1990 to 2000. Numbers on the map pants in the mapping workshops. This
are the percentage of the area’s Latino population could indicate: (1) clients and providers
in the 2000 Census. are not familiar with the childcare resourc-
Current and past users of Jefferson County es which Jefferson County has identified;
social services were identified by Jefferson and (2) Jefferson County is not effectively
County. Since the database was confidential, Jef- disseminating this information. Many of
ferson County would not allow CIPP to pull a the childcare resources identified in the
stratified sample. In addition, over half the client Jefferson County resource guides are be-
records were missing complete address informa- fore-school and after-school care programs.

692
Cognitive Mapping and GIS for Community-Based Resource Identification

Figure 3. Client home locations and range boundaries

In contrast, the cognitive mapping partici- housed with a specific organization rather
pants identified pre-school childcare facili- than listed within Jeffco’s resource guides;
ties rather than before and after-school pro- however, a consolidated database will fa-
grams. Based on this analysis, the current cilitate resource integration, information
guides are lacking in pre-school childcare referral, and client access.
resources. It is likely this information is

Figure 4. All Jefferson County, Colorado, resources identified by participants and resource guides

693
Cognitive Mapping and GIS for Community-Based Resource Identification

• Education: There were 366 education ticipants and the Jefferson County resource
resources identified. This is the category guides. As seen in the Figure 7, however,
in the Jefferson County guides with the the Jefferson County resource guides also
most resources. As would be expected, the have information about several health care
Jefferson County Public Schools Resource services in Denver. Most of the Denver and
Guide has all the primary and secondary Jefferson County services were also iden-
public schools, as well as a few private tified by the cognitive mapping process.
schools in the county. Figure 6 shows there Information from the cognitive mapping
is very little additional information provid- process increases the resource base by
ed by the mapping sessions with regard to two-thirds, adding significantly to health
education resources in Jefferson County; care knowledge of Jefferson County.
however, the mapping identified a signifi-
cant number of resources in Denver that Three of the 24 resource categories have been
are known and used by Jefferson County provided above for sake of brevity. However,
residents. The Jefferson County education there were several trends that appeared in almost
resources not identified in the resource all of the resource categories. First, the Jefferson
guides were typically services such as County resource guides failed to provide adequate
breast-feeding classes located at hospitals, information. Except in education and childcare,
ESL classes available at community cen- the resource guides fell far short of the number
ters and/or libraries, and other educational of resources identified by the mapping participants.
services available at libraries, these impor- While there are many childcare resources in the
tant educational services are missing in Jefferson County information guides, there is
Jefferson County’s guides. little overlap between the childcare listed in the
• Health care resources: Of the 190 health Jefferson County guides and the childcare identi-
care resources, 140 were identified by the fied by the cognitive mapping. The GIS maps
cognitive mapping process with only one- effectively demonstrate such knowledge gaps.
third of the services identified by both par-

Figure 5. Childcare resources identified by participants vs. Jefferson County resource guides

694
Cognitive Mapping and GIS for Community-Based Resource Identification

Figure 6. Education resources: Cognitive mapping vs. resource guides

Second, there are a significant number of Third, opposite of the previous trend, Jefferson
resources in Denver County (east of Jefferson County resource guides provide mainly Denver
County) that providers and clients identify. Rea- locations for some types of resources, even though
sonable accessibility to Denver County, as well as the same resources exist in numerous places in
lack of availability of the resources in Jefferson Jefferson County. Available resources closer to
County, likely accounts for this trend. Building Jefferson County residents are a fundamental
a community-based SOC will require Jefferson component of SOC and, in this trend, require only
County to find ways to offer some of these services disseminating the information effectively, which
locally, a challenge that will require developing is a low-cost method to improve community-based
community partnerships to overcome the financial service delivery.
constraints which the County faces.

Figure 7. Health care resources: Cognitive mapping vs. resource guides

695
Cognitive Mapping and GIS for Community-Based Resource Identification

Finally, there is a large disparity in knowledge consequently, also lack the understanding to work
between clients and providers. With the exception effectively and efficiently with the researchers.
of 3 of the 24 categories, Education, Recreation, Furthermore, because social service agencies
and Commercial Resources, the providers and typically do not have a GIS analyst on staff, data
clients did not overlap significantly in knowledge and map files have limited usefulness beyond the
about resources. Providers know more about initial analysis as presented in the final report. Fi-
traditional resources such as other agencies or nally, human service agencies have organizational
governmentally-supported social services, while procedures that create significant barriers in imple-
clients know about resources of a less traditional menting research projects, barriers that need to be
nature, such as churches, motels, and parks where addressed in the project planning stages (Ventura,
teenagers gathered to socialize and engage in rec- 1995). Jefferson County Human Services suffered
reational sports activities. Although these informal from all three impediments and was exacerbated
resources are not referral services that providers by the high turnover of the staff. In the first year,
typically pass along to clients, they are important two-thirds of the project staff left. By the middle
community-based resources to share with clients. of the second year, only one person out of nine key
In creating a community-based SOC, providers project staff remained. Those who left included
need to be aware of the alternative methods cli- the project manager and the principal investigator,
ents use to meet their needs. In some instances, both of who had been replaced twice. Within 18
this new information will lead to the creation of months, none of the people who conceptualized
government/community partnerships to more and wrote the HHS grant were involved in the
effectively and efficiently deliver services. In project. Institutional memory was wiped clean and
other circumstances, the additional knowledge of new staff was unfamiliar and wary of many com-
resources will provide clients with options and/ ponents laid out in the grant proposal, including
or fill gaps in needs that traditional government the untraditional resource identification method.
and community providers cannot meet. Higher administrative support for the innovative
project waned, and “business as usual” reasserted
itself as the dominant paradigm. It became clear
LESSONS LEARNED that the resource database developed through the
mapping process would not be updated on a regular
Several problems directly and indirectly related basis and, perhaps, not disseminated throughout
to the GIS component of the project became ap- the organization if left to Jefferson County. The
parent and required adjustments to the procedures CIPP sought out a more stable organization to
or accommodations to the expected output. These house the resource data, Colorado 2-1-1, with the
include research procedures that are incompatible permission of the first project manager.
with social service agencies’ capacity, issues of Second, human service agencies as well as
client confidentiality, repeat rates, incomplete and/ educational institutions cannot share client/stu-
or inaccurate databases for coding resource loca- dent data. This presents a significant research
tions, coding protocols, and mapping accuracy. barrier when the project requires participation of
First, as has been found before, many county these populations. Ideally, individuals within the
and local agencies lack leadership that under- organizations would have both the access to the
stands the value of GIS in policy decision-making data and sophistication to manipulate the data in
(Greene, 2000; Nedovic-Budic, 1996; Ventura, accordance with standard research protocols. This
1995; Worrall & Bond, 1997). Hence, many agen- is unlikely to be the case in institutions which
cies lack the technical ability to employ GIS and, are financially strapped and lack the vision or

696
Cognitive Mapping and GIS for Community-Based Resource Identification

political will to invest in trained personnel and the most errors, as discovered when the address
needed research tools. To ameliorate these condi- was imported into ArcMap. A cross-check with
tions, project planning must include agreed-upon the phone books corrected most of these errors.
protocols for effectively and efficiently handling Nine percent of the mapping points were
confidential data. unidentifiable due to a combination of missing
Third, unique to this project was the creation information in MapQuest and the phone book, and
of a “repeat rate” to set a standard for data density. poor location information on the hand drawn maps.
The 80% repeat rate was selected for efficiency of The latter accounted for a greater proportion of
resources, based on an extrapolation of the aver- the unidentified points, especially resources such
age number of points per map and time needed to as neighborhood parks and unnamed resources
code and enter the data for each map. Unknown such as “soup kitchen.” Rather than rely solely
was how many participants/maps were needed to on participants naming the nearest cross streets
reach the 80% repeat rate in each of the 24 cat- to such resources, the closest known commercial
egories. Initially, the CIPP recommended target entity should be identified. This redundancy will
was 450 participants. This number was revised reduce missing data due to participant error in
downward by Jefferson County Human Services naming streets.
to a maximum of 250 participants. From the 247
actual participants, the 80% repeat rate was reached
in only two of the 24 resource categories. The FUTURE TRENDS
average repeat rate was 55% across all categories,
indicating that more than 250 participants were While this project was limited to identifying
needed to reach 80%. Whether 450 participants resources, spatial patterns of resource locations,
were ultimately required is unknown. More im- and knowledge gaps, the collected data can be
portantly, did the lower repeat rate significantly mined further. More specific uses can be created,
affect the quality of the project? Certainly, fewer such as a searchable Web-based provider resource
resources were identified at the 55% rate; but 1,480 database and the identification of physical and/or
resources not in Jefferson County resource guides service areas with inadequate resources in relation
were identified; not an insignificant contribution to socio-economic deprivation areas. The latter
to building a more comprehensive social services. allows providers to demonstrate specific needs,
Fourth, in the process of coding the maps important for several reasons, including the pursuit
and sorting the data to find repeated addresses of future programmatic funding. These specific
or groupings by type of provider, and so forth, it uses are described in greater detail as follows:
was discovered that precise alphanumeric coding
was critical. With the large number of data fields • Provider resource database: In the future,
(attributes) assigned to each participant, there were the Web-based database can be converted
inconsistencies in some of the categories. The data into a tool for social service providers to
cleaning was more extensive than anticipated. identify available resources and the most
Future projects should utilize numeric coding in accessible locations for clients (Worrall &
attributes to the fullest extent possible and develop Bond, 1997). The end user (a case-worker)
strict alphanumeric standards for addresses, orga- would be able to search for particular re-
nizational names, and other alpha fields. sources based on any number of criteria
Finally, to find resource addresses, MapQuest or a combination of criteria. For example,
and the Denver metro area phone book were used. one might enter necessary criteria such as
MapQuest was the most efficient method but had Rental Assistance Housing Resource lo-

697
Cognitive Mapping and GIS for Community-Based Resource Identification

cated within three miles of a given loca- duplication of efforts; (4) an understanding of
tion that also caters to Spanish-speaking the geographical distribution of resources; (5) the
clientele. After these attributes or criteria identification of resources lacking in the county
are entered into the appropriate locations and specific communities; and (6) knowledge
on the Webpage, a list of all the resources differences among diverse participant groups.
or providers that fit the criteria could be The addition of 1,480 resource locations within
retrieved, similar to the business name the seven study areas (only a portion of Jefferson
search feature available through a site such County) nearly tripled the number of resources
as MapQuest. Finally, digital maps could and services listed in the Jefferson County guides.
be generated with driving directions for the Ultimately, service delivery in SOC is about
case-worker to print out for the client. It is building partnerships across the multiple services
also possible to map the public transporta- and bringing in new, even sometimes untraditional,
tion routes to services. community partners. Family involvement is the
• Needs assessments: The database can be key in this collaborative arrangement. Similar to
used to conduct comprehensive, quantifi- untraditional community partners and resources,
able, and defensible needs assessments. families as partners do not fit easily within cur-
A social service provider administrator or rent social service delivery structures, values,
grant writer could search the data described and beliefs. Recognizing, valuing, and partnering
above in conjunction with Census data and with resource providers identified by clients and
the County’s client locations to reveal ar- community members is one important step toward
eas of need or areas of excess (Bond & shifting practices. Cognitive mapping with GIS
Devine, 1991; Worrall & Bond, 1997).6 A provides a tool for taking the first critical steps.
strategic plan could be developed to deter-
mine where a new office or access point for
a particular resource should be located to ACKNOWLEDGMENT
serve the greatest number of clients. This
type of spatial analysis based on quantifi- The author would like to thank April Smith,
able numbers and distances can be used to Department of Psychology, Colorado State Uni-
justify a particular course of action either versity, and Mary Tye, Department of Psychol-
for internal/external accountability or to ogy, Colorado State University, for running the
acquire funding for various projects aimed workshops and coding the data; David Wallick,
at community resource and social service Colorado Institute of Public Policy, Colorado
distribution. State University, for conducting the GIS analysis;
and Juliana Hissrich for providing administrative
support to the project.
CONCLUSION

Cognitive mapping combined with GIS analysis REFERENCES


is a powerful method for identifying community
resources by providing: (1) a comprehensive Bond, D., & Devine, P. (1991). The role of
database of existing services; (2) a basis to build geographic information systems in survey
communication networks and cooperation among analysis. The Statistician, 40, 209–215.
government and community providers; (3) the doi:10.2307/2348493
ability to create an efficient system that avoids

698
Cognitive Mapping and GIS for Community-Based Resource Identification

Daniels, K., & Johnson, G. (2002). On trees and Holahan, C. J., & Dobrowolny, M. B. (1978).
triviality traps: Locating the debate on the con- Cognitive and behavioral correlates of the
tribution of cognitive mapping to organizational spatial environment: An interactional analysis.
research. Organization Studies, 23(1), 73–81. Environment and Behavior, 10(3), 317–333.
doi:10.1177/0013916578103003
Evans, G. W. (1980). Environmental cogni-
tion. Psychological Bulletin, 88(2), 259–287. Horan, T., Serrano, K., & McMurran, G. (2001).
doi:10.1037/0033-2909.88.2.259 GIS for livable communities: Examiniation of
community perceptions of assets, liabilities and
Fridgen, J. D. (1987). Use of cognitive maps to
transportation improvements. San Jose, CA: San
determine perceived tourism regions. Leisure
Jose University, Mineta Transportation Institute,
Sciences, 9(2), 101–117.
College of Business.
Fulton, W., Horan, T., & Serrano, K. (1997). Put-
Jordan, T., Raubal, M., Gartrell, B., & Egenhofer,
ting it all together: Using the ISTEA framework to
M. J. (1998, July). An affordance-based model of
synthesize transportation and broader community
place in GIS. In Eighth International Symposium
goals. Claremont Graduate University, University
on Spatial Data Handling ’98 Conference Pro-
Research Institute, Claremont, CA.
ceedings, Vancouver, BC, Canada (pp. 98-109).
Greene, R. W. (2000). GIS in public policy: Us-
Kathlene, L. (1997). 29th street greenway corridor
ing geographical information for more effective
citizen survey panel: Results of mapping exercise,
government. Redlands, CA: ESRI Press.
phase 3. Minneapolis, MN: University of Min-
Hardwick, D. A., Wooldridge, S. C., & Rinalducci, neapolis, Humphrey Institute of Public Affairs.
E. J. (1983). Selection of landmarks as a correlate
Kathlene, L., & Horan, T. (1998). GIS survey of
of cognitive map organization. Psychological
29th street corridor, Minneapolis, MN. Minneapo-
Reports, 53(3), 807–813.
lis, MN: University of Minneapolis, Humphrey
Heagerty, P. J., & Lele, S. R. (1998). A composite Institute of Public Affairs.
likelihood approach to binary spatial data. Journal
Longley, P. A., Goodchild, M. F., Maguire, D. J.,
of the American Statistical Association, 93(443),
& Rhind, D. W. (2001). Geographic information
1099–1111. doi:10.2307/2669853
systems and science. New York: John Wiley and
Hjortso, C. N., Christensen, S. M., & Tarp, P. Sons, LTD.
(2005). Rapid stakeholder and conflict assessment
Lynch, K. (1960). The image of the city. Cam-
for natural resource management using cognitive
bridge, MA: MIT Press.
mapping: The case of Damdoi Forest Enterprise,
Vietnam. Agriculture and Human Values, 22, Magana, J. R., & Norman, D. K. (1980). Meth-
149–167. doi:10.1007/s10460-004-8275-z odological inquiry into elicitation procedures:
Cognitive mapping and free listing. Perceptual
Hobbs, B. F., Ludsin, S. A., Knight, R. L., Ryan, P.
and Motor Skills, 51(3), 931–934.
A., Biberhofer, J., & Ciborowski, J. J. H. (2002).
Fuzzy cognitive mapping as a tool to define Moeser, S. D. (1988). Cognitive mapping in a
management objectives for complex ecosys- complex building. Environment and Behavior,
tems. Ecological Applications, 12, 1548–1565. 20(1), 21–49. doi:10.1177/0013916588201002
doi:10.1890/1051-0761(2002)012[1548:FCMA
Nasar, J. L. (1988). The evaluative image of the
AT]2.0.CO;2
city. Thousand Oaks, CA: Sage Publications.

699
Cognitive Mapping and GIS for Community-Based Resource Identification

Nedovic-Budic, Z., & Godschalk, D. R. (1996). Uzzell, D., Pol, E., & Badenas, D. (2002). Place
Human factors in adoption of geographical infor- identification, social cohesion, and environmental
mation systems: A local government case study. sustainability. Environment and Behavior, 34(1),
Public Administration Review, 56, 554–567. 26–53. doi:10.1177/0013916502034001003
doi:10.2307/977254
Ventura, S. J. (1995). The use of geographi-
O’Laughlin, E. M., & Brubaker, B. S. (1998). cal information systems in local government.
Use of landmarks in cognitive mapping: Gender Public Administration Review, 55, 461–467.
differences in self report versus performance. doi:10.2307/976770
Personality and Individual Differences, 24(5),
Worrall, L., & Bond, D. (1997). Geographical
595–601. doi:10.1016/S0191-8869(97)00237-7
information systems, spatial analysis, and public
O’Neill, M. J. (1991). Evaluation of a con- policy: The British experience. International
ceptual model of architectural legibility. En- Statistical Review, 65, 365–379.
vironment and Behavior, 23(3), 259–284.
Yoshino, R. (1991). A note on cognitive maps: An
doi:10.1177/0013916591233001
optimal spatial knowledge representation. Jour-
Quaiser-Pohl, C., Lehmann, W., & Eid, M. (2004). nal of Mathematical Psychology, 35, 371–393.
The relationship between spatial abilities and rep- doi:10.1016/0022-2496(91)90053-V
resentations of large-scale space in children — a
structural equation modeling analysis. Person-
ality and Individual Differences, 36(1), 95–107. ENDNOTES
doi:10.1016/S0191-8869(03)00071-0
1
The project was supported by grant
Reich, R. M., & Davis, R. (2003). Spatial sta- #90CA1715/01, CFDA #93.570 from the
tistical analysis of natural resources (Tech. Rep. Federal Department of Health and Human
No. NR512). Fort Collins, CO: Colorado State Services through Jefferson County, Colo-
University. rado.
Sholl, M. J. (1987). Cognitive maps as orienting
2
The term cognitive mapping is used for a vari-
schemata. Journal of Experimental Psychol- ety of techniques, including “fuzzy cognitive
ogy. Learning, Memory, and Cognition, 13(4), mapping,” a technique that builds mental
615–628. doi:10.1037/0278-7393.13.4.615 maps of perceptions from focus-group and
interviews (Hjortso, Christensen, & Tarp,
Stroul, B. (1996). Profiles of local systems of care. 2005; Hobbs et al., 2002). In this project,
In B. A. Stroul and R. M. Friedman (Eds.), Systems cognitive mapping means hand-drawn
of care for children’s mental health (pp. 149-176). maps of tangible community resources and
Baltimore: Paul H. Brookes Publishing Co. locations, a geographical data collection
Tolman, E. C. (1948). Cognitive maps in rats and technique new to GIS.
men. Psychological Review, 55(4), 189–208.
3
Nine percent of the mapping points could
doi:10.1037/h0061626 not be accurately located and were dropped
from the analysis. Of the remaining 89%, two
Unger, D. G., & Wandersman, A. (1985). The possible location errors could occur in trans-
importance of neighbors: The social, cognitive, ferring the cognitive map information into a
and affective components of neighboring. Ameri- database for ArcMap. First, multiple coders
can Journal of Community Psychology, 13(2), could use different alphanumeric codes,
139–169. doi:10.1007/BF00905726 thereby making the same resource appear

700
Cognitive Mapping and GIS for Community-Based Resource Identification

as a different resource. To correct this error, mation and referral (I&R). 2-1-1 is an easy-
the data was cleaned by conducting sorts on to-remember and universally-recognizable
multiple columns in the excel spreadsheet to number that makes a critical connection
reveal unknown duplicates. For example, a between individuals and families in need and
search on “Research Name” might find the the appropriate community-based organiza-
same resource with inconsistent address tions and government agencies. Housing the
codes. If the address did not match exactly data with 2-1-1 allows statewide access to
(e.g., one was coded with “St.” and another resources and bi-annual updating to keep the
coded with “Street,” the coding was corrected information current. Colorado 2-1-1 system
to be consistent. Similar searches were done is the depository for the resources collected
on other categories such as street address, in this project. Web searchable database of
street name, and zip code. The data was resources can be found at http://211colorado.
cleaned accordingly. The second error was org/
from incorrect addresses in the MapQuest 5
CIPP provided Jefferson County with the
and/or Dex directory. The Dex directory is ethnic enclave areas based on the 2000 Cen-
the official metropolitan phone and address sus. The Asian communities fell outside the
directory and should have a high level of project boundaries set by Jefferson County
reliability; however, the actual reliability (see Figure 1) and, unlike Russians, Latinos,
rate is unknown. To correct for possible and Native Americans, Jefferson County did
errors, all identified social services not in not request mapping with the Asian com-
the Jefferson County resource guides (e.g., munity.
soup kitchens, English as a Second Language 6
For example, it might be found that 65% of
courses, support groups, etc.) were called all users of a certain type of resource (this
to verify the address. It was assumed that data would be collected by cognitive map-
the Jefferson County resource guides had ping alone) live “x” number of miles away
accurate information. (analysis performed by the GIS system) from
4
All identified resources were provided a particular needed or frequently-accessed
to Colorado’s 2-1-1 system, which is the resource (gathered through cognitive map-
national abbreviated dialing code for free ping and other sources).
access to health and human services infor-

701
Cognitive Mapping and GIS for Community-Based Resource Identification

APPENDIX

Only forty percent of the participants provided demographic information, which limits the ability to
determine the gender, age, and ethnicity/race of the participants. However, there is no way to determine
the representativeness of the sample on these traditional demographics since the population character-
istics are unknown. Even among the clients, the demographics are not available because most of the
client records were incomplete. Unlike many social research projects, demographic representation is
less of a concern. For the identification of resources, a cross-section of the types of people who use or
provide services and the geographical distribution of their knowledge was most important, of which
both criteria were met.

Table 2. Demographics of participants (n=100)

Demographic All participants Providers Clients Community


characteristic (n=100) (n=19) (n=72) Residents (n=9)
Number and percent female 85% 90% 82% 100%
Average age 34.39 39.75 31.86 44.83
Number and percent Caucasian 62% 68% 64% 33%
Number and percent Latino 19% 5% 24% 11%
Number and percent African American 6% 0% 4% 33%
Number and percent Native American 9% 21% 4% 22%
Number and percent Other 4% 5% 3% 0%

This work was previously published in Emerging Spatial Information Systems and Applications, edited by B. Hilton, pp. 326-
350, copyright 2007 by IGI Publishing (an imprint of IGI Global).

702
703

Chapter 43
Complementary Part
Detection and Reassembly
of 3D Fragments
Vandana Dixit Kaushik
Harcourt Butler Technological Institute, India

Phalguni Gupta
Indian Institute of Technology Kanpur, India

ABSTRACT
This chapter presents an algorithm for identifying complementary site of objects broken into two parts.
For a given 3D scanned image of broken objects, the algorithm identifies the rough sites of the broken
object, transforms the object to a suitable alignment, registers it with its complementary part which
belongs to the same object, and finds the local correspondence among the fragmented parts. The pre-
sented algorithm uses multiple granularity descriptors and edge extraction to detect the exact location
of multiple cleavage sites in the object. It greatly reduces the amount of information to be matched and
also helps in identification of the parts; as a result it reduces the computational time in the processing.
It is also applicable to all triangulated surface data even in the presence of noise.

INTRODUCTION 2D image processing, it is essential to have the


good quality of image which is dependent on the
Recent advances in the field of computer vision, image acquisition system and the environment.
image processing and pattern recognition have Generally, 2D methods are found to be efficient
led to substantial research in object identifica- under controlled environment where the input
tion and feature extraction from 2D images for images are acquired in almost fixed orientation,
different applications. In order to design efficient distance, camera calibration, illumination, fixed
methods for solving problems in the field of atmospheric condition etc. However, in reality it
is expected that the acquired images which are to
DOI: 10.4018/978-1-4666-2038-4.ch043 be processed do not possess good quality and fixed

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Complementary Part Detection and Reassembly of 3D Fragments

environment. Further, information available in 2D • Arbitrary number of fractured faces per


images may not be sufficient to generate needful fragment
result for a given application. This has motivated • Missing fragments
people to explore the possibility of analyzing 3D • Probably flawed or weathered surfaces
images. The most obvious difference between a • Non-existence of strict assemblage rules
3D image and a 2D image of a particular object
is its depth information. The 2D image provides Solutions to any assembly problem should
only monocular depth information while a 3D be able to handle above mentioned challenges.
image consists of both monocular and binocular The problem considered in this chapter can be
depth information. This binocular depth informa- formally defined as follows: Consider a 3D im-
tion should also be considered for the study on age consisting of multiple objects scanned using
images. Moreover, a 2D image is very sensitive to a 3D laser scanner and each object is broken
changes due to illumination or lighting conditions into two pieces. The problem is to detect the
and change in orientation of image being captured. complement of a broken part of a given object.
Since any 3D image considers the overall geom- Detection of complementary parts in the multiple
etry of the image being captured, therefore, it is broken objects is an important phase during the
insensitive to the above mentioned changes. How- reassembly. Here, the rough site/ cleavage site in
ever, there are many challenges associated with any fragment is identified and then an attempt is
3D images; out of which handling large volume made to detect its complementary rough site in
of data, high computational cost to process these the other fragment. The detected complementary
data, inconvenient 3D acquisition systems and the parts are aligned for reassembly. For example
digital archiving of 3D image data (Terry, 1995) Figure 1 shows the multiple broken fragments.
are most important. There exists a large number Fragment A and A’ belong to one object. This
of 3D scanners (Boehler, & Marbs, 2002) that chapter presents an algorithm for the detection
can be used to scan any type of objects - from a of A and A’ which are the complementary parts
small-sized object to a life-sized artifact (Levoy, of same object. The result of detection is shown
2000). Features are extracted from this scanned in Figure 2 (a) and (b) respectively.
data for further processing.
This chapter deals with the problem of detect-
ing the complementary part and to reassemble the BACKGROUND
broken 3D fragments. It has many applications in
the fields like archeology where multiple broken There have been lot of studies in the area of reas-
fragments are obtained from the sites and parts sembly and reconstruction of broken 3D objects
belonging to the same object are to be identified.
Along with this, automatic reassembly is required
in art restoration, forensics, computer-aided de- Figure 1. Multiple broken fragments
sign, chemistry, medicine etc. (Barequet, & Sharir,
1997). Generally, the reconstruction of arbitrary
objects from their fragments may have many
constraints. Some of these are mentioned below.

• Arbitrary shapes of parts (fragments)


• Unknown shape and number of the final
objects

704
Complementary Part Detection and Reassembly of 3D Fragments

Figure 2. (a) Fragment A (b) Fragment A’ [Complementary of A]

due to advances in the field of 3D scanners and as mesh simplification (Bolin, & Meyer, 1998,
3D imaging devices. In the last decade there have Lindstrom, & Turk, 2000, Nicolas, et. al. 2002,
been significant technological advances in the Scopigno, et. al., 1998, Williams, et. al., 2003) and
design of tools for digitizing and modeling 3D perceptually guided rendering. Mesh simplifica-
shape of objects. Modeling of 3D shapes also tion aims at reducing the number of vertices of a
includes regeneration and reassembly of broken polygonal mesh model while preserving as much
objects. Given multiple pieces of some 3D scanned as possible its visual appearance. In general, the
objects, the problem considered in this chapter is simplification process is driven by a similarity
to detect all complementary parts of each broken metric that measures the impact of changes of the
object and to reassemble these detected parts. model after each simplification step. Hausdorff
In order to identify and reassemble the comple- distance is often used as a metric to evaluate the
mentary parts, it considers the property of broken distortions due to mesh simplification. Nicolas et.
fragments which leave the cleavage site rougher al. have proposed two tools for geometric mesh
as compared to remaining surface and based on comparison (Nicolas, et. al. 2002). This chapter
this property, the rough site of each broken frag- takes triangular meshes representing the surfaces
ment is determined. in terms of triangles [known as facets] as inputs
The work described here is mainly based on an and determines all rough facets [obtained through
efficient surface inspection algorithm for cleavage/ some threshold value] which form the cleavage
rough site detection in broken 3D objects. The site of the fragment. This cleavage site is used to
algorithm captures differences in the geometric detect the complementary part of a given broken
structure in different parts of the 3D object rep- object. The meshes representing surfaces at dif-
resented by polygon meshes. Most of the work ferent levels are shown in Figure 3(a-d)
in the field of roughness analysis of 3D meshes Wu, et. al. have obtained the roughness of a
has been done in context of visualization, finding single facet using a dihedral angle based approach
surface defects to assess the visual quality of water- (Wu, et. al., 2001). In this work, some conclusion
marked 3D objects. Rogowitz & Rushmeier have can be drawn on cleavage sites on the surfaces
proposed the perceptual metric which is based on by assigning a roughness level to each facet. This
the estimation of roughness of the surface of the metric does not take into account the scale of
watermarked 3D objects (Rogowitz, & Rushmeier, roughness. In other words, The proposed metric
2001). Perceptual considerations have been used measures ‘bumpiness’ of a surface with granular-
in Computer Graphics for various operations such ity close to the size of single facet and this is one

705
Complementary Part Detection and Reassembly of 3D Fragments

Figure 3. Meshes at different levels


the area of neighboring facets. It does not take
into account the direction of neighbouring facets
which is very important as this may lead to wrong
detection of smooth surface as rough surface. In
order to explain it more clearly, let us consider
Figure 5 representing a smooth surface and a
rough surface. Corsini et. al. have ignored the
direction of neighbouring facets to determine the
roughness value and it provides same roughness
value for both the cases. Since both the metrics
discussed above focus on perceptual quality of
3D meshes, they may not be suitable for cleavage
site detection. However, Kaushik, et. al have
proposed a metric which makes uses of roughness
detection at single facet level along with different
granularity levels (Kaushik, et. al., 2007). Further,
it also takes into account the direction of neigh-
bouring facets thereby assigning a different rough-
ness values to a smooth surface and a rough
major drawback of this metric. Corsini et. al. have
surface. Therefore, it is found to be suitable for
proposed a new metric which takes into account
cleavage site detection.
multi-scale roughness along with the roughness
The detected cleavage sites in 3D fragments
at single facet level (Corsini, Gelasca, Ebrahimi,
help to reassemble 3D broken objects. This chap-
2005). Multi-scale roughness refers to bump size
ter first determines cleavage site or sites in broken
in a rough surface. For example, bumps or grains
3D scanned fragments, and then identifies comple-
in a brick are larger than those in a ceramic mate-
mentary parts in two broken objects at the cleaved
rial. Figure 4 illustrates the rough surfaces at
sites which reduce the area to be matched con-
various levels of granularity.
siderably and finally proposes a new process to
Generally, roughness of a facet can be obtained
register these complementary parts for reassembly.
as a function of the dihedral angles made with
It can be noted here that registration plays an
neighboring facets, variation of these dihedral
important role in 3D model acquisition, object
angles and the area of facets under consideration.
recognition, and geometry processing. For the
In order to obtain the roughness of a facet, the
two sets of points representing two scanned frag-
above said metric only considers the area of
ments, the registration process tries to find suitable
facet whose roughness is being calculated and not

Figure 4. Roughness at various levels of granu- Figure 5. Different surfaces


larity

706
Complementary Part Detection and Reassembly of 3D Fragments

Figure 6. Registration process

transformation composed of a rotation matrix and graphics applications. Since the scanner can scan
a translation vector that aligns a scanned fragment the objects having maximum height of 16 inch and
to get best match with another one. Figure 6(a) diameter of 10 inch, we have selected fragments
shows two sets P and Q of points. After applying accordingly. It has a door to insert objects to be
the registration process the set P of points is aligned scanned to place on a rotating plate. It scans the
to Q as shown in Figure 6(b). object and creates a 3D point cloud data. Figure
There exist several well known algorithms for 7 shows the image of the scanner.
registration. Most of them are based on Iterative
Closest Points [ICP] (Besl, & McKay, 1992).
Masuda, & Yokoya have used Least Median of ROUGHNESS DETECTION
Squares (Rogowitz, & Rushmeier 2001) in the DESCRIPTOR
ICP based algorithm to overcome the original
method’s intolerance to outlier data (Masuda, & An efficient algorithm for detecting the rough
Yokoya, 1995). Even though the above algorithms site of a broken fragment has been presented. It
along with ICP can work for registration of also considers the direction of the facets properly
similar surfaces, but they may not work for and therefore the desired rough site is obtained
complementary surfaces. Another drawback with efficiently. The proposed algorithm consists of the
the ICP is that it is very slow to determine the
correspondence between two sets of points and
may not give real time results. Also, the ICP based Figure 7. Roland LPX-600 3D laser scanner
algorithm involves processing of all the points in
the set which makes the processing time ex-
tremely high.

SCANNING OF FRAGMENTS

In order to scan fragments of objects, we have used


Roland LPX-600 3D laser scanner which is found
to create accurately digital information from an
object for 3D computer aided design and computer

707
Complementary Part Detection and Reassembly of 3D Fragments

following major steps. Firstly, the roughness factor a cleavage site in a broken object obtained from
for each facet both at single granularity and mul- an archaeological site.
tiple granularity level is calculated. Secondly, the A rough surface is the surface with high con-
facet’s final roughness factor is computed which centration of ‘bumps’ of different sizes (Corsini,
can be defined as the maximum value of rough- Gelasca, Ebrahimi, 2005). It is difficult to define
ness factors of the facets at different levels. Then, a suitable metric that can help to differentiate
using a roughness histogram a dynamic roughness between roughness due to natural texture and
threshold is obtained to mark portions of surface as roughness due to cleavage. Since most fragments
rough. In this process due to the presence of noise, obtained from an archeological site are weathered
a smooth portion may be incorrectly marked as and eroded, it becomes difficult to ascertain
rough. Therefore, in order to avoid such wrongly whether roughness is due to the natural texture
detected rough portions from the broken fragment, or not. However, there exist fundamental differ-
a roughness consolidation algorithm along with ences in roughness generated through natural
a noise removal algorithm is used. The detailed texture and roughness due to cleavage. Following
description to calculate roughness factor at each are the observations about rough surfaces obtained
step is given below. through experiments.

Measuring Roughness 1. Composition of materials in the fragment and


the effect of external agents in the weathered
Roughness can be defined in terms of dihedral fragments are the major reasons for natural
angles (Wu, et. al., 2001), stochastic models (Jafari, roughness of a surface. It can be noted that
2003, Waechter, et. al., 2004) and fractals (Arrault, weathering agents have a smoothing effect
Arnéodo, Davis, & Marshak, 1997) which are all on surface texture.
based on perceptual quality of a mesh or facet. 2. Bumps of different sizes follow Gaussian
However, since primary aim of cleavage detection distribution.
algorithm is to decrease computations, perfect 3. In natural unbroken surfaces, over a long
roughness modeling is not required. Keeping this measurement range, most of the smaller
issue in mind, the dihedral angle based approach bumps average out each other while the
to determine roughness factor has been modified larger bumps are retained. This is shown in
which is found to be very efficient. Figure 8 shows Figure 9.
4. Natural rough surfaces obey fractal property
which is computationally complex and is
Figure 8. Cleavage site in a broken object difficult to use.
(Kaushik, et. al., 2007) 5. In case of a cleaved surface, high roughness
both at lower and at higher granularity levels
is visible.

Computation of Roughness of Single


Facet

Consider a regular triangular mesh M. Let F be


the facet or the triangle under consideration. An
adjacent/ adjoining facet of F is the facet which

708
Complementary Part Detection and Reassembly of 3D Fragments

Figure 9. Example of single and multilevel roughness (Kaushik, et. al., 2007)

Figure 10. Adjoining Facets (Kaushik, et. al., way to estimate the roughness factor of a facet is
2007) by considering areas of the facet and its adjoining
facets along with their corresponding dihedral
angles. Thus, if R(Fi ) is the roughness factor of
the facet Fi, then R(Fi ) can be expressed as

R(Fi ) = fn{ ωij * ( AF . ; AF . )}


i j

where AF . is the area of facet Fi under consider-


i

ation and AF . is the area of facet Fj, ∀ j = 1, 2, 3.


j

It may be observed that the value of ωij can


be very small. Hence, even though there exists
a roughness of a facet with respect to its neigh-
shares one edge with the facet F and is shown in bouring facets, but the difference between the
Figure 10. roughness factors of a smooth facet and a rough
Let θ be the dihedral angle between two facets facet may be very small. Thus, to highlight the
Fi and Fi. Finding the value of the angle θ involves cleavage site consisting of such type of rough
inverse trigonometric function which is compu- facets, we need to use proper multiplicative factor
tationally expensive. However, the value of θ can to upgrade the roughness factor R(Fi).
be found through the angle function which is In this work, we have considered the rough-
given by (1 - cos θ). Let N F . and N F be the ness of facet Fi, R(Fi), as the product of M(Fi)
i j

normals of the facets Fi and Fj respectively. Then and V(Fi) as


ωij is given by
R(Fi) = M (Fi) ∗V (Fi) (2)
 N N 
 Fi . Fj 
wij = 1 −   (1) where M(Fi) and V(Fi), can be computed by
 N Fi . | . | N Fj . 
1
The value of the dihedral angle ωij is lying be-
M(Fi ) = ∑ (ωij ) * (AFi + AFj)
3 Fj ∈A(Fi )
(3)

tween 0 and 1. Smaller the value of ωij, smoother


is the surface (i.e. roughness of a facet with respect
to its adjoining facets is near zero). One simple

709
Complementary Part Detection and Reassembly of 3D Fragments

1 1
V(Fi ) = ∑ (ωij.(AFi +AFj ) − M (Fi))2 R N (Fi) = N ∑
| S F | Fi ∈SFN
(−1)1+Zij R(Fj ) (6)
Fj ∈A(Fi ) 3
(4)
where | S FN | is the number of facets in (N,N)-ring
It can be observed that value of R(Fi ) is not
and the Boolean variable Zij indicates whether the
fixed. Thus, to apply a threshold for roughness,
facet is downward sloping or upward sloping, i.e.,
there is a need to normalize the value of R(Fi ) to
the interval [0,1]. This can be done by
1 if the facet Fj is upward sloping with respect to Fi
Zij = 
0 if the facet Fj is downward sloping with respect to Fi

R(Fi) − min(R(Fi))
R '(Fi) = (5)
max(R(Fi)) − min(R(Fi))
Zij can be computed by

Multi-Granularity  NFi.(CFj − CFi) 


Roughness Estimation Zij = 1 + (7)
 | NFi | . | (CFj − CFi) |

Corsini et. al. have defined the rough surface as
the surface having high concentration of steep where ⌊.⌋ is the floor function, CFi and CFj are the
crests and troughs (Corsini, Gelasca, Ebrahimi, centroid of facet Fi and Fj respectively. It can be
2005). If one considers multi-scale roughness noted that the complementary (upward and down-
i.e. bump size which is exceeding the dimension ward sloping) surfaces local bumps, as shown in
of single facet, then the effects of local crest and Figure 11, may cancel each other’s effect and this
an adjacent local trough cancel each other by leads to contribute higher scale roughness to the
making minimum effect on overall bumpiness bump. As a result, it leads to provide larger scale
of the considered collective facets. Thus, the idea roughness to the bump.
of roughness estimation can be extended from Also, any surface, which is looking smooth to
roughness of single facet to N levels of rough- the naked eye, may often be found to be rough
ness estimation which is known as estimation of under microscope. This type of surface is an ex-
multi-granularity roughness. ample of low scale roughness. Again, the surface
Let a (N,1)-ring neighborhood of a facet F of sandstone is an example of high scale rough-
comprise of facet F and its adjoining facets. One
can create a (N,2)-ring neighborhood from the
(N,1)-ring by adding all of the facets sharing a Figure 11. Bumps with different scales (Kaushik,
common edge with the (N,1)-ring facets. If the et. al., 2007)
process is repeated by K times, a set of all facets
within the (N,K)-ring neighborhood can be cre-
ated. The value K is called the clustering radius
of (N,K)-ring. Suppose, S FN is the set of all facets
within the (N,N)-ring neighbourhood where N is
the radius of (N, N)-ring for the facet F. Roughness
of (N,N)-ring or estimation of multi-granularity
roughness at level N is given by

710
Complementary Part Detection and Reassembly of 3D Fragments

ness. However, it can be noted that this scale is minimum value. A facet is marked as rough if its
also dependent on the resolution of the scanner. roughness value is greater than the threshold H.
Figure 14 shows the roughness histogram of
Locating Rough Site at the object considered in Figure 13. Note that the
Multiple Roughness Levels thick line in the center represents threshold value
H which is used to determine the cleavage site.
The values of R(F) which are obtained using It is shown in the highlighted area of Figure 12.
Equation (5) and Equation (6) provide the rough-
ness factors of the single granularity and multi- Consolidation of Roughness
granularity level respectively. It can be noted Patches
that the value of R(F) is directly related with the
roughness, i.e. more the value of R(F), more is the Due to the presence of noise, some small spuri-
roughness. Thus the cluster of faces which is hav- ous patches on smooth area can be formed. It is
ing the values of roughness more than roughness important to remove such type of noise from the
threshold can be found in the surface and that can
be labeled as rough area. Hence the cleavage site
can be determined which is shown in Figure 12. Figure 13. Source surface used to generate Figure
12 (Kaushik, et. al., 2007)
Roughness Histogram

These roughness values of all facets can be used


to compute its histogram. For a particular rough-
ness granularity, the roughness distribution in
that fragment can be seen from the computed
histogram. It can be noted that roughness value is
always found to be the least at and near the centre
position. The roughness threshold H is selected
from the roughness level corresponding to this
Figure 14. Roughness histogram (Kaushik, et.
al., 2007)
Figure 12. Roughness values of facets and high-
lighting cleavage site using threshold as 0.5914
(Kaushik, et. al., 2007)

711
Complementary Part Detection and Reassembly of 3D Fragments

cleavage site. Suppose, there are Ψ rough facets in algorithm is O(n) where n is the number of facets
an (N,K)-ring having T facets. If the value of Ψ is in the cleavage site.
very small then one can assume that all T facets in
that ring are smooth. In other words, all T facets
of an (N,K)-ring in a fragment are considered to CLEAVAGE SITE DETECTION
be smooth if for a predefined threshold value u,
0<u<1, it satisfies the following: During the detection of the rough surface, the
edge facets are also detected as rough because of
Ψ<T*u. their higher dihedral angle and angle function. A
cleavage site from a rough surface of a given 3D
Again, it can be observed that for any given broken object is determined by removing these
threshold value H obtained from roughness histo- edge facets. This approach can be used to find all
gram, the roughness detection algorithm may not cleavage sites of a 3D broken object.
produce continuous roughness areas in an (N,K)-
ring; i.e. some scattered roughness patches, say Ψ, Edge Descriptor
can be found in (N,K)-ring through this algorithm
as shown in Figure 10. A continuous rough area A hypothetical ball of (N,N)-ring diameter is
which forms a cleavage site can be obtained by moved over the entire surface of the object and the
merging these scattered patches. All T facets of number of facets lying within it is used to extract
an (N,K)-ring can be marked as rough if for some edges. The ball is centered at the midpoint of every
predefined threshold v lying between 0 and 1, it facet and the area lying within the ball is used for
satisfies the following edge detection. The edge descriptor, E(Fi) for the
facet Fi can be defined mathematically as follows
Ψ>T*u.
E (Fi) = ∑ A(Fj ) ∀j ∋ Dist(Mid (Fj ), Mid (Fi))< R
Based on the above discussion the algorithm (8)
for roughness detection of a single facet F of a
broken surface has been given in Algorithm 1. where R is the radius of the ball, A(Fj) is the area
It can be noted that the time complexity of the of an adjacent facet of Fi, Mid(Fi) is the midpoint
of facet Fi and Dist(X,Y) is the distance between

Algorithm 1. Roughness-Detection (F)

Step 1: Compute the roughness factor at single level R(Fi) using the formula
R(Fi) − min(R(Fi))
R '(Fi) =
max(R(Fi)) − min(R(Fi))
where R(Fi ) is defined in Equation (2)
Step 2: Compute the roughness factor at multiple levels using the following formula
1
R N (Fi) = N ∑
| S F | Fi ∈SFN
(−1)1+Zij R − (Fj )

Step 3: Generate the roughness histogram.


Step 4: Select threshold roughness value H.
Step 5: Mark facets having roughness more than computed threshold.
Step 6: Consolidate all rough patches and remove noise to detect rough surface.

712
Complementary Part Detection and Reassembly of 3D Fragments

point X and Y. If E(Fi) > T, then Fi is an edge facet register the corresponding complementary parts
where T is the predefined threshold. of the objects in the registration phase.
The cleavage site is detected by removing all Let Si be a rough surface and Fj be a facet in Si
detected edge facets from its corresponding rough then the distance between two rough surfaces can
surface. Figure 15(a) shows the broken 3D object be defined as the Euclidean distance between their
while Figure 15(b) shows the detected rough sur- closest facets. The rough surfaces with distance
face of applying roughness detection algorithm less than the predefined threshold form a cluster
and Figure 15(c) gives the rough surface after Cm. Thus, a cluster Cm can be defined as
removing its edge facets.
Cm = U kn=0Sk : : ∀Si ∃ Sj ∋ Dist(Si, Sj) < threshold
Clustering where 0 ≤ i,j ≤ n (9)

Clustering is used to obtain the multiple cleavage


These clusters Cm represent different cleavage
sites (if any) in the broken 3D object. The process
sites in the broken 3D object. Figure 16(a) shows
of grouping a set of physical or abstract objects
the broken object with two cleavage site while
into classes of similar objects is called cluster-
Figure 16(b) shows the result of clustering on a
ing. More clearly, a cluster is a collection of data
broken object.
objects that are similar to each other within the
same cluster and are dissimilar to the objects in
other clusters. That is, a cluster of data objects
COMPLEMENTARY PART
can be treated collectively as one group.
DETECTION
Advantages of clustering are that it is adaptable
to changes and helps to identify useful features
The complementary part detection problem is to
that distinguish different groups. In this case
identify parts of P and Q of a given 3D object that
groups are multiple cleavage sites and the feature
represents the same physical part of the scanned
that distinguishes them is the minimum distance
object. The correspondence descriptor is used to
between them. The Euclidean distance among all
obtain the complementary parts from multiple
closest pairs of rough facets is used to determine
broken objects.
the distance between two cleavage sites. If the
Assume that the objects are broken into two
distance is lying within some predefined threshold,
parts, P and Q, and parts are point-sampled. Let
both the cleavage sites are considered to be a single
parts P and Q consist of n points {p1, p2,.…., pn}
cluster. This process creates a number of clusters
and {q1,q2 …..,qn} respectively. A pair of points
representing each cleavage site in the broken part
from P and Q is in correspondence by assigning to
of the object. These clusters are then processed to
pi a corresponding point qi. This descriptor, termed

Figure 15. Detection of cleavage site from broken 3D object

713
Complementary Part Detection and Reassembly of 3D Fragments

Figure 16. Detected cleavage sites after clustering

as correspondence descriptor, is the mean of the facets with their corresponding area. Let there
Euclidean distances of facets from the center of be n facets in the cleavage site. Then average of
the cleavage site which is given by normals is given by

Cd =
∑E F
n

∑N A k k
n N = k =1
(11)
n

where n is the total number of facets in the cleavage


∑A k
k =1
site and EF is the Euclidean distance of all facets
from the center of a given cleavage site. where Nk and Ak are the normal and the area of
the kth facet in the cleavage site. Once the plane
is fitted, it needs to be aligned parallel to a refer-
REGISTRATION ence plane which in this case is the ZX-plane, as
shown in Figure 18.
After detection of complementary cleavage sites,
these complementary parts need to be aligned so Aligning Cleavage Sites
that they can be registered to regenerate the original
object. This phase registers the cleavage sites of Cleavage sites which are in a common orienta-
the detected complementary parts by realignment tion, i.e. parallel to ZX-plane, further needs to
and translation of the broken parts. The steps for be aligned such that one cleavage site is above
registration are discussed here. the ZX-plane while other one is below it. This is
done by considering two points having maximum
Changing to Common Orientation and minimum values of the Y-coordinate on each
cleavage site. Figure 19 shows these coordinates.
The initial orientation of the complementary The cleavage sites are aligned first by translating
cleavage sites is random; hence they need to be the minimum Y-coordinate values of both the
transformed to a common orientation. The regis- cleavage sites with respect to the origin and then
tration phase first aligns the cleavage sites of the by matching the corresponding maximum values
broken fragments parallel to the ZX-plane. This of Y-coordinate so that these two aligned cleavage
is done by fitting a plane on the cleavage site as sites can be registered. Aligned cleavage sites are
shown in Figure 17. The plane is fitted by taking shown in Figure 20.
weighted average of the normals of the rough

714
Complementary Part Detection and Reassembly of 3D Fragments

Figure 17. Initial orientation of fragments

Figure 18. Two cleavage sites in common orientation with respect to ZX-plane

Registration of Cleavage Sites registration of two complementary cleavage sites


as shown in Figure 21.
To register the aligned complementary cleavage Based on the above discussion the algorithm
sites, the corresponding maximum Y-coordinates for registration of complementary cleavage sites
are matched. This is achieved by rotating one of of broken 3D fragments surface has been given
the objects about Y-axis with its axis perpendicular in Algorithm 2. It can be noted that the time
to the plane fitted on the cleavage site passing complexity of the algorithm is O(n) where n is
through the minimum Y-coordinates. the number of facets in the cleavage site.
Consider a vector for each cleavage site which
is formed by joining the maximum and minimum
Y-coordinates. Then the angle between the pro- EXPERIMENTAL RESULTS
jections of the two vectors of the corresponding
cleavage sites on the ZX-plane (by the dot prod- The proposed roughness detection algorithm has
uct method) is found. Further, one cleavage site been tested on more than 50 scanned fragments.
is rotated by that angle about the Y-axis so that All the fragments are scanned using 3D laser LPX
the corresponding maximum Y-coordinates are 600 Roland scanner. The results have shown that
matched with each other. This process makes the the proposed algorithms are well suited for detect-

715
Complementary Part Detection and Reassembly of 3D Fragments

Figure 19. Cleavage sites with maximum and minimum Y-coordinate

Figure 20. Aligned cleavage sites

Figure 21. Registered cleavage sites

716
Complementary Part Detection and Reassembly of 3D Fragments

Algorithm 2. Registration

Step 1: Detect the cleavage sites of broken 3D fragments following the steps given below
          Step 1.1: Use Roughness Detection algorithm as discussed above

          Step 1.2: Determine cleavage site from a rough surface by removing these edge facets as discussed above

          Step 1.3: Use clustering to obtain all multiple cleavage sites in the broken 3D object using the method discussed

Step 2: Determine the complementary parts of the detected cleavage sites using the correspondence descriptor given by

Cd =
∑E F

n
Step 3: Bring both complementary cleavage sites to common orientation by using the following steps

          Step 3.1: Fit a plane on each cleavage site by taking weighted average of the normals of the rough facets with their corresponding
area which is given by
n

∑N A k k

N = k =1
n

∑A k
k =1

          Step 3.2: Using these planes fitted on the cleavage sites, align them parallel to the ZX-plane

Step 4: Align the cleavage sites such that one cleavage site is above the ZX-plane while other one is below it using

          Step 4.1: Consider two points having maximum and minimum values of the Y-coordinate on each cleavage site

          Step 4.2: Translate the minimum Y-coordinate values of both the cleavage sites with respect to the origin

Step 5: Register the aligned complementary cleavage sites by following the steps given below

          Step 5.1: Form a vector for each cleavage site by joining the maximum and minimum Y-coordinates

          Step 5.2: Compute the angle between the projections of the two vectors of the corresponding cleavage sites on the ZX-plane

          Step 5.3: Rotate one cleavage site by the angle computed in the above step about the Y-axis in order to match the corresponding
maximum Y-coordinates

ing cleavage sites in broken objects in a much values more than H are highlighted which form
accurate and efficient manner. To illustrate the a rough site as shown in Figure 12.
performance of the algorithm, the original frag- In order to illustrate the consolidation of
ment’s surface as marked in Figure 13 has been roughness patches, another example is considered.
considered. The calculated roughness values of a First the value of clustering radius K is chosen
single facet as well as multiple facets are shown according to the size of model and the values
in Figure 12. Using these roughness values of all of u and v have been experimentally estimated
the facets, a roughness histogram is constructed for consolidation and noise removal as shown
as shown in Figure 14. The minimum value near in Table 1. The values of u and v depend on the
the center is chosen as the threshold value with type of material. Table 1 depicts these values for
H = 0.5914. Now all the facets having roughness half baked clay fragments of different sizes. Since
values for a particular material are applicable for

717
Complementary Part Detection and Reassembly of 3D Fragments

Table 1. Values of u and v for varying clustering


The roughness is calculated at (N, N)-ring radius
radii (K)
of 2.
K V U From each of the above two earthen-ware
3 0.44 0.22
fragments, only one cleavage site is detected.
4 0.50 0.16
There may be more than one cleavage site in a
broken fragment. As shown in Figure 24 there are
5 0.60 0.10
two cleavage sites in the scanned fragment. The
detected cleavage sites are shown in Figure 25.
To detect the complementary parts, each of 50
all materials having similar texture, so these values
scanned 3D objects has been broken into two parts
need not be found each time.
and the cleavage site of each of these fragments
Figure 22 shows two original earthen-ware
is identified. Sample of broken objects is shown
fragments. The roughness values at single facet
in Figure 26. Table 2 shows some results of the
level and multiple facet level are computed. Cor-
broken objects after applying the proposed algo-
responding fragments after roughness consolida-
rithm. The algorithm could detect all the comple-
tion and facet de-noising are shown in Figure 23.
mentary parts among the objects successfully.

Figure 22. Two original fragments

Figure 23. Roughness detected fragments

718
Complementary Part Detection and Reassembly of 3D Fragments

Figure 24. Fragment with two cleavage sites

Figure 25. Corresponding cleavage sites of fragments shown in Figure 24

Registration of these detected complementary


parts is done in order to reassemble them at their
Table 2. Broken objects and their complementary
cleavage sites. In order to illustrate the results of
parts
registration, four experiments are given.
3D Broken Object Complementary Part
Experiment 1

The first object considered for our experiment is


an earthenware rectangular object having height
2.0”, width 0.5” and length 1” approximately.

Figure 26. Multiple broken fragments

719
Complementary Part Detection and Reassembly of 3D Fragments

Figure 27. First fragment with its cleavage site

Figure 28. Second fragment with its cleavage site

The object is broken into two fragments. Figure object is broken into two fragments. Figure 30
27 and Figure 28 show the fragments with their shows the first fragment with its detected cleavage
corresponding detected cleavage site. Figure 29 site while the other fragment with its cleavage site
depicts both the fragments after registration. is given in Figure 31. Both the fragments with their
respective detected cleavage sites are registered
Experiment 2 and its result is shown in Figure 32.

Next object considered in our experiment is a clay Experiment 3


pottery which has the size of approximately 1.5
inches in height and 0.5 inches in diameter. This In this experiment, we have also considered a clay
pottery. It is approximately 2.5’’ in height and
Figure 29. Registration of both fragments 1’’ diameter. This object is also broken into two
fragments. Both the fragments along with their
respective cleavage sites are shown in Figure 33
and Figure 34. Registered fragments are shown
in Figure 35.

Experiment 4

The object considered in this experiment is a piece


of chalk. The chalk is broken into two fragments.
It can be noted that this chalk piece is a good

720
Complementary Part Detection and Reassembly of 3D Fragments

Figure 30. First fragment with its cleavage site

Figure 31. Second fragment with its cleavage site

Figure 32. Result after registration example of multi-granularity roughness. Frag-


ments with their respective detected cleavage sites
are shown in Figure 36 and Figure 37 while the
object obtained from fragments after registration
is depicted in Figure 38.

FUTURE WORK

The work discussed in this chapter opens up some


new directions of research which are ranging
from theoretical issues to the latest cutting edge

Figure 33. First fragment with its cleavage site

721
Complementary Part Detection and Reassembly of 3D Fragments

Figure 34. Second fragment with its cleavage site

Figure 35. Registration of fragments


for metallic objects and crystalline solids because
the roughness in case of metallic broken objects
is difficult to obtain.
Cleavage site detection algorithm is found
to reduce the area to be matched drastically by
identifying the portions where we should attempt
to fit complementary fragments. This algorithm
can further be enhanced by taking into account
the sharp edges found on cleavage site boundar-
ies. But it requires careful evaluation because all
fragments may not have sharp cleavage bound-
aries. Further, the algorithm for registration can
technologies. The problem studied is found to be be improved by taking into consideration the
very useful for archeologists who obtain broken boundaries of the obtained cleavage sites and ap-
fragments especially in case of potteries at the plying the 3D image correlation on it. This may
time of excavations. Automatic reassembly of further improve the accuracy of the registration
these fragments can be done through the algo- of the broken complementary pieces.
rithm proposed in this chapter. The algorithm is
found to be very effective for ceramic materials
and earthen-ware. However, it is not well suited

Figure 36. First fragment with its cleavage site

722
Complementary Part Detection and Reassembly of 3D Fragments

Figure 37. Second fragment with its cleavage site

Figure 38. Object after registration


reassembly of 3D thin walled objects. However, the
problem described in this chapter can efficiently
handle thick walled objects as well. In this work,
not only few existing algorithms for roughness
detection were enhanced, but the generated data
was consolidated and cleaned to make it suit-
able for entirely new application of cleavage site
detection. As such 3D fragment reassembly is a
complex problem, involving matching and regis-
tration of complementary surfaces. Cleavage site
detection algorithm reduces the area to be matched
drastically by identifying the portions where we
CONCLUSION
should attempt to fit complementary fragments.
To validate and optimize, the proposed rough-
The chapter has explored a problem for deter-
ness detection algorithm has been tested on more
mining the complementary part of a fragment of
than 50 scanned fragments. All the fragments are
an object and of reassembling them to form the
scanned using 3D laser LPX 600 Roland scanner.
object. It has proposed an efficient surface inspec-
The results have shown that the proposed algo-
tion algorithm which detects the corresponding
rithms are well suited for detecting cleavage sites
cleavage sites of fragments and registers them
in broken objects in a much accurate and efficient
so that the object can be formed from the given
manner. To detect the complementary parts, each
fragments. The algorithm captures differences in
of 50 scanned 3D objects has been broken into
the geometric structure in different parts of the 3D
two parts and the cleavage site of each of these
object represented by polygon meshes. These dif-
fragments is identified. Further, results obtained by
ferences are used to find most probable cleavage/
applying the proposed algorithm for complemen-
fracture sites. It is applicable to all triangulated
tary part detection well suited for identification
surface data even in presence of noise. The algo-
of complementary parts from the multiple broken
rithm is found to be very effective on objects of
objects in a much accurate and efficient manner.
ceramic material and archeological artifacts. The
After the detection of complementary parts, final
work already done in the field of reconstruction
registration of these complementary parts is done
of ceramic material or archeological artifacts is
in order to reassemble them at their cleavage sites.
through reassembly is either on 2D reassembly or

723
Complementary Part Detection and Reassembly of 3D Fragments

REFERENCES Lindstrom, P., & Turk, G. (2000). Image-driven


simplification. ACM Transactions on Graphics,
Arrault, J., Arnéodo, A., Davis, A., & Marshak, 19(3), 204–241. doi:10.1145/353981.353995
A. (1997). Wavelet based multifractal analysis of
rough surfaces: Application to cloud models and Masuda, T., & Yokoya, N. (1995). A robust method
satellite data. Physical Review Letters, 79(1), for registration and segmentation of multiple range
75–78. doi:10.1103/PhysRevLett.79.75 images. Computer Vision and Image Understand-
ing, 61(3), 295–307. doi:10.1006/cviu.1995.1024
Barequet, G., & Sharir, M. (1997). Partial
surface and volume matching in three dimen- Nicolas, A., Santa-Cruz, D., & Ebrahimi, T. (2002).
sions. IEEE Transactions on Pattern Analysis Mesh: Measuring error between surfaces using
and Machine Intelligence, 19(9), 929–948. the Hausdorff distance. In IEEE International
doi:10.1109/34.615444 Conference on Multimedia and Expo (ICME):
Vol. I (pp. 705–708). IEEE Press.
Besl, P. J., & McKay, N. D. (1992). A method for
registration of 3D shapes. IEEE Transactions on Rogowitz, B., & Rushmeier, H. (2001). Are image
Pattern Analysis and Machine Intelligence, 14(2), quality metrics adequate to evaluate the quality
239–256. doi:10.1109/34.121791 of geometric objects? In Rogowitz, B. E., &
Pappas, T. N. (Eds.), SPIE on Human Vision and
Boehler, W., & Marbs, A. (2002). 3D scanning Electronic Imaging VI (Vol. 4299, pp. 340–348).
instruments. In CIPA WG6 International Workshop SPIE. doi:10.1117/12.429504
on Scanning for Cultural Heritage Recording
(CIPA2002) (pp. 9-12). Scopigno, R., Cignoni, P., & Rocchini, C. (1998).
Metro: Measuring error on simplified surfaces.
Bolin, M. R., & Meyer, G. W. (1998). A perceptu- Computer Graphics Forum, 17(2), 167–174.
ally based adaptive sampling algorithm. In ACM doi:10.1111/1467-8659.00236
SIGGRAPH (pp. 299–309). ACM Press.
Terry, W. (1995). The challenge of 3D digitizing.
Corsini, M., Gelasca, E. D., & Ebrahimi, T. Computer Graphics World, 18(11), 21–25.
(2005). A multi-scale roughness metric for 3D
watermarking quality assessment. In Workshop Waechter, M., Riess, F., Schimmel, T., Wendt,
on Image Analysis for Multimedia Interactive U., & Peinke, J. (2004). Stochastic analysis of
Services, (pp. 13-15). different rough surfaces. The European Physi-
cal Journal B- Condensed Matter and Complex
Jafari, G. R. (2003). Stochastic analysis and regen- Systems, 41(2), 259-277.
eration of rough surfaces. Physical Review Letters,
91(22), 1–4. doi:10.1103/PhysRevLett.91.226101 Williams, N., Luebke, D., Cohen, J. D., Kelley,
M., & Schubert, B. (2003). Perceptually guided
Kaushik, V. D., Singh, A. K., Varshney, J., Pandey, simplification of lit, textured meshes. In Sympo-
P., Rao, K. P., & Pathak, V. K. (2007). Cleavage sium on Interactive 3D Graphics (pp. 113-121).
site detection in broken 3D objects. In Interna- ACM Press.
tional Conference on Advanced Computing and
Communications (ADCOM 2007) (pp. 339–344). Wu, J., Hu, S., Tai, C., & Sun, J. (2001). An ef-
fective feature-preserving mesh simplification
Levoy, M., Pulli, K., Curless, B., Rusinkiewicz, scheme based on face constriction. In 9th Pacific
S., Koller, D., & Pereira, L. … Fulk, D. (2000). Conference on Computer Graphics and Applica-
The digital Michelangelo project: 3D scanning of tions (pp. 12 - 21). IEEE Press.
large statues. In ACM SIGGRAPH (pp.131-144).
ACM Press.

724
Complementary Part Detection and Reassembly of 3D Fragments

KEY TERMS AND DEFINITIONS Fragment: Each broken piece of a 3D object


is known as fragment.
Cleavage Site: It consists of all rough facets Multi-Level Granularity: It is the roughness
in a fragment. having more than one level in a fragment.
Complementary Part Detection: The method Registration: Alignment of complimentary
of obtaining the corresponding fragments belong- parts of fragments.
ing to the same 3D object. Roughness Factor: A metric to measure the
Facet: If a surface is represented by triangular roughness of a surface of a 3D object.
meshes, then each triangle is it is termed as facet.

This work was previously published in 3-D Surface Geometry and Reconstruction: Developing Concepts and Applications,
edited by Umesh Chandra Pati, pp. 314-337, copyright 2012 by Information Science Reference (an imprint of IGI Global).

725
726

Chapter 44
Biometric Authentication
in Broadband Networks for
Location-Based Services
Stelios C. A. Thomopoulos
National Center of Scientific Research “Demokritos,” Greece

Nikolaos Argyreas
National Center of Scientific Research “Demokritos,” Greece

ABSTRACT
Broadband communication networks have begun to spread rapidly over fixed networks, with wireless
networks following at close distance. The excess capacity allows the offering of broadband services
at competitive rates. Location-based services (LBS) over wireless broadband networks are becoming
mainstream in an emerging ambient intelligence society. For LBS over broadband and, in particular,
peer-to-peer networks, such as ad hoc networks, unambiguous user authentication is of paramount im-
portance to user trust and safety, thus ultimately to the success of such service. Biometric authentication
is an approach to providing irrefutable identity verification of a user, thus providing the highest level
of security. This chapter addresses some of the issues associated with the use of biometric ID for user
and apparatus authentication over broadband wireless networks (e.g., GPRS, UMTS, WiFi, LANs) and
narrow band local networks (e.g., bluetooth, Zigbee, PANs, BANs).

INTRODUCTION enhanced GPS infrastructures, such as EGNOS,


Galileo …, with broadband wireless networks,
The spreading of broadband networks stimulates provide the necessary infrastructure for delivering
a wealth of Internet services over fixed and wire- high quality and versatile location-based services
less networks with stationary and mobile devices. (LBSs) ranging from travel information to enter-
Combining accurate location information from tainment, to crisis and incident management, to
services on demand, to health care and peer-to-peer
DOI: 10.4018/978-1-4666-2038-4.ch044 communications, to mention just a few.

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Biometric Authentication in Broadband Networks for Location-Based Services

In all these services, the common thread is the identification and verification include fingerprint
ability to unambiguously identify and authenticate identification, iris scan, face and voice recogni-
the mobile user and customer to the LBS provider. tion, signature recognition, hand geometry, and
Different LB services may have different authenti- combinations thereof (Reisman & Thomopoulos,
cation requirements. However, no matter what the 1998; Thomopoulos & Reisman, 1993).
application is all such services, the unambiguous The use of biometric ID imposes certain re-
authentication of the user is paramount to gaining strictions and technological challenges that need
the trust of the end user and thus achieving the to be addressed before biometric authentication
success of the services. Unambiguous user au- becomes widely used as an enabling technology
thentication is paramount to the parties involved for irrefutable user authentication in LBS and
in an LB service and the trust upon which the other broadband services.
service is built. If for example the LBS refers to
the provision of transport services on demand,
the ability to correctly identify and authenticate BIOMETRIC ID
both parties involved in the transaction, that is
the passenger (i.e., the user) and the driver (i.e., A. Requirements for
the service provider) build mutual trust and can Biometric ID Usage
be proved life-saving in the case of a car-jacking,
criminal activity, or fraud. Biometric ID is the mathematical encoding of
User identification and authentication can be certain bodily features that are considered unique
performed by a variety of means, ranging from a for each human being and differ enough from
simple alphanumerical password to a more secure person to person so that this difference can be
digital signature, to the ultimate in security bio- used safely enough to tell apart one person from
metric ID. Although a digital signature produced another. For example, in the case of fingerprints,
by an electronic device provides the convenience biometric features are the characteristic points
of a self-contained identification instrument, it that are formed from the endings or bifurcations
does not prevent fraudulent use of a user ID. of the finger ridges and/or the pattern of the ridges
Since there is no unique and inherited connection themselves. In the case of the iris scan, biometric
between the user and the digital ID, any holder features are the radial patterns of the iris. In the
of the electronic device that produces the digital case of the face, biometric features are the relative
ID can produce a fraudulent authentication. The location of the eyes, mouth, nose, and so forth.
only means to eliminate such possibility is the The mathematical encoding of the biometric
use of biometric ID. features constitutes what is referred to as biometric
Biometric ID is a digital signature generated “template” or biometric ID (Thomopoulos, Reis-
from the measurement of some bodily human man, & Papelis, 1996). No matter which biometric
characteristics that are unique, or different enough is used, the biometric features constitute unique
to be considered unique, from user to user. This and personal human characteristic and as such,
Biometric ID, encoded properly, constitutes a they are protected by the Personal Information
unique signature for each user that cannot repli- Protection Act (PIPA) (Personal Information
cated by an impostor. This biometric ID can be Protection Act, S.A. 2003, c. P-6.5). This protec-
used to meet the stringent requirements imposed tion imposes a number of issues, concerns, and
by LB services and the necessary trust required restrictions with the extraction (or capturing)
by users and operators of such services alike. of biometric features, their (electronic) stor-
Examples of biometrics commonly used for user age, encoding into a biometric template and the

727
Biometric Authentication in Broadband Networks for Location-Based Services

subsequent retrieval and use of this template for seemingly universal architecture involves the use
user authentication. In addition to privacy, other of a smart card as a memory device for storing
concerns with the use of biometric features relate the user biometric ID template and a local secure
to the medical information that may be contained processor for capturing the biometric information,
in and revealed by biometric features, the safety processing it to extract the biometric features, and
of the process and devices used in the extraction comparing it against the stored template. Alterna-
and verification process, and the protection of tive storage technologies include memory chips or
the user’s privacy (ANSI:The American National tokens. In this case, the biometrics data processor
Standards Institute, 2005; also ISO: International is part of the memory device, in a size that is not
Organization for Standardization). larger than a conventional memory stick. Using
A biometric “template” constitutes a non- either technology, the PICA requirements can
reversible mathematical transformation of the be met. The biometric template is always at the
extracted biometric features (Thomopoulos, Reis- possession of the user, any processing is done
man, & Papelis, 1996). In that respect, it cannot locally, and over links and processors that are
be used outside the specific device that is used to securely isolated from the WWW. However, in
extract it (and subsequently verify it) as a means to the process of enforcing PICA, one of the ma-
identify one’s identity without his or her consent, jor advantages of biometrics and biometric ID,
thus violating the PICA. However, it can be used namely the convenience not to carry any type of
indirectly through the use of the specific device ID other than one’s self, is being lost. This is the
and process used to extract it to coerce evidence trade-off between the convenience offered by the
about one’s identity without his or her consent. card-less biometric identification, and the restric-
In that regard, even the biometric template, albeit tions imposed by PICA that translate in the need
its mathematical irreversibility, constitutes private to carry and use a plastic card in order to store the
and personal data and as such it is mandatory that biometrics template in a user-controlled memory.
it is protected by the Private Information Protec- The cartoon displays in Figures 1 and 2 show
tion Act (PIPA, S.A. 2003, c. P-6.5). the typical registration and authentication pro-
The restrictions imposed by the PICA reflect on cesses for biometric ID. These processes have
the technological specifications for the processes been implemented in the VeriEasy biometrics
and devices used for biometric ID verification. access control system that was developed in the
Hence, the requirement that any biometric infor- context of the Bioathletics1 project. The project
mation, including the biometric template, consti- aims at implementing a distributed biometrics ID
tutes private data and requires that the biometric system to allow secure access to spectators, VIPs
data be stored in a memory device that is only and the press in athletic installations and events
accessible by the owner of this data, either in the using their biometric ID only.
form of raw biometric data or a processed bio-
metric template (Stapleton, 2003). Furthermore,
any retrieval of such information from the storage BIOMETRICS IN ATHLETICS
device (memory) must be done over secure and
cryptographically protected communication links, Athletic events represent an ideal field of appli-
and any subsequent processing of this informa- cation and showcase test beds for biometrics as
tion must be done by a secure processor. These they combine a usually large crowd, the need for
restrictions lead to almost uniquely defined archi- speedy but secure authentication, electronic tick-
tectures for biometric verification (The BioAPI™ eting, and overall security concerns by potential
Consortium, 2005). The most readily available and troublemakers. VeriEasy™ was developed in the

728
Biometric Authentication in Broadband Networks for Location-Based Services

Figure 1. Registration process of the PICA-compliant smart card based biometrics (fingerprint) VeriEasy
access control system for athletic events and installations

context of the funded research program Bioathlet- stallations reader in a smart card that stays with
ics to provide a unified BioAPI compliant platform the template’s owner at all times. This way the
for addressing the concerns of the athletic events PICA requirements are fulfilled in terms of both
industry in secure physical access to stadium for privacy and protection against eavesdropping,
athletes, spectators, and VIPs. In the VeriEasy™ since the template stays with his or her owner at
system, the biometrics (fingerprint) template is all times and processing is done locally over a
stored directly from the card Verification and trusted link. The user smart card serial number is
Access Process as implemented in the VeriEasy then linked to a back office legacy system data-
access control system for athletic events and in- base that contains demographic (non-private and

Figure 2. PICA-compliant smart card based biometrics (fingerprint)

729
Biometric Authentication in Broadband Networks for Location-Based Services

sensitive) information about the user and access data communication from the distributed access
rights and athletic event attendance information. points to the centralized back offices database or
During an athletic event, the user accesses databases that hold the athletic event schedules,
the stadium or the equivalent athletic installation ticket information, event schedules, athletic instal-
using his or her biometric ID smart card. The lations layout, access rights, and authorizations.
stored user’s biometric ID is compared against Finally, if the biometric authentication devices
his or her fingerprint live scan locally using the communicate with the local Access Rights Con-
same secure local processor and communication troller via a Bluetooth or zigbee wireless link,
link. If the authentication is positive, the system this wireless link must be secured as well. Both
is connected via a broadband connection to the bluetooth and zigbee protocols provide a security
back office athletic events database to acquire the layer but the level of security of these layers is
appropriate access permissions for the specific still not widely tested.
event the user is attempting to gain access to. If Hence, if a Bluetooth or zigbee wireless link
the user has purchased the appropriate ticket for is used in the biometrics ID system, it may be
the event or has access rights to it, an authenti- required to use encryption on top of their security
cation acknowledgment signal is returned to the layer until their security is extensively tested and
device to enable access to the athletic premise. proven.
Alternative, a positive acknowledgment message Figure 3 depicts the VeriEasy verification
can be displayed on a monitor, if the system oper- process as a generic secure verification process
ates with a human attendant. The verification and applicable to any biometrics ID authentication
access process for VeriEasy is shown in a cartoon system operating over any broadband communi-
way in Figure 2. cation network. Hence, the biometrics authenti-
Another restriction that is imposed on bio- cation system architecture of VeriEasy, and thus
metrics ID systems is the encryption of any bio- Bioathletics, provides: a trusted and secure means
metrics data. Encryption in the VeriEasy system of biometrics ID extraction and processing at the
is achieved using a private key that is encrypted local level, compliant with the PICA specifica-
in the smart card. Other encryption methods are tions for personal data protection and privacy;
possible and can be used in conjunction with the a globally secure network architecture for data
smart card encryption. Another link that must be exchange and information communication; and a
secured in the Verification and Access process locally secure wireless network for the intercon-
is the physical electric connection from the door nection and networking of devices and signaling.
controller to the electric door latch. Any biometrics authentication system since
This link can be secured by using an encryption- the biometrics field is continually evolving and
decryption hardware pair to encrypt the electric no standard has been reached yet.
signal that commands the opening and closing of The Bioathletics paradigm can be general-
the gate. This way the entire local biometric au- ized to other broadband applications that require
thentication-access process is secured. It remains, biometrics ID authentication. A number of useful
however, to secure all long haul communication conclusions drawn from the Bioathletics project
network links. are summarized in Table 1. These conclusions are
In order to make the biometrics ID registra- relevant and applicable to the implementation of
tion-verification-access process entirely secure any PICA-compliant biometrics ID authentication
globally, all long hauls fixed or wireless commu- system that may be required for the unambiguous
nication network links must be secured as well. and trusted user authentication in broadband ap-
One way to achieve this is use VPN tunneling for plications, with LBS being one of them.

730
Biometric Authentication in Broadband Networks for Location-Based Services

Figure 3. Securing all communication links, fixed and wireless, either via a VPN, bluetooth, or zibgee,
the security layer provides a globally secure and trusted, PICA-compliant smart card based biometrics
(fingerprint) authentication for any broadband application with emphasis to LBS applications

In addition to the security requirements im- late all manufacturer dependent biometric and
posed on a biometrics ID authentication system, algorithm processes in callable DLLs. In this way,
the system must also be Bio-API compliant. the core structure of the biometrics ID authentica-
Compliance with the Bio-API allows to encapsu- tion system remains fixed, whereas new biomet-

Table 1. Recommendations for the implementation of a biometrics ID authentication system

PICA
Process Device Communication Link Security level PICA requirements
compliant
Biometrics feature Biometrics reader Local and secure. Data Secure and Personal data pro-
extraction and with local processor encryption if necessary trusted tection Yes
template creation
Biometrics tem- Local with the Directly in a smart card or Private and secure Privacy and protec-
plate storage user: Smart Card or memory stick tion of personal data
Memory Stick with Yes
or w/out processor
and built-in memory
Biometrics ID Biometrics reader Local. Encryption of data Secure and Privacy and protec-
authentication with local proces- may be required from smart trusted tion of personal data
Yes
sor & Local Card or card or memory stick to local
Memory Stick reader processor.
Access of back Secure Server VPN with or without ad- Secure N/A unless it carries
office permission ditional data encryption private data info – to N/A
rights db be avoided
Local devices and Biometric readers, Wireless local links: blue- bluetooth or Secure personal data
command signal- controllers, actuators tooth or zigbee zigbee Security transfer if inevitable
ing via wireless protocol with or Yes
links w/out additional
data encryption

731
Biometric Authentication in Broadband Networks for Location-Based Services

ric feature extraction, encoding, and verification compliant applications and biometric devices is
modules can be easily integrated in the system speculated to significantly increase in the next two
with minor modifications. This modularity is years (The BioAPI Consortium; and The BioAPI
extremely important as it allows adaptation to Consortium: BioAPI™ Specification version 1.1).
new biometrics as time progresses. This adapta- A BioAPI™ compliant application consists
tion is keen to the success of any biometrics au- of at least two different modules, the application
thentication system since the biometrics field is module, which could be seen as the “top” level
continually evolving and standards have yet to be application (Level H) and the BSP (Biometric
defined. Service Provider) module, which could be seen
as the “bottom” level application (Level L).
BioAPI™ and BSP Implementation The application module would be the biometric
application and the BSP module would be the
The BioAPI™ Specification version 1.1 0 is an service provider module, which should control
ANSI 0 (INCITS 358-2002) standard and cur- the biometric hardware. The BSP could be seen
rently under ISO standardization process (ISO/IEC as a black box from the side of the application
JTC1/SC37) specifying an Application Program- developer, since the technical details of a given
ming Interface (API), which was introduced to device will not concern him.
facilitate the implementation of Biometric applica- Access to the biometric mechanisms is through
tions (The BioAPI Consortium; and The BioAPI a set of standard interfaces defined by BioAPI™.
Consortium: BioAPI™ Specification version The approach taken by the BioAPI™ is to hide,
1.1). BioAPI is a standard biometric technology to the degree possible, the unique aspects of in-
interface, which is intended to provide a high-level dividual biometric technologies, and particular
generic biometric authentication model; one suited vendor implementations, products, and devices,
for any form of biometric technology. BioAPI while providing a high-level abstraction that can
supports, among other lower level functions, the be used within a number of potential software
basic functions of Enrollment, Verification, and applications.
Identification, and includes a database interface One of the most important terms of the Bio-
to allow a biometric service provider (BSP) to API™ standard is the BIR. The term Biometric
manage the Identification population for opti- Identification Record (BIR) refers to any biometric
mum performance in an application independent data that is returned to the application; including
manner. There is a reference implementation raw data, intermediate data, and processed sample
for Microsoft® Windows® (early alpha) publicly ready for verification or identification, as well as
available by the BioAPI™ Consortium 0, as well enrollment data.
as two internally released beta implementations; Theoretically, BSPs that are supplied by
one for Java (under Windows®, via JNI) by (Gens vendors and conform to the BioAPI™ interface
Software Ltd); and one for Unix™ / Linux™ by specification, can be used by any application
SAFLINK. There are several commercial imple- developed using the BioAPI™ framework.
mentations to date—mainly by members of the The BioAPI™ Consortium claims, among
BioAPI™ Consortium—either in the form of other things, that the BioAPI™ will enable rapid
BioAPI™ framework SDK or in the form of BSP development of application, flexible deployment
Biometric Service Provider (BSP) SDK or specific between platforms, and simple application inter-
BSP implementations for the vendors’ Biometric faces modularity of biometric applications.
Devices. However, market acceptance is still For an application to be compliant with the
immature and market penetration of BioAPI™ BioAPI™ specification, a software application

732
Biometric Authentication in Broadband Networks for Location-Based Services

must perform that operation consistent with the a. A new device is plugged in the system.
BioAPI™ specification for each BioAPI™ func- b. An existing device is unplugged from the
tion call made. That is, all input parameters must system.
be present and valid. There is no minimum set of c. An existing device doesn’t work properly.
functions that must be called. d. A biometric source is presented to one of
For a BSP to claim compliance to the BioAPI™ the biometric devices in the system, or is
specification, it must implement mandatory func- removed from it.
tions for their category. They are categorized as
either a Verification or Identification BSP. BSPs The application can command the BSP with
must accept all valid input parameters and return commands such as BioAPI_Enroll(…) or BioA-
valid outputs. Additionally, they must provide all PI_Verify(…). The idea of BioAPI™ includes the
required module registry entries. Entries to the idea that any attached device should be handled
module registry must be performed upon BSP from the BSP so that different devices can be
installation. Biometric data generated by the used if they provide a suitable BSP. However,
BSP must conform to the data structures defined as existing biometric devices may not be fully
in BioAPI™ Specification. A BioAPI compliant compliant to BioAPI™, there might be cases
BSP must support all Module Management and that it may be deemed necessary to command the
Handle operations. device to perform something that is not foreseen
When an application is connected to a given by the BioAPI™ without violating the BioAPI™
BSP, the process initializes the BioAPI™ Frame- standard. Otherwise, the application will no longer
work, loads the BSP desired, and then attaches be BioAPI™ compliant, since no other de vice
a module of the given BSP to the application. is expected to work with the same command and
Once the application is connected to a BSP, it the idea of the BSP module as a black box is lost.
can perform BioAPI™ calls for several biometric Another problem exists with the data exchange
operations as Capture, Enrollment, Verification, between the application and the BSP. The API gives
and Identification. the application the right to use a payload as input
The BioAPI™ concept is very interesting, but on Enrollment and Create Template, or as output
some things in the implementation may get con- on Verification and Verify Match (Identification).
fusing. For instance, data exchange between the In any other case the API does not provide the
BSP and the application can be an issue. There is application the right to exchange data with the
no asynchronous way to pass data from the BSP BSP. Two questions are raised then:
to the application. Also when it comes to com-
munication of the application and the BSP, the a. What happens in the case that the application
rules of communication can be quite restraining. needs to send data to the BSP asynchronously
In the given implementation of BioAPI™, and vise-versa?
when it comes to the BSP wanting to notify the b. What should the payload data be, if the data
application that some event has occurred, the API that is exchanged through payload are not
defines that the notifications that can be made are specified by the API?
“on insertion” or “on removal” of a biometric
device, “on fault” of a device and “on presence” The data exchange through payload might be
or “on removal” of a biometric source (e.g., pres- anything, but since the API does not define what
ence of a finger on a fingerprint device). In other this data should be, then it is almost impossible
words an application can only be notified if: to find two different BSPs passing the same data
through payload and on the same form. This

733
Biometric Authentication in Broadband Networks for Location-Based Services

means that the application is highly impossible MULTIFUNCTIONAL BSPs


to be able to function with the BSP of a different
device, thus a need for BSPs for devices from Using more than one biometric technology may
different manufacturers and vendors. be a highly unprofitable venture, because of the
cost of different kinds of devices coming from
Expanding the Capabilities different hardware distributors. But even more, the
of VeriEasy™ maintenance of such a system could be proved a
constant headache for our administration person-
In many applications, in particular in high security/ nel (different contracts of maintenance). It is here
high authentication fidelity cases, more than one where VeriEasy™ comes to provide the winning
biometrics technology is required to exist and solution--the multifunctional BSP. That means
operate under different authentication scenarios that the same device can operate in many different
in the same environment. Such a heterogeneous ways so it is up to us to choose the proper way for
biometric devices network may consist of fin- the specific device, whenever we want to.
gerprint scanners, voice recognition devices, Hence, setting up the most complex biometric
and face and iris scanners, all working in various devices network to meet the security needs of an
combinations for ID verification under various organization is only one click away! Moreover, if
authentication scenarios. Using the VeriEasy™ for security concerns changes are required in the
client-server architecture, we can build BSPs security network design at run time (!), VeriEasy™
for each type of biometrics device we intend allows on the fly reconfiguration of the security
to use in the security network and control them network design without service interruption!
via a single application! This way the highest
security standards are achieved without losing Biometrics Authentication
any flexibility. For example, in a heterogeneous in Broadband Networks and
biometrics network, the Server BSP can grab the Location-Based Services
proper fingerprint template from the smartcard or
request to the application for the right one when a Advances in broadband, and in particular wire-
centralized Template Database is in place. less broadband networks, enable the provision
The obvious advantage of this architecture is of personalized multimedia services, as well as
that it defines a middle layer between the applica- location-based services (LBS’s). Those services
tion and the native BSPs, so that the native BSP extend from electronic maps and navigation, to
programmer needs to know about his device and location sensitive tourist guidance, transportation,
nothing more, the middle layer programmer (if car-pooling, ad hoc networking, and ubiquitous
any) needs to know about the biometric network computing, to incident management, emergency
and nothing more, and the application programmer services, to make reference to just a few. In all
needs to know nothing at all about biometrics! LB services, the sensitive information is the user’s
Furthermore we can use plug and play BSPs location and the user’s identity. Both these two
(e.g., have a BSP installed, as a client one), and pieces of information must be protected by the
let the Server BSP to reconfigure itself in order provided service or infrastructure the service is
to handle the new device. running on, and authenticated when necessary
by means directly controlled by the user. Simply
imagine how critical it is to authenticate the user’s
ID unambiguously when the LBS is a car pooling
service provided over an ad hoc (peer-to-peer)

734
Biometric Authentication in Broadband Networks for Location-Based Services

network. In such an application where verification Personal Information Protection Act, S.A. 2003, c.
of the true identity of the user may be a life and P-6.5. (n.d.). Retrieved from http://www.psp.gov.
death situation, it becomes imperative to have a ab.ca/index.cfm?page=legislation/act/index.html
bullet-proof biometrics authentication system to
Reisman, J. G., & Thomopoulos, S. C. A. (1998).
provide unambiguous user ID verification.
Data fusion architecture for automated finger-
Such a system should protect the user ID from
print identification for very large databases. In
theft, forgery, and misuse.
Proceedings SPIE (Vol. SPIE-3374). SAFLINK
Corporation. (n.d.). Retrieved from http://www.
saflink.com
CONCLUSION
Stapleton, J. (2003, June 23-26). KPMG, state of
The architecture and design of a distributed bio- biometric standards. Presentation at the Biomet-
metric ID verification system has been presented. ricTech Conference, New York.
The VeriEasy™ system provides protection
The BioAPI™ Consortium. (2005). Retrieved
of the user privacy and ID, compliant with the
from http://www.bioapi.org
sensitive personal data protection guidelines and
legislations, and is BioAPI compliant using BSP The BioAPI™ Consortium: BioAPI™ Specifica-
compliant interfaces. tion version 1.1. (n.d.). Retrieved from http://
www.bioapi.org
Thomopoulos, S. C. A., & Reisman, J. G. (1993).
ACKNOWLEDGMENT
Fusion-based, high volume Automatic Fingerprint
Identification System (AFIS). In Proceedings of
The research for the development of a BioAPI™/
SPIE 93, Innsbruck, Austria (Vol. SPIE-2093).
BSP compliant biometrics authentication system
for ID verification in athletics events have been Thomopoulos, S. C. A., Reisman, J. G., & Papelis,
sponsored by the General Secretariat of Research Y. E. (1996). Ver-i-Fus: An integrated access con-
& Technology, of the Greek Ministry of Devel- trol and information monitoring and management
opment, under Contract No. ΣΠ/ΑΘ/17+32, Op- system. Proceedings of the Society for Photo-
erational Program “Competitiveness” Oct. 2003. Instrumentation Engineers, SPIE-2934, 191–200.

REFERENCES ENDNOTES

ANSI. American National Standards Institute. 1


The project “Bioathletics” has been funded
Retrieved from http://www.ansi.org by the General Secretariat of Research &
Technology (GSRT) of the Greek Ministry of
Gens Software Ltd. (n.d.). Retrieved from http://
Development under the Contract AΘ 17+32
www.gensoft.com
/ 2-10-2003.
ISO. International Organization for Standardiza-
tion. Retrieved from http://www.iso.ch

This work was previously published in Secure E-Government Web Services, edited by A. Mitrakas, P. Hengeveld, D. Polemi,
and J. Gamper, pp. 84-96, copyright 2007 by IGI Publishing (an imprint of IGI Global).

735
736

Chapter 45
Multi-Criteria Decision
Analysis for Identifying
a Suitable Location for
Groundwater Pumping Wells
D. Mimoun
Ecole Nationale Supérieure des Mines, France

S. Gaur
Ecole Nationale Supérieure des Mines, France

D. Graillot
Ecole Nationale Supérieure des Mines, France

ABSTRACT
The paper presents the methodology for the combined use of GIS-based multi-criteria analysis and
simulation-optimisation modelling for management of the groundwater resources of the Dore river basin
in France. The study identifies the suitable location and maximum discharge for the new groundwater
pumping wells. The multi-criteria analysis (MCA), with the help of GIS-based geospatial analysis,
was performed to identify those areas suitable for pumping wells by considering different criteria,
such as hydraulic conductivity, land use, river aquifer exchange, depth to water, and geomorphology.
Different criteria were selected with the help of regional experts and stakeholders. For the study area,
the groundwater flow model was developed. Further, new pumping wells in the suitable zones, those
identified by MCA, were considered and a simulation-optimisation technique was used to identify the
maximum discharge from those wells. Finally, the results obtained from both the methods were to finalise
the potential zone. The developed methodology proves to be a more realistic approach to identifying
new locations for pumping wells.

DOI: 10.4018/978-1-4666-2038-4.ch045

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

1. INTRODUCTION (2007), Lin and Kao (1998), Allen et al. (2002),


and Kontos et al. (2005). In a preliminary, GIS was
The demand for water at the industrial, agricultural normally involved for employing a set of criteria
and domestic levels is continuously increasing and, in order to catalogue an area into defined classes
correspondingly, limited fresh water resources are by creating buffer zones around the geographic
shrinking. The European Union has established a features to be protected. All map layers were then
community framework for water protection and intersected so that the resulting composite map
management. In this framework, member states contains two suitable and unsuitable areas. As per
must analyse the characteristics of each river basin the literature review, the authors did not find the
district, the impact of human activity on water study on the combined use of the MCA approach
resources and the economic analysis of water with the genetic algorithm model in conjunction
use. The management plans of recommendations with groundwater model. Moreover, the present
should be executed in order to protect and enhance work augments the simulation-optimisation ap-
groundwater resources, protect them from pollu- proach with the use of MCA method coupled with
tion and ensure a balance between installing new geographical information system.
pumping wells and designing protected areas. The In this paper, first the detail of the study area
above-mentioned objectives must be achieved no is presented. Then, the development of methodol-
later than 2015. ogy is explained with a detailed explanation of
Selection of a suitable location for the devel- multi-criteria analysis, groundwater simulation-
opment of new pumping wells is a challenging optimisation modelling, objective function and
task when taking of pollution and potentiality of constraints. Finally, the results and conclusion
the area into consideration. With the advance- of the study are presented.
ment of GIS technology, a multi-criteria analysis
method is becoming a powerful tool for decision
makers to analyse and solve the problems which 2. METHODOLOGY
are based on multiple criteria. MCA methods are
used to assist decision makers in either ranking a 2.1. Location of Groundwater
known set of alternatives for a problem or mak- Extraction Wells
ing a choice among this set while considering
the conflicting criteria (Sumathi, 2007; Keeney Figure 1 shows the flow chart for the proposed
& Raiffa, 1976; Zeleny, 1982; Figueira et al., methodology for the evaluation of a suitable
2005).The integration of MCA techniques with location for groundwater pumping. In the first
GIS has significantly advanced the map overlay step, the Intrinsic Potentiality Index (IPI) was
analysis for site suitability analysis (Carver, 1991; calculated using five parameters that control the
Banai, 1993 R. Banai, Fuzziness in geographic groundwater potentiality, i.e., depth to water,
information systems: contributions from the ana- hydraulic conductivity, aquifer river exchange,
lytic hierarchy process, International Journal of geomorphology and land use. The name of the
Geographical Information Systems 7 (1993), pp. criteria and their weight and rating are summarised
315–329. Full Text via CrossRef | View Record in Table 1. Two exclusion criteria were also con-
in Scopus | Cited By in Scopus (41)Eastman, sidered to keep the suitable location away from
1997; Malczewski, 1999). An integrated approach a potential polluted area. These criteria include
incorporating the application of GIS and MCA the distance from the treatment site and the vi-
methods have been employed for the suitability cinity to an industrial centre. Various buffers to
analysis of landfills in an urban matrix by Sumathi these areas were developed and used as a mask

737
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

Figure 1. Methodological scheme developed for identify the suitable location for groundwater pumping
wells

layer. All described parameters were developed identified in the first step. The groundwater flow
in the raster format with a 20 m grid resolution model was run by considering the 2 and 3 pump-
and map algebra was performed using ArcMap ing wells in the zones and the maximum dis-
GIS (ESRI, 99). charges from those wells were computed. The
Finally, simulation-optimisation modelling MGO module of Visual MODFLOW 4.2 were
was performed to identify the maximum discharge used to perform the optimisation. The most suit-
from the given number of wells at the locations able location was identified with the help of results

Table 1. Patterns and grading values of no-exclusionary criteria used to evaluate relative suitability of
areas selected by GIS analysis

Criterion Pattern of Rat- Grading Value


ing Curve
Hydraulic Geometric 0% = 0.06*10-3 m/s; 10% = 0.1*10-3 m/s; 20%=0.5*10-3 m/s; 50%=10-3 m/s; 70%=3*10-3
Conductivity Geometric m/s; 90%=6*10-3 m/s; 100%=8*10-3 m/s
Geomorphology Geometric 0%= outside the zone of recent lateral migration of a meander; 40%=inside the zone of
River Aquifer ex- -- historic lateral migration of a meander; 60%=in the abandoned meander; 100%= inside the
change Linear abandoned meander and near the growing meander
Land Use 0%=low channel dynamics in bed incised river; 20%=low channel dynamics in bed and
Depth to Water riprap bank of the river; 50%=low channel dynamics in bed river; 70%=low channel dy-
namics in bed river and zone of erosion; 80%=moderate channel dynamics; 100%=moder-
ate channel dynamics and zone of erosion
100%=Forest and semi-natural zones; 50%=Aquatic area (swamps) or Pastures and agro-
forest zones; 30%=Discontinuous urban zones or Permanent cultures; 25%=Artificial
covered zones, green zones, continuous urban zones; 20%=gravel pit; 10%=Irrigated and
non irrigated annual culture ; 0%=Industrial discharge, landfill
100%= >9m; 90%=7.5-9m; 80%=6-7.5m; 70%=4.5-6m; 40%=3-4.5m; 10%=1.5-3m;
0%=<1.5m

738
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

achieved by multi-criteria analysis and ground- and efficiently be used to obtain globally (or, at
water modelling. least near globally) optimal solutions to these
groundwater management problems. Mantoglou
2.2. Simulation-Optimisation in and Papantoniou (2008) developed a method
Groundwater Management for the optimal design of pumping networks in
coastal aquifers using genetic algorithms. Two
Groundwater management problems are typically models were developed: first, for the simultane-
solved by researchers through the simulation- ous optimisation of wells’ location and pumping
optimisation approach. During the last two de- rates using GA; the second model followed two
cades, several studies have been carried out to stages where the first stage was concerned with
address groundwater management problems using the optimisation of wells’ locations and the sec-
simulation-optimisation approaches (Lefkoff & ond stage was concerned with the optimisation
Gorelick, 1986; Ahlfeld et al., 1986; Zheng & of pumping rates. The last method used GA for
Wang, 2002; Shamir et al., 1984; Willis & Finney, searching the well positions and at each generation
1988; Finney & Samsuhadi, 1992; Emch & Yeh, of GA, a simpler optimisation with respect to the
1998). Extensive literature reviews on this topic pumping rates was performed using non-linear
have been done by Gorelick (1983), Das and programming. Finally, the model was applied
Datta (2001), Willis and Yeh (1987), and Qin et effectively to determine the optimal design of a
al. (2009). In recent years, applications of global pumping network and assess pumping rates in a
search methods have been growing and these coastal aquifer.
methods have been applied to solve different kinds
of groundwater quantity and quality management 3. STUDY AREA
problems. The global search methods are efficient
at finding a global solution to the problem. The The model has been applied to fulfill the water
constraints of the global optimisation method demand of the city of Thiers. This is one of the
can be incorporated into the formulation and do major cities in the Loire region of France. The total
not require derivatives with respect to decision population of Thiers is 20,000 and the population
variables as in non-linear programming (McKen- in the vicinity of the region is 23,500. Due to an
ney & Lin, 1994). A genetic algorithm has been increasing population and a decreasing quality of
applied by Wang and Zheng (1998), Goldberg surface water, groundwater is the best alternative
(1989), Wang (1991), Rogers and Dowla (1992), to satisfy the area’s demand for water. Therefore,
and McKinney and Lin (1993, 1994). Ritzel et al. water authority officials (municipality, water agen-
(1994), Cieniawski et al. (1995), and Johnson and cies) are trying to identify potential locations for
Rogers (1995) used global search methods in their new pumping wells with the required discharge.
studies and found that these global search methods The Dore river catchment, which is situated in
are more efficient and robust in comparison to the eastern part of the Massif-Central in France
other methods like linear, non-linear and mixed (Figure 2), has been considered for establishing
integer. McKenney and Lin (1994) developed the pumping wells.
genetic algorithms (GA) based on groundwater The catchment has a moderate climate with
simulation models to solve three groundwater four seasons. The low flow period in the river
management problems: (1) maximum pumping occurs in summer but can be extended up to No-
from an aquifer, (2) minimum cost water supply vember. The average annual rainfall recorded is
development and (3) minimum cost aquifer reme- 780 mm. The Dore River is an important tributary
diation. They found that the GA could effectively of the Allier River. The average annual flow of

739
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

Figure 2. Location of Dore basin in Massif Central (France)

the river Dore is 20.2 m3 s-1. In the high flow of different hydrological features and other re-
period, the average flow is 50 m3 s-1 and de- quired data was extracted from the geological
creases to 5 m3 s-1 in the dry period. The study maps 1/50 000 provided by BRGM (National
area exhibits solid geology where bedrocks are Service for Geological Survey). A total of 12
exposed on the surface in the form of structural piezometric measurements are available in the
hills. Fluvial quaternary sediments underlain by study area, which are used to calibrate the ground-
marls and clay cover the major part of the area. water flow model. The water level in the river
The quaternary alluvium is composed of gravels, was observed at 11 different locations and was
sand and pebbles with silt. The impervious sub- used to develop the model.
stratum is composed of clay and sand (Oligocene
period). Hydraulic conductivity varies from 1×10-3
to 8×10-3 m s-1 in different parts of the aquifer. 4. MULTI-CRITERIA ANALYSIS
The thickness of the aquifer varies from 12 to 15
m. Elevation of the impervious bottom layer of The suitable location map for the groundwater
aquifer varies from 240 to 258 m, which is mea- pumping wells was produced using a modified
sured from mean sea level (MSL). The location version of the SINTACS model, applied in a GIS

740
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

environment. It is an adaptation of the DRASTIC flow: drainage of water during low flow periods
method for the hydrogeological, climatic and and recharge during high flow periods. Therefore,
impact settings typical for the Italian territory it is necessary to take into account the water ex-
and the Mediterranean basin. This methodology change between the existing water table and the
includes five non-exclusion criteria i.e., hydraulic river in the evaluation of a suitable location for
conductivity of the aquifer, geomorphology, river groundwater pumping wells. These exchanges
aquifer exchange, land use and depth of water table. are governed by the nature and condition of the
Recharge by precipitation and topography were banks of streams and by fluvial dynamics. The
suppressed since they were considered uniform in phenomenon of river bed incision limits the water
the study area. The weighting scheme presented exchange.
here is based on local expertise and a preliminary The Land use map and the depth to water
sensitivity analysis was performed. table are a sensitive parameter that influences the
The Intrinsic Potentiality concept is used, vulnerability of groundwater, since it provides a
which is related to two important controlling greater chance for natural attenuation to occur. As
features that determine the accessibility of ground- the study area has a dense agriculture activity, a
water to accommodate the pumping wells, i.e., shallow groundwater depth can increase the risk
aquifer properties evaluated at the thickness of of contamination propagation to the water table.
the saturated layer and protection against pollut- The Land use map was developed on the basis
ants. The first attribute is related to the hydraulic of CORINE Land Cover. The classification con-
conductivity, the river-aquifer exchange and the siders the nature and the property of top surface.
geomorphology. In assessing groundwater poten- Other factors such as aspect, elevation and soil
tiality based on an index approach, the physical type were not considered. A total of 7 land use
and conceptual mechanisms take place along the classes were considered, according to the nature
attempt made to identify the groundwater poten- of their top surface. The land use factor, ranging
tial zones. Therefore, the dependent parameters from 0 to 100, is assigned to each class of land.
for this method (Table 1) were carefully chosen. Therefore, the dimension values vary from 0 to
For example, hydraulic conductivity is the 100 to represent the most favourable to the least
most important hydrogeological feature, since it favourable class. The weights were assigned to
provides a great chance to extract groundwater the classes by the SINTACS method. SINTACS
through pumping wells. A high value of hydrau- is an intrinsic vertical vulnerability method
lic conductivity helps extract a higher amount developed by Civita (2001). Data related to the
of discharge through pumping wells, whereas a depth of the water table was obtained from data
lower hydraulic conductivity value reduces the on 180 borehole logs. The use of the geostatistical
amount of discharge. Available pumping test data method, implemented in ARC Info, was neces-
at several locations in the study area were accused sary to characterise the spatial structure of these
and interpolated to calculate the hydraulic con- regionalised variables.
ductivity value on each cell of the grid. Hydraulic Exclusion criteria were defined by creating
conductivity data was obtained from five aquifer the buffer zone around the potential contaminant
pumping test databases. Geomorphology takes into location. Further, these buffer zones are used to
account the evolution of the river. Former chan- prepare the mask layer. Buffer zone values for
nels make aquifers; therefore, it represents areas different area are as follows: domestic waste treat-
of high aquifer potential. River aquifer exchange ment site = 300 m buffer zone; industrial centre =
is also an important hydrologic feature. The Dore 500 m buffer zone and hazardous waste landfill
River plays an important role in the groundwater = 500 m buffer zone.

741
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

4.1. Aggregation Analysis In the procedure of multi-criteria evaluation, it


was necessary for the assigned factor weights to
The Analytic Hierarchy Process (AHP) method is add up to 1. In the study, weight was defined as a
most suitable for the establishment of a platform value assigned to an evaluation, which indicates
organisation and covers the topics necessary to the importance of one criterion related to another.
identify sites suitable for the location of wells. In order to perform a pair-wise comparison, the
This method allows for data to be gathered and relative importance of the row variable to its
organised in a single model considering the system corresponding column variable was considered
as a whole, thereby establishing levels of priority on the basis of following values:
between the criteria. The multi-criteria decision
analysis methods ELECTRE (ELimination and 9: the criterion is extremely more important; 7: the
Choice Expressing REality), also widely used criterion is very strongly more important; 5: the
for solving problems with respect to decision criterion is strongly important; 3: the criterion is
support environment (ROY 85), does not apply moderately more important; 1: equal importance
to this problem. This does not choose the best site of two criteria; 1/3: the criterion is moderately
from a given set of sites, but rather identifies all less important; 1/5: the criterion is strongly less
potential sites. important; 1/7: the criterion is very strongly less
Appropriate weights have been calculated by important; 1/9: the criterion is extremely less
AHP method (Satty, 1980) where pair-wise com- important.
parisons are made between the criteria, acSaaty,
T.L., 1980. The Analytic Hierarchy Process, The result of this pair-wise comparison for 5
McGraw-Hill, New York. Saaty, T.L., 1980. The non-exclusion criteria is shown in Table 2. The
Analytic Hierarchy Process, McGraw-Hill, New calculated coherence ratio was 9.14%, which was
York. according to preference and importance within the permissible limit, i.e., 10%. The rat-
of the considered criteria. For every new factor, ing curve was developed for each criteria, using
weights were assigned according to the importance a series of arbitrary relationships between the
of each factor. A pair-wise comparison was ap- quantity of the criterion and a suitability score
plied in order to determine the relative importance (0 -100). The mathematical formulation for the
of each new criterion. To make the processes of assignment of the five suitability scores required
assigning factor weights more realistic, criteria integration of the weights and scores. This ag-
considered one at a time were carried out by differ- gregation consisted of multiplying the weight and
ent experts working on the project. The technique score obtained from the rating curves and then
of pair-wise comparison was developed by Saaty adding up the product to calculate the suitability
(1977) in the context of decision-making process score SI so that:
known as the AHP. AHP is characterised by two
fundamental aspects: hierarchical decomposition 5
SI = ∑Wi X i (1)
of the problem of comparison of complex objects i =1
and estimation of utility functions values from
paired comparisons. In this case, the hierarchy
where Wi is the weight of the considered criteria
consisted of two levels (objects and criteria).
calculated by pair-wise comparison and Xi is the
A pair-wise comparison was done in order to
normalised score obtained from rating curves. SI
determine the relative importance of each cri-
designates the suitability score of the suitable
terion. Weights were evaluated with the help of
location for groundwater discharge. Table 2 shows
stakeholder and experts working on the project.

742
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

Table 2. Relative importance of non-exclusionary


the relative importance of non-exclusion criteria
criteria based on the 5-point rating system. These
based on the 5-point rating system. Figure 3 il-
value obtained based on a pair-wise comparison
lustrates the location of suitable zones for ground-
of all criteria using the weighted linear combina-
water pumping after considering the different
tion described by Saaty
parameters explained above. Potentialities of
Criterion Weight suitable zones are defined in four ranges, i.e., low,
Hydraulic conductivity of the 0.463
moderate, high and very high.
aquifer 0.256
Geomorphology 0.195
River aquifer exchange 0.043
Land use 0.041 5. GROUNDWATER MODELLING
Depth to water

A regional groundwater flow model of the Dore


basin, constructed and calibrated to steady-state

Figure 3. Suitable location map for groundwater pumping wells

743
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

time-averaged conditions for the period 1976, the simulated and the observed data. The median
developed in a previous study (CETE, 1976), was values of hydraulic conductivity range from 0.003
used in this study. Groundwater modelling was m/s for the older sediment to 0.008 m/s for the
performed using the “modular three dimensional actual sediment. Many assumptions are neces-
finite-difference groundwater flow model” MOD- sary to simplify a real hydrogeological system
FLOW developed by The United States Geological represented by a mathematical model. A relatively
Survey (McDonald & Harbaugh, 1988). The active simple modelling concept is developed on the
cells of the model grid cover 60 km² area of the basis of two types of external model boundaries:
Dore basin. The y-axis of the model is oriented to physical and artificial. The contact between the
the north to align with the predominant direction alluvial aquifer and the Dore River is simulated
of groundwater flow. The finite-difference model by a head-dependent flux boundary. The rate of
of the basin was constructed by dividing the al- flow is calculated from the difference of hydraulic
luvial plain into tree models. Each model was head between the boundary cell and the adjacent
subdivided by a rectilinear grid, which consists aquifer cells (McDonald & Harbaugh, 1988). The
of 2,400 active cells that have a uniform grid-cell lateral boundary at the eastern and western parts
spacing of 50 m (a cell area of 2,500 m²) and a of the model between the alluvial and the first
variable thickness. Grid size was taken as uniform terrace was represented by equipotential lines
as available data are evenly distributed over the with known hydraulic head. These boundaries
model domain and each part of the domain was were chosen far enough from the area of interest
considered as equal interest. to avoid the influence on the future flow pattern
created by a proposed suitable location of ground-
5.1. Hydrological Units water pumping wells.
Some assumptions were made to design a
The overviews of the geology and hydrogeology groundwater model that can limit the scope of
of the Dore basin, defined by Mimoun (2009), application of the model and the hydrological
was used in this model. The groundwater flow questions that can reasonably be addressed. The
system of the basin, the thickness of the aquifer, model uses a steady-state simulation for the dry
including its boundary, hydraulic characteristics seasons, including climate, land use and water
and components of recharge and discharge were use. As the model was not calibrated for transient
also described in Mimoun (2009). The Dore al- conditions, the model cannot be used to predict
luvial plain is filled with quaternary alluvium and the transient response of the system. Also, the
surrounded by tertiary sedimentary rocks. The fill boundary conditions involve considerable sim-
consists of sandy gravel, sand, silt and clay. The plification of the hydrological system and can
alluvium forms a single unconfined aquifer where generate substantial effects on model results. The
groundwater flows from south to north. lateral boundary of the Dore river basin model has
The hydraulic characteristics of sediments been specified as a “constant head boundary” so
that form the aquifers control the direction and that the cells represent an inexhaustible source
the velocity of groundwater movement within the of water regardless of changes at the boundary
system. The horizontal hydraulic conductivity has and in the aquifer. For example, a cell boundary
been estimated from the pumping well test data with known fixed head may dry out because of
and the published data. Thus the obtained data the nearby pumping well, and the constant head
have been used as initial distributions that were cells simulating it will continue to supply water to
subsequently modified during calibration of the the aquifer. For many purposes, these assumptions
numerical model to achieve the best fit between do not impose serious limitations on the use of

744
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

the model. However, interpretation of the results problems related to wells optimization. It is not
should be done carefully, as changes in any of these always guaranteed that gradient-based methods
conditions can influence the suitable location of will find out the global optimal solution in the
the groundwater pumping wells and the discharge. case of highly non-linear optimization problems
(Huang & Mayer, 1997). On the other side, global
5.2. Model Run search methods are efficient to find out the global
solution of the problem. Constraints in global
The model was calibrated by changing the initial optimization method can be incorporated into
values of recharge and hydraulic conductivity. the formulation and do not require derivatives
The recharge values of different zones were with respect to decision variables as in nonlinear
increased from 15% to 20% and the influence programming (McKenney & Lin, 1994). In this
was observed on the groundwater head. Values study, the genetic algorithm (GA) has been used
of hydraulic conductivity were also modified for as an optimisation model. First pioneered by John
the domain. The values of groundwater head, Holland in the 1960s, genetic algorithms have
calculated by the model, and observed head values been widely studied, experimented with and ap-
were analysed at a 95% confidence level. The plied in different fields of engineering. The GA
groundwater head in the middle part of the basin is a search technique used in computing to find
was found very close to the observed values of exact or approximate solutions to optimisation
head. The observation points close to the bound- and search problems. Genetic algorithms are a
ary of the model domain were not found within particular class of evolutionary algorithms (EA)
the range of the 90% confidence level due to the and are categorised as global search heuristics. The
complex geological formation, presence of hills GA techniques are inspired by such evolutionary
and inselbergs in that area. Therefore, there is a biology elements as inheritance, mutation, selec-
scope to refine the values of these parameters by tion, and crossover.
taking more field samples. The basic working steps of GA are as follows:

5.3. Optimisation Model 1. Randomly generate an initial population


and Objective Function IP(0)
2. Compute and save the fitness u(m) for each
Mathematically, a typical groundwater manage- individual m in the current population IP(t)
ment problem can be defined using three main 3. Define selection probabilities p(m) for each
components: decision variables, objective function individual m in IP(t) so that p(m) is propor-
and constraints (Ahlefeld & Mulligan, 2000). Opti- tional to u(m)
misation models are used to maximise or minimise 4. Generate IP(t+1) by probabilistically se-
an objective function without violating the given lecting individuals from IP(t) to produce
set of constraints. The objective function is typi- offspring via genetic operators i.e., crossover
cally defined by one or more decision variables and mutation.
and the optimisation models determine the values 5. Repeat step 2 until satisfying solution is
of these decision variables. Most of the traditional obtained.
optimization techniques are gradient-based and
the solutions from those methods produce local To evaluate the value of fitness function, pen-
optimal values rather than global optimal solution. alty function approach has been adopted. In this
Therefore, application of gradient based optimi- process, specific penalty is used after the violation
zation techniques may be difficult in realistic of constraints. This penalty varies linearly accord-

745
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

ing to the magnitude of constraint violation. The The first constraint was taken to limit the
next generation population of chromosomes is drawdown of groundwater under the permissible
selected from the current generation with selec- limit. After the discussion with technical people
tion probability based on the evaluation score of from (the) water agencies, minimum 2 m water
each chromosome. column from the bottom impervious layer is
In this study, the objective function is to proposed, thus
determine the maximum discharge, which can
be withdrawn from the aquifer through a given hi > 258 m (3)
number of wells. Therefore,
where the average level of the aquifer bottom is
 Nw
 assumed at 256 m and hi is the minimum water
Maximize ∑ Qi − β1P (h ) (2) head on the periphery of the ith well and i = 1,
 i =1 
  2…., Nw.
The second constraint is defined to ensure the
Subject to minimum water discharge from all wells to fulfil
the minimum water demand of the area, and hence
Qi,min<Qi<Qi, max (2a)
Qtot > 10 000 m3 day (4)
hi>hi,min (2b)
The third constraint prescribes the maximum
(xi,yi)≠Ai (2c) and minimum discharge limit of a single well.
On the basis of aquifer properties and the avail-
(x i − x j )2 + (yi − i j )2 ≥ Sw ,min (2d) ability of pumps, the discharge limit was selected
as follows

hi,min − hi if hi < hi, min 2000 m3 day < Qi < 4000 m3 day (5)
P(h)=  (2e)
0 if hi ≥ hi,min

6. RESULTS AND DISCUSSION
where Qi,min and Qi,max are the minimum and maxi-
mum discharge limit for ith well, hi,min = minimum The suitable map values vary from 0 to 100 in
allowable head of groundwater at ith well, Sw,min the case study and it falls into four classes cor-
= minimum distance between any pair of wells; responding to four degrees of vulnerability. The
xj and yj are co-ordinates of remaining well, i.e., generic suitable map which has been established
i ≠ j; P(h) = penalty term, which varies linearly shows four potentiality classes: low, moderate,
with the magnitude of constraint violation, and high and very high. The weakly potential zones
β1 = weighting factor, which can be selected ac- occupy 39% of the total area and the moderately
cording to the problem. potential zones occupy 25%. In the MCA, two dif-
Constraints of the problems were finalised ferent locations in the study area have been found
with the help of water authority’s officials and suitable for establishing the new pumping wells.
stakeholders. The penalty function approach was After calibration, the model was used to identify
used to incorporate the constraints in the problem the maximum discharge from the wells at the suit-
solution. The following three constraints were able location derived by the multi-criteria analysis.
considered in the optimisation problem. The set of 3 and 4 wells was used to identify the

746
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

maximum discharge at this location. Results show particle-tracking programme will be used to map
that maximum groundwater can be withdrawn recharge areas of the groundwater flow system.
from the set of 3 wells with a discharge of 4522 Future works could focus on complementary
m3/sec and from set of 4 wells with a discharge sensitivity analysis. In this study criterion weights
of 5941 m3/sec. Groundwater modelling results are based on the preferences given by experts with
depict that this location is able to give a greater the help of stakeholders. As they can change in time
amount of groundwater and is less vulnerable to and space, the methodology could include these
contamination. It also shows that extracting more variations. Fuzzyfication of the criteria valuation
groundwater from this zone lowers the stress could also bring more subjectivity to interpret the
on the aquifer, thereby resulting in an increased results with buffer around boundaries of the zones
extraction of groundwater in the future. which are recommended to locate pumping wells.

7. CONCLUSION ACKNOWLEDGMENT

The methodology developed aids in the evalua- The authors thank anonymous reviewers and the
tion and management of groundwater resources. editor for their critical comments which substan-
The combined use of GIS-based mapping and tially helped to improve the original manuscript
groundwater modelling acts as a powerful tool for publication.
for understanding groundwater behaviour and
identifying the best management practices at
watershed level, which can be used judiciously REFERENCES
for agricultural and drinking water purposes. The
multi-criteria analysis is also relevant to the evalu- Ahlfeld, D. P., Mulvey, J. M., & Pinder, G. F.
ation of relative suitability of the areas selected (1986). Designing optimal strategies for contami-
by GIS analysis for new pumping wells. nated groundwater remediation. Adventures in
This study also suggests that zones falling Water Resources, 9(2), 72–84.
in the potential zones can give a good amount Allen, A., Brito, G., Caetano, P., Costa, C., Cum-
of groundwater for fulfilling the water demand mins, V., Donelly, J., et al. (2002). Procedure for
of the area. The overall stress on the watershed the location of landfill sites using a GIS model.
can be reduced by using these wells. Hence, the In Proceedings of the 9th Congress of the Inter-
developed methodology can be applied in other national Association of Engineering Geology and
regions for the evaluation and management of the Environment, Durban, South Africa (p. 100).
groundwater resources.
The methodology for identifying the ground- Carver, S. J. (1991). Integrating multi-criteria
water potential zones is based on an overlapping evaluation with geographical information
of criteria and not on an analysis of pollutant mo- systems. International Journal of Geo-
tion. These methods ignore the dynamics of the graphical Information Systems, 5(3), 321–339.
groundwater flow system, as the results depend doi:10.1080/02693799108927858
on the weights assigned to each criterion and do
Cieniawski, S. E., Wayland, J., & Ranjithan, S.
not take the flow into account. The identification
(1995). Using genetic algorithms to solve mul-
of recharge is also one of the most important
tiobjective groundwater monitoring problem.
elements of a vulnerability assessment. The
Water Resources Research, 31(2), 399–409.
groundwater flow model in conjunction with the
doi:10.1029/94WR02039

747
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

Civita, M. (2001). Groundwater vulnerability Keeney, R. L., & Raiffa, H. (1976). Decisions with
mapping in the Lauria Mountains (Southern Italy) multiple objectives. Cambridge, UK: Cambridge
using the SINTACS method. In Sciences et tech- University Press.
niques de l’environnement. Bensançon, France:
Kontos, T. D., Komilis, D. P., & Halvadakis, C.
University of Franche Comté.
P. (2005). Sitting MSW landfills with a spatial
Das, A., & Datta, B. (2001). Application of opti- multiple criteria analysis methodology. Waste
mization techniques in groundwater quantity and Management (New York, N.Y.), 25, 818–832.
quality management. Sadhana: Academy Pro- doi:10.1016/j.wasman.2005.04.002
ceedings in Engineering Sciences, 26(4), 293–316.
Lefkoff, L. J., & Gorelick, S. M. (1986). Design
Emch, P. G., & Yeh, W. G. (1998). Management and cost analysis of rapid aquifer restoration
model for conjunctive use of coastal and surface systems using flow simulation and quadratic
water and groundwater. Journal of Water Re- programming. Ground Water, 24, 777–790.
sources Planning and Management, 124, 129–139. doi:10.1111/j.1745-6584.1986.tb01694.x
doi:10.1061/(ASCE)0733-9496(1998)124:3(129)
Lin, H., & Kao, J. (1998). Enhanced spatial
Figueira, J., Greco, S., & Ehrogott, M. (2005). model for landfill sifting analysis. Journal of
Multiple criteria decision analysis: State of the Environmental Engineering, 125(9), 845–851.
art surveys international series in operations doi:10.1061/(ASCE)0733-9372(1999)125:9(845)
research and management science. New York,
Malczewski, J. (1999). GIS and multicriteria deci-
NY: Springer.
sion analysis. New York, NY: John Wiley & Sons.
Finney, B. A., & Samsuhadi, W. R. (1992).
Mantoglou, A., & Papantoniou, M. (2008).
Quasi-three-dimensional optimisation model for
Optimal design of pumping networks in coastal
J-kata basin. Journal of Water Resources Plan-
aquafers using sharp interface models. Jour-
ning and Management, 118, 18–31. doi:10.1061/
nal of Hydrology (Amsterdam), 361, 52–63.
(ASCE)0733-9496(1992)118:1(18)
doi:10.1016/j.jhydrol.2008.07.022
Goldberg, D. E. (1989). Genetic algorithms in
McKinney, D. C., & Lin, M. D. (1993). Ground-
search, optimization and machine learning. Read-
water optimization using genetic algorithms. Paper
ing, MA: Addison-Wesley.
presented at the SIAM Conference on Computa-
Gorelick, S. M. (1983). A review of distributed tional Issues in the Geosciences, Houston, TX.
parameter groundwater management modelling
McKinney, D. C., & Lin, M. D. (1994). Genetic
methods. Water Resources Research, 19(2),
algorithm solution of groundwater management
305–319. doi:10.1029/WR019i002p00305
models. Water Resources Research, 30, 1897–
Huang, C., & Mayer, A. S. (1997). Pump-and- 1906. doi:10.1029/94WR00554
treat optimization using well locations and
Mimoun, D. (2009). Connaissance et préserva-
pumping rates as a decision variable. Wa-
tion des ressources en eau de la nappe alluviale
ter Resources Research, 33(5), 1001–1012.
dans la basse vallée de la Dore: Rapport d’étude
doi:10.1029/97WR00366
pour le Parc Naturel Régional Livradois Forez
Johnson, V. M., & Rogers, L. L. (1995). Loca- (p. 118). Saint-Etienne, France: Ecoles Nationale
tion analysis in ground-water remediation using Supérieure des Mines de Saint-Etienne.
neural networks. Ground Water, 33(5), 749–758.
doi:10.1111/j.1745-6584.1995.tb00021.x

748
Multi-Criteria Decision Analysis for Identifying a Suitable Location for Groundwater Pumping Wells

Qin, X. S., Huang, G. H., & He, L. (2009). Simula- Wang, M., & Zheng, C. (1998). Ground water man-
tion and optimization technologies for petroleum agement optimization using genetic algorithms
waste management and remediation process and simulated annealing: Formulation and com-
control. Journal of Environmental Management, parison. Journal of American Water Resources,
90(1), 54–76. doi:10.1016/j.jenvman.2008.07.002 34(3), 519–530. doi:10.1111/j.1752-1688.1998.
tb00951.x
Ritzel, B. J., Eheart, J. W., & Ranjithan, S. (1994).
Using genetic algorithms to solve a multiple objec- Wang, Q. J. (1991). The genetic algorithm and
tive groundwater pollution containment problem. its application to calibrating conceptual rainfall
Water Resources Research, 30(5), 1589–1603. runoff models. Water Resources Research, 27(9),
doi:10.1029/93WR03511 2467–2471. doi:10.1029/91WR01305
Rogers, L. L., & Dowla, F. U. (1992). Groundwater Willis, R., & Finney, B. A. (1988). Planning model
remediation optimization with artificial neural for optimal control of saltwater intrusion. Jour-
networks and the genetic algorithm [abstract]. nal of Water Resources Planning and Manage-
Eos, Transactions, American Geophysical Union, ment, 114, 163–178. doi:10.1061/(ASCE)0733-
73(43), 186. 9496(1988)114:2(163)
Saaty, T. L. (1980). The analytic hierarchy process. Willis, R., & Yeh, W. W. G. (1987). Groundwater
New York, NY: McGraw-Hill. systems planning and management. Upper Saddle
River, NJ: Prentice Hall.
Shamir, U., Bear, J., & Gamliel, A. (1984). Opti-
mal annual operation of a coastal aquifer. Water Zeleny, M. (1982). Multiple criteria decision
Resources Research, 20, 435–444. doi:10.1029/ making. New York, NY: McGraw-Hill.
WR020i004p00435
Zheng, C., & Wang, P. P. (2002). A field demon-
Sumathi, V. R., Natesan, U., & Sarkar, C. (2008). stration of the simulation optimization approach
GIS-based approach for optimized sifting of for remediation system design. Ground Water,
municipal solid waste landfill. Waste Manage- 40(3), 258–266. doi:10.1111/j.1745-6584.2002.
ment (New York, N.Y.), 28(11), 2146–2160. tb02653.x
doi:10.1016/j.wasman.2007.09.032

This work was previously published in the International Journal of Agricultural and Environmental Information Systems
(IJAEIS), Volume 3, Issue 1, edited by Petraq Papajorgji and François Pinet, pp. 72-85, copyright 2012 by IGI Publishing (an
imprint of IGI Global).

749
750

Chapter 46
Last Online Deposits
Spatial Data in the Web
Mohamed-Khireddine Kholladi
University Mentouri of Constantine, Algeria

ABSTRACT
The posting of interactive mapping is essential for the dissemination of information to the general pub-
lic and in all areas. All the elements constituting a spatial object are represented by agreements with
symbols. Each element is represented at the level of reality. The posting of maps on the Internet can take
many forms. It can be static maps, as a picture. Interactions with the user can be included on the maps
produced. These are movements and functions of the zoom presentation (display information, change
of scale, global view). At this level, it is also possible to examine, by selection of the objects represented
on the map (common facilities). This level is commonly known as Web mapping. The dynamic mapping
is used when information is to be renewed or if the geographical extent of the area is large. In this case,
a server handles in real-time updated database to provide users answers to their complaints. The func-
tions proposed in this case are close to those of GIS software (acquisition, manipulation, management
and processing of geographical data). In this chapter we will explore the possibility of integrating a
dynamic mapping on the Web.

INTRODUCTION picture. As a result, the site or any part thereof


is not connected to a database. The pages are
For over thirty years, the cards are offered as a already built and are displayed (e.g. PDF) on the
faithful reproduction of a geographical space ob- user as they were originally created. Interactions
ject. All its components are represented by agree- with potential users can be included on the maps
ments with symbols. Each is represented on the produced. These are movements, and zoom func-
scale of reality. The maps posting on the Internet tions as the display presentation of information,
can take many forms. It can be fixed cards, as a the change of scale, the global view, and so on.

DOI: 10.4018/978-1-4666-2038-4.ch046

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Last Online Deposits Spatial Data in the Web

It is also possible to examine, by selection necessary calculations, if you want to perform the
of geographical space on the card, such as mu- topographic location of objects by using measures.
nicipalities, schools, etc. This level is commonly The physical figure which best represents the shape
known as Web-mapping. The dynamic mapping is of the earth called geoids. This is the form of the
applied when the information is to be constantly mean sea level. After the location of each point
updated or if the coverage of the area is great. in space measurements of angles and distances,
In this case, allows a server to query real-time calculations of the projections means from the
database updates to give users answers to their three-dimensional reality to the representation
complaints (De Blomac F, 2009), (Dupasque X, plane (Dupont G, 2004).
2005), (Dupont G, 2004) and (Mauvière E, 1999).
The functions proposed in this case approximate
those of a geographical information system (GIS) ONE POINT ON THE TERRESTRIAL
for the acquisition, handling, management and SURFACE LOCALIZATION
processing of geographic data. The publication
of the geographical information as numeric The altimetry permits to determine the height of
cards takes more and more interest because of one point that is expressed by the difference be-
the progress reached in the domain of the new tween the point and a surface of reference as the
technologies of information and the communica- sea level. The geoids are the surface of reference
tion and the arrival to maturity of the software of for the observations of altimeter. The measure of
spatial representation of the data or geographical the heights called levelling makes itself by lines,
information system (GIS) [De Blomac F, 2001]. whose intermediate points are marked with the help
In this chapter, we will focus on the possibility of ankles implanted in the facades of the buildings.
of integrating a dynamic mapping on the Web. Our The set of these reference marks constitutes the
aim is to provide an overview of the potential of network of levelling of the studied geographical
the Internet for GIS and to give what he wishes zone. All information on these reference marks
to put their GIS dynamic web to choose between is listed by the direction of the land-registry of
different opportunities (AFIGEO, 1998). the wilaya.
The plane coordinates of one point are cal-
culated near to the centimetre with the help of
STATE OF CARTOGRAPHY ART triangles, where we measured the angles and the
distances with tachometers. To orient their obser-
The mapping allows the use of techniques for vations, the geometers determined the coordinates
achieving the cards. It is used to better understand of the mosques minarets or the water castles that
and define the space, territories and landscapes. represent very visible reference marks from afar.
It is also used in related sciences, demography The administration of the land-registry and the
and economy in order to propose a reading of topography manages the geodesic basis network
the geo-phenomena. How can we be a part of the composed of score thousand points. The system of
irregular shape of the spherical earth on a globe? positioning by GPS satellites (Global Positioning
To be able to give a reliable description of a ter- System) permits the precise determination of the
ritory and to locate objects in space, it is essential coordinates of one point quickly on earth. To be
to know the exact shape of the earth. That is the able to produce a card or a plan from the measured
purpose of surveying and mapping work. The data, we conduct calculations of a mathematical
ellipsoid is the reference surface to represent the projection. Several geodesic systems exist. An
best shape of the earth. It is useful to perform the

751
Last Online Deposits Spatial Data in the Web

important system is the WGS4 that is the basis of ments. At the time of the restitution photogram-
the GPS system (Laurini R. & Thompson D, 1992). metric, the aerial photos are installed by couples
in a device named restitutor, which permits to
re-establish a model of the relief and to seize all
THE CURVES OF LEVEL visible elements numerically to stock them in a
data base. The restitution represents an important
The level curve is a polyline that passes by all phase in the chain of production of a card.
points of the land situated to the same altitude. Our main challenges are the seizure of all
The earth relief is represented on a plane card by sought-after elements, the seized interpretation
level curves and points sides. If one represents a correct of the objects and the guarantee of the
mountain cut in horizontal plates of equal thick- geometric precision required. The result of the
ness, the contours of these plates, seen from top, restitution must be controlled therefore and must
draw the curves of level that follow the relief be completed by information of descriptive,
shape. The disposition of the level curves on the nominative and administrative nature, where the
card informs on the shape of the relief (stiff slope nature of a building (of dwelling, agricultural,
or soft slope). The value of the heights curve is commercial or industrial), its attributes (school,
registered on the level curve. The orientation in- town hall, reservoir, etc.), the names of the streets
forms of slope direction. The numbers are always and the toponyms (localities, inhabited places) and
orienting upstream (Kholladi M. K, 1999b). the administrative and other limits. The scale of a
card expresses the report of reduction between the
reality and the representation on the card. A big
PRODUCTION OF A CARD scale of card permits to represent a lot of details,
when a card has a small scale. It gives us a better
The photogrammetry is a cartographic technique general preview on a whole region. Different ap-
that permits to measure all visible elements plications require some cards to different scales
from aerial photos of the territory. From a plane (Kholladi M. K, 1999a; [Kholladi M. K, 1999b;
equipped with a special camera, some aerial and 9Laurini R. & Thompson D, 1992).
photos are taken to very short intervals, so that The topographic card is a faithful representa-
two successive photos cover themselves to about tion of the reality. To arrive there, the cartographer
60% in direction of the skimming. Thanks to the restores the information in a legible and compre-
principle of the stereoscopy, we can measure and hensible cartographic picture. Its instruments are
reconstitute all elements situated inside the 60% of the symbols, the signatures, the colours and the
cover from a couple of aerial photos. The height texts. The graphic elements used in the card are
of the skimming depends on the scale of the card regrouped and are explained in the legend of the
that we want to produce. Today the planes of pho- card. The choice of the symbols and colours is
togrammetry are equipped with the GPS system. guided by the principle to bring closer for example
A relation between the earth and the aerial to best the graphic representation to the reality,
photo is established by the signalling of the points the use of the bruise for the rivers, of the green
knew on the land. These points can be detected for vegetation and symbols for the camping, the
on the photo, what permits to situate all points of sports, etc. An analysis of the different editions
the photo in the system of coordinates wanted. To of a card permits to retrace so much in detail the
re-establish the three-dimensional relief from the evolution of the regional development spatial
aerial photos, the cartographer uses a grant natural such economic and social. The information on the
of the man: the faculty to see in three measure- date and the shape of the update of the edition are

752
Last Online Deposits Spatial Data in the Web

Figure 1. Example of thematic layers


indicated in the legend of the card, what denotes
the actuality of the card. Available until now as
cards and plans, the geographical information
presents itself henceforth as computer data base,
which permits exchanges of more efficient data,
as well as exploitation and a lot more rational
update (AFIGEO, 1998).

GEOGRAPHICAL INFORMATION
SYSTEM (GIS)

A Geographical information system is a computer


tool permitting to represent and to analyze all
geographical objects as well as all events that
produce themselves of it [Demegre J, 2005]. The
GIS offer all possibilities of the data bases (as ask simple questions as (Kholladi M. K, 2004a)
requests and statistical analyses) and that, through and (Laurini R. & Thompson D, 1992):
a unique visualization and geographical analysis
clean to the cards. • To whom does this parcel of land belong?
Many other domains as research and the de- • Which of these two zones is the most
velopment of new markets, the survey of a con- distant?
struction impact, the regional development, the • Where are the exploitable lands for a given
management of the networks of water, electricity, economic activity?
gas or transportation, the follow-up in real time
of vehicles, the civil protection, the follow-up of Some of the questions integrating an analysis,
the evolution of the pollution of a territory, the as for example:
follow-up of the desertification, etc. are concerned
as directly by the power of the GIS to create
some cards, to integrate all type of information,
Figure 2. The two models of the reality representa-
to visualize the different scripts better, to present
tion: vector and raster
the ideas better and to measure the impact of the
possible solutions better.
A GIS permits the storage of the information
concerning the world under the shape of bound
thematic layers the some to the other by geogra-
phy (as on the Figure 1) (Kholladi M. K, 2004a;
Kholladi M. K, 2004b; Laurini R. & Thompson D,
1992 and Laurini R. & Millert-Raffort F, 1993).
The Systems of Geographical information
exploit two different types of geographical mod-
els: the vector model and the model raster as on
the Figure 2. If one has a GIS and data, one can

753
Last Online Deposits Spatial Data in the Web

• What are the virgin lands to construct new To answer these questions, the GIS has calcu-
houses? lations algorithms named “buffering” in order to
• What are soils adapted to the plantation of determine the relations of proximity between the
a type of tree? objects.
• If I construct a freeway here, what will be
the density of the traffic of it?
SPATIAL ANALYSIS
The GIS arranges many and powerful tools
of analysis, but two among them appear as par- The integration of data by different information
ticularly essential: the proximity analysis and the layers permits to do a spatial analysis. This analysis
spatial analysis. by crossing of information, if it can take place
visually (to the tracings identical superimposed the
some to the other) requires the crossing often with
PROXIMITY ANALYSIS alphanumeric information. The crossing of the soil
nature, of its declivity, of the present vegetation
How much does existing houses in a zone of 80 with the owners and the taxes paid is an example
meters on both sides of a road to big circula- of sophisticated analysis that permits the GIS use
tion? (See Figure 3). What is the total number of (as on the Figure 4) (Kholladi M. K, 2004a) and
customer in a ray of 5Km around a big surface? (Laurini R. & Milleret-Raffort F, 1993).
The card is a much better tool of synthesis and
presentation of information. The cards created
with a GIS can integrate the reports, the three-
Figure 3. A road neighbourhood request dimensional views, the photographic pictures and
all sorts of elements multimedia henceforth eas-
ily. The capacity of the GIS concerning the data
research and the geographical requests execution
allowed numerous enterprises or administrations
to save the hundreds of millions of Algerian Dinars.
A realtor can use a GIS to find houses of a certain
surface, with three rooms and including a garage.
It gets a list of the houses answering these criteria

Figure 4. The thematic layers superposition

754
Last Online Deposits Spatial Data in the Web

Figure 5. Request of a house research with criteria


of the cards via the Internet network is a domain
in full expansion thanks to the development of
the solutions Open Source. Following the GNU
philosophy that allows the copy, the diffusion of the
software and the modification of the code source,
these generally free programs and of emergent
free use at a rhythm sustained.
The concept of the Web cartography consists
in three components: geography, information
and the Web. The approach of the Web Mapping
is adapted to a large public no specialist: simple
as well as their geographical localization thus interface, oriented “cartographic atlas” and on line
simultaneously as on the Figure 5. GIS, consultation by connection to weak debit,
This request can be refined thereafter with the fast and independent solution of the navigators,
help of two new criteria: a renting lower to 5000 without Plug in (De Blomac F, 2009) and (Soussain
DA per month and situated unless about hundred G, 2001). It permits a complete integration in an
meters of a school. The result of this new request existing site and the possibilities of interactions:
will be displayed in the same conditions as on the the dual brightness of the entities, the comple-
Figure 6 (Kholladi M. K, 1999a]) and (Sohm J, mentary information, the display of the themes,
2003). etc. The cartographic functionalities are:

• Navigation: inside a same document.


WEB MAPPING • Global view: seen permitting to localize
the zone of visualization on a general pre-
The needs to communicate some information in view of the document.
the domains of management of the risks, of the • Zoom: the user has the possibility to zoom
regional development, or more environment, in the size of the visualization zone.
the publication of cartographic data on Internet • Icons: presence of interactive symbols in
is means of indispensable communication (De the visualization.
Blomac F, 2001), (De Blomac F, 2005) and (Sous- • Zones of click: interactive zones in the vi-
sain G, 2003). The Web Mapping or the diffusion sualization triggering some events (pop up,
info bubbles, etc.).
• Management of layers: possibility to vi-
Figure 6. Composed spatial requests sualize several layers (tracings).
• Requests user: Semantics and/or geomet-
ric, thematic, predefined or no.
• Update: interface of update of the
spatial data and/or attributes without
programming.

755
Last Online Deposits Spatial Data in the Web

GENERAL PRINCIPLE or ASP. The cartographic server leans on these


OF A CARTOGRAPHIC elements to receive some requests and to send
WORKING SERVER back the pictures and data. Client side, a navigator
Web is sufficient, accompanied by a viewer, to
The cartographic server is the automatic ticket display the card (cf. Figure 7) (Durand H, 2003)
window where we make a call to display some and (Pewe B, 1997).
cards on its station of work. The architecture is The data can be managed by specific software,
of type client-server. The user, from his terminal, the SGBDR, such PostgreSQLs, MySQL, Oracle,
spear of the requests to ask for a particular card among others, that can be installed directly on the
display. The cartographic server the interprets and server containing the cartographic server or on
send back the card under the shape of a matrix another server, distant. PostgreSQL and its exten-
picture (png, jpg, etc.) or vectorial (svg, swf, sion spatial PostGIS is the management system
etc.). The cartographic motor can be controlled of relational data bases the more succeeded in the
by languages of script as PHP, java script, Python domain of the free software. PostGIS is included
or Perl that allow him to generate a card dynami- henceforth in the official distribution of Post-
cally in answer to the user’s request (Thomazeau GreSQL. It offers functionalities deepened to
J., 1997) and (Soussain G., 2003). stock and to treat geographical reference of the
The cartographic server can look for the infor- spatial objects.
mation necessary to the realization of the card in The different software bricks have been tested
its own resources, but also on servers of distant with the systems of exploitations Windows XP
data. The diffusion of the on line data requires an and Linux Mandrake 10 to value their interoper-
installation side server with software as Apache ability. To simulate the situation of exchange of
(project Open Source) or IIS (Internet Information data via Internet, MapServer 4.4, PHP/MapScript,
Services, of Microsoft) that turn in task of bottom the SGBDR extended PostgreSQL 7.5 of PostGIS
and give access to the cards servers to the intranet 0.9.1 with its interface graphic phpPgAdmin has
and to the Internet. These servers often see their been compiled on a dedicated server Linux. The
functions spread by interpreters of scripts as PHP user will have permissions of reading and writing

Figure 7. The cartographic server

756
Last Online Deposits Spatial Data in the Web

in the arborescence of the indexes of the machine THE DM SOLUTIONS PRODUCTS


server, for the storage of the solutions side server
while the solutions client side will be placed on These are MAPLAB, CHAMELEON, KA-MAPS
the local machine in a Windows XP pro environ- and CARTOWEB3. Figure 9 summarizes the
ment (De Blomac F, 2001). features of each of them.

Other Solutions
SOLUTIONS CLIENT SIDE
On the Web, we can find a big number of other
The software tested client Open Source, QGIS, available solutions in downloading. They are
UDIG, THUBAN and JUMP, are software GIS counted on the sites of FreeGIS and Maptools,
in full evolution having functionalities advanced and are of easiness of access and use variables.
from afar for the consultation and the geographical In general the simple and easily adaptable solu-
data edition (see Figure 8). tions for its own geographical data are viewers as
In practice, these applications are not always Veremap or Pmapper. This last proposes a set of
located entirely side server because they can re- tools for example interesting Java scripts as the
quire the downloading of a light plug-in by the zoom by cursor, the legend in hierarchical tree
customer (for example the Rosa applet). Of basis, and the impression in pdf. Then comes the more
the solutions side server Open Source brings the complex solutions and evolved like Mapbender or
possibility from a classic navigator Internet to MapLink. On-line, it is possible to select a polygon
visualize the generated dynamically geographical and to modify its geometry (see on the Figure 9)
layers. Currently, new more advanced function- (Marshall J, 2001) and (Mauvière E, 1999).
alities appear as the possibility to add or to Actually the solutions client-server can be
modify on line information. completely complementary, one (server) permit-
ting to do some treatments clean to the problematic
of the GIS (crossings of layers, requests, analysis
thematic), the other (customer) through the inter-
mediary of the mapfile displaying the card in an

Figure 8. Solutions server side

757
Last Online Deposits Spatial Data in the Web

Figure 9. Web mapping solutions

accessible interface on Internet. Otherwise, we see requests and analyses. The diagram of the present
the modules Open Source that transplants them on Figure 10 how MapServer integrates in a system
the big commercial GIS as Amein maintained by server Internet (Gifford F, 1999).
Territories for ArCGIS 8 and Wortomap developed The main libraries used by MapServer are:
by usable Tydac with MapInfo 7.8 that allows him
to export the mapfile of the GIS to display the • GDAL and OGR: Interpretation and con-
same card in an interface Map Server. version of the spatial data.
• Proj.4: Reproduction, transfers and distor-
Web Carto as Solution of tion of the spatial data.
Web Dynamic GIS • GD: Generation and modification of pic-
tures to the formats bitmap of the Web
The Web Carto product is an interactive Web-GIS, (Herself will possibly have need of the
based on the motor cartographic Open Mapserver JPEG libraries and PNG).
source. It permits the presentation, but also the • FreeType: Management of the fonts.
acquirement of geographical data through the
Internet. It is a tool permitting to generate some We can also mention the libraries that are going
cards from various spatial data, according to to allow the generation of cards particular formats:
the instructions contained in what we call some
Mapfile, of the files “.map”, regrouping the pa- • MING for the format flash.
rameters describing the way whose cards must • PDFlib for the PDF format.
be presented. We can beside use a model html
(commonly named a template), that will manage MapServer accepts a big number of formats
the layout of the display of the data generated. data raster, there again thanks to the library of
The most often, it is used on a server Internet GDAL functions. Without this library only some
to generate some pictures in Web pages, and so formats geographical reference rasters is sup-
to permit the display but also the questioning, ported: GeoTIF, hair *, gif *, png *, jpeg *, and
the modification, of cartographic pictures on an Erdas Lan. The possibilities raster of your instal-
Internet site. We can also use MapServer in local, lation of MapServer goes therefore essentially to
on its computer, to generate the cards, to do the depend on the capacities of GDAL that have been

758
Last Online Deposits Spatial Data in the Web

Figure 10. MapServer in a server Internet

specified at the time of its compilation. To add wms_srs EPSG:42304 EPSG:42101


a layer raster in a Mapfile, here is the syntax of EPSG:4269 EPSG:4326 EPSG:42304
the LAYER block: wms_name
prov_bound
LAYER wms_server_version 1.1.0
NAME Scan wms_formatlist image/gif,image/
DATES Constantine.tif png,image/jpeg,image/wbmp
RASTER TYPE wms_format image/gif
STATUS ONE END
END

THE CARTOWE3 PRESENTATION


SERVERS TO THE NORM WEB MAP
SERVICE (WMS) OF THE OGC Most solutions of interactive cartography proposed
on the market are descended of the world of the
The use of a WMS source requires the presence GIS and don’t often answer fully the needs of the
of a block PROJECTION in the MAP block, as dynamic Internet sites. Camptocamp its developed
well as the use of a METADATA block that is CartoWeb thus, a Web application of Geographi-
going to specify the request made to the server. cal information System (GIS) interactive, while
We have for example: especially respecting the specific needs of the
web. It is also conceived especially to integrate
CONNECTION http://www2.dmsolutions. easily in nearly all types of sites dynamic internet.
ca/cgi-bin/mswms_gmap? It permits to put easily in evidence the spatial as-
CONNECTIONTYPE WMS pects of the data presented notably, even though
METADATA these are not part of a GIS (Jongler D. & Lestrat

759
Last Online Deposits Spatial Data in the Web

I., 2001), (Jongler D. & Lestrat I., 2002) and • Compatibility with the applications GIS
(Kholladi M. K., 2008). standards of the market.
The proposed solution is especially interest- • Compatibility with the GPS and the PDA.
ing since it permits the visualization (diffusion) • Respect of the norms and standards (OGC
and the seizure (collection) of the geographical WMS, OGC SFS, GML, XML, XHTML,
objects (lines, points, surfaces) through different and SOAP).
Web sites. It permits to benefit from a solution
of web-GIS to very attractive costs, since this
solution doesn’t require the purchase of licenses ADVANTAGES OF USE
commercial GIS. However, CartoWeb integrates
without any problem in an environment existing We will mention in some points the advantages
GIS, since the formats supported by this system of the CartoWeb system:
are compatible with most formats of the GIS ap-
plications of the market. • Enhancement of the spatial information in
a Web site.
• Easy setting up, little programming.
SPECIFICATIONS • Weak cost of implementation (not of ex-
pensive license - federation of the costs of
The Web Carto product is an interactive Web-GIS, acquirement of the cartographic data).
not only permitting the presentation, but also the • Possibility to seize and to modify geo-
acquirement and the geographical data manage- graphical information via Internet.
ment through the Internet and more especially the • Compatibility and integration with the GIS
Web. The Web Carto has been developed while customer server of the market.
respecting the specific needs of the Web in the • Possibility to rent the access to functional-
GIS domain, to know (De Blomac F, 2005): ities (ASP fashion).

• Spatial visualization of the data in the Web CartoWeb will be used in particular for [De
sites (Webmapping). Blomac F, 2009]:
• Edition and seizure of data through the
Web (interactive Web-GIS). • To distribute the cards or plans on the Web
• Integration of the interactive Web-GIS in a with specific thematic information and all
site web, existing or in creation, simplified functionalities of navigation (zoom, dis-
to the maximum. placement, etc.).
• Interaction facilitated of the geographical • To bind spatial information to the informa-
functionalities with the relational data bas- tional content of the data bases.
es (out GIS) of the site web. • To seize and to modify the on line geo-
• Possibility to insert the cartography direct- graphical information, by the edition of
ly in any page of a site in the content (har- objects of type points, lines or surfaces.
mony between the site and the GIS tools).
• Possibility to share and/or to rent geograph-
ical information (cartographic funds). ARCHITECTURE OF WEB CARTO
• Weak cost of acquirement and
implementation. The Web Carto is an efficient solution to the needs
concerning treatment and diffusion by Internet

760
Last Online Deposits Spatial Data in the Web

of managed data or produced by a GIS classic CREATION OF A PROJECT OF WEB


client-server. A strong point of Web Carto is the MAPPING
easiness of interaction between present data in
external data bases and the geographical data. It The Gait Describes the Project Configuration.
permits the presentation, but also the acquirement
of geographical data through internet. Project Installation
It has been developed while having to the mind
the specific needs of the Web in the domain of the In a line of order the message of solicitation (DOS,
GIS. It is generally used as application standalone. shell), to throw the manuscript of installation of
However Web Carto is constructed on an archi- cw3setup.php.
tecture multi third that permit, if in the need be,
to establish it as Web Service. It is then possible <PHP-INTERPRETER>cw3setup.php to in-
for several Web sites to use the same geographi- stall
cal server. It possesses two fashions of working --base-URL http://localhost/car-
in direct or in SOAP (sees Figures 11 and 12). toweb3/htdocs /
Functionalities of the CartoClient are activated --development of profile
on simple passage of parameters (see Figure 13). --<nameOfProject> of project
The development of CartoWeb respects the main
condition for a perennial computer application, to So no mistake occurred we must be able to
know an efficient separation between the data, the reach the project by http://localhost/ cartoweb3/
treatment, the presentation and the language. There htdocs/client.php of typing for the navigator and
is not a code HTML mixed to the code of PHP to choose the project from the slim drop-down
treatment. The texts of the interface also sepa- of the projects.
rated. It permits to put a multilingual site in place
therefore easily. It assures the application evolutiv-
ity, since the central functionalities are not touched DATA OF LOADING
at the time of a change of graphic charter or the
addition of a peripheral functionality. As we can see it, the project that we created is
empty. Thereafter, we will load the spatial data
Figure 11. Direct Mode of the project. To add the next one in the Mapfile
(project_name.map): To PUT THE TYPE THE
COLOR NAMED 240 of region of NAME of
MODEL of class region of ttt of CALIBER of
reg_algerie of POLYGON DATA 240 240 OUT-
LINECOLOR 255 EXTREMITY of extremity of
CODE id_attribute_string of 165 96 of MÉTA-
GIVE extremity of END
Figure 12. SOAP Mode And in the file of layers.ini (server_conf/
project_name /) to add what follows:

• layers.region.className = layers.region.
label layer = Regions
• layers.region.msLayer = region

761
Last Online Deposits Spatial Data in the Web

Figure 13. The architecture Composition

• And to add the new identification of layer CONCLUSION


in the list of children of layer of the root:
• layers.root.children = region The realization of dynamic cards on Internet is
• To publish besides the file of project_name. essential when we want to distribute geographi-
ini in a publisher of text in order to place cal information. Often used in the localization of
some initial mapstates by default as chosen the places by their addresses, in the calculation
layers. of itineraries or for the geographical marketing,
• To add what follows: its uses are increasing and enlarge toward new
◦◦ mapInfo.initialMapStates.default. sectors. Descended of the data processing applied
layers.region.selected = rectify to geography, this tool stands to the crossing of
◦◦ In your navigator, the click on the techniques allying the relational data manage-
button of reset_session and one must ment, the representation and the spatial analysis,
see the regions of Algeria that one the programming in script language (Kholladi
wants to display. M. K., 1999a).
• To add our data, one must simply proceed Actually, we find either of the reusable
from the following manner: components (Chameleon), of the interfaces web
◦◦ to get our available data, by default in (CartoWeb, Ka-Map), either of the more complete
the shirt of data solutions of help to the management and to the
◦◦ to modify the Mapfile size, realization of Web applications (MapLab). As
◦◦ to modify the size defined in the ini- bending on the construction of a card on line,
tial Mapstateses (project_name.ini), only QGIS in customer and MapLab thanks to
◦◦ to add the definitions of layers in the its MapEdit tool side server permits to publish
mapfile (project_name.map), graphically and dynamically a mapfile. In this
◦◦ to add the definitions of layers in the domain many progress remain last one has to be
layers.ini. done because this last is not flexible enough in
◦◦ to hand to zero the session in the nav- its syntax and in its adaptability to the different
igator, and to get our data to display. environments (Jongler D. & Lestrat I., 2001).

762
Last Online Deposits Spatial Data in the Web

REFERENCES Kholladi M. K, (1999a, November 15-17). Mé-


canismes de raisonnement spatiaux sur des objets
AFIGEO, (1998, May). L’information géographiques, Séminaire International sur les
géographique française dans la société de SIGs, « AL-SIG’99 » organisé par le Conseil
l’information: état des lieux et propositions National de l’Information Géographique « CNIG
d’action, p. 35. », Alger, p. 9.
De Blomac, F. (2001). L’open source, véritable Kholladi M. K, (1999b, November 15-17).
alternative pour les SIG? p. 4. Modèle Numérique de Terrain par la Méthode
De Blomac F, (2005). Les systèmes d’information de Triangulation de Delaunay avec Contraintes,
géographique territoriaux-La mise en ligne des Séminaire International sur les SIGs, « AL-SIG’99
données géographiques: principes et expériences, » organisé par le Conseil National de l’Information
Cybergeo: European Journal of Geography, n°318, Géographique « CNIG », Alger, p. 21.
p. 123. Kholladi M. K, (2004a, September). Raisonnement
De Blomac F, (2009, June 01). Publier des cartes et Manipulation Spatial sur des Objets Urbains «
sur Internet: ces solutions qui nous viennent du RMSOU », CISC’2004: Conférence Internationale
graphique. SIG La Lettre, n°108, p. 3. des Systèmes Complexes, Jijel, p. 13.

Demegre, J., & Salge, F. (2004). Que sais-je? Kholladi, M. K. (2004b, December). Geographical
Les SIGs, p. 84. Information System for Urban Spatial Analysis,
ACIT’2004: The International Arab Conference
Dupasque X, (2004, April). Etats des lieux des on Information Technology, Mentouri Univer-
solutions Internet de diffusions des données sity of Constantine, Algeria, pp. 188-193, ISSN:
géographiques, Géo événement. 1812-0857.
Dupont G, (2004, January 23). L’IGN cartographie Kholladi, M. K. (2008, October 15-16). Carto
la France en numérique, Journal: Le Monde, p. 2. Web Solution for the Web Mapping, IACET: Inter-
national Arab Conference of e-Technology, Arab
Durand H, (2003, April). Du SIG à la donnée sur
Open University, Amman, Jordan, pp. 105-113.
internet: Quel mode de diffusion pour quel public.
Journée professionnelle SIG-LR. Laurini, R., & Milleret-Raffort, F. (1993). Les
bases de données en géomatique. Paris, France:
Gifford, F. (1999). Internet GIS Architecture -
Editions Hermès.
Which Side Is Right for You? p. 11.
Laurini, R., & Thompson, D. (1992). Fundamen-
Jongler D. & Lestrat L, (2001, February). Dossier
tals of Spatial Information Systems, the APIC.
technique: les SIG et Internet, Signature n°20,
Series Academic Press.
CERTU, p. 11.
Marshall, J. (2001). Developing Internet-Based
Jongler, D., & Lestrat, L. (2002). Cartographie
GIS Applications, p. 11.
sur Internet, Fiche technique du CNIG (Aide à
la Maîtrise d’Ouvrage) n°64, p. 8. Mauvière, E. (1999). La cartographie sur le web.
p. 40.

763
Last Online Deposits Spatial Data in the Web

Pewe, B. (1997). GIS online, p. 125. WEB SITES TO VIEW


Pottier, P. (2000). La sémiologie et communica- http://www.camptocamp.com/
tion cartographique, Fiche technique du CNIG
n°40, p. 6. http://www.cartoweb.org/

Sohm, J. C. (2003). Image et graphique pour le http://www.commentcamarche.net


web, p. 5. http://www.esrifrance.fr/
Soussain, G. (2001). Publier des cartes sur inter- http://www.geoscopies.net
net: Les repères. p. 64.
http://www.ieti.fr/Panorama_SIG_WEB.pdf
Soussain, G. (2003). Cartographie sur Internet:
synthèse à l’usage d’une maîtrise d’œuvre. p. 7. http://www.mysql.com

Thomazeau, J. (1997). Le Web support http://www.opensource.org


d’applications client/serveur, une voie suivie par
http://www.ors-idf.org/etudes/pdf/carto-web.pdf
la DSI pour le système d’information de gestion
du CNRS, p. 12. http://www.portailsig.org
http://www.sitewww.ch/dossier_solution_car-
tographie.php
http://www.webrankinfo.com/actualites/tags/
cartographie-sur-internet.htm

This work was previously published in Handbook of Research on E-Services in the Public Sector: E-Government Strategies
and Advancements, edited by Abid Thyab Al Ajeeli and Yousif A. Latif Al-Bastaki, pp. 182-196, copyright 2011 by Information
Science Reference (an imprint of IGI Global).

764
765

Chapter 47
Spatial Modeling of Natural
Phenomena and Events
with Artificial Neural
Networks and GIS
Silke Noack Alexey Irkhin
Beak Consultants GmbH, Germany Beak Consultants GmbH, Germany

Andreas Barth Evelyn Bennewitz


Beak Consultants GmbH, Germany Beak Consultants GmbH, Germany
Frank Schmidt
Beak Consultants GmbH, Germany

ABSTRACT
Artificial neural networks (ANN) are used for statistical modeling of spatial events in geosciences. The
advantage of this method is the ability of neural networks to represent complex interrelations and to
be “able to learn” from known (spatial) events. The software advangeo® was developed to enable GIS
users to apply neural network methods on raster geodata. This statistic modeling can be displayed in
a user-friendly way within the ESRI ArcGIS environment. The complete workflow is documented by the
software. This paper presents three pilot studies conducted to illustrate the possibilities of spatial pre-
dictions with the use of existing raster datasets, which described influencing factors and the selection
of known events of the phenomenon to be modeled. These applications included (1) the prognosis of soil
erosion patterns, (2) the prediction of mineral resources, and (3) vulnerability analysis for forest pests.

INTRODUCTION dence of a phenomenon on several main influenc-


ing factors) is rarely feasible. Typically, a model
Natural phenomena and events are usually caused only refers to some aspects of the phenomenon in
by a complex of interacting factors. An exact question. Due to the lack of data and knowledge
mathematical formulation of a geo-scientific task, of details of many geo-processes, mathematical
however, (with equations describing the depen- models cannot be successfully defined and applied
with reasonable amounts of effort. As a result,
DOI: 10.4018/978-1-4666-2038-4.ch047

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

the exploration of artificial intelligence (artificial RATIONALE AND OBJECTIVES


neural networks, ANN, e.g., Haykins, 1998) as a OF THE STUDY: SELECTION OF
means to provide a reliable tool to analyze causal AN APPROPRIATE MODELLING
relationships and to make the knowledge available METHOD FOR SPATIAL EVENTS
for predictive tasks was initiated. This approach
differs from traditional methods in that viable The location of a natural event is determined by a
results may be obtained with reasonable efforts complex network of influential causes and subse-
invested in data processing, model design and quent effects. Hence, the relationships between the
computational time. parameters are usually characterized by qualitative
Results for three different case studies are descriptors rather than quantifiable means, if they
discussed in this paper, following an introduc- can be described at all in a reasonable amount of
tion on the theoretical background of the neural time. This hampers the (mathematical) modeling
network approach. The outcomes, predictive maps and spatial prediction of natural phenomena. In
illustrating the probabilities of occurrence for a general, spatial pattern of events can be modeled
given event of interest, form an important basis by two different approaches:
for planning activities. The case studies deal with
various tasks in applied earth sciences and dem- 1. By conducting detailed studies of physical,
onstrate the rather universal applicability of the chemical and other relations to establish an
coupled ANN/GIS approach that is presented in accurate quantitative model of the processes
this paper. In particular, the following objectives with mathematical-analytical methods (e.g.,
applied to the case studies presented here: using finite elements to model slope stabil-
ity). With this approach, equations are used
1. Spatial prediction of soil erosion channels to to parameterize and model natural processes.
localize damage on arable land and to infra- Equation calibration is usually accomplished
structure and to model effects of mitigation by adjusting “constants”, based on the com-
measures; parison of modeled results and measured
2. Prediction of the most promising locations data.
for mineral exploration, based on available 2. By a statistical approach, whereby poten-
geological data within a gis environment; tially influential factors are evaluated by
3. Spatial prediction of the most vulnerable multivariate methods, based on statistical
forest stands in case of spreading bark beetle correlations. The influence of several inde-
infections. pendent factors on the dependent variables
(e.g., the development of a geo-hazard such
Although the value of the application of neural as slope failure) is investigated.
network technologies in GIS environments was
recognized in the past, actual application of this The calibration of analytical models is usually
remained a challenge for standard users due to the connected with time-consuming and cost-intensive
lack of user-friendly tools. Thus, the overall goal field studies. As a result, it is not uncommon that
of the three case studies was to study the quality complex natural processes cannot be adequately
of model outputs and the general applicability of described with derived physical equations for more
the ANN/GIS approach included in the software extensive areas of interest, even given reasonable
advangeo®. efforts to do so.

766
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

The statistical approach has the advantage of METHOD: ARTIFICIAL


being able engender usable results with much NEURAL NETWORKS
lower amounts of effort in terms of data capture
and calibration. Well-known and widely used Artificial Neural Networks (ANN) (Haykins,
multivariate methods include regression, cluster 1998) can be used for the statistical modeling
and discriminant analyses. of spatial events. This method has proven to be
The ANN approach is a multivariate analysis capable of representing complex interactions. In
method. With this method, it is possible for users addition, it can “learn” by analyzing the combina-
to model complex, non-linear relations without tion of input data at a given location with known
prior or initial knowledge of the relationships events. Following training with such known
amongst the input layers. These relations are events, ANN applies the “knowledge” to more
investigated by the ANN software during a extensive areas under similar boundary conditions.
“learning process” in which a sizable amount ANN help to detect the controlling factors for
of input data can be analyzed at various scales. analyzed events and can also return a probability
However, type of data layers selected and the value, which describes the likelihood of event
manner in which they are prepared by the user occurrence for a particular location. These results
will influence the quality of outputs generated. are of vital importance, as they can inform about
Therefore, a general understanding of the baseline the nature of geo-hazards and aid in the planning
processes to assess the quality of the modeling and implementation of mitigation measures. In
results produced. the case of mineral exploration, the analysis can
Various authors have successfully applied help prospectors to minimize their efforts and
ANN to forecast the occurrence and extent of expenditures by focusing resources and work on
spatial events. Examples of applications have the most promising locations.
included the accurate identification of slope It should be noted that all forms of approaches
failure processes (Fernandez-Steeger, 2002), the have their own sets of precautions and shortcom-
modeling of spatial air pollution patterns (Lin et ings. The ANN approach is no exception; however,
al., 2004), the propagation of flood waves (Pe- identification of these challenges and restrictions
ters et al., 2006) and the prediction of mineral allows users to pre-process input data more me-
deposits (Redford et al., 2004). However, none ticulously and to gain a better understanding of
of the previous studies involved the integration possible inconsistencies in the generated results.
of ANN into a geographic information system, One such challenge relates to the quality of the
whereby data processing and the presentation of predicted results with respect to the way the ANN
results could be streamlined to better facilitate learns: if the training data, which represent the
the storage management and display of generated known events, is only available as incidental points
information. The goal of this project was initi- on the raster layer instead of complete coverage,
ated to address this with the development of a the ANN may return an incomplete statistical
software product. This product can be applied by model of the phenomenon.
a GIS user within the standard GIS environment, The main principle of the ANN approach is
to facilitate the production of prediction maps built on the concept of how networks of biological
for planning purposes, following the investiga- neurons are organized and function by imitating the
tion of influential factors identified for a given properties of a real nervous system. These systems
assessment. are composed of many chemically interconnected
neurons (nerve cells), which receive and process
signals from other neurons. When these signals

767
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

reach a certain threshold, sets of nerve cells (closest The network topology describes the way the
to the originating signals) are activated and forward neurons are interconnected. The direction of sig-
the information to other connected neurons. Learn- nal propagation (forward or backward), the level
ing processes result from the constant adaptation and the type of connection (e.g., completely con-
of the interconnections amongst neurons, where nected or with shortcuts) are thus defined. Figure
the growth and strength of connections are directly 1 shows the scheme of a completely connected,
related to their frequency of use and degenerate feed-forward network with 4-3-1 topology. On
when rarely accessed. the right-hand side, a processing unit (neuron) is
The simulation of these biochemical processes sketched with its output function.
in an ANN is done by artificial neurons, known The weights increase or decrease the strength
as processing units. The connections are realized of the incoming signal and are adaptable during
by (directed) interconnection weights wij (e.g., the learning process. By repeating the input of
Backhaus et al., 2003; Kriesel, 2008). In general, the training pattern, the intensity of the connec-
an ANN may be described as an adaptive system, tions is modified, such that the deviation (or error)
which changes its structure based on the flow of between expected and actual outputs (i.e., between
external or internal signals through the network. modeled and real data) are minimized.
Artificial neurons are usually organized in There are different network paradigms, such
layers, comprised of: as the multi-layer perceptron (MLP), the radial
basis function networks (RBF), Hopfield networks
• An input layer that receives input values and self-organizing maps (SOM). The MLP
(i.e., independent variables and crucial paradigm is of particular interest for applications
controlling factors related to the event to that involve supervised learning. The information
be modeled); processing that occurs is characterized as “feed
• Hidden layer(s) that process the signals forward”. This network paradigm is an extension
and calculate the resultant value(s) by of earlier developments on the perceptron, which
transforming them with the application of incorporates higher requirements for the model-
an activating function; and ing of complex relationships. A MLP consists of
• An output layer, created from the depen- an input layer, at least one hidden layer and an
dent variable to be modeled. output layer (Figure 1).

Figure 1. General scheme of a feed forward network

768
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

MLP are usually trained with a “back propa- ent application“. The “application meta database”
gation” algorithm (e.g., Rumelhart et al., 1986; manages the project data, which includes calcula-
Rios, 2010). “Back propagation of error” is a tions, embedded data, parameterized models and
training algorithm, which adapts the weights (wij) scenarios. Metadata are stored in a Microsoft SQL
by considering the degree of error from the output Server database (Figure 2).
signal so that it may be minimized. The “project database” contains all of the
Another important network parameter is the geodata and feature data for the build-up of the
activation function, which is used to calculate the statistical models.
output status of the neuron. Differentiable and The Data and Model Explorer (a stand-alone
non-linear functions are required for this purpose. solution) provides support to the user as new
Generally, sigmoid functions are applied. projects are set up and assists in the systematic
parameterization of the models. The workflow is
fully documented. The additional ArcGIS exten-
INTEGRATION OF THE sion provides all of the necessary GIS function-
METHOD INTO GIS alities in a user-friendly interface to facilitate the
geodata with the ANN. Besides the availability
ANN has already been used for some time, as of various functions to pre-process geodata, this
an alternative to other multivariate, statistical extension also supports the automatic creation of
methods and has been proven to produce adequate output map layouts.
results. Although some ANN-based commercial The setup of a consistent and reproducible
products and class libraries are currently available, statistical model for geodata analysis consists of
they usually require the manual generation of input a number of procedures, which formed the basis
data in a proprietary format. Some products have for the design and development of this software.
import interfaces, for instance, Microsoft Excel This included multiple stages, which involve:
spreadsheet structures.
GIS users aiming to analyze numerous input 1. The exact formulation of the modeling objec-
data layers are faced with the organization and tive and the delineation of the study area;
assessment of a sizable number of individual 2. Describing and compiling base data, which
records. The identification of the spatial relation represent the potentially influential factors
of the records (i.e., data layers) enables the GIS with which predictions will be made in
to derive information at any defined point and, consideration of,
thus, to define training data for the ANN across 3. Technical data processing (from the sci-
all data layers for the same location. entific and it point of view) and statistical
Up to now, the use of ANN to analyze spatial pre-processing;
data has not been widespread in geosciences, due 4. The setup and calculation of model scenarios
to the lack of user-friendly software that runs for training, testing and applications;
together with a standard GIS environment. Based 5. Statistical post-processing and presentation
on this observation, a software application was of prediction results.
developed and used in this study which provides
users with tools to evaluate their spatial data with The Data and Model Explorer is the primary
ANN within the ESRI ArcGIS environment. user interface, where the main object types may
This software, called advangeo®, is based on a be defined for modeling, organized with their
client/server architecture and is comprised of two respective metadata sets, modified and visualized.
components, namely “data management” and “cli- The processing of geodata and the presentation of

769
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

Figure 2. System Architecture of advangeo ®

subsequent results are supported by a GIS exten- adaption, scale transformations or data partition-
sion with a special toolbar and a project tree view ing. The “processed source data” are transformed
option. The system architecture is illustrated in and saved as spatially compatible raster layers.
Figure 2. Taking the coverage extents of the project area
Project definition is the first step of the mod- into consideration, the data are transformed to
eling process, and appears at the top of the Data “model input data” layers. This process is required
Explorer hierarchy. The levels are displayed in a to ensure a spatially consistent set of base data.
tree view structure for easy navigation. By defin- Following the data processing procedures, a
ing a project, the user sets the technical boundary parameterized model is set up and used for the
of the intended task. The study, training and test calculation. Model input data, training and model
areas are delineated subsequently. parameters are then determined in a training sce-
The basis of every modeling task is founded nario (Figure 3). Currently, the MLP approach is
upon the ability to use various source data types. used in advangeo®, along with various adaptable
However, data are generally unusable without prior training algorithms and parameters. This proce-
processing involving the conversion to a standard dure was realized with the open source library
or recognizable format for a given application. FANN (Nissen, 2009).
The original data are therefore unlikely to be used The user may opt to retain the standard default
directly for calculations, but can be included in values at the beginning of the model parameter-
the project for documentation or reference pur- ization stage. The model calculation results in a
poses. Necessary processing procedures prior trained network within the defined training area.
to modeling can include the calculation of field The initial run returns a preliminary estimation
values, error identification and elimination, code of the model quality. The GIS extension provides

770
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

Figure 3. Training scenario with advangeo ®

further functions for the statistical validation of In both areas, precipitation events character-
model results. ized by long durations and/or high intensities lead
The trained network can then be applied to to the dislocation of soil material, especially in
other study areas. The calculations may also be agricultural fields. Topsoil, which contains the
displayed in a semi-automatically generated pre- greatest amount of nutrients, is most vulnerable
diction map within the GIS environment. to dislocation; seeds may also be carried away,
The special new and innovative aspects ad- thereby significantly reducing the potential yield
vangeo® introduces in ANN processing are the of the land. These types of adverse effects are
integrated GIS data management in conjunction known as “on-site damages”. “Offsite-damages”,
within the standard GIS environment of ESRI on the contrary, occur where soil is deposited or
ArcGIS and various tools to pre-process data, accumulated and can spoil water bodies and cause
tools to assess the model quality and a customized damages to human infrastructure.
map printing function (Figure 9). The prediction of the probabilities of erosion
event occurrences, along with a quantitative es-
timation of soil dislocation, forms an important
CASE STUDY 1: EROSION basis for the planning of preventative or mitiga-
PATTERN PREDICTION tive measures. Analytical equations to predict the
amount of dislocated soil have been derived and
The prediction of erosion caused by rainfall was used for many years (e.g., Wischmeier & Smith,
conducted to evaluate the quality of modeling 1978; de Vries et al., 1998). Other soil parameters
results on two different sites, the Limpopo region have already been predicted successfully with
in South Africa and the Weisseritz catchment in ANN and the multiplayer perceptron in particu-
the South of Eastern Germany. lar recently by Behrens et al. (2005), Alavi et al.

771
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

(2010), and, in case of soil contamination pattern, The combination of slope and flow accumula-
by Hertwig et al. (2010). tion explained most of the observed erosion pat-
The goal of this case study was to test the ac- terns. Other terrain attributes (derived from the
curacy of the prediction of rill and gully erosion DEM) did not enhance the results significantly
as a result of intense rainfall with ANN. whereas top soil composition and land use (arable
land, pastures, forest, urban areas) contributed to
Source Data and Pre-Processing enhanced results.
The second study site was located in the
The first project area was located in the Weis- northeastern region of South Africa. There exist
seritz catchment in Saxony, Germany. The main severe problems for agriculture and transport
controlling factors governing the soil erosion infrastructures due to the formation of deep ero-
prediction are well known for the study area, due sion gullies. Comparable data sources to the ones
to a number of existing algorithms developed for used in the first study site were also used in this
the parameterization of mathematical models. second analysis of causal relationships, especially
Factors include: amongst multiple datasets, namely geological data,
field data on existing gullies, remote sensing and
• Terrain attributes (slope and flow accumu- flow accumulation data (derived from the DEM).
lation; to a lesser extent aspect, profile and The land cover datasets were determined to be
plan curvature and flow length); unsuitable for modeling upon visual inspection
• Percent composition of top soil (i.e., soil as the raster was only available as a 30 m grid.
particle size – fine, medium, coarse); and The DEM was interpolated from a 90 m SRTM
• Land use (e.g., arable land, pastures, for- (Shuttle Radar Topography Mission) raster. Vari-
ests, urban areas). ous terrain attributes were calculated and tested.
For an overview of standard DEM derivations,
Data sources comprised of a digital elevation please see Wilson and Gallant (2000). The data
model (DEM) with 20 m raster size (DGM 25), sources, which were selected for the (definition
the soil concept map of the Free State of Saxony of the) final network, included a geological map,
(scale 1:25,000) and ATKIS land use data in the
same scale. These data layers were pre-processed
Figure 4. Training area, field validation
with selected GIS tools to create consistent raster
layers. In order to include the influence of linear
elements such as roads and hedgerows, which
function as barriers to erosional processes, an
“apparent enhancement” of the spatial resolution
was realized by resampling the data to a 5 m grid.
Metric data layers were then normalized to fall
between ranges of [0…1] or [-1…1]. Nominal
scales for layers containing data on the presences
of forests, roads and geological units were reclas-
sified as binary layers.
Training data were manually digitized from
scanned orthophotos, which were taken after the
severe rainfalls of August 2002. The accuracy
of the data was also partially verified during site
visits (Figure 4).

772
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

elevation, slope angle and flow accumulation. the network and is an important parameter for the
Figure 5 illustrates the types of input data used assessment of the modeling result. The trained net-
and the neural network calculation scheme. work was then applied on a test site and validated.
The network accurately identified approximately
Network Parameterization 90% of the test records at Weisseritz catchment.
The threshold for the decision “erosion/no ero-
The pre-processed data were used as input data sion” was set to p > 0.75 and was derived from
for the model calculation. The models for both statistical calculations of confirmed erosion sites.
areas were defined with the procedures described This result was confirmed in the field. Figure 6
previously and trained with existing influencing shows water logging after high precipitation as
factors. Training was completed in a part of the well as erosion channels at locations of dark cell
defined study area. For modeling, a MLP approach values (high erosion probability).
was used. It was trained with various learning For the Limpopo study site (Figure 8), visual
algorithms, activation functions and parameters. validation was conducted, based on satellite im-
Acceptable results were attained with the Resil- ages which clearly showed existing erosion gullies.
ient Back Propagation (RPROP) algorithm, in The satellite data were not included as a part of
combination with a sigmoid activation function. the training data. Relevant gullies were still iden-
tified at the correct location despite the rather low
Validation spatial resolution of 30 m.

The training was repeated several times with the Planning of Preventative Measures
same parameters. Comparable results (i.e., with and Presentation of Results
only slight deviations, acquired following a simi-
larly defined training process) were observed. This The trained network was also used to plan for
confirmed the reproducibility and robustness of the introduction and implementation of erosion

Figure 5. Neural network calculation scheme for erosion gully prediction

773
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

Figure 6. Visual field inspection of predicted results


the elevation contours. The subsequent network
run confirmed the successful mitigation of the
predicted erosion processes. With this approach,
prevention measures could be optimized (number
and location of barriers), thereby facilitating the
conservation of funds.
The integrated function for automatic map
generation helped to display the prediction results
in a user-friendly way. Figure 9 is an example of
a prediction map with the following content: ero-
sion channel probability (in color) on top of the
DEM, hillshade display (top left), and on top of
the aerial image (top right). The bottom line rep-
resents detailed maps on the same background
data (as indicated in the overview map). The
right-hand side diagrams explain the used factors
of influence and display the model calibration
error (training function). The overall erosion
probability is given in a color scale and values
prevention measures, especially at Weisseritz from 0 – 1.
study site. Additional barriers (landscape elements
such as hedgerows) were added in the model on
slopes that were especially susceptible to erosional CASE STUDY 2: PREDICTION OF
processes. This was accomplished by modifying MINERAL RESOURCES
the input layers. Figure 7 shows that the erosion
probability (dark cell values, dominated by the The ability of ANN to analyze complex, non-linear
factors flow accumulation and slope on arable combinations of multiple (raster) input data was
land) is significantly reduced by introducing three tested for the interpretation of mineral occurrences,
barriers to cut of the major flow lines parallel to in order to generate high quality prediction maps

Figure 7. Mitigation of erosion processes by landscape elements (barriers) on a southwest exposed slope

774
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

Figure 8. Visual comparison of erosion gullies on satellite imagery and with ANN predicted gully loca-
tion (image authorized by Council of Geoscience, South Africa)

of mineral resources. Such maps are crucial tools, to significant amounts of financial resources. The
especially for developing countries, where positive successful prediction of mineral occurrences plays
results may be used to attract investment interest a key role in the selection of further exploration
in their respective mineral sectors. The identifi- tasks and defines a basis for which investment
cation of exploration targets is one of the most decisions can be made in many countries.
ambitious tasks in geology, as it is instrumental The calibration of quantitative analytical
to the strategic development of companies and models generally involves substantial temporal re-
sometimes even countries and is generally tied sources and cost-intensive field or laboratory work.

Figure 9. Prediction Map for soil erosion probability (original in color, indicating high erosion prob-
ability in red lines/areas)

775
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

Currently, analytical models cannot adequately and 1:200,000, respectively. For modelling pur-
describe the complex interactions of geological, poses, a raster cell size of 50 m was estimated to
tectonic, geochemical, geo-mechanical and other be sufficient, resulting in approximately 4,400,000
factors in the timescale that is required to form cell values within the Kosovo administrative
mineral deposits. Only single aspects of it can boundaries.
be modeled with acceptable levels of precision The structural and litho logic relationships
when there are fixed boundary conditions (e.g., within the lead/zinc deposits were presented by
how ore minerals attach to geochemical barriers). Anković et al. (2003) and further explored in a
In this case study, input data layers were study by Legler et al. (2008). The main controlling
selected based on existing knowledge about in- factors for the localization of ore deposits included:
fluencing factors of resource genesis, which was
derived from geological maps. The study area of • The striking tectonic elements in the NNW
11.000 km² was located in Kosovo in southeastern – SSE direction;
Europe. Promising targets for in-depth exploration • The proximity of Cenozoic vulcanite
could be identified at this scale of investigation. rocks; and
In addition to the known deposits of lead, zinc, • The presence of highly reactive host rocks,
bauxite, chromium, nickel and coal, the results such as marble and calcareous schist.
also indicated significant potential for gold and
platinum metals. Selected influencing factors were derived from
the geological map and transformed to a 50 m grid
Source Data and Pre-Processing (see example of tectonic data preparation in Figure
10). In the cases involving the consideration of
Between 2003 and 2007, existing geo-scientific lineaments/faults and Cenozoic vulcanite rocks,
data were compiled, reviewed, completed and the distance from each cell to the nearest relevant
transformed into a consistent set of digital maps, structure was calculated, which functions as a
under the supervision of the “Independent Com- viable potential indicator of deposits.
mission for Mines and Minerals” (ICMM). All To include high-resolution aero-geophysical
necessary data layers for the prediction of mineral data into the assessment, the following base da-
resources (especially geology, tectonics, known tasets were identified to be crucial for the gen-
deposits, metallogeny, aero-geophysics) were eration of the prediction map:
made to be available, in the scales of 1:100,000

Figure 10. Pre-processing of tectonic data – calculation of distances to potentially relevant lineaments/
faults

776
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

• Known deposits and their occurrences, The training results were further assessed by
• Tectonic data (lineaments and faults), an iterative process that involved the consideration
• Aero-geophysical data (gamma spectrom- of a greater number of input layers. In this case,
etry of elements u, th and k, magnetics, and further investigation began with 3 layers (aero-
electromagnetics), and radiometric layers Uranium (U), Thorium (Th)
• Lithological data. and Potassium (K)) and increased to the use of 6
layers, which included volcanic, tectonic and
Network Parameterization electromagnetic data. For modeling, the MLP
approach was applied.
The training stage was completed for the test area
illustrated in Figure 11. For the cross-validation Validation
stage, a subset of the known deposits within the
test area was used for a second training scenario, Training was repeated several times with the same
while the remaining known deposit locations, parameters, which produced almost identical
which had been removed beforehand, were used results to previous training runs. These results
to validate the trained results. gave insight into how robust and reproducible
the network-based training runs were. The trained

Figure 11. Training area, used layers (aero-radiometric surveys of Uranium, Thorium, Potassium;
distance to volcanic rocks) and known deposits (dots)

777
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

network was then applied to the full extent of and the mass reproduction of the aforementioned
Kosovo, with the exception of the northern and beetle species. Various approaches and models
northwestern border where no geophysical data have been available for the analysis and predic-
were available. Some known deposits, identified tion of the spread of I. typographus, as described
on the prediction map with probabilities of less in Baier et al. (2007), Seidl et al. (2007), and
than 0.5, were not used for the training exercise. Netherer and Nopp-Mayr (2005).

Presentation of Results Source Data and Pre-Processing

The results engendered from the modeling stage The study site was located in Saxony, Germany.
were semi-automatically transformed to the GIS, Source data comprised of a digital elevation model
where a prediction map could be made. This kind (DEM) in 20 m resolution and ecological forest
of map can be used to display the modeled results site map at 1:10,000 with soil and soil water data,
in a manner that can be easily used by potential and forest stand data on species composition,
investors in the mineral sector or other interested stand age, and height. For the ANN training,
parties (Mining Journal, 2009). known locations of I. typographus infections
from the years 2003 and 2008 were available as
field data for the site of interest. The data layers
CASE STUDY 3: FOREST PEST were pre-processed based on the knowledge on
VULNERABILITY ANALYSIS influential factors (e.g., elevation and terrain at-
tributes such as slope, aspect, curvature and the
Mass reproduction of the European Spruce Bark topographic wetness index) and were subjected to
Beetle (Ips typographus) can lead to severe for- a scale transformation. Soil data were processed to
est dieback, since their larvae feed on the inner generate maps differentiating between 1) fine and
bark of trees at different life stages and in various coarse soil particle sizes and 2) wetness classes.
degrees of deterioration. Knowledge of control- A statistical evaluation of 16 site parameters
ling factors pertaining to the reproductive cycle and three forest stand parameters was conducted
of I. typographus is crucial for successful forest prior to the processing. Histograms were produced
management and for the planning of mitigation at the end of the evaluation, which helped to de-
measures. The beetles mainly affect Norway lineate data ranges and were used to estimate the
Spruce (Picea abies), and especially older ho- potential influence of aforementioned factors, with
mogenous stands. Significant influencing fac- the consideration of previous stand infections by
tors for the (mass) dispersion of this bark beetle I. typographus in 2003 and 2008.
include climate parameters such as temperature Highly correlated and potentially influential
sequences along with wind strength and direction, factors were identified to reduce the number of fac-
as well as site parameters such as the percentage tors for subsequent training stages. For example,
of spruce present, stand age and density. In addi- flow length and flow accumulation were found
tion, topographical attributes such as terrain and to be directly correlated (r=0.92). Therefore, they
soil conditions can influence the development of were not used simultaneously as input layers, in
I. typographus populations. The goal of this study order to avoid distortions or bias in the results.
was to test, if the ANN approach with advangeo® is Furthermore, a logarithmic function was applied to
suitable for the spatial prediction of how probable the flow accumulation, which was notably skewed
a specific site is susceptible to future infections to the left, to approximate a standard distribution

778
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

of the range of its values. Data normalization The correlation between predicted and known
to ranges [0..1] and [-1..1] was carried out for values at infected locations raised some questions
most layers. Ordinal and nominal scales were about the modelling quality. It was observed
transformed to binary grids. This is exemplified that 60-70% of the infected locations retained
in the extraction of soil type data from maps and an infection probability at 0.5. There were few
their conversion into single binary layers, where probabilities > 0.8 at all. However, by visual
values of 1 or 0 were assigned for the presence inspection, the general spatial pattern showed a
or the non-existence of certain attributes (e.g., more accurate prediction trend. Results for 2003
gley: yes/no or podsol: yes/no). This means of (a rather dry year) were significantly better than
reclassifying the data facilitated the evaluation of those for 2008.
their significance as potential factors of influence. The low rates of matches between the two
datasets were caused by the validity and ac-
Network Parameterization cessibility of data on the positional accuracy of
the training points (i.e., infected stands). This
The training algorithm was executed for the in turn, was primarily influenced by different
“Tharandter Wald” located in Saxony, as well scales at which data were collected and made to
as within parts of the study area. The model was be available for use. Additional factors that could
systematically set up, with various input layers significantly enhance the model quality include
representing factors of influence by using MLP information about previous infections and forest
coupled with a RPROP algorithm and a sigmoid management measures, more detailed forest stand
activation function. parameters, information about neighboring stands
and wetness. Another aspect was the absence of
Validation important climatic parameters such as duration
of insolation, wind direction and precipitation,
To validate the results, several parameters were which may explain the differences of the results
evaluated. First, the rate of matches was calculated obtained between 2003 and 2008.
(i.e., the percentage of predicted sites to known A common parameter set (in terms of potential
sites of infection not used for training). In addition, factors of influence) for the 2003 and 2008 data
the “prediction error sum of squares” (PRESS) turned out to be inefficient. The different envi-
was calculated. The determination of the quadratic ronmental conditions, particularly differences in
difference of the values (i.e., known infections and the climate, notably influenced the behavior of the
predicted results) at infected locations described beetle. It is hypothesized that the period of drought
the quality of the predictions. in 2003 extended the range of beetle dispersion
In addition, a fourfold cross-validation as- into previously healthy forest stands (i.e., stress
sessment was conducted for the model. For this caused by droughts led to higher susceptibility
purpose, the training data were separated into two to infections).
exclusionary subsets (training and test subsets) In the beginning of this study, a grid with an
and used as input data on a rotational basis, until overall resolution of 5 m was used. Later on, a
each element was a part of the test subset for at resolution of 20 m was determined to be sufficient,
least one time. The results showed that the model considering the quality of data that was available.
outputs were notably similar, which indicates that This change in grid size was reflected in the cal-
a successful learning process occurred during the culation time and the output quality.
network training stage.

779
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

Presentation of Results selected as the best (most representative) input


datasets:
In order to compare the model output, the same
classes were used to describe the probability of • Percentage of spruce (species composition
infection for each model run, where: of the forest stand)
• Forest stand age
• No susceptibility: 0 – 0.5 • Soil composition
• Low susceptibility: >= 0.5 – 0.6 ◦◦ Percentage of coarse soil
• Average susceptibility: >= 0.6 – 0.7 ◦◦ Percentage of silt percentage of sand
• High susceptibility: >= 0.7 – 0.8 • Appearance of gley soils
• Very high susceptibility: >= 0.8 (dark val- • Soil moisture levels (soil type)
ues on the color scale) • Slope angle
• Slope position
Figure 12 shows one the model output with the • Aspect
susceptibility (probability of infection) as a mean
value of 15 model runs. Dark values indicate the CONCLUSION
highest of 5 probability classes (>= 0.8) and al-
low a quick overview to target actions in case of The case studies demonstrated that the multilayer-
starting mass reproduction of the beetles. White perceptron approach (MLP) is suitable for the
areas represent other land use classes (villages, analysis and prediction of various applications
pastures, arable land, streams and forest roads. of spatially-dependent natural phenomenon,
Finally, a quantitative prediction was calcu- including the assessment of soil erosion, the
lated (in m³/infected location). Using the input identification of potential mineral occurrences
layers for 2008, the following parameters were and the spread of forest pests. The stepwise

Figure 12. Mean values of 15 model runs with 5 probability classes (from very high to no susceptibility
to infections by I. typographus)

780
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

generation of model scenarios with advangeo® Africa and the Staatsbetrieb Sachsenforst (State
supported the derivation of information while Agency for Forestry of Saxony) in Germany for
taking the sensitivity of controlling factors into their expressed authorization to present proprietary
consideration. However, one disadvantage to this project data and figures in this article. The research
approach is its inadequate ability to explain results project was supported by the Federal Ministry of
(i.e., comparable to a “black box” system), where Economics and Technology, Germany (project
the specific influences of single parameters cannot identification number IW072061).
be directly retraced.
All interrelations amongst factors of influence
and the final result were revealed through the REFERENCES
course of the training process. With this process,
“erroneous learning” cannot be excluded. As a Alavi, A. H., Gandomi, A. H., Mollahassani, A.,
result, it remains a challenge to define the bound- Heshmati, A. A., & Rashed, A. (2010). Modeling
aries of neural network applications. To evaluate of maximum dry density and optimum moisture
the quality of individual components of training content of stabilized soil using artificial neural
goals and ability to generalize the input events, networks. Journal of Plant Nutrition and Soil Sci-
“expert knowledge” and the means to critically ence, 173, 368–379. doi:10.1002/jpln.200800233
assess the outputs of the modelling process are Anković, S., Jelenković, R., & Vujić, S. (2003).
required, in addition to the generation of statisti- Mineral resources and potential prognosis of
cal figures on the proximity of predicted and metallic and nonmetallic mineral raw materials
measured values alone. in Serbia and Montenegro at the end of the 20th
Overall, the ANN defined within the advan- century. Belgrade, Serbia: Engineering Academy
geo® software seemed to be capable of predict- of Serbia and Montenegro.
ing the occurrence of real spatial phenomena.
Further potential fields of applications, which the Backhaus, K., Erichson, B., Plinke, W., & Weiber,
developers intend to test in near future, include R. (2003). Multivariate analysemethoden – eine
the assessment of slope stability, the prediction anwendungsorientierte Einführung. Berlin, Ger-
of market trends, and the evaluation of traffic many: Springer-Verlag.
accidents, in addition to the mapping of spatial
Baier, P., Pennerstorfer, J., & Schopf, A. (2007).
events in forestry, agriculture and mining.
PHENIPS – A comprehensive phenology model
The most significant advantage of the ANN is
of Ips typographus (L.) (Col., Scolytinae) as a tool
the relatively small amount of effort required for
for hazard rating of bark beetle infestation. For-
data pre-processing, model calibration and short
est Ecology and Management, 249(3), 171–186.
model run times. As such, this new tool enables
doi:10.1016/j.foreco.2007.05.020
standard GIS users to apply ANN approaches that
are fully integrated with their already familiar Behrens, T., Förster, H., Scholten, T., Steinrücken,
work environment. U., Spies, E., & Goldschmitt, M. (2005). Digital
soil mapping using artificial neural networks.
Journal of Plant Nutrition and Soil Science, 168,
ACKNOWLEDGMENT 21–33. doi:10.1002/jpln.200421414

The authors sincerely thank the Independent


Commission for Mines and Minerals (ICMM) in
Kosovo, the Council of Geoscience (CGS) in South

781
Spatial Modeling of Natural Phenomena and Events with Artificial Neural Networks and GIS

Fernandez-Steeger, T. M. (2002). Erkennung von Nissen, S. (2009). Fast artificial neural network
hangrutschungssystemen mit neuronalen netzen library (FANN). Retrieved from http://leenissen.
als grundlage für georisikoanalysen. Unpublished dk/fann/
doctoral dissertation, Universität Karlsruhe, Karl-
Penning de Vries, F. W. T., Agus, F., & Kerr, J.
sruhe, Germany.
(Eds.). (1998). Soil erosion at multiple scales:
Haykins, S. (1998). Neural networks: A compre- Principles and methods for assessing causes and
hensive foundation (2nd ed.). Upper Saddle River, impacts. Wallingford, UK: CABI Publishing.
NJ: Prentice Hall.
Peters, R., Schmitz, G., & Cullmann, J. (2006).
Hertwig, T., Müller, I., & Zeißler, K.-O. (2010). Flood routing modelling with artificial neural
Management of contaminated soils in urban networks. Advances in Geosciences, 131–136.
areas in the ore mountains (Germany). Paper doi:10.5194/adgeo-9-131-2006
presented at the 11th International Conference on
Redford, S., Lipton, G., & Ugalde, H. (2004). Pre-
Management of Soil, Groundwater and Sediment,
dictive ore deposit targeting using neural network
Salzburg, Austria.
analysis. Society of Exploration Geophysicists
Journal, M. (2009). Kosovo ready to roll. Mining Expanded Abstracts, 23, 1198.
Journal, 17(7), 14–15.
Rios, D. (2010). Neural networks: A requirement
Kriesel, D. (2009). A brief introduction to neural for intelligent systems. Retrieved from http://www.
networks. Retrieved from http://www.dkriesel. learnartificialneuralnetworks.com/backpropaga-
com tion.html
Legler, C., Knobloch, A., & Barth, A. (2008). Map Rumelhart, D. E., Hinton, G. E., & Williams, R.
of minerals – Metallogenic / minerogenic map J. (1986). Learning internal representations by
(1: 200,000) - Final report – Map description. error propagation. In D. E. Rumelhart & J. L. Mc-
Freiberg, Germany: Beak Consultants GmbH, Clelland (Eds.), Parallel distributed processing:
Prishtinë. Explorations in the microstructure of cognition,
volume 1: Foundations (pp. 318-362). Cambridge,
Lin, S.-W., Sun, C.-H., & Chen, C.-H. (2004).
MA: MIT Press.
Temporal data mining using genetic algorithm
and neural network: A case study of air pollutant Seidl, R., Baier, P., Rammer, W., Schopf, A.,
forecasts. Geospatial Information Science, 7(1), & Lexer, M. J. (2007). Modelling tree mortal-
31–38. doi:10.1007/BF02826674 ity by bark beetle infestation in Norway spruce
forests. Ecological Modelling, 206, 383–399.
Netherer, S., & Nopp-Mayr, U. (2005). Predis-
doi:10.1016/j.ecolmodel.2007.04.002
position assessment systems (PAS) as supportive
tools in forest management – Rating of site and Wilson, J. P., & Gallant, J. C. (Eds.). (2000). Ter-
stand-related hazards of bark beetle infestation rain analysis: Principles and applications. New
in the High Tatra Mountains as an example for York, NY: John Wiley & Sons.
system application and verification. Forest Ecol-
Wischmeier, W. H., & Smith, D. D. (1978). Pre-
ogy and Management, 207, 99–107. doi:10.1016/j.
dicting rainfall erosion losses – a guide to conser-
foreco.2004.10.020
vation planning. USDA Agriculture Handbook,
537, 1–58.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 1,
edited by Donald Patrick Albert, pp. 1-20, copyright 2012 by IGI Publishing (an imprint of IGI Global).

782
783

Chapter 48
Stereo-Vision-Based Fire
Detection and Suppression
Robot for Buildings
Chao-Ching Ho
National Yunlin University of Science and Technology, Taiwan

ABSTRACT
A stereo-vision-based fire detection and suppression robot with an intelligent processing algorithm for
use in large spaces is proposed in this chapter. The successive processing steps of our real-time algorithm
use the motion segmentation algorithm to register the possible position of a fire flame in a video; the
real-time algorithm then analyzes the spectral, spatial, and motion orientation characteristics of the fire
flame regions from the image sequences of the video. The characterization of a fire flame was carried
out by using a heuristic method to determine the potential fire flame candidate region. The fire-fighting
robot uses stereo vision generated by means of two calibrated cameras to acquire images of the fire flame
and applies the continuously adaptive mean shift (CAMSHIFT) vision-tracking algorithm to provide
feedback on the real-time position of the fire flame with a high frame rate. Experimental results showed
that the stereo-vision-based mobile robot was able to successfully complete a fire-extinguishing task.

INTRODUCTION tional point-type detectors are installed on walls


or on a ceiling. The delays in the activation of fire
Fire incidents can cause loss of lives and damage detection sensors and sprinklers in large spaces
to property. Damage due to fire has always been a are a major problem. Hence, the monitoring ca-
major area of concern for museums, warehouses, pabilities of point-type sensor devices are limited
and residential buildings. Conventional fire de- to a certain distance, and they are ineffective for
tection sensors (e.g., ionization and photoelectric monitoring large areas. These devices are not suf-
detectors) and fire sprinkler systems monitor only ficiently flexible to detect fire incidents, and many
particular points in space. In most cases, conven- fire-detection sensors and sprinklers are required
to be installed very close to the monitoring areas.
DOI: 10.4018/978-1-4666-2038-4.ch048 Comparatively, the video camera is a volume

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

sensor, and potentially monitors a larger area 2005). Then, the image difference method based
and has a much higher probability of successful on chromatics is used to remove spurious fire-like
early detection of fire flames. Video surveillance regions such as objects with similar fire colors or
technology is suitable for early detection of fires areas reflected from fire flames. A fuzzy-based
due to its low detection delay, good resolution, dominant flame color lookup table is created, and
and high localization accuracy. Early detection of fire regions are automatically selected (Wang,
fires can certainly expedite fire-fighting efforts, 2006). However, either the solution does not
and consequently, fires can be extinguished be- consider the temporal variation of flames or the
fore they spread to other areas. To monitor large approach is too complicated to process in real time.
spaces, the use of a mobile fire-fighting robot is Fire suppression systems usually use water to
a more flexible alternative than installing a large extinguish fires due to its good ability to suppress
number of detectors and sprinklers. When a fire fire. Chen et al. (2004) developed a water-spraying-
is detected, the fire-fighting robot can move to the based fire suppression system. The fire searching
position of the fire flame and safely evacuate an method is realized based on computer vision theory
object from the fire area. Stereo vision systems using one CCD camera that is installed at the end
can provide the robot with precise depth informa- of a fire monitor chamber. However, it is necessary
tion about a target. Hence, the use of two cameras to calculate the changes in the space coordinates
instead of one increases the suppression efficiency of the fire with displacement and the pivot angle
and adaptability of the robot while detecting and of the CCD camera in the fire searching process.
evacuating a burning object. Ho (2009) proposes a fire-tracking scheme based
on CAMSHIFT. The CAMSHIFT algorithm is
applied to track the trajectory and compute the
BACKGROUND 2D positions of the specified moving fire-fighting
robot in real time with the aid of a vision system.
In recent times, research on the detection of fire Yuan (2010) adopted the computer vision tech-
flames using surveillance cameras with machine niques to extract color and motion characteristics
vision has gained momentum. The image process- for real-time fire detection. However, the system
ing approach involves the extraction of the fire was designed to move a water gun along a fixed
flame pixels from a background by using frame path using computer-based control. Hence, the
difference technologies. Healey et al. (1993) monitor ranges of the scene are limited and not
presented a fire detection algorithm using a color sufficiently flexible.
video input with a pre-partitioning scheme under
some restricted conditions, without rejecting the
similar fire-like alias. Phillips et al. (2002) and FIRE DETECTION AND
Celik et al. (2007) conducted studies on computer SUPPRESSION ROBOT
vision by using spectral analysis and the flicker-
ing property of fire flame pixels to recognize the Issues, Controversies, Problems
existence of fires at a scene. Hue and saturation
are adopted as feature vectors to extract the fire The basic problems associated with conventional
pixels from the visual images (Chen, 2003). Fire fire detection sensors are that they are not suffi-
flame features based on the HSI(hue, saturation, ciently reliable due to the time delay between the
intensity) color model are extracted, and regions start of the fire and nonzero input in the detection
with fire-like colors are roughly separated from sensor (Podržaj, 2008). A fire-fighting robot,
the image by the color separation method (Horng, equipped with vision-based fire detection technol-

784
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

ogy, is capable of eliminating the conventional pattern matching, which accounts for the handling
sensor delay factor and providing fire detection of the disturbances mentioned above (Sumi, 1995;
coverage for a larger area with minimized cost. The Di Stefano, 2003). Due to the high cost of im-
overall objective of the mobile robot system is to age capturing equipment and image processing
develop an autonomous fire detection technology components, the need for substantial computing
that is capable of identifying fires at early stages, resources by visual servoing algorithms is not
alerting fire staff, and extinguishing the fire. often taken into consideration. Despite expected
To enable fire detection by the robot, the main improvements on the performance of visual servo
objective here is to develop a color vision-based with image processing and control components
fire detection system that is accurate and per- operating on separate processors, the additional
forms efficiently. The image processing approach cost involved with the extra components presents
involves the extraction of the fire flame pixels yet another problem.
from a background by using frame difference
technologies. To achieve the segmentation of fire
features, color processing is a better alternative SOLUTIONS AND
than gray-scale processing. Color processing RECOMMENDATIONS
can avoid the generation of false alarms due to
the variations in lighting conditions (e.g., natural Image Capturing
background illumination) better than gray-scale
processing. However, vision-based fire flame Image capturing and processing are the two major
detection still has great technical challenges. Fire challenges in the construction of a stereo visual
flames are non-rigid objects and do not exhibit tracking system. The main goals of these respec-
primitive image features and variability in density, tive processes are to provide visual feedback
lighting, etc. (Ho, 2009). and identify the objects one wants to track (in
For robot navigation and fire extinguishing, this case a burning target), while simultaneously
robots with machine vision have been attracting determining their 3D positions. In the proposed
substantial attention recently. A visual feedback stereo tracking system, the video-signal-capturing
mechanism commonly used in a robot system is process is conducted by stereo sensors, which are
the machine vision measuring system capable two low-cost CMOS cameras with pixel resolu-
of non-contacting measurement; this type of tions of 320 x 240. The captured synchronized
measurement is advantageous because the actual concurrent video frames are then transmitted
environment does not always allow contact with via USB to a PC, the image processor, and then
the surface at which the measurement is performed. buffered in the PC’s system memory. The frames
One common problem with the acquired images are eventually displayed in the capture windows
is the accompanying noises, namely, disturbances using the PC’s windows driver model (WDM)
caused by irrelevant objects in the background or functions. The windows driver model can reach
foreground or by substandard illumination. As a a high video frame rate, while the two CMOS
visual tracking system must be capable of record- cameras can also achieve a frame rate of 30 im-
ing image features in real time, another problem ages per second; therefore, real-time tracking can
with most computer-based vision systems is their be achieved.
limited performance due to inadequate comput-
ing power for motion tracking after processing
resources, including the memory, have been allo-
cated to the tasks of feature extraction and template

785
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

Calibration of Image-Based System coordinate systems of the camera and the concrete
world, including a 3D translation vector t that
It might be expected that after the raw images are provides the translational components and a rota-
obtained, the next step would be to process them. tion matrix R that provides the rotational compo-
However, camera calibration and robot hand-eye nents. For stereo camera systems, it is also neces-
transformation are two fundamental steps that need sary to obtain the relative extrinsic parameters
to be taken into consideration before any reliable between the two cameras. The combination of a
image processing or even further visual servoing camera’s, or an array of cameras’, intrinsic and
can be performed. To obtain the 3D position of an extrinsic parameters presents the full set of data
object portrayed in 2D images, the relationship needed to locate, in the outside world, the cor-
between the coordinates of the points in a 3D responding position of a point in images taken by
frame, in this paper the tank, and the coordinates the camera(s). M.-C. Villa-Uriol (2004) and
of their corresponding points projected onto the 2D Hutchinson (2006) provided a complete review
imagery plains must be found. The first step toward on the fundamentals and techniques of camera
this goal is recognizing the basic characteristics calibration.
of the cameras used to retrieve the images. These In this work, static pattern calibration is ap-
characteristics, generally grouped into intrinsic plied to attain the cameras’ two sets of parameters
and extrinsic parameters, are shown in Figure 1. and empower automatic edge detection. The
Camera calibration can be used to obtain the key to this calibration process is the use of a flat
intrinsic and extrinsic parameters of the camera. checkerboard with a known geometric pattern.
The intrinsic parameters, which are independent The dimensions of each square on the board are
of a camera’s position in the physical environment, predetermined. Points on the board model plane
describe the camera’s focal length (fx, fy), princi- and their projections in digital images taken by the
pal point (Cx, Cy), and distortion coefficients. On to-be-calibrated cameras are passed as parameters
the other hand, the extrinsic parameters offer to the intrinsic calibration routine.
information on the transformation between the

Figure 1. The robot is calibrated to recognize the intrinsic and extrinsic parameters of the camera

786
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

The extrinsic calibration methods used here Bobick, 2001; Bradski, 2002). This moving history
also employ digital checker-patterned board im- representation can be used to determine the cur-
ages. Assume that for every point M in the physical rent movement of the object and to segment and
world reference frame we have m = RM + t, where measure the motions induced by the object (e.g.,
m is the 3D coordinates of the point’s counterpart fire flame) in a video scene. MHI representations
in the camera reference frame, while R and t are have the following advantages: a range of times
the rotational and translational matrices, respec- from frame to frame to several seconds may be
tively, for the coordinate transformation between encoded in a single image, direct recognition of
the physical world and camera reference frames. the motion itself is possible, motion recognition
The extrinsic parameters of the to-be-calibrated is not computationally taxing and real-time imple-
cameras can be calculated linearly with the help mentation is possible, and the motion within the
of one array that stores the coordinates of chosen detecting scene can be monitored. An MHI is used
reference points in the digital checkerboard images to represent how the fire flame is moving, since
and another array that contains the correspond- the outward boundaries of the fire flame are less
ing points in the physical world. The collection prone to misdetection than the source regions of
of reference points is selected on the checker- fire flame. In an MHI, the pixel intensity is a func-
patterned model plane for the two cameras used tion of the motion history at that location; in the
in this chapter. MHI, brighter values correspond to a more recent
motion. It should be noted that the final motion
Vision-Based Fire Detection locations appear brighter in the MHI.

The real-time fire-fighting robot is guided by a Correlation of Spectral Characteristics


vision-based CAMSHIFT tracking surveillance
system and is equipped with a water gun to ex- The first step in detecting possible fire flame pixel
tinguish the burning targets. The vision-based fire candidates is to transform the color space into HSI
flame detection algorithm consists of five steps: color space and then carry out analysis. The HSI
(1) moving pixels or regions in the current frame color system projects the standard red-green-blue
of a video are determined with the motion history (RGB color model) color space along its principle
image (MHI); (2) the HSI colors of moving pixels diagonal in terms of white to black shades to avoid
are checked; (3) if the histogram of moving pixels the influence of lighting changes (Castleman,
is correlated with the fire flame color histogram, 1996). Hue is the dominant color (red, green, and
then the disordered measurement and temporal blue) of an area, and saturation is the colorfulness
analysis are performed to determine if fire flame of an area in proportion to its brightness. Intensity
colored pixels flicker or not; and (4) Back-Pro- is related to the color luminance, e.g., human skin
jection and CAMSHIFT are applied to track the occupies a small portion of the H and S spaces. The
fire flame region; (5) the distance between the advantages of the HSI space are the intuitiveness
robot and the burning target is calculated through of the components and the explicit discrimination
binocular stereo. between luminance and chrominance. The hue,
saturation, and intensity components of the HSI
Moving Motion Segmentation with model are normalized into the following ranges:
Motion History Image (MHI) 0° ≤ hue ≤ 360°, 0 ≤ saturation ≤ 255, and 0 ≤
intensity ≤ 255. The computed fire flame spectral
The MHI is a scalar-valued image where intensity histogram correlation coefficient is measured by
is a function of the recency of motion (Davis, 1999; the compare correlation analysis. The template

787
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

of fire flame spectral histogram, which is based candidate regions are checked to see whether they
on empirical analysis results for the fire flames continuously appear and disappear over time. The
with colors from red to yellow, was created to level crossing rate LCR is utilized for validating
detect the flame-colored pixels (Horng, 2005). these extracted fire flickering regions. Temporal
Hue is an attribute of the pure color of the image variation for each pixel is computed by finding
scene, and it was demonstrated that it can be used the level crossing rate of the most likely fire flame
in assessing the prospect of numerical labelling candidate region above the heuristic threshold
of the flame colors (Huang, 2008). Hence, the value among consecutive frames. The heuristic
detection of flame pixels is carried out using the threshold is determined based on the fire flame
hue channel histogram correlation analysis with models in recorded video sequences.
the fire flame template, which maps the hue value
of general flames to be distributed from 0° to 60°. Back-Projection

Chaotic Spatial Structure Analysis If a visual servoing system is applied to natural


backgrounds, color data usually provides more
The moving object regions with disordered ratios reliable and flexible information than mono-
of perimeter to area for the extracted fire flame chrome data (Kim, 1996; LeGrand, 1996). The
region Ω are defined as: P/A, where P represents CAMSHIFT tracking engine is based on the his-
the perimeter of the region and A represents the togram projection algorithm (Swain, 1990), which
area of the region. As the complexity of a shape is a useful technique for color object recognition,
increases (i.e., the perimeter increases with re- especially for object identification in complex
spect to the area), the value associated with the background surroundings. Histogram back-pro-
disordered ratio Ω increases. The chaotic and jection is a primitive operation that finds and
turbulent nature of a region can be detected by identifies the association between pixel values in
relating the extracted spatial features to the fire a grabbed image and the values in a particular
flame likelihood region and the smoke likelihood histogram bin. Histogram and back-projection
region (Chen, 2004). The likelihood that a flame- performed on any consecutive frame would gen-
like region is a flame region is highly correlated erate a probability image on which the value of
with the parameter Ω. each pixel represents the probability of the exact
same pixel from the input belonging to the target
Temporal Analysis histogram that was used. Given that m histogram
bins are used, we can define n image pixel loca-
It is not always sufficient to detect fire flame tions. Thus, we have histograms {ŷu}, u=1,…, m
correctly based on color information. There are and pixel locations {xi}, i=1,…, n. Let us also
many objects, with similar color properties as the define a function c:R2→{1,…, m} that associates
fire flame spectrum. The key to distinguishing a pixel at location xi* with a histogram bin index
between the flame and flame-colored objects is the c(xi*). Then, the histograms can be computed with
nature of their motion. The flames in a fire dance n

around, so any particular pixel will only see fire  ( )


the equation ŷu = ∑ δ c x i∗ − u  . In all cases,

i =1
for a fraction of the time. This kind of temporal the values in the histogram bins are rescaled to
periodicity is commonly known as flickering. The fit within the discrete pixel range of the possible
flicker of fire flame causes the spectral values in the output 2D probability distribution image with the
fire flame region to fluctuate in time. The flicker function
in fire is also used as additional information. The

788
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

   derived from color histograms. The center and


  UPPER 
pu = min  yˆu ,UPPER . size of the targeted object region is computed
  max (yˆ) 

 u =1...m and used as settings for the search window on the
following frame of the video sequence. Figure
That is, the values in the histogram bins, which 2 shows the images of the tracked object in the
originally lie in the range [0, max(ŷu)], now lie in digital pictures that have been recognized and
the new range [0, UPPER]. In the end, the input processed, and the green bounding ellipse presents
pixels with the highest probability of being in the fire flame region tracked by the CAMSHIFT
the sample histogram will be mapped onto a 2D algorithm. The calculation of the color probability
histogram back-projection image with the highest distribution is not performed on the entire image,
visible intensities. but only on limited regions surrounding the cur-
rent CAMSHIFT window, which includes images
CAMSHIFT Tracking of the specified object that are transformed into
a discrete probability image. This tends to result
The CAMSHIFT algorithm is a non-parametric in a large reduction in the computational costs.
technique that can track a specified target’s 2D The CAMSHIFT algorithm can be summarized
position efficiently across a series of images. When by the following steps:
tracking the 2D position of a colored object, in our
case a fire flame colored region, the CAMSHIFT • Step 1: A region of interest (ROI) window
operates on a color probability distribution image is selected to be the sample image for fu-

Figure 2. The image at the 169th frame is detected as a flame by searching the candidate regions, which
are above the LCR threshold. The CAMSHIFT tracking algorithm is employed to track the movement
of flame pixels, and the flame region is bounded by the green ellipse.

789
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

ture color probability distribution compu- the two images obtained by each of the two stereo
tation. In tracking procedures, this window cameras—to create a set of stereo pairs. Then, for
is placed over the targeted object. every pair, each point is envisioned as a ray. The
• Step 2: A mean shift search window is ini- two rays would intersect in the actual 3D world.
tially centered at the first frame’s data point Now suppose the 3D coordinates of the two points
position. in a stereo pair are P1 and P2; the rotation matrices
• Step 3: The color distribution of the region and translation vectors for the transformation
centered at the mean shift search window between the two camera coordinate systems and
is calculated, producing a discrete prob- the stereo coordinate system are R1, R2 and t1, t2.
ability image. The mean location (the cen- Therefore, given a point M in the stereo coordinate
troid) of the discrete probability image can system, we have P1 = R1M + t1 and P2 = R2M + t2.
be found within the search window by first Finally, the point where the two rays intersect is
obtaining moment values. Given that I(x, found to be the point M that produces the P1 and
y) is the pixel value function for the inten- P2 closest to the rays (represented by straight lines)
sity of the discrete probability image at in their respective camera coordinate systems.
point (x, y) in the search window, one can This approach transforms the coordinate systems
compute the zeroth moment for that point. of the two cameras into absolute coordinates. In
• Step 4: The mean shift algorithm (step 3) this work, the real-time 3D depths of the burning
is iterated, replacing (x, y) with the corre- target, tracked with the presently discussed 3D
sponding (xc, yc), until the centroid of the point measuring method involving the intersecting
search window region’s generated proba- rays, are rendered using Open Graphics Library
bility image converges to a constant point. (Wright, 2000). The specified corner positions of
This point should be at the center of the the chessboard are processed and analyzed with
tracked target. The zeroth moment (distri- the calibrated intrinsic and extrinsic parameters
bution area) and the mean location (the and the absolute coordinates are calculated via
centroid) are stored. binocular stereo.
• Step 5: The size of the search windows
is set as a function of the zeroth moment a. LCR probability distribution image of flame.
found in Step 4 to match the size of the b. The moving flame pixels are separated.
tracked object, and the center of the search c. The CAMSHIFT tracking algorithm is
window is placed on the following frame at employed.
the mean location found in Step 4. The pro-
cess is then repeated, beginning at Step 3.
STEREO-VISION-BASED FIRE
Depth Calculated through EXTINGUISHING MOBILE ROBOT
Binocular Stereo
The real-time stereo-vision-based wheeled mobile
The purpose of using two cameras instead of one robot (WMR) is composed of a stereo-vision-
to track a burning target provides data on the based CAMSHIFT tracking algorithm, two CMOS
burning target’s position in a third dimension. cameras (with stereo vision) for capturing images,
The calculation of depth via binocular stereo is and a water gun for extinguishing the burning tar-
a common way to extend one’s knowledge of a get. To equip a mobile robot with 3-dimensional
scene from 2D to 3D (Jain, 1995). First, feature range detection capability, the calibration of the
points are grouped into teams of two—one each in binocular vision system should be carried out.

790
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

The inside geometry information of and spatial five fuzzy regions: left far (LF), left near (LN),
relationship between the two cameras used can be zero (ZE), right near (RN), and right far (RF). For
acquired by calibrating their intrinsic and extrinsic simplicity, these five standard triangular member-
parameters. The binocular image processing sys- ship functions for the fuzzy region variables {LF,
tem first finds the same feature points of objects, LN, ZE, RN, and RF} are used. The value of Td
then compares them to see if they are the same consists of three fuzzy regions: near (N), middle
based on the epipolar geometry constraint. The (M), and far (F), which represent the target’s depth
next step is to calculate the 3-dimensional coordi- distance relative to the mobile robot.
nate of the object using the triangular perspective The rotating signal θr at the fuzzy controller
theory, and to compute the distance between the output consists of five singletons: left large (LL),
robot and the object. left small (LS), zero (ZE), right small (RS), and
A fuzzy controller is used to control the robot’s right large (RL). With the values of the four
rotating direction to track the target on the front singletons in hand, the controller output u can
side. The robot follows the commands of the then be calculated using the defuzzifier formula.
fuzzy reasoning module to manipulate the mobile The fuzzy rules are listed in Table 1, which also
rotation. As soon as the tracked target is followed represent the fuzzy associative matrix. The lines
without colliding with obstacles, the fuzzy reason- and columns correspond to the target’s relative
ing visual system orders the mobile robot to cease depth distances and horizontal offset position
rotating. In this work, the two main input variables values, respectively (inputs to the fuzzy reasoning
for the fuzzy controller are the sum of the target’s system), while the values of the matrix correspond
horizontal offset position from the stereo image to a robot-rotating signal (output of the fuzzy
pairs Xs = Xl + Xr, as shown in Figure 3, and the reasoning system). Overall, the fuzzy reasoning
followed target’s depth position relative to the system is governed by the min/max inference
robot Td. The values of Xs and Td are rescaled to technique and the center of gravity for the de-
fit the range [–1, 1]. The value of Xs consists of fuzzification step. The two cameras of the proposed

Figure 3. Two main input variables for the fuzzy controller are Xs (Xl + Xr) and Td

791
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

Table 1. Fuzzy associative matrix for mobile


the mobile robot for the best effective view of the
robot steering
targeted object. As shown in Figure 4, the robot
Td follows the commands of the fuzzy reasoning
θr
N M F
module to track the burning target with the fire
LF LS LS LL
detection function enabled and hence, the robot
LN
rotates to approach the fire flame on the front
LS LS LS
side. Demo video clips of the experiment are
Xs ZE ZE ZE ZE
available on the Web site (http://www.youtube.
RN RS RS RS
com/watch?v=dgTSsX2ezMU and http://www.
RF RS RS RL
youtube.com/watch?v=JkgC-Q7ogE8).

a. Frame 100, first camera


visual servoing system track the target indepen-
b. Frame 100, second camera
dently of each other with their CAMSHIFT track-
ing engine.
FUTURE RESEARCH DIRECTIONS
EXPERIMENTAL RESULTS Designing a visual tracking system to avoid the
in-path obstacles is a complex task because a
The main purpose of this work is to design a
large amount of video data must be transmitted
wheeled mobile robot with stereo machine vision
and processed in real time. The main task for the
to extinguish a burning target. The stereo visual
target tracking without hitting the obstacles is
system is used to analyze the images to get the
obstacle detection, which is essential for a safe
3D coordinate position of the burning target on
autonomous mobile robot. Detecting obstacles
the ground, mark the region and its gravity center,
requires an active perception of the surround-
and then the mobile robot is driven to extinguish
ings. Laser scanners have the great advantage
the burning object with a water gun. The position
of providing accurate depth information that has
and orientation of the stereo cameras with respect
to be computed from calibrated stereo images if
to the real world’s coordinate system could be
cameras are used for the same task (Ho, 2009). The
obtained through extrinsic calibration routines.
real-time object tracking and collision avoidance
Both cameras are mounted at the front part of

Figure 4. Position deviations between the robot and the fire and viewpoint from the robot’s first and
second camera after applying the visual servoing rules

792
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

method for mobile robot navigation in indoor and CONCLUSION


outdoor environments using stereo vision fused
with laser sensors is an emerging trend. This chapter proposes a stereo-vision-based
Multi-sensor fusion is necessary to cut down wheeled mobile fire detecting and fighting system
on the number of false alarms, since it can reduce that has been successfully implemented and shown
the effects of errors in measurements. A variety to work in non-ideal real-world residential build-
of multi-sensors were fused together in sensor ings. Spectral, spatial, and temporal motion fea-
packages and evaluated based on a set of cost tures and a heuristic-based classifier are employed
and performance criteria. Vision-based real-time to extract real fire flame data and are adopted for
detection for early fire flame detection can be fused helping the validation of that fire flame. Stereo
with the multiple sensors in order to have a more vision tracking can be achieved by applying the
robust video-based fire detection system. The CAMSHIFT algorithm and using two low-cost,
development of a more sophisticated algorithm, calibrated USB cameras, which enable high-speed
versus the simple threshold rule, for multi-sensor image capturing. A computationally efficient and
detectors is currently under investigation. robust implementation of the visual measurement
In addition, multiple fires may break out si- and servo mobile robot can be used to obtain reli-
multaneously, and hence the conducting research able real-time online 3D positioning of a particular
on tracking multiple fire regions concurrently and burning object. Moreover, the fire-fighting system
extinguishing dynamically by the behavior of the provides more safety in fire fighting and is very
flame is great technical challenges. economical when incorporated with other fire
Significant progress has been made in visual alarm systems for use in large spaces. Intelligent
servoing during the last few years. Several robust and automatic control of the fire-fighting robot
tracking algorithms have been developed, which improves its detection efficiency and suppres-
can track objects in real time in simple scenarios. sion adaptability. Experimental results show that
The proposed framework can find further applica- real-time fire flame detection and suppression is
tions in versatile fields like automated surveillance, achieved even under non-ideal lighting conditions.
human computer interaction, video retrieval, traf-
fic monitoring, and vehicle navigation. Further,
motion estimation is a very active area of research ACKNOWLEDGMENT
in which new solutions are continuously being
developed. One challenge in tracking is to develop This research was partially supported by the
robust algorithms suitable for tracking objects via National Science Council in Taiwan under Grant
hardware logic IP (system-on-chip technology). NSC 98-2218-E-224-003, NSC 100-2221-E-224-
The combination of different sensors employed in 016 and Industrial Technology Research Institute.
visual servoing (e.g., audio and force sensors) is Special thanks are given to anonymous reviewers
also a new direction for further development. Such for their helpful suggestions.
hybrid sensor-based servoing provides additional
information that can be used in conjunction with a
video-based tracker to solve problems like severe
occlusion or estimating tracking more robustly.

793
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

REFERENCES Ho, C.-C. (2009, 16-19, August). Machine vision


based 3D scanning system. Paper presented at the
Bobick, A., & Davis, J. W. (2001). The represen- International Conference on Electronic Measure-
tation and recognition of action using temporal ment & Instruments, Beijing.
templates. IEEE Transactions on Pattern Analy-
sis and Machine Intelligence, 23(3), 257–267. Ho, C.-C. (2009). Machine vision-based real-time
doi:10.1109/34.910878 early flame and smoke detection. Measurement Sci-
ence and Technology, 20(4), 045502(045513pp).
Bradski, G. R., & Davis, J. W. (2002). Motion seg-
mentation and pose recognition with motion his- Huang, H. W., & Zhang, Y. (2008). Flame colour
tory gradients. Machine Vision and Applications, characterization in the visible and infrared spec-
13(3), 174–184. doi:10.1007/s001380100064 trum using a digital camera and image process-
ing. Measurement Science & Technology, 19(8),
Castleman, K. R. (1996). Digital image process- 085406. doi:10.1088/0957-0233/19/8/085406
ing. Upper Saddle River, NJ: Prentice Hall Press.
Hutchinson, T. C., Kuester, F., Doerr, K. U., &
Celik, T., Özkaramanlı, H., & Demirel, H. (2007). Lim, D. (2006). Optimal hardware and software
Fire and smoke detection without sensors: Image design of an image-based system for capturing
processing based approach. Proceedings of 15th dynamic movements. IEEE Transactions on In-
European Signal Processing Conference (pp. strumentation and Measurement, 55(1), 164–175.
1794-1798). doi:10.1109/TIM.2005.860872
Chen, T., Yuan, H., Su, G., & Fan, W. (2004). An Jain, R., Kasturi, R., & Schunck, B. G. (1995).
automatic fire searching and suppression system Machine vision. McGraw-Hill.
for large spaces. Fire Safety Journal, 39(4),
297–307. doi:10.1016/j.firesaf.2003.11.007 Kim, K. I., Oh, S. Y., Kim, S. W., Jeong, H., Lee,
C. N., Kim, B. S., et al. (1996). An autonomous
Chen, T. H., Kao, C. L., & Chang, S. M. (2003). land vehicle PRV III. Paper presented at the IEEE
An intelligent real-time fire-detection method Intelligent Vehicles Symposium.
based on video processing. Paper presented at the
IEEE 37th Annual 2003 International Conference LeGrand, R., & Luo, R. C. (1996, 22-28 April).
on Carnahan. Position estimation of selected targets. Paper pre-
sented at the International Conference on Robotics
Chen, T. H., Wu, P. H., & Chiou, Y. C. (2004). and Automation, Minneapolis, MN
An early fire-detection method based on image
processing. Paper presented at the International Podr aj, P., & Hashimoto, H. (2008). Intelligent
Conference on Image Processing. space as a framework for fire detection and evacu-
ation. Fire Technology, 44(1), 65-76.
Davis, J. (1999). Recognizing movement using
motion histograms. Sumi, K., Hashimoto, M., & Okuda, H. (1995).
Three-level broad-edge matching based real-time
Di Stefano, L., Mattoccia, S., & Mola, M. (2003). robot vision. Paper presented at the IEEE Interna-
An efficient algorithm for exhaustive template tional Conference on Robotics and Automation.
matching based on normalized cross correlation.
Paper presented at the 12th International Confer- Swain, M. J., & Ballard, D. H. (1990). Indexing
ence on Image Analysis and Processing. via color histograms. Paper presented at the Third
International Conference on Computer Vision.

794
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

Villa-Uriol, M. C., Chaudhary, G., Kuester, F., Davis, J. W., & Bobick, A. (1997, June). The
Hutchinson, T., & Bagherzadeh, N. (2004). Ex- representation and recognition of action using
tracting 3D from 2D: Selection basis for camera temporal templates. Paper presented at the Pro-
calibration. Paper presented at the 7th Interna- ceedings of IEEE Conference on Computer Vision
tional Conference on Computer Graphics and and Pattern Recognition, Puerto Rico.
Imaging (CGIM).
Fang, J., Jie, J., Hong-Yong, Y., & Yong-Ming, Z.
Wang, S. J., Tsai, M. T., Ho, Y. K., & Chiang, C. (2006). Early fire smoke movements and detec-
C. (2006). Video-based early flame detection for tion in high large volume spaces. Building and
vessels by using the fuzzy color clustering algo- Environment, 41(11), 1482–1493. doi:10.1016/j.
rithm. Proceedings of International Computer buildenv.2005.05.036
Symposium, 3, 1179–1184.
Ho, C.-C. (2010). Image processing algorithm for
Wright, R. S. Jr, & Sweet, M. R. (2000). OpenGL tracking and picking an object for an open plat-
superBible (2nd ed.). Waite Group Press. form mobile robot. Key Engineering Materials,
419, 569–572. doi:10.4028/www.scientific.net/
Yuan, F. (2010). An integrated fire detection and
KEM.419-420.569
suppression system based on widely available vid-
eo surveillance. Machine Vision and Applications, Ho, C.-C. (2010). A stereo vision based target
21(6), 941–948. doi:10.1007/s00138-010-0276-x tracking and obstacle avoidance. Key Engineer-
ing Materials, 419, 565–568. doi:10.4028/www.
scientific.net/KEM.419-420.565
ADDITIONAL READING
Ho, C.-C., Chen, M.-C., & Lien, C.-H. (2011).
The references to materials directly cited in the Machine vision-based intelligent fire fighting
chapter have been included in the reference sec- robot. Key Engineering Materials, 450, 312–315.
tion. However, there are many additional readings doi:10.4028/www.scientific.net/KEM.450.312
relevant to stereo vision, some of which are listed Ho, C.-C., & Kuo, T.-H. (2009). Real-time video-
in this section. based fire smoke detection system. Paper presented
Bradski, G. R. (1998). Computer vision face at the IEEE/ASME Conference on Advanced
tracking for use in a perceptual user interface. Intelligent Mechatronics.
Intel Technology Journal, 2(2), 12–21. Ho, C.-C., & Shih, C.-L. (2007). Real-time track-
Bradski, G. R., & Davis, J. W. (2002). Motion seg- ing and stereo vision-based control of a goldfish-
mentation and pose recognition with motion his- catching system. Paper presented at the Tthe 35th
tory gradients. Machine Vision and Applications, International MATADOR Conference.
13(3), 174–184. doi:10.1007/s001380100064 Ho, C.-C., & Shih, C.-L. (2008). Machine vision
Chen, T. H., Yin, Y. H., Huang, S. F., & Ye, Y. T. based tracking control of a ball-beam system.
(2006, Dec. 2006). The smoke detection for early Key Engineering Materials, 381-382, 301–304.
fire-alarming system base on video processing. doi:10.4028/www.scientific.net/KEM.381-
Paper presented at the International Conference 382.301
on Intelligent Information Hiding and Multimedia Ho, C.-C., & Shih, C.-L. (2008). A real-time visual
Signal Processing. servo architecture for a golden fish catching sys-
tem. Measurement and Control, 41(5), 151–154.

795
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

Ho, C.-C., & Shih, C.-L. (2009). A real-time Schroeder, D. (2004). Evaluation of three wildfire
fuzzy reasoning based control system for catch- smoke detection systems. [FERIC]. Advantage-
ing a moving goldfish. International Journal of Forest Engineering Research Institute of Canada,
Control. Automation and Systems, 7(5), 755–763. 5(24), 8.
doi:10.1007/s12555-009-0508-x
Toreyin, B. U., Dedeoglu, Y., & Cetin, A. E.
Horng, W.-b., Peng, J.-w., & Chen, C.-y. (2005, (2005). Flame detection in video using hidden
March 19-22). A new image-based real-time markov models. Paper presented at the Interna-
flame detection method using color analysis. tional Conference on Image Processing.
Paper presented at the International Conference
Toreyin, B. U., Dedeoglu, Y., & Cetin, A. E. (2005).
on Networking, Sensing and Control, Tucson,
Wavelet based real-time smoke detection in video.
Arizona, USA.
Paper presented at the The 13th European Signal
Kang, K. (2007). A smoke model and its applica- Processing Conference, Antalya, Turkey.
tion for smoke management in an underground
Toreyin, B. U., Dedeoglu, Y., & Cetin, A. E. (2006).
mass transit station. Fire Safety Journal, 42(3),
Contour based smoke detection in video using
218–231. doi:10.1016/j.firesaf.2006.10.003
wavelets. Paper presented at the The 14th European
Ko, B. C., Cheong, K. H., & Nam, J. Y. (2008). Signal Processing Conference, Florance, Italy.
Fire detection based on vision sensor and support
Toreyin, B. U., Dedeogˇlu, Y., Gudukbay, U., &
vector machines. Fire Safety Journal.
Cetin, A. E. (2006). Computer vision based method
Kopilovic, I., Vagvolgyi, B., & Sziranyi, T. (2000, for real-time fire and flame detection. Pattern
09/03 - 09/07). Application of panoramic annular Recognition Letters, 27(1), 49–58. doi:10.1016/j.
lens for motion analysis tasks: surveillance and patrec.2005.06.015
smoke detection. Paper presented at the 15th In-
Wieser, D., & Brupbacher, T. (2001, March 25-28).
ternational Conference on Pattern Recognition,
Smoke detection in tunnels using video images. Pa-
Barcelona, Spain.
per presented at the 12th international conference
Lai, C. L., Yang, J. C., & Chen, Y. H. (2007, May). on automatic fire detection, Gaithersburg, USA.
A real time video processing based surveillance
Xiong, Z., Caballero, R., Wang, H., Finn, A. M.,
system for early fire and flood detection. Paper
Lelic, M. A., & Peng, P.-Y. (2007). Video-based
presented at the The 24th IEEE Instrumentation
smoke detection: possibilities, techniques, and
and Measurement Technology Conference.
challenges. Paper presented at the Fire Suppres-
Liu, C.-B., & Ahuja, N. (2004). Vision based fire sion and Detection Research and Applications,
detection. Paper presented at the The 17th Interna- Orlando, FL.
tional Conference on Pattern Recognition, Urbana.
Yuan, F. (2008). A fast accumulative motion ori-
Phillips, W. III, Shah, M., & da Vitoria Lobo, N. entation model based on integral image for video
(2002). Flame recognition in video. Pattern Rec- smoke detection. Pattern Recognition Letters,
ognition Letters, 23(1-3), 319–327. doi:10.1016/ 29(7), 925–932. doi:10.1016/j.patrec.2008.01.013
S0167-8655(01)00135-0

796
Stereo-Vision-Based Fire Detection and Suppression Robot for Buildings

KEY TERMS AND DEFINITIONS Video Based Fire Flame Detection: Analyze
the spectral, spatial, and motion orientation char-
3D Tracking: Position based tracking algo- acteristics of the fire flame regions from the image
rithm to provide feedback on the real-time posi- sequences of the video to detect fire.
tion of the specified target with a high frame rate. Video Based Smoke Pattern Recognition:
Fire-Fighting Robot: The mobile robot is able Analyze the spectral, spatial, and motion orien-
to successfully complete a fire-extinguishing task. tation characteristics of the fire smoke regions
Fire Surveillance System: Video based fire from the image sequences of the video to detect
surveillance technology is suitable for early detec- fire smoke.
tion of fires due to its low detection delay, good Visual Servoing: A visual tracking system to
resolution, and high localization accuracy. track an object and video data must be transmitted
Stereo Vision: 3D position can be generated and processed in real time in order to feedback
by means of two calibrated cameras. to the system.

This work was previously published in Depth Map and 3D Imaging Applications: Algorithms and Technologies, edited by
Aamir Saeed Malik, Tae Sun Choi and Humaira Nisar, pp. 418-432, copyright 2012 by Information Science Reference (an
imprint of IGI Global).

797
Section 4
Utilization and Application

This section discusses a variety of applications and opportunities available that can be considered by practitioners
in developing viable and effective Geographic Information Systems programs and processes. This section includes
16 chapters that review topics from case studies in Africa to best practices in Asia and ongoing research in the
United States. Further chapters discuss Geographic Information Systems in a variety of settings (environment,
city planning, disease/disaster management, etc.). Contributions included in this section provide excellent cover-
age of today’s IT community and how research into Geographic Information Systems is impacting the social
fabric of our present-day global village.
799

Chapter 49
Children’s Maps in GIS:
A Tool for Communicating Outdoor
Experiences in Urban Planning

Kerstin Nordin
Swedish University of Agricultural Sciences, Sweden

Ulla Berglund
Swedish University of Agricultural Sciences, Sweden

ABSTRACT
Since 2002 the authors have successively developed “Children’s Maps in GIS”, a method for children’s
participation in spatial planning. Their studies show that 10-15 year-olds are capable of reading maps
and using a GIS-application for communicating their interests in a stable and useful manner. The purpose
of this article is to discuss the first stages of implementation in a real world project, in relation to ICT.
The authors report experiences from a Swedish municipality using Children’s Maps in GIS in a survey
with over 600 children as part of a comprehensive planning process and give examples of how data can
be visualized. A significant digital divide between different parts of the administration is noted. In the
ongoing development into an Internet version of the method the authors aim to increase the access to
the GIS-application and develop standard procedures for categorizing and analyzing data.

INTRODUCTION in effect implemented, by most countries around


the World. The local environment is such an issue
It is considered that children (0-18 years) should be concerning children, this is identified in the UN
afforded the opportunity to influence those matters action plan Agenda 21 (United Nations, 1993)
that affect or concern them, as outlined in Article where it is stated that young people’s participation
12:1 of the UN Convention of The Rights of the is vital to the realization of a sustainable society.
Child (UNICEF, 1990), a convention signed, and Therefore there is an expectation for young

DOI: 10.4018/978-1-4666-2038-4.ch049

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Children’s Maps in GIS

people’s experiences and wishes to be included criticized for lacking strategies for handling of
in local-authority planning. biased power relations. This means that less
In the background section of this article we powerful stakeholders, and especially those whose
report on our theoretical framework and describe perspectives deviate from existing policies, are at
the method Children’s Maps that we have devel- risk of being unable to achieve the influence that
oped through our research. In the result section their arguments call for (Flyvbjerg, 2001; Sager,
we document how this method was used outside 1994; Sager, 2006).Children and youth tend to
the research-context in a municipal as a part of a fall into this category, putting their participation
comprehensive planning process. In the discus- at risk of being reduced to pseudo-democratic
sion section we comment on the results and make practices, such as manipulation, decoration or
comparisons with experiences from previous tokenism, the lower steps in the “Ladder of Young
research-pilots. Finally we draw conclusions on People’s Participation” (Hart, 1997). Case studies
further development of the method. in various European countries highlight tendencies
towards superficial participation resulting in little
real influence (Rogers, 2006; Tonucci, Prisco,
BACKGROUND & Horelli, 2004) and unwillingness or inability
from decision-makers to take children’s interests
This paper reports on research carried out within into account has been noted by many researchers
the discipline of Landscape architecture although (Chawla, 2002a; Lynch, 1977; Matthews, 1998;
the theoretical framework shows an interdis- Wilhjelm, 1999; Woolley et al., 1999).
ciplinary approach including planning theory, Conversely, literature shows a strong sup-
social theory as well as theory of environmental port for the idea that planning and design of the
psychology, geography and GIScience. physical environment attracts the interest of young
people and it is therefore possible to include them
Children, Planning and Participation in society’s democratic processes (Horelli, 1998;
Percy-Smith & Malone, 2001; Urban Green Spaces
In today’s construction of childhood, children are Taskforce, 2002; Norsk form, 2005.) Furthermore,
acknowledged as active social and cultural actors following our personal experiences, there is a de-
(Holloway & Valentine, 2003; Christensen & sire among many planners to engage children and
Prout, 2002). According to Christensen & O’Brian youth to a larger extent. We, as well as Freeman
(2003, p. 2) they are also recognized as informants & Aitken-Rose (2005) have noticed a growing
and participants in research, having “emerged interest toward child-focused and youth-focused
as key source for understanding their everyday methodologies for working with adolescents.
life”. Christensen further stresses the relevance There is an obvious need to find solutions to
for planning of children’s “emplaced knowledge the dilemma of how to incorporate young peoples’
[…] full with personal and social meaning, built perspectives into local planning processes (e.g.,
up through their everyday encounters” with their Christensen & O’Brian, 2003; Driskell, 2002).
local environment” (Christensen, 2003, p.16). The A model for enhancing children’s participation
overall idea of governance using communicative in decision-making is the “Pathways to par-
planning (e.g., Healey, 1997; Healey, 1999) with ticipation”, developed by Shiers (2001). Shiers,
reference to Habermas’ theories on communica- drawing on Hart (1997), identifies five levels of
tive rationality and communicative action (e.g., participation. The model can be used as a tool for
Habermas, 1984) today is widely practiced in planning for participation. Level 3, “Children’s
developed countries. However, this practice is views are taken into account” is the minimum

800
Children’s Maps in GIS

you must achieve if you endorse the UN Con- can collect, store, analyze and visualize data that
vention on the Rights of the Child. The higher is relevant and trustworthy to all participants in
levels of participation require that children are a planning process. The first step is then to make
involved in decision making processes (level 4) data concerning children’s use and perceptions
and children share power and responsibility for of the outdoor environment available in a plan-
decision-making (level 5). A method aiming to ning decision system in a way that is true to the
enhance children’s participation should be useful child’s own views.
at least at level 3.
Several initiatives concerning the empower- Geographic Information
ment of children within spatial planning and related
issues have been influenced by the UN Convention Geographic information, often represented in the
of the Right of the Child. Among these are the form of maps, plays an important role in planning
“Town of Children Project” started in 1991 which practice; today almost all of this information is
has engaged more than 200 Italian municipalities digital and part of a geographic information sys-
(Baraldi, 2003), the UNESCO Project “Growing tem, GIS. Although the use of GIS has increased
up in Cities” (Chawla, 2002b; Driskell, 2002), in Sweden in the public sector since 2003, there is
and the “Children as Community Researchers”, still a gap between those who can and those who
initiated by Roger Hart and launched by UNI- cannot use GIS for more advanced applications.
CEF (UNICEF, 2001).These varied approaches To look at static digital maps is still the most
highlight the diversity of methodologies available common use of GIS (ULI, 2008).
for children’s participation in planning of cities. MacEachren (2004, pp. iii-iv) points out that
The changes brought about by globalization the map is evolving as an interactive interface to
and the diffusion of information and communica- geospatial information, not just a static representa-
tion technology (ICT) is gradually opening up new tion of information. He states that understanding
opportunities for the involvement of young people the processes of map-making and the perceptual-
in planning and development of cities (Horelli & cognitive processes through which these repre-
Kaaja, 2002). One such example is a tool in the sentations are understood are of importance in
form of an Internet-based design game “The Ad- decision making and the enhancement of the role
venture Forest” invented by Finnish researchers of maps in collaborative activities.
(Kyttä, Kaaja, & Horelli, 2004); other example is Maps and cartography has been critically dis-
the sharing of knowledge on the Internet site “The cussed since the 1990´s with relation to linking
FreeChild Project” (The FreeChild Project, 2008); geographic knowledge with power (Crampton
There are several reports concerning the insti- & Krygier, 2006). They note drawing on Pickles
tutional constraints of adapting new methods and (1995) that “Maps are active: they actively con-
technologies within the planning practice (Horelli struct knowledge, they exercise power and they can
& Kaaja, 2002; Vonk, Geertman, & Shot, 2005). be powerful means of promoting social change”
An assumed reason for this is that most current (Crampton & Krygier, 2006, p.15).
tools are too inflexible and incompatible for most Geovisualization of data enhance visual
planning tasks, being oriented towards techno- thinking in exploring data and in collaboration
logical problems rather than planning problems processes (Jiang, Huang, & Vasek, 2003). Children
(Vonk et al., 2005) tend to develop a visual competence in an early
A method for communicating children’s age. Of special interest is specifically children’s
outdoor experiences into every-day planning understanding of a map as a representation of the
practices should be an easy-to-handle method that real world. Plester, Blades, and Spencer (2006),

801
Children’s Maps in GIS

describe how children during the first school years from this age group are supposedly developed,
learn to master correspondence and to interpret providing the researcher with multiple options
direction and distance, i.e. skills needed to under- concerning methodology.
stand the representation of an environment in an School is a frequently used environment for
aerial photograph or map. research with children. In school you find an
If we focus on GIS as a data collecting tool, is organization that can assist with information and
it useful when collecting information from chil- the permissions needed for research that concerns
dren? When it comes to drawing on digital maps children. If the aim and the methods of the research
the competence of children above 10 years of age can make it fit into the curriculum at school, a
concerning interpretation of symbols, scale etc win-win situation could be created. If this is not
can be expected to be as good (or bad) as that of possible it might be regarded unethical to ask for
adults (Plester et al., 2006). Positive experiences of the time of teachers and pupils.
the use of digital maps with children are reported According to the report “Benchmarking Ac-
by for example Kyttä et al. (2004) and Berglund cess and Use of ICT” (Korte & Hüsing, 2006),
(2008). Holloway and Valentine (2003) in their all Swedish schools have access to computers
study of children and ICT comment that children and Internet, but the access is inconsistent across
on the whole tend to be more technically compe- schools, with students and teachers at independent
tent in the use of computers than their parents are. schools having more access to ICT than students
Elwood (2006) points out that GIS has a his- and teachers in municipal schools. This includes
tory of being an expert tool in a rational, expert- hardware as well as connection to broadband and
driven planning process. Information that can be software (Skolverket, 2009). Furthermore, most
displayed visually is privileged as are the use of teachers in Sweden do not feel that that they have
quantitative techniques for spatial analysis. This sufficient knowledge of ICT, and roughly half of
may implicate that other forms of knowledge may the teachers do not believe that ICT encourages
be excluded from processes in which GIS is used. learning. That puts Sweden almost at the bot-
The widespread use of Internet and mobile GIS tom of the list for ICT use in schools in Europe
makes it important to discuss the digital divide (Skolverket, 2009).
with aspect to access to GIS and to spatial data
(Elwood, 2006). So why use GIS? “Elwood an- The Method Children’s Maps
swers (2006, p. 693) “In spite of its limitations
and challenges, GIS is tremendously important Since 2002 we have in our research successively
because it is such a powerful mediator of spatial developed “Children’s Maps in GIS”, a method
knowledge, social and political power …” The for communicating young people’s outdoor experi-
use of such an expert tool as GIS, can give the ences into urban planning. The method Children’s
information from the children some legitimacy Maps consist of several activities: contact with
(Elwood, 2002). school and parents, information to children, the
use of a GIS-application and feed-back to the
Research with Children in School children and teachers (Berglund, 2008; Berglund,
Nordin, & Eriksson, 2009). In the following we
There is a significant amount of literature regard- refer to the method as “Children’s Maps” and
ing pupils of 11-12 years of age. In our judgment depending on the context, we use young people as
this is due not only to their experience of place but well as children and youth, students and children
also to their writing and reading abilities and in for labeling our target group.
some cases the fact that technical skills of children

802
Children’s Maps in GIS

We aimed at developing a method that was easy found no special complications regarding the use
to manage within the school context, and stimulat- of the method in the different research-contexts.
ing and fun for children to participate with. We Our studies show that 10-15 year-olds are capable
tested the method with and adapted it for children of using a GIS-application for communicating
from 10 to 15 years of age. As we were aware of their experiences and interests into the local urban
that GIS technology may empower or disempower planning process in a stable and useful manner.
actors and institutions as described in Elwood We found it easy to fit the project into schools and
(2002) we wanted to build a GIS-application that to get some teachers to run a special teacher’s ap-
should not demand technical skills or expensive plication. The development and tests are described
software or databases. The result should be trust- in detail in Berglund (2008) and supplemented in
worthy and relevant to a town district level in the Berglund et al. (2009).
planning practice. The GIS-application can be described as a
We have in several research pilots (Berglund computerized questionnaire. Each student an-
& Nordin, 2007; Berglund, 2008; Berglund et al., swers seven questions individually. The student
2009) tried to answer the following questions: answers the question by making a sketch map
Can we attain location bound information that is directly on the screen, and when required, the
relevant to planning and suitably reliable, with student includes text information as an attribute
the help of children? Will this information be ac- to the geographical object created in the sketch
cepted by planning authorities? Are children able map. An overview of the procedure is presented
and willing to document their movements using in Table 1. The exercise generally takes about 15
GIS on digital maps that are used within today’s minutes and is carried out during class hours. The
planning? Can the children’s school become a software is an application built in ESRI ArcView
mediator between the children and the planners; 3. Laptops are used to store the application as well
can it be the place for developing and implement- as the data collected.
ing the method? The teachers participating in a study are using
During the period 2003-2006, the method an application that works in the same way as the
was tested with children and teachers in several children’s application. They indicate routes and
research-pilots in different locations in Sweden. places that are used for teaching by themselves
Several tests were carried out in Stockholm; in a or used by other teachers of their school.
high-status inner-city district as well as in a low-
status outer-city district with a high proportion
of immigrant families within an interdisciplinary APPLICATION OF THE METHOD
research project “Children and Open spaces in the IN REAL-WORLD USE IN TÄBY
City – Accessibility, Use and Influence”. In 2006
we got an opportunity to test the method in a rural We have in the text above described the method
context (a village, Örbyhus) as we got funding Children’s Maps through our research-pilots.
from the Road Administration. In that project What was still lacking was the evaluation of the
we adjusted the method to better catch questions implementation into planning practice. In 2008,
concerning traffic safety. In all our studies, we the municipal of Täby wanted to use the method
supplied computer, software and supervision for “Children’s Maps” to get information about how
the children participating, and did not ask for children and young people perceive and use the
any IT-qualifications from the teacher. We have outdoor environment in Täby. The project was
through diverse means, evaluated the function called “Place to grow” (our translation) and a part
and the trustworthiness of the method. We have of the comprehensive planning process. This gave

803
Children’s Maps in GIS

Table 1. Questions included in the survey and data inputs that are possible to make. The questions are
directly translated from Swedish and not adapted to English speaking children

Questions on the screen Geographic data Attribute


A call to register a new student by School, grade, number and sex
writing a unique number
1.Where do you live? Point object(s) No attributes
2.What routes do you use to school? Poly-line object(s) Means of travel: walk, cycle, walk/cycle to public transporta-
tion
Comments: Free wording
3.What routes do you use in your Poly-line object(s) Movements: walk, run, walk the dog, walk/cycle to public
free-time? transportation
Destination: friends, activity or else
Comments: Free wording
4.Where do you go when you are Polygon(s) Common activities all year around: meet friends, play, play
out-doors? boll, training, sun-bath, barbeque, skate
Common activities in winter time:
Ice-skating, skiing, play, sliding
Comments: Free wording
5.Are there places that are hazardous Point object (s) Comments: Free wording
or make you feel uncomfortable?
6.Do you have a favorite place? Point object (s), max 3 objects Comments: Free wording
7.Do you have any suggestions for Point object (s) Comments: Free wording
improvements?

us an opportunity to test if the method Children’s arenas. The topography is varied, and green areas
Maps could be transferred from research to prac- with woods and lakes are situated close to the
tical use. We decided to use the version that we residential areas. For an overview, please, look
had developed and tested in 2006 with special at http://maps.google.se/maps, search for Täby.
focus on traffic safety (Berglund et al., 2009). The initiative for a co-operation came from
Our conclusions from the research-pilots are that Technical Office at the municipal administration
this version works well as an over all tool and that in Täby. Technical Office is divided into five
traffic safety has proved to be of high interest in units: Building Permits Unit, Surveying Division,
local planning with a child perspective. Street and Park Unit, Water and Sewage Unit and
By participating in the project we wanted to Urban Planning Unit. The project leader and the
get some answers on the following questions: project members came from the Street and Park
Unit. The unit works with developing and main-
• How can the method work outside the re- tenance of streets, roads, parks and forests. The
search context? aims of the project “Place to Grow” were (Täby
• How can the information be accessed, han- kommun, 2009b):
dled and valued within a local authority?
• To make the voices of children and youth
Täby has about 62 000 inhabitants and is situ- heard in urban planning in Täby,
ated 15 km north of Stockholm city. The residential • To gather the results into a planning-friend-
areas consist partly of block of flats and partly ly tool for all who plan and design the pub-
of areas with private homes. In the centre of the lic places and out-door environments in the
municipal are a big shopping mall and some sports municipality,

804
Children’s Maps in GIS

• To get concrete maps concerning places exceptions. Besides the spatial information the
and routes that children and youth use, and project “Place to grow” wanted to get as much
in what way they use them, verbal comments from the children as possible
• To get some general information about about their perceptions of dangerous or unpleas-
what children and youth see as good or bad ant places (question 5). The supervisor clarified
about the neighborhoods of Täby, that boring or weary places could be marked as
• To draw conclusions from the answers well. The supervisors also reminded the children
about which places and routes ought to be to think about the winter aspect (part of question
protected, developed or changed. 4) (Täby kommun, 2009b).
This project did not prioritize the participation
During 2008, surveys were carried out in 28 of the teachers. However, 16 teachers from 11
out of 42 schools in Täby that have students in the schools were given the opportunity to take part.
age between 11-18 years. The schools were chosen All schools, teachers and children participated
to give a good geographical representation of the voluntarily.
municipality. In all a total of 635 children from The data was transferred from the lap-tops into
34 classes participated. 54% of the children were the data server administrated by the IT-department
11-12 years, 37% were 15-16 years and 9% were of the local government. Before storing the data,
17-18 years old (Täby kommun, 2009b). In Table some harmful comments on individuals were
2 there is an overview of the schools participating taken away by the project team. A copy of the
in the project. Schools with children with special data set was sent to us for research purposes and
needs did not participate. back-up. The administration of data showed to
The Surveying Division of Täby provided the be a time-consuming activity when using the ap-
digital maps with detailed and updated informa- plication developed for small-scale research in a
tion. As data were in MapInfo–format (tab) they large scale survey.
had to be exported to ESRI ArcView format (shape)
and then loaded into the Children’s Maps GIS
application on laptop-computers. REPRESENTATION OF DATA

The Use of the Method The project team had full access to data through
Children’s Maps in Täby an internal digital support system. The team
represented data in several maps and charts pre-
The method used in Täby is very much like the sented it in a report (Täby kommun, 2009b). The
method described earlier in this article, with some project also produced additional information as

Table 2. Numbers of schools participating sorted on level and school form

Total number of schools municipal independent Not participating


Compulsory schools 24 participating of 37 16 participating of 19 8 participating of 18 7 schools with children
Age: 11-16 years possible possible possible with special needs, 6 of
them are independent
schools
Upper secondary 4 participating of 7 pos- 1 participating of 2 3 participating of 5 1 school for students
schools sible possible possible with special needs, 2
Age: 16-18 schools with vocational
years education

805
Children’s Maps in GIS

field notes, analysis and presentations that have as dangerous or insecure places. The explanation
not been published. given is that these are places the children know
A summary of the results is a part of the com- well, and therefore have a lot of information about.
prehensive plan (exhibited for consultation until An example is the central shopping mall that has
September, 15, 2009), under the heading Public most marks for favorite place and as a place that
Interests – important out-door places for children can be perceived as insecure. A closer look at the
and youth. The comprehensive plan highlights the comments shows that the indoor environment,
following conclusion: with a lot of people and activities is perceived as
secure, and the parking lot outside is perceived
“[…] many popular places have some qualities as empty no mans land and insecure. Also, the
in common – space, variation, people and vicin- children have made some comments on rebuild-
ity. […] The biggest problem for children in the ing of parks that in one case has been successful
local environment is traffic, followed by lack of according to the comments, in another case the
illumination and worries about other people’s comment from the children are “it was better
behavior. Bad maintenance is another factor that before”. Some new places for skating and biking
makes places difficult to use.” (Täby kommun, are asked for in the survey, and that information
2009a, p. 39). is also highlighted in the report.
The participants of the seminar expressed their
The results were also presented on a seminar appreciation of the information of children’s use
April 14, 2009. Participants were politicians and and perceptions on the local environment and saw
employees representing the Technical Office, the connections to prioritized issues like on school-
Child Care and Elementary School and Culture and road safety and security issues in the outdoor
Leisure. In the seminar and in the report “Place environment, as well as a basis for prioritizing
to Grow” (Täby kommun, 2009b), the following maintenance of streets and parks.
aspects were brought forward. A follow-up meeting was held on May 18,
In general there are no big differences noted 2009 with one researcher and people in the local
between how boys and girls are moving around administration that were most likely to use the
except in the 9th grade (15-16 years), where there information from the children (the GIS-expert
are some notable differences. The boys report more from Surveying Division and employees from the
physical activities than the girls. The girls report Street and Park Unit and from the Urban Planning
that they “hang around”, looking at activities, Unit together with Child Care and Elementary
talking to each other. When it comes to safety/ School and Culture and Leisure departments).
security, girls 15 years and older, have made The question was: what part of the dataset is the
more comments than the boys on places that they most interesting, and how do you want to access
perceive as insecure. Such places can be where it? The following alternatives were discussed: a)
people are gathering at nights, empty places or plain thematic maps (Figure 1), b) map visualizing
badly illuminated places like woods and parks. some attributes e.g. places for meeting friends
Places to meet friends are a favorite in all ages. (Figure 2), c) summarized maps (Figure 3), and
Many children have marked places indoors as the d) text information connected to geographical
shopping mall, training grounds or at a friend’s object (Figure 4).
home as a favorite place. Most comments in all The participants from the administration an-
ages about insecure places concerns traffic, lack of ticipated to use the information in urban planning
illumination and suspicious and unsavory people. and in the detailed planning of projects concern-
Places that are often used also have most marks ing traffic safety, green-structure planning and

806
Children’s Maps in GIS

Figure 1. A thematic map showing all the places marked in a part of Täby without editing afterwards.
The attributes are not visualized. Symbolization of the basic map is different from what the children saw
on the screen when drawing the sketch map.

Figure 2. A map showing places for meeting friends. It is one of the multiple-choice options that are
given in question number 4. Each bar represents one annotation from one child.

Figure 3. A representation using the centroids of each polygon to construct a density map. When there
are many centroids of the polygons close to each other, the color gets darker.

807
Children’s Maps in GIS

Figure 4. A map showing places that are perceived as hazardous or dangerous. Connected to the symbol
are text annotations made by the children. The examples on the right are directly translated from Swedish.

maintenance of open spaces. The Technical Office seen in our research pilots; schools tend to use
had access to data from the survey, through the other areas than the students do themselves, and
internal digital support system while the Child with less concentration towards central parts of
Care and Elementary School unit did not have the neighborhoods.
access to the digital information. Because of the
character of the information, officials from the
Child Care and Elementary School unit expressed DISCUSSION
their interest in having access as well.
In the report “Place to Grow” there is no By reporting on this initial test of the method
presentation of the information given by the Children’s Maps in a real-world planning project
teachers although there is some information in the we want to discuss problems and possibilities with
dataset. Figure 5 shows the difference between implementation into a local government context.
the information given by the children and the We also reflect on empowerment of children and
information given by the teachers of one of the school and on the digital divide within the local
schools. It shows the same pattern that we have authority administration.

Figure 5. Focus in “Place to grow” has been on children’s mapping. Some teachers have contributed
with information as in this example.

808
Children’s Maps in GIS

On a general level we could ask if the project further communication within the planning pro-
“Place to Grow” shows that Children’s Maps can cess. This must be regarded as a weakness when
function as a vehicle to facilitate young people’s considering the empowerment of children, making
influence on the local environment. One conclu- it less likely to positively influence their attitude
sion is that the project allows the opportunity for to future contributions to society. Feed-back is,
children to participate to a degree that corresponds however, prescribed by us in the steps for the use
with level 3 “Children’s views are taken into ac- of Children’s Maps. This is based on numerous
count, and given due weight in decision making” studies of participation projects with children stat-
(Shier, 2001, p. 111). A weakness of the method ing that being listened to and getting feed-back
is that young people are delivering data, yet have are cornerstones needed for a fair involvement
neither ownership nor control over it. The maps of children in a participation process (Chawla,
produced are adapted to the conventional planning 2002b). In this sense the “Place to Grow” project
process, which we judge crucial for the information is incomplete, as well as our own research-pilots.
to be taken into account in real-world planning. As reported, some of the results from the
The power of the children is embedded in the fact project “Place to grow” have been integrated in
that their knowledge and perceptions are visual- the consultation for the comprehensive plan yet
ized, and have the same technical significance it is too early to assess the real impact on the lo-
as other forms of planning information. At this cal environment or on the empowerment of the
point, the deliberation between different aspects children. If the administration can not assimilate
becomes political issue. the information in its different activities, it may
be another case of tokenism (Hart, 1997).
Lessons Learned by Stepping The supervisors came from the Street and Park
Out of the Research Context Unit of Täby Municipality. The result indicates
that the supervisors have encouraged the children
The method has shown to function outside the to make comments on favorite places (question 6)
research context as a tool for collecting and storing and suggestions for improvements (question 7),
information. Information from over 600 children besides the expressed aim of gathering as much
concerning their use and perceptions of the local verbal information as possible about unpleasant
environment has been collected within the local and dangerous places. The extent of the comments
administration. On the whole the results are con- from Täby is larger than from the research-pilots.
sistent with the results from the research–pilots. The influence from the supervisors can imply
The seven questions (see Table 1) in our research that the children are used as a transmitter for the
studies are tested in different social contexts and supervisors own views, which could lead others to
in different age-groups. The development and tests assess the information as less trustworthy. On the
are described in detail in Berglund (2008) and other hand, the Children’s Maps can be seen as a
supplemented in Berglund et al. (2009). By using tool for getting on speaking terms with children
the same pre-formulated questions in every survey and youth, and thus gaining information that is
we assert that the questions are not biased by any otherwise difficult to attain. This is important to
ongoing planning debate or by personal agendas. consider when choosing persons to instruct the
The “Place to Grow” project had no procedures children.
to communicate back to the children (and teachers) One bias that the Täby case highlights is the
on the impact of their participation. Even if their choice of schools participating in mapping. If the
information is taken into account, they may not case is just a small project, concerning one small
be aware of it and the project did not provide for village this might not be an issue. When the method

809
Children’s Maps in GIS

is used for gaining information about a whole mu- The project “Place to grow” shows that there
nicipality, there needs to be some consideration on is a great demand for information from children
how the schools and classes are chosen. In Täby, and from various administrations. In Täby there
schools and classes with students with special were demands of information from different units
needs were excluded. We have not reflected or within the Technical Office as well as from the
tested how the method can or cannot be used by Child Care and Elementary School and from Cul-
children with different kind of disabilities. But to ture and Leisure. The most obvious demands came
exclude these students from the use of Children’s from the park unit, from the traffic department
Maps is certainly not our intention. and from people involved in “Safe in Täby” (our
In the report “Place to grow”, it is noted that translation), a project within the local government.
most of the children were able to orientate on As we have stated earlier in this article, maps
the map quite well, except from one district with (and GIS) represent power and this means that
a very hilly topography, were the slopes are not parts of the administration that cannot express
shown in the basic orientation map (Täby kom- their interests in maps (or GIS) have a weaker
mun, 2009b, p. 16). This is an aspect that we position when it comes to maintaining their
have not encountered earlier and, of course, it interests in urban planning. Child Care and El-
is important that the basic map used for orienta- ementary School Unit have until now had little
tion represents features in the landscape that are interest in geographic information. This may be
important for children. a result of different approaches toward different
Another point to observe is the low number sources of information and technologies within
of teachers participating in the mapping. The an administration.
project leader was reluctant to include teachers’ The seminars showed that the written com-
mapping in the project as the focus was on how ments were of high interest. As our focus is on
children used and perceived the outdoor environ- the spatial information, we have not considered
ment. However, we know from our own pilots that how to analyze or categorize the written infor-
teachers might be reluctant to use the computer mation. The participants wanted the information
themselves to input information. Some have will- to be categorized for easy access. Qualitative
ingly done so while others have preferred to be analyzes of the text information calls for time
helped by the supervisor. This observation is in and competence to carry it out in a qualified way.
line with the reporting from the Swedish National One solution is to have standard categories in the
Agency for Education (Skolverket, 2009). We questionnaire, yet if we restrict the children to
have not engaged ourselves in developing the placing information into predefined categories,
teacher’s graphical interface to the same degree we are afraid that this will produce biased and
as the children’s version. The interface that we incomplete results. By this we mean that it is
have used has still to prove that it functions. As better to let children express themselves in their
the teacher does not report a personal use but own words in relation to personal feelings such
functions as a representative for the school’s use as favorite places, dangerous places and wishes
of the outdoor environment, it is not considered for change. It is then possible to make different
to be a significant problem that not all teachers interpretations while the source material is kept
are involved. The school’s use and need for good intact. This ambition may lead to a somewhat
environments and safe routes for transportation less administration-friendly representation, yet
can be reported by those teachers who are willing the strength of the written text is that it provides
and able to do so. an easier medium to hear the voice of the child,
than by simple markings on the map.

810
Children’s Maps in GIS

Digital Divide each case and the administration of data. From the
very beginning we aimed at an Internet solution,
The geographical databases produced by local we now have funding for transferring the GIS-
government in Täby are of high quality regarding application into Internet so we will address issues
scope, scale of details and actuality. It is mainly of unequal access to hardware between different
used as a background map for the internal digital schools as mentioned by the Swedish National
system, containing a large amount of different Agency for Education (Skolverket, 2009). We
information used in administration. Geographic will also develop standard procedures for catego-
information was not accessible for Child Care rizing and analyzing data, making the threshold
and Elementary School units within the internal as small as possible for the user to interactively
digital support system. The preliminary finding explore the data.
from Täby tells us that the use of GIS is still in- Parallel to this, we will write a manual de-
consistent across the administration, and that it is scribing, among other things, how the supervi-
still a technique that demands training to master. sors should conduct themselves when using the
In Täby, as in many municipalities in Sweden, method and how we consider it could be combined
there are just a couple of people engaged in the with other means for gaining deeper information
technical support of the data system that have the and higher levels of participation. We will also
mandate to set up and decide the functionalities develop guidelines concerning feed-back to chil-
that can be obtained by users in administration. dren and teachers and further explain the role of
The GIS-expert plays a crucial role in deciding the teacher’s map.
what is possible to do or not.
The full potential of the internal digital system
with interactive maps seems not to have been used CONCLUSION
or even requested. The same situation is reported
from research on bottlenecks with implementation The information that young people and teachers
of planning support system (Vonk et al., 2005). If can provide is asked for in many contexts, not
the information stored in digital information sys- only in urban planning. The use of GIS as an
tem is not relevant for you, why use the systems? interactive tool for analyses and presentations is
Interactive maps and multimedia representation not well established in Swedish local authorities.
of data helps the user to construct knowledge. If In Täby it is still an expert tool, and especially
important information is stored in files, charts and the social administration has not integrated the
archives or unreadable static maps they have very technique in the daily work. Here the representa-
little influence. If relevant information, such as tion on static maps still plays an important role in
children’s experiences of the outdoor environment, communication processes between the municipal
is accessed through an interactive information administrations as well as with the public.
system, this will enhance the use of the informa- We look forward to transferring Children’s
tion, and maybe even transform it into knowledge. Maps to Internet. We do not aim for the application
to be open for use by everyone, only accessible
Need for Improvements within projects, to secure that the information
on Children’s Maps given by children and youth is not misused. Be-
sides, local authorities, a school, or an organization
Problems with the GIS-application in its cur- with children’s wellbeing on the agenda could get
rent stage are; access to the GIS application, the access to the tool. As the use of Google maps and
adaptation of the maps in the GIS-application to Web 2.0 is spreading, we think it is more important

811
Children’s Maps in GIS

than ever that the knowledge and the interests of Christensen, P., & O’Brian, M. (2003). Children
children and youth are made visible and accessible. in the city: Introducing new perspectives. In
We believe that the presence of Children’s Maps Christensen, P., & O’Brian, M. (Eds.), Children
on Internet will contribute to this and in the end in the city: Home neighbourhood and com-
hopefully be helpful in giving young people the munity (pp. 1–12). London: RoutledgeFalmer.
influence on local planning and maintenance of doi:10.4324/9780203167236
their neighborhoods that they have the right to.
Christensen, P., & Prout, A. (2002). Work-
ing with ethical symmetry in social research
with children. Childhood, 9(4), 447–497.
REFERENCES
doi:10.1177/0907568202009004007
Baraldi, C. (2003). Children’s social participa- Crampton, J. W., & Krygier, J. (2006). An in-
tion in the town of adults. In Christensen, P., & troduction to critical cartography. ACME An
O’Brian, M. (Eds.), Children in the city: Home International E-Journal for Critical Geographies,
neighbourhood and community (pp. 184–205). 4(1), 11-33. Retrieved Mach 29, 2009 from http://
London: RoutledgeFalmer. www.acme-journal.org
Berglund, U. (2008). Using children’s GIS maps Driskell, D. (2002). Creating better cities with
to influence town Planning. Children. Youth and children and youth. London: Earthscan.
Environments, 18(2), 110–132.
Elwood, S. (2002). GIS use in community plan-
Berglund, U., & Nordin, K. (2007). Using GIS ning: A multidimensional analysis of empower-
to make young people’s voices heard in urban ment. Environment & Planning A, 34, 905–922.
planning. Built Environment, 33(4), 469–481. doi:10.1068/a34117
doi:10.2148/benv.33.4.469
Elwood, S. (2006). Critical issues in participatory
Berglund, U., Nordin, K., & Eriksson, M. (2009). GIS: Deconstructions, reconstructions, and new
Barnkartor i GIS och trafiksäkerhet (Children’s research directions. Transactions in GIS, 10(5),
maps in GIS and traffic safety) (Tech. Rep.). Up- 693–708. doi:10.1111/j.1467-9671.2006.01023.x
psala, Sweden: Rapporter Institutionen för stad
och land. Flyvbjerg, B. (2001). Making social science
matter: Why social inquiry fails and how it can
Chawla, L. (2002a). Toward better cities for succeed again. Cambridge, UK: Cambridge
children and youth. In Chawla, L. (Ed.), Growing University Press.
up in an Urbanizing World (pp. 15–34). London:
Earthscan. Freeman, C., & Aitken-Rose, E. (2005). Future
shapers: Children, young people, and planning in
Chawla, L. (Ed.). (2002b). Growing up in an New Zealand local government. Environment and
urbanising world. London: Earthscan. Planning. C, Government & Policy, 23, 227–256.
Christensen, P. (2003). Place, space and knowl- doi:10.1068/c0433
edge: Children in the village and the city. In Habermas, J. (1984). The theory of communica-
Christensen, P., & O’Brian, M. (Eds.), Children tive action (Vol. 1). Boston, MA: Beacon Press.
in the City: Home neighbourhood and com-
munity (pp. 13–28). London: RoutledgeFalmer.
doi:10.4324/9780203167236

812
Children’s Maps in GIS

Hart, R. A. (1997). Children’s participation: The Lynch, K. (Ed.). (1977). Growing up in cities:
theory and practice of involving young citizens in Studies of the spatial environment of adolescence
community development and environmental care. in Cracow, Melbourne, Mexico City, Salta, Toluca,
London: Earthscan. and Warszawa. Cambridge, MA: MIT Press.
Healey, P. (1997). Collaborative planning: Shap- MacEachren, A. (2004). 1995). How maps work:
ing places in fragmented societies. Basingstoke, Representation, visualization, and design (pp.
UK: Palgrave Macmillan. iii–iv). New York: Guilford.
Healey, P. (1999). Institutionalist analysis, com- Matthews, H., Limb, M., & Taylor, M. (1999).
municative planning, and shaping places. Jour- Young people’s participation and representation
nal of Planning Education and Research, 19(2), in society. Geoforum, 30, 135–144. doi:10.1016/
111–121. doi:10.1177/0739456X9901900201 S0016-7185(98)00025-6
Holloway, S., & Valentine, G. (2003). Cyberkids: Norsk Form. (2005). Barnetråkk: Barns tilgjen-
Children in the information age. London: Rout- gelige uteareal (Children’s tracks: Children’s
ledgeFalmer. accessible outdoor environments). Retrived
June 20, 2009 from http://www.norskform.
Horelli, L. (1998). Creating child-friendly envi-
no/?V_ITEM_ID=1282
ronments: Case studies on children’s participation
in three European countries. Childhood, 5(2), Percy-Smith, B., & Malone, K. (2001). Making
225–239. doi:10.1177/0907568298005002008 children’s participation in neighbourhood settings
relevant to the everyday lives of young People.
Horelli, L., & Kaaja, M. (2002). Opportunities
PLA Notes, 42, 18–22.
and constraints of internet-assisted urban plan-
ning with young people. Journal of Environ- Pickles, J. (Ed.). (1995). Ground truth: The social
mental Psychology, 22, 191–200. doi:10.1006/ implications of geographic information systems.
jevp.2001.0246 New York: Guilford.
Jiang, B., Huang, B., & Vasek, V. (2003). Geovi- Plester, B., Blades, M., & Spencer, C. (2006).
sualisation for planning support systems. In Geert- Children’s understanding of environmental repre-
man, S., & Stillwell, J. (Eds.), Planning support sentations: Aerial photographs and model towns.
system in practice (pp. 177–191). Berlin: Springer. In Spencer, C., & Blades, M. (Eds.), Children and
their environments: Learning, using and designing
Korte, W. B., & Hüsing, T. (2006). Benchmark-
spaces (pp. 42–56). New York: Cambridge Univer-
ing Access and Use of ICT in European Schools
sity Press. doi:10.1017/CBO9780511521232.004
2006. Retrieved from http://empirica.com/pub-
likationen/documents/No08-2006_learnInd.pdf Rogers, P. (2006). Young people’s participation in
the renaissance of public space: A case study of
Kyttä, M., Kaaja, M., & Horelli, L. (2004).
Newcastle upon Tyne, UK. Children. Youth and
An internet-based design game as a media-
Environments, 16(2), 105–130.
tor of children’s environmental visions. En-
vironment and Behavior, 36(1), 127–151. Sager, T. (1994). Communicative Planning Theory.
doi:10.1177/0013916503254839 Avebury. UK: Avebury Press.

813
Children’s Maps in GIS

Sager, T. (2006). The logic of critical com- ULI. (2008). Lägesbild GI Sverige: en lägesbild
municative planning: Transaction cost al- av användandet av geografiska informations-
teration. Planning Theory, 5(3), 223–254. sytem i Sverige, resultat från delundersökningen
doi:10.1177/1473095206068629 offentliga sektorn (Situational awareness GI
Sweden: Study on use of geographic informa-
Shiers, H. (2001). Pathways to participation: Open-
tion and geographic information technology in
ings, opportunities and obligations. Children &
Sweden, public sector). Retrieved from http://
Society, 15, 107–117. doi:10.1002/chi.617
www.geoforum.se/page/158/332
Skolverket. (2009). IT-användning och IT-kom-
UNICEF. (1990). Convention on the rights of the
petens i förskola, skola och vuxenutbildning (IT
child. In First Call for Children (pp. 43–79). New
usage and IT skills in preschool, school and adult
York: UNICEF.
education) (Dnr 75-2007:3775). Retrieved from
http://www.skolverket.se/publikationer?id=2192 UNICEF. (2001, March). The children as com-
munity researchers. Retrieved November 15, 2009
Täby kommun. (2009a). Det nya Täby: Översikt-
from http://www.unicef.org/teachers/researchers/
splan 2010-2030, utställningshandling maj 2009,
allmänna intressen (The new Täby: Comprehen- United Nations. (1993). [Earth summit - the United
sive plan 2010-2030, consultation May 2009, Nations programme of action from Rio. New York:
public interests). Retrieved June 20, 2009 from United Nations Dept. of Public Information.].
http://www.taby.se Agenda (Durban, South Africa), 21.
Täby kommun. (2009b). Plats att växa (Place to Urban Green Spaces Taskforce. (2002). Green
grow). Retrieved June 13, 2009 from http://www. spaces, better place: Final report. London: De-
taby.se/Miljo-natur-halsa/Planeringsdokument/ partment for transport, local government, and
Plats-att-vaxa/ the regions.
The FreeChild Project. (2008). Retrieved No- Vonk, G., Geertman, S., & Shot, P. (2005). Bottle-
vember 15, 2009 from http://www.freechild.org necks blocking widespread usage of planning
support systems. Environment & Planning A,
Tonucci, F., Prisco, A., & Horelli, L. (2004). A
37, 909–924. doi:10.1068/a3712
comparison of the models for children’s participa-
tion in decision-making, in Rome and Helsinki. Wilhjelm, H. (1999). Hvor har du vært? - ingen
In L. Horelli & M. Prezza (Eds.), Child-Friendly steder: Miljøtilknyttende infrastruktur og barns
Environments: Approaches and lessons (pp. hverdagsliv - en kunnskapsoversikt (Where have
83-93). Espoo, Finland: Helsinki University of you been? nowhere: Environmental infrastructure
Technology. and children’s everyday life- a review). Trond-
heim, Norway: Norsk senter for barneforskning.
Woolley, H., Spencer, C., Dunn, J., & Rowley, G.
(1999).Thechildascitizen. JournalofUrbanDesign,
4(3), 255–282. doi:10.1080/13574809908724451

This work was previously published in the International Journal of Information Communication Technologies and Human
Development (IJICTHD), Volume 2, Issue 2, edited by Susheel Chhabra and Hakikur Rahman, pp. 1-16, copyright 2010 by
IGI Publishing (an imprint of IGI Global).

814
815

Chapter 50
Landscape Biodiversity
Characterization in Ecoregion
29 Using MODIS
Nitin Kumar Tripathi
Asian Institute of Technology, Thailand

Aung Phey Khant


Asian Institute of Technology, Thailand

ABSTRACT
Biodiversity conservation is a challenging task due to ever growing impact of global warming and cli-
mate change. The chapter discusses various aspects of biodiversity parameters that can be estimated
using remote sensing data. Moderate resolution satellite (MODIS) data was used to demonstrate the
biodiversity characterization of Ecoregion 29. Forest type map linked to density of the study area was
also developed by MODIS data. The outcome states that remote sensing and geographic information
systems can be used in combination to derive various parameters related to biodiversity surveillance
at a regional scale.

1. INTRODUCTION of natural selection to fit in the constellation of


certain environmental factors or niches. In the
A natural environment is self-renewing, self- process, they help to sustain others, each species
perpetuating and stable one, in which every or- controlling its own population growth and at the
ganism contributes in some way, however, small same time limiting of other species, so that a
to the overall stability. In natural ecosystems, reasonable ecological balance may be achieved
the plants and animals have evolved at their own and maintained for hundreds of years The satellite
pace and in their own way under the influence remote sensing can identify the important param-
eters for biodiversity characterization like size,
DOI: 10.4018/978-1-4666-2038-4.ch050 fragmentation, porosity, patchiness, interspersion

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

and juxtaposition at the landscape level. The role Geographical Information System (GIS) have
of remote sensing is emphasized in quick ap- proven to be very effective tools to analyze the
praisal of regional biodiversity surveillance. This landscape patterns for biodiversity characteriza-
becomes of high importance in present context of tion at various levels.
biodiversity loss due to climate change.
There is an urgent need to inventory and
monitor indicators of biological diversity such as 2. STUDY AREA
species richness and habitats. Remotely sensed
data provide a means to accomplish part of this Kayah-Kayin and Tenisserim ecoregion (Ecore-
task, but there has been no comprehensive scien- gion 29) are the richest in species in mainland
tific framework to guide its effective application Southeast Asia, for this area is the cross road to
(Stoms & Estes, 1993). Most of the discussion exchange species among different geographic
concerning potential roles for remote sensing in regions of Holartic, Oriental and Greater Sundas
biodiversity assessment has come from conserva- Island. On the other hand, this ecoregion forming
tion biologist and ecologists (Soule & Kohm, 1989; a juncture of the Indo-Chinese, Indo-Burmese, and
Noss, 1990; Lubchenco et al., 1991). The remote Malaysian floral and faunal elements. Formerly,
sensing community has had little involvement to it is divided into the Kayah-Karen Montane
date in supporting biodiversity research, largely Rain Forests (ecoregion 51: Table 1) and the
concentrating instead in the global change domain Tenasserim-South Thailand Semi-evergreen Rain
(Stoms & Estes, 1992). Very little quantitative Forests (ecoregion 53: Table 2). In the year 2000,
analysis has been accomplished to determine the World Wide Fund For Nature (WWF) scientists
actual value of remote sensing and geographic team combined these two regions and defined
information systems in biological research. as ecoregion (29). The flora and fauna in this
Remote sensing provides spatial data, which region is distinct and includes several endemic
are less used but they are the powerful source to species. Among the ecoregions of Indochina, this
acquire accurate, up-to-date information essential ecoregion contains some of the highest diversity
for conserving biodiversity and wildlife habitat of both bird and mammal species found in the
mapping. Although technically complex, the Indo-Pacific region
remote sensing techniques have revolutionized This ecoregion encompasses the mountainous,
the process of data gathering and map making. semi-evergreen rain forests of Thailand, Myanmar
The combinations of Remote Sensing (RS) and and Malaysia, and includes the extensive lowland

Table 1.­

Ecoregion Name : Kayah-Karen Montane Rain Forests


Bioregion : Indochina
Major Habitat Type : Tropical and subtropical moist broadleaf forest
Ecoregion Number : 51
Political Unit(s) : Myanmar, Thailand
Ecoregion Size : 119,200 km2
Biological Distinctiveness : Globally outstanding
Conservation Status : Relatively intact
Conservation Assessment : III

816
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

Table 2.­

Ecoregion Name : Tenasserim-South Thailand Semi-evergreen Rain Forests


Bioregion : Indochina
Major Habitat Type : Tropical and subtropical moist broadleaf forest
Ecoregion Number : 53
Political Unit(s) : Myanmar, Thailand, Malaysia
Ecoregion Size : 96,900 km2
Biological Distinctiveness : Globally outstanding
Conservation Status : Relatively intact
Conservation Assessment : III

plain that lie between the peninsular mountains zone between the subtropical broadleaf evergreen
and which until recent decades supported exten- forests in the north and the southern tropical and
sive lowland forest. The southern margin of this dry deciduous forests in Tenesserim and southern
ecoregion is defined by the Kangar-Pattani flo- Thailand (Figure 2). Tropical hardwood trees in
ristic boundary (Whitmore & Sayer, 1992). Figure the family Dipterocarpaceae dominate forests
1 shows the geographic extent of the study area. throughout the ecoregion. Forests to the east are
dominated, especially at the lower elevations,
2.1. Vegetation trees that have a drought-deciduous phenology,
while the west-facing slopes are a mixture of de-
The vegetation of this ecoregion includes both ciduous and evergreen species. At low elevation
tropical and subtropical moist broadleaf forest, (<= 1000 m) on the east side of the Tenasserim
montane forests, lowland rainforest; and a higher Hills, the potential vegetation consists of drought
proportion of evergreen broad-leaved species. This deciduous forest or savanna woodland. Higher
ecoregion represents the semi-evergreen forests of elevations support much richer broad-leaved
the Kayah-Karen Mountains in the broad transition forest communities with a mixture of evergreen

Figure 1. Map of Ecoregion 29

817
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

Figure 2. Land use map of study area


biological surprises. However, after Thailand
banned timber exploitation in its forests in 1988,
Myanmar granted large logging concessions to
Thai companies, and illegal timber extraction in
Myanmar by Thai loggers has become common
in recent years (WWF & IUCN 1995).

2.2. Fauna

This ecoregion contains one of the most intact


vertebrate faunas of Indochina, including one of
the richest assemblages of mammals in Asia. The
fauna is also distinctive, with characteristics of
the islands of the Malay Archipelago as well as
the mountains of China and India. The relatively
intact and contiguous hill and montane habitat
has potential for conserving large landscapes
that will provide adequate habitat to maintain
a viable population of Asia’s largest carnivore,
the tiger (Panthera tigris), and Asian elephant
(Elephas maximus). This ecoregion lies within a
high priority, Level (I) Tiger Conservation Unit
(Dinerstein et al. 1997). This range of forests in
conjunction with the Kayah-Karen Mountains
and deciduous species. Forests of teak Tectona represents some of the best landscapes for Asian
grandis represent the climax vegetation at low elephant conservation in Indochina. Numerous
elevation in the absence of fire, but today the other mammals are of conservation significance,
teak forests are nearly extirpated in Thailand primarily the elusive and endemic Fea’s muntjac
and also declining rapidly in Myanmar. At 800- (Muntiacus feae). The population of the Malayan
1200m, a well-developed undestroyed shrub tapir (Tapirus indicus), the only Old World tapir
grows beneath a tall, closed forest canopy that representative, has been drastically reduced. It
includes some very large, buttressed trees that survives in the hill and montane protected areas
share an affinity with tropical Asia together with of this ecoregion and scattered pockets throughout
temperate tree taxa in the families Magnoliaceae peninsular Malaysia and Sumatra. More than 25
and Lauraceae (IUCN, 1991). Although fire is pairs have been found in some of the last remain-
frequent today, there is little consensus as to the ing forest, and that forest is now contained within
historical frequency of fire or its importance in Bang Kram wildlife sanctuary (Stewart-Cox,
this ecosystem. An important unresolved question 1995). Several primate species are found in these
is whether fire (mostly anthropogenic) or pre- forests, and include the threatened banded langur
monsoon drought stress (non-anthropogenic) is (Trachypithecus melalophus) and slow loris (Loris
primarily responsible for limiting species diversity nycticebus), a small, nocturnal prosimian. Other
in these places. This ecoregion remains relatively species of conservation concern include the Dyak
unexplored scientifically, especially those parts fruit bat (Dyacopterus spadiceus), the endangered
that lie in Myanmar, it will very likely yield more clouded leopard (Pardofelis nebulosa), common

818
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

leopard (Panthera pardus), sun bare, binturong significance. Several of Myanmar and Thailand’s
(Arctictis binturong), gaur (Bos gaurus), and largest and most intact wildlife reserves lie within
banteng (Bos javanicus) (Stewart-Cox, 1995). this ecoregion, including Myinmo-let-khat Na-
The wide diversity of habitats within this tional Park (proposed) and Huai Kha Khaeng
ecoregion, from deciduous forests in the north to Wildlife Sanctuary (2,575 km2) and several other
seasonal evergreen forests in the south, habitats protected areas with which it forms a contigu-
lowland to montane, make it one of the richest in ous network. Huai Kha Khaeng is prized for the
bird species for the entire Indo-Pacific. A total of high diversity of cat species it supports, and it’s
560 bird species have been recorded there. The relatively intact vertebrate communities and intact
ecoregion is the fourth richest in the Indo-Pacific lowland dipterocarp forests. Moister habitats on
region for mammals with 168 known species. the Myanmar side of the Tenasserim ranges also
These include one ecoregional endemic species, include significant amounts of intact habitat,
the tiny Kitti’s hog-nosed bat (Craseonycteris probably still in better condition overall than the
thonglongyai). Some of the other mammals of con- forest on the eastern (Thailand) side of the range.
servation importance include several threatened However it is difficult to assess ecological condi-
species such as the tiger (Panthera tigris), Asian tions in the forests of eastern Myanmar at this time.
elephant (Elephas maximus), gaur (Bos gaurus), The existing (50) protected areas that cover
banteng (Bos javanicus), wild water buffalo (Buba- 32% (35,030 km2) of the ecoregion and most of
lus arnee), southern serow (Naemorhedus suma- these protected areas are located in Thailand. Large
traensis), clouded leopard (Pardofelis nebulosa), blocks of intact seasonal evergreen forest habitats
Malayan tapir (Tapirus indicus), wild dog (Cuon are still remaining in Myanmar, but these are
alpinus), Asiatic black bear (Ursus thibetanus), mostly not protected. Some protected areas have
Assamese macaque (Macaca assamensis), stump- been designated in the portion of this ecoregion
tailed macaque (Macaca arctoides), smooth- that lies within Myanmar, but their effectiveness
coated otter (Lutrogale perspicillata), great Indian is difficult to assess at this time due to the political
civet (Viverra zibetha), and Particoloured flying instability of the region.
squirrel (Hylopetes alboniger). Sumatran rhinoc-
eros is believed to have inhabited remote regions 2.3. Agriculture
of the Tenasserim Hills in recent years, but this
critically endangered species is now thought to Shifting cultivation is the main cause of defor-
have been extirpated from this ecoregion. Hunt- estation throughout the region. But in areas like
ing has decimated most of the large mammal northern Thailand, where previously nomadic
populations, such as elephant, banteng, gaur and tribal peoples have been settled, pressure exists to
tiger (IUCN, 1991). On the other hands, many convert forest into more intensive agricultural land
species are declining to unsafe population levels, devoted to cash crops like cabbage, coffee, and
important habitats are destroyed, fragmented and lychee. Opium replacement efforts, in Thailand,
degraded, and the ecosystems are destabilized have compelled local people to grow alternative
through climate change, population, invasive crops that require more cultivated land area and
species, and human impact. higher pesticide inputs. The land requirements
The relatively intact, contiguous habitat has of an increasing population have forced itinerant
potential to conserve large landscape that will farmers to reduce the cycle of cultivation-fallow
provide adequate habitat to maintain a viable periods, and have pushed them deeper into the
population of Asia’s largest carnivore, the tiger, forest and into more marginal areas.
as well as other species of critical conservation

819
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

Lowland area in Peninsular Thailand (Bang 2.6. Climate


Kram) still supports a significant amount of late
successional forest. Other areas support vast Annual precipitation increases southward as
tracts of rubber plantation, monocultures of a fast- the length of the dry season and the magnitude
growing, short-lived tree species native to South of pre-monsoon drought stress declines. The
America, and plantations of oil palm. Pineapple southern mountain ranges receive rain from both
is grown there as a rotation crop following the the northeast and southwest monsoons so that,
removal of senescent rubber trees. Paddy is also unlike mountain ranges further north, there is
grown in some lowland areas. Unfortunately, no significant rainshadow. The entire region has
none of these crops with the possible exception of a monsoon climate with warm, moist summers
paddy provide any significant support for natural and mild winters that tend to be dry. Overall an-
biological diversity. Hill slopes support more nual rainfall average is 1,500 mm to 2,000 mm.
native forest than the lowland areas, and the hill Although this ecoregion lies within the Tropic of
forests of Southern Thailand are relatively intact, Cancer, winter temperatures can be quite cool,
although swidden (slash and burn) agriculture is especially at the higher elevations where frost
still practiced in hill areas in the northern part has been recorded from the northern part of the
of the ecoregion. Mature forest cut for swid- ecoregion. West-facing slopes (Myanmar side)
den agriculture is generally succeeded in this face the Bay of Bengal and receive more precipi-
ecoregion by a grassy sub-climax that supports tation; and East-facing slopes (Thailand side) lie
far fewer species than the mature forest (IUCN, within a partial rainshadow and tend to be drier.
1991). General land-use map of the study area is This climatic difference is clearly reflected in the
shown in Figure 2. vegetation. The Köppen climate system placed
this ecoregion in the Tropical Wet Climate Zone
2.4. Geology (Townsend & Cohoon, 1999).

Ecoregion (29) consists of hills of Paleozoic lime-


stone that have been much dissected by chemical 3. DATA USED
weathering. The overhanging cliffs, sinkholes, and
caverns characteristic of tropical karst landscapes Fortunately, enough data is available with institu-
are all-present in this ecoregion. tions and agree to share the information. These
are Biogeographical zone map, protected areas
2.5. Landscape Structure networks and species list from WWF-Thailand,
topographic/ vegetation/ LU & LC/ geology/ soil
Terrain throughout much of this ecoregion is maps from Department of Forestry (Myanmar and
rugged and intricately folded. Hillsides tend to Thailand); and annual rainfall and temperature for
be steep and ridges exceed 2,500 m elevation. ten years from Department of Geography. Lists of
Valley bottoms are narrow but fertile and tend to data used for this study are as follows:
lie at about 300 m elevation. The western slopes
drain into the Salween River that flows through • MODIS satellite data for study area with
Myanmar and entered Gulf of Martaban in the the spatial resolution of 250 m.
Indian Ocean. The eastern slopes drain into the • Digital map of entire ecoregion with the
Chao Phraya River that drains into the gulf of scale of 1: 250000
Thailand. • Topographic/ Vegetation/ LU&LC/
Geology/ Soil map

820
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

Figure 3. Kayah-Kayin and Tenisserim ecoregion


• Annual rainfall and temperature for 10
(Ecoregion 29) in MODIS image
years
• Other ancillary data

MODIS image in Figure 3 shows the overview


of the ecoregion 29 and surrounding. This image
was utilized to extract the biodiversity character-
istic of the ecoregion.

4. BIODIVERSITY RICHNESS
ESTIMATION MODEL

Growing concerns over the loss of biodiversity


has spurred land managers to seek better ways of
managing landscapes at a variety of spatial and
temporal scales. A number of developments have
made possible the ability to analyze and manage
entire landscapes to meet multi-resource objec-
tives. The developing field of landscape ecology
has provided a strong conceptual and theoretical
basis for understanding landscape structure, func-
tion, and change (Forman & Godron 1986; Urban
et al., 1987; Turner, 1989). Growing evidence that
habitat fragmentation is detrimental to many spe-
cies and may contribute substantially to the loss
of regional and global biodiversity (Saunders et ments. Advanced computation and massive data
al., 1991; Harris, 1984) has provided empirical archives would produce quantitative improvement
justification for the need to manage entire land- in analysis of biodiversity through more accurate
scapes, not just the components. and efficient solution. Visualization and interac-
As growing numbers of researchers and re- tivity would produce greater understanding and
source managers rely on digital geographic data, clarity on the status of the environment and effort
and look to remotely sensed imagery as a source to maintain it.
of data for their GIS, issues of geographic data A landscape is composed typically of several
models and remote sensing scene models (Good- types of landscape elements. Of these, the matrix is
child, 1992, 1994; Strahler et al., 1986), as well the most extensive and most connected landscape
as image processing algorithm development, are element type, and therefore plays the dominant role
central to research on the integration of remote in the functioning of the landscape. As landscape
sensing and GIS. patterns are complex and heterogeneous, variety
A rotational approach to the management of of matrices are needed to measure different aspect
biological resources requires advanced technol- of patterns to characterize biodiversity. Digital
ogy in many areas of information processing. satellite data provides a consistent and complete
The conceptual framework is quite complex, with spatial data for a very large area and may be con-
many interactions between species and environ- sidered very useful tool for measuring landscape

821
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

patterns (patch, fragmentation, terrain complexity, 4.1 Indices for Biodiversity


porosity, interspersion and juxtaposition) to esti- Characterization
mate biodiversity richness together with species
diversity index (Shannon and Simpson). And also, Species Diversity Index
the landscape spatial configurations influence
ecological processes such as biodiversity, habitat Species diversity can be described as the number
or animal population dispersal and abundance. of species in a sample or habitat per unit area. The
Landscapes are distinguished by spatial rela- indices are as follow:
tionships among component parts. A landscape
can be characterized by both its composition Shannon’s diversity index (log base 10 and natu-
and configuration, and these two aspects of a ral log)
landscape can independently or in combination
affect the ecological processes and organisms. H = -Σ(Pilog[Pi])
The difference between landscape composition
and configuration is analogous to the difference H = -Σ(Piln[Pi])
between floristic and vegetation structure that are
commonly considered in biodiversity studies at Simpson’s diversity index (D)
the within-patch scale.
Spatial data layers are generated using digital D = Σ(Pi2)
satellite data through digital image processing
and classification. In addition to that spatial and D = 1-Σ(Pi2)
non-spatial data from other ancillaries data sources
are collected. Database is generated using GIS Where,
software and analyzed. Then biodiversity char-
acterization was done analyzing the landscape Pi = the total number of species
parameters of patch, fragmentation, porosity, H = Shannon’s index value
interspersion and juxtaposition, and using ground D = Simpson’s index value
base data on species richness, terrain complexity
and diversity index. Fragmentation
First, and more importantly, subsets of par-
ticular landscape have to be analyzed in order to Forest fragmentation occurs when large, continu-
evaluate the distribution of spatial characteristics ous forests are divided into smaller blocks, either
(to detect the high diversity or where the habitat by roads, clearing for agriculture, urbanization,
is more endangered because of the loss of con- or other human development.
nectivity).
N
Therefore, it is necessary to know how the F = ∑ Di
extent of analyzed pattern is going to influence the i=1
results of estimation and whether the obtained val-
ues are reliable to the entire region under analysis. Where,

F = Index value of Fragmentation


Di = dissimilarity value for the ith boundary be-
tween adjacent cells

822
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

N = number of boundaries between the adjacent Porosity


cell
Porosity is the measure of number of patches or
Generally, forest type map is classified into density of patches within a landscape, considers
two classes such as forest and non-forest. A grid only number of patches, not size.
cell (n * n) is convolved with the spatial data layer
with a deriving number of forest patches within n
PO = ∑ Cpi
the grid cell. This will be repeated by moving l=1
the grid cell through the entire spatial layer. The
output layer with patches was derived.
Where, Cpi = number of closed patches of ith
cover class
Patchiness
Interspersion
Patchiness is a measure of the density of patches
of all types or number of clusters in given mask.
Interspersion is count of dissimilar neighbors with
It is a measurement of number of polygons over
respect to central pixel or measurement of the
a particular area.
spatial intermixing of the vegetation types. It can
also be used to represent the landscape diversity.
Pj = ((∑Di) / Nj ) * 100
n
Where, I = ∑ (∑ Fi / n)
l=1

N = Nj is the number of boundaries between


adjacent intervals along Transect j Where, Fi is shape of factor
Di = Dissimilarity value for the i th boundary For the determination of interspersion a
between adjacent Cells ‘Convolution’ will be used with the forest type
map to compute the number of dissimilar pixels
In landscape ecology, patches are spatial in the nearest neighborhood. The computation
units at the landscape scale. Patches are areas is performed in an interactive mode through the
surrounded by matrix, and may be connected entire spatial layer to derive an output intersper-
by corridors. The geomorphology of the land sion layer. Calculation of interspersion gives the
interacting with climate factors, along with the magnified view of resistance; the central pixel or
other factors such as the establishment of flora class has with respect to its surroundings.
and fauna, soil development, natural disturbances,
and human influences work to determine patch Juxtaposition
size, shape, location, and orientation (Forman &
Godron, 1986). Juxtaposition is defined as measure of proximity
Five major types of patches: of the vegetation of vegetation. It’s measurement
includes relative weight assigned by the impor-
(1) spot disturbance patch tance of the adjacency of two cover types for the
(2) remnant patch species in question. It is species-specific measure-
(3) environmental resource patch ment i.e. edge between cover and quality of edge.
(4) introduced patch
(5) ephemeral patch J = ∑ Di (Ji) / Jmax

823
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

I=1 PD equals the number of patches of the correspond-


ing patch type (NP) divided by total landscape
Where, area, multiplied by 10,000 and 100 (to convert
to 100 hectares).
Di = the shape desirability weight for each cover
type combination Landscape Shape Index (LSI)
Ji = the length of edge between combination of
cover types on either side of an edge m

Jmax = the average total weighted edge per habitat ∑e ik

unit of good habitat LSI = k-1



2 À_A
A grid cell of size 5 (e.g. 5*5) is convolved
with the derived layer in an interactive manner When LSI = 1, the landscape consists of a single
by assigning higher weight to natural vegetation patch of the corresponding type and is circular
and lower weight to unnatural vegetation. The (vector) or square (raster); LSI increases without
juxtaposition helps in characterizing the param- limit as landscape shape becomes more irregular
eter porosity with respect to natural or unnatural and/or as the length of edge within the landscape
(manmade) vegetation type. Since porosity can be of the corresponding patch type increases.
understood as one of the important factors influ- There are a number of landscape configuration
encing the disturbance index, the added weightage metrics to characterize the biodiversity spatially.
through juxtaposition gives the right perspective The spatial statistics output table created from a
in ultimately deriving disturbance index. landscape pattern analysis contains all the results
of the analysis (Class area, Total landscape area,
4.2 Landscape Configuration Metrics number of patches, Total edges, Edge density,
for Biodiversity Characterization Mean shape Index, Fractal Dimension, Mean Near-
est Neighbor Distance, Mean Proximity Index,
Landscape Similarity Index (LSIM) Interspersion and Juxtaposition Index, Shannon’s
Diversity Index, Shannon’s Evenness Index).
n Mean Nearest Neighbor (MNN), Measure
∑a ij of patch isolation, means the nearest neighbor
j=1
LSIM = Pi = (100) distance of an individual patch is the shortest
A
distance to a similar patch (edge to edge). The
mean nearest neighbor distance is the average of
LSIM equals total class area (m2) divided by total
these distances (meter) for individual classes at
landscape area (m2), multiplied by 100 (to convert
the class level and the mean of the class nearest
to a percentage); in other words, LSIM equals
neighbor distances at the landscape level.
the percentage of the landscape comprised of the
corresponding patch type.
5. RESULTS AND DISCUSSIONS
Patch Density (PD)
The improvement in spectral, spatial and temporal
ni resolutions and better digital image processing of
PD = (10, 000)(100)
A remotely sensed data has kept pace with the infor-
mation needs of the characterizing biodiversity.

824
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

The geographic information system (GIS) tool has Patch shape and orientation also play an im-
developed fast with greater facility of large area, portant ecological role. An ecologically optimum
multiple spatial and non-spatial data integration patch shape usually has a large core with some
and analysis. curvilinear boundaries and narrow lobes (Forman,
1995). This shape may allow both interior species
5.1 Results Discussion and edge species to flourish. Patch shapes also
on Biodiversity Richness determine the edge length.
Estimation Model Porosity is very important parameter for
conservation aspect at landscape and regional
In the biodiversity richness estimation model, the level. The porosity (1) provides an over all clue
most significant landscape patterns, which can be to the change of species isolation present and to
extracted from the digital satellite data were used the potential genetic variability present within
for biodiversity characterization. population of animals and plants in regional
For meaningful estimation and characterization level, (2) sometime indicates the presence of high
of biodiversity, the degree of variation of each conservation value and lower interaction among
index is related to landscape patterns that are landscape elements, homogeneity and low frag-
used for biodiversity richness estimation model mented habitats, (3) higher porosity value indicates
and not to artifacts derived from the methodologi- the higher interaction among landscape elements,
cal problems involved in the measurement. This heterogeneous and high fragmented habitats, and
uncertainty associated with the index estimated (4) disturbance in natural landscape increases
is arguably one of the major limitations of this patch density and decrease matrix connectivity.
kind of quantitative analysis for biodiversity Interspersion Juxtaposition Index, Measure of
characterization. And it is also greatly related to patch adjacency, the result will be zero when the
the scale such as spatial resolution of satellite data distribution of unique patch adjacencies becomes
and the spatial extent (i.e. total area). uneven and 100 when all patch types are equally
The biodiversity richness estimation model can adjacent. At the landscape level it is a measure of
provide variety of landscape value and allowing the interspersion of the each patch in the landscape.
obtaining wide range of patterns with intermediate The dispersal ability of species depends on
level of spatial differences, in which patchiness spatial organization of landscape. The fragile
and fragmentation is generated. or rare species occur only in highly connected
The use of these indices and landscape con- landscape mosaic. Higher value of interspersion
figuration matrices instead of real world landscape
data is preferable in this study because this ap-
proach is possible to generate and isolate different
factors affecting the landscape to characterize the Table 3. Indices for Biodiversity characterization
biodiversity. of study area
The influences of fragmentation, patchiness,
No. Index name Result value
porosity, interspersion and juxtaposition can be
1 Patchiness 60.98
adequately separated, and thus the biodiversity
2 Fragmentation 2.009
richness estimation using landscape patterns and
3 Porosity 2,962
configuration metrics can be specifically analyzed
4 Interspersion/ Juxtaposition 49.015
avoiding confusion with other land cover data
5 Landscape similarity 48.07
(Table 3).
6 Patch fractal dimension 33

825
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

means dispersal ability of the central class will is increasingly becoming recognized as a major
be low or reduce. cause of declining biodiversity.
Landscape Similarity Index Matrices ap- The use of diversity measures in community
proaches 0 when the corresponding patch type ecology has been heavily criticized because diver-
(class) becomes increasingly rare in the landscape. sity conveys no information on the actual species
LSIM equal to 100 when the entire landscape con- composition of a community. Species diversity is
sists of the corresponding patch type; that is, when a community summary measure that does not take
the entire image is comprised of a single patch. into account the uniqueness or potential ecological,
Fractal dimension is a measure of shape com- social, or economical importance of individual
plexity that can be computed for each patch and species. A community may have high species
then for landscape, or it can be computed from diversity yet be comprised largely of common
the landscape as a whole. or undesirable species. Conversely, a community
These diversity measures are influenced by may have low species diversity yet be comprised of
two components: (1) richness and (2) evenness. especially unique, rare, or highly desired species.
Richness refers to the number of patch types Although these criticisms have not been discussed
present and evenness refers to the distribution explicitly with regards to the landscape ecological
of area among different types. Richness and application of diversity measures, these criticisms
evenness are generally referred to as the compo- are equally valid when diversity measures are
sitional and structural components of diversity, applied to patch types instead of species. In addi-
respectively. Simpson’s diversity index is more tion, these diversity indices combine richness and
sensitive to richness than evenness. Thus, rare evenness components into a single measure, even
types have a disproportionately large influence though it is usually more informative to evaluate
on the magnitude of the index. This diversity richness and evenness independently (Tables 4
index has been applied to measure one aspect of and 5, Figures 4, 5 and 6).
landscape structure and landscape composition Therefore, a landscape with many habitats will
to characterize biodiversity. Thus, the higher the be richer than a less heterogeneous one. How-
value the greater the likelihood that any randomly ever, if habitat patches become too fragmented
drawn patches would be different patches that is and disjunctive, as typically results from human-
greater in diversity. induced land use conversion, regional richness
In many biodiversity applications, the pri- declines.
mary interest is in the amount and distribution
of a particular patch. Forest fragmentation is a 5.2 Forest Type Distribution
landscape-level process in which forest tracts
are progressively sub-divided into smaller, geo- The potential of remote sensing and Geographic
metrically more complex, and more isolated forest Information Systems is displayed in providing
fragments as a result of both natural processes and accurate and timely information on forest density
human land use activities. This process involves map. The forest types are classified base on the
changes in landscape composition, structure, and dominant species composition. As a result, in the
function and occurs on a backdrop of a natural study area various forest types were classified and
patch mosaic created by changing landforms mapped with reasonable amount of accuracy. The
and natural disturbances. Forest fragmentation vegetation type map of ecoregion 29 is classified
is the prevalent trajectory of landscape change into nine types: (1) Deciduous Broadleaf Forest,
in several human dominated forest regions, and (2) Semi-deciduous Broadleaf Forest, (3) Ever-
green Needleleaf Forest, (4) Lowland Evergreen

826
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

Table 5. Overall species richness estimation

No. of plots Estimation

Summarized Index
1 31.6
2 45.4
3 55.4

9.997
0.785
1.179
1.502

0.110
4 59.4
5 61.6
Plot 12

6 64.4
0.703
0.195
1.058
1.505

5.12
7 66.2
8 67.2
Plot 11

0.096
10.45
1.196
1.519
0.788

9 67.8
10 68.4
Plot 10

11 68.6
0.735
0.147
6.826
1.107
1.505

12 69
Plot 9

7.727
0.676
0.129
1.052
1.556

Broadleaf Rain Forest, (5) Semi-evergreen Moist


Plot 8

10.575
0.799
0.095
1.192
1.491

Broadleaf Forest, (6) Upper Mountain Forest,


(7) Lower Mountain Forest, and (8) Mangrove
Plot 7

11.686

Forest (Table 6).


0.808
0.086
1.228
1.519

The degradation activities such as extensive


Table 4. The results of Shannon’s and Simpson’s species richness index.

shifting cultivation, excessive logging, increasing


Plot 6

16.228
1.431
0.887
0.062
1.27

in human population, development of infrastruc-


ture, and conversion of forest habitats to agricul-
Plot 5

10.383
1.415
0.841
0.096

tural land have altered the natural landscape to


1.19

great extent in ecoregion 29. Because of these


Plot 4

10.008
1.232
1.58
0.78
0.1

Table 6. Forest density distribution


Plot 3

14.046
0.848
0.071
1.287
1.519

Percentage
No. Forest Type Area (km2)
(%)
Plot 2

9.577
0.789
0.104
1.229
1.556

Deciduous/semi-
1 deciduous broadleaf 26,886.55 12.45
forest
Plot 1

0.136
7.343
1.101
1.431
0.77

Disturbed natural
2 20,438.47 9.46
forest
3 Evergreen forest 4,597.43 2.13
Simpsons Diversity (1/D)
Simpsons Diversity (D)
Shannon Hmax Log10.

Semi-evergreen moist
4 55,246.34 25.58
Shannon H’ Log 10.

broadleaf forest
Index

5 Lower montane forest 22,093.60 10.29


Shannon J’

6 Upper montane forest 1,327.88 0.67


7 Mangrove 839.82 0.38
8 Others 84,521.71 39.04
Total 215,951.79 100.00

827
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

Figure 4. Graph of Shannon’s species richness index

Figure 5. Graph of Simpson’s species richness index

Figure 6. Graph of overall species richness estimation

828
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

increased anthropogenic activities, the natural tion of natural habitats and have brought in lots
landscape have become fragmented. In addition of biodiversity. Figures 7 and 8 show the status
to forest cover lost, fragmentation has a great of forest in ecoregion 29.
impact on biodiversity. Bio-habitat degradation
due to fragmentation is also considered to be one
of the major threats to biodiversity and often re- 6. CONCLUSION
duced genetic diversity, consequently.
The fragmentation of forest communities Study demonstrates that MODIS data can be
leads to high patch density and also affected to used to characterize biodiversity and help the
various ecological processes such as species dis- understanding of landscape diversity and ter-
tribution and degradation of habitats of keystone restrial ecosystem functions at the regional or
and indicator species. For core habitat zonation, landscape level.
as well as mapping of habitat for endangered The assessment to calculate the biodiversity
species deals with these transformations. These richness using important indices (Fragmentation,
land transformations have resulted in the altera- Patch, Terrain complexity, Porosity, Interspersion

Figure 7. General status of Ecoregion 29

Figure 8. Forest density distribution

829
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

and Juxtaposition, Species richness, etc.) includ- Dinerstein, E. (1997). A framework for identifying
ing landscape configuration metrics generated high priority areas and actions for the conserva-
important quantitative information about the status tion of tigers in the wild. World Wildlife Fund-US.
of different resources and landscape patterns of
Foreman, R. T. T. (1995). Some general prin-
Ecoregion 29. The use of these index indicate the
ciples of landscape and regional ecology. Land-
status of entire landscape and, therefore, can be
scape Ecology, 10(3), 133–142. doi:10.1007/
assumed as essential tools for biodiversity appli-
BF00133027
cation towards conservation strategies to prevent
biodiversity loss. Foreman, R. T. T., & Gordon, M. (1986). Land-
As known, the biodiversity is a reflection of scape ecology. New York, NY: John Wiley & Sons.
several biotic, abiotic and climatic factors, there-
Goodchild, M. F. (1994). Integrating GIS and
fore the ecological relationships was considered
remote sensing for vegetation analysis and model-
as the most important underlying factor for their
ing: Methodological issues. Journal of Vegetation
in situ conservation.
Science, 5(5), 615–626. doi:10.2307/3235878
Earlier study lack using moderate resolution
satellite data such as MODIS towards biodiversity Harris, L. D. (1984). The fragmented forest: Is-
mapping. This data is regularly available and is land biogeography theory and the preservation
well suited for regional level biodiversity surveil- of biotic diversity. Chicago, IL: University of
lance (inventory and monitoring) to extract the Chicago Press.
data and information of landscape patterns, land
IUCN. (1991). Caring for the Earth. The World
cover, and vegetation. Conventional methods for
Conservation Union, Gland.
biodiversity mapping are very costly and time
consuming. Since biodiversity loss and ecological Lubchenco, J., Olson, A. M., Brubaker, L. B.,
imbalance are the major concern of climate change Carpenter, S. R., Holland, M. M., & Hubbell, S.
impact on biological richness of ecoregion 29, the P. (1991). The sustainable biosphere initiative:
surveillance based on MODIS satellite data and An ecological research agenda: A report from
limited ground observations will facilitate the the Ecological Society of America. Journal of
prioritization for conservation. Ecology, 72(2), 371–412. doi:10.2307/2937183
The results from biodiversity richness estima-
Noss, R. F. (1990). Indicators for monitor-
tion model indicate that the overall biodiversity
ing biodiversity: A hierarchical approach.
richness is still high except the southern part of
Conservation Biology, 4(4), 355–364.
Thailand.
doi:10.1111/j.1523-1739.1990.tb00309.x
Saunders, D. A., Hobbs, R. J., & Margules, C. R.
REFERENCES (1991). Biological consequences of ecosystem
fragmentation: A review. Conservation Biol-
David, M. S., Davis, F. W., & Hollander, A. D. ogy, 5(1), 18–32. doi:10.1111/j.1523-1739.1991.
(1994). Hierarchical representation of species tb00384.x
distribution for biological survey and monitoring.
In Goodchild, M., Parks, B. O., & Steyaert, L. Soule, M. E., & Kohm, K. A. (Eds.). (1989).
(Eds.), Environmental modeling: Progress and Research priorities for conservation biology.
research issues (pp. 445–449). Fort Collins, CO: Washington, DC: Island Press.
GIS World Books.

830
Landscape Biodiversity Characterization in Ecoregion 29 Using MODIS

Stewart-Cox, B. (1995). Wild Thailand. Cam- Townsend, P. A., & Cohoon, K. P. (1999). Sen-
bridge, MA: MIT Press. sitivity of distributional prediction algorithms
to geographic data completeness. Ecological
Stoms, D. M., & Estes, J. E. (1993). A re-
Modelling, 117(1), 159–164. doi:10.1016/S0304-
mote sensing research agenda for mapping
3800(99)00023-X
and monitoring biodiversity. International
Journal of Remote Sensing, 14, 1839–1860. Turner, M. G. (1989). Landscape ecology: The
doi:10.1080/01431169308954007 effect of pattern on process. Annual Review
of Ecology and Systematics, 20(1), 171–197.
Strahler, A. H., Woodcok, C. E., & Smith, J. A.
doi:10.1146/annurev.es.20.110189.001131
(1986). On the nature of models in remote sensing.
Remote Sensing of Environment, 30(2), 121–139. Urban, D. L., O’Neill, R. V., & Shugart, J. R.
doi:10.1016/0034-4257(86)90018-0 (1987). Landscape ecology. Bioscience, 37,
119–127. doi:10.2307/1310366
Whitemore, T. C., & Sayer, J. A. (1992). Tropi-
cal deforestation and species extinction. IUCN,
Gland.

This work was previously published in Land Use, Climate Change and Biodiversity Modeling: Perspectives and Applications,
edited by Yongyut Trisurat, Rajendra P. Shrestha and Rob Alkemade, pp. 133-149, copyright 2011 by Information Science
Reference (an imprint of IGI Global).

831
832

Chapter 51
The Spatially Interactive
Literature Analysis
System Study Tool:
A GIS-Based Approach to Interpreting
History in the Classroom

Alyssa K. Moore
University of Redlands, USA

Lillian I. Larsen
University of Redlands, USA

Diana Stuart Sinton


University of Redlands, USA

ABSTRACT
The integration of spatial technologies with humanistic approaches to scholarship is expanding and
creating new opportunities and challenges for students. Documenting information about source material
is a fundamental step of the historical research process. Traditional geospatial data documentation is
maintained within standardized metadata forms, but these are not well suited for documenting historical
sources and are cumbersome for people new to GIS and metadata. The Spatially Interactive Literature
Analysis System Study Tool (SILAS.ST) addresses this and other usage issues for students in undergradu-
ate humanities courses. Through customized toolbars and map templates designed to be used with Esri
software, SILAS.ST lowers the barriers to understanding the parameters of spatial datasets while at the
same time models the processes of historical inquiry, documentation, and communication of research
results. The use of natural language and focused prompts help students begin to understand complex
mapping topics such as authorship, relevance, purpose, and uncertainty. This prototype tool aids in the
introduction of digital mapping technologies to humanities students.

DOI: 10.4018/978-1-4666-2038-4.ch051

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
The Spatially Interactive Literature Analysis System Study Tool

INTRODUCTION students ask robust and interesting questions when


analyzing spatial patterns, allowing them to more
Maps and mapping have long held the interest of deeply probe spatial relationships and to investi-
historians and humanities scholars. The where of gate causes for observed patterns (Gregory & Ell,
geography and the when of history are intrinsically 2007; Kwan & Ding, 2008; Schultz et al., 2008).
linked, making place and location central concepts Humanities-oriented GIS applications have
for both disciplines (Jessop, 2008; Mostern, 2010; particularly captured the imagination of teaching
Staley, 2007). For humanities scholars, maps have faculty at many undergraduate institutions (Brown,
traditionally served as visual texts that effectively 2008; Lloyd, 2001; Robinson, 2010; Sinton, 2009;
reveal patterns, support arguments, and illustrate Sinton & Lund, 2007). Instructors in these settings
narratives (Ackerman & Karrow, 2007; Black, realize that simply mapping in “layers” can greatly
1997; Edney, 1997; Sparke, 1995; Staley, 2007). enhance students’ understanding of history and
Maps and globes in digital form also serve as place (Mostern, 2010; Staley, 2007). Even the
research tools for exploring historical questions rudimentary process of manually overlaying clear
(Holdsworth, 2003; Schultz et al., 2008; Univer- plastic sheets, printed with different themes and
sidad Francisco Marroquín, 2009). map elements, reinforces the point “that places
The focus of humanistic map use is increas- are complex, and that mapping spatial phenom-
ingly shifting towards maps generated through ena can reveal important, sometimes surprising
geographic information systems (GIS). Through juxtapositions” (Knowles, 2000, p. 29).
its flexible tools, GIS offers the capacity to or- At the same time, the role of GIS in humanities
ganize, archive, analyze, and display prodigious scholarship and instruction is complex. Because
amounts of information in novel ways (Bailey & the tools were not created with historical sources
Schick, 2009; Bodenhamer et al., 2010; Brennan- in mind, their analytical potential is more readily
Horley et al., 2010; Goodchild & Janelle, 2010; applicable to data rich fields within the natural and
Gregory & Healey, 2007; Jessop, 2004; Knowles, social sciences, less so in the humanities. Because
2002). Large scale research projects on China (Bol, historical data are often inherently imprecise and
2008), the Holocaust (Knowles et al., 2009), the incomplete, measures of uncertainty and gener-
Salem Witch Trials (Ray, 2002), the Dust Bowl alization must also be taken into account when
(Cunfer, 2002), racially-based housing discrimi- interpreting historically informed spatial trends
nation (Hillier, 2003), the development of the and patterns (Gregory & Ell, 2007; Knowles,
railway system (Schwartz, 2010), and the Civil 2002; Rumsey & Williams, 2002). The quality
War (Knowles, 2008) have been undertaken in of historical data is difficult to evaluate with
efforts to glean new insights from applying spatial respect to many of the categories foundational
analysis to historical questions. to GIS (such as error and accuracy), since the
The growing availability of digital geographic data are rarely formulated in ways conducive to
tools has also spurred increased interest in edu- the exploration and manipulation typically used
cation from a spatial perspective (Brown, 2010; with GIS software (Gregory & Ell, 2007; Pearce
Sinton & Bednarz, 2007). New generations of & Louis 2008; Plewe, 2002). Such realities often
superior mapping tools and data resources have thwart attempts to confidently analyze and inte-
significantly facilitated spatial analysis and, as grate historical data within GIS (Brown, 2010).
importantly, qualitative learning (Mostern, 2010). Including historical data on digital maps
This is due not solely to advances in mapping compels scholars to address issues of geographic
technology. Rather the combination of intellectual data quality and completeness. For example, in
and technical skills has helped researchers and an ancient frame, even data sets derived from the

833
The Spatially Interactive Literature Analysis System Study Tool

most reliable sources, may be deduced from in- member, Lillian Larsen, in the Religious Studies
herently nebulous archeological remnants (Reed, department. With a focus on history and critical
2007). Such sources are susceptible to historical reading of primary texts, these non-theological
inaccuracy, and some place locations will never religion courses have proven ideal undergraduate
be known with certainty. This ambiguity is rarely “laboratories” for humanities students to explore
depicted in published maps. In turn, contemporary geospatial technologies. Conceiving of maps not as
digital maps, whether they are addressing histori- “an end product; [but as] a research tool” (Gregory
cal themes or not, often fail to cite their sources & Ell, 2007, pp. 10-11), these interfaces have been
of information. This undermines and stymies designed to allow students to visually overlay
critical inquiry by students who tend to trust the information from various geographic sources and
visual representation of mapped data (Schultz et store their own notes and observations for individual
al., 2008). Other formats of map publications, layers. Default exploratory and drawing tools are
such as atlases, incorporate additional information simplified and grouped, making the most common
that may facilitate the interpretation of ambiguous visualization tasks more accessible. In addition,
historical geographic data. Yet as a design format, customized templates are provided so that students
atlas pages are not often used by default in typical can generate layouts resembling an atlas page, a
layout templates for map production or within format that accommodates maps, text, images, and
classrooms exploring GIS-based instruction. other graphics. This combination allows students
Humanities scholars and students alike also to integrate maps with the other range of media
find that the use of GIS poses initial challenges they more typically use in their scholarship. The
(Brown, 2010; Jessop, 2004, 2008; Mostern, additional explanatory text and documentation as-
2010). The GIS software most often available on sociated with each atlas page provides information
university campuses, Esri’s ArcGIS, is industrial that aids in qualifying and interpreting the history
strength and designed for professional applica- behind and within the included maps.
tions. It is not ideally suited to casual or educa-
tional use. Acquiring and processing data can be
cumbersome and learning to be an independent MAPPING AS A CLASSROOM
user of the software can be an overwhelming ACTIVITY
task. To make GIS an effective scholarly tool
for humanities students, it may be necessary to Initially, simple paper and pencil exercises were
guide and simplify its use. Furthermore, inquiry used to introduce maps into the undergraduate
into the factors behind the spatial patterns is more classroom. These exercises have proved an effective
consistently the final learning achievement, rather preliminary step in generating insights. For ex-
than software expertise for its own sake (Gregory ample, in a course on Christian Scripture, religious
& Ell, 2007; Kerski, 2003; Mostern, 2010). studies students were tasked with representing the
With these challenges in mind, graduate students geographical information included in the letters of
and faculty at the University of Redlands (California, Paul, and the Acts of the Apostles, on two separate
USA) have built a series of GIS-based interfaces and maps. While there is broad scholarly consensus
tools designed to support undergraduate humanities that Acts is a well-fashioned literary reconstruction
students as they critically investigate ancient texts of events being reported up to a full century after
and contexts (Benzek, 2008; King, 2009; Larsen, Paul lived and wrote, and general agreement that
2009). The interest to develop these has emerged only seven of the thirteen letters ascribed to Paul in
from partnerships between graduate students in the fact derive from Paul’s hand, biblical geographers
University’s Master’s of GIS program and a faculty have traditionally assigned these sources equal

834
The Spatially Interactive Literature Analysis System Study Tool

historical truth and weight and woven a seamless Larsen, 2009). SILAS.ST is comprised of a col-
map of the “three missionary journeys of Paul” lection of customized toolbars, built to work with
(Larsen, 2009). In creating their own maps, students ArcGIS 9.3, desktop software that was available
quickly recognized the disparities in data derived to University of Redlands students in computer
respectively from the Acts of the Apostles, and the labs. One toolbar provides access to a set of forms
undisputed letters in Paul. Their student-generated within which students can document important
maps differed significantly from the traditional map information for individual data layers. A second
of Paul’s missionary journeys. Creating maps that set of toolbars facilitates both data exploration and
visually differentiated between discrete historical design of maps as atlas pages. This platform also
data layers, equipped students to develop their own supports data creation and editing, processes that
criteria for critically assessing the relative historicity were harder to manage with the original web-based
of ancient sources (and, derivatively, the maps used version of SILAS. During classroom activities
to elucidate these sources). As importantly, students involving GIS, students can add the SILAS.ST
gained a new appreciation for the value of asking toolbars to an ArcMap document, highlight any
historical questions through a geographical lens. data set or map layer, and display the customized
With even more ambitious educational goals in application for viewing or editing of pertinent map
mind, the Spatially Interactive Literature Analysis information (Figure 1). The custom SILAS.ST
System (SILAS) was constructed as a web ser- metadata-like forms behave as a regular metadata
vice that first addressed issues of GIS usability editor, storing the information along with the dataset
(Benzek, 2008; Larsen, 2009). Using ArcGIS for future access and editing by students working
Server, the tool allowed students to add data lay- independently or in groups.
ers from a pre-existing ArcSDE database and the The SILAS Study Tool was designed to encour-
user interface supplied simple annotation tools. age and support classroom learning objectives:
Historic roads, populated places, and topography critical engagement with historical texts and con-
were provided as an initial collection of layers. texts and exploration of complex cartographic
Additionally, a scripture search web application topics such as authorship, relevance, purpose,
was accessible through the interface, enabling a uncertainty, and accuracy. The prototype was de-
keyword search of biblical texts (Benzek, 2008). signed during the spring of 2009, concurrent to the
A step up from paper mapping, GIS afforded an offering of a course on the Historical Search for
even richer laboratory for facilitating learning. Jesus. In this course, students used a variety of
While the gains were significant for introduc- media including digital historical maps, to explore
tory GIS classroom exercises, the scope of the and construct a conceptual image of the historical
initial SILAS project did not extend into other Jesus as recorded and stereotyped through time.
pedagogically-important areas of historical inter- For young researchers and introductory GIS users,
est, including the support of deeper data inquiries. we sought to create a venue for contemplating the
In particular, the complex, fragmentary, incom-
plete, and uncertain nature of historical data, of
great interest to scholars, was not readily explor- Figure 1. Final SILAS.ST Map Information tool-
able through the SILAS web service. bar. Accessed within ArcMap (9.3), this toolbar
To address these issues, a second project phase, gives students easy access to the customized
the Spatially Interactive Literature Analysis System metadata-like forms while avoiding ArcCatalog.
Study Tool (SILAS.ST), focused on three areas:
data documentation, data discovery and exploration,
and cartographic design and layout (King, 2009;

835
The Spatially Interactive Literature Analysis System Study Tool

cause of patterns and discrepancies among data iterative version of the tool was further refined
layers as information from multiple sources was with continued classroom use.
juxtaposed. Students overlaid and critiqued data
layers representing historical and modern biblical
sources, a process that challenged them to evaluate METHODS OF TAGGING MAP
the presumed authority of printed texts and maps. DATA WITH KNOWLEDGE
Students were also encouraged to interrogate the
causes and generative forces that have contributed Enabling the systematic and thoughtful consider-
to data being represented as they are. Working with ation of historical mapped data was the primary
the Study Tool gave students greater understanding motivation behind SILAS.ST. Its customized
of the historical relevance and uncertainty inherent forms blend portions of the traditional structure
in the maps they create and study. Students simul- of metadata with a more open-ended documen-
taneously became more critical consumers of the tation approach. The metadata-like forms were
texts and map layers they encountered while in- modeled on two sets of form and function: source
vestigating historical topics. Applying these ap- documentation standards and common evaluative
proaches of historical inquiry to maps not only methods used in humanistic fields. We considered
helped student avoid “crude spatial-pattern spot- several existing schemes for their ability to docu-
ting” (Gregory & Ell, 2007, p. 118), but also cul- ment, elucidate, and evaluate information. The
tivated their senses of intellectual curiosity. structure was then organized around different
SILAS.ST is not a replacement for conven- categories of information evaluation using the
tional and standardized geospatial metadata, same type of terminology that a librarian might
which is still available for students as they are employ in teaching students to question and as-
using ArcGIS. However, in our experience, the sess the sources and information they encounter.
categories that populate traditional metadata We focused primarily on the identification and
standards are opaque and rarely accessible to qualification of data, attentive to the nature of
students in humanities courses. In designing the the information we expect students to evaluate.
content of the metadata tool, we made careful use The organizational structure of SILAS.ST
of natural language, avoided disciplinary jargon, draws largely from an information literacy docu-
and included gentle “prompts” to direct and guide ment known as the CRAAP (Currency, Relevance,
student responses to questions. We deliberately Authority, Accuracy, and Purpose) Test (Meriam
embedded open-ended, subjective, and qualitative Library, 2010). Designed with students in mind, this
parameters to model the lines of scholarly inquiry sequence of evaluative questions encourages critical
being pursued in broader humanities courses. assessment of the quality and original purpose of a
Student input and feedback were essential given body of information, an important component
components of the SILAS.ST development pro- of the research process. While it was originally
cess from the beginning. When students did not geared towards assessing information found on the
understand any particular component, or were Internet, where many students automatically begin
unclear about how they were to use the tools in their data searches, its value extends well beyond
class, the design scheme was revisited to ensure this immediate function. Focusing the students’ at-
as much transparency as possible. Likewise, while tention on the currency of information, the sequence
both SILAS.ST and its predecessor, SILAS, were begins by asking students to note when a source
initially coupled with Religious Studies curri- was published and last updated. Relevance targets
cula, their functionality was designed to support the appropriateness of the source by determining
learning in many other disciplines as well. Each the intended audience, which may influence the

836
The Spatially Interactive Literature Analysis System Study Tool

language and scholarly integrity of included infor- own contextualizing fields and redefining Dublin
mation. The named authority of the information is Core’s Coverage to better document spatial and
an important qualifier, as the author represents a set temporal attributes, ECAI has made their database
of credentials and qualifications that speak of their searchable by location and time, within the frame
ability to adequately contribute reliable information. of Dublin Core formatting (Gregory & Ell, 2007;
Students are also asked to assess the accuracy of Lancaster & Bodenhamer, 2002).
a source, whether it is supported by a substantive Further complicating the objective documenta-
body of evidence, or compromised by bias. Last tion of historical sources is how to best recognize
but not least, the purpose of the information sheds that the recorded location of a feature can be
light on the larger contextual picture for why the subject to multiple interpretations. The ArcGIS
information exists. Is the information’s purpose Citation Data Model (CDM) was designed to
to persuade? Entertain? Sell? (Meriam Library, connect bibliographic information to geographic
2010). By inviting students to assess a GIS data features in ArcGIS (Frye, 2008). The CDM enables
layer, such as a scanned historical map, using the scholars to document why a given feature has been
same criteria that would be applied to any other digitally located where it has, and why others may
historical source underscores the textual aspect of disagree. The interface links such information to
mapped information. It likewise equips students single geographic features by utilizing a database
with a familiar rubric on which they can base their schema that systematically joins and relates tabu-
initial discussion and research. lar data. Each object within a feature class can
In adapting the framework of the CRAAP reference up to three corroborating sources, as
Test’s sequenced components, other sources were well as one conflicting source, should one exist.
considered for basic objective metadata categories. This referencing system ensures efficient use of
Of these, the metadata standard from the Electronic sources and establishes their fidelity.
Cultural Atlas Initiative (ECAI) and the ArcGIS Confidence in cartographic representations can
Citation Data Model were particularly useful. be established by contextualizing map sources or
The ECAI Metadata Clearinghouse has its own even the cartographers themselves (Frye, 2008;
metadata standard for the purpose of cataloging Harley, 1988). The CDM includes a notes field in
historical spatial data. As a collaborative venture, both the source table and feature layer table, inviting
ECAI’s information base draws on a worldwide insights and information that might help qualify a
network of scholars working to create a digital atlas source or feature. By applying the CDM to GIS
of cultural information, tagged with spatial and research, a variety of data sources can be incor-
temporal data (University of California, 2007). The porated, validated, easily maintained and shared.
ECAI documentation standard is composed of the
Dublin Core Standard. This widely used metadata
format documents both spatial and temporal ele- SILAS.ST FOR DATA RESEARCH
ments in one field, called Coverage. For use in a AND DOCUMENTATION
spatial database comprised of historical sources, a
single field documenting both time and place may By design, SILAS.ST incorporates elements from
not be satisfactory, so ECAI extended the Coverage each of the aforementioned schemes and standards.
field to include the bounding latitude and longitude Those most suitable for evaluative documentation
points, the relevant historical or archaeological and objective metadata were selected and modified
periods, resolution of a map or digital position so as to better align with our pedagogical purposes.
recording, and the geography by which the data These elements were arranged on digital forms
was aggregated (Johnson, 1998). By adding their under five different tabs: Quick Facts, Origin &

837
The Spatially Interactive Literature Analysis System Study Tool

Authority, Relevance & Purpose, Accuracy & (Goodchild & Janelle, 2010). What is the best way
Reliability, and Notes. The content of each tab to inform Google Earth-era students about scale,
was then organized with a mind to supporting when the digital mapping tools with which they
critical and creative historical inquiry. are most familiar have a notable absence of meta-
The Quick Facts section covers the basic ele- data? What does it take for students to appreciate
ments that a student would initially need to know why the coastline of a country will appear different
about a map or data layer. This tab integrates the within a GIS if it was originally digitized from
ECAI notion of “coverage” of time and space, maps at 1:10,000 or 1:1,000,000 scale? If we
but separates location and time into two differ- limit students’ ability to view data above a certain
ent categories (Figure 2). Instead of identifying scale, with the scale-dependent display functional-
precise bounding coordinates for any particular ity available in GIS, what lessons are they learning?
location, students are asked to provide a more When one zoomed in very close to the ground with
general place and time period. This encourages Google Earth or Google Maps imagery, a message
classification of locations and time using descrip- previously indicated that “imagery is not available
tors that are inherently meaningful to the student at this resolution.” In a GIS interface, no such
user. Likewise, if the details of time or place are message will appear. At very large scales, Google’s
ambiguous or conflicted, students can state this images now convert from an overhead view to the
using qualitative description. “Street View” and one suddenly appears to stand
The notions of scale and resolution were com- on the sidewalk (if you have zoomed to an urban
plex and important ones to address in designing area, for example) and can look down the street.
SILAS.ST. These are difficult concepts for many What lessons about digital image resolution does
students to grasp, especially as quantitative values this impart to students?

Figure 2. Final SILAS.ST Map Information design for the ‘Quick Facts’ tab. Fields are methodically
grouped to promote comprehension of scale and resolution.

838
The Spatially Interactive Literature Analysis System Study Tool

Thus the growing familiarity with these online “published” in the ways that a contemporary map
maps and virtual globes, and their ability to zoom might be. Through this section students can see
and preserve resolution at each fixed extent level, how historic maps may represent a certain school of
undermines their understanding of how digital data thought or tradition of map making. They are also
in GIS can differ. For example, during one class invited to consider a cartographer’s motivation for
session a student zoomed in on a junction of two making maps for different purposes. Vernaculars
vector roads that, from afar, appeared to intersect. of geographers and cartographers vary depending
However, as the student continued to zoom in, the on social and cultural background (Harley, 1989).
roads no longer connected. This situation launched Historical maps often have a distinct lineage and
a rich conversation about the limits of generating a may have undergone many revisions and updates
spatial dataset and how errors enter the digitizing (Black, 1997; Rumsey & Williams, 2002; Wilford,
processes. Students gained a new awareness of 2000). When researching the cartographer and the
how the original scale of a map affects the nature map, students are prompted to consider how the
of the data, knowledge otherwise absent following information was gathered, and in turn which other
independent forays with virtual globes. maps might have been known or consulted. They
Understanding the differences among scale, can assess the map maker’s level of familiarity
resolution and map size is a challenge, but link- with the geography represented, as these elements
ing the topics may facilitate learning (Frye, per- affect the relative reliability of a map.
sonal communication, 2009; Rumsey & Williams, In our experience with visual overlay of maps,
2002). Here, the issue lies in maintaining a balance students were eager to single out the differences
between mathematically specific definitions and they saw in change over time, or differences between
more open-ended, conceptual parameters. Thus, two or more maps produced within a single time
we added a field where one could indicate the period. When students encounter these discrepan-
dimensions of the original source map that had cies, SILAS.ST offers prompts in its Purpose and
been used to generate the GIS layer the student Relevance section to consider and note what the
now observed, so that students could envision differences might mean and why they might exist,
how the data were now being represented in a such as the results of different map-making prac-
dramatically different size. Knowing the method tices or the intentions of the cartographers (Figure
and purpose of preservation is also relevant in this 4). This subjective and evaluative section guides
case, and we provided a tag to indicate the quality students to critically evaluate the map layer as an
and resolution of the original scan. Ultimately, we information source. By design, completing this
included a combination of both subjective and section is likely to require further independent
objective tags to reflect these ideas and metrics. research by the student. Students are encouraged
One of the goals of SILAS.ST is to move beyond to leave their name and document the references
the when and where of historical sources and involve they use for this research, so that future students
students in thinking about how a source may have can add to and accrue information over time. In
been used and what factors drove its creation in the this way, the tool retains the potential to serve as
first place (including student generated maps and an ongoing evaluative document for students, who
data layers). Thus the Origin and Authority section can revisit, revise and rethink its informational
prompts students to document the owner, distribu- content through consecutive semester courses.
tor, publisher, or cartographer of the maps and data The notions of uncertainty and accuracy have
layers they are studying (Figure 3). By breaking long been central to mapping and GIS (Coucle-
down the various forms of map ownership, students lis, 2003; MacEachren, 1992; Plewe, 2002; Unwin,
begin to understand that not all historic maps were 1995; Zhang & Goodchild, 2002). Even the am-

839
The Spatially Interactive Literature Analysis System Study Tool

Figure 3. Final SILAS.ST Map Information design for the origin and authority

Figure 4. Final SILAS.ST Map Information design for the relevance and purpose

840
The Spatially Interactive Literature Analysis System Study Tool

biguity associated with the words themselves of the Holy Land, in particular, clearly show the
lends rich complexity to these ideas. Students great effort expended in detailing particular fea-
bring with them a prior conceptual knowledge of tures relevant to the desired audience (Long, 2002).
“uncertainty” and “accuracy” but the depth and Prior to the invention of precision instruments,
breadth of how those ideas translate into mapping maps were simply a mental construct of space,
and GIS is new to them. Moreover, students are limited by the observer’s (or primitive instru-
also learning how to apply these concepts as junior ment’s) ability to measure. Early observances and
scholars of history. Many enter the classroom records may not be adequately preserved, or fall
without appreciating the importance of critical victim to bias. Thus a large challenge to overcome
thinking in addressing historical texts, much less before using historical sources is adequate research
maps (Robinson, 2010). The challenge lies in to compensate for the place names, geographic
introducing both sets of ideas to students so that features and measurement standards that have
they can begin to appreciate complexity but not changed or have not been properly recorded.
be overwhelmed by a perceived burden of trying One quantitative measure where accuracy can
to “fix” something. We debated the balance be- be applied is in georeferencing (Rumsey & Wil-
tween quantitative and qualitative measures of liams, 2002). Georeferencing involves the careful
uncertainty and accuracy, designing an approach placement of a digital image, often an historical
to accommodate both in the Accuracy and Reli- map, over the GIS-based representation of that
ability section of SILAS.ST (Figure 5). same geographic extent on the Earth’s surface.
In historical records, problems of incomplete- Once the image is “tacked” down, it becomes
ness, incoherency, and ambiguity are part of a layer within the GIS, aligned with the other
documenting a narrative and interpreting it over layers in the vertical stack of data. How well the
time (Gaddis, 2002; Plewe, 2002). Historical maps georeferencing placement process occurs is some-

Figure 5. Final SILAS.ST Map Information design for the accuracy and reliability tab

841
The Spatially Interactive Literature Analysis System Study Tool

thing that can be measured; the root mean square courses. This involved reorganizing and renaming
error (RMSE) is one calculation of how far the existing ArcMap functionality for the support of
“tacks” have had to stretch the image to fit where historical inquiry.
it has been placed. With explanation and practice, To facilitate map navigation, the Explore Data
students can learn not only to georeference maps toolbar contains the common Pan and Zoom func-
(and record their RMSE values within SILAS. tions for initial exploration of a dataset, along with
ST) but also to understand when and why some Bookmarks, so students can save extents they
maps can never be georeferenced with acceptable wish to revisit later (Figure 6). For map overlay
measures of accuracy. visualizations, the Transparency and Swipe tools
Storing information on data lineage and pro- were also included. The Add Layers command
cessing is also a component of the Accuracy and opens to the ArcSDE database, containing the
Reliability section. Ideally completed by a more original SILAS data layers and any others that
experienced GIS user who may have processed the students may have contributed. The Search Bible
data, in class settings this information may be used functionality opens up the SILAS Scripture Search,
to facilitate discussion of GIS data creation and in which students can perform a keyword search
conversion techniques. Such conversations may, of names or places included in the biblical text.
in turn, shed light for the students on the merits To encourage sketching as a form of inquiry
and limitations of GIS for historical research. At and data creation, the Hand Draw toolbar contains
the same time, it is important that the documenta- common drawing functionality that supports the
tion of data lineage and processing be written so use of graphic elements (Figure 7). Drawing al-
that students less familiar with GIS techniques lows students to generate point, lines and polygon
can understand how the original information has features directly on the map, and provide their
already been manipulated in order to be viewed own annotation. We have found this particular
within GIS. functionality very important in our classes as we
often ask students to trace the routes of historic
figures, based on information they are extracting
SILAS.ST FOR DATA DISCOVERY from the texts as they read. By engaging students
AND REPRESENTATION in the tracking of historical journeys, reading
comprehension is improved because students are
In addition to the metadata-like forms avail- required to read for geographical indicators (Oden,
able within SILAS.ST, we also created several 1992). Students have the option to save their
other customized toolbars designed to support drawings as a layer for future use outside of their
classroom-usage of GIS within undergraduate current map document.

Figure 6. Final SILAS.ST Explore Data Toolbar is comprised of common map exploratory functions

Figure 7. Final SILAS.ST Hand Draw Toolbar helps students edit graphic features and attributes

842
The Spatially Interactive Literature Analysis System Study Tool

Lastly, we encourage students to frame their previous mapping exercises (Figure 9). The or-
use of maps within an atlas format, one that blends ganizational choices reflect student interest in
the use of maps, text, images, and other graphics comparing maps of similar geographic extents,
to document and share their historical scholarship. or using a single map to illustrate a journey.
Not only does this model the use of multiple
sources and formats of information to document
historical inquiry, but it facilitates the creation of NEXT STEPS AND CONCLUSION
professional, well-designed final research prod-
ucts. The Create Atlas (Figure 8) toolbar links to As a prototype tool, SILAS.ST will undoubtedly
customized map templates that include preset base continue to evolve as it is used in more classroom
maps with ArcSDE database connections, map settings, with new data sets and new learning
layout schemes, and customized historical fonts objectives. Within this frame, the capacity for
and symbols. ongoing student involvement in shaping and re-
The final map templates were modeled on shaping SILAS.ST is significant. In developing
layouts that students themselves had designed in both SILAS and SILAS.ST, student needs were

Figure 8. Final SILAS.ST Create Atlas Toolbar eases map layout for print

Figure 9. One of several map layout templates available to students

843
The Spatially Interactive Literature Analysis System Study Tool

anticipated and then directly evaluated during classroom GIS use. Objective and subjective
in-class sessions. Student feedback was system- scrutiny of historical sources is helping students
atically solicited and incorporated as part of the more fully comprehend the complexities of
work flow of tool development. history. This juxtaposition allows humanities-
Diverse opportunities for student feedback oriented researchers and students to ask better
and lesson integration continue to be integral to all and more nuanced questions, whether students are
future development and modifications. Though analyzing patterns, probing spatial relationships,
the use of SILAS.ST has been limited to courses investigating causes, or formulating queries
in the Religious Studies department to date, and (Gregory & Ell, 2007). As a prototype, SILAS.
even with those classes its deployment has been ST helps students appreciate the processes of
sporadic, we expect this to expand significantly historical scholarship and encourages critical
in the near future. Our active campus-wide ini- thinking and inquiry. As students seek to bring
tiative to promote the use of maps, mapping historical data sources into conversation with
and spatial perspectives into our undergraduate 21st century technology platforms, they begin
curricula has recently focused on the themes of to view both ancient and modern categories in
mapping people and mapping migrations, topics new and productive ways.
that are clearly of broad interest to humanities Within historical GIS, the nature of the project
and social science faculty (LENS, 2010). In and data sources must influence the documenta-
the future, there are plans to explore the use of tion method. Many projects focus on the pres-
SILAS.ST with multiple courses in history and ervation of history, with an eye to utilizing GIS
archaeology as well. These additional use cases database management for information efficiency.
will undoubtedly provide important feedback to These GIS projects share similar challenges to
the on-going development of this prototype tool. those met by SILAS.ST. Documenting coverage,
Software upgrades have had to be accom- scale and accuracy, and the thoughtful incorpora-
modated with each new phase. For example, tion of customized metadata standards are issues
accessing the customized SILAS.ST forms that any historical GIS must address. Metadata
from a toolbar within ArcMap 9.3 required the and source documentation have been considered
use of VB.Net, and this function may behave from a technical and scholarly perspective for
differently in more recent versions of ArcGIS. many years, but our intention was to design a
Software enhancements create new hurdles but tool that prioritized pedagogical objectives. Ul-
also new opportunities and possibilities. With timately, by making these topics more relevant
new versions, we can now reconsider the idea for historians and researchers, we increase the
of enabling students to edit data and work with likelihood that scholars and students alike will
SILAS.ST over web-based maps. benefit from the powerful resources of GIS.
In developing SILAS.ST, we found that Interest in and use of these tools will be one
blending mapping technologies with intellectual measure of our success. A second measure will
contexts and technical skills yields a rich learn- be SILAS.ST’s effectiveness in validating data
ing environment. Of the many possible ways of sources for exploratory use in the humanities
documenting sources, SILAS.ST’s capacity for classroom, while encouraging students to ques-
accommodating both factual and more ambigu- tion both conventional readings and geographical
ous information has broken new ground. The representations of history.
balance it affords will be particularly valuable in
the interdisciplinary and collaborative classroom
settings that are often home to undergraduate

844
The Spatially Interactive Literature Analysis System Study Tool

REFERENCES Cunfer, G. (2002). Causes of the Dust Bowl. In


Knowles, A. (Ed.), Past time, past place: GIS for
Ackerman, J. R., & Karrow, R. W. Jr. (2007). history (pp. 93–104). Redlands, CA: ESRI Press.
Maps: Finding our place in the world. Chicago,
IL: University of Chicago Press. Edney, M. H. (1997). Mapping an empire. Chicago,
IL: University of Chicago Press.
Bailey, T. J., & Schick, J. B. M. (2009). Enabling
the collision of history and geography. Social Frye, C. (2008). The ArcGIS citation data model
Science Computer Review, 27(3), 291–296. and method. Paper presented at the Meeting of the
doi:10.1177/0894439308329757 Social Science History Association, Miami, FL.

Benzek, S. (2008). Spatially Interactive Literature Gaddis, J. L. (2002). The landscape of history:
Analysis System (SILAS): An interactive tool to How historians map the past. New York, NY:
study the narrative landscape of ancient texts Oxford University Press.
(Unpublished master’s thesis). University of Goodchild, M., & Janelle, D. (2010). Toward criti-
Redlands, Redlands, CA. cal spatial thinking in the social sciences and hu-
Black, J. (1997). Maps and history: Construct- manities. GeoJournal, 75(1), 3–13. doi:10.1007/
ing images of the past. New Haven, CT: Yale s10708-010-9340-3
University Press. Gregory, I. N., & Ell, P. S. (2007). Historical GIS:
Bodenhamer, D. J., Corrigan, J., & Harris, T. M. Technologies, methodologies and scholarship.
(2010). The spatial humanities. Bloomington, IN: Cambridge, MA: Cambridge University Press.
Indiana University Press. doi:10.1017/CBO9780511493645

Bol, P. (2008). Creating a GIS for the history Gregory, I. N., & Healey, R. G. (2007). Historical
of China. In Knowles, A., & Hillier, A. (Eds.), GIS: Structuring, mapping and analysing geogra-
Placing history: How maps, spatial data, and GIS phies of the past. Progress in Human Geography,
are changing historical scholarship (pp. 27–60). 31(5), 638–653. doi:10.1177/0309132507081495
Redlands, CA: ESRI Press. Harley, J. B. (1988). Maps, knowledge, and power.
Brennan-Horley, C., Luckman, S., Gibson, C., & In Cosgrove, D., & Daniels, S. (Eds.), The ico-
Willoughby-Smith, J. (2010). GIS, ethnography, nography of landscape (pp. 277–312). Cambridge,
and cultural research: Putting maps back into UK: Cambridge University Press.
ethnographic mapping. The Information Society, Harley, J.B. (1989). Deconstructing the map.
26(2), 92–103. doi:10.1080/01972240903562712 Cartographica: The International Journal for
Brown, J. (2008). Using GIS to drive research Geographic Information and Geovisualization,
in undergraduate history classes: Five years of 26(2), 1–20. doi:10.3138/E635-7827-1757-9T53
experimentation with the modern world history Hillier, A. (2003). Redlining and the home owners
survey course. Journal of the Association for loan corporation. Journal of Urban History, 29(4),
History and Computing, 11(2). 394–420. doi:10.1177/0096144203029004002
Brown, P. (2010). Exploring historical space and Holdsworth, D. W. (2003). Historical geogra-
environments in the history/social studies class- phy: New ways of imaging and seeing the past.
room: Discussion and reflections. Journal of the Progress in Human Geography, 27(4), 486–493.
Association for History and Computing, 13(1). doi:10.1191/0309132503ph440pr

845
The Spatially Interactive Literature Analysis System Study Tool

Jessop, M. (2004). The visualization of spatial data Lancaster, L., & Bodenhamer, D. (2002). The
in the humanities. Literary and Linguistic Com- electronic cultural Atlas Initiative and the North
puting, 19(3), 335–350. doi:10.1093/llc/19.3.335 American Religion Atlas. In Knowles, A. (Ed.),
Past time, past place: GIS for history (pp. 163–
Jessop, M. (2008). The inhibition of geographi-
177). Redlands, CA: ESRI Press.
cal information in digital humanities scholarship.
Literary and Linguistic Computing, 23(1), 39–50. Larsen, L. (2009). Implementing SILAS: The fu-
doi:10.1093/llc/fqm041 ture of using GIS to study the past. Paper presented
at the Meeting of the Association of American
Johnson, I. (1998). ECAI Metadata Manual
Geographers, Las Vegas, NV.
(Version 1.1). Retrieved from http://www.ecai.
org/documentation/ecai_metadata_standard.html LENS. (2010). Learning spatially at the Uni-
versity of Redlands. Retrieved from http://lens.
Kerski, J. J. (2003). The implementation and ef-
redlands.edu
fectiveness of geographic information systems
technology and methods in secondary education. Lloyd, W. (2001). Integrating GIS into the
The Journal of Geography, 102(3), 128–137. undergraduate learning environment. The
doi:10.1080/00221340308978534 Journal of Geography, 100(5), 158–163.
doi:10.1080/00221340108978443
King, A. (2009). The spatially interactive litera-
ture analysis system study tool: A GIS tool for Long, B. O. (2002). Imagining the Holy Land:
interpreting history (Unpublished master’s thesis). Maps, models, and fantasy travels. Bloomington,
University of Redlands, Redlands, CA. IN: Indiana University Press.
Knowles, A. K. (2000). A case for teaching geo- MacEachren, A. M. (1992). Visualizing uncer-
graphic visualization without GIS. Cartographic tain information. Cartographic Perspective, 13,
Perspectives, 36, 24–37. 10–19.
Knowles, A. K. (Ed.). (2002). Past time, past Meriam Library, California State University,
place: GIS for history. Redlands, CA: ESRI Press. Chico. (2010). Evaluating information -- apply-
ing the CRAAP test. Retrieved from http://www.
Knowles, A. K. (2008). What could Lee see at
csuchico.edu/lins/handouts/eval_websites.pdf
Gettysburg? In Knowles, A., & Hillier, A. (Eds.),
Placing history: How maps, spatial data, and GIS Mostern, R. (2010). Putting the world in world
are changing historical scholarship (pp. 235–265). history. Journal of the Association for History
Redlands, CA: ESRI Press. and Computing, 13(1).
Knowles, A. K., Beorn, W., Cole, T., Gigliotti, S., Oden, P. (1992). Geography is everywhere in chil-
Giordano, A., & Holian, A. (2009). Geographies dren’s literature. The Journal of Geography, 91(4),
of the Holocaust. Geographical Review, 99(4), 151–158. doi:10.1080/00221349208979503
563–574.
Pearce, M. W., & Louis, R. P. (2008). Mapping
Kwan, M.-P., & Ding, Q. (2008). Geo-narrative: indigenous depth of place. American Indian
Extending geographic information systems for Culture and Research Journal, 32(3), 107–126.
narrative analysis in qualitative and mixed-method
Plewe, B. (2002). The nature of uncertainty in
research. The Professional Geographer, 60(4),
historical geographical information. Transac-
443–465. doi:10.1080/00330120802211752
tions in GIS, 6(4), 431–456. doi:10.1111/1467-
9671.00121

846
The Spatially Interactive Literature Analysis System Study Tool

Ray, B. (2002). Teaching the Salem Witch Trials. Sinton, D., & Lund, J. (2007). Understanding
In Knowles, A. (Ed.), Past time, past place: GIS place: GIS and mapping across the curriculum.
for history (pp. 19–33). Redlands, CA: ESRI Press. Redlands, CA: ESRI Press.
Reed, J. L. (2007). The HarperCollins visual guide Sparke, M. (1995). Between demythologizing
to the New Testament: What archaeology reveals and deconstructing the map: Shawnadithit’s
about the first christians. New York: HarperOne. New-found-land and the alienation of Canada.
Cartographica, 31(1), 1–21. doi:10.3138/WW47-
Robinson, B. (2010). The integration of GIS-based
6X0N-475Q-7231
information mapping into an enthnohistorical
seminar. Journal of the Association for History Staley, D. (2007). Finding narratives of time and
and Computing, 13(1). space. In Sinton, D., & Lund, J. (Eds.), Under-
standing place: GIS and mapping across the cur-
Rumsey, D., & Williams, M. (2002). Historical
riculum (pp. 35–48). Redlands, CA: ESRI Press.
maps in GIS. In Knowles, A. (Ed.), Past time,
past place: GIS for history (pp. 1–18). Redlands, Universidad Francisco Marroquín. (2009).
CA: ESRI Press. Quauhquechollan, El Lienzo de la Conquista (The
Lienzo). Retrieved from http://www.lienzo.ufm.
Schultz, R. B., Kerski, J. J., & Patterson, T. C.
edu/cms/en/home
(2008). The use of virtual globes as a spatial
teaching tool with suggestions for metadata stan- University of California. (2007). Electronic
dards. The Journal of Geography, 107(1), 27–34. Cultural Atlas Initiative. Retrieved from http://
doi:10.1080/00221340802049844 www.ecai.org/
Schwartz, R. (2010). Rail transport, agrarian cri- Unwin, D. (1995). Geographical information sys-
sis, and the restructuring of agriculture: France tems and the problem of ‘error and uncertainty.’
and Great Britain confront globalization, 1860– Progress in Human Geography, 19(4), 549–558.
1900. Social Science History, 34(2), 229–255. doi:10.1177/030913259501900408
doi:10.1215/01455532-2009-026
Wilford, J. N. (2000). The mapmakers (Revised
Sinton, D. (2009). Roles for GIS within higher edu- ed.). New York, NY: Knopf.
cation. Journal of Geography in Higher Education,
Zhang, J., & Goodchild, M. (2002). Uncertainty in
33(1), 7–16. doi:10.1080/03098260903034046
geographic information. Boca Raton, FL: Taylor
Sinton, D., & Bednarz, S. (2007). About that G in and Francis. doi:10.4324/9780203471326
GIS. In Sinton, D., & Lund, J. (Eds.), Understand-
ing place: GIS and mapping across the curriculum
(pp. 19–34). Redlands, CA: ESRI Press.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 3,
edited by Donald Patrick Albert, pp. 70-85, copyright 2012 by IGI Publishing (an imprint of IGI Global).

847
848

Chapter 52
Geographic Visual
Query Languages and
Ambiguities Treatment
Arianna D’Ulizia
Consiglio Nazionale delle Ricerche, IRPPS, Italy

Fernando Ferri
Consiglio Nazionale delle Ricerche, IRPPS, Italy

Patrizia Grifoni
Consiglio Nazionale delle Ricerche, IRPPS, Italy

ABSTRACT
The main issues of spatial databases and Geographic Information System (GIS), concern the representa-
tion, the management and the manipulation of a large and complex number of spatial objects and spatial
relationships. In these systems many concepts are spatial and, therefore they are intrinsically related
with a visual representation, which makes also easier to formulate queries by non-expert users. The
main problems in visual query languages for spatial databases concern imprecision, spatial integrity and
ambiguities in query formulation. Our concern in this chapter is with the ambiguity of visual geographi-
cal queries. In particular, a review of existing visual query languages for spatial databases and their
classification on the grounds of the methodology adopted to resolve the ambiguity problem are provided.

INTRODUCTION Visual queries for spatial databases can be


expressed using one of the following four ap-
Spatial databases and Geographic Information proaches. The first approach uses predefined
System (GIS) represent, manage and manipulate icons to retrieve pictorial information. Examples
a large and complex number of spatial objects of languages that use this approach are: Cigales
and spatial relationships. (Calcinelli and Mainguenaud, 1994), the language
defined by Lee and Chin in (1995) and the card-
based language proposed by Ju et al. (2003). The
DOI: 10.4018/978-1-4666-2038-4.ch052 second approach specifies spatial relationships by

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Geographic Visual Query Languages and Ambiguities Treatment

freehand drawing. Sketch (Meyer, 1993) Spatial- use of actions in query formulation by modifying
Query-By-Sketch (Egenhofer, 1997; Blaser and the query semantics. The iconic language defined
Egenhofer, 2000), VISCO (Wessel and Haarslev, by Lee and Chin (1995) belongs to this group.
1998), GeoPQL (Ferri and Rafanelli, 2005; Ferri A third group of languages tries to increase the
et al., 2004) and, finally, the language proposed user’s ability to formulate more complex queries
by Erwig and Schneider (2003), belong to this by the use of several operators without facing the
approach. The third approach uses symbolic im- ambiguity problem. Languages that belong to this
ages for representing a set of objects and a set group are Cigales and LVIS. A fourth group of
of spatial relations among them. Languages that languages proposes approximate solutions as well
belong to this approach are Pictorial Query By as the exact answer to the query enabling the user
Example (Papadias and Sellis, 1995), SVIQUEL to select what he/she requires. Sketch and Spatial
(Kaushik and Rundensteiner, 1997), and the lan- Query By Sketch are part of this category. Finally,
guage proposed by Rahman et al. (2005). Finally, a fifth group of languages, such as GeoPQL,
the fourth approach combines text and sketching in resolves ambiguities by introducing special new
a hybrid solution, such as the language proposed operators to manage them.
by Szmurlo et al. (1998). This chapter is structured as follows. Section
The main problems in visual query languages 2 gives a brief overview of the approaches used
for spatial databases concern imprecision, spatial for the definition of visual querying for spatial
integrity (Favetta & Laurini, 2001) and ambi- databases. Section 3 illustrates problems about
guities in query formulation. Some authors have ambiguity treatment in these kinds of visual lan-
proposed solutions to resolve ambiguities. For guages. In section 4 a classification of different
example, Favetta and Aufaures-Portier (2000) languages on the grounds of methodology adopting
proposed a taxonomy for classifying different to resolve the problem of ambiguity is proposed.
types of ambiguity during query formulation. Section 5 presents some future perspectives on the
They state that the best solution for ambiguities growth of visual languages for spatial databases
is a hybrid language (textual and visual) with a and conclusions.
more intensive dialog between user and system.
Lbath et al. (1997) have proposed to resolve
ambiguities through a standard for semantics using VISUAL QUERY LANGUAGES
specific menus. They argue that it is possible to FOR SPATIAL DATABASES
define the interpretation for the query and suggest
a hybrid visual language named Aigle-Cigales, in Several proposals of visual languages for geo-
which the system works with default semantics. graphic data exist in the literature. The following
Details should be explicitly mentioned through discussion expressed in detail the classification
a specific contextual menu or by textual format. of the languages presented in the introduction.
Since ambiguity can represent a restriction for To conceptually represent geographic objects,
visual languages, it is very interesting to analyze different visual query languages consider three
several language proposals and classify them types of symbolic graphical objects (SGO): point,
according to the methodology used to resolve polyline and polygon.
the problem of ambiguity. A first group of lan- The first approach uses predefined icons for
guages, such as Pictorial Query By Example and retrieving pictorial information. The shortcoming
SVIQUEL, faces ambiguity by allowing the use of this method is that the predefined icons do not
of a few operators and/or spatial relationships. A have strong expressive power, and the consequent
second group disambiguates language through the query capability is limited.

849
Geographic Visual Query Languages and Ambiguities Treatment

Among languages based on this approach, there The Spatial-Query-By-Sketch language


is Cigales (Calcinelli and Mainguenaud, 1994), (Egenhofer, 1997; Blaser and Egenhofer, 2000),
which allows the user to draw a query. It is based similarly to Sketch!, is based on a formal model for
on the idea of expressing a query by drawing the topological spatial relations and a computational
pattern corresponding to the result desired by model for the constraints relaxation. Each query
the user. To achieve this it uses a set of icons that produces a set of candidate interpretations as result
model the geometric objects, polyline and polygon and the user selects the correct one.
(point is not considered), and the operations carried Another language based on the second ap-
out on these objects (e.g., intersection, inclusion, proach is VISCO (Wessel and Haarslev, 1998)
adjacency, path and distance). Symbolic graphical which considers geometric as well as topological
objects and icons that conceptualize the operators constraints as drawn by the user. The system parses
are predefined. the geometry of query sketches and supports the
Another language, defined by Lee and Chin annotation of meta information for specifying
(1995), allows the user to compose a query utilizing relaxations which are additional constraints, or
the three symbolic graphical objects: rectangle, “don’t cares”, that define the query interpretation.
line and point. These SGOs can compose an iconic Among the languages based on the second ap-
sentence whose meaning is due to topological proach there is the GeoPQL (Ferri and Rafanelli,
relations among the icons of the query. In this 2005; Ferri et al., 2004) that is based on twelve
language the user draws a new SGO and can set operators. The nine traditional topological opera-
the state of all of the previously drawn SGOs to tors, the distance operator and finally two further
the foreground or background. SGOs for which operators, ALIAS and ANY, devoted to solving
the relationships with the new SGO have to be ambiguities in query interpretation. Only the last
considered, must be placed in the foreground, two operators need to be explicitly expressed,
while those SGOs whose relationships have not while topological operators are automatically de-
been considered are placed in the background. duced from the visual representation of the query.
The card-based language, proposed by Ju et Finally, the proposal of Erwig and Schneider
al. (2003), uses the card iconic metaphor for rep- (2003) consists in a language devoted to analyz-
resenting both complex spatial objects and spatial ing two-dimensional traces of moving objects
relationships between them. The user can describe to infer a temporal development of their mutual
his/her query requirements in a visual environment spatial relationships.
by selecting the appropriate cards and putting The third approach uses symbolic images for
them into the proper query boxes. The result of representing a set of objects and a set of spatial
the query is also displayed in graphical form. relations among them. For instance, Pictorial
The second approach specifies spatial relation- Query-by-Example (PQBE) (Papadias and Sellis,
ships by a freehand drawing. First of all there is 1995) uses symbolic images to find directional
Sketch! (Meyer, 1993), that allows the user to relationships. This language considers a symbolic
draw a visual representation of his/her query, as image as an array that could correspond to visual
if on a blackboard, without explicit references to scenes, geographical maps or other forms of spa-
operators to be applied to geographical objects tial data. The main limitation of PQBE is that it
involved in the query. In fact, the spatial query is considers directional relationships only.
expressed through a sketch on the screen, which is An evolution of PQBE is SVIQUEL (Kaushik
later interpreted by the system. It means that spatial and Rundensteiner, 1997) that also includes topo-
operators are directly derived from the sketch. logical operators which consider 45 different types
of primitives, allowing to represent topological

850
Geographic Visual Query Languages and Ambiguities Treatment

and directional relationships between two SGOs default semantics and details are mentioned ex-
of type polygon. plicitly through a specific contextual menu or by
Finally, the proposal by Rahman et al. (2005), textual format.
allows elicit information requirements through Moreover, Carpentier and Mainguenaud (2002)
the interactive choice of wireless web services. distinguish two ambiguities: visual ambiguity that
To define a visual spatial query language this ap- appears when a given visual representation of a
proach combines text and sketching in a hybrid query corresponds with several interpretations,
solution. Using this method, users can draw spatial and selection ambiguity that appears when sev-
configurations of the objects they would like to eral metaphors correspond to a given selection.
retrieve from the GIS, while the textual part permits To reduce these problems the authors propose
the specification of the geographical semantics. a drawing process with a grammar for interac-
An example of this approach is represented by the tion: the more powerful the grammar, the more
language proposed by Szmurlo et al. (1998). This important the level of ambiguity. A compromise
language allows the user to draw the configura- needs to be found.
tion of the objects he/she is interested in, and thus Visual languages offer an intuitive and incre-
defines spatial constraints between these objects. mental view of spatial queries, but often times
Any geographic object can be classified as a zone, they may offer different interpretations of the
a line or a point. Moreover, constraints written in same query. Among the reasons for multiple
natural language are collected in labels that are query interpretations, the most important is that
graphically connected to the object they refer to. the user has a different intention in formulating
his/her query with respect to the analysis that the
system makes of it. Moreover, when the user draws
QUERY’S AMBIGUITY PROBLEM two icons for representing different objects of a
sentence, he/she can avoid defining one or more
The main problems of visual query languages spatial relations between them. Then the system
for spatial databases concern imprecision, spa- can formulate different interpretations to obtain
tial integrity and ambiguities in query formula- the required result.
tion. Favetta and Aufaure-Portier (2000) details For example, suppose the user formulate the
problems due to topological imprecision and following query: “Find all the regions which pass
integrity in spatial relations. Some researchers through a river and overlap a forest”. The user
have proposed solutions to resolve ambiguities. is not interested in the relationship between the
For example, Favetta and Aufaures-Portier (2000) river and the forest and the absence, in natural
have suggested a taxonomy for classifying dif- language (NL) formulation, of explicit relation-
ferent kinds of ambiguity that can be produced ships between them produces an ambiguity. The
during the query formulation. They state that the different visual queries of Figure 1 represent the
best solution to resolve ambiguities is a hybrid query in natural language.
language (textual and visual) with a dialog more To remove the ambiguity, the complete natu-
intense between user and system. ral language query, “Find all the regions which
Another proposal to resolve the ambiguities are passed through by a river and overlap a forest,
has been introduced by Lbath et al (1997). They irrespectively of the topological relationships
proposed standard semantics, through specific between the river and the forest”, could be con-
menus, that make interpretation of the query sidered. However, when the user draws an SGO
possible. They propose a hybrid visual language, representing a forest and another representing a
Aigle-Cigales, in which the system works with

851
Geographic Visual Query Languages and Ambiguities Treatment

Figure 1. Visual queries for the same NL query

river he/she cannot avoid representing a topo- defined by Lee and Chin (1995) belongs to this
logical relationship between them. group. In this language it is possible to remove
Since ambiguity can constitute a restriction undesired relationships among drawn symbolic
of visual language, in the following section we graphical objects or impose an a priori restrictive
analyze methodologies adopted by different lan- interpretation using the foreground/background
guages to resolve this problem. metaphor. The relationships of a new symbolic
graphical object depend on the state (foreground
or background) of the previously drawn symbolic
VISUAL SPATIAL QUERY graphical objects. To interpret a query the parser
LANGUAGES AND THE must consider both the visual representation and
AMBIGUITY PROBLEM the drawing process. In this manner some proce-
dural steps influence the semantics of the query
In section 2 we presented visual query languages they do not influence its representation. Queries
for spatial databases that have been proposed in having the same representation may have differ-
the literature. Now we classify these languages ent semantics. Another language that belongs to
on the grounds of methodologies they adopt to the second group is VISCO. This language offers
resolve the problem of ambiguity. tools that specify meta-information resembling the
A first group of languages handles the am- user’s idea of the interpretation. This approach
biguity by allowing the use of few operators or demands more skill from the user but makes the
spatial relationships, such as Pictorial Query By intended interpretation explicit.
Example and SVIQUEL. Considering only limited A third group of language tries to increase a
kinds of spatial relations (directional relations) user’s possibility of formulating a more complex
PQBE avoids multiple interpretations of the query query by the use of several operators without facing
but reduces the possibility of formulating more explicitly the ambiguity problem. A language that
complex queries that involve topological rela- belongs to this group is Cigales. In this language the
tionships. The SVIQUEL language also includes system is not able to give a unique interpretation
topological operators. However, it avoids multiple of the visual query representation. Two possible
interpretations by limiting the number of objects solutions proposed to reduce ambiguity of Cigales
involved (to just two) and provides a tool with a are: to introduce various interactions (feedback)
low expressive power for specifying the relative with the user and to increase the complexity of the
spatial positions. resolution model. However, different obstructions
A second group disambiguates the language may arise so that the semantics of the query are
by the use of actions in query formulation that fully user dependent and complex queries with
modify query semantics. The iconic language numerous basic objects, are not expressible. An-

852
Geographic Visual Query Languages and Ambiguities Treatment

other language of the third group is that proposed Finally, a fifth group of languages resolves the
by Szmurlo et al. (1998). This language allows the ambiguity problem by introducing new special
user to draw the configuration of the objects and operators that serve to manage the ambiguity.
thus to define spatial constraints between these Among the languages belonging to this group there
objects and thematic constraints that each object is GeoPQL, which allows the user to represent only
has to respect. However, as this language allows the the desired relationships. The system interprets
user to have great freedom, many ambiguities may the query considering all relationships between
arise due to the incoherencies between the object symbolic graphical objects of the sketch, and it
and the thematic constraints, between constraints is possible to remove or modify some undesired
for the same object or between constraints for relations using ad hoc operators introduced in the
different objects. A solution to ambiguity consists language. Figure 2 shows the few examples of
of detection of incoherencies and proposal of a visual queries represented by using some of the
possible solutions to the user. languages introduced in this article.
A fourth group of languages proposes to give
query approximation solutions to the user and the
user then selects what he/she requires. Sketch and CONCLUSION
Spatial Query By Sketch are part of this category. In
particular, Spatial-Query-By-Sketch resolves the The number of applications using spatial or geo-
ambiguity problem by considering and proposing graphic data has been ever increasing over the
to the user both the exact solution of the query, if last decade. New small-scale GISs, often called
possible, and other approximate solutions obtained desktop GIS, are gradually becoming available and
by relaxing some relationships. In this manner the people are becoming familiar with using the web
language includes multiple interpretations in the to access remote information. An important future
result, and the user selects the representation that direction is to provide the existing system through
provides a correct interpretation of his/her query. the web by using standards created by the Open-

Figure 2. The same visual query represented with different languages

853
Geographic Visual Query Languages and Ambiguities Treatment

GIS consortium such as WMS (Web Map Server) Calcinelli, D., & Mainguenaud, M. (1994).
and GML (Geographic Markup Language). In Cigales, a visual language for geographic infor-
particular, GML is an open standard for encoding mation system: the user interface. Journal of
geographic information in an eXtensible Markup Visual Languages and Computing, 5(2), 113–132.
Language (XML). It is not related to any specific doi:10.1006/jvlc.1994.1006
hardware or software platform. Any data encoded
Carpentier, C., & Mainguenaud, M. (2002).
using it can be easily read and understood by any
Classifying ambiguities in visual spatial lan-
programming language and software system able
guages. GeoInformatica, 6(3), 285–316.
to parse XML streams. Moreover, there are com-
doi:10.1023/A:1019770003896
mercial and open source tools that translate GML
data into the Scalable Vector Graphics (SVG) Egenhofer, M. J. (1997). Query processing in spa-
format in order to display maps. tial-query-by-sketch. Journal of Visual Languages
Consequently, the World Wide Web diffusion and Computing, 8(4), 403–424. doi:10.1006/
and the increasingly widespread usage of mobile jvlc.1997.0054
devices has made possible to query and to access
Erwig, M., & Schneider, M. (2003). A visual
geographical databases available online from mo-
language for the evolution of spatial relation-
bile devices. There are mainly two drawbacks in
ships and its translation into a spatio-temporal
using GML for mobile devices. First, it is memory
calculus. Journal of Visual Languages and
and bandwidth-consuming for storage and transfer
Computing. Elsevier, 14, 181–211. doi:10.1016/
respectively. Moreover, maps described with it
S1045-926X(02)00057-5
have to be projected and scaled before being plot-
ted. These characteristics make GML not directly Favetta, F., & Aufaure-Portier, M. (2000). About
accessible with small devices. It is necessary to Ambiguities in Visual GIS Query Languages: a
reduce GML size in order to make cartographic Taxonomy and Solutions. Proceedings of the 4th
data accessible from mobile devices. International Conference on Advances in Visual
Another future perspective concerns the repre- Information Systems. LNCS Springer-Verlag Pub-
sentation of dynamic phenomena that change over lications, LNCS 1929, (pp. 154-165).
space and time in GISs. While current GIS can
Favetta, F., & Laurini, R. (2001). About Preci-
provide snapshot views at discrete time intervals,
sion and Integrity in Visual Query Languages
they fall short in providing an ability to link the
for Spatial Databases. Proceedings of the 7th
process models with data from multiple sources
International Conference on Database Systems
or simulate scenarios of change for users.
for Advanced Applications (DASFAA 2001). IEEE
Computer Society, (pp. 286-293).
REFERENCES Ferri, F., Grifoni, P., & Rafanelli, M. (2004).
XPQL: A pictorial language for querying geo-
Blaser, A. D., & Egenhofer, M. J. (2000). A Vi- graphic data. Databases and Expert Systems
sual Tool for Querying Geographic Databases. Applications (DEXA 2004). LNCS 3180, Springer-
Advanced Visual Interfaces – AVI 2000. Palermo, Verlag Publications (pp. 925-935).
Italy: ACM Press (pp. 211-216).

854
Geographic Visual Query Languages and Ambiguities Treatment

Ferri, F., & Rafanelli, M. (2005). GeoPQL: A Geo- Szmurlo, M., Gaio, M., & Madelaine, J. (1998).
graphical Pictorial Query Language that resolves The geographical Anteserver: a Client/Server
ambiguities in query interpretation. Journal of Architecture for GIS. EOGEO’98 workshop.
Data Semantics. LNCS 3534, 3, 50-80. Springer- Salzburg, Austria, (pp. 78-85).
Verlag Publications.
Wessel, M., & Haarslev, V. (1998). VISCO:
Ju, S., Guo, W., & Hernández, H. J. (2003). A Bringing Visual Spatial Querying to Reality.
Card-based Visual Query System for Geographi- Proceedings of the IEEE Symposium on Visual
cal Information Systems. Scandinavian Research Languages. IEEE Computer Society, Halifax,
Conference on Geographical Information Science Canada, (pp. 170-179).
(pp. 62-74).
Kaushik, S., & Rundensteiner, E. (1997).
KEY TERMS AND DEFINITIONS
SVIQUEL: A Spatial Visual Query and Explora-
tion Language, Databases and Expert Systems eXtensible Markup Language (XML): is
Applications (DEXA 1998), LNCS 1460, Springer- a W3C-recommended general-purpose markup
Verlag Publications (pp. 290-299). language for creating special-purpose markup
Lbath, A., Aufaure-Portier, M., & Laurini, R. languages, capable of describing many different
(1997). Using a Visual Language for the Design kinds of data. In other words, XML is a way of
and Query in GIS Customization. International describing data.
IEEE Conference on Visual Information Systems. Geographical Database: A database in which
San Diego, CA. (pp. 197-204). geographic information is store by x-y coordi-
nates of single points or points which identify the
Lee, Y. C., & Chin, F. (1995). An iconic query boundaries of lines (or polylines, which sometimes
language for topological relationship in GIS. In- represent the boundaries of polygons). Different
ternational Journal of geographical. Information attributes characterize the objects stored in these
Systems, 9(1), 25–46. databases. In general the storing structure consists
Meyer, B. (1993). Beyond Icons: Towards New of “classes” of objects, each of them implemented
Metaphors for Visual Query Languages for Spatial by a layer. Often a geographic databases include
Information Systems. International Workshop raster, topological vector, image processing, and
on Interfaces to Database Systems. Glasgow, graphics production functionality.
Scotland, (pp. 113-135). Geographical Information System (GIS):
A computerized database system used for the
Papadias, D., & Sellis, T. (1995). A pictorial query- capture, conversion, storage, retrieval, analysis
by-example language. Journal of Visual Lan- and display of spatial objects.
guages and Computing, 6(1), 53–72. doi:10.1006/ Geography Markup Language (GML): is the
jvlc.1995.1004 XML grammar defined by the Open Geospatial
Rahman, S. A., Bhalla, S., & Hashimoto, T. (2005). Consortium (OGC) to express geographic features.
Query-By-Object Interface for Information Re- GML serves as a modeling language for geographic
quirement Elicitation Implementation. Fourth systems as well as an open interchange format for
International Conference on Mobile Business geographic transactions on the Internet.
(ICMB2005). IEEE Computer Society, Sydney, Icon: Small pictures that represent commands,
Australia, (pp. 667-670). files, objects or windows.

855
Geographic Visual Query Languages and Ambiguities Treatment

Metaphor: Figurative language that creates an Visual Query language: A language that
analogy between two unlike things.  A metaphor allows the user to specify its goals in a two-(or
does not make a comparison, but creates its anal- more)-dimensional way with visual expressions
ogy by representing one thing as something else. - spatial arrangements of textual and graphical
Scalable Vector Graphics (SVG): SVG is a symbols.
language for describing two-dimensional graphics
and graphical applications in XML.

This work was previously published in Handbook of Research on Geoinformatics, edited by Hassan A. Karimi, pp. 340-348,
copyright 2009 by Information Science Reference (an imprint of IGI Global).

856
857

Chapter 53
Multi Depot Probabilistic
Vehicle Routing Problems
with a Time Window:
Theory, Solution and Application

Sutapa Samanta
Maryland State Highway Administration, USA

Manoj K. Jha
Morgan State University, USA

ABSTRACT
Vehicle Routing Problems (VRPs) are prevalent in all large pick up and delivery logistics systems and are
critical to city logistics operations. Of notable significance are three key extensions to classical VRPs: (1)
multi-depot scenario; (2) probabilistic demand; and (3) time-window constraints, which are considered
simultaneously with VRPs in this paper. The issue then becomes a Multi Depot Probabilistic Vehicle
Routing Problem with a Time Window (MDPVRPTW). The underlying complexities of MDPVRPTW
are analyzed and a heuristic approach is presented to solve the problem. Genetic algorithms (GAs) are
found to be capable of providing an efficient solution to the so-called MDPVRPTW. Within the GAs, two
modification operators namely, crossover and mutation, are designed specially to solve the MDPVRPTW.
Three numerical examples with 14, 25, and 51 nodes are presented to test the efficiency of the algorithm
as the problem size grows. The proposed algorithms perform satisfactorily and the limiting case solutions
are in agreement with the constraints. Additional work is needed to test the robustness and efficiency of
the algorithms as the problem size grows.

DOI: 10.4018/978-1-4666-2038-4.ch053

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

INTRODUCTION will start from a set of depots to cover larger areas.


Each node or point has a demand, which varies
The Vehicle Routing Problem (VRP) has received stochastically and must be picked up by any of the
widespread attention in recent years because of vehicles. The customers specify a period of time
its applicability in many pickup and delivery in which the pick up must occur. This particular
systems. Multiple depots become necessary for time period is called the time window. The vehicles
systems consisting of larger networks, such as have specific limited capacities. The total demand
for the operation of larger logistics systems of picked up by each vehicle should not exceed its
distribution companies. When the vehicle routing corresponding capacity. The objective is to find
problem involves multiple depots, it is called a out a set of routes formed by the fleet of vehicles
Multi Depot Vehicle Routing Problem (MDVRP). in order to cover these customers in such a way
In such problems the number of vehicles can be that total travel cost becomes minimum while sat-
equal to or exceed the number of depots. In most isfying the capacity and time window constraints.
of the real-life VRPs, demands at the customer
nodes vary due to various factors, such as loca- Previous researchers have approached vari-
tional and temporal seasonal factors. Hence, the ous components of MDPVRPTW separately.
stochasticity has to be considered for the demands The stochasticity in demand, the time window
at the customer nodes. The vehicle routing prob- constraint, and the multidepot scenario is either
lem which considers stochastic demands of the addressed individually or in the combination
customers, is called Probabilistic Vehicle Routing of two in most of the cases. As the number of
Problem (PVRP). Another important criterion variants increase with the introduction of each
for any logistics system is to provide the service component, it becomes difficult to devise an
within a specified time period. Hence, the time analytical approach to obtain an optimal solution.
window concept is associated with the VRPs, This might be the reason for the MDPVRPTW to
which is coined as Vehicle Routing Problem with be unsolved, to date. The earlier works in the area
Time Window (VRPTW). have not addressed all the three components of the
The real life vehicle routing problem with a vehicle routing problem together, but in real life
larger network can have all the issues mentioned problems, they exist together in most cases. The
above, such as the nodes can have variable or sto- attempt in this study is to develop an innovative
chastic demand, the services have to be provided algorithm, which is capable of handling these
within fixed time periods at the nodes and multiple three components simultaneously and is able to
depots are needed to cover the network. Hence all produce reasonable results. The algorithm can be
the three components are important for a vehicle modified and improved further in future; neverthe-
routing problem The proposed MDPVRPTW is less, it provides a good start to handle the different
complex in nature as it combines all the three prob- components of the VRP that make the problem
lems, such as, VRPTW, MDVRP, and PVRP. There relatively complex to solve. The new heuristic-
are many applications of this type of problem like based algorithm to solve the MDPVRPTW is
the pick-up of children by school buses, garbage developed after carefully scrutinizing significant
collection, delivery of laundry, and delivery by earlier works and the underlying complexities
United Postal Services (UPS). The problem in all associated with the problem.
these cases can be described as follows: In the next section, we briefly review the
VRP literature. The third section describes the
A set of customers in an area has to be provided formulation and the proposed methodology to
with a service by a fleet of vehicles. The vehicles solve the MDPVRPTW. The last section presents

858
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

three numerical examples of growing complexity. require the time window constraint. Kohl and Mad-
Sensitivity of the results on the problem domain is sen (1997) used Lagrangian Relaxation for VRP
examined by relaxing or restricting the time win- with time windows. The demand was considered
dow constraint so that the results can be validated deterministic in these problems. They solved some
by justifying them within the specified boundaries. previously unsolved problems and obtained the
results with competitive computation time. Jung
and Haghani (1999, 2001) solved a vehicle rout-
LITERATURE REVIEW ing problem and a pickup-delivery problem with
time windows. They used a genetic algorithm to
This section reviews various methodologies solve these problems. It worked better than the
developed by the researchers to solve different exact algorithms for smaller problems. Bent and
components of the MDPVRPTW at different Hentenryck (2004) solved VRP with time windows
times. A selective list of literature review has been with a two-stage hybrid local search approach.
presented here. The various studies available in the They tried to minimize the number of routes in
literature can be broadly categorized as follows: the first stage using simulated annealing and tried
to minimize the total travel cost by using a large
• Vehicle routing problem neighborhood search method in the second stage.
• Vehicle routing problem with time window The hybrid algorithm improves or matches with
• Vehicle routing problem with stochastic the solutions of most of the Solomon Benchmark
demand problems (2009) though the approach to minimize
• Multi depot vehicle routing problem the number of vehicles does not guarantee better
• Multi depot vehicle routing problem with results for minimizing the total cost. Cordeau et
stochastic demand al. (2001) attempt to solve vehicle routing prob-
• Vehicle routing problem with time win- lem with time windows using tabu search. Their
dows and stochastic demand method, however, does not seem efficient for all
the cases presented. For example, their approach
The vehicle routing problem is a classical does not necessarily minimize the number of
optimization problem. Jaillet and Odoni (1998) vehicles, and yet they suggest their algorithm to
present a comprehensive survey of some advanced be an efficient one. Two optimization algorithms,
methods used to solve VRPs. The methods include generalized K-tree algorithm and variable splitting
Integer programming, Lagrangian relaxation algorithm were developed by Fisher et al. (1997)
method, genetic algorithm, tabu search, etc. These to solve vehicle routing problems with time win-
methods assume deterministic demand and don’t dows. They were able to solve problems optimally
consider the time window constraint. Laporte et al. for up to 100 customers. Desaulniers et al. (2008)
(2000) presented classical and modern heuristics used tabu search, partial elimentarity and k-path
to solve vehicle routing problems. inequalities to solve the vehicle routing problem
with time windows. Braysy and Gendreau (2005,
Literature Considering Time Window 2005) presented metaheuristics and local search
algorithms for the vehicle routing problems with
Some researchers identify the vehicle routing time windows. Ropke and Cordeau (2009) solved
problem with time windows as a separate class the pickup delivery problem with time windows
of problem. As time window is an important con- by Branch and Cut, and Price method.
straint for most of the vehicle routing problems,
an emphasis is put on optimizing the routes that

859
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Literature Considering independent. The author applied Neuro-Dynamic


Stochasticity in Demand programming in order to tackle the complexity of
the problem. While all of the above mentioned
Stochastic demand is an important consideration models were capable of handling stochastic de-
in real-world VRPs. A number of research works mand in VRPs, they did not take the multi depot
involving stochastic demand were found in the case and time window constraint into account.
literature. Jaillet and Odoni (1998) discussed the Aykagan and Erera (2007) solved vehicle rout-
probabilistic vehicle routing problem, its applica- ing problem using a tabu search heuristic. The
tions, and some heuristics to solve it. Bertsimas et uniqueness of their strategy was to use vehicles
al. (1990) solved the vehicle routing problem with in pairs instead of a single vehicle for nodes. Liu
an a-priori optimization strategy. They considered and Lai (2009) solved vehicle routing problem
the demand to be stochastic. They approximated considering uncertain demand at nodes.
the solutions as the problem became complex.
Bertsimas et al. (1992) solved a stochastic vehicle Literature Considering
routing problem in the Euclidean plane. The main the Multi Depot Case
objective of that work was to minimize the total
waiting time, not the total travel cost. The problem Nagy and Salhi (2005) solved a multi depot
involved a queuing phenomena and considered pickup-delivery problem. They found better results
the demand to follow a Poisson distribution. They by using their heuristics over the benchmark pro-
solved a VRP with stochastic demand and proposed cedures available in the literature. Salhi and Sari
two different strategies: (1) a re-optimization (1997) developed heuristics to solve a multi depot
strategy when the demands are known and, (2) vehicle fleet mix problem. The heuristics were
an a-priori strategy when the demand is known also applied on the multi-depot vehicle routing
only when the customer reaches at the node. They problem and produced solutions which were about
fixed the upper and lower boundaries of the de- 1% better than the best solutions available in the
mand based on the general probabilistic demands literature, with lesser computation time. Renaud
and optimized the routes within those limits. The et al. (1996) solved a multi depot vehicle routing
accuracy of the method depends on making the problem with a tabu search algorithm. The algo-
boundaries tight, which is not always guaranteed. rithm clusters the nodes initially and then improves
Dror et al. (1989) and Dror (1993) modeled the solution by intensification and diversification
vehicle routing with stochastic demand. They methods. They obtained better solutions to all 23
handled the problem by applying a stochastic benchmark problems on which the algorithm was
programming model and the Markov Decision employed. Laporte et al. (1988) solved multi-depot
Process model. Gendreau et al. (1995, 1996) vehicle routing and location routing problems with
solved vehicle routing problem with stochastic an exact algorithm and could obtain comparable
demands by an exact algorithm and with a tabu results within relatively short computing time for
search method, and obtained satisfactory results. the large multi depot vehicle routing problems of
Gendreau et al. (1996) presented a comprehensive upto 80 nodes. Tangiah and Salhi (2001) solved a
review of stochastic vehicle routing problems and multi depot vehicle routing problem using genetic
summarized selected cases. Secomandi (2000) clustering. In the optimal solution they obtained
solved a vehicle routing problem with stochastic lesser number of vehicles but larger travel times
demand by formulating it as a Markov decision for most of the problems on which they employed
problem. The demand distribution was known the developed algorithm. Crevier et al. (2007)
beforehand and it was assumed to be discrete and solved a multi depot vehicle routing problem in

860
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

which vehicles may be replenished at intermedi- demand, and the time window constraint, the
ate depots, using a tabu search algorithm. The need to develop a heuristic to solve real-world
method is shown to be robust and fast enough for MDPVRPTWs within a reasonable computation
larger problem sizes. Wu et al. (2002) provided time arises.
heuristics solutions for multidepot location routing Baker and Ayechew (2003) solved vehicle
problems. They developed an approximate reason- routing problems with a GA and proved that the
ing algorithm and improved sweeping algorithm method is superior to Simulated Annealing or tabu
to solve the problem. search methods. Chakroborty and Mandal (2005)
obtained comparatively good results efficiently
Literature on the Simultaneous while solving single vehicle routing problems
Consideration of Multi with a GA. Ombuki et al. (2006) solved a vehicle
Depot and Stochasticity routing problem with time windows using a multi-
objective GA. They used Pareto-optimal ranking
Chan et al. (2001) solved the problem of multi to determine the number of vehicles to minimize
depot vehicle routing and location routing with sto- the cost. The obtained results were superior to the
chastic demand by formulating it as a binary integer best solutions reported by other researchers. Tong
program. They used space filling curves (SFC) et al. (2004) solved vehicle routing problem with
for the stochastic model and the Clark-Wright time window and unknown number of vehicles
Savings method for the deterministic instances using a GA and proved that GAs can efficiently
of the problem. They provided a validated way search for the optimal solution. Hwang (2002)
to estimate the probable demand using a queuing developed a Graphical User Interface (GUI)-
network. They did not however, incorporate the program based on a GA to solve the Traveling
time window constraint in the VRP. Salesman Problem (TSP) and obtained efficient
results for problems containing small and large
Literature on the Simultaneous number of nodes. Berger and Barkaoui (2004)
Consideration of Time developed a new hybrid genetic algorithm to solve
Window and Stochasticity VRPs with time windows. Though the algorithm
does not seem be any superior to other established
There are limited studies reported that considered heuristics, the obtained results were comparable to
time windows and stochastic demand in VRPs. those obtained from other heuristics. Alvarenga et
Bramel and Simchi-Levi (1996) solved a VRP with al. (2007) proposed a two-phase approach using a
time windows and stochastic demand. Chan et al. GA and a set partitioning method for a VRP with
(2001) solved a vehicle routing and scheduling time window. The method was proved to perform
problem with time windows and stochastic demand better than other established heuristics or exact
using a tabu search based heuristic algorithm. algorithms. The above literature review makes the
Chen et al. (2006) developed a heuristic to solve GA a viable candidate to solve the MDPVRPTW.
the vehicle routing problem with time windows
for a real time and time dependent travel time and
a real time demand scenario. MODEL DEVELOPMENT
From the literature review, it is observed that
there are no studies available, which cover all The MDPVRPTW problem can be formulated as a
the components of MDPVRPTW. As most of the Mixed Integer Programming (MIP) and Constraint
practical vehicle routing problems contain all three Programming (CP) problem. Since MIP and CP
components, multi depot condition, stochastic problems are non-convex, they can be solved

861
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

either by exact solution methods or by some kind M = {1,2,….m} set of vehicles


of systematic exhaustive search algorithm. For a (ai, bi) = Time window for customer node i
MIP, memory and solution time increase exponen- qi = Demand of customer i
tially with the increase in integer variables. In this Qm= Capacity of vehicle m
paper, a genetic algorithm (GA) based heuristic is cij = Cost per unit travel distance
proposed for solving the MDPVRPTW problem Dmax= Maximum distance which the vehicles may
as unlike many conventional search methods, cover in a tour
GAs are not restricted by such assumptions as Coordinates of the nodes
linearity, convexity, and differentiability (Jong &
Schonfeld, 2003; Kim et al., 2005). GAs search Notations for decision variables:
from a population rather than a single point, reach-
ing to a globally optimum solution eventually. 1, if arc ij is part of route k
GAs generate candidate solutions which satisfy x ijk = 
0, otherwise
the integer constraints. Initially the solutions are 
far from optimal, but they transform the existing
solutions into new candidate solutions through
operators, such as crossover and mutation, which dij = Distance between customer i and j
improve the solution over successive search gen-
C w = Penalty for the early arriving at the cus-
erations. Thus, GAs handle far more variants in
a larger and complex search space than the clas- tomer node i
sical optimization algorithms, thereby reducing C d = Penalty for late arriving at the customer
the chance of earlier convergence of the solution. node i
The objective of developing the GA-based ti = Actual arrival time at node i
heuristic is to obtain an optimal solution which wi = Waiting time at the customer node i
consists of a number of routes traveled by a number di = Delay time at the customer node i
of vehicles within a fixed time window. The problem q(m) = Total demand picked up by vehicle m
includes more than one depot. Each vehicle has a pi = Probability value of the demand at node i
given capacity. The demands at the customer nodes
are considered to be stochastic in nature. The total expected cost of the routes formed
from multiple depot is calculated considering
Mathematical Formulation factors, such as stochastic demand, time window,
of the Problem and shortest routes.

As noted earlier, the problem is formulated as a Multiple Depots


mixed-integer programming and constraint pro-
gramming problem. The objective function is to It is assumed that the depots are located around
minimize the total travel cost with the constraints the customer nodes such that the customers should
of time windows and vehicle capacity. The nota- be efficiently served. Depending on the stochastic
tions are given as follows: demand and time window constraints for pickups
and deliveries, the natural tendency is to assign
Notations for parameters: vehicles to customer nodes from the nearest depot.
Hence the nearest neighborhood method has been
N = {1,2,…..n} set of customers used to assign the customer nodes to the depot.
K = {1,2, ….k} set of depots

862
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

But, distance is not the only governing factor for demand (Equation 2b below) distributed over the
the assignment; demands of the customer nodes time of a day is shown in Figure 1. The probability
and the capacities of the vehicles play important factor (pi) of demand at any customer node over
roles during the assignment. The customer nodes the time period of a day is given by
are divided in the following two sets during the
assignment. pi = ps ⋅ qi (2)

U: Set of nodes allocated to other depots


where,
V: Sets of available nodes to be allocated to the
kth depot
f (z )
U ∪V ⊆ N ; ps = ; 0 ≤ k <1 (2a)
A k : {q }, q ∈ V , k ∈ K ; Ak represents ∑ f (z )
∀z
the set of nodes allocated to the kth depot.
V where,
B k: k , k ∈ K ; Bk represents the set of
Ak
rest of the nodes available for the allocation u
−(z −T /2)

to the (k+1)th depot. f (z ) = × m; 0 < u < 1 (2b)


u −T /2
Travel Time Cost
The demand varies over the days of a week
The shortest route set is developed using the GA- (Figure 2). The demands for the weekdays are as-
based heuristic discussed in Section 3.4. The travel sumed to be more than those on weekends. The
time cost is calculated by multiplying the total probability factor (pk ) is determined from Figure
travel time by the unit travel time cost for each 2 depending on the day of a week the vehicle ar-
route formed. Total travel time cost is estimated rives at a particular node. The functions for ps and
for all the routes and for all the depots pk are designed randomly for this study and is
replaceable by another function as specified by the
Ttotal = ∑∑ ∑ cij ⋅ dij ⋅ x ijk (1) user. The expected value of the total cost of the
k ∈M i ∈N ∪K j ∈N ∪K
route due to the stochastic demands is given by

Cost Due To Stochastic Demand


Figure 1. Function showing the distribution of
probability factor of demand over the time of a day
We assume that the exact demand at the customer
nodes is known only when the vehicle arrives at
the node. The demand at a customer node varies
stochastically for various reasons, e.g., the time
in a day or the day in a week when the vehicle
is arriving at the customer node, the proximity
of the node to important activity areas, etc. A
vehicle will decide to pick up the delivery from
the customers depending on the stochastic demand
at the node and available capacity of the vehicle.
The function depicting the probability factor of

863
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Figure 2. Probability factor (pk) for demand over


where,
a week
q(m ) = ∑ piqi (7)
i ∈N

∑ ∑ dij ⋅ x ijk ≤ Dmax (k ∈ M ) (8)


i ∈N ∪K j ∈N ∪K

∑∑ x ijk = 1 (j ∈ N ) (9)
k ∈M i ∈N ∪K

∑∑ x ijk ≥ 1 (j ∈ K ) (10)
C (x s ) =
k ∈M i ∈N ∪K
∑ ∑ pi ⋅ dij ⋅ x ij (3)
i ∈N ∪K j ∈N ∪K

∑ x ijk = ∑ x jik (j ∈ N ∪ K , k ∈ M )
i ∈N ∪K i ∈N ∪K
Penalty Due to the Time Window (11)
Violation
w i ≥ a i − ti (i ∈ N ) (12)
The customer deliveries have to be picked up
from a particular node within a time window.
Penalties are imposed for both the actions of early di ≥ ti − bi (i ∈ N ) (13)
arriving and/or late arriving from the specified
time window. The objective (Equation 5) is to minimize the
The cost incurred due to the early arrival or total travel cost embedded with the penalty for
delay at ith node is given by: the delay and waiting at the node. Constraints (6)
and (7) are the maximum capacity of the vehicles
C (Tx ) = C w × wi or , = C d × di (4) and the maximum distance (Secomandi, 2000),
respectively. Equation (8) represents the total
The objective function and constraints are demand which consists of probabilistic values
given as follows: of the customer demand. Constraints (9) and (10)
impose that every customer belongs to one and
Minimize only one route or vehicle.
Z =∑ ∑ Constraint (11) implies that every customer
∑ C ij ⋅ dij ⋅ x ijk + C w × ∑ wi
is serviced by the same vehicle. Constraints (12)
k ∈M i ∈N ∪K j ∈N ∪K i

+ C d × ∑ di + ∑ ∑ ps ⋅ dij ⋅ x ij and (13) are the time window constraints (Jung
i i ∈N ∪K j ∈N ∪K
& Haghani, 2001).
(5)
Upper and Lower Boundaries
subject to of the Objective Function

q(m ) ≤ Qm (m ∈ M ) (6) The boundaries of the objective function are


defined based on the time windows assigned to

864
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

the customer nodes. The upper boundary of the when


objective function, i.e., the maximum value of
the cost is estimated by relaxing the time window ti ⊂ (ai , bi )
constraint.
Proposition 1: The total expected cost, i.e.,
Proof: The time window is maintained most
the value of the objective function (Z), will be
stringently; the vehicles arrive within the time
maximum when none of the nodes satisfy the
window at all customer nodes. Hence, the delay
time window constraint.
and waiting time become 0, which leads the total
cost to be minimum, other constraints remaining
Z u = Z max
constant.

when (di + wi ) = 0;

ti ⊄ (ai , bi )
as

Proof: As the time window constraint is ti ⊂ (ai , bi ) ;


relaxed, the vehicles may arrive at the customer
nodes anytime in which case, the arrival time
Hence,
might not lie within the upper and lower limit
of the arrival times. Hence, the delay time and
C (Tx ) ≤ C (Tx )* ,
waiting time become maximum which leads the
cost due to delay or early arrival to be maximum.
as
C (Tx ) ≥ C (Tx )* ,
(di + wi ) ≤ (di + wi )* → Z ≤ Z * ,
as
where (⋅) corresponds to the optimum value,
*

(di + wi ) ≥ (di + wi ) → Z ≥ Z ,
* *

which is minimum total cost, in this case. This


particular condition of the arrival time of the
where (⋅) corresponds to the optimum value,
*
vehicle yields the lower boundary ( Zl ) for the
which is minimum total cost, in this case. The total expected cost for MDPVRPTW, expressed
specific case of arrival of the vehicles without by Z * ≥ Zl .
considering the time window constraint yields the
upper boundary ( Z u ) for the total expected cost Assumptions
for MDPVRPTW, expressed by Z * ≤ Z u .
Two important assumptions to the proposed model
Proposition 2: The lower boundary of the
are as follows:
objective function is estimated by satisfying the
time window constraint for all the nodes.
• No individual demand is greater than the
vehicle capacity.
Zl = Z min

865
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

• One node can only be assigned to one results. In the proposed algorithm the modified
depot. genetic operators used are reproduction, crossover
and mutation. The steps of the algorithm can be
The GA-Based Heuristic summarized as:

A GA-based heuristic is developed to solve the 1. Generation of initial set of solutions


above MDPVRPTW problem. The overview of 2. Evaluation
the heuristic is shown by a flow-chart in Figure 3. Modification
3. GA is a heuristic search algorithm based on
the principle of natural selection and genetics. It Two special modification operators, namely
has been extensively applied in many engineering Crossover-mutation and Mutation-crossover are
and logistics applications involving optimization. developed for the modification scheme, to suit
The proposed algorithm consists of the following the nature of the problem.
steps. First, initial set of solutions are generated.
Then they are modified to produce improved

Figure 3. Overview of the proposed methodology

866
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

1. Generation of initial set of solutions: The Nearest Neighborhood Method


first step presents the procedure for gen-
erating the initial solutions. First a proper The nearest neighborhood method is the simplest
representation scheme is designed for the method used to order the nodes. In this method,
solution of the problem. Each solution set once a node is selected, the node which has the
will consist of a number of routes equal to minimum travel time from the previously selected
the number of vehicles. A typical solution node is selected as the next node from the set of
set (nth population) will be represented as unselected nodes.
the following: As for example, if there are 4 nodes and node
2 is selected randomly as the first node to form
M X1 D1A B C D1 X2 D2E F D2 X3 D3G H a route. The travel times from node 2 to node
I….D3 …… XMDM P Q R DM 1, node 3 and node 4 have to be evaluated. The
node with the least travel time is node 4 and will
where, be selected as the next node. The ravel time from
node 4 to node 1 and node 3 will be evaluated and
M = Number of routes or vehicles the least time is for mode 3 and will be chosen
K = number of depots to be the next node. Hence the route formed by
Xi = number of nodes for ith route traveled by ith Nearest Neighborhood method will be 2-4-3-1.
vehicle
Dk = kth depot Step 3: The deterministic value of the demand is
A, B, C are the customer nodes visited by the first known. The probabilistic demand is calcu-
vehicle from first depot lated based on the probability distribution
E, F are the customer nodes visited by the third function. The nodes are associated with the
vehicle from second depot stochastic demands.
G, H, I are the customer nodes visited by the third Step 4: As a vehicle keeps on adding the nodes,
vehicle from third depot the capacity of the vehicle gets exhausted
P,Q,R are the nodes visited by the Mth vehicle gradually by adding up the demands of the
from Mth depot picked up nodes.
Step 5: Once the capacity of the vehicle is reached,
For the present study, the number of vehicles i.e., it can no longer serve any more nodes,
is considered to be the same as the number of it returns to the depot. The time window
depots, which is M = K. constraint has to be validated while forming
The steps for constructing the initial feasible these routes. If the total time for the route
solution are given below: set does not fall within the time window, the
total delay time or waiting times are calcu-
Step 1: Initial feasible solution set will consist of lated and associated with the evaluation of
the routes formed by the vehicles from the the solution set.
depots. Vehicles of varying capacities are Step 6: Repeat steps 1 to 5 for the remaining de-
assigned to each of the depots randomly. pots using the method mentioned for multi
Different sets of solutions are generated. depot condition in section 3.1.2.
Step 2: Each vehicle starts from a depot. It picks
up the nodes one by one following a nearest By following the above steps, one initial set
neighborhood method. of solution is generated. Similarly, k initial set of
solutions are generated randomly, given by:

867
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Si : nodes, of which, 3 is the depot node and 11, 2, 7,



{m x i 1 D1Ai BiC i D1x i 2 D2 Ei Fi D2 x i 3 D3 Gi H i I i D3 } 8 and 13 are the nodes traveled by 2nd vehicle
starting from depot node 3 and returning to the
same depot. The 3rd route consists of 5 nodes of
Sj : which 14 is the depot node and the nodes traversed

{m x j 1 D1Aj B jC j D1x j 2 D2 E j Fj G j D2 x j 3 D3 H j I j D3 } are 12 and 5.

2. Evaluation: After generation of the initial so-


Sk : lution sets, the fitness values of the solutions

{m x k 1 D1Ak BkC k D1x k 2 D2 Ek Fk D2 x k 3 D3 Gk H k I k D3 } are determined. Fitness value is calculated
by the total travel time cost and the penalty
values incurred due to the waiting and delay
The initial set of solutions is represented by: time while picking up the deliveries at the
customer nodes. The fitness function is given
by Equation (11).
I : {S i , S j ,...Sk }

A sample problem has been presented to TC = ∑ ∑ dij × cij + C w × ∑ wi


demonstrate the mechanism of different steps of (11)
the algorithm. A problem of 14 nodes is shown + C d × ∑ di + ∑ ∑ pi ⋅ dij ⋅ qi
in Figure 4. An initial set of routes (Figure 5) for
the problem may be represented as,
3. Modification: The initial set of solutions is
3 5 1 4 6 9 10 1 6 3 11 2 7 8 13 3 312 5 14 14 modified to result into improved solutions.
Various modification schemes such as,
where, the first route is given by 1 4 6 9 10 1 Reproduction, Crossover and Mutation are
where, the depot node is 1 and the 1st vehicle used for this purpose. Two unique operators
traverses 4 nodes which are 4, 6, 9 and 10 other are designed specially in accordance with
than the depot 1. The second route consists of 6 the nature of the problem. The operators are
i) Crossover-mutation, which is applied at

Figure 4. Position of the customer nodes and the


depots shown in the problem
Figure 5. An example of initial route set

868
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

the end of the crossover and ii) Mutation- route of another solution. The
crossover, which is applied during mutation. exchange takes place between
a. Reproduction: Tournament Selection the randomly chosen sub routes
is applied for the reproduction opera- of two route sets.
tion. The intent of this operation is to Step 3: The selected segments are ex-
duplicate good solutions and eliminate changed provided the constraint
bad ones from the population. The steps of vehicle capacity does not get
to apply this operator are given below: violated in either case.
Step 1: The value of the tournament
selection of size is assumed to The process is described below:
be p for this purpose.
Step2: The m better solutions are cho- Sm : {m x m 1 D1Am BmC m D1x m 2 D2 Em Fm D2 x m 3

sen from the initial set of solutions D3Gm H m I m D3 }
based on the fitness Step function
values. The m selected solutions
are copied n times to develop Sn : {m x n 1 D1An BnC n D1x n 2 D2 En Fn D2 x n 3
the mating pool of population
D3 Gn H n I m D3 }
size mxn.
Step 3: From mxn population size, p
solutions are selected.
Step 4: The best of p solutions are The parts to be exchanged are selected ran-
selected and copied p times and domly from the two sets of solutions chosen for
kept in the mating pool. Thus, the the crossover, which are given as:
new mating pool will consist of
 m × n  pm : {BmC m } ⊂ Sm
 
 p  sets of solutions, each
of which contain p number of pn : {BnC n } ⊂ Sn
solution sets.
b. Crossover: Crossover takes place in
After the crossover the new sets of locations
a set of two solutions. New solutions
generated are:
are created by exchanging the segments
of any two solutions from the mating
Sm' : {m x m 1 D1Am BnC n D1x m 2 D2 Em Fm D2 x m 3
pool. So, there will be p number of D3 Gm H m I m D3 }

2
crossover operations in a mating pool
of p solutions. The steps to apply the
crossover operator are given below: Sn' : {m x n 1 D1An BmC m D1x n 2 D2 En Fn D2 x n 3
Step 1: Two solutions are picked up
D3 Gn H n I m D3 }
randomly.
Step 2: A few nodes of one route of
one solution set are selected
randomly and interchanged with Crossover-mutation operator: A new operator
the same number of nodes of a called Crossover-Mutation operator is de-
signed for the modification scheme while

869
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

solving the MDPVRPTW with the GA heu- Q ~ [β ]


ristic. It is a post-crossover operator. After the
nodes of the sub routes of two solutions get and
exchanged in the crossover process, the sub
routes except the participating sub routes in P ∪ [α ] → [{Smp ~ [β ]} ∪ [α ]] ∩ [{Svq ~ [α ]} ∪ [β ]
the crossover get mutated in order to satisfy
the constraint that each node can be visited
only once. The already existing nodes which For example, we consider two populations
become identical with the newly imported and after the crossover the two offspring (Figure
nodes due to crossover are eliminated and 6) obtained are:
the exported nodes are appended to the
same sub route. Thus, a modified sub route Population 1: 3 6 1 11 12 8 9 6 1 5 3 4 10 13 7
is generated due to the sudden change made 3 3 3 14 5 2 14
after the crossover. Population 2: 3 7 1 4 6 9 10 13 8 1 5 3 12 11 7
2 3 2 14 5 14
Offspring 1: 3 6 1 4 10 8 9 6 1 5 3 12 11 13 7 3
T
P : Sp  3 14 5 2 14
 
Offspring 2: 3 7 1 6 9 13 8 12 11 1 5 3 4 10 7 2
3 2 14 5 14
T
Q : S q 
 
c.. Mutation: Application of the mutation op-
erator is an occasional phenomenon where
[α ] : {α1, α2 ,...αm } ⊆ Snq random modification takes place within the
solution. A few of the nodes of one route are
and exchanged with some other nodes of another
route in the same solution set. A completely
[α ] : {α1, α2 ,...αn } ⊆ Snq new route set is developed by this process.

Parent:
[β ] : {β1, β2 ,...βn } ⊆ Svq Sn :
{m x n 1 D1An BnC n D1x n 2 D2 En Fn D2 x n 3 D3Gn H n I m D3 }

and Offspring:
Snnew :
{m x n 1 D1An BnnewC n D1x n 2 D2 En Fn D2 x n 3 D3 Gn H n I m D3 }
[β ] : {β1, β2 ,...βm } ⊆ S
p
u

In the above mentioned example in the first


The crossover is explained as follows:
route node Bn has been replaced by a new node,
P ~ [α ] Bnnew which gives rise to a new route set.

Mutation-Crossover Operator
and
The mutation takes place through intra-solution
P ∪ [β ] → [{Smp ~ [α ]} ∪ [β ]] ∩ [{Sup ~ [β ]} ∪ [α ] crossover, hence the name of the operator is

870
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Figure 6. Development of new route sets by


Population 1: 3 6 1 8 13 10 4 7 1 5 3 9 6 5 2 3 4
crossover operator
14 12 11 14
Offspring: 3 5 1 8 9 6 7 1 6 3 13 10 4 5 2 3 4
14 12 11 14

The new solution sets are evaluated. The al-


gorithm is run for a number of generations until
the improvement in objective function becomes
negligible. The optimum set of routes which in-
curs the minimum cost is obtained. The program
terminates either when the solution converges or
the algorithm runs for a pre-specified number of
generation gmax.

ANALYTICAL RESULTS

Three artificial instances are generated randomly


and the heuristic is applied on them. The coordi-
nates and the demand of the nodes are generated
proposed as mutation-crossover. The nodes of randomly for the first two instances of 14 and
two sub routes within a solution are exchanged 25 nodes (P14 and P25), respectively. The third
to give rise to two new sub routes. The exchange
of information does not take place between two Figure 7. Development of a new route set by
solutions; rather it remains restricted within the mutation operator
same solution. The assignment of the nodes to
the depots changes within the same solution. So,
the operator can be presented as a special type of
mutation operator.

T
P : S 
 

[α ] : {α1, α2 ,...αm } ⊆ Sm ;

[β ] : {β1, β2 ,...βn } ⊆ Sn

P ' : [{Sm ~ [α ]} ∪ [β ]] ∩ [{Sn ~ [β ]} ∪ [α ] ]

Figure 7 shows an example of mutation and the


representation of the new chromosomes, which
are given as follows:

871
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

instance of 51 nodes (P51) is a modification from Population size for GA=30x2


the problem Eil51 (capacitated vehicle routing A - indicates data not applicable
problem) obtained from TSPLIB95 (TSPLIB
2001). The single depot problem has been modi- The optimal travel times obtained for MDVRP,
fied to a multi depot problem by assigning a few MDPVRP and MDPVRPTW for the 3 instances
nodes as depot randomly. The demands of the are given in Table 5. Figures 8a through 8c show
depot nodes have been changed to zero for the the results obtained for the P14 problem for the
problem while the demands of the other nodes three cases, such as, MDVRP, MDPVRP and
and the coordinates of the other nodes remain MDPVRPTW. Similarly, Figures 9a through 10c
the same. The heuristic is run 10 times on each of show the results for the P25 and P51 problems,
the instances. The description of the problems is respectively.
given in Tables 1 through 3, respectively. The time For all the three instances the lower boundar-
windows are set based on the near-optimal travel ies of the objective functions are set by establish-
times obtained for MDPVRP. The input values ing the time windows in such a way that all the
used for the three instances are shown in Table 4. nodes satisfy the time window constraints. Hence,
The input parameter values used in the algo- there is no need for adding any penalty to the
rithm are given below: total cost due to waiting or delay at the correspond-
ing nodes.
Vehicle speed=60 mph On the other hand, the coefficients of waiting
Penalty coefficient or waiting and delay param- (C w ) and delay (C d ) are set to very low values
eters (time window) Cw=0.2 and Cd=0.1 which are 0.002 and 0.001, respectively, to obtain
Parameter values for the probability function for the upper boundaries of the objective functions.
stochastic demand m=10, u=0.85 Setting the coefficients to very low values allow
Number of generations for GA=100 the vehicles to travel with the violation of the time

Table 1. Input values for P14 (14-node problem)

Node Type of node Coordinates of the node Demand (qi) Time window
(AM)
1 Depot 1.5, 8 - -
2 Customer 2.5, 10 30 9:54-10:02
3 Depot 4.5, 11 - -
4 Customer 8, 10.5 10 9:34-9:39
5 Customer 12, 11.5 00 10:16-10:20
6 Customer 6, 8.5 20 9:42-9:46
7 Customer 4.5, 7 30 9:45-9:52
8 Customer 6.5, 5 20 9:25-9:30
9 Customer 7.5, 7.5 30 9:31-9:36
10 Customer 10, 7.5 20 10:12-10:15
11 Customer 3, 2 10 9:10-9:14
12 Customer 5, 1 20 9:15-9:20
13 Customer 11, 4 10 10:03-10:08
14 Depot 8, 5 - -

872
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Table 2. Input values for P25 (25-node problem) Table 3. Input values for P51 (51-node problem)

Node Type of Coordinates Demand Time Node Type of Coordinates Demand Time
node of the node (qi) window node of the node (qi) window
(AM) (AM)
1 Customer 2, 2 10 9:07- 1 Customer 37, 52 7 10:32-
9:12 10:36
2 Customer 4, 1 15 9:10- 2 Customer 49, 49 30 9:20-9:25
9:14
3 Customer 52, 64 16 10:02-
3 Customer 6, 2 20 9:00- 10:08
9:05
4 Customer 20, 26 9 10:48-
4 Depot 9, 1 - - 10:55
5 Customer 10, 2 20 9:04- 5 Customer 40, 30 21 9:18-9:23
9:10
6 Customer 21, 47 15 9:48-9:55
6 Customer 1, 4 10 9:03-
7 Customer 17, 63 19 9:25-9:32
9:09
8 Customer 31, 62 23 9:05-9:10
7 Depot 3, 4 - -
9 Customer 52, 33 11 9:44-9:50
8 Customer 5, 4 12 9:05-
9:08 10 Customer 51, 21 5 9:34-9:38
9 Customer 9, 4 15 9:01- 11 Depot 42, 41 - -
9:05
12 Customer 31, 32 29 9:40-9:46
10 Customer 7, 5 10 9:07-
13 Customer 5, 25 23 10:05-
9:10
10:14
11 Customer 11, 5 20 9:05-
14 Customer 12, 42 21 9:19-9:25
9:07
15 Customer 36, 16 10 10:20-
12 Customer 7, 6 16 9:04-
10:28
9:11
16 Customer 52, 41 15 9:35-9:44
13 Depot 8, 6 - -
17 Customer 27, 23 3 10:40-
14 Customer 5, 7 10 9:10-
10:45
9:17
18 Customer 17, 33 41 9:27-9:36
15 Customer 2, 8 15 9:05-
9:12 19 Customer 13, 13 9 10:37-
10:44
16 Customer 10, 8 10 9:17-
9:23 20 Depot 57, 58 - -
17 Depot 4, 10 - - 21 Customer 62, 42 8 9:23-9:30
18 Customer 6, 10 12 9:01- 22 Customer 42, 57 8 10:22-
9:05 10:29
19 Customer 11, 10 20 9:22- 23 Customer 16, 57 16 9:31-9:36
9:27
24 Depot 8, 52 - -
20 Customer 5, 11 10 9:02-
25 Customer 7, 38 28 9:45--9:53
9:06
26 Customer 27, 68 7 9:13-9:18
21 Customer 10, 11 - -
27 Customer 30, 48 15 10:45-
22 Customer 2, 12 00 9:10-
10:52
9:14
28 Customer 43, 67 14 10:10-
23 Customer 8, 12 15 9:05-
10:18
9:10
29 Customer 58, 48 6 9:16-9:24
24 Customer 7, 13 10 9:03-
9:08 30 Customer 58, 27 19 9:40-9:50
25 Customer 11, 13 20 9:18- 31 Depot 37, 69 - -
9:25
continued on following page

873
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Table 3. Continued Table 4. Input parameter values

Node Type of Coordinates Demand Time Problem Number of vehicles Vehicle


node of the node (qi) window in the fleet capacity
(AM)
P14 3 80
32 Customer 38, 46 12 9:35-9:42
P25 5 80
33 Customer 46, 10 23 10:08-
P51 8 95
10:16
34 Depot 61, 33 - -
35 Customer 62, 63 17 9:55-9:62 Figure 8. a. Near-optimal route sets for MDVRP
36 Customer 63, 69 6 9:35-9:43 for P14 b.Near-optimal route set for MDPVRP for
37 Customer 32, 22 9 10:34- P14 c. Near-optimal route set for MDPVRPTW
10:38
for P14
38 Customer 45, 35 15 10:03-
10:08
39 Customer 59, 15 14 9:55-10:05
40 Customer 5, 6 7 10:02-
10:08
41 Depot 10, 17 - -
42 Customer 21, 10 13 9:44-9:50
43 Customer 5, 64 11 9:04-9:12
44 Customer 30, 15 16 10:27-
10:34
45 Depot 39, 10 - -
46 Customer 32, 39 5 9:56-10:12
47 Customer 25, 32 25 9:33-9:41
48 Customer 25, 55 17 9:35-9:45
49 Customer 48, 28 18 9:49-10:00
50 Customer 56, 37 10 9:27-9:38
51 Depot 30, 40 - -

Table 5. Results for the instances

Problem Number of depot Travel time (min) Travel time Travel time
MDVRP) (min) (min)
(MDPVRP) (MDPVRPTW)
P14 3 57.60 54.64 81.85
P25 5 90.68 86.69 113.69
P51 8 804.85 827.95 903.97

874
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Figure 9. a. Near-optimal route sets for MDVRP Figure 10. a. Near-optimal route sets for MDVRP
for P25 b. Near-optimal route set for MDPVRP for P51 b.Near-optimal route set for MDPVRP for
for P25 c. Near-optimal route set for MDPVRPTW P51 c.Near-optimal route set for MDPVRPTW
for P25 for P51

window constraints as much as possible and hence function from the initial solution sets by 22.7%
incurring larger penalty cost, which is evaluated for the P14 problem, 15.7% for the P25 problem,
using the original values of the coefficients after and 18.8% for the P51 problem. The average
formation of the final route set. The final objective computational time for all three examples was 30
function values and the lower limit (LL) and up- minutes.
per limit (UL) of the objective function are shown
in Table 6.The final values of the objective func-
tions remain within the boundaries for all the three CONCLUSION
instances. Hence, they validate the optimality
conditions for the instances randomly generated This paper presented a new heuristic for solv-
for the study. The heuristic improves the objective ing the multi depot probabilistic vehicle routing

875
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Table 6. Lower and Upper boundaries of the


ACKNOWLEDGMENT
objective functions

Problem LL UL Objective This work was completed at the Center for


function value Advanced Transportation and Infrastructure
P14 54.64 104.50 81.85 Engineering Research (CATIER) at the Morgan
P25 86.69 135.31 113.79 State University.
P51 827.95 1117.85 903.97

REFERENCES
problem with time window (MDPVRPTW). The
MDPVRPTW represents a real-world situation Alvarenga, G. B., Mateus, G. R., & Tomi, G.
and has many applications, such as the pick-up (2007). A genetic and set partitioning two-phase
of children by school buses, garbage collection, approach for the vehicle routing problem with time
delivery of laundry, and delivery by the United windows. Computers & Operations Research,
Postal Service (UPS). Past studies only solved 34, 1561–1584. doi:10.1016/j.cor.2005.07.025
various components of MDPVRPTW separately Aykagan, A., & Erera, A. L. (2007). A paired-
since it was difficult to devise a solution procedure vehicle recourse strategy for the vehicle-routing
for a vehicle routing problem by simultaneously problem with stochastic demands. Transpor-
considering multi-depot, probabilistic demand, tation Science, 41(2), 222–237. doi:10.1287/
and time window constraints. In our work pre- trsc.1060.0180
sented here we were able to develop a customized
genetic algorithm based heuristic to solve the Baker, B. M., & Ayechew, M. A. (2003). A ge-
MDPVRPTW. We think this will open up the pos- netic algorithm for the vehicle routing problem.
sibility for the implementation of the developed Computers & Operations Research, 20, 787–800.
approach in many real-world applications. doi:10.1016/S0305-0548(02)00051-5
The GA-based heuristic exhibits robust nature Bent, R., & Hentenryck, P. V. (2004). A two-stage
and efficiency in terms of computing time for the hybrid local search for the vehicle routing problem
MDPVRPTW problem. The optimal objective with time window. Transportation Science, 38(4),
function values lie within the specified limits for 515–530. doi:10.1287/trsc.1030.0049
all the three instances. Hence, we can infer that
the proposed heuristic provides a feasible solu- Berger, J., & Barakoui, M. (2004). A parallel
tion. While there is no rigours proof to show the hybrid genetic algorithm for the vehicle routing
optimality of the solutions, it is at least intuitively problem with time windows. Computers & Op-
evident that the solutions are optimal as they have erations Research, 31, 2037–2053. doi:10.1016/
been compared in their limiting cases with classical S0305-0548(03)00163-1
VRPs by relaxing the time-window and stochastic Bertsimas, D. J. (1992). A vehicle routing problem
restitutions. It is also found that on the average, with stochastic demand. Operations Research,
there is about 20% improvement from the initial 40(3), 574–585. doi:10.1287/opre.40.3.574
solution sets for the three instances. Additional
tests need to be performed to test the efficiency Bertsimas, D. J., Jaillet, P., & Odoni, A. R. (1990).
of the heuristic on larger problems. A priori optimization. Operations Research, 38(6),
1019–1033. doi:10.1287/opre.38.6.1019

876
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Bramel, J., & Simchi-Levi, D. (1996). Probabi- Desaulniers, G., Francois, L., & Ahmed, H. (2009).
listic analyses and practical algorithms for the tabu search, partial elimentarity, and generalized
vehicle routing problem with time windows. Op- K-path inequalities for the vehicle routing prob-
erations Research, 44(3), 501–509. doi:10.1287/ lem with time windows. Transportation Science,
opre.44.3.501 42(3), 387–404. doi:10.1287/trsc.1070.0223
Braysy, O., & Gendreau, M. (2005). Vehicle rout- Dror, M. (2003). Modeling vehicle routing with
ing problem with time windows, part I: Route uncertain demand as a stochastic program: Prop-
construction and local search algorithms. Trans- erties of the corresponding solution. European
portation Science, 39(1), 104–118. doi:10.1287/ Journal of Operational Research, 64, 432–441.
trsc.1030.0056 doi:10.1016/0377-2217(93)90132-7
Braysy, O., & Gendreau, M. (2005). Vehicle Dror, M., Laporte, G., & Trudeau, P. (1989). Ve-
routing problem with time windows, part II: hicle routing with stochastic demands: Properties
Metaheuristics. Transportation Science, 39(1), and solution framework. Transportation Science,
119–139. doi:10.1287/trsc.1030.0057 23(3), 166–175. doi:10.1287/trsc.23.3.166
Chakroborty, P., & Mandal, A. (2005). An asexual Fisher, M. L., Jornsten, K. O., & Madsen, O. B. G.
genetic algorithm for the general single vehicle (1997). Vehicle routing with time windows: Two
routing problem. Engineering Optimization, optimization algorithm. Operations Research,
37(1), 1–27. doi:10.1080/03052150410001721 45(3), 488–492. doi:10.1287/opre.45.3.488
468
Gendreau, M., Laporte, G., & Seguin, R. (1994).
Chan, Y., Carter, W. B., & Burnes, M. D. (2001). A A tabu search heuristic for the vehicle routing
multiple-depot, multiple vehicle, location routing problem with stochastic demands and custom-
problem with stochastically processed demands. ers. Operations Research, 44(3), 469–477.
Computers & Operations Research, 28, 803–826. doi:10.1287/opre.44.3.469
doi:10.1016/S0305-0548(00)00009-5
Gendreau, M., Laporte, G., & Seguin, R. (1996).
Chen, H., Hsueh, C., & Chang, M. (2006). The An exact algorithm for the vehicle routing problem
real-time time dependent vehicle routing prob- with stochastic demands and customers. Trans-
lem. Transportation Research Part E, Logistics portation Science, 29(2), 143–155. doi:10.1287/
and Transportation Review, 42(5), 222–237. trsc.29.2.143
doi:10.1016/j.tre.2005.01.003
Gendreau, M., Laporte, G., & Seguin, R.
Cordeau, J.-F., Laporte, G., & Mercier, A. (2001). (1996). Stochastic vehicle routing. European
A unified tabu search heuristic for vehicle routing Journal of Operational Research, 88, 3–12.
problems with time windows. The Journal of doi:10.1016/0377-2217(95)00050-X
the Operational Research Society, 52, 928–936.
Hwang, H. (2002). An improved model for ve-
doi:10.1057/palgrave.jors.2601163
hicle routing problem with time constraint based
Crevier, B., Cordeau, J.-F., & Laporte, G. (2007). on Genetic Algorithm. Computers & Industrial
The multi depot vehicle routing problem with Engineering, 42, 361–369. doi:10.1016/S0360-
interdepot routes. European Journal of Op- 8352(02)00033-5
erational Research, 176, 756–773. doi:10.1016/j.
ejor.2005.08.015

877
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Jaillet, P., & Odoni, A. R. (1998). The probabi- Liu, S., & Lai. (2009). The vehicle routing problem
listic vehicle routing problem. Vehicle Routing: with uncertain demand at nodes. Transporta-
Methods and Studies, 2, 125–247. tion Research Part E, Logistics and Transpor-
tation Review, 45(4), 517–524. doi:10.1016/j.
Jong, J.-C., & Schonfeld, P. (2003). An evolution-
tre.2008.11.002
ary model for simultaneously optimizing 3-di-
mensional highway alignments. Transportation Nagy, G., & Salhi, S. (2005). Heuristic algorithms
Research Part B: Methodological, 37(2), 107–128. for single and multiple depot vehicle routing
doi:10.1016/S0191-2615(01)00047-9 problems with pickups and deliveries. European
Journal of Operational Research, 162, 126–141.
Jung, S., & Haghani, A. (1999). A genetic algo-
doi:10.1016/j.ejor.2002.11.003
rithm for pick-up and delivery problem with time
windows. Transportation Research Record, 35, Ombuki, B., Ross, B. J., & Hanshar, F. (2006).
123–126. Multi-objective genetic algorithm for vehicle
routing problem with time windows. Applied
Jung, S., & Haghani, A. (2001). A genetic al-
Intelligence, 24, 17–30. doi:10.1007/s10489-
gorithm for the time-dependent vehicle routing
006-6926-z
problem. Transportation Research Record, 1771,
164–171. doi:10.3141/1771-21 Renaud, J., Laporte, G., & Boctor, F. F. (1996). A
tabu search heuristic for the multi-depot vehicle
Kim, E., Jha, M. K., & Son, B. (2005). Improving
routing problem. Computers & Operations
the computational efficiency of highway alignment
Research, 23(3), 229–235. doi:10.1016/0305-
optimization models through a stepwise genetic
0548(95)O0026-P
algorithms approach. Transportation Research
Part B: Methodological, 39(4), 339–360. Ropke, S., & Cordeau, J. (2009). Branch and Cut
and Price for the pick up and delivery problem with
Kohl, N., & Madsen, O. B. G. (1997). An optimi-
time windows. Transportation Science, 43(3),
zation algorithm for the vehicle routing problem
267–286. doi:10.1287/trsc.1090.0272
with time windows based on Lagrangian relax-
ation. Operations Research, 45(3), 395–405. Salhi, S., & Sari, M. (1997). A multi-level com-
doi:10.1287/opre.45.3.395 posite heuristic for the multi-depot vehicle fleet
mix problem. Operations Research, 103, 95–112.
Laporte, G., Gendreau, M., Potvin, J. Y., & Semet,
F. (2000). Classical and modern heuristics for the Secomandi, N. (2001). Comparing neuro-dynamic
vehicle routing problem. International Transac- programming algorithms for the vehicle routing
tions in Operational Research, 7(4-5), 285–300. problem with stochastic demands. Comput-
doi:10.1111/j.1475-3995.2000.tb00200.x ers & Operations Research, 27, 1201–1225.
doi:10.1016/S0305-0548(99)00146-X
Laporte, G., Nobert, Y., & Taillefer, S. (1988).
Solving a family of multi depot vehicle routing Thangiah, S. R., & Salhi, S. (2001). Ge-
and location routing problems. Transportation Sci- netic clustering: An adaptive heuristic for the
ence, 22(3), 161–172. doi:10.1287/trsc.22.3.161 multi depot vehicle routing problem. Ap-
plied Artificial Intelligence, 15, 361–383.
doi:10.1080/08839510151087293

878
Multi Depot Probabilistic Vehicle Routing Problems with a Time Window

Tong, Z., Ning, L., & Debao, S. (2004). Genetic Wu, T., Low, C., & Bai, J. (2002). Heuristic solu-
algorithm for vehicle routing problem with time tions to multi-depot location-routing problems.
window and uncertain vehicle number. In Pro- Computers & Operations Research, 29(10), 1393–
ceedings of the 5th World Congress on Intelligent 1415. doi:10.1016/S0305-0548(01)00038-7
Control and Automation, Hongzhou, China.

This work was previously published in the International Journal of Operations Research and Information Systems (IJORIS),
Volume 2, Issue 2, edited by John Wang, pp. 40-64, copyright 2011 by IGI Publishing (an imprint of IGI Global).

879
880

Chapter 54
Estimating Potential Woody
Biomass in Communal Savanna
Woodlands from Synthetic
Aperture Radar (SAR)
Charles Paradzayi
University of Johannesburg, South Africa & Midlands State University, Zimbabwe

Harold J. Annegarn
University of Johannesburg, South Africa

ABSTRACT
Recent developments in Synthetic Aperture Radar (SAR) technologies have shown their potential for as-
sessing and quantifying above-ground biomass (AGB) at landscape levels in different biomes. This paper
examines the application of full polarimetric data to retrieve information related to potential woody bio-
mass in sparse communal savanna woodlands in southern Africa using the Advanced Land Observation
Satellite’s Phased Array L-band Synthetic Aperture Radar (ALOS PALSAR). Woody vegetation classes
were obtained from the unsupervised entropy/alpha Wishart classification of the full polarimetric ALOS/
PALSAR data. A combination of Differential GPS and conventional surveying techniques was used for a
field inventory survey to estimate plot-level biomass densities in Welverdiend communal woodlands of
South Africa. Regression analysis was used to derive the logarithmic relationship between the sampled
plot AGB densities and the mean backscatter intensity of the microwave signal, which is transmitted in
the horizontal plane and received in the vertical plane (HV). The AGB density for each woody vegeta-
tion class is estimated by solving the logarithmic equation after extracting the mean HV backscatter
intensity for the particular vegetation class. The potential woody biomass is estimated from the derived
AGB densities and the areal extent of the respective woody vegetation classes.

DOI: 10.4018/978-1-4666-2038-4.ch054

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Estimating Potential Woody Biomass in Communal Savanna Woodlands from Synthetic Aperture Radar (SAR)

INTRODUCTION southern Africa (Scholes & Archer, 1997; Twine,


Moshe, Netshiluvhi, & Siphugu, 2003; Whitlow,
Population growth and persistent poverty are 1979). The predominant source of bioenergy in
increasing pressure on forest and woodland most low-income rural and urban households is
resources, thereby threatening their sustainable woody biomass in the form of charcoal and fu-
utilisation. In some electrified communities, the elwood (Banks, Griffin, Shackleton, Shackleton,
high cost of electricity has generally prevented & Mavrandonis, 1996; Wamukonya & Jerkins,
a move away from dependency on bio-energy 1995). In Mozambique, biomass (charcoal and
(Madubansi & Shackleton, 2007). At the same firewood) provides up to 80% of energy consump-
time, intensive harvesting practices and demands tion while 87% of rural households in Zambia use
for arable farming land are contributing to the con- fuelwood for cooking (Brouwer & Falcão, 2004;
tinued decrease in the availability of tree species Chidumayo, Masialeti, Ntalasha, & Kalumiana,
preferred for fuelwood and/or charcoal. Indicators 2002). Quantitative information about available
of energy poverty such as increased distance of biomass resources required to meet domestic
preferred species from households are beginning energy demands is lacking in most developing
to emerge in certain communal woodlands that economies. In addition, quantitative information
are not subject to rigorous management regimes on woody biomass resources is becoming critical
(Howells et al., 2003; Madubansi & Shackleton, for national inventories of woody biomass stocks
2007). This is resulting in the emergence of both for carbon sequestration and accounting purposes
micro- and macro-level woody biomass trading as well as for meeting the reporting requirements
within and among villages, and from rural to urban for international conventions and protocols such
areas respectively. In Mozambique, Tanzania and as the Kyoto Protocol on Climate Change and the
Zambia, charcoal from communal woodlands is Reduction of Emissions from Deforestation and
exported to urban centres for sale to low-income forest Degradation (REDD) initiatives (Gibbs,
urban dwellers. However, there is a general paucity Brown, Niles, & Foley, 2007; Woodhouse, Cas-
of quantitative data on available woody biomass sells, Mitchard, & Tembo, 2009).
resources to provide critical input into energy
planning models at the landscape level in most Biomass Estimation in
rural economies (Mabowe, 2006). The objective Savanna Woodlands
of this paper is to investigate the potential of using
the intensity of the backscatter signal of polari- Conventional field inventory and optical remote
metric synthetic aperture radar to quantify above sensing approaches have been used in the past to
ground biomass density and then map the spatial estimate available above ground biomass (Chidu-
distribution of available above ground biomass mayo, 1997; Ellegård, Chidumayo, Malimbwi,
resources in savanna woodlands. Pereira, & Voss, 2002; Grundy, 1995). Field
inventory measurements provide essential data
Utilisation of Woody for validating remotely sensed biomass estimates.
Biomass Resources in Low- However, such approaches are tedious and time-
Income Communities consuming. They are also constrained by ac-
cessibility problems, thus limiting the sampling
Savanna woodlands provide a traditional source approaches to areas where measuring equipment
of woody biomass and other non-timber forestry can be delivered (JICA, 2005; Mabowe, 2006;
products for a large percentage of households in Netshiluvhi & Scholes, 2001). In savanna wood-
both rural and urban low-income communities in lands, allometric equations relating measurable

881
Estimating Potential Woody Biomass in Communal Savanna Woodlands from Synthetic Aperture Radar (SAR)

tree parameters to above ground biomass quantities training plots were characterised by heterogeneous
were established in previous studies (Netshiluvhi distribution of tree species and only trees with trunk
& Scholes, 2001). The total tree biomass (TB) is circumferences greater than eight centimetres were
estimated using the equation: sampled. A Total Station (surveying instrument
with integrated direction and distance measuring
TB = 0.035 * DAH2.5 (1) devices) was used to measure the geographic
location and heights of sampled trees from the
where DAH is the diameter at ankle height. established control points because GPS degrades
Recent developments in the airborne and when operated under tree canopies (Paradzayi,
spaceborne synthetic aperture radar technologies Annegarn, Matsika, & Erasmus, 2008). Tree basal
have shown great promise in obtaining standing diameter at ankle height (DAH) and the tree height
woody biomass estimates in boreal, temperate parameters were also recorded during the field
and tropical forests (Mette, 2007; Papathanas- inventory surveys. The relatively short vegetation
siou, Lee, Kugler, & Hajnsek, 2008; Prakoso, structure in the savanna woodlands of this region
2006). Studies in these biomes have shown that makes it impractical to measure the conventional
the HV polarization of the L-band is sensitive diameter at breast height to use with allometric
to biomass densities up to 200 t ha–1 (Imhoff, equations (Mabowe, 2006).
1995; Viergever, Woodhouse, & Stuart, 2007;
Wallington, Viergever, Stuart, Moss, & Wood- ALOS/PALSAR Quadpol Dataset
house, 2006; Watanabe et al., 2006). In sparse
and heterogeneous woodlands, the backscatter This study analysed the relationship between the
intensity from the HH and VV polarisation has normalised HV backscatter intensity and above-
shown positive correlation with increasing woody ground biomass density using full polarimetric
biomass densities (Lucas et al., 2006; Saatchi, ALOS/PALSAR [ALPSRP148026690] dataset.
Halligan, Despain, & Crabtree, 2007; Santoro, The data were acquired over the communal sa-
2003; Santos et al., 2003). Integrated approaches vanna woodlands around Welverdiend Village
that combine terrestrial and spaceborne techniques (24° 35’S, 31° 20’E) in South Africa under wet
have been used for assessing biomass resources at conditions on 3 November 2008. The data were
the landscape level (Avitabile et al., 2008; Moss, obtained at an incidence angle of 23.1°. The
Stuart, Viergever, & Wallington, 2006; Narvaes, PALSAR data have a pixel spacing of 9.4 m in
Silva, & Santos, 2007). azimuth, resulting in a ground range resolution of
23.88 m. The pixel spacing in the range direction
is 3.5 m. The ground range resolution is used to
METHODS determine the ratio between the azimuth and range
pixels to produce square pixels in the SAR prod-
Field Inventory ucts. This process is referred to as multi-looking.
A multi-look ratio of 7:1 was adopted for this
The field inventory survey involved GPS mea- work to give square pixels of 25 m x 25 m. The
surements using the OmniLOGGER Differential quadpol data were processed using a combina-
GPS system (from OmniSTAR, South Africa) to tion of open source Polsarpro® and commercial
establish control points with known coordinates GAMMA® SAR software. Polsarpro was used to
for the subsequent detailed topographic surveying decompose the ALOS/PALSAR data into entropy,
of trees in sampled plots. A total of 24 plots were anisotropy and alpha angle elements. The terrain
sampled, ranging between 0.05 – 0.32 ha. The classes were generated by running the unsuper-

882
Estimating Potential Woody Biomass in Communal Savanna Woodlands from Synthetic Aperture Radar (SAR)

vised entropy/alpha (H/α) Wishart classification boundaries in an adaptive way to better fit the
algorithm. The commercial SAR software was natural distribution of the scattering mechanisms
used for normalising the backscatter intensity and also takes into account information related
and also to geocode the products to the Universal to the backscattered power. In Figure 1, the en-
Transverse Mercator projection. tropy/alpha angle plane is segmented into eight
zones representing the scattering mechanism of
H/A/alpha Decomposition each pixel of the PALSAR image after the H/α
decomposition. The boundaries of the segmented
The polarimetric data were decomposed into H/α plane are used to discriminate the surface,
entropy (H), anisotropy (A) and alpha angle (α) volume and double bounce scattering along the
elements using the Cloude and Pottier (eigenvec- alpha axis and low, medium and high degree of
tor-eigenvalue) decomposition approach (Cloude randomness along the entropy axis. The volume
& Pottier, 1997). The entropy describes the degree and dihedral scattering mechanisms are associated
of statistical disorder of the scattering phenom- with the presence of woody vegetation.
enon from the logarithmic sum of the eigenvalues
of the coherency matrix. High entropy values
indicate the presence of more than one dominant RESULTS
scattering mechanism. The anisotropy measures
the relative importance of the second and third Sampled Plot Biomass Density
eigenvalues of the H/A/alpha decomposition
(Harter, Engdahl, & Desnos, 2007). Anisotropy Individual tree biomass was obtained from exist-
expresses the relation between the secondary ing allometric equations (Equation 1) from the
scattering processes such as between surface and ground-truthing survey of test plots within the
double bounce if volume is dominant. The alpha savanna woodlands. The plot biomass density was
angle is continuous within a range of 0o ≤ α ≤ 90o calculated by dividing the total plot biomass by
and is used together with the entropy to differen- the respective areal extent of the sampled plot. A
tiate the scattering mechanisms present in a SAR summary of the field inventory results in selected
decomposition. α = 0o represents isotropic sur- plots around Welverdiend village are shown in
face scattering produced by a rough surface; (Table 1). A GIS layer (shapefile) of the sampled
α = 90o represents dihedral (double-bounce) plots were generated and used to extract the mean
scattering; α = 45o represents oriented dipole of the normalised HV backscatter intensity over
(volume) scattering typical of canopy scattering the sampled plots.
(Hajnsek, 2001; Lee et al., 1999, 2004).
Unsupervised Classification Results
Unsupervised H/α Wishart
Classification Scattering from buildings is associated with
the complex structures class. Strong reflections
The classification procedure combines the unsu- from residential buildings and tree colonies in
pervised classification approach based on polari- open areas result in the strong double bounce
metric target decompositions and the maximum scattering. Surface reflections are classified as
likelihood classifier based on the complex Wishart either Bragg or rough surfaces. The scattering
distribution (Harter et al., 2007; Lee et al., 1999). from natural vegetation and some built up areas
The classification segmentation modifies the is associated with volume and dihedral reflections
corresponding to the dense, mixed and sparse

883
Estimating Potential Woody Biomass in Communal Savanna Woodlands from Synthetic Aperture Radar (SAR)

Figure 1. H/α segmented plane

woody vegetation classes. Areas such as roads and double bounce, anisotropic, dihedral and dipole
some built-up areas that are dominated by both scattering mechanisms are closely related to the
surface and double bounce scattering are shown interaction between the microwave signal and tree
in shades of pink. The results of the unsupervised components. Hence, these classes can be used to
H/α Wishart classification is made up of eight infer the amount of above ground woody biomass.
classes representing complex structures, random Aerial photographs were used to validate the
anisotropic scatterers, Bragg surfaces, forestry polarimetric classification results. In Figure 2, the
double bounce scattering, anisotropic (vegetation) classified map was validated by overlying it on
scattering, rough surfaces, dihedral scatterers aerial photographs from 2007 obtained from the
and dipole scattering mechanisms. The forestry Bushbuckridge Municipality (Vincent, personal
communication, June 2009). A visual analysis
of the classified product shows a high level of
Table 1. Summary of training plot biomass den-
correlation between the classes and the expected
sities
scattering mechanisms depicted on the aerial
Training AGB Density Training AGB Density photographs. Relatively new built-up areas show
Plot (tons ha-1) Plot (tons ha-1) dominant surface and double bounce scattering as
N1 1.7 SE2 2.8 indicated by the pinkish colour. Established resi-
N2 1.5 SE3 7.0 dential areas show increased presence of dipoles,
N3 0.8 W1 24.9 dihedrals and vegetation scattering mechanisms
NE1 5.5 W2 5.8 from citrus trees and also some preserved natural
NE2 4.1 W3 14.5 trees and hedges used for providing shade. These
NE3 0.7 W4 7.6 are shown in shades of yellow and green. Bragg
NW1 6.0 W5 80.4 and rough surfaces are also notable in sky and
NW2 3.9 North 2 0.8 navy blue colours respectively.
S1 1.5 North 3 2.8 The south-western part has a good woody
S2 45.3 West 6 9.9
vegetation cover as evidenced by the dominant
S3 4.7 North 1 1.8
anisotropic (vegetation), dipoles and forestry
SE1 11.1 WLFC 17.0

884
Estimating Potential Woody Biomass in Communal Savanna Woodlands from Synthetic Aperture Radar (SAR)

Figure 2. Validation of classified map with 2007 aerial photograph

double bounce scattering mechanisms in green, backscatter intensities and the plot biomass densi-
yellow and red colours respectively. ties obtained during the field campaign. Figure 3
shows the results of the regression analysis using
Relating HV Backscatter Intensities logarithmic fit between the plot biomass densities
and Plot Biomass Densities and the respective plot HV backscatter intensities.
The mean backscatter intensities showed strong
Analysis of the relationship between AGB density correlation with increased woody vegetation
and SAR polarisation backscatter intensity showed densities for the sampled woodlands around
that HV and VV have better regression coefficients Welverdiend village, with a correlation coefficient
for biomass assessment in savanna woodlands of 0.84. The regression equation derived from the
compared to the backscatter intensity from the logarithmic fit between the field inventory above
HH polarisation (Paradzayi, 2010). A 25 m buffer ground biomass estimates and the mean HV
was introduced around each of the training plots backscatter intensities (σHV
o
) is given in Equation
to account for possible geolocation errors when 2.
extracting the mean backscatter intensity for the
respective training plot. Regression analysis was o
σHV = 19.084 + 1.019 * ln(AGB ) (2)
performed to test the correlation between HV

Figure 3. HV backscatter intensity and plot AGB density regression analysis

885
Estimating Potential Woody Biomass in Communal Savanna Woodlands from Synthetic Aperture Radar (SAR)

Table 2. Above ground estimates in Welverdiend communal woodlands

AGB Density (tons


Vegetation Class Mean HV / ha) Area (ha) AGB (tons)
4. Forest double bounces -14.68 75 126 9472
5. Vegetation -16.65 11 732 8013
7. Dihedral scatterers -12.44 678 21 14375
8. Dipoles -14.30 109 192 20924
1071 52783

The forest double bounce, anisotropic scatter- The results of the regression analysis between
ing, dihedral scattering and dipoles classes were the HV backscatter and above ground biomass
used to calculate the potential woody biomass densities enable the estimation of AGB densities
within the limits of Welverdiend communal wood- using the mean HV backscatter intensity values
lands because they are related to components of for the woody vegetation related polarimetric
above-ground biomass. The total above-ground classes. The logarithmic equation applies only
biomass within Welverdiend communal areas to acquisitions under wet conditions since the
was then calculated as the sum of the product of soil moisture has a significant influence on the
the areal extent of the vegetation classes and the interaction of the backscatter intensity with woody
mean AGB densities (Table 2). vegetation. The 24 training plots were confined to
The areal extent of each vegetation class was woodlands around Welverdiend due to logistical
derived from GIS analysis. The potential above and time constraints. The methods used for the
ground biomass is estimated to be over 52 000 field inventory surveys proved to be tedious and
tons for the communal woodlands within the time-consuming but allowed the detailed col-
boundaries of Welverdiend village. lection of tree parameters required to determine
the plot-level biomass densities. However, the
regression analysis revealed the need for more
DISCUSSION training plots with larger geographic coverage.
Such training plots must transcend the full spec-
The results presented in this paper have dem- trum of savanna vegetation (from sparse to dense
onstrated the potential of using full polarimet- vegetation). Larger training plots will account for
ric ALOS PALSAR data to retrieve potential the heterogeneous nature of savanna woodlands
woody biomass quantities in communal savanna and minimise the effect of residual geolocation
woodlands. The microwave imagery was decom- errors, leading to more rigorous logarithmic fitting
posed and classified using both open source and and robust regression equations.
commercial SAR applications. The commercial
software was used to project the POLSAR data
onto conventional coordinate systems such as the CONCLUSION
Universal Transverse Mercator. The results of the
unsupervised H/α Wishart classification showed The logarithmic equation can be used to estimate
relatively high qualitative correlation between the amount of available above ground woody
the derived vegetation classes and the aerial pho- biomass resources at the landscape scale after
tographs and other existing geospatial datasets. classifying woody vegetation classes from full

886
Estimating Potential Woody Biomass in Communal Savanna Woodlands from Synthetic Aperture Radar (SAR)

polarimetric SAR data. Improved estimates can Banks, D. I., Griffin, N. J., Shackleton, C.
be obtained by accounting for restrictions imposed M., Shackleton, S. E., & Mavrandonis, J. M.
by land tenure and nature conservation policies (1996). Wood supply and demand around two
governing the Welverdiend community. Biomass rural settlements in semi-arid savanna, South
resources in built-up areas, along river banks Africa. Biomass and Bioenergy, 11(4), 319–331.
and other restricted areas within the limits of the doi:10.1016/0961-9534(96)00031-1
communal area jurisdiction must be excluded
Brouwer, R., & Falcão, M. P. (2004). Wood
from the calculation of the available resources for
consumption in Maputo, Mozambique. Biomass
firewood and charcoal production. Results from
and Bioenergy, 27, 233–245. doi:10.1016/j.biom-
such analysis will provide decision makers with
bioe.2004.01.005
more accurate information on available woody
biomass resources for modelling the sustainable Chidumayo, E. N. (1997). Miombo ecology and
utilisation of such resources in rural communities, management: An introduction. Stockholm, Swe-
providing a quantitative tool for monitoring and den: Intermediate Technology Publications.
assessing savanna woodlands.
Chidumayo, E. N., Masialeti, I., Ntalasha, H., &
Kalumiana, O. S. (2002). Charcoal potential in
Southern Africa final report for Zambia. Stock-
ACKNOWLEDGMENT
holm, Sweden: Stockholm Environment Institute.
This paper was supported by VW Foundation Cloude, S. R., & Pottier, E. (1997). An entropy
Southern Africa Biomodelling Project and a Na- based classification scheme for land applica-
tional Research Foundation Focus Area Grant: tions of polarimetric SAR. IEEE Transactions
FA2005040600018 “Sustainability Studies Using on Geoscience and Remote Sensing, 35, 68–78.
GIS and Remote Sensing” to Harold J Annegarn. doi:10.1109/36.551935
The authors acknowledge European Space Agency
Ellegård, A., Chidumayo, E., Malimbwi, R.,
(ESA) and the Japan Aerospace Exploration
Pereira, C., & Voss, A. (2002). Charcoal poten-
Agency (JAXA) for providing ALOS PALSAR
tial in Southern Africa (CHAPOSA) final report.
full polarimetric data for research purposes under
Stockholm, Sweden: Stockholm Environment
project C1P. 6143. The authors also acknowledge
Institute.
the technical assistance from Andreas Vollrath and
Ralf Knuth (University of Jena, Germany) in pro- Gibbs, H. K., Brown, S., Niles, J. O., & Foley,
cessing and geocoding the ALOS PALSAR data. J. A. (2007). Monitoring and estimating tropical
forest carbon stocks: Making REDD a real-
ity. Environmental Research Letters, 2, 1–13.
REFERENCES doi:10.1088/1748-9326/2/4/045023

Avitabile, V., Marchesini, L. B., Balzter, H., Grundy, I. M. (1995). Wood biomass estimation
Bernoux, M., Bombelli, A., & Hall, R. (2008). in dry miombo woodland in Zimbabwe. For-
Biomass: Assessment of the status of the devel- est Ecology and Management, 72, 109–117.
opment of standards for the terrestrial essential doi:10.1016/0378-1127(94)03467-B
climate variables. Rome, Italy: Global Terrestrial Hajnsek, I. (2001). Inversion of surface parameters
Observing System. using polarimetric SAR. Unpublished doctoral
dissertation, University of Jena, Jena, Germany.

887
Estimating Potential Woody Biomass in Communal Savanna Woodlands from Synthetic Aperture Radar (SAR)

Harter, M., Engdahl, M., & Desnos, Y. L. (2007, Mabowe, B. R. (2006). Aboveground biomass
January 22-26). First assessment of polarimetric assessment in Serowe woodlands, Botswana.
images from ALOS - PALSAR: Check of polari- Unpublished doctoral dissertation, ITC, Enschede,
metric calibration and assessment of land-cover The Netherlands.
classification potential. Paper presented at the
Madubansi, M., & Shackleton, C. M. (2007).
3rd International Workshop on Science and Ap-
Changes in fuelwood use and selection fol-
plications of SAR Polarimetry and Polarimetric
lowing electrification in the Bushbuckridge
Interferometry, Frascati, Italy.
lowveld, South Africa. Journal of Environmental
Howells, M. I., Alfstad, T., David, G., Victor, D. Management, 83, 416–426. doi:10.1016/j.jenv-
G., Goldstein, G., & Remme, U. (2003). An energy man.2006.03.014
model for a low income rural African village.
Mette, T. (2007). Forest biomass estimation from
Stanford, CA: Stanford University.
polarimetric SAR interferometry. Unpublished
Imhoff, M. L. (1995). Radar backscatter and doctoral dissertation, Technischen Universität
biomass saturation: Ramifications for global München, Munich, Germany.
biomass inventory. IEEE Transactions on
Moss, D., Stuart, N., Viergever, K., & Wallington,
Geoscience and Remote Sensing, 33, 511–518.
E. (2006). SAR remote sensing for natural resource
doi:10.1109/36.377953
management - is there a role for ground survey?
JICA. (2005). Manual of biomass survey and Geomatics World, 24-28.
analysis. Tokyo, Japan: Japanese International
Narvaes, I. S., Silva, A. Q., & Santos, J. R. (2007,
Cooperation Agency (JICA).
July 23-27). Evaluation of the interaction between
Lee, J. S., Boerner, W. M., Schuler, D. L., Ain- SAR L-band signal and structural parameters of
sworth, T. L., Hajnsek, I., & Papathanassiou, K. P. forest cover. In Proceedings of the IEEE Inter-
(2004). A review of polarimetric SAR algorithms national Symposium on Geoscience and Remote
and their applications. Photogrammetry and Sensing, Barcelona, Spain (pp. 1607-1610).
Remote Sensing, 9(3), 31–80.
Netshiluvhi, T. R., & Scholes, R. J. (2001). Al-
Lee, J. S., Grunes, M. R., Ainsworth, T. L., Du, L. lometry of South Africa woodland trees. Pretoria,
J., Schuler, D. L., & Cloude, S. R. (1999). Unsu- South Africa: CSIR.
pervised classification using polarimetric decom-
Papathanassiou, K. P., Lee, S. K., Kugler, F., &
position and the complex Wishart classifier. IEEE
Hajnsek, I. (2008, July 6-11). Polarimetric SAR
Transactions on Geoscience and Remote Sensing,
interferometry for forest structure parameter esti-
37(5), 2249–2258. doi:10.1109/36.789621
mation: Potential and limitations. In Proceedings
Lucas, R. M., Cronin, N., Lee, A., Moghaddam, of the IEEE International Symposium on Geosci-
M., Witte, C., & Tickle, P. (2006). Empirical ence and Remote Sensing, Boston, MA.
relationships between AIRSAR backscatter and
Paradzayi, C. (2010). Polarimetric synthetic aper-
LiDAR-derived forest biomass, Queensland,
ture radar (POLSAR) biomass estimation in com-
Australia. Remote Sensing of Environment, 100,
munal African savanna woodlands. Unpublished
407–425. doi:10.1016/j.rse.2005.10.019
doctoral dissertation, University of Johannesburg,
Johannesburg, South Africa.

888
Estimating Potential Woody Biomass in Communal Savanna Woodlands from Synthetic Aperture Radar (SAR)

Paradzayi, C., Annegarn, H. J., Matsika, R., & Viergever, K. M., Woodhouse, I. H., & Stuart, N.
Erasmus, B. (2008, July 6-11). Field surveys for (2007, July 23-27). Backscatter and interferometry
biomass assessment in African savanna wood- for estimating above-ground biomass in tropical
lands. In Proceedings of the IEEE International savanna woodland. In Proceedings of the IEEE
Symposium on Geoscience and Remote Sensing, International Symposium on Geoscience and Re-
Boston, MA (pp. 632-635). mote Sensing, Barcelona, Spain (pp. 2346-2349).
Prakoso, K. U. (2006). Tropical forest mapping Wallington, E., Viergever, K. M., Stuart, N., Moss,
using polarimetric and interferometric SAR data D., & Woodhouse, I. (2006). SAR Remote sens-
- a case study of Indonesia. Unpublished doctoral ing for natural resources management - retrieving
dissertation, Wageningen Agricultural University, spatial extent and height of savanna vegetation.
Wageningen, The Netherlands. Geomatics World, 31-35.
Saatchi, S., Halligan, K., Despain, D. G., & Wamukonya, L., & Jerkins, B. (1995). Durabil-
Crabtree, R. L. (2007). Estimation of forest fuel ity and relaxation of sawdust and wheat-straw
load from radar remote sensing. IEEE Transac- briquettes as possible fuels for Kenya. Biomass
tions on Geoscience and Remote Sensing, 45(6), and Energy, 8(3), 175–179. doi:10.1016/0961-
1726–1740. doi:10.1109/TGRS.2006.887002 9534(95)00016-Z
Santoro, M. (2003). Estimation of biophysical Watanabe, W., Shimada, M., Rosenqvist, A.,
parameters in boreal forests from ERS and JERS Tadono, T., Matsuoka, M., & Romshoo, S. A.
SAR interferometry. Unpublished doctoral disser- (2006). Forest structure dependency of the relation
tation, University of Jena, Jena, Germany. between L-Band sigma nought and biophysical
parameters. IEEE Transactions on Geoscience and
Santos, J. R., Freitas, C. C., Araujo, L. S., Dutra,
Remote Sensing, 44(11), 3154–3165. doi:10.1109/
L. V., Mura, J. C., & Gama, F. F. (2003). Airborne
TGRS.2006.880632
P-band SAR applied to the aboveground bio-
mass studies in the Brazilian tropical rainforest. Whitlow, J. R. (1979). The household use of wood-
Remote Sensing of Environment, 87, 482–493. land resources in rural areas. Harare, Zimbabwe:
doi:10.1016/j.rse.2002.12.001 National Resources Board.
Scholes, R. J., & Archer, S. R. (1997). Tree-grass Woodhouse, I., Cassells, G., Mitchard, E., &
interaction in savannas. Annual Review of Ecol- Tembo, M. (2009, July 12-17). The use of ALOS
ogy and Systematics, 28, 517–544. doi:10.1146/ PALSAR for supporting sustainable forest use
annurev.ecolsys.28.1.517 in Southern Africa: A case study in Malawi. In
Proceedings of the IEEE International Symposium
Twine, W., Moshe, D., Netshiluvhi, T., & Siphugu,
on Geoscience and Remote Sensing, Cape Town,
V. (2003). Consumption and direct-use values of
South Africa (pp. 206-209).
savanna bio-resources used by rural households in
Mametja, a semi-arid area of Limpopo Province,
South Africa. South African Journal of Science,
99, 467–473.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 1,
edited by Donald Patrick Albert, pp. 53-62, copyright 2012 by IGI Publishing (an imprint of IGI Global).

889
890

Chapter 55
OOSTethys/Oceans IE
Service Registry Based on
Catalog Service for Web
Manil Maskey Ken Keiser
The University of Alabama in Huntsville, USA The University of Alabama in Huntsville, USA

Helen Conover Luis Bermudez


The University of Alabama in Huntsville, USA Southeastern Universities Research Association,
USA
Sara Graves
The University of Alabama in Huntsville, USA

ABSTRACT
Service registries can play a big role in helping developers, collaborators and agencies find deployed
resources without difficulty. A service registry is especially useful if it follows a well-known, predefined
specification that allows for automatic machine interactions and interoperability, such as the Open
Geospatial Consortium (OGC) specification for Catalog Services for the Web (CSW). This chapter
discusses a CSW-compliant registry developed as part of an OGC-sponsored interoperability experi-
ment involving the ocean sciences community. The development approach for selecting, adapting and
enhancing an open source implementation of the CSW is described. Implementation goals for the registry
included support for OGC Sensor Observation Services (SOS) and additional functionality to minimize
requirements on service providers and maximize the robustness of the registry. The registry’s role in the
OGC Ocean Science Interoperability Experiment is also discussed.

INTRODUCTION amounts of data available, scientists must address


issues related to multiple data formats, discovery
Geo-scientists today have access to unprecedented and access methods in use. Solutions across these
volumes of data collected by the many in-situ and issues, such as data format interchange and in-
remote sensing environmental sensors currently teroperable discovery and access services, would
deployed. But in order to make use of the vast allow scientists to focus on science tasks rather
than data technology. Data services supporting
DOI: 10.4018/978-1-4666-2038-4.ch055 common standards specifications, such as com-

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

munity-accepted XML definitions, can provide which engages the marine science community to
the required information in a form understandable advance interoperability using OGC standards for
by machines, thus promoting data interoperability data exchange. As part of Phase I of this experi-
and inter-use. Web service specifications have ment, the oceans community investigated the use
provided new levels of interoperability, allowing of OGC Web Feature Services (WFS) and Sensor
users and applications to more easily discover and Observation Services (SOS), utilizing the refer-
bind to available services. While interoperability is ence implementations of the services produced by
one of the main purposes behind the standardiza- a related community activity, called OOSTethys.
tion of web services, in practice, it is sometimes OOSTethys is a collaboration among software
difficult to obtain due to loose conformance to, developers and marine scientists who develop
or interpretation of, these specifications in actual open source tools in multiple languages, for use in
implementations. the evolving Integrated Ocean Observing System
Standards development is a slow and difficult (IOOS). Reference implementations and devel-
process that requires collaboration, iteration and oper guides, such as the OOSTethys cookbooks
consensus within the target community. Often the for SOS standards (http://www.oostethys.org/
result can be large, complex specifications that best-practices), can be very helpful by providing
attempt to satisfy everyone. Although the devel- implementation examples, usage tips and commu-
opment of standards and specifications generally nity specification profiles, for use by developers
involves rigorous revision and approval processes, and system integrators – as was demonstrated in
an accurate assessment of a standard’s usefulness the Oceans IE project.
cannot be made until diverse teams of developers As an integral part of the Ocean IE, the authors
implement the standard in real systems. Standards developed a service registry compliant with the
often suffer from ambiguities and scalability Catalog Service for Web (CSW) OGC standard
issues. There are, however, tangible benefits to for registries of data and services. The Oceans
developing and using a standard, that are realized IE registry was created to index the growing
when interchangeable tools are developed that number of data services (both SOS and WFS) that
researchers can use and plug into their workflow became difficult to track within the experiment.
as needed. This registry supports easy registration of SOS
Hence, it is important that experienced de- and WFS, as well as discovery of the services
velopers identify such ambiguities as well as using the CSW specifications. While CSW was a
test the reliability and scalability of new or pro- natural choice for an OGC service registry, it has
posed standards in their areas of expertise. The other advantages as well, described in the CSW
Open Geospatial Consortium (OGC) conducts subsection later in the chapter.
interoperability experiments to address the issue This chapter describes the concepts and capa-
of interoperability for newly approved geospa- bilities of an OGC service registry based on the
tial specifications (Percivall, 2009). OGC is an CSW standard that delivers the value of a service
international organization consisting of various registry in a heterogeneous environment. First, we
institutes and agencies with a common goal of discuss a variety of service metadata specifications
developing standards for geospatial services and and existing service registries. The next section
products. Thus, OGC is an ideal coordinator of describes the OGC CSW in some detail. Then we
the type of interoperability experiments needed. describe Oceans IE and its CSW service registry
One such initiative is the Ocean Science Interop- implementation, followed by a consideration of
erability Experiment (Oceans IE http://www. future research directions.
opengeospatial.org/projects/initiatives/oceansie),

891
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

BACKGROUND: SERVICE- supporting tools. OGC advocates use of its suite


ORIENTED ARCHITECTURE of web services along with supporting tools for
AND OGC WEB SERVICES almost all types of geospatial applications.

Service Oriented Architecture (SOA) approaches OGC Sensor Data Access Services
offer increased interoperability, software reuse
and reduced maintenance, as compared to more The Oceans Interoperability Experiment demon-
traditional software architectures. SOAs can strated the SOA approach using OGC data services.
transparently tie together services that provide A primary focus of Oceans IE was the OGC Sen-
different functions, are often distributed, and that sor Observation Service (SOS), a service protocol
communicate via standards. An SOA includes pro- that provides a standard interface for requesting,
ducers that offer specific functionality as a service filtering and retrieving sensor system information
and consumers, who make use of the services in and observations. SOSs allow the user or calling
the SOA framework. These architectures are typi- program to select any number of the observation
cally built around web services, loosely coupled variables available from the data source, and to
and often distributed services that are critical to subset the data by spatial and temporal range,
creating workflows and problem solutions. Use thereby significantly reducing data volume and
of Extensible Markup Language (XML) has been transfer time. Sensor data is converted from
key to attaining this goal. XML with a predefined its native format to the OGC Observations &
schema (specification) has become the common Measurements encoding specification, and may
way of storing and passing service information, be embedded directly into the SOS response or
especially because XML is able to provide precise delivered as either ASCII or binary attachments,
machine-readable information about web services. in order to foster data/application interoperability
Being machine-readable allows better automa- and reuse of multiple sensor data products. The
tion, which eventually helps in development of mandatory operations for SOS are GetCapabili-
supporting tools. ties, GetObservations, and DescribeSensor. Each
Geoscience tools can also take advantage of of these operations is briefly described below.
the SOA approach to implement flexible and eas-
ily adaptable workflows and processes. In such a • GetCapabilities: This operation returns
scenario, consumers may be geospatial visualiza- the metadata about the SOS including de-
tion, environmental data analysis or other tools scriptions of offered observations and links
that interface with the SOA framework using a to description of the sensors being used.
standard protocol (usually HTTP) and messages • GetObservation: This operation re-
that follow a certain specification. Embracing turns observation data from sensors for
SOA for geoscience applications can enrich an the specified temporal and spatial range.
organization’s service offerings and provide value GetObservation uses the Observations &
in overall efficiency, accuracy, and accessibility to Measurements (O&M) schema in its re-
organizational marketability. A geoscience SOA sponse. O&M is an OGC standard that
is supported by the efforts of geospatial profes- provides models and XML specifications
sionals and environmental data scientists who use for encoding sensor observations and mea-
various industry proven software to configure, surements. Using O&M, the SOS can de-
serve, and maintain science data and models. fine the data fields, data types, and struc-
A complete geoscience SOA includes servers tural organization in the data records. This
that provide web-accessible services along with

892
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

allows for machine-readable responses for issues and human interactions are almost certainly
virtually any type of observation. needed.
• DescribeSensor: This operation returns A solution is the use of standardized protocols
about a description of the sensor itself. for registering and discovering information about
The response is encoded in Sensor Model the services. The key questions when selecting
Language (SensorML), an OGC standard a service registry specification should be what
that provides models and XML specifica- are the functional requirements and whether the
tions for describing sensors and processes registry is easy to use. By understanding which
associated with measurements. functions are needed to support an SOA approach,
an appropriate registry solution can be selected.
Used in conjunction with other OGC specifi- A simple service registry lifecycle is shown in
cations such as CSW, the SOS provides a broad Figure 1. For a viable registry, all three opera-
range of interoperable capability for discovering, tions – publish, find, and invoke – follow defined
binding to and interrogating individual sensors, specifications. Service providers publish their
sensor platforms, or network of sensors in real- services into the service registry. Service consum-
time, archived or simulated environments. ers find services using search filter mechanisms.
Once the service is located, the service consum-
Survey of Web Service ers can invoke the service at the service provider
Metadata Specifications using the protocols and descriptions defined in
the service registry.
While web services and SOA are usually thought OGC web services serve geospatial content
to be synonymous from a technical perspec- and capabilities. OGC has defined standard pro-
tive, they are not. Web services are only one of tocols for these services, including the required
the important aspects of SOA. Another equally GetCapabilities operation for accessing service
pertinent component of SOA is the service reg- metadata. Hence, these services should be easily
istry. A service registry handles the management discovered and published using standard inter-
of service descriptions and serves as a record faces. Before we discuss CSW, we describe a few
keeper of information on how to interface with alternate approaches that have been used to
the services. Usually the service registry uses ex- tackle the service registry problem. We also dis-
tensive metadata to make services searchable and cuss the GEOSS interoperability registry that has
discoverable. Standards-based service registries the ability to interconnect the services using
expose service metadata for discovery by users various service registries.
or automated tools, fostering interoperability and
promoting service reuse. Additional information Universal Description, Discovery,
about the functional capabilities and operational and Integration (UDDI)
metadata can also be made searchable. Science
data and analysis services have often been regis- Universal Description, Discovery, and Integration
tered in ad hoc registries that are populated and (UDDI) is a specification for online directories
maintained by the organizations that developed to be used by both service providers and service
the services, and often contain only those services consumers (OASIS, 2010). The UDDI specifica-
developed by the organization. Obviously, custom tion complies with the Web Services Interoper-
tools and interfaces would need to be developed ability Organization (WS-I) standards of technical
to communicate with these proprietary registries. interoperability as a communication protocol for
Thus, compatibility and interoperability are big exchanging data. Service providers register their

893
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

Figure 1. Simple standard based service registry lifecycle

web services in a UDDI directory and service WSDLs have been critical to design and execu-
consumers search the service descriptions in this tion of scientific workflows.
directory that will lead to the services that they
are looking for. The drawback of UDDI is the use ISO 19119
of high-level descriptions. The descriptions that
are supported are very basic and generally are The International Organization for Standardization
not machine-readable. Very few organizations are (ISO) sponsors geospatial community standards
using the UDDI for service registry as a result. activities under Technical Committee 211 Geo-
Even business registries provided by IBM and graphic Information / Geoinformatics (ISO/TC
Microsoft have discontinued the use of UDDI 211). While much of the work of this committee
(Microsoft, 2006). deals with metadata for geographic data and infor-
mation, ISO 19119 is a service standard developed
Web Service Description with an intent to make geospatial resources avail-
Language (WSDL) able over a distributed environment (Percivall,
2002). A computational aspect of this standard is
Web Service Description Language (WSDL) is an the ability to interlink data and services without
XML specification that describes a web service prior knowledge about the data or service descrip-
interface (W3C, 2001). WSDL is designed as an tions. ISO 19119 offers architectures exemplary
open content model so that it is extensible and of the different kinds of service chaining. Another
flexible to be used with various web service bind- feature is the description of a metadata model for
ings. Hence, the web service’s interaction within service objects based on service instance, metadata
an SOA framework is easily machine-readable. and type. Models are also provided for interoper-
Without such standard description, one must rely ability and service distribution.
on ad hoc documentation to communicate with
the interfaces for SOA components. WSDLs have Service Entry Resource Format (SERF)
become a widely accepted standard since several
industry-accepted WSDL tools have simplified National Aeronautics and Space Administration
the creation and usage of this standard. Lately, (NASA)’s Global Change Master Directory

894
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

(GCMD) includes the Service Entry Resource information resource by the science community.
Format (SERF), a specification intended to define The ECHO service registry is primarily intended
information about geospatial data services relevant to provide reference to services that enhance the
to the datasets cataloged in the GCMD. The SERF usability of the Earth science data sets and the
consists of both required and optional metadata integration of these data and services into service
fields and is intended to help users locate software oriented architectures throughout the Earth science
tools and services for use with their data products, community. ECHO’s registry is based on web
and to decide whether a particular service would services standards, as defined by W3C (Burnett,
be useful to them (Olsen and Stevens, 2008). 2007), to include WSDL and UDDI. Registration
of services with ECHO is dependent on multiple
Survey of Web Service registration steps and multiple WSDL descrip-
Registry Implementations tions for each service. The registration process
requires users to be formally registered as not
Several organizations have implemented service only ECHO users, but also as ECHO service
registries using one or more of these protocols. providers. Both steps involve interactions with
Some of these efforts are discussed here. ECHO administrators. The formal ECHO registry
requires this level of accountability for registra-
Global Change Master tion submissions, but can be overly complex and
Directory (GCMD) intimidating for service providers seeking to share
their functionality (Bai, 2007a). Though the bar-
The primary purpose of National Aeronautics and rier to entry is high, ultimately the ECHO support
Space Administration (NASA)’s Global Change for web service standards should be instrumental
Master Directory (GCMD) is to catalog the wealth in enabling service orchestration, the chaining
of environmental datasets available from NASA of services together for programmatic solutions.
and other data sources. In recent years, GCMD
has also begun to catalog services relevant to its Geospatial One Stop (GOS)
metadata holdings, using its SERF service meta-
data specification. Bai et al. (Bai, 2009), describe The federal government mandated the manage-
how service interoperability requirements are not ment, interoperability and public access of all
fully met by well-known service registries like federally funded spatial datasets through a broad
GCMD. GCMD is intended as a human search e-government initiative (OMB Circular A-16).
interface, allowing users to browse descriptions For end-users, GOS provides a human-searchable
of services, often related to specific data products. catalog of metadata providing information about
GCMD allows consumers to search and access to the type and location of datasets. The GOS reg-
the geospatial services using keywords that are istry is comprised of a network of interoperable
too general to be referenced in a service instance catalogs and services that provide an interactive
level, preventing machine level interoperability. search capability across all registered resources.
All spatial data collected by federal agencies (or
Earth Observing System their agents) are expected to be available through
Clearinghouse (ECHO) this registry. GOS uses the National Spatial Data
Infrastructure (NSDI) as the underlying registry
NASA’s Earth Observing System Clearinghouse capability, following the Federal Geographic Data
(ECHO) also supports the registration of Earth Committee (FGDC) metadata standards. Inter-
science data and related services, for use as an nally, GOS maintains a UDDI registry to track

895
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

a list of data producers or sources. Metadata is THE OGC CSW STANDARD


subsequently harvested for specific data across the
registered searchable registry services. To register Unlike UDDI and other high-level service regis-
a data source with GOS, the data producer must tries, domain specific catalogs generally contain
provide access to a registry that is compliant with more detailed information. Where these catalogs
GOS supported searches. GOS currently supports are based on standards, this allows for support-
harvesting services for Z39.50, OGC CSW, WAF ing solutions based on automatic discovery of
(web accessible folders), and OAI-PMH (Open the services registered in the catalog. OGC has
Archive Initiative Protocol for Metadata Harvest- developed a standards-based registry specification
ing). The GOS portal provides a map viewer that for data and services, namely Catalog Service for
can subsequently utilize a number of data access Web (CSW). A CSW-compliant catalog allows
services to provide data displays to users, to include stateless web/HTTP access with standard XML
OGC WMS, WFS, and ArcIMS. Data producers data being exchanged. The CSW specification
may register their data sources through web-based allows for all properties in the metadata to be
forms at the GOS website (http://geodata.gov). searchable, making service and data associations
clearly traversable. CSW supports the ability to
GEOSS publish and search collections of descriptive infor-
mation (metadata) about geospatial data, services
Global Earth Observation System of Systems and related resources (OGC, 2009). Providers of
(GEOSS; IEEE GEOSS, 2009) is an international resources use catalogs to register metadata that
infrastructure to connect users, producers and conform to the information model that CSW sup-
integrators of environmental information. One ports. The information model includes detailed
of the GEOSS goals is to make environmental descriptions of data and services that are provided.
information publicly available to a broad set of Client applications can then efficiently search for
users. The core components of GEOSS are the geospatial data and services (OGC, 2009). CSW
“Components Registry” and the “Standards and In- was recommended for implementation specifica-
teroperability Registry”. The Components Regis- tion status in November 2005.
try’s main purpose is to provide a centralized place OGC specifies the interfaces, bindings, and
to register and access GEOSS components (e.g., a framework for defining application profiles
web services, software, models). The Standards required to publish and access a CSW that pro-
and Interoperability Registry’s main purpose is to vides metadata for geospatial data, services,
provide a centralized place to register and access and related resource information. Metadata are
standards, protocols and “special agreements”. A represented as generalized properties that can be
special agreement could be a standard commu- queried and returned through catalog services for
nity profile or a non standard agreement among resource evaluation and in many cases, invoca-
communities. As a “system of systems,” GEOSS tion or retrieval of the referenced resource. The
does not support a single registry protocol; rather CSW standard and relevant application profiles
multiple data and service registries will exist in the are described in more detail in the next section.
GEOSS framework, and will communicate each
other via standardized protocols and interfaces. CSW Information Structure
and Semantics

OGC specifications allow for flexibility in the


model for information exchange. In case of CSW,

896
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

application profiles are allowed to be specific about ISO 19119


what information content is to be communicated
through the service. This information follows a The Catalog Application Profile (Senkler, 2004)
particular syntax from the application profile. definition, based on the CSW specification, takes
Thus an application profile should address the advantage of both 19115 and 19119 for geospatial
following items: data and services, respectively. As ISO 19119
includes extensions that allow for the varying
1. Identify information resource types that can data/service binding described above, the pro-
be requested. In the case of a catalog service, file uses the 19115 description with additional
the information resources being described by objects designed to include the necessary exten-
the metadata may include geographic data, sions. For example, 19119 includes a definition
imagery, services, controlled vocabularies, of CSW_CoupledResource, a class that regulates
or schemas among a wide variety of possible the interoperation of services and datasets. The
types. This sub clause allows the community profile excerpt shown in Figure 2 includes the
to specify or generalize the resource types object SV_ServiceIdentification that would suf-
being described in the metadata for their fice for the metadata functionality of 19115 but
scope of application. does not satisfy the service requirements of 19119.
2. Identify a public reference for the informa- The profile described uses an HTTP binding,
tion being returned by the service (e.g., but is also capable of using SOAP with XML
ISO 19115:2003 “Geographic Information binding. ISO 19119:2005 identifies and defines
- Metadata”). Include any semantic resources the architecture patterns for service interfaces
such as data content model, dictionary, used for geographic information, defines its rela-
feature type catalog, code lists, authorities, tionship to the Open Systems Environment
taxonomies, etc. model, and presents a geographic services tax-
3. Identify named groups of properties (element onomy and a list of example geographic services
sets) that may be requested of the service placed in the services taxonomy. It also prescribes
(e.g. “brief,” “summary,” or “full”) and the how to create a platform-neutral service specifica-
valid format (syntax) for each element set. tion, how to derive conformant platform-specific
Identify valid schema(s) with respect to a service specifications, and provides guidelines
given format to assist in the validation of for the selection and specification of geographic
response messages. services from both platform-neutral and platform-
specific perspectives.
A service metadata schema must be defined
for the purpose of interchange of information ebRIM
within an information community. This schema
should provide a common vocabulary that supports The ebXML Registry Information Model (ebRIM)
search, retrieval, display, and association between (Nogueras-Iso, 2005; Chen 2009) has also been
the description and the object being described. used as a profile for OGC CSW. The Model
Currently, the XML exchanges for CSW are based provides a format for ordering both the content
on two draft application profiles: ebRIM Registry and their connections (Martell, 2009) within the
Information Model Profile and the 19115/19119 catalog, as shown below in Figure 3.
Profile developed by ISO. Extension packages can be used as needed to
support application-specific functionality. Also,
specific components, such as the RegistryObject,

897
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

Figure 2. ISO 19119 profile excerpt

can be substituted where possible by similar ele- CSW Operations


ments. The slots can also be used if no conflicts
are present with either extension packages or The operations supported by CSW can be divided
higher-level objects. into three classes: service operations which are
George Mason University’s Center for Spatial operations a client may use to question the service
Information Science and Systems (CSISS) (Bai, to determine its capabilities; discovery operations
2007b) uses an ebRIM-style catalog service which a client may use to determine the informa-
designed to provide access to their Landsat im- tion model of the catalog and to query catalog
ages through a combination of geospatial data records; and management operations which are
and geospatial services. The data and services used to create or change records in the catalog.
are associated together, letting users tailor the
data via the service interface. This specification Service Class Operation
is based on an OGC-compliant model, and thus GetCapabilities: Generate Capabilities document
is restricted to the same functions as the CSW, Discovery Class Operations
described below. DescribeRecord: Generate Record Schema
GetDomain: Determine value domain and gener-
ate response document

898
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

Figure 3. ebXML registry information model

GetRecordById: Query repository for record with GetCapabilities Operation


specified id and generate response
GetRecords: Execute query and generate re- Like all other OGC services, the CSW specifi-
sponse document containing result records cation mandates the GetCapabilties operation.
Management Class Operations This operation allows clients to retrieve service
Transaction: Execute transaction and generate metadata from a server. CSW servers allow the
response document HTTP request for GetCapabilities to contain
Harvest: Generate response document that ac- parameter and value pairs for Request, Service,
knowledges receipt of request and execute Sections, AcceptVersions and AcceptFormats.
request asynchronously. The value of the Service parameter should be
“CSW”. The response to a GetCapabilities request
In general, metadata consumers will invoke is an XML document containing service metadata
GetCapabilities, DescribeRecords, GetDomain, about the server. The document includes sec-
GetRecords, and GetRecordById operations tions on ServiceIdentification, ServiceProvider,
whereas metadata producers will invoke Transac- OperationsMetadata, and Filter_Capabilities. Of
tion and Harvest operations. these sections, OperationsMetadata is the most
important, as it lists all the operations supported
by the CSW service.

899
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

DescribeRecord Operation allows records to be deleted from the catalog in


conjunction with the constraint element.
DescribeRecord is a mandatory operation that
allows a client to discover elements of the infor- Harvest Operation
mation model supported by the catalog service.
This method requires mandatory keyword-value Harvest is an optional operation that references
pairs for parameters Request, Service, and Ver- data to be inserted or updated in the catalog. The
sion. Other supported parameters are Namespace, catalog itself will fetch the data and process into
TypeName, outputFormat, and schemaLanguage. the catalog.
The response for the DescribeRecord request con-
tains SchemaComponent elements that describe CSW Use Scenarios
each requested schema language.
In the context of SOA, Figure 4 illustrates an
GetDomain Operation example sequence of operations performed by an
SOS consumer. In this figure, the SOS consumer
The GetDomain operation is optional. This op- discovers two instances of SOSs using the GetRe-
eration returns a range of values for a metadata cords operation provided by CSW service catalog.
record element or request parameter. Then, the consumer requests SOS capabilities
documents for each service, using URLs from
GetRecords Operation the CSW GetRecords response. The consumer
can then make a choice of service by inspecting
GetRecords is a mandatory operation that is used the observation offerings from both SOSs, and
to search for records in the catalog. It uses the invoke the appropriate GetObservation from the
Query element to filter and identify the record. selected SOS is invoked.
GetRecords also uses the outputSchema parameter Again in the context of SOA, Figure 5 illustrates
to indicate which schema shall be used to generate an example sequence of operations performed by
the response to the GetRecords operation. two SOS providers (producers). The first pro-
ducer discovers the second SOS from the CSW
GetRecordById Operation catalog. Whenever new sensors are discovered,
the producers can register these sensors with their
GetRecordById is a mandatory operation that SOS. They will subsequently publish sensor ob-
retrieves a specified record from the catalog based servations that they are able to provide.
on the provided identifier. This operation is gener-
ally used after executing another operation to get
back the identifier for the record. THE OGC OCEANS
INTEROPERABILITY EXPERIMENT
Transaction Operation
Overview of Oceans IE
Transaction is an optional operation that provides
mechanisms to create, modify, and delete catalog OOSTethys, the loose collaboration of software
records. In particular, the Insert action allows one developers and marine scientists who worked with
or more records to be inserted into the catalog. The OGC to define and implement Oceans IE, was
Update action uses constraint elements to find the itself the product of several earlier collaborations:
records to be updated. Similarly, the Delete action OpenIOOS, the SURA Coastal Ocean Observ-

900
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

Figure 4. SOS consumer sequence diagram

Figure 5. SOS sensor data publisher sequence diagram

901
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

ing and Prediction (SCOOP) program, and the started the OGC Ocean Science Interoperability
NSF-sponsored Marine Metadata Interoperability Experiment in 2007.
project. OpenIOOS has been demonstrating the Through Oceans IE, a portion of the Ocean
concept that “standards enable innovation” for Observing community has reached consensus on
several years, thanks to data and service contri- its understanding of various OGC specifications,
butions from Ocean Observing Systems (OOSs) solidified demonstrations for Ocean Science appli-
around the country. OpenIOOS.org, which is a cation areas, hardened software implementations,
proof of concept portal interface for the distributed and produced a candidate “OGC Best Practices”
participants, demonstrates a Service-Oriented Ar- document that can be used to inform the broader
chitecture (SOA) with heavy emphasis on Open ocean science community. To achieve these goals,
Geospatial Consortium (OGC) web-service speci- the Oceans IE engaged the OGC membership to
fications. With support from the NOAA Coastal assure that any community requirements com-
Services Center and the Office of Naval Research, ing from the Oceans group properly leveraged
the Southeastern Universities Research Associa- the OGC specifications. Change requests on the
tion (SURA) integrated OpenIOOS.org into the OGC specifications were provided to the OGC
SCOOP program (http://scoop.sura.org). Through Technical Committee to influence the underlying
SCOOP and other programs, SURA has supported specifications.
the vision of enabling a distributed laboratory for The Oceans IE Phase I, which ended in May
research and applications with broad participation 2008, investigated and compared the use of
from research institutions working in partnership OGC Web Feature Services (WFS) and Sensor
with federal agencies and the private sector. The Observation Services (SOS) for representing
Marine Metadata Interoperability project, started and exchanging point data records from fixed
in September 2004, now comprises more than 300 in-situ marine platforms. This phase of Oceans
members, with a web site consisting of thousands IE produced an engineering best practices report
of pages including guides and references about and reference implementations for SOS. Phase I
marine metadata issues (http://marinemetadata. included a registry to record all the SOS that were
org). One of the activities of MMI is an interop- developed. This ad hoc registry was a quick and
erability demonstration using an SOA, common temporary solution to the problem of cataloging
use content standards, and semantic mediation and discovering the SOSs. Tools and interfaces
via ontologies. that relied on the registry for SOS information
In 2005 MMI, OpenIOOS and SCOOP par- had to use custom Representational State Trans-
ticipants started working together on a single, fer (REST) style services to communicate with
combined interoperability test bed activity of a the registry. We encountered a growing need
distributed, service-oriented architecture to share for a standards-based service registry soon after
real time data. The combined team, which adopted completion of this initial service registry, as more
the name OOSTethys, set up an initial set of meta- participants began utilizing the registry from dif-
data requirements such as geo-spatial location and ferent applications. Research on available registry
platform type, and agreed on an interface to serve standards thus began during Phase I.
as a wrapper for each data system. OOSTethys The Oceans IE Phase II ran from March 2009
members decided that working with standards to December 2009. The following topics were
organizations to select, exercise and advance ap- advanced: Automated instrument metadata/soft-
propriate standards was the logical path to move ware installation via the PUCK protocol (MBARI,
forward toward an integrated ocean observing 2009), observation offerings of complex systems
system. With the OGC, OOSTethys members (e.g., observations systems containing other sys-

902
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

tems) such as collection of stations, linking data GI-cat


from SOS to out-of-band offerings, Semantic
Registry and Services, CSW Registry, and IEEE- GI-cat is open source software developed by
1451/OGC-SWE harmonization. During Phase Earth and Space Science Informatics – Labora-
II, CSW was investigated in depth along with tory (ESSI-Lab) that provides implementations
other catalog standards. After CSW was selected, of the OGC CSW specification, along with a
various open source reference implementations suite of other OGC standards (Nativi, 2007). GI-
of the specification were also investigated. The cat uses the ISO application profile information
deegree project was chosen as the base CSW models for core operations and exposes standard
implementation, and the authors developed vari- query interfaces implemented by the profiler
ous components to support additional Oceans IE components. The software in developed using the
needs and make the implementation more robust. Java Standard Edition that is capable of running
under Apache Tomcat server. Although GI-cat is a
CSW Reference Implementations practical implementation of CSW, the installation
procedures and other compatibility issues were
Multiple organizations have implemented regis- deemed to be impractical for use in Oceans IE.
tries based on CSW. The Oceans IE community
evaluated three such implementations, described Deegree
below.
The deegree project is a Java implementation of
GeoNetwork an extensive suite of OGC service specifications.
It also encompasses test clients for the suite of
GeoNetwork is an open-source software project services. Deegree software is freely available
developed to provide the geospatial community as open-source software protected by the GNU
a standards-based suite of applications and ser- Lesser General Public License (GNU LGPL) and
vices (GeoNetwork, 2009). Geospatial standards is accessible at http://www.deegree.org/.
developed by OGC and ISO are well supported The deegree CSW implementation is designed
by GeoNetwork. One of the functions offered in such a way that it is able to serve different
is the cataloging of data and services. Although metadata formats in parallel even with a single
powerful in many aspects, the current devel- underlying relational database. This is possible be-
opment version of GeoNetwork at the time of cause the deegree CSW implementation uses Ex-
evaluation did not support the latest version of tensible Stylesheet Language (XSL) (W3Schools,
the CSW specification. A number of ambiguities 2010) processing to transform requests as well as
in the 2.0.1 specification have been clarified in responses into the desired format. The deegree
2.0.2 and should be reflected in the GeoNetwork CSW does not contain a data access module of its
implementation of CSW. In addition, at the time own, rather it uses an OGC Web Feature Service
of our evaluation, the Transaction and Harvest (WFS), also implemented by deegree as a data
operations were not supported by GeoNetwork source. It is possible to adjust the deegree CSW to
implementation of CSW. These operations were almost any existing metadata profile without the
critical to our experiment. need for replication. Because of its flexibility with
respect to configuration and output formats, the
deegree implementation was ideal for our purpose.
It merely required modifying XML configuration
files before installation.

903
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

Table 1. Summary of features of various CSW


successes were leveraged by developing an open
implementations
source service registry and catalog based on the
Features Geonetwork GI- deegree OGC’s CSW which harvest metadata from the SOS
cat GetCapabilities and DescribeSensor responses.
Open source X X X A real-time use of the CSW-compliant service
Latest version of CSW X X registry is shown in Figure 6. Various SOSs de-
Supports all CSW X X ployed by the IOOS community can be discovered
operations from the OOSTethys/Oceans IE registry. The
Supports many OGC X observations from the registry are displayed in
and ISO standards
real-time at the website openioos.org. All of the
Can support multiple X
application profiles source code developed for the experiment is avail-
Can serve multiple X able for the community to reuse at http://code.
metadata formats in google.com/p/oostethys/. This architecture allows
parallel
for reference implementations to be adapted and
Easy installation and X
configuration
improved easily while keeping the overall usage
No compatibility issues X
as simple as possible.
The OOSTethys/Oceans IE SOS Service Reg-
istry was based on the deegree implementation
of the CSW specification. This implementation
The pros and cons of each of these CSW
allows communication using the ISO 19119 ap-
implementation are summarized in Table 1.
plication profile. Other supporting components
include the TEAM Engine Validator, a web-based
user interface, a GetCapabilities metadata har-
THE OCEANS IE CSW-COMPLIANT
vester, and SOS metadata to ISO metadata con-
SERVICE REGISTRY
verter. Figure 7 illustrates the architecture of the
entire system. Components of the system are
A primary goal of the Oceans Interoperability
described in the following subsections. This ar-
Experiment and the complementary OOSTethys
chitecture is a specific case of SOA that we dis-
project, is to dramatically reduce the time it takes
cussed earlier in the chapter. Service providers
to install, adopt and update standards-compliant
are responsible for storing and serving data col-
web services, focusing on the OGC Sensor
lected from instruments and resulting derived
Observation Service (SOS). Open source Java,
products. In addition, they make their services
PERL, Python, PHP and ASP toolkits serving
available by simplified registration process ex-
ocean observations from NetCDF-formatted files,
posed by CSW interface. End users and tools map
THREDDS, various relational databases and even
to the service consumers in the SOA. The tools
CSV (comma-separated value) text files have been
programmatically discover the services using the
developed. This matches the diverse data needs
CSW interface. Once the services are discovered,
and software expertise of the Integrated Ocean
the tools will invoke the services using the pro-
Observing System community. In many cases
tocol described in the service metadata. OpenIOOS
data providers with limited software experience
is an example of such tools used in OOSTethys/
are able to install an SOS in mere minutes and
Oceans IE.
configure it by editing a simple configuration file.
This easy deployment has led to broad adoption
of SOS by the marine science community. These

904
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

Figure 6. OpenIOOS: real-time sos observations visualization

Figure 7. OOSTethys/Oceans IE CSW registry system for SOS

905
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

Deegree CSW Registry Package These components transform incoming catalog


requests into WFS requests and transform WFS
Deegree’s CSW installation package is a web responses into CSW responses.
archive (WAR) file, which can be dropped easily
into a web server. In our case, the Apache Tom- TEAM Engine Validator
cat web server suffices as a container to use for
deployment of the deegree servlet module. It is Compliance & Interoperability Testing & Evalu-
also very convenient to use the bundled structured ation (CITE) is an ongoing OGC initiative that
query language (SQL) scripts to create tables for develops tests for OGC standards, and makes
the catalog. This implementation supports all of those tests available for online access. An integral
the operations defined for CSW and is compliant part of CITE is a test script interpreter, the Test,
with the CSW 2.0.2 specifications. During Oceans Evaluation, And Measurement (TEAM) Engine.
IE, degree provided the solutions to any problems It executes test scripts written in Compliance
encountered via well supported online forums and Test Language (CTL) to verify that a designated
user correspondence. implementation complies with the specification.
OGC and ISO geoinformatics standards Being an OGC member, we were able to obtain
played a vital role in development of deegree. the source code for the TEAM Engine for use
In particular, deegree’s CSW implementation in the OOSTethys/Oceans IE Service Registry
supports the ISO 19119 standard, and is built interface. The TEAM Engine test for SOS vali-
on the deegree implementation of the OGC Web dation was adapted to integrate into the service
Feature Service (WFS) – see Figure 8. The WFS registration process. The incorporation of most
programming interface accepts WFS requests and of the TEAM Engine open source code helped
replies with WFS responses for a particular data to create a complete and robust CSW registry.
source. Therefore, the discovery and manager OOSTethys is able to distribute the TEAM Engine
layers hide the WFS layer from the exposed API.

Figure 8. Dependency between deegree CSW and WFS

906
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

as a component of the Service Registry under the URLs for operation requests. The basic
Mozilla 1.1 license. contents and organization of this section is
the same for all OGC web services, but in-
CSW Registry Web Interface dividual services can add elements and/or
change the use of optional elements.
For Oceans IE, the authors developed a web-based
user interface to the CSW registry for both pro- Service providers merely need to provide
ducers and consumers of SOS. This user interface a valid GetCapabilities URL to register their
communicates with the CSW registry by generat- service in the registry. Automatic harvesting of
ing the detailed XML request and response docu- service metadata is achieved by parsing the Get-
ments required by CSW operations, thus shielding Capabilities response from an SOS. The registry
users from these complex XML documents. The first validates the SOS using the TEAM Engine
web interface for the OOSTethys/Oceans IE SOS Validator. The provider is notified if the SOS is not
Service Registry is available at: http://scoredev. in compliance with the OGC SOS specification.
itsc.uah.edu/csw/. Two important operations that Once the SOS passes the TEAM Engine validation,
are offered through the web interface are service the XSL transformation layer translates the SOS
registration and service discovery. GetCapabilities service metadata to the ISO 19139
service metadata schema required by the deegree
Service Registration CSW registry package. For the service providers,
all of these complex activities, such as the con-
All of the web service standards used for Oceans IE struction of valid XML insert request to register
are developed and maintained by OGC. The CSW their services, are completely hidden. Service
service registry leverages the fact that all OGC web providers can enter an SOS GetCapabilities URL
services have commonalities in their design and and press the “Register” button for registration.
interface, in particular the GetCapabilities opera- Any error encountered during registration will be
tion. The response from a GetCapabilities request displayed to the registrant.
is an XML description of the service’s information
content and supported request parameters, and Service Discovery
is therefore both machine- and human-readable.
Since the XML-formatted response conforms to The users of the registry can also bypass the cre-
a predefined schema, clients can easily validate ation of XML for discovery or transaction opera-
the response. GetCapabilities responses contain tions as the easy-to-use web interface builds the
information on following sections: necessary XML from the specification of a few
necessary query components by the user. Pressing
• ServiceIdentification: Metadata about “Get Record By Id” button in the web interface
this specific server. The schema of this sec- returns a CSW response for a GetRecordById
tion is the same for all OGC web services. request based on the provided identifier, currently
• ServiceProvider: Metadata about the or- “Service Title”, and element set name. Pressing the
ganization operating the server. The sche- “Get All Records” button returns ISO records for
ma of this section is also the same for all all the registered services. The underlying CSW
OGC web services. web service can be accessed programmatically
• OperationsMetadata: Metadata about the with a valid CSW request.
operations specified by this service and
implemented by this server, including the

907
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

SOS Service Metadata to are available at http://oceanobservatories.org/


ISO 19119 Translator spaces/display/CIDev/Metadata+Registry and
http://oceanobservatories.org/spaces/display/CI-
Since the deegree registry interface accepts CSW Dev/Harvester. The main difficulty with semantic
requests that conform to the ISO 19119 profile, registry is providing filtering by topics of large
the SOS metadata harvested from the GetCa- collections of metadata. If there is a thesaurus
pabilities response needs to be converted to an (terms with broader and narrower relations), it will
appropriate ISO record. The authors developed be easier to navigate and discover the metadata
such a translator using an XSL transformation. records of interest. Furthermore, initial imple-
This translator maps almost all available elements mentations of semantic-based service registries
from the GetCapabilities response schema into have suffered from performance issues. Much
the appropriate ISO 19119 schema. The transla- of the research is now focused on improving the
tor is executed whenever providers register their performance of such registries. Currently, another
service using the CSW interface. Figure 9 shows research focus area is into ways to describe rela-
a portion of the XSL mapping from SOS service tionships between different web services so that
metadata to an ISO 19119 profile. In this subset such semantic relationships can be exploited in
of the XSL mapping, the ObservationOffering’s id service discovery within the service registry.
and description are translated into DataIdentifica-
tion’s title and identifier respectively. The CSW
registry thus allows the services to be searched CONCLUSION
using these criteria.
As part of OGC Oceans Interoperability Experi-
ment (Oceans IE), the authors developed a web
FUTURE RESEARCH DIRECTIONS service registry compliant with the OGC Catalog
Service for Web (CSW) standard. The registry
Since the evolution of web 3.0 technologies, the evolved with the need for a standards-based catalog
future of service registries also seems to moving to support a growing number of Sensor Observa-
towards the semantic-based service registry. Ocean tion Services (SOS). The resulting registry allows
community efforts related to semantic registries easy registration of SOSs via harvesting the service

Figure 9. Portion of XSLT mapping from SOS GetCapabilities to ISO 19119

908
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

metadata from GetCapabilities responses, as well Burnett, M., Weinstein, B., & Mitchell, A. (2007).
as easy discovery of these services using the CSW ECHO – enabling interoperability with NASA
specifications. A standards-compliant registry for earth science data and services. Proceedings from
this endeavor was required to eliminate the need for Geoscience and Remote Sensing Symposium and
discovering the services from ad-hoc registries. In IGARSS 2007.
addition, having a standard CSW registry opened
Chen, N., Di, L., Yu, G., & Wei, Y. (2009). Use
up avenues for the development of automated
of ebRIM-based CSW with sensor observation
tools. This open source registry uses Extensible
services for registry and discovery of remote-
Stylesheet Language (XSL) transformation to
sensing observations. Computers & Geosciences,
convert SOS metadata into the ISO metadata
35, 360–372. doi:10.1016/j.cageo.2008.08.003
that is required by the CSW. Two plug-ins, the
TEAM Engine Validator and CSW registry web GeoNetwork. (2009). GeoNetwork OpenSource
interface, were also developed or enhanced and community website. Retrieved from http://
integrated to make the CSW registry more robust geonetwork-opensource.org/
and easy to use. The authors believe that the
IEEE GEOSS. (2009). IEEE Standards Associa-
availability of the OOSTethys/Oceans IE SOS
tion – GEOSS standards registry. Retrieved from
CSW Registry will promote acceptance of SOS
http://seabass.ieee.org/groups/geoss/
by the oceanographic community and facilitate
service discovery and reuse. Furthermore, the Martell, R. (2009). CSW-ebRIM Registry
OOSTethys/Oceans IE Registry is on schedule Service - part 1: ebRIM profile of CSW.
to be registered with GEOSS. This registration Retrieved from http://portal.opengeospa-
will allow creation of community registry that tial.org/modules/admin/license_agreement.
includes services from other organizations that php?suppressHeaders=0&access_license_
are registered with GEOSS, and interconnected id=3&target=http://portal.opengeospatial.org/
with OOSTethys/Oceans IE registry. files/index.php?artifact_id=31137
MBARI. (2009). Marine plug-and-work consor-
tium. Retrieved from http://www.mbari.org/pw/
REFERENCES
puck.htm
Bai, Y., Di, L., Chen, A., & Wei, Y. (2007b). To- Microsoft. (2006). UBR Shutdown FAQ. Retrieved
wards a geospatial catalogue federation service. from http://uddi.microsoft.com/about/FAQshut-
Photogrammetric Engineering and Remote Sens- down.htm
ing, 73, 699–708.
Nativi, S., Bigagli, L., Mazzetti, P., Mattia, U.,
Bai, Y., Di, L., & Keiser, K. (2007a). Review of & Boldrini, E. (2007). Discovery, query and ac-
NASA ECHO (version 8) Web service registration cess services for imagery, gridded and coverage
process, Technical Whitepaper submitted to the data a clearinghouse solution. Proceedings from
Earth Science Data Systems’Technology Infusion Geoscience and Remote Sensing Symposium and
Working Group. IGARSS, 2007, 4021–4024.
Bai, Y., Di, L., & Wei, Y. (2009). A taxonomy of
geospatial services for global service discovery
and interoperability. Computers & Geosciences,
35, 783–790. doi:10.1016/j.cageo.2007.12.018

909
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

Nogueras-Iso, J., Zarazaga-Soria, F., Bejar, R., Conover, H., Berthiau, G., Botts, M., Goodman,
Alvarez, P., & Muro-Medrano, P. (2005). OGC M., Li, X., & Lu, Y. (2010). Using Sensor Web
Catalog Services: a key element for the develop- Protocols for Environmental Data Acquisition and
ment of Spatial Data Infrastructures. Comput- Management. Ecological Informatics, 32–41.
ers & Geosciences, 31, 199–209. doi:10.1016/j. doi:10.1016/j.ecoinf.2009.08.009
cageo.2004.05.015
Conover, H., Drewry, M., Graves, S., Keiser, K.,
OASIS. (2010). Online community for the uni- Maskey, M., & Smith, M. (in press). SCOOP Data
versal description, discovery, and integration. Management: A Standards-based Distributed In-
Retrieved from http://uddi.xml.org/. formation System for Coastal Data Management.
In Di, L., & Ramapriyan, H. K. (Eds.), Standards-
OGC. (2009) Catalogue service. Retrieved from
Based Data and Information Systems for Earth
http://www.opengeospatial.org/standards/cat
Observation. Springer-Verlag.
Olsen, L., & Stevens, T. (2008). Service Entry
Graves, S., Ramachandran, R., Movva, S.,
Resource Format (SERF) standard. (ESDS-RFC-
Conover, H., Fox, P., & McGuinness, D. (2008).
013v0.1). Retrieved from http://www.esdswg.
A distributed knowledge extraction framework
org/spg/rfc/esds-rfc-013/ESDS-RFC-013v0.1.pdf
based on semantic Web services. AISR NASA
Percivall, G. (2002). ISO 19119 and OGC Service Investigator Workshop, NASA, College Park,
Architecture. Presented at FIG XXII International MD, 2008.
Congress, Washington, D.C.
ISO 19119. (2008). Geographic information ser-
Percivall, G. (2009). OGC Interoperability Pro- vices. Fuger S., Najmi F., & Stojanovic, N. (2005).
gram. Retrieved from http://www.opengeospatial. ebXML Registry Information Model (Version
org/ogc/programs/ip 3.0). OASIS Standard #OGC regrep-rim-3.0-os,
May 2005.
Senkler, K., Voges, U., & Remke, A. (2004). An
ISO 19115/19119 profile for OGC catalogue ser- Movva, S., Ramachandran, R., Maskey, M., &
vices CSW 2.0. Presented at the 10th EC GI & GIS Li, X. (2009). Building a semantic framework
Workshop, ESDI State of the Art, Warsaw, Poland. for e-science. Paper presented at the American
Geophysical Union Fall Meeting.
W3C. (2001). Web Services Description Lan-
guage (WSDL 1.1). Retrieved from http://www. Shanming, W., & Jianjing, S. (2008). Ontology-
w3schools.com/xsl/ Based Framework for Geospatial Web Services.
Information Science and Engineering., 2, 107–110.
W3Schools. (2010). XSLT Tutorial. Retrieved
from http://www.w3schools.com/xsl/
KEY TERMS AND DEFINITIONS
ADDITIONAL READING Application Profile: A metadata schema con-
sisting of elements from one or more namespaces
Chidambaram, C., Maskey, M., & Ramachandran,
merged and optimized for use in a specific ap-
R. (2009). Service Oriented Architecture for Data
plication area.
Mining at Data Centers and Computing Centers.
Catalog: A centralized location where informa-
Poster presented at the Cloud Computing and Col-
tion regarding the similar resources is stored, and
laborative Technologies in Geosciences.
where they can be searched for easily.

910
OOSTethys/Oceans IE Service Registry Based on Catalog Service for Web

Catalog Service for Web (CSW): Specifica- Reference Implementation: A model or skel-
tion that defines the schemas and interfaces to eton from which other similar implementations
search, discover, register, and request geospatial can be derived and customized easily.
data and services. Service Oriented Architecture (SOA): An
Interoperability: Capability of multiple soft- approach to distributed computing using loosely
ware engineering components to communicate coupled web services where communication and
and exchange information. message exchanges are standards based.
Standard: A formal document that specifies a
set of rules to be followed throughout a process.

This work was previously published in Geospatial Web Services: Advances in Information Interoperability, edited by Peisheng
Zhao and Liping Di, pp. 97-117, copyright 2011 by Information Science Reference (an imprint of IGI Global).

911
912

Chapter 56
A Multi-Criteria GIS Site
Selection for Sustainable Cocoa
Development in West Africa:
A Case Study of Nigeria

Tunrayo Alabi
International Institute of Tropical Agriculture, Nigeria

Kai Sonder
International Maize and Wheat Improvement Center, Mexico

Olusoji Oduwole
Cocoa Research Institute of Nigeria, Nigeria

Christopher Okafor
International Institute of Tropical Agriculture, Nigeria

ABSTRACT
Cocoa occupies 6 million hectares in humid coastal West Africa where 70% of the world supply is grown,
90% of which is produced on 2 million family farms of 2 hectares or less. Here, at least 16 million people
depend on cocoa but earn only $100/person/year from the crop. There is need to optimize the farming
system, minimize the environmental impact of technologies, and improve socio-economic dynamics.
This study identifies areas with potential for intensified cocoa farming and where maximum impact to
household income could be achieved without deforestation. The selection involves defining suitability
criteria, preparing an inventory of available data, determining suitability based on identified criteria,
and combining suitability into hierarchical preferences based on weights proposed by local experts.
GIS and Multi-Criteria land Evaluation technique using biophysical, socioeconomic, and demographic
variables were employed in selection. Nineteen administrative units were selected in Nigeria where the
intervention project could be implemented.

DOI: 10.4018/978-1-4666-2038-4.ch056

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

INTRODUCTION Nigeria [CRIN], 2008). Cocoa was the most im-


portant agricultural export crop during the 1960-
Cocoa (Theobroma cacao) is one of the most 1970 period, earning a significant percentage of
important perennial crops worldwide, with an the foreign exchange income (Mustapha, 1999).
estimated world production of 2.8 million tonnes The production increased gradually to a maximum
(t) in 2002 (Food and Agriculture Organization of 308,000 t in 1970-1971 but dropped drastically
[FAO], 2003). It is especially a most significant to 110,000 t in 1990-1991. This was because of
crop in West Africa where it occupies between 5 the diversion of government policy during the
and 6 million ha in the coastal humid zone. Nearly First, Second and Third Development plan periods
70% of the world cocoa supply is produced in (1970-1985) into petroleum and food production,
West Africa (International Cocoa Organization and the poor price policy of the Marketing Board
[ICCO], 2010), 90% of which is grown on nearly which resulted in the abandonment of cocoa farms
2 million small family farms, the majority with (Mustapha, 1999).
land holdings of 2 ha or less. It is estimated that In view of the undulating nature of the cocoa
at least 16 million people depend on cocoa for sector as a result of factors such as socio-econom-
the majority of their cash income (International ics, government policies, management practices,
Institute of Tropical Agriculture [IITA], 2005a). and declining natural resources (Bastide & Perret,
The average cocoa farming family in West 2007), it is necessary to seek a viable and sustain-
Africa earns US$100 per person per year from able cocoa production system in West Africa.
cocoa (IITA, 2002; ICCO, 2007). The long-term It is therefore important to carefully target
social and economic well-being of these families the program’s intervention in the cocoa sector of
and their communities depends upon the viability the West and Central African countries to ensure
and sustainability of the West African cocoa sec- maximum program impact and conformity with
tor. The sector is also vital to national economies national agricultural sector priorities. This will
with cocoa among the top three agricultural export include the identification and selection of com-
products in each target country (Bastide & Perret, munities through systematic spatial targeting that
2007). To ensure this, cocoa farming needs to meets multiple criteria. World Cocoa Foundation
become a profitable, income-generating profes- (WCF) has asked IITA Sustainable Tree Crop Pro-
sion that allows farmers, both women and men, gram (STCP) to lead the process of site selection,
to provide adequately for their families. building on its recent experience in conducting a
West African cocoa farmers face numerous similar multi-criteria and multi-stage systematic
challenges such as production and marketing selection of target districts/communities for a
inefficiencies as well as stagnant technological multi-sectoral project in Cote d’Ivoire and Ghana
development. For instance, the majority of farm- in collaboration with local expertise.
ers sell their cocoa individually, a few bags at a In view of the complexity involved in the
time, to itinerant buyers without access to market process of decision making, the method of
information and this reduces their earnings relative multi-criteria land evaluation (MCE) is adopted
to the market price. in this study (Khoi & Murayama, 2010). MCE is
In Nigeria, like the rest of West Africa, cocoa the process of determining the fitness of a given
cultivation has witnessed significant changes in parcel of land for a defined use (Stainer, 1991).
the last few decades due to factors such as varia- An MCE involves the selection of the biophysical
tions in international demand, prices, and policy or socio-economic factors, or both, of an area;
measures implemented by both the Federal and the combination of the selected factors with the
State governments (Cocoa Research Institute of decision-makers preferences allows one to create a

913
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

composite suitability index (Sui, 1993). The MCE factors influencing cocoa farming with respect to
is an effective tool for multiple criteria decision- enabling optimum cropping for maximizing pro-
making issues (Malcewski, 2006). The purpose ductivity while reducing environmental impacts.
of the MCE is to investigate a number of choice
possibilities in the light of multiple criteria and
multiple objectives (Cover, 1991). Integration of METHODOLOGY
the MCE and GIS (GIS-MCE) can help land-use
planners and managers to improve decision- The Study Area
making processes (Malcewski, 1999). GIS allows
the computation of assessment factors and MCE The study area comprises the cocoa growing
aggregates them into a land suitability index. region of Nigeria, consisting of the South West,
This paper aims to identify potential land suit- South East, and South-South geopolitical zones.
ability for cocoa, based not only on biophysical Specifically it consists of the 14 cocoa growing
but also socioeconomic and infrastructural vari- States (Abia, Adamawa, Akwa Ibom, Cross River,
ables in accordance with the framework for land Delta, Edo, Ekiti, Osun, Ondo, Oyo, Ogun, Kwara,
evaluation developed by the Food Agriculture Kogi, and Taraba).
Organization of the United Nations (FAO, 1976) The cocoa growing area in Nigeria lies mainly
using GIS, a decision support system involving below latitude 8 °N within the lowland rain forest
the integration of spatially referenced data in a and forest transition ecological zones (Figure 1).
problem solving environment (Cowen,1988), and Total annual rainfall of the cocoa growing areas
to simultaneously examine major socio-economic of Nigeria ranges from 1200 mm to 3000 mm.

Figure 1. The study area showing ecological zones and protected area of the Cocoa Growing Region
of Nigeria

914
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

The mean minimum temperature is between 20 Input Data and Data Sources
°C and 22 °C; the mean maximum during the dry
season varies from 31 °C to 32 °C and in the wet The input data used for this study were based
season from 27 °C to 29 °C (Wood & Lass, 1987). on the selected evaluation factors discussed in
The elevation of the study area ranges from the next part. They include a gridded rainfall,
50 to 2000 m above sea level. More than 50% of temperature, relative humidity map, vector soil
the area falls within elevation range between 200 map, road network map, and cocoa production
and 400 m while about 10% of the areas have and other relevant socioeconomic data (Table
high elevation of above 800 m. The regions of 1). The climate grid surface data were obtained
high elevation occur at the extreme eastern part from Worldclim - Global climate data, database
of the study area at the border between Cameroon documented in Hijmans, Cameron, Parra, Jones,
and Nigeria. and Jarvis (2005) at resolution of 30 arc seconds
Most of the study area has a high population (approximately 1 km) (http://www.worldclim.
density generally above 200 persons/km². Mean org/). Relative Humidity data were obtained from
population density of 185 person/km² in the Climatic Research Unit (CRU) of the University
study area is greater than the national average of of East Anglia, UK as described by New, Lister,
about 153 persons/km². About 46 million of the Hulme, and Makin (2000). Protected area data
national population of 140 million reside within and digital elevation model (DEM) were extracted
the 14 cocoa growing States (National Population for the study area from global databases (SRTM,
Commission [NPC], 2007) 2000). Soil depth, soil organic matter, and soil pH
factor maps were extracted from the digital soil
map of the world (FAO, 1974). Road density, an

Table 1. Databases used in this research

Factor Suitable Range Data Source


Rainfall (mm) 1500-3000 http://www.worldclim.org/
Minimum temperature (°C) 18-21 http://www.worldclim.org/
Maximum temperature (°C) 29-32 http://www.worldclim.org/
Relative humidity (%) 70-100 CRU http://www.cru.uea.ac.uk
Elevation (m) 0-300 SRTM, 2000
Organic carbon (%) >2 FAO, 1974
Soil depth (cm) >150 FAO, 1974
Soil pH 5.5-6.5 FAO, 1974
Cocoa production (t) 500 – 30,000 CRIN, 2008
Food crop farm size (ha) 5 - 10 CRIN, 2008
Cocoa farm gate price (Naira/bag) 7500 - 9000 CRIN, 2008
Population density (persons/km²) 50 - 500 NPC, 2007
Road density (km/km²) 0.10 - 0.5 Geospatial Laboratory, IITA
Farmer organization (%) 40-100 CRIN, 2008
Government support 10 - 40 IITA, 2008
(No. Farmer Field Schools in the State)
Protected area World Protected area http://www.wdpa.org/

915
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

indication of the infrastructural development of Cocoa Multi-Criteria Land Evaluation


the region, was extracted from the road network
data by computing the total length of road (km) The GIS Multi-criteria procedure for the cocoa
divided by area (km²) in each local government cultivation suitability assessment in the cocoa
area (LGA). The resolution of all raster factor growing region included several stages that are
maps was set at 1 km × 1 km. These data were framed in Figure 2. The determination of the
used for delineating areas suitable for sustainable relevant factors was the starting step in the as-
cocoa production. sessment and was followed by standardizing the
factors, weighting the factors, combining the fac-
tors with their weights, and finally reclassifying
the suitability into three levels of low, medium,
and high. The procedures and algorithms available

Figure 2. Flow chart of cocoa land suitability assessment

916
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

in ArcGIS 9.3 (Environmental System Research excluded from further analysis (Nath et al., 2000;
Institute [ESRI], 2008) were employed to imple- Son & Shrestha, 2008).
ment the assessment.
Initially, the factors were selected based on Cocoa Suitability Assessment
their relevance to suitability in sustainable cocoa Based on Biophysical Factors
production and the availability of databases. The
selection of factors is a technical process that is Climate plays a fundamental role generally in
based on expert knowledge or empirical research. agriculture and in cocoa production. The quantity
This was done at a meeting of stakeholders com- and quality of yields can be affected by water stress
prising cocoa industry partners, government and heat stress (Kassam, Velthuizen Van, Fischer,
representatives, cocoa scientists, and other experts. & Shah, 1991). The limitation condition method
They participated in selecting the factors, identi- of land evaluation described by Sys, Ranst, and
fying the suitable ranges of the factors, and Debaveye (1991a, 1991b) as described by various
evaluating the weights of the factors. Furthermore, researchers, such as Maes, Vereecken, and Darius
from this meeting, the criteria to be used were (1987) and Rodriguez (1995) was used for rating
grouped into four thematic areas based on the four the biophysical land suitability classes which
specific objectives of the project. The procedures implies that all rating below a given threshold
for cocoa land suitability assessment based on the was considered limiting the overall suitability
four objectives are described below. of such area.
Binary reclassification method was applied to
Suitability Assessment Based each climatic layer. All pixels that met the suit-
on National Cocoa Development able range of rainfall (1500-3000) mm, minimum
Prioritization and Industry Investments temperature (18-21°C), maximum temperature
(29-32°C), and relative humidity (70-100%) were
To identify areas that have been defined by national given codes of 1 and those outside those ranges
policy as priority for cocoa production and where were coded 0. The resulting intermediate layers
cocoa industry partners are investing or have in- were combined in a weighted algebraic operation
vested in production and marketing, the number with higher weights of 55% given to rainfall and
of Farmer Field schools (FFS) established in each the other climatic factors had weights of 15%
cocoa growing State of Nigeria was used as an assigned to them (Zuidema et al., 2005). This
indicator for government investment priorities. resulted in a climatic suitability map (Figure 2).
The FFS efforts of the States are a reflection of A similar procedure was applied to soil conditions
their response to the cocoa development agenda as specified in Table 1 and the map of soil suit-
of the National Cocoa Development Committee. ability was obtained. The combination of climatic
The program is a partnership program involving and soil suitability resulted in biophysical land
the State governments and the Federal Ministry suitability (Figure 2).
of Agriculture and the various cocoa associations. Bowen and Hollinger (2004) reported that
Data on the number of FFS for the cocoa grow- precipitation and temperature follow the “law of
ing region of Nigeria were obtained from IITA, the minimum”. This means if a variable is limiting,
2008 and a simple attribute query operation was the species cannot be grown, even if all the other
performed in GIS to select States that had at least variables are not limiting (Holzkämper, Calancaa,
ten FFS. According to the weight assigned to & Fuhrer, 2010). Hence all areas that did not fall
this factor, all States that met this condition were within the suitable zones of biophysical factors
included and those that failed to meet this were were excluded from the next step of evaluation.

917
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

Cocoa Suitability Assessment Based ranges had been ranked and resulted in the socio-
on Socio-Economic Environment economic suitability layer (Figure 2).

The third set objective is to identify an area that Cocoa Suitability Assessment
has the potential for impact on cocoa household Based on Demographic and
income. To evaluate land based on this, we needed Infrastructural Project Environment
the following socioeconomic parameters; cocoa
production, cocoa farm gate price, and food crop The population density is the number of persons/
farm size. Cocoa production data in cocoa grow- km² and this was obtained from the Nigerian
ing LGAs is the quantity (t) of cocoa purchased Population census of 2006 (NPC, 2007). At the
in the main cocoa season of the year 2005. This stakeholders’ meeting, it was agreed that an area
was chosen to reflect the current status of cocoa with a very low population density implied a
production and also to give an indication of the sparse population of farmers limiting the number
gap between the current and potential production of farmers that can be reached through the project.
levels. The production data were obtained from An area with a very high population density, on
CRIN (2008). Higher production areas are of high the other hand, becomes a constraint on land avail-
priority in the selection process, hence suitable ability for cocoa farm land expansion. Hence only
LGAs must have an annual production of at least cocoa farming areas where population densities
500 t (Table 1). are between 50 and 500 persons/km² fall within
Average food crop farm size is another item suitable ranges in the selection process.
of socio-economic data employed in the deci- Road development and access to infrastructure
sion- making process. It is an indication of high was another factor considered in the selection
potential for household food security in the cash process. Road density in each LGA was used as
crop economy and a measure of sustainable devel- a proxy for access to infrastructure and this was
opment in cocoa growing areas. LGAs with high derived from both paved and unpaved roads.
food crop farm size are considered suitable in the Road density is defined as the total length of road
selection process and so a cutoff point of at least network per unit area (km/km2). Digitized road
5 ha was set for inclusion in the suitability zones. data were obtained from the GIS database of the
In addition, data on farm gate prices measured Geospatial Laboratory, IITA. Areas with a road
in price per bag of cocoa were also included in density from 0.10 to 0.50 were considered useful
socio-economic environmental considerations for in the selection process because places with higher
sustainable cocoa production. Areas where prices values correspond with a very high population
are low suggests poor marketing infrastructure density and lower values correspond with water
while high prices are good indicators of profit- bodies or forest reserves and are classified as
ability of cocoa farming in the area. These data unsuitable for the project.
were obtained on LGA levels from CRIN, (2008). The percentage of cocoa farmers registered in
Based on cocoa experts’ opinion, areas with low farmers’ organizations was also used in the selec-
prices are the priority for project intervention in tion. Farmers who belong to registered farmers’
this study; hence the cutoff point for this was organizations have access to input, credit, and
set at a maximum of N9000 per bag in the LGA warehousing. One of the project’s objectives is to
(Table 1). identify areas with the potential to enhance pro-
All these socio-economic layers were com- gram efficiency. LGAs that have at least 40% of
bined in simple algebraic operations after suitable their farmers registered in a farmers’ organization
were considered suitable in the study.

918
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

Spatial overlay of the criteria layers was ers are trained in integrated crop and postharvest
performed in iterative multi-stage procedures management practices. The success of FFS has
(Figure 2). All biophysical criteria layers were been found to result in raising farmer incomes,
combined in additive raster overlay while similar increasing productivity, and reducing dependency
procedures were employed for socio-economic on costly agrochemicals as well as in improving
and demographic factors. All the intermediate cocoa quality (IITA, 2005b). It is therefore impera-
outputs were then combined to give the final tive that government investment in FFS plays a
priority area. A protected area layer was used to crucial role in selecting project implementation
exclude all of the suitable area that belongs to a communities. Among the five States that are suit-
national park, forest or game reserve. able according to government support criteria, the
three top cocoa production States of Ondo, Cross
River, and Osun were selected. The remaining two
DISCUSSION States of Edo and Abia, even though they are not
in the high producing States, are both enjoying
Figure 3a presents the result of cocoa suitabil- good government support (IITA, 2008).
ity assessment based on government policy and The result of combining all biophysical factors,
investment drives. Government policy has been soil, elevation, and climate is shown in Figure 3b.
one major factor that has influenced the cocoa It is clear from Table 2 that about 70% of the study
sector in Nigeria (Coleman, Akiyama, & Varangis, area is suitable with 19% in the category of mod-
1993) and so it is selected as one of the limiting erately suitable and 50% in the high suitability
criteria for the cocoa intervention project. From class. This result also suggests that there are more
the figure, only five States out of the 14 cocoa areas where cocoa could be grown than where it
producing States were qualified to participate in is currently being grown in Nigeria. All the areas
the project because of good government drives that fall within moderate-to-high land suitability
to educate the cocoa farmers in order to revamp are potential areas where cocoa production could
the sector. Nigeria as a developing country was be explored. The major proportion of Anambra,
rated the second largest world producer of cocoa Imo, Enugu, Rivers, and Benue States (Figure 3b)
in the 1960s (Adegbola & Abe, 1983), and, for a could serve as an expansion zone for cocoa pro-
long time, the crop has been generating substan- duction in Nigeria. Even though these States are
tial foreign exchange earnings for the country. not included in the cocoa producing States due to
However, the production of this important cash low production levels, they have good potentials
crop for export has suffered a reduction in recent for the expansion of cocoa cultivation in Nigeria
years in the country owing to a number of factors, (CRIN, 2008). Low adoption and production of
such as government policy, low yield, ageing of cocoa in these States are due not only to bio-
cocoa farms, and farm size (Folayan, Daramola, physical constraints but also to some socio-cul-
& Oguntade, 2006; Oduwole, 2004). Moreover tural issues, as has been observed by Manyong et
Oluyole and Sanusi (2009) reported that the al. (2004).
educational level of farmers is one other major The result of cocoa land evaluation based on
factor that has influenced cocoa production in socio-economic factors, such as cocoa production
Nigeria, implying that strong government sup- level, cocoa farm gate prices, and food crop farm
port is necessary if cocoa production is to thrive size, is presented in Figure 3c. Areas that fall
again. The FFS is a new approach pioneered by within the low suitability class are about 31% of
the Sustainable Tree Crops Program (STCP) for the study area while only 4.6% belong to the high
efficient technology dissemination whereby farm- suitability category (Table 2). This result contrasts

919
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

Figure 3. (a) Cocoa growing States showing government support as indicated by number of Farmer
Field Schools; (b) cocoa land suitability; (c) cocoa suitability based on socio-economic factors; (d)
cocoa suitability based on demographic and infrastructural factors

with the suitability obtained for biophysical cri- ducing States after Ondo and Cross River had low
teria where a larger percentage of the total area suitability and unsuitable categories. This obser-
was found to be highly suitable (Figure 3b). vation suggests socio-economic factors, such as
Moreover it is evident from Figure 3c that high farm gate price and food crop farm size, are lim-
suitability classes occurred only in Ondo and iting factors in selecting intervention sites for
Cross River States while moderately suitable cocoa sustainable projects. It therefore follows,
areas were found in Ogun, Ekiti, Edo, and Kogi as observed by Son and Shrestha (2008) that,
States. By socio-economic constraints, Osun when interventions are being formulated to im-
which is the third in the rank of high cocoa pro- prove farmers’ income and preserve land re-

Table 2. Percentage suitability in each category for all factors

Factor High Suitability Moderate Suitability Low Suitability Unsuitable


Land suitability 50.26 18.69 22.85 8.20
Socio-economic 4.62 15.09 30.47 49.82
suitability
Demography & infra- 10.01 31.08 21.09 37.82
structure
All factors 3.43 5.64 3.32 87.62

920
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

sources, an adequate understanding of the socio- The final selected area based on all factors
economic context of production is important to is shown in Figure 4 and reveals that five States
accommodate the real needs of land users. have suitable sites. This is due to the limiting
The result of cocoa land assessment based priority condition of government support (Figure
on demographic and infrastructural constraints 3a). Furthermore areas that were suitable based
is presented in Figure 3d and Table 2. Based on on other factors such as land and socio-economic
these factors, highly suitable areas (10%, Table 2) variables have also been excluded where they fall
occurred in six cocoa growing States, Oyo, Osun, within a protected area. Hence it is not surprising
Ogun, Kogi, Kwara, and Ekiti. The proportion that the highly suitable area was only 3.4% while
of moderately suitable land was much larger at the moderately suitable places were only about 6%
about 30% (Table 2) and was found in almost all (Table 2). More than 87% of the study area was
the 14 cocoa growing States except Taraba and not suitable for this project’s intervention sites.
Adamawa. The higher suitability observed in the Table 3 lists the selected LGAs and it shows
southwestern part of Nigeria is probably influ- that 19 LGAs were selected, three of which belong
enced by the higher level of farmers’ organization to the high suitability category while seven were
in this region where well organized cocoa farmers in the moderately suitable area. Nine LGAs were
have been reported (Oduwole, 2004; Oluyole & found in the low suitability category. A total area
Sanusi, 2009). Membership of farmers’ associa- of about 20,000 km² was selected. High suitabil-
tions is crucial as it has been found to improve the ity LGAs were distributed evenly among the three
adoption of new technologies and to raise farmers’ top high-producing States of Ondo, Cross River,
income (Agbeniyi, Ogunlade & Oluyole, 2010) and Osun. Moderately suitable areas can be found

Figure 4. The study area showing cocoa suitability based on all biophysical, socio-economic, and de-
mographic factors, including government support

921
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

in all the selected States except in Abia where the Moreover, the approach highlights the partici-
only selected area was in the low suitability cat- patory decision-making process (Eastman, Jin,
egory. Kyem, & Toledano, 1992). Therefore, it helps to
minimize and solve conflicts among competing
interests in the area of cocoa land-use management.
CONCLUSION AND The GIS-MCE approach has been widely
RECOMMENDATION applied in land suitability analysis (Malcewski,
2006), however, the application of the method in
Though the GIS-MCE approach provides an effec- cocoa sustainable land management is relatively
tive framework for land evaluation, the selection new in Nigeria. The MCE of soil, topography,
of assessment factors and the identification of a and socioeconomic factors was exemplified to
suitable range for each factor have a direct influ- be useful for delineating areas suitable for cocoa
ence on the results. In this study, the factors were production in the region. In particular, the in-
selected based on the experts’ opinion; therefore volvement of local experts was vital to obtaining
they represent a considerable share of the factors consistent results. The experts played key roles in
relevant to the suitability of cocoa growing areas the selection of the evaluation factors and in the
in the region. For instance, the inclusion of the determination of the factor weights. The applica-
government priority and investment variable has tion of this paper can be useful for the managers
a strong influence on the States selected for the and planners of cocoa sustainable projects.
project’s intervention sites. The study shows that GIS databases of different
It can also be concluded that the integration formats and sources can be efficiently integrated
of spatial databases and expert knowledge sig- to establish a land suitability assessment for co-
nificantly enhances the decision-making capacity coa production. The methodology is useful for
when land suitability evaluation is undertaken. identifying priority areas for cocoa farming, and

Table 3. Selected Local Government Area (LGA) for cocoa sustainable project

Area Area
State LGA Suitability Class (km²) State LGA Suitability Class (km²)
Cross
River Obudu Moderate 371 Abia Bende Low 591
Cross
Edo Owan East Moderate 959 River Ogoja Low 977
Cross
Ondo Odigbo Moderate 2193 River Obanliku Low 936
Cross
Osun Irewole Moderate 310 River Etung Low 869
Osun Aiyedaade Moderate 1017 Edo Owan West Low 672
Osun Atakumosa West Moderate 868 Edo Etsako West Low 1637
Osun Oriade Moderate 257 Ondo Akure North Low 669
Cross
River Boki High 4868 Osun Atakumosa East Low 385
Ondo Idanre High 1880 Osun Boluwaduro Low 145
Osun Obokun High 474
Total Area 20,079 (km²)

922
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

thus it contributes to improving the efficiency of Coleman, J. R., Akiyama, T., & Varangis, P. N.
conservation and of sustainable land management. (1993). How policy changes affected cocoa sec-
The land suitability information produced tors in sub-Saharan African countries: Vol. 1.
in the recent research is valuable. However, the (Research Report No. WPS 1129). Washington,
socio-economic information was based on a single DC: World Bank.
survey carried out by the Cocoa Research Insti-
Cover, S. J. (1991). Integrating multi-criteria
tute of Nigeria in 2005 which has its limitations.
evaluation with geographical information
More comprehensive socio-economic factors
systems. International Journal of Geo-
and other infrastructural variables could be used
graphical Information Systems, 5, 321–339.
to enhance the accuracy of the results obtained.
doi:10.1080/02693799108927858
Therefore, we recommend that future studies
should consider broader and more detailed socio- Cowen, D. J. (1988). GIS versus CAD versus
economic information to offer decision-makers a DBMS: What are the differences? Photogram-
comprehensive basis for a feasible strategy, and metric Engineering and Remote Sensing, 54,
sound decisions about more sustainable cocoa 1551–1554.
production in Nigeria.
Eastman, J. R., Jin, W., Kyem, P. A. K., & Tole-
dano, F. (1992). Participatory procedures for
multi-criteria evaluation in GIS. In Proceedings
REFERENCES
of the Chinese Professionals in GIS (pp. 281-288).
Adegbola, M. O. K., & Abe, J. O. (1983). Cocoa Environmental Systems Research Institute.
development programme, Nigeria. Cocoa Re- (2008). ArcGIS 9.3. Retrieved from http://www.
search Institute of Nigeria Printing Unit, 9, 3–5. esri.com/
Agbeniyi, S. O., Ogunlade, M. O., & Oluyole, K. Folayan, J. A., Daramola, G. A., & Oguntade, A.
O. (2010). Fertilizer use and cocoa production in E. (2006). Structure and performance evaluation
Cross River State, Nigeria. Journal of Agricultural of cocoa marketing institutions in South-Western
and Biological Science, 5(3), 10–13. Nigeria: An economic analysis. Journal of Food
Bastide, P., & Perret, C. (2007). Atlas on regional Agriculture and Environment, 4(2), 123–128.
integration in West Africa: Cocoa. Paris, France: Food and Agriculture Organization. (1974). Soil
Organization for Economic Co-operation and map of the world (Vol. 1-10). Paris, France: Food
Development. and Agriculture Organization of the United Na-
Bowen, C. R., & Hollinger, S. E. (2004). Geo- tions and UNESCO.
graphic screening of potential alternative crops. Food and Agriculture Organization. (1976). A
Renewable Agriculture and Food Systems, 19, framework for land evaluation. FAO Soils Bul-
141–151. letin 32.
Cocoa Research Institute of Nigeria. (2008). Ni- Food and Agriculture Organization. (2003). Agri-
geria Cocoa production survey. Nigeria, Africa: culture data. Retrieved from http://faostat.fao.org
Cocoa Research Institute of Nigeria (CRIN).

923
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, Kassam, A. H., Velthuizen Van, H. T., Fischer, G.
P. G., & Jarvis, A. (2005). Very high resolution W., & Shah, M. M. (1991). Agro-egological land
interpolated climate surfaces for global land resources assessment for agricultural development
areas. International Journal of Climatology, 25, planning - A case study of Kenya - Resources data
1965–1978. doi:10.1002/joc.1276 base and land productivity - Technical Annex 3.
World Soil Resources Reports, 78.
Holzkämper, A., Calancaa, P., & Fuhrer, J. (2010).
Evaluating climate suitability for agriculture in Khoi, D. D., & Murayama, Y. (2010). Delinea-
Switzerland. Paper presented at the Fifth Biennial tion of suitable cropland areas using a GIS based
Meeting of the International Congress on Envi- multi-criteria evaluation approach in the Tam Dao
ronmental Modelling and Software Modelling National Park Region, Vietnam. Sustainability,
for Environment’s Sake, Ottawa, ON, Canada. 2, 2024–2043. doi:10.3390/su2072024
International Cocoa Organization. (2007, June 4). Maes, J., Vereecken, H., & Darius, P. (1987).
Sustainable cocoa economy: A comprehensive and Knowledge processing in land evaluation. In
participatory approach. Paper presented at the Beek, K. J., Burrough, P. A., & MacCormack, D.
Thirteenth Meeting ICCO Offices Consultative E. (Eds.), Quantified land evaluation procedures.
Board on the World Cocoa Economy London, UK. Enschede, The Netherlands: ITC.
International Cocoa Organization. (2010). Cocoa Malcewski, J. (1999). Spatial multicriteria
year 2009/2010. Quarterly Bulletin of Cocoa decision analysis. In Thill, J.-C. (Ed.), Spatial
Statistics, 36(2). multicriteria decision making and analysis: A
geographical information sciences approach (pp.
International Institute of Tropical Agriculture.
101–269). Aldershot, UK: Ashgate.
(2002). Sustainable tree crop project (IITA): A
report on the baseline survey on cocoa farming Malcewski, J. (2006). GIS-based multi-
livelihood of West Africa. Ibadan, Nigeria: Inter- criteria decision analysis: A survey of the
national Institute of Tropical Agriculture. literature. International Journal of Geo-
graphical Information Systems, 20, 703–726.
International Institute of Tropical Agriculture.
doi:10.1080/13658810600661508
(2005a). STCP impact brief series, issue no 1.
Retrieved from http://www.treecrops.org Manyong, V. M., Okike, I., Legg, C., & Makinde,
K. O. Haan, de N., &. Alabi, T. (2004). NDDC
International Institute of Tropical Agriculture.
master plan project: The agriculture and rural
(2005b). STCP impact brief series, issue no 2.
enterprises sector study report. Asokoro, Nigeria:
Retrieved from http://www.treecrops.org
NDDC.
International Institute of Tropical Agriculture.
Mustapha, A. R. (1999). Cocoa farming and
(2008). National cocoa development committee
income diversification in South-western Nigeria.
and sustainable tree crop FFS capacity building
Nigeria: De-Agrarianisation and Rural Employ-
project report. Ibadan, Nigeria: International
ment Network (DARE).
Institute of Tropical Agriculture.
National Population Commission. (2007). Census
2006 results -36 states and FCT. Retrieved from
http://www.population.gov.ng

924
A Multi-Criteria GIS Site Selection for Sustainable Cocoa Development in West Africa

New, M., Lister, D., Hulme, M., & Makin, I. SRTM. (2000). US geological survey. Sioux Falls,
(2000). A high-resolution data set of surface cli- SD: EROS Data Center.
mate over global land areas. Climate Research,
Stainer, F. (1991). Landscape planning: A method
21, 1–25. doi:10.3354/cr021001
applied to a growth management example.
Oduwole, O. O. (2004). Adoption of improved ag- Environmental Management, 15, 519–529.
ronomic practices by cocoa farmers in Nigeria: A doi:10.1007/BF02394742
multivariate Tobit analysis. Unpublished doctoral
Sui, D. Z. (1993). Integrating neural networks
dissertation, Federal University of Agriculture,
with GIS for spatial decision making. Operation
Akure, Nigeria.
Geography, 11, 13–20.
Oluyole, K. A., & Sanusi, R. A. (2009). Socio-
Sys, C., Ranst, E., & Debaveye, J. (1991a). Land
economic variables and cocoa production in Cross
evaluation part I: Principles in land evaluation
River State, Nigeria. Journal of Human Ecology
and crop production calculations. Brussels, Bel-
(Delhi, India), 25(1), 5–8.
gium: GADC.
Pereira, J. M. C., & Duckstein, L. A. (1993). Mul-
Sys, C., Ranst, E., & Debaveye, J. (1991b). Land
tiple criteria decision-making approach to GIS-
evaluation part II: Methods in land evaluation.
based land suitability evaluation. International
Brussels, Belgium: GADC.
Journal of Geographical Information Systems,
7, 407–424. doi:10.1080/02693799308901971 Wood, G. A. R., & Lass, R. A. (1987). Cocoa
tropical agriculture series (4th ed.). London,
Rodriguez, O. (1995). Land use conflicts and
UK and New York, NY: Longman Scientific and
planning strategies in urban fringes. A case study
Technical and John Wiley & Sons.
of Western Caracas, Venezuela. Unpublished
doctoral dissertation, University of Ghent, Gh- Zuidema, P. A., Leffelaar, P. A., Gerritsma, W.,
ent, Belgium. Mommer, L., Niels, P. R., & Anten, N. P. R.
(2005). A physiological production model for
Son, N. T., & Shrestha, R. P. (2008). GIS-Assisted
cocoa (Theobroma cacao): Model presentation,
land evaluation for agricultural development in
validation and application. Agricultural Systems,
Mekong Delta, Southern Vietnam. Journal of Sus-
84, 195–225. doi:10.1016/j.agsy.2004.06.015
tainable Development in Africa, 10(2), 875–895.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 1,
edited by Donald Patrick Albert, pp. 73-87, copyright 2012 by IGI Publishing (an imprint of IGI Global).

925
926

Chapter 57
The Columbia Regional
Geospatial Service
Center System:
A Proven Model for the United States

P. R. Blackwell
Stephen F. Austin State University, USA

Darrel McDonald
Stephen F. Austin State University, USA

ABSTRACT
During the past 20 years, the role of geospatial technology in society has increased dramatically.
However, the impact of these technologies in rural areas remains minimal. In Texas, a federally funded
project called the Columbia Regional Geospatial Service Center System (the System) has emerged as a
model for bringing the benefits of geospatial technology to all portions of society. The model involves
distributed, academically based Centers, each with regional specializations, linked together into a uni-
fied system for addressing critical needs in emergency response, economic development, and natural
resource management. The Centers operate on three focus areas, i.e., data, applications, and training.
The Columbia Center has been in operation for five years and has demonstrated the practical strength
of the System through numerous local and statewide projects, responses to natural disasters, and other
geospatial activities.

DOI: 10.4018/978-1-4666-2038-4.ch057

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
The Columbia Regional Geospatial Service Center System

INTRODUCTION geospatial needs of Texas. In effect, the Columbia


Center System extends the reach of TNRIS to all
History areas of the State.

In 2005, the United States Congress established Mission


the Columbia Regional Geospatial Service Center
System through a federal appropriation. Initially, The Columbia Regional Geospatial Service Center
the System consisted of the Columbia Regional System provides regional geospatial support for
Geospatial Service Center at Stephen F. Austin emergency preparedness and response, economic
State University (SFASU) in Nacogdoches and development and resource management. The Sys-
the Regional Geospatial Service Center at the tem serves all interest groups, public and private,
University of Texas at El Paso. The appropriation, by providing cutting edge technology tempered
championed by Senator Kay Bailey Hutchison, with a pragmatic, bottom-up philosophy. This
came as recognition of the contributions of SFASU broad mission allows the System flexibility to
during the recovery effort after the loss of STS 107, discover new ways to apply geospatial technology
the Space Shuttle Columbia. But the origins of for the common good. At the same time, the three
the project go back to the 1990s when researchers mission areas give focus to the Systems efforts,
at SFASU first recognized the disadvantages that while leveraging the work in each mission area
rural communities suffered in terms of adopting to the benefit of the other areas. Put another way,
geospatial technology to address pressing prob- the same geospatial data, applications and training
lems. The work that went on quietly in East Texas used for emergency preparedness and response
over the next ten years resulted in development can be applied to economic development and
of capabilities that enabled SFA to contribute so natural resource management issues (McDonald
effectively to the recovery effort (Gehman, 2003). et al., 2007).

Relevance
THE MODEL
Senator Hutchison challenged the Columbia
Center to demonstrate a better way to bring the The Columbia Regional Geospatial Service Center
advantages of geospatial technology to all parts System Model is comprised of eight principles,
of the state, and in so doing create a model for each of which defines an important aspect of the
the nation. During the course of the ensuing successful program. These principles are a com-
five years, the system has done that. Today the bination of fundamental concepts upon which the
Columbia Regional Geospatial Service Center project is based and pragmatic additions that have
System is comprised of five Centers: the original developed over the years and refined by experi-
two in Nacogdoches and El Paso; the Center for ence. These principles are the foundation of the
Geospatial Technology at Texas Tech University Columbia Regional Geospatial Service Center
in Lubbock; a facility in the Department for Earth System (Kroll et al., 2006).
and Space Sciences at Lamar University; and the
Center for Earth and Environmental Studies at the Distributed Centers
Texas A&M International University in Laredo.
The System operates in close coordination with The idea for regionally based, distributed centers
The Texas Natural Resources Information System dates back to very early thinking about the chal-
(TNRIS), a state agency mandated to serve the lenge of bringing geospatial technology to rural

927
The Columbia Regional Geospatial Service Center System

areas. The problem persists that state and feder- at the local level, working with city and county
ally based programs often do not have sufficient governmental agencies, local police and sheriff
reach to impact local jurisdictions effectively. An offices, fire fighters, appraisal districts and others.
obvious solution is to place resources where they This local focus is one of the defining strengths of
are needed, i.e., locally. This idea became a core the Columbia Regional Geospatial Service Center
concept of the Columbia System model during the System. The origin of this concept and conviction
Space Shuttle Columbia recovery effort in 2003. in its value comes from hard-earned experience
It was observed that the farther the search ranged in dealing with real-world emergencies. Emer-
from geospatial resources the more difficult it be- gencies are local and effective response during
came to adequately support the effort. As a result, the first critical hours after an event depends on
the concept of “Forward Mapping Centers” was action by local authorities. Often, however, local
developed and successfully deployed during the jurisdictions do not have the technical resources
first critical weeks of the recovery (Blackwell et they need to respond effectively. The Columbia
al., 2003). Today this idea is reflected not only in System Model addresses that unmet need with
the regional distribution of the Columbia System focused geospatial technology and support at the
Centers, but also in the Geospatial Rapid Response local level.
teams being developed by the System.
Bridging the Divide
Regional Specialization
This focus on the local level allows the regional
Further, the needs of different areas vary greatly. Centers to bridge the gap between state and fed-
For example, forestry issues that are prevalent eral programs and local agencies. The Centers’
in East Texas hold little interest for the citizens roles are to “connect the last mile” between these
of El Paso. At the same time international issues programs and the individuals and agencies at the
that dominate life in El Paso are less meaningful local level that need them. This is essential, par-
in East Texas. Each region has its own concerns ticularly in rural areas where resources are thin
that are best addressed by regional centers. In and capacity lacking. Thus, the Columbia Center
addition, each Center is able to concentrate on System focuses on rural Texas. In most urban
local needs and develop a level of expertise that areas, geospatial technology is well entrenched
would be difficult to achieve by a centralized ef- (Croswell, 2009; DeMers, 2009). In rural parts
fort. Other examples of regional specialization are of the state, as in most rural areas in America,
the research on the Ogallala aquifer by the Center geospatial resources often do not exist. Regionally
for Spatial Technology at Texas Tech (Mulligan distributed Centers close the gap between urban
et al., 2008) and the coastal process work be- and rural access to geospatial technology, data
ing undertaken by the Department of Earth and and procedures.
Space Sciences at Lamar (Farmer et al., 2010).
These regional specialties are great assets to the Local Data
individual region and bring strength and depth
to the System. The System Centers focus on local data. Local
data are often more current and accurate than
Bottom-Up Philosophy data compiled at the state or federal level, and
the most efficient way to build compilations of
The regional distribution of Centers leads to a dis- large-scale, timely and accurate data across the
tinctly bottom-up philosophy. Centers concentrate states and the nation is with good local data. The

928
The Columbia Regional Geospatial Service Center System

Columbia Center System is dedicated to this System Linkage


concept and works with local jurisdictions to
find, vet, update, augment, or create good qual- While the regional Centers provide appropriate,
ity geospatial data. These data are archived and customized services to local and regional agencies,
distributed by the Centers and passed up the chain they do not operate in a vacuum. The Centers are
to TNRIS for inclusion in state-level archives connected to the System by common needs and
(Figure 1). Coordinating schemas and standards a shared mission as well as compatible technol-
across domains are challenges that the Columbia ogy. The Centers work together to enhance the
Center System is dedicated to meet. Reliable, ac- strengths of each Center to address the issues
curate and timely local data are essential to any of each region. Through weekly meetings via
state or federal level compilation. It is impossible video conferencing, and quarterly face-to-face
for state or federal agencies to bring the same coordination meetings, the Centers share their
resources to a mapping project as local agencies experiences and capabilities to the benefit of all.
can. In terms of local knowledge, timeliness, High-performance networking is key to this co-
detail, workforce, the local effort wins every ordination. In Texas, the Lonestar Education and
time. Often, however, the necessary resources Research Network (LEARN) provides high-speed
and expertise are not available at the local level. connectivity between the Centers and TNRIS,
This is where the Columbia Regional Geospatial enabling the Centers to operate as one when
Service Center System plays an important role in necessary. The System structure makes statewide
providing functional geospatial data at all levels. projects possible as well, allowing the resources
of all Centers to be focused on statewide issues.
In large-scale emergencies, all Centers contribute
to a unified response. Regular training and drills

Figure 1. Data flow model from local to state to federal compilations

929
The Columbia Regional Geospatial Service Center System

keep skills sharp and lead to a strong esprit de each Center’s ability to carry on its academic
corps across the organization. mission in addition to meeting the needs of a
regional Center. Secondly, academic institutions
Data Replication are often progressive in terms of adopting new
technology. This willingness to grow and change
The Columbia Center System realizes that single is essential in a field that is as dynamic as geo-
points of failure are inconsistent with any operation spatial technology. Thirdly, high performance
that is expected to provide critical services. The computing and networking infrastructures avail-
System maintains data and services replication to able in universities also are essential in fulfilling
avoid problems caused by failure of any one part the Columbia Center System mission. Finally,
of the System. The importance of this approach has all three parts of higher education’s mission:
been demonstrated many times. In June of 2001, research, education and outreach, are enhanced
for example, Tropical Storm Allison devastated by the Columbia Center System activities. This
southeast Texas dropping 40 inches of rain over makes the university environment the ideal place
some areas (LaNore, 2001). Severe flooding in for Columbia System Centers.
Houston damaged roads, sewers, hospitals and
many businesses. One of the casualties of this
storm was the equipment room where the Texas OPERATIONAL MODEL
Gigapop resides. The Gigpop connects Texas to
Internet2, a high-speed network for universities, The Columbia Center System focuses on inte-
research centers, and other collaborators (http:// grated, regional activity in three broad areas: Data,
www.internet2.edu). This facility was down for Applications, and Training. The three areas and
many days, effectively severing a large part of the technology that overlaps them, form a compre-
the state from the Internet, including SFASU. The hensive program designed to meet the goals and
System model prevents any such interruption of objectives established for the System (Figure 2).
service from a single point of failure by replicating
critical data and services at Centers distributed Figure 2. The Columbia Center System Opera-
across the state. The Columbia Center System’s tional Model comprises three areas of focus: Data,
data replication encompasses many different Applications and Training that overlap technical
technologies and strategies designed to address resources
specific needs. Data replication for emergency
response, for instance, is designed not only to
ensure data integrity but to provide robust access
to data through a variety of delivery mechanisms.

University Based

The Columbia Regional Geospatial Service Center


System Centers are based in academic institu-
tions. This was done for several reasons. Firstly,
participating universities already had established
geospatial expertise and were chosen largely on
the basis of existing capacity. The System built
capacity within the Centers as well, enhancing

930
The Columbia Regional Geospatial Service Center System

Data nized by state and federal agencies as a significant


advantage (National Research Council, 2007).
As previously mentioned, local data are critical to
successful responses, however, all local data are Applications
not equal. Centers work with local stakeholders
to ensure that the best possible data are identi- To be useful, geospatial data must be applied to
fied and made available to all who need them. address specific issues. The Columbia Center Sys-
At times, this means revising or creating data tem recognizes this fact and places considerable
where existing data are inadequate. The System emphasis on developing customized applications
has projects underway across Texas refining and for specific purposes. Applications range from
extending datasets, from building parcel maps in entire protocols such as the Wide Area Damage
the Panhandle to E9-1-1 road-centerline rectifica- Assessment (WADA) program developed for the
tion in East Texas. Texas Military Forces and the System’s vision
For many applications, geospatial data need of Common Operational Picture, the Common
to be integrated at the state level. To meet these Operational Map (COM) (Figure 3) to simple
requirements, the System engages in statewide web-based mapping applications such as Road
data projects ranging from electrical power grid Closure and Fuel Availability web applications
mapping for emergency response to public water developed for Nacogdoches County Emergency
supply critical infrastructure mapping to statewide Management. In all cases, however, applications
E9-1-1 data compilation. With Centers located are designed to bring the power of geospatial
across the state, the System is able to bring the same technology to those that need it in a way that is
bottom-up philosophy to statewide projects as it easy to understand, simple to use and affordable.
does to local activities. This capability is recog-

Figure 3. The Columbia Center Systems Common Operational Map provides compressive, map-based
information to all responders during an emergency

931
The Columbia Regional Geospatial Service Center System

Training Columbia Center System provides customized


training for other federal agencies such as the
The third leg of the System operational model USDA National Forest Services (NFS), National
is training. Even the most elegant geospatial Weather Service (NWS), and US Army Corp of
applications are useless unless people know and Engineers (USACOE). The regional Centers are
understand them. The Columbia Center System able to address the specific needs of agencies
conducts a comprehensive training program cov- within their areas of interest, providing targeted
ering a diverse range of geospatial technologies training to meet a broad range of needs.
and targeting all users from professional GIS
practitioners to the general public.
The System’s training program for the Texas PROOF OF CONCEPT
Military Forces has developed into a new mission
for the Texas State Guard: Wide Area Damage There were two important developmental pro-
Assessment (WADA) (McDonald et al., 2008; grams in the geospatial sciences program at
Spraggins, 2009). The three-stage program in- Stephen F. Austin State University, which predate
volves land navigation using GPS, field search the formation of the Columbia Center on campus.
protocols and advanced geospatial-based planning These programs were instrumental in building geo-
concepts. This program has spawned the concept of spatial competency that contributed to a successful
Geospatial Quick Response Teams (QRTs) within response during the Columbia Shuttle incident.
the Texas State Guard and is being expanded to Starting in the mid 1990s, two geospatial labora-
include Texas National Guard personnel. On-line tories were established to introduce GIS into the
training modules under development for this curriculum and into research initiatives involving
program will increase the audience and reduce local entities, agencies and industry. The Forest
training costs. In the future, all TXMF personnel Resources Institute (FRI) supported research and
may benefit from this training. teaching in the College of Forestry and the Hu-
The concept of WADA, as developed by the manities Urban Environmental Sciences (HUES)
Columbia Regional Service Center System and lab based in Geography concentrated on city/local
TXMF, has application beyond Texas (McDonald government side of GIS applications.
et al., 2009). These principles and techniques
complement other methodologies commonly used Implementing GIS
for damage assessment such as aerial photography
or satellite image analysis. The recent tragedy in FRI built strong relationships with the regional
Haiti illustrates the need for rapid, on-the-ground, forest industry while the HUES group worked
damage assessment. Those assisting with the closely with the City of Nacogdoches to build
relief effort are hampered by the lack of detailed an infrastructure geodatabase. As the programs
information about the geography and extent of matured, entities started to rely upon Stephen F.
the devastation. The WADA initiative provides Austin State University for leadership in imple-
such critical information quickly and efficiently. menting GIS solutions to address service delivery
WADA training is being rolled out for civil and public safety issues (Sementelli & McDonald,
agencies as well as TXMF. WADA training for 2003). Through extended collaborations trust was
police, firefighters, and volunteer first responders built between participating parties that continues
will be made available through each CRGSCS to the present. Recently, Nacogdoches County
Center. Distributed Centers allow this training expanded their investment in GIS/911 mapping
to be presented to jurisdictions across Texas. The resources after working with the Columbia Cen-

932
The Columbia Regional Geospatial Service Center System

ter to improve service delivery and emergency ing a replication solution for local editing by
response capacities. individual counties with database replication
Similar projects have emerged at partner in- back to DETCOG offices and offsite storage to
stitutions of the CRGSCS. For example, at UTEP the Columbia Center.
the Regional Center for Geospatial Services,
Director Raed Al Douri has become the focal Storm Support
point for the Mapas del Norte group that links
GIS operations in several city and county entities A third area that the Columbia Center model has
along the US/Mexican border extending into New been an important aspect in is supporting entities
Mexico. Currently, the Center is working with New responding to storms in the Gulf of Mexico region,
Horizon, a community near El Paso to establish particularly hurricanes. Columbia Center support
a GIS solution to managing city infrastructure. for local agencies was first initiated just before and
At the Center for Geospatial Technologies at during Hurricane Rita in 2005. As most remember,
Texas Tech, Dr. Kevin Mulligan, Director and in August Hurricane Katrina wreaked havoc on
staff geospatial scientists are collaborating with New Orleans displacing hundreds of thousands
city and county entities to augment parcel data of people. The Columbia Center provided map-
and build capacity to better serve the public and ping support for city and county entities dealing
create critical infrastructure datasets essential to with the influx of thousands of evacuees. The
successful responses to incidents. maps were primarily informational rather than
analytical, and were well received by the local
Enhancing Public Safety entities as a tool to help meet evacuee needs and
questions in a timely fashion. Then, a month later,
Another area that the System partners have been Hurricane Rita ripped through East Texas. This
successful in providing expertise in geospatial time the Columbia Center operated in conjunction
needs is in enhancing public safety. The Columbia with local and regional responders. For example,
Center was invited to build a base-mapping project just hours before Rita made landfall, Columbia
for the East Texas Council of Governments (ET- Center personnel delivered county base maps to
COG) to meet requirements of a hazard mitigation Texas Department of Public Safety responders
plan directive from FEMA. The Columbia Center, in the Beaumont area. Throughout the event, the
in conjunction with regional and local entities, Columbia Center provided mapping support for a
created geodatabases to produce the base maps broad spectrum of responding agencies, ranging
needed to meet project objectives. from law enforcement and emergency managers
In another example, at Texas A&M Interna- to electrical companies (McDonald et al., 2009).
tional University (TAMIU), the Center for Earth In 2008, Hurricane Ike made landfall near
and Environmental Studies, under the direction Galveston, Texas. The storm devastated Bolivar
of Dr. Ken Tobin, coordinated a 14 county hazard Peninsula and swamped Galveston Island before
mitigation plan project funded by FEMA. More moving inland, continuing to damage buildings
recently, the Columbia Center has been tasked and destroy infrastructure as it carved a trail north
to update the 911 maps for the Deep East Texas through East Texas. Shortly after the storm had
Council of Governments (DETCOG). This project passed Galveston County, officials requested
focused on using current high-resolution photog- geospatial assistance to support search and rescue
raphy to improve road network representation operations. The Columbia Center deployed a team
on maps used to assign addresses for emergency to help meet this critical need (Figure 4). Hundreds
response. Moreover, the project included build- of maps later, the Columbia Center was recognized

933
The Columbia Regional Geospatial Service Center System

Figure 4. Columbia Center South – a self-contained, fully capable GIS facility was imported, set up,
and operated in the Galveston County EOC in Dickerson, Texas within hours of Hurricane Ike’s passing

as an important resource that helped in managing Border Security


a successful response to a tragic incident.
As a result of these deployments, the System Recently, the System became involved with border
has moved forward with developing county level security through work with the Texas Military
base maps (3 counties deep) along the southern Forces. In particular, the Texas State Guard troops
perimeter of Texas from El Paso to Beaumont as are deployed to the Joint Operations Information
well as extending northward to the Red River. A Centers (JOIC) across the Texas borderland to aid
final objective of this project is to have base maps the border security mission. The JOICs combine
prepared for all 254 Texas counties. the resources of local law enforcement, Texas
Based on lessons learned during Hurricanes Department of Public Safety (TDPS), the Border
Katrina, Rita, Gustav and Ike, each Center is de- Patrol, and other state and federal agencies in a
veloping two-person, geospatial response teams coordinated effort to increase border security.
capable of responding to incidents anywhere in the The Columbia Center developed a geospatially
state on short notice. These Rapid Response teams based information system called GeoBorderStar
will be able to operate connected or disconnected to help JOICs share and use geospatially enabled
from other resources to meet mapping needs of information.
any response effort across the state. In addition, This work lead to an invitation for the Columbia
the Lamar Geospatial Lab Associate Director, Center System to participate in Project Sidewinder,
Dr. Joe Krueger, is focusing on aligning research a directed action conducted in Brooks County,
projects that coincide with the Height Moderniza- Texas in March of 2009. Columbia Center per-
tion Project based at Texas A&M University at sonnel embedded with the Sidewinder task force
Corpus Christi (http://tsrc.cbi.tamucc.edu/THM/ to provide mapping and geospatial analysis sup-
HomePage). port for the multi-agency task force. The project
demonstrated the importance of mapping in this
type of operation and has led to other initiatives

934
The Columbia Regional Geospatial Service Center System

designed to integrate geospatial technology into and record locations for wells, pumps, tanks and
the border security mission in Texas. other components, along with information about
Recently, the Texas Military Forces drafted a capacity, treatment types, storage capacity, and
requirement directing the Columbia Center to help generator/wiring harness availability. The infor-
integrate TexMap, the TDPS information system, mation will be used by TCEQ during emergency
with TXMF mapping assets. This work will lead conditions to coordinate relief efforts to restore
to better integration of information resources for clean water as quickly as possible after a natural
all stakeholders. or manmade outage.
The Columbia Center System has completed
Critical Infrastructure Mapping another project for TCEQ involving updating
boundary data for the various regulatory districts
The importance of critical infrastructure datasets under TCEQ jurisdictions. This work involved
has been recognized by research on the role of geo- reviewing plats and legal records, rectifying map
spatial sciences in emergency response (National data and constructing a comprehensive geodatabase
Research Council, 2007). The Columbia Regional of all district boundaries. In many cases, local ap-
Geospatial Service Center System has undertaken praisal districts and courthouses were contacted to
several statewide critical infrastructure mapping ensure the most accurate and up to date information
projects. Given the size of Texas, the distributed were used in the process.
Centers hold a substantial advantage over any cen- In addition to updating the boundary data, this
tralized agency for such projects. These projects project helped TCEQ to update their internal opera-
are large scale and require considerable resources. tions from ArcView 3.x/Shapefiles to a state-of-the-
Originating field activities at each of the Centers art ArcGIS/geodatabase operation. The Columbia
mitigates logistics and travel costs. Project coor- Center System continues to provide training and
dination is paramount with this type of endeavor, technical support to TCEQ for the project.
and a single project manager is selected to oversee
each project. Three major state wide projects have Data Warehousing and Replication
been completed or initiated to date.
The Texas Power Grid Mapping Project is Geospatial data replication is one of the funda-
producing a comprehensive geodatabase of the mental concepts on which the Columbia Regional
Texas Power Grid for emergency response. This Geospatial Service Center System is based. The
project involves compiling existing infrastructure System provides replication services for local
and spatial data from electrical power companies, and regional entities as well as for critical state
rectifying them against aerial ortho-rectified imag- resources. At present, the System is replicating
ery and integrating the results. Texas Department of data for the 12 counties of DETCOG, Panola
Emergency Management (TDEM) will use the end County and several cities. In some cases these
product for tracking and mitigating power outages replicas serve as live backups and are instantly
during storm events. available if the primary databases fail. In others,
The Texas Commission on Environmental Qual- the Columbia Center hosts the working database
ity (TCEQ) tasked the Columbia Center System for smaller entities that are not able to maintain
to create a Critical Facilities Infrastructure Map their own geospatial infrastructure. Panola County
(CFIM) for the State of Texas. The project involves is an example of this arrangement. The Columbia
mapping public water supply (PWS) infrastructure Center is working closely with Panola County,
for all systems serving fewer than 10,000 customers moving them towards implementing their own
in Texas. GPS data collection teams visit each PWS GIS. In the meantime, the Columbia Center hosts

935
The Columbia Regional Geospatial Service Center System

Panola County data and makes tools available to of each image’s location as well as usage metrics.
the County for editing and using these data from This system is in beta test at this writing and is
county offices in Carthage, Texas. scheduled to go live later this year.
The Columbia Center System partners with
TNRIS on several data replication initiatives. The
Texas Strategic Mapping Program (Stratmap) is CONCLUSION
the premier, high-resolution geospatial dataset
for Texas. Stratmap data are mirrored at each of The Columbia Regional Geospatial Service Center
the Columbia Center System Centers, providing System model grew from concepts developed
five-way redundancy and enhanced access to these many years ago and refined by practical experi-
important data. Raster data are replicated as well, ence with natural and man-made disasters. The
including the original 1995 Texas Orthoimagery principle components of the model are:
Program (TOP) 1-meter CIR data, and USDA
National Agricultural Imagery Program (NAIP) • Distributed Centers
data for 2004, 2006, 2008 and 2009. These larger • Regional Specialization
datasets are not fully replicated at each Center. • Bottom-up Philosophy
Instead, each Center maintains their region of • Dedicated to Bridging the Divide
Texas while TNRIS maintains data for all of the • Local Data Emphasis
State. Thus, all the data are available from at least • Linked Together as a System
two sources. • Data Warehousing and Replication
The importance of offsite backups of critical • University Based
data has been demonstrated repeatedly. Pertinent
case studies can be drawn from the September These components, combined with a three-
11th attack on the World Trade Center and Hur- phase operational philosophy based on Data,
ricane Katrina in New Orleans (National Research Applications and Training comprise a proven
Council, 2007). In both cases, both the primary approach to distributive geospatial technology.
and backup geospatial data for the area were Over the past five years, the Columbia Center
lost due to backup data being stored too close system has amassed a track record of successful
to the main data store. The Columbia Center projects and deployments; expanding the use of
System model places replicas of crucial data at geospatial technology at the local level, while
the regional centers, far from the main location. spanning Texas with System level activities.
Where warranted, additional replicas are housed Today, the System stands as a proven model for
at other regional centers ensuring uninterrupted delivering the promise of geospatial technology
access to critical data. to everyone.
An innovative project is currently underway
to distribute the L-2 rectified aerial photogra-
phy used to create the 2009 NAIP dataset for REFERENCES
Texas. This is 4 band, 0.7 meter imagery that is
unprocessed other than for L2 geometric correc- Blackwell, P., Garner, J., Henderson, S., & Wil-
tion (georectification). The 9-terabyte dataset is liams, J. (2003, April). Space Shuttle Columbia
split between the Centers, reducing the demand Recovery Efforts: How Internet2 and Geospatial
on any one Center. A web-based discovery tool Technology Made a Difference. Paper presented
provides access to the entire dataset regardless of at the Internet2 Spring Members Meeting, Ar-
its location. A synchronized database keeps track lington, VA.

936
The Columbia Regional Geospatial Service Center System

Croswell, P. (2009). The GIS Management Hand- McDonald, D. L., Kroll, J. C., & Blackwell, P.
book. Frankfort, KY: Kessey Dewitt Publications. R. (2007). Functionality Issues of Geospatial
Contributions to Emergency Response: The Re-
DeMers, M. (2009). Fundamentals of Geographic
gional Geospatial Service Center System Model.
Information Systems. New York: John Wiley &
In Proceedings of the GITA’s Annual Conference,
Sons, Inc.
San Antonio, TX.
Farmer, K., Kruger, J., & Murphy, L. (2010, Feb-
McDonald, D. L., Kroll, J. C., & Blackwell, P. R.
ruary). Hurricane Ike Storm Surge in Chambers
(2008). The Nac-4 Pack: A Proven Set of Essential
and Jefferson Counties. Poster session presented
Maps for a Successful Emergency Response. In
at the Annual East Texas Geospatial Conference,
Proceedings of the GITA Geospatial Infrastructure
Nacogdoches, TX.
Solutions Conference, Seattle, WA.
Garey, G. (n.d.). Texas Height Modernization
Mulligan, K., Barbato, L., Warren, A., Van Nice,
Project. Retrieved February 11, 2010, from http://
C., & Rainwater, L. (2008). A Regional Analysis
tsrc.cbi.tamucc.edu/THM/HomePage
of Groundwater depletion for the Ogallala Aquifer
Gehman, H. (2003). Columbia Accident Investi- in Texas. Paper presented at the Joint Meeting
gation Board Report Volume II (CAiB PA 42-43). of the Geological Society of America, Soil Sci-
Washington, DC: Government Printing Office. ence Society of America, American Society of
Agronomy, Crop Science society of America, Gulf
Kroll, J. C., McDonald, D. L., & Blackwell, P.
Coast Association of geological Societies with
R. (2006). The Columbia Regional Geospatial
the Gulf Coast Section of SEPM, Houston, TX.
Service Center System: A Model for Rural Geo-
spatial Support for the Nation. In Proceedings of National Research Council. (2007). Successful
the 5th Southern Forestry and Natural Resources Response Starts with a Map (pp. 78–85). Wash-
GIS Conference (p. 173). ington, DC: The National Academies Press.
LaNore, S. (n.d.). Tropical Storm Allison: Ex- Sementelli, A., & McDonald, D. L. (2003). A
treme Texas flood. Retrieved February 11, 2010, Geographic Information System-Guided Cost-
from http://www.examiner.com/x-5182-Dallas- Effective Analysis for Waterborne Asbestos Reme-
Weather-Examiner-y2009m6d4-Tropical-Storm- diation. Public Works Management & Policy, 7(3).
Allison-Extreme-Texas-Flood-June-2001
Spraggins, M. (2009). TXSG Public Affairs. “This
McDonald, D. L., Ashton, C., & Blackwell, P. R. Is An Exercise... This Is An Exercise” - Disaster
(2009). An Multi-Scalar Open Source Wide Area Assessment Exercise. Retrieved February 11, 2010,
Damage Assessment (WADA) Mash-up Applica- from http://www.txsg.state.tx.us/news/article.
tion Model. In Proceedings of the 2009 Geospatial aspx?id=20090302
Infrastructure Solutions Conference, Tampa, FL.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 2, Issue 1,
edited by Donald Patrick Albert, pp. 63-74, copyright 2011 by IGI Publishing (an imprint of IGI Global).

937
938

Chapter 58
Mapping the Distribution of
Tsetse Flies in Eastern Uganda:
A Geoinformatics Approach

Teddy Nakato Ayanlade Ayansina


Ministry of Agriculture, Uganda Obafemi Awolowo University, Nigeria

O. O. Jegede V. F. Olaleye
Obafemi Awolowo University, Nigeria Obafemi Awolowo University, Nigeria
Bolarin Olufemi
University of Ilorin Nigeria

ABSTRACT
This study demonstrates the ability of GIS and Remote Sensing in capturing spatial-temporal data on
land use and land cover classes. The nine land cover classes captured were Built-up area, Secondary
forests, Savannah, Grasslands, and Shrublands containing herbaceous, Rain-fed shrub crops, Fresh water
swamps, Water bodies, and Farmlands. The remote sensed imageries also displayed how the land use
and land cover classes changed between 1986 and 2001, while helping to identify the suitability of the
land cover classes for tsetse fly habitation. In this paper, the authors demonstrate that GIS and remote
sensing coupled with statistical analyses could help immensely in mapping tsetse habitats. Results show
that the tsetse fly habitat area in Eastern Uganda has been decreasing with time due to the increase in
the Savannah and grassland land cover types and urbanization.

1. INTRODUCTION for increased understanding of disease dynamics


and factors driving transmission. The continued
Eastern Uganda is one of regions in Africa that have spread and shifting of the sleeping sickness dis-
the highest cases of sleeping sickness recorded. tribution within Uganda and the potential for sub-
The continued spread and shifting of the sleeping species overlap highlight the need for increased
sickness distribution within Uganda and the po- understanding of disease dynamics and factors
tential for sub-species overlap highlight the need driving transmission. Previous studies focused
on infection of the disease using parametric and
DOI: 10.4018/978-1-4666-2038-4.ch058

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Mapping the Distribution of Tsetse Flies in Eastern Uganda

non-parametric technique for their analysis. The White Nile has its source in Lake Victoria and as
prominent among these are the studies by Ford the Victoria Nile, it runs northward through Lake
(1969), Buyst (1977), Wellde (1989), Khonde Kyoga and then westward to Lake Albert, from
(1995), and Leak (1999). Recently, evidence of dis- which it emerges as the Albert Nile to resume
ease spread to districts considered to be previously its northward course to the Sudan. (Advameg,
uninfected (Fèvre, 2001) has highlighted the need 2007), Uganda has a typically tropical climate
for increased understanding and implementation with little variation in temperature throughout the
of disease prevention and control. This study aims, year. Distinctive wet and dry seasons characterize
therefore, at using remote sensing and GIS tech- the climate of most of the country, except in the
niques to examine and map the spatio-temporal semi-arid north east. The country’s natural envi-
landcover dynamics and their relationship with ronment provided good grazing for cattle, sheep,
tsetse fly occurrence in Uganda between 1986 and goats, with indigenous breeds dominating
and 2001. Satellite products were used for the most livestock. Smallholder farmers owned about
land cover analysis with the aim of assessing the 95 percent of all cattle, although several hundred
influence of land cover types on the distribution modern commercial ranches were established dur-
of tsetse flies. To achieve the above aim the study ing the 1960s and early 1970s in areas that had
was divided into three phases: (1) satellite data of been cleared of tsetse-fly infestation.
1986 and 2001 were used to provide information
on land cover types and thereby identification of
the tsetse habitat; (2) the spatial distribution of a 3. MATERIALS AND METHODS
vector-borne disease were related to the habitat
of the tsetse flies as the vectors; and (3) data on Raw data on livestock populations in Uganda was
land cover types, habitat and human population obtained from the statistical abstracts and back-
were analysis to provide information on the spatial ground to the Budget (several years), Ministry of
distribution of sleeping sickness and Nagana in Finance, Planning and Economic Development;
livestock (Curran et al., 2000). Small Ruminant Development Study Report,
Ministry of Agriculture, Animal Industry and
Fisheries in Uganda. The data cover the periods
2. STUDY AREA

Uganda is in latitudes 40.0’ North and 10.30’ South Figure 1. Map of Uganda showing eastern Uganda
of the equator, and longitudes 300.0’ East and 350.0’
East of Greenwich (Figure 1). The greater part
of Uganda consists of plateaus which are about
800 to 2,000 m (2,600–6,600 ft) in height. Along
the western border, is the Rwenzori Mountains,
Margherita Peak reaches a height of 5,109 m
(16,762 ft), while on the eastern frontier Mount
Elgon rises to 4,321 m (14,178 ft). By contrast,
the Western Rift Valley, which runs from north to
south through the western half of the country, is
below 910 m (3,000 ft). For example, the surface
of Lake Edward, Lake George and Lake Albert (L.
Mobutu Sese Seko) is about 621 m (2,036 ft). The

939
Mapping the Distribution of Tsetse Flies in Eastern Uganda

from 1986 to 2001. Satellite products used for this, the first task was to develop a table showing
this study include; Land-sat ETM 1986 and 2001. the area in hectares and the percentage change for
Image preparation involved the image registration. each year measured against each land use land
To ensure proper image overlay in the GIS envi- cover type. Percentage change to determine the
ronment. The land-sat ETM image of 2001 was trend of change can then be calculated by dividing
formally geo-referenced in projection, Universal observed change by sum of changes multiplied
Transverse Mercator (UTM), Zone 36. Arc 1960 by 100.
and Clarke 1880, were used to geo-reference im-
ages. The standard processes for the analyses of (Trend) Percentage Change = Observed change
satellite imagery such as extraction, restoration, X 100
classification, and enhancement were used for the Sum of change
study. The classification of land-use/Land-cover
were aggregated to 5 classes which include: Sec-
ondary Forests (Agro-forestry), farmlands (shrubs/ 4. RESULTS AND DISCUSSION
fallow), built-up (residential and commercial),
Bare rocks and bare soils (exposed rocks) and The land-sat products showed that Eastern Uganda
water bodies (ponds, lakes, rivers, streams). is covered by built up areas, secondary forests,
The comparison of the land use/land cover savannah, grassland, shrub lands with herbaceous
statistics assisted in identifying the percentage vegetation, rain-fed shrub crops, freshwater
change, trend and rate of change in vegetation swamps and water bodies (Table 1 and Table 2).
cover for the period of time (Figure 2). To achieve In 1986, the total area covered by land-sat product

Figure 2. Entity relationship model for linking remote sensing, land cover and disease

940
Mapping the Distribution of Tsetse Flies in Eastern Uganda

(the product used for the study) was 9,242,069.841 ha (which is about 20% of the total area) of the
ha compared to 9,242,069.841 ha in 2001. Irrespec- land cover in 2001. The area covered by water
tive of the year of data collection, shrub-land with bodies was also very extensive both in 1986 and
herbaceous vegetation cover the most extensive 2001 with 1,846,069.50 ha (which is about 19%
(about 31% in 1986 and 28% in 2001) in Eastern of the total area) and 2,048,790.63 ha (22% of the
Uganda. The rain-fed shrub croplands also cover total area) covered respectively (Table 1, Table
lager part of the country from the analysis. In 1986 2, and Table 3).
for instance, the area covered by rain-fed shrub The freshwater swamps which are dominant
cropland was 1,143,118.44 ha (which is about tsetse fly habitats covered 793,783.60 ha (9% of
13% of the total area) compared to 1,758,760.59 the total area) and 841,035.49 ha (9% of the total

Table 1. Areas covered by each land cover type - 1986

Eastern Uganda – 1986


Class Names Area (ha) Proportion (%) Suitability for Tsetse flies
Built up Area 675979.0281 7.314151914 *
Cloud cover 155541.8862 1.682976745 *
Secondary Forests 520476.0624 5.631596292 ***
Savannah 791362.7424 8.562613744 *
Grassland 469918.3329 5.084557258 *
Rain-fed Shrubs 2845820.255 30.79202283 **
Herbaceous Crops 1143118.437 12.36864097 **
Fresh Water Swamps 793783.5975 8.588807607 ***
Water Bodies 1846069.499 19.97463264 *
Total 9242069.841 100
*** - Very Suitable
** - Suitable
* - Not Suitable

Table 2. Areas covered by each land cover type - 2001

Eastern Uganda – 2001


Class Names Area (ha) Proportion (%) Suitability for Tsetse flies
Built up Area 383873.1044 4.153540397 *
Secondary Forests 31483.4237 0.340653384 ***
Savannah 561449.3293 6.074930605 *
Grassland 117129.2658 1.26734885 *
Rain-fed Shrubs 2587986.004 28.00223379 **
Herbaceous Crops 1758760.593 19.02994268 **
Fresh Water Swamps 841035.4902 9.100077198 ***
Water Bodies 2048790.628 22.1680929 *
Farm Lands 911562.0025 9.863180199 **
Total 9242069.841 100

941
Mapping the Distribution of Tsetse Flies in Eastern Uganda

Table 3. Relative change in land cover types


The tsetse suitability for each one of the ag-
Land Cover Types Relative Change gregated land cover classes identified for Eastern
Area (ha) Proportion
Uganda is described in the section. The land
(%) cover suitability for tsetse habitation of Eastern
Built up Area -292105.9237 2.362452841 Uganda is summarized in Table 4. It is worth
Secondary Forests 31483.4237 0.340653384 noting that the degree of suitability was assigned
Savannah 40973.2669 5.132039645 according to the inherent features of the land
Grassland -674233.4766 -5.604624361 cover class only.
Shrublands with Herba- Figure 3 shows the land cover type area distri-
ceous 2118067.671 27.81754874 bution for Eastern Uganda in 1986 and 2001. The
Rain-fed Shrub Crops -1087059.662 17.38416177 areas covered by built-up areas, farmland, savanna,
Fresh Water Swamps -302082.9468 7.717631167 shrub land with herbaceous cover, rain-fed shrub
Water Bodies 1255007.031 21.77402068 land and water bodies were more extensive in
Farm Lands -934507.4965 7.803341979 1986 than in 2001. Conversely, the grassland,
Total 155541.887 84.72722585 secondary forests, and freshwater swamps were
more extensive in 2001 than in 1986.
As shown in Figure 3, there was generally
area) in 1986 and 2001 imageries respectively. slight gradual decrease in the tsetse habitats as
The savannah areas during the period of study of the Shrub lands tend to decrease with an increase
Eastern Uganda covered 791362.74 ha (9% of the in the grasslands land cover type between 1986
total area) and 561449.32 ha (6% of the total area) and 2001. Also there was a decrease in the shrub
in 1986 and 2001 imageries respectively. The built lands with herbaceous cover and rain-fed shrub
up areas which adversely affects tsetse fly habitats crops which are good habitat to tsetse flies. Con-
covered 675,979.03 ha (7% of the total area) and
383,873.10 ha (4% of the total area) of Eastern Table 4. Tsetse suitability for land cover classes
Uganda respectively (Table 1 and Table 2). in eastern Uganda
Analysis (Table 3 and Table 4) showed that
Shrub land with herbaceous cover is moderately Land Cover Types Tsetse fly Species
suitable for the morsitans group and is less suitable fusca palpalis morsitans
for the palpalis group, covers the largest portion Built up Area 0 0 0
of between 28% and 30% of the study area. The Secondary Forests 3 3 2
principal habitat of these forest flies (FAO, 1982) Savannah 0 1 0
is clearly confirmed by the analysis. Similarly, for Grassland 0 0 0
the palpalis group, the single most relevant land Shrub lands with Herbaceous 0 1 2
cover class is the shrub lands with herbaceous Rain-fed Shrub Crops 1 2 1
cover, which accounts for almost 30% percent Fresh Water Swamps 3 3 2
of the distribution. Irrespective of the year of Water Bodies 0 0 0
imagery collection, from the results over 60% of Farm Lands 1 2 1
the area in Eastern Uganda is a potential habitat Tsetse suitability
for tsetse flies. Only 20% of Eastern Uganda
3 High
covered by water bodies is not suitable for tsetse
2 Moderate
fly inhabitation.
1 Low
0 Unsuitable

942
Mapping the Distribution of Tsetse Flies in Eastern Uganda

Figure 3. Land cover type area distribution for eastern Uganda, 1986, 2001

versely, there was an increase in the grassland of morsitans group. With one exception (G. lon-
which are unfavorable for tsetse habitants between gipennis), the species of the fusca group are for-
1986 and 2001. There was an increase in the fresh est flies inhabiting either rain forest or isolated
water swamps and a decrease in the water bodies patches of forest, along with riverine forest in the
because of the increased growth of fresh water savannah zones. Gallery forests are the typical
weeds and plants which are known to be moder- habitat for the flies of the palpalis group. Species
ately favorable for tsetse habitats. of the morsitans group can be found in forest
edges, forest islands and in riverine forests. (Veg-
4.2 Description of the etation areas not used by G. morsitans include
Land Cover Types very high rainfall areas such as rain forests.) (FAO,
1982). The main shrub land with herbaceous
The land cover map of Eastern Uganda in 1986 layer consists of shrubs with crown cover between
(Figure 4) and in 2001 (Figure 5) showed the dif- 15% and 95%. The height is in the ranges from
ferent land cover types the descriptions of which 0.3 to 5 m. In Eastern Uganda, most of the areas
are given below: of this class have an open cover (between 20%
These are continuous closed trees. The main and 60%), a second layer of closed to open her-
secondary forest layer consists of closed trees baceous vegetation and a third layer of emergent
with crown cover of more than 60 percent. The trees and it covers an area of around 35 000 km2
vegetation is spread over the area without intervals (16 percent of the total surface of the country).
or breaks. In Eastern Uganda, most of the areas Tsetse suitability: This habitat differs from the
in this class are covered by broad-leaved evergreen classic savannah only for the presence of the main
trees with a second layer of trees that form a dif- shrub layer. Thus, we can assume that it is mod-
ferent stratum due to a difference in height and a erately suitable for the morsitans group and less
third layer of emergent trees higher than the main so for the palpalis group. This class is deemed
stratum. Tsetse suitability: Forests provide favor- unsuitable for the fusca group (FAO, 1982).
able habitats to several tsetse fly species of the Rain-fed shrub croplands are permanently
fusca and palpalis groups and, to a lesser extent, cropped area with rain-fed shrub crop(s). The

943
Mapping the Distribution of Tsetse Flies in Eastern Uganda

Figure 4. Land cover map of eastern Uganda – Figure 5. Land cover map of eastern Uganda –
1986 2001

crop cover includes orchard(s) and shrub crops


such as plantains, coffee, cotton, bananas, tea, however shown that this land cover is unsuitable
cocoa and pineapple. In terms of harvested area, for tsetse flies (FAO, 1982).
the most widespread crops of this class in Uganda The main savannah layer consists of grassland
are plantains, coffee, cotton and bananas (FAO, (crown cover is between 15 and 100 percent) with
2005). The semi-artificial habitats consisting of the height in the range of 0.03 to 1 m. The savan-
various kinds of orchards or other type of planta- nah vegetation is spread over the areas covered
tions (such as bananas, cacao, and coffee) could without intervals or breaks. The second layer in
be typical habitats for some species of the palpalis the savannah covered areas consists of sparse
group (e.g., G. tachinoides). On the contrary, trees. The third layer is mainly sparse shrubs. In
some other crops in this class like cotton fields Uganda, this class in the 1986 and 2001 land-sat
are unsuitable for all tsetse species (FAO, 1982). imageries of vegetation covers more than 20
The main layer consists of grassland (crown 000 km2 (8.5 percent of the total surface of the
cover is more than 15–100 percent); the height is country). The savannah offers moderately suit-
in the range of 0.03–3 m, the grassland vegetation able habitats for species of the morsitans group
is spread over areas without intervals or breaks. In and for some of the palpalis group, and is less
Uganda this class covers around 6 000 km2 (2.5 suitable for the fusca group. The limited tree and
percent of the total surface of the country) in 1986 shrub cover in the savannah areas however, could
and 2001 land-sat imageries. This research has be sufficient for many tsetse species during the

944
Mapping the Distribution of Tsetse Flies in Eastern Uganda

wet season. Usually however, the area is unable Africover map of Uganda for 1986 and 2001, this
to support tsetse fly populations during the dry class occupies 36 000 km2 (more than 15 percent
season (FAO, 1982). of the total surface of the country) and includes
The main layer consists of herbaceous veg- the vast expanses of Lakes Victoria, Albert and
etation on permanently or temporarily flooded Kyoga (Figure 5, Figure 6, Figure 7, and Figure
land (cover is between 15 and 100 percent; the 9). The land cover here however is not suitable
height is in the range of 0.03–3 m). In Uganda, for tsetse flies (FAO, 2005).
this class covers approximately 12 000 km2 Farmlands are permanently cropped area with
(more than 5 percent of the total surface of the rain-fed tree crops (Figure 6 and Figure 9). The
country). In around half of this area, a second plantations include the fruit trees (e.g., citrus,
layer of sparse shrubs is present. By definition mango, palm, etc.), conifers (e.g., pinus spp.,
Freshwater swamps are associated with humid Cupressus spp.) and hedging and shade plants.
environments, and cannot be considered a major Among the less typical habitats of tsetse flies,
habitat for tsetse flies because of the very limited man-made ones are particularly important from
presence of woody vegetation. The sparse shrubs the point of view of disease transmission, in par-
occasionally present in these areas could however ticular for Human African Trypanosomiasis
provide atypical habitats to some tsetse species, (HAT). Tree plantations are arguably the most
particularly these of the palpalis group. suitable man-created habitat for some tsetse spe-
These are perennial natural water bodies. Salin- cies. Plantations of mango are breeding sites for
ity in these water bodies is less than, <1 000 parts some species of the palpalis group (e.g., G.
per million [ppm]. The perennial natural water tachinoides and G. palpalis); many mango plan-
bodies included flowing or standing water. In the tations are grown along riversides, which provide

Figure 6. Land cover map of eastern Uganda - 1986

945
Mapping the Distribution of Tsetse Flies in Eastern Uganda

Figure 7. Land cover map of eastern Uganda - 2001

Figure 8. Map of eastern Uganda showing Tsetse infested areas – 1986

946
Mapping the Distribution of Tsetse Flies in Eastern Uganda

Figure 9. Map of eastern Uganda showing Tsetse infested areas – 2001

tsetse flies with suitable shelter, particularly so in 4.3 Analysis of Areas of Different
the case of old trees with low branches. Examples Vegetative Vigor - NDVI
of other semi-artificial habitats of this class are
plantations of oil palms and cola nuts and tree From Figure 10 and Figure 11, we can clearly see
hedges. Untrimmed hedges and tree crops can the different colors for the different vegetative
also provide a suitable habitat for G. pallidipes vigor which are between 1 and -1. Areas of high
(morsitans group). vegetative vigor are represented in a pick shade
Analysis of land cover imageries of Eastern while areas of low vegetative vigor are indicated
Uganda (1986) is shown in Figure 6 while the Map by the dark bluish shade. Areas of low vegetative
of Eastern Uganda showing the tsetse infested vigor are the water bodies in this area showing
area is shown in Figure 7. The land cover map that there is no vegetation in these areas and Areas
of Eastern Uganda for 2001 is shown in Figure of high vegetative vigor are area covered with
8 while the Map of Eastern Uganda of the same secondary forests.
area is shown in Figure 9 for 1986. Figure 7 and From the levels of vegetative vigor, the high-
Figure 9 showed that there was a decrease in the est areas in 1986 had NDVI of 0.93 and those of
tsetse infested areas due to the expansion of the lowest vigor had NDVI of -0.58 while those of
savannah and grassland land cover types which are 2001 had NDVI of 0.61 as the highest vegetative
known not to be favorable tsetse habitants. Also, vigor and -0.91 as areas of the lowest vegetative
urbanization that is which increased in the built- vigor. This shows that there is a very high loss in
up areas are known not to favor tsetse habitants. vegetative vigor of the vegetation from 1986 to
2001 showing that this area is rapidly tending to
aridity. This difference is not due to the changes

947
Mapping the Distribution of Tsetse Flies in Eastern Uganda

Figure 10. Map of eastern Uganda showing areas Figure 11. Map of eastern Uganda showing areas
of different vegetative vigor – NDVI 1986 of different vegetative vigor – NDVI 2001

in climatic seasons because all the images were unsuitable habitat because of inconclusive
taken between December and March which is a environmental conditions.
dry season. 2. A second difficulty encountered was re-
lated to the challenge of defining classes of
4.4 Validation of Land Cover Classes suitability valid for the study area. Ideally
for Suitability of Tsetse Habitats validation should be based on a number of
sites capable of encompassing the enormous
1. Rigorous validation of the estimated suit- environmental heterogeneities in Uganda
ability for tsetse of the land cover classes and Africa at large.
defined in Table 4 and 5 and clearly shown 3. Another problem was related to the intra-
in Figure 6 and Figure 8 was hindered by a group differences in habitat requirements. If
range of practical and conceptual difficulties. the definitions of forest, riverine and savan-
Foremost among the conceptual problems nah flies for the three groups (fusca, palpalis
is the fact that land cover vegetation is only and morsitans) are commonly accepted, it is
one component of potential determinant of also true that the ecology and spatial distribu-
the tsetse habitat. Favorable environmental tion of the species within each group vary
conditions must include availability of considerably (e.g., G. longipennis, belonging
hosts on which to feed on and convenient to the fusca group, and G. tachinoides, of
climatic conditions. As a consequence, the palpalis group, are found in more arid
suitable vegetation can still represent an

948
Mapping the Distribution of Tsetse Flies in Eastern Uganda

environments than the other species in the tential for sub-species overlap highlight the need
same groups). for increased understanding of disease dynamics
4. Another complicating factor is the dispersal and factors driving transmission. The land cover
of flies, some of which can easily travel hun- maps produced can be used in several stages of
dreds of meters away from their resting and tsetse and Trypanosoma intervention: mapping
breeding sites for feeding. It was therefore vector habitat, planning baseline entomological
difficulty to link trap catches (i.e., apparent surveys, monitoring the efficacy of tsetse sup-
densities) to the vegetation in the immediate pression, land-use planning of reclaimed areas
surroundings of the trap; trap catches are and monitoring the environmental impacts of
in most cases influenced by the vegetation intervention strategies.
mosaic at landscape level. The results presented here are of a preliminary
nature, but clearly show the capacity of remote
sensing and geographic information systems to
In other words, it is almost impossible to define make significant contributions to country, regional
experimental conditions that comply with the and continental-level analyses of factors affect-
hypothesis of ‘indefinite expanse of one homoge- ing tsetse distribution, human population density
neous land cover’ on which the present evaluation and land-use intensity. It is considered that such
was based. Furthermore, the opportunity to move analyses will play an increasing role in the future
in less shaded and less protected environments in the process of resource allocation to improved
is heavily influenced by thermal and humidity food production in the continent through more
gradients linked to seasonality. The above consid- effective disease control.
erations explained why very broad and qualitative
suitability classes were employed in the project.
ACKNOWLEDGMENT

5. CONCLUSION The Authors thank the African Regional Centre


for Space Science and Technology Education
This study demonstrates the ability of Remote in English (ARCSSTE-E), Obafemi Awolowo
Sensing and GIS in capturing spatial-temporal University (OAU) Campus, Ile-Ife, Nigeria, for
data and attempt was made to capture as ac- the scientific contribution to training activities
curate as possible nine land use and land cover and fellowship grant support.
classes as they change through time and identi-
fying their suitability for tsetse fly habitation.
The study reveals GIS and remote sensing help REFERENCES
immensely in spatial and temporal analyses for
mapping tsetse fly habitats. Eastern Uganda is Buyst, H. (1977). The epidemiology of sleeping
one of such regions that have the highest cases sickness in the historical Luangwa valley. An-
of sleeping sickness recorded from 2000 to 2007. nales de la Société Belge de Medecine Tropicale,
The concerns over re-current foci and continued 57, 349–359.
disease spread highlight the absence of adequate FAO. (1982). In J. N. Pollock (Ed.).Training
assessment of the differences between current manual for Tsetse control personnel. Vol. 2 Ecol-
and historical distributions of sleeping sickness. ogy and behaviour of tsetse. Retrieved from ftp://
The continued spread and shifting of the sleeping ftp.fao.org/docrep/fao/009/p5444e/p5444e00.pdf
sickness distribution within Uganda and the po-

949
Mapping the Distribution of Tsetse Flies in Eastern Uganda

FAO. (2005). Land Cover Classification Sys- Katondo, K. M. (1984). Revision of second edi-
tem – Classification concepts and user manual tion of tsetse distribution maps. Insect Science
Software version 2. Rome: FAO. Retrieved from and Its Application, 5(5), 381–384.
www.glcn-lccs.org/
Khonde, N., Pépin, J., Niyonsenga, T., Milord,
Feldmann, U., Dyck, V. A., Mattioli, R. C., & F., & Wals, P. (1995). Epidemiological evidence
Jannin, J. (2006). Potential impact of tsetse fly for immunity following Trypanosoma brucei
control involving the Sterile Insect Technique gambiense sleeping sickness. Transactions
(Vol. 37, pp. 1-723). of the Royal Society of Tropical Medicine and
Hygiene, 89(6), 607–611. doi:10.1016/0035-
Fèvre, E. M., Coleman, P. G., Odiit, M., Magona,
9203(95)90408-5
J. W., Welburn, S. C., & Woolhouse, M. E. J.
(2001). The origins of a new Trypanosoma brucei Leak, S. G. A. (1999). Tsetse Biology and Ecology:
rhodesiense sleeping sickness outbreak in eastern Their Role in the Epidemiology and Control of
Uganda. Lancet, 358, 625–628. doi:10.1016/ Trypanosomosis. Wallingford, UK: CABI.
S0140-6736(01)05778-6
Ministry of Health. (2004). Uganda sleeping sick-
Ford, J. (1969). The control of the African try- ness report. Uganda: Ministry of Health.
panosomiases with special reference to land use.
Paul, J., Curran, P., Atkinson, M., Foody, G. M., &
Bulletin of the World Health Organization, 40,
Milton, E. J. (2000). Linking remote sensing, land
879–982.
cover and disease. Southampton, UK: University
Ford, J., & Katondo, K. M. (1975). Revision of of Southampton, Department of Geography.
the Glossina distribution map of Africa. In Pro-
Rodriguez, A. (2005). Trypanosomiasis - Ugan-
ceedings of the 14th of the International Scien-
da (Kaberamaido). ProMed. Retrieved from
tific Council for Trypanosomiasis Research and
http://www.pubmedcentral.nih.gov/redirect3.
Control, Dakar, Senegal (No. 109, pp. 23-25).
cgi?andandauth=001Z7ISWIFesRlR-eJDdjJ-
Ford, J., & Katondo, K. M. (1977a). The distri- hjEG-wI-DVZ_vnK8AJtandreftype=extlinkanda
bution of tsetse flies in Africa. Nairobi, Kenya: rtid=1832067andiid=143254andjid=378andFRO
Nairobi Organization of African Unity. M=Article%7CCitationRefandTO=External%7C
Link%7CURIandarticle-id=1832067andjournal-
Ford, J., & Katondo, K. M. (1977b). Maps of
id=378andrendering-type=normalandandwww.
tsetse flies (Glossina) distribution in Africa, 1973
promed.org
according to sub-generic groups on scale of 1:5
000 000. Bulletin of Animal Health and Produc- Rogers, D. J., & Randolph, S. E. (1993). Distribu-
tion in Africa, 25, 187–193. tion of tsetse and ticks in Africa: past, present and
future. Parasitology Today (Personal Ed.), 9(7),
Hutchinson, O. C., Fèvre, E. M., Carrington, M.,
266–271. doi:10.1016/0169-4758(93)90074-P
& Welburn, S. C. (2003). Lessons learned from
the emergence of a new Trypanosoma brucei rho- Rogers, D. J., & Williams, B. G. (1992). Monitor-
desiense sleeping sickness focus in Uganda. The ing trypanosomiasis in space and time. Parastio-
Lancet Infectious Diseases, 3, 42–45..doi:10.1016/ log, 106, S77–S92.
S1473-3099(03)00488-2

950
Mapping the Distribution of Tsetse Flies in Eastern Uganda

Welburn, S. C., & Odiit, M. (2002). Recent de- Wellde, B. T., Chumo, D. A., Waema, D., Reardon,
velopments in human African trypanosomiasis. M. J., & Smith, D. H. (1989). A history of sleeping
Current Opinion in Infectious Diseases, 15(5), sickness in Kenya. Annals of Tropical Medicine
477–484. and Parasitology, 83(1), 1–11.

This work was previously published in the International Journal of ICT Research and Development in Africa (IJICTRDA),
Volume 1, Issue 2, edited by Blessing Maumbe, pp. 19-34, copyright 2010 by IGI Publishing (an imprint of IGI Global).

951
952

Chapter 59
Open Source Based
Deployment of Environmental
Data into Geospatial
Information Infrastructures
José Gil
Institute of New Imaging Technologies, Universitat Jaume I, Spain

Laura Díaz
Institute of New Imaging Technologies, Universitat Jaume I, Spain

Carlos Granell
Institute for Environment and Sustainability, Italy

Joaquín Huerta
Institute of New Imaging Technologies, Universitat Jaume I, Spain

ABSTRACT
Today, scientists use local and closed geospatial solutions to run their models and store their results. This
may limit their ability to share their models, and results with other interested colleagues. This scenario is
changing with the advent of new factors such as the rapid growth and rise of open source projects, or new
paradigms promoted by government organizations to manage environmental data, such as Infrastructure
for Spatial Information in the European Community (INSPIRE) directive, or the massive use of Web
2.0 techniques where users are looking for applications with a high degree of collaboration, interac-
tiveness, and multimedia effects. Many authors address the versatility of Spatial Data Infrastructures
where resources are shared and accessed via standard service according to complex specifications. In
this context, the authors point out the need to merge the traditional building and maintenance of these
infrastructures, driven by official providers, with these more participative methodologies where users
can participate in creating and integrating information. It seems necessary to develop new geospatial
tools which integrate these new trends. This paper proposes a unified solution offering to the scientific
field an open development framework, based on standards and philosophies focused on new technolo-
gies and scientific needs.

DOI: 10.4018/978-1-4666-2038-4.ch059

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

INTRODUCTION Better decisions and concrete actions at local


and global scale to underpin sustainable develop-
One of the challenges we are facing today is to ment can be taken only based on precise knowledge
better understand the processes that occur over of our environment (Lannotta, 2007). However,
or near-surface of our planet (Goodchild, 2008). geospatial datasets, and other resources are often
The Earth is a multi-dimensional system made of isolated, hidden to potential interesting stakehold-
complex interactions highly interconnected and ers that could benefit from them. Therefore, the
continuously evolving at many spatial and tem- ease of access and retrieval of geospatial and
poral scales (GEO Secretariat, 2007). This means environmental data is a key aspect for information
that to understand such interactions, scientists and infrastructures (Yang, Raskin, Goodchild, & Gahe-
environmental experts need to collect and integrate gan, 2010). There are consensus-based initiatives
different data sets referred to physical and biologi- which aim is to promote interoperability, standards
cal aspects of the Earth (Giuliani et al., 2011). In specifications, and seamless integration of services
addition, these data sets are often georeferenced, and information to improve this accessibility to
that is, they describe physical phenomena tied to a geospatial resources (see next sections). However,
concrete geographical location in terms of points, the participation of users in these infrastructures
lines and sets of polygons, and we refer to them has been delayed due to the complexity of cur-
as geospatial data. rent methodologies to participate and integrate
Scientists and environmental experts need to new resources; the necessity of skills about how
manage vast amounts of these geospatial data to to incorporate real time data with historical, the
turn them into meaningful information. The ap- use of standard services specification to deploy
plicability of complex environmental processes resources and promote interoperable resources,
and models has been the focus of intense research the integration of data from different domains, or
in this domain (Poch, Comas, Rodríguez-Roda, simply looking for these data sources have become
Sánchez-Marrè, & Cortés, 2004; Minsker et al., tedious tasks in current developments (Yang &
2006; Reichert & Schuwirth, 2010). For instance, Raskin, 2009). This makes scientists continue
models for reduction of energy consumption, com- using traditional methods like local processing.
putation of atmospheric emissions, and simulation The aim of this article is to present an open
of forest fire require multidisciplinary teams of source-based framework that enables both:
geospatial technologists and environmental ex- standard access and processing of data, and also
perts. Such models, however, have been tradition- the deployment of new content into geospatial
ally performed using multiple, isolated desktop information infrastructures. The processing and
geospatial tools: a specialist in Digital Elevation access of environmental data sets and the deploy-
Models might perform slope computations while ment and sharing of processing results will be
other colleague might apply remote sensing addressed in this paper. All this, in the simplest
techniques over satellite imaginary to produce way possible, through an immersive and user-
a snow cover product. In this simple example, friendly interface, based entirely on geospatial
various GIS software packages could be involved open source technologies.
in the activities of a single environmental model. The rest of this article is structured as fol-
However, this paradigm of closed solutions, which lows. The following three sections outline the
operates with local data sets, provides a way of basic concepts underlying the topics tackled in
hiding shared results and collaborative research, this article: architectural styles for information
and more often, hindering further analysis tasks. infrastructures, concrete realizations of geospatial
information infrastructures, and the concept of

953
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

open source applied to geospatial technologies. loosely-coupled unit composed of a service inter-
Given the pieces of the puzzle, we present a face and a service implementation. The service
toolbox that tries to connect all pieces to facili- interface describes the functional capabilities of a
tate the sharing, deployment and publication of service. The service implementation codifies what
new geospatial and environmental data sets into a service should execute. This principle provides
existing geospatial information infrastructures. a clean separation of concerns especially between
We discuss the applicability of our framework service interfaces (what services offer to the public
through a use case in the forest fire domain. The community) and internal implementations (how
paper concludes by looking to futures challenges services work). Essentially SOA introduces a new
in automatic deployment of user-generated content philosophy for building distributed applications,
and identifies a number of issues that are in need where services can be discovered, aggregated,
of further research. published, reused, and invoked at the interface
level, independently of the specific technology
used internally to implement each service.
DESIGNING INFORMATION At the time of implementation SOA-based
INFRASTRUCTURES services must make use of concrete languages
and protocols. Here is where web service technol-
Geospatial and environmental research is a field ogy gains importance because it increasingly is
that involves multidisciplinary teams of scientists becoming the choice to implement SOA-based
and technologists working together. The applica- applications. Web services (Papazoglou, 2008)
tion of environmental models may span from are, by definition, loosely coupled independent
local to global scope, what undeniably requires units and are well described (interface description
suitable information infrastructures to share and contains functional properties), thereby promot-
do research on the web (Hey & Trefethen, 2005). ing one of the goals of SOA: enabling interoper-
This complex scenario demands the use of new ability or the ability of services to interact with
architectural styles which oppose centralized, iso- minimal knowledge of the underlying structure
lated solutions, and instead support distributed pro- of other services (Goodchild, Egenhofer, Fegeas,
cessing capabilities and remote communications, & Kottman, 1999). Interoperability is achieved
necessary ingredients to successful collaborative (or optimized) by using standard interfaces. Web
and multidisciplinary research. Service-Oriented service technology includes various standards such
Architectures (SOA) and Resource-Oriented Ar- as Web Service Description Language (WSDL)
chitectures (ROA) are currently the architectural for the description of service interfaces, Universal
styles adopted in the development of collaborative, Description, Discovery and Integration registry
distributed Web systems and applications. (UDDI) for their advertisement and discovery, and
SOA is an architectural style to design ap- Simple Object Application Protocol (SOAP) that
plications based on a collection of best practices, enables communication among services (Curbera,
principles, interfaces, and patterns related to the Duftler, Khalaf, Nagy, Mukhi, & Weerawarana,
central concept of service (Bell, 2008). In SOA, 2002).
services play a key role and become the basic As opposed to SOA, focused on distributed
computing unit to support development and com- capabilities and services, ROA is an architectural
position of larger, more complex services, which style devoted to manage distributed, heteroge-
in turn can be used to create flexible, ad hoc and neous domain resources. In SOA client applica-
dynamic applications. The main design principle tions interacts with a distributed Web applications
behind SOA is that a service is a standards-based, through delegation, that is, by specifying the

954
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

desired capability to a service component instead different ways, such as RESTful web services
of directly acting on the resources themselves (Richardson & Ruby, 2007). Designing REST-
(Mazzetti, Nativi, & Caron, 2009). In ROA, ful web services following the REST constraints
through, client applications interact directly with described above has gained popularity within the
the exposed resources. Earth science, mobile and location based services
The main constraints behind ROA-based ap- community (Mazzetti, Nativi, & Caron, 2009).
plications are the set of architectural principles The selection of one or another architectural
known as Representational State Transfer (REST) style depends often on the context of the target
(Fielding, 2000), namely: application. Both approaches should not be dis-
joint but complementary. In this paper, we take
• Resources should be identified properly the complementary strategy for the proposed
using global identifiers (URI). However, framework implementation that makes use of
REST proposes to use URIs not only for both architectural patterns to communicate with
identification but also for physical access remote geospatial services. Accessing to common
to the actual resource representation. One geospatial services, as we describe in the follow-
important feature is that each resource ing, are built upon a SOA approach, while recent
must be addressable via a HTTP (Hypertext open source implementations already expose
Transfer Protocol) URI. RESTful interfaces to manipulate and deploy
• Uniform interface through the use of remote geospatial resources.
HTTP as the unique application-level pro-
tocol. HTTP has a small, fixes of opera-
tional methods with specific purpose and GEOSPATIAL INFORMATION
meaning. For instance, the GET method is INFRASTRUCTURES
for retrieving representations of target re-
sources, the POST method for creating new Several administrations are recognizing the role of
resources, PUT for updating resources, and geospatial data in e-commerce, sustainable devel-
finally, the DELETE method to eliminate a opment and government activities, and the ability
given resource. to be efficiently coordinated and managed for the
• Resources are manipulated through rep- interest of public in general (Rajabifard, Binns,
resentations since clients and servers ex- Masser, & Williamson, 2006). In consequence, the
change self-descriptive messages each notion of Spatial Data Infrastructures (SDI) refers
another. to the specialization of information infrastructures
• Interaction stateless since servers only re- for the geospatial sciences, allowing discovering,
cord and manage the state of their resourc- access, publishing, sharing, maintaining, and
es they exposes. Client sessions are not integrating geospatial data (Nebert, 2004).
maintained in the server. Many initiatives at different administrative
• Hypermedia as the engine of application levels have been recently flourishing to promote
state, that is the application state is built the creation of SDIs (Masser, 2005). These
following hyperlinks according to the nav- initiatives coordinate actions and policies that
igation paradigm so that clients can dis- encourage awareness of institutional agreements,
cover related resources from a given one. common standards and effective mechanisms for
data access, harmonization, standardization, and
As in the case of SOA-based services, ROA- the development of interoperable geospatial data
based applications can be implemented in many and technologies to support decision making for

955
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

multiple purposes. One of these initiatives to elimi- perform not only retrieval but also deployment
nate jurisdictional, cultural or domain boundaries of geospatial data sets.
is the Infrastructure for Spatial Information in
the European Community (INSPIRE) Directive
of the European Commission (European Parlia- GEOSPATIAL OPEN SOURCE
ment and Council, 2007). INSPIRE Directive TECHNOLOGIES
establishes a legal framework for the creation and
implementation of the European Community SDI, The use of geospatial applications for environ-
forcing organizations to publish their geospatial mental studies has experienced a significant
resources (focusing on the environmental domain) boom in the last years (Minsker et al., 2006;
under standard data services and by promoting the Pezzoli, Marciano, & Robertus, 2006; Morisette
adaptation of the SOA paradigm and geospatial et al., 2009). Its development has led to progress
web service standards. in making decisions in real time (Ramaprivan,
In essence, geospatial web services allow users 2008), it has created new relationships between
to access, manage, and process geospatial data different areas of knowledge (Ball et al., 2008)
in a service-oriented manner (Granell, Gould, & and has developed new processes to understand
Esbri, 2008). The demand for interoperability has the behavior of natural phenomena (Díaz, Granell,
boosted the development of standards and tools Gould, & Olaya, 2008).
to facilitate data transformation and integration, For all these efforts to continue to improve, they
mostly in terms of standard service interfaces need to be associated with technological advances
specified by Open Geospatial Consortium (OGC1). that gradually eliminate barriers that scientists
The Web Map Service (WMS) (OGC, 2006), the found in their research. This requires advanced
Web Feature Service (WFS) (OGC, 2005), the tools (processing, analysis and visualization) and
Web Coverage Service (WCS) (OGC, 2008), sophisticated information infrastructures that
the Sensor Observation Services (SOS) (OGC, provide access to large volumes of data. Given
2007a) and the Web Processing Services (WPS) the wide array of open source technologies for
(OGC, 2007b) are some prominent examples of geospatial data (Steiniger & Bocher, 2009), these
OGC interfaces for geospatial services with a can be a serious solution to address the lack of
clear applicability to environmental applications mechanisms for democratizing the capacity build-
(Granell, Díaz, & Gould, 2010). For instance, a ing of geospatial information infrastructures.
WMS could be used to provide a map of the pro- A key reference in geospatial technologies’
tected areas of a given region, a SOS may serve state of the art is the world conference for Free
detailed meteorological observations from near and Open Source Software for Geospatial (FOS-
weather stations, and a WCS may offer satellite S4G2). One can find the latest innovative tools
imagery of the area of interests and its surround- and developments in the geospatial landscape,
ings. The central building-blocks for data, as such as OpenGeo Suite or GeoNode, which are
well as service discovery, are provided by the examples of open source technology related to
Catalogue Services for the Web (CSW) (OGC, those proposed in this paper. OpenGeo Suite
2007c). The CSW provides one access point to (Pumphrey, 2009) is an open source project that
users that publish and search environmental data. brings together different technologies (PostGIS,
In the proposed framework implementation, GeoServer, GeoWebCache, GeoExt, OpenLay-
we will use a broad list of OGC services (WMS, ers) to develop an architecture based on OGC
WFS, WCS, WPS, and CSW) to let experts to standards. The main objective is that users can
easily build a platform that provides maps and data

956
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

via web applications, mobile and desktop clients. collaborative SDI, accessible, designed to facili-
GeoNode provides a new SDI solution by adding tate data sharing and standardization of specifica-
a new application discovery layer so that scientists tions (European Parliament and Council, 2007),
can easily discover the data services. The result reinforced by standardization organizations, such
is a platform that facilitates the creation, sharing as the Open Geospatial Consortium (OGC) and
and collaborative use of geospatial data (Benthall International Organization for Standardization
& Gill, 2010). (ISO). On the other hand, the advent of recent Web
Merging our work with other domains not 2.0 technologies to create interoperable geospatial
strictly in touch with geospatial domain, we must web portals (Yang, Li, Xiao, Raskin, & Rambacus,
review the take off of many web applications that 2007; Wilkins-Diehr, Gannon, Klimeck, Oster, &
following the Web 2.0 philosophy have created Pamidighantam, 2008) and that allows users to
an environment that brings users to participate publish and access data in an easy way. The rise
very actively and add new content to be accessed of open-source service-oriented geospatial envi-
and shared by many users. This way of content ronments and tools, provide a variety of solutions
sharing brings new forms of study. Not only with and knowledge that can be easily extrapolated or
the incorporation of the social factor but with a incorporated directly into new scientific applica-
new relationship with the user. Applications are tions (Pierce, Fox, Choi, Guo, Gao, & Ma, 2009).
required to have higher degree of interaction and Why then the diverse geospatial technologies,
attractive interfaces that facilitate complex tasks. open source tools, and geospatial information
Solutions such as RIA (Rich Internet Applications) infrastructures have not received the same level
can be a key element in future developments. In interests in environmental applications as in oth-
this paper we describe how we propose to merge ers domains? After all, SDI and open source
these distinct disciplines. Having an underlying geospatial technologies presents many benefits
geospatial information infrastructure based on for environmental scientists and experts, since the
standard services and components we add on top intrinsic multidisciplinary aspect in environmental
a framework composed by mechanisms and Web research can be viewed as a source of expertise
2.0 interfaces to assist users in not only accessing for collaborative applications between different
but also deploying and sharing new geospatial domains (Westlund, 2010).
resources in this SDI. From the user’s perspective of environmental
and geospatial sciences, the increasing complexity
of current methodologies, tools and technologies
FRAMEWORK FOR DEPLOYMENT may lead to underuse such information infrastruc-
OF ENVIRONMENTAL DATA tures for efficient analysis and decision-making
tasks. This means scientists who produce interest-
In the previous section we have seen numerous ing results through models and processes often
technological trends that can potentially become use information for access, not to publish their
the basis of a set of solutions in multiple scenarios. own data sets. This implies in low rates of user
On one hand, common policies and agreements in participation in the process of injecting environ-
terms of the notion of SDI, understood as geospa- mental datasets into information infrastructures
tial data stores collected by government agencies for collaborative research.
and private institutions, where data are published For this reason, we present an open source
via open, standards-based services, can be easily framework based on INSPIRE architecture that has
incorporated into scientific applications (Nerbert, been extended with some modules for not only ac-
2004). The INSPIRE directive, which creates a cessing but also deployment of environmental data

957
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

into geospatial information infrastructures (like for communication; and the Data layer (bottom).
SDI). The main characteristics are the following: Grey boxes represents the components contained
within the Service Framework. In the following we
• It allows the access and retrieval of envi- outline the main functionality of each layer. Next
ronmental data by means of interoperable section will describe in detail each component in
OGC-based services), grey in Figure 1
• It allows the integration and processing of
new information by means of deployment Application Layer
mechanisms,
• It allows the generation and publication of Our contribution on this layer is to add the Service
metadata descriptions in OGC-based cata- Deployer and Publisher that allows user not only
logs for further search. access resources but also deploy new resources in
the SDI and a user interface that following web 2.0
Figure 1 illustrates the conceptual architecture principles offers a entry point to the underlying
of the proposed framework. It basically follows SDI that allows users to participate in the building
the INSPIRE technical architecture, that is, three- and maintenance of the SDI.
layer SOA: the Application layer (top) contains Following the Model-View-Controller (MVC)
client side services, application logic and presen- pattern (Freeman, Sierra, & Bates, 2004) to build
tation modules; the Service layer (middle) with the client side, we find the Application layer
server side services that use a standard interface is divided into three levels according to their

Figure 1. Multi-layered architecture for the proposed framework

958
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

proximity to the user. At the level closest to the OPEN SOURCE BASED
user, we have the visual part of the application, COMPONENTS
multimedia effects and interaction with the user.
At the intermediate level, we find the application As we have seen throughout the paper, there
logic associated with the controller of the actions are many open source projects in the geospatial
produced by the user. At the lower level we have domain and in particular many projects that
client side’s services that allow communication implement open source components according to
with the geospatial services deployed at the Ser- OGC standard specifications. We describe next
vice layer, through OCG standards and RESTful the service framework components regarding
technology. the technology used to implement them. For the
Service Framework implementation we have reuse
Service Layer some of these existing open source components
and also we have self-develop some components
The Service layer is responsible for mediating to be offered through an open source license.
between the Application and the Data layer. Its The rest of this section describes our solu-
development focuses on the paradigm INSPIRE/ tion from two points of view: interfaces and
SDI/SOA to minimize problems of accessibility technologies used. Figure 2 shows the interfaces
and interoperability, so that wherever possible implemented by each component in the Service
requests for data on this layer are performed using Framework. Figure 3 shows the technology that
OGC standards. However, on certain occasions has been used to implement each of these compo-
such as the insertion of new data, we need to use nents. Some of these technologies are existing open
a RESTful approach. In this layer reside the in- source projects and self implemented components
stances of the OGC services in which we deploy to be delivered as open source projects as well.
the new content that users will provide using
the framework interface. Service Framework is Application Layer
preconfigured to work with existing instances of
geospatial web services. The Application layer as we can see in both Figures
2 and 3 is composed of several sub-layers and com-
Data Layer ponents. At the top of the Application Layer, we
find the components that interact directly with the
In this part we find the final elements for data user. Next we describe each of these components
storage, such as databases or simply memory disk regarding the interfaces and the technology used.
regions, where the Service layer can store their
data. It is out of the scope of this paper to describe User Interface, Map
further this layer since it will be something man- Viewer and Searcher
aged by the OGC standard service implementa-
tions that we use to deploy new data content and Interface. The main purpose of the User Interface
that reside in the service layer. is to facilitate the user interaction with the applica-
tion. The easiness and comfort of use and the level
of interaction are some factors that determine the
success or failure. The User Interface integrates the
other two components that interact directly with
the user: the Map Viewer and the Searcher. The
former component offers a common map-based

959
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

Figure 2. Interfaces used to expose the components functionality

interface for data visualization and exploration. application on the user’s local system. This reduces
The latter, the Searcher component, follows an the communication with the server and increases
OpenSearch3 interface. This simple specification the user’s autonomy (Weaver, Gao, Chin, &
proposes a standard search interface, used in most Iverson, 2009).
modern browsers to manage custom searches. In From among the wide range of valid options
addition, a geospatial extension (OpenSearch- like FLEX, SilverLight or JavaFX, we have se-
Geo) has been proposed that adds geographic lected JavaFX because of its fully integration with
filtering capabilities and support for geographic Java which allows us to easily integrate the com-
response formats, obtaining then an optimal ap- ponents reused and developed in our framework.
proach to search for geospatial data. JavaFX is structured according to the classical
Technology. As we shown in Figure 3, the concept of scene graph for the implementation
interface is implemented using RIA technology. of graphical applications, with a primary scene
It proposes a new evolution of the client server node on which iteratively hang the rest of nodes.
paradigm that allows executing a portion of the These nodes represent mainly behavior anima-

Figure 3. Technology used to implement the components

960
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

tion, transition effects, visual content and user user interface and the controller. Examples are the
interaction (Weaver, Gao, Chin, & Iverson, 2009). list of local resources loaded (localFileResources)
In the open source community there exist as a view update and the method loadLocalFile()
several map viewers components available. In as an input to the controller (and therefore execu-
our work OpenLayers has been the chosen one. tion of actions on the model).
OpenLayers is a JavaScript client that offers
interesting features like very good performance, Service Connector
the ability to consume many different geographic
formats, and client side support of standards such Interface. The Service Connector component in
as OGC Web Map Service (WMS) and Web Fea- the application layer contains components that
ture Service (WFS). The communication between implement the client side interfaces of the stan-
the interface and the map viewer component is dard services deployed in the Service layer. For
done through calls to specific methods (using instance it contains the component Web Process-
the JavaFX AppletStageExtension class) that are ing Service Application Programming Interface
interpreted by the JavaScript runtime engine in (WPSAPI) that implements the OGC WPS client
the browser. interface to be able to access and run distributed
Furthermore, OpenLayers has been the con- processing services. Another module developed
tainer extended with a new self developed control in the Service connector is the CSWAPI (Catalog
called OpenSearch-geo control which has been Service for Web API).
integrated in OpenLayers for searching purposes Technology. Currently, Service Connector
(Fonts, Huerta, Díaz, & Granell, 2010). This new modules like WPSAPI and CSWAPI are included
control in the client translates user queries to the in a self–developed Java library. For instance
standard OpenSearch protocol, using geospatial WPSAPI offers a programmatic way to send
parameters and supported formats, and offers and receive getCapabilities request using Java
user the retrieved documents. As in the previous objects. It implements objects to send and receive
case, the JavaFX AppletStageExtension class is Process description requests to get the description
the preferred interface. of processes in order to see the parameters and the
way a process within a WPS has to be invoked.
Controller Finally, it permits sending an execute request to
invoke one of the available processes and it is
Interface. Following the MVC pattern, the logic able to retrieve and parse the response in order
of the application is located in the Controller to extract the process results. The object model is
component. This responds to events produced scalable to support new versions. Currently ver-
by the user, and invokes requests to the model sions 0.4 and 1.0.0 are supported. This component
by delegation to the lower components of the is programmed also under open source license.
Application Layer (Service Connector, Service
Deployer and Service Publisher). Service Deployer
Technology. This component is developed
using the Singleton pattern (Freeman, Sierra, & Until now the described components offered ac-
Bates, 2004) so that any instance requested on it cessing capabilities to resources deployed in the
to access the same content and execution. The infrastructure. The aim of the Service Deployer
interactions between the view (user interface) and component is to assist the user to deploy resources
the controller are performed via a new feature in as services in the infrastructure. These resources
JavaFX that synchronizes variables between the can be either data resources or processes. In

961
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

this paper we focus on the deployment of data Service Publisher


resources.
Interface. The Data Wrapper component al- Service Publisher component is in charge of de-
lows users to upload geospatial data to be deployed ploying data and process descriptions to an open
and delivered as standard OGC data services, catalog, so they can be found in the infrastructure.
such as WMS, WFS and WCS. The Data Wrapper Interface. Like the Data Wrapper, the Service
interface contains two methods: WrapFile and Publisher needs to implement an interface to
WrapData to deploy data available in a File or deploy content in a specific Service type. In this
included as a parameter in the request. case the content is not data but metadata and the
To deploy data as an OGC Service, this com- service type is the Discovery Service Type, which
ponent should connect to OGC Service instances recommended interface according to INSPIRE is
available in the infrastructure. Some of these the OGC CSW specification. In this case CSW
OGC standards (such as WFS) include transac- offers a transactional operation in its interface to
tional operations in their interface specification. deploy new metadata in the service, thus opposite
In this case, the resources can be deployed by to the Data Wrapper the interface implemented
implementing the transactional interface in the by Service Publisher is a standard interface, the
client side. This is the ideal case, because the CSW-T. For these transactional requests, we
component implementation is independent of build and send a HTTP POST request where the
service implementations. In other cases like de- body is a standard metadata element included in
ploying data for visualization purposes as WMS, the standardized XML structure according to the
the specification does not consider transactional OGC service.
operations; therefore this component has to deal On the other hand once deployed the metadata,
with the concrete services implementation and it will be available for searching purposes through
their own protocol and interface. Currently, in our the CSW interface. Since the CS-W implementa-
prototype implementation this is the case of the tion of choice offers other interfaces, the deployed
Data Wrapper component when deploying data metadata will be available not only through the
in WMS implementations using the Geoserver CSW but also through the Open Search interface.
RESTful API4. Technology. The Service Publisher is again a
Technology. The Data Wrapper component self-developed Java library delivered with open
is a self developed Java library in form of API source license. The Service Publisher uses the
exposing the described interface. In order to Service Connector to perform the two main steps
deploy the data, we have first associated this of its workflow: first, uses the Service Connector
component with an instance of OGC Data Service to send GetCapabilities request to the services in
that implements WMS, WFS and WCS interfaces. which we have deployed the resources and get a
This OGC Services implementation is an open minimum set of metadata elements (available in
source project called Geoserver, which offers a the Capabilities document) and second, it uses
RESTful API to deploy content. Geoserver is then the Service Connector CSWAPI to connect to
the responsible for publishing data through OGC the catalog(OGC CSW) and deploy this metadata
interfaces, transforming the resources processed using the transactional interface, which provides
in WMS for viewing and WFS and WCS for methods to insert, delete, and update records in
download purposes. the catalog.

962
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

Service Layer our framework. Regarding WPS specification,


we choose 52North WPS framework because it
As we based our infrastructure on INSPIRE tech- is an open source Java framework that enables
nical guidelines, the service layer must provide easily design, implementation and deployment
services for discovery, viewing and downloading of WPS-based services.
of data available in the infrastructure.
Interface. The INSPIRE technical architecture
shows that the different service types defined USE CASE
by the Directive are placed in the service layer
resulting in the so-called Service Network. Each To evaluate our proposal and illustrate with a
type defines common capabilities offered by a practical example the behavior of the described
group of services. Regarding this functionality, components we present a particular use case
the INSPIRE Service Types we consider are the focused on the evaluation of burned regions on
Discovery (INSPIRE Network Services Drafting protected environmental areas in Valencia (Spain).
Team, 2009b), View (INSPIRE Network Services The analysis of fire impact requires a real time
Drafting Team, 2008) and Download (INSPIRE interaction between different datasets, processes
Network Services Drafting Team, 2009a) service and users. For these reasons it represents a perfect
types. In addition we also consider the Process- case study to look at the overall functionality of
ing type to add processing capabilities to the SDI the framework.
(Díaz, Granell, & Gould, 2008). A technician must assess the impact of the
Since OGC standards are proven to be mature, some summer fires in the protected areas of Va-
they have been chosen to implement the interfaces lencia, she should find and access the protected
of the services available in the Service layer as areas dataset, the burned areas dataset (for a cer-
the INSPIRE directive recommends in its imple- tain year) and perform the intersection process.
mentation rules (see parenthesis content: such Traditionally the result of the assessment can be
as OGC CSW (to implement Discovery Service used to write a report or make a decision; here
type), OGC WMS (View Service Types), OGC we only describe how this technician can deploy
WFS, OGC WCS (Download Service types) and it easily in the underlying SDI. Figure 4 shows a
OGC WPS (pointed out to be useful to implement sequence diagram with the operations and actors
distributed environmental applications (Granell, involved in each step of this scenario.
Díaz, & Gould, 2010). The first step is to find the initial data sets, in
Technology. There are many implementa- our case the protected areas of Valencia and the
tions of OGC specifications in the open source burned areas in 2005. For the first data entry
community. GeoServer is the project of choice (protected areas), we use a local data provider
to deploy data for visualization and download using a standard OGC data service. These data
purposes, that is, through WMS, WFS and WCS can be found using the OpenSearch-geo client
specifications. It offers these interfaces and in its component.
latest version GeoServer RESTful API allows the If the data is not accessible through an OGC
Data Wrapper component to deploy resources and service and it is only available locally like the input
configure the services. data related to the burned areas in 2005, Service
Geonetwork is another open source project Framework provides the functionality to upload
broadly extended thanks to its implementation of local datasets to be deployed as standard OGC
the OGC CSW specification. This is the project data service. Thus, the user can upload it in the
of choice to deploy the discovery service type in

963
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

Figure 4. Sequence diagram

Figure 5. Local data deployment

964
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

application. Figure 5 shows a screenshot of how have deployed the data), the execute request to
this step can be performed in the user interface. run the process.
The second step shown in the sequence diagram The last step consists of deploying the process-
(Figure 4) is the execution of a distributed process ing results to explore the results (Figure 6). Users
to perform the intersection and the generation of can thus view it and analyze it through the inte-
the assessment results (Figure 6). The Service grated map viewer component based in OpenLay-
Framework interface allows the user to connect ers. Again as we described with the upload of the
and execute to existing WPS services, rather than burned areas the most interesting feature of the
using heavy desktop applications. In this step we Service Framework is that the Service Framework
must select a WPS server and a specific process, assists the user to deploy the processing results
as in our case the Intersection (a process that will for visualization and download purposes into
return the protected areas that burned in 2005). standard services, OGC WMS and OGC WFS
All communication with the service is through respectively. The Service Framework uses the
the Service Connector component using the WPS GeoServer RESTful API and creates a new data
API. In the diagram of Figure 4, we can see how store accessible through standard services. After
internally several WPS interface calls are per- the data deployment, the Service Publisher com-
formed. First a describeProcess request to get the ponent creates a minimum set of metadata elements
process description regarding input data and af- to be published in an instance of CS/W so that
terwards (after the user identifies the correct input the dataset can be found by other users through
data, in this case references to the WFS where we common discovery mechanisms in SDI.

Figure 6. Process inputs view and results

965
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

CONCLUSION ACKNOWLEDGMENT

We have presented a framework that follows a This work has been partially supported by the
hybrid SDI building methodology. We provide an European project EuroGEOSS and by the CENIT
INSPIRE-based architecture, whose compo- “España Virtual” project through the Instituto
nents assist users not only in accessing but also Geográfico Nacional (IGN). We would like to
in deploying resources as INSPIRE-compliant thank Jorge Suarez from the Conselleria de Medi
services. In this way we provide an entry point Ambient, Aigua, Urbanisme i Habitatge (Environ-
for users to participate in SDI building and mental, Water, Urban planning and living regional
maintenance. As a result, users can massively government of Valencia), who kindly granted the
deploy resources improving their availability in forest fire data for experimentation purpose.
a standard manner.
To deploy resources automatically as standard
services, the Service Framework uses a concrete REFERENCES
implementation of the OGC WMS, WFS, WCS
specification. The Service Framework uses Ge- Ball, W. P., Brady, D. C., Brooks, M. T., Burns,
oserver’s own protocol to deploy resources. We R., Cuker, B. E., & Di Toro, D. M. (2008). Pro-
still could migrate the solution to other imple- totype system for multidisciplinary shared cyber-
mentations, but some programming should be infrastructure: Chesapeake Bay environmental
done. To avoid this, it would be useful to deploy observatory. Journal of Hydrologic Engineering,
resources using standard transactional operations. 13(10), 960–970. doi:10.1061/(ASCE)1084-
This would require to have the OGC WMS and 0699(2008)13:10(960)
WCS specifications including a transactional Bell, M. (2008). Service-Oriented Modeling
interface like WFS and WPS is considering in its (SOA): Service analysis, design, and architecture.
current and future versions. This would detach Hoboken, NJ: John Wiley & Sons.
the solution from the vendor implementation.
In this sense OGC standards have shown to be Benthall, S., & Gill, S. (2010). SDI Best Prac-
mature enough to provide specifications to create tices with GeoNode. Retrieved June 1, 2010,
interoperable web services to deploy and share from FOSS4G website: http://2010.foss4g.org/
resources and provide the functionality needed tutorial10.php
to create distributed applications on SDI.
Curbera, F., Duftler, M., Khalaf, R., Nagy, W.,
The use of open source components has the
Mukhi, N., & Weerawarana, S. (2002). Unravelling
advantage of giving us full control over the code
the Web Services Web: An Introduction to SOAP,
to modify everything. A disadvantage, however,
WSDL, and UDDI. IEEE Internet Computing,
is that they have caused some inconveniences
6(2), 86–93. doi:10.1109/4236.991449
because most of them use libraries that are not
fully developed in the context of this project. Docu- Díaz, L., Granell, C., & Gould, M. (2008). Case
mentation scarcity is always a big issue to face. study: Geospatial processing services for web
Therefore, there is a need to invest an important based hydrological applications. In Sample, J. T.,
amount of time and effort in the development to Shaw, K., Tu, S., & Abdelguerfi, M. (Eds.), Geo-
have satisfying results. spatial services and applications for the Internet
(pp. 31–47). New York, NY: Springer.

966
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

Díaz, L., Granell, C., Gould, M., & Olaya, V. Granell, C., Díaz, L., & Gould, M. (2010).
(2008). An open service network for geospatial Service-oriented applications for environmental
data processing. In Academic Proceedings of the models: Reusable geospatial services. Environ-
Free and Open Source Software for Geospatial mental Modelling & Software, 25(2), 182–198.
Conference, Cape Town, South Africa (pp. 410- doi:10.1016/j.envsoft.2009.08.005
419).
Granell, C., Gould, M., & Esbrí, M. A. (2008).
European Parliament and Council. (2007). IN- Geospatial web service chaining. In Karimi, H. A.
SPIRE EU Directive. Retrieved January 28, 2009, (Ed.), Handbook of research on geoinformatics
from http://eur-lex.europa.eu/LexUriServ/site/en/ (pp. 189–195). Hershey, PA: Information Science
oj/2007/1_108/1_10820070425en00010014.pdf Reference.
Fielding, R. (2000). Architectural styles and the Hey, T., & Trefethen, A. E. (2005). Cyberinfrae-
design of network-based software architectures structure for e-Science. Science, 308(5723),
(Unpublished doctoral dissertation). University 817–821. doi:10.1126/science.1110410
of California, Irvine, CA.
INSPIRE Network Services Drafting Team.
Fonts, O., Huerta, J., Díaz, L., & Granell, C. (2008). Draft technical guidance to implement IN-
(2010). OpenSearch-geo: The simple standard for SPIRE view services. Retrieved from http://inspire.
geographic web search engines. In Proceedings jrc.ec.europa.eu/reports/ImplementingRules/
of the IV Jornadas de SIG Libre, Girona, Spain. network/Draft_Technical_Guidance_View_Ser-
Retrieved September 1, 2010, from http://www. vices_v1.0.pdf
sigte.udg.edu/jornadassiglibre/uploads/Articles/
INSPIRE Network Services Drafting Team.
a29.pdf
(2009a). Draft technical guidance for INSPIRE
Freeman, E., Sierra, K., & Bates, B. (2004). Head download services. Retrieved from http://inspire.
first design patterns. Sebastopol, CA: O’Reilly jrc.ec.europa.eu/reports/ImplementingRules/
Media. network/Draft_Technical_Guidance_Down-
load_Services_v1.0.pdf
Giuliani, G., Ray, N., Schwarzer, S., De Bono,
A., Peduzzi, P., & Dao, H. (2011). Sharing envi- INSPIRE Network Services Drafting Team.
ronmental data through GEOSS. International (2009b). Technical guidance for INSPIRE dis-
Journal of Applied Geospatial Research, 2(1). covery services. Retrieved from http://inspire.
doi:10.4018/jagr.2011010101 jrc.ec.europa.eu/documents/Network_Services/
Technical%20Guidance%20Discovery%20Ser-
Goodchild, M. F. (2008). The use cases of digital
vices%20v2.0.pdf
earth. International Journal of Digital Earth,
1(1), 31–42. doi:10.1080/17538940701782528 Lannotta, B. (2007). GEOSS: A global view.
Aerospace America, 45(8), 36–40.
Goodchild, M. F., Egenhofer, M., Fegeas, R., &
Kottman, C. (1999). Interoperating geographic Masser, I. (2005). GIS worlds: Creating spatial
information systems. Boston, MA: Kluwer Aca- data infrastructures. Redlands, CA: ESRI Press.
demic.

967
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

Mazzetti, P., Nativi, S., & Caron, J. (2009). Open Geospatial Consortium Inc. (OGC). (2008).
RESTful implementation of geospatial services Web coverage service implementation specifica-
for Earth and Space science applications. Inter- tion 1.1.2. Redlands, CA: Author.
national Journal of Digital Earth, 2(1), 40–61.
Papazoglou, M. P. (2008). Web services: Principles
doi:10.1080/17538940902866153
& technology. Essex, UK: Person Education.
Minsker, B., Myers, J., Marikos, M., Wentling,
Pezzoli, K., Marciano, R., & Robertus, J. (2006).
T., Downey, S., Liu, Y., et al. (2006). NSCA en-
Regionalizing integrated watershed management:
vironmental cyberinfrastructure demonstration
A strategic vision. In Proceedings of the 7th Annual
project: Creating cyber environments for envi-
International Conference on Digital Government
ronmental engineering and hydrological science
Research, San Diego, CA (pp. 444-445). New
communities. In Proceedings of the ACM/IEEE
York, NY: ACM.
Conference on Supercomputing, Tampa, FL. New
York, NY: ACM. Pierce, M. E., Fox, G. C., Choi, J. Y., Guo, Z.,
Gao, X., & Ma, Y. (2009). Using Web 2.0 for
Morisette, J. T., Richardson, A. D., Knapp, A. K.,
scientific applications and scientific communities.
Fisher, J. I., & Abatzoglou, J. (2009). Tracking
Concurrency and Computation, 21(5), 583–603.
the rhythm of the seasons in the face of global
doi:10.1002/cpe.1365
change: Phenological research in the 21st century.
Frontiers in Ecology and the Environment, 7(5), Poch, M., Comas, J., Rodríguez-Roda, I., Sàn-
253–260. doi:10.1890/070217 chez-Marrè, M., & Cortés, U. (2004). Design-
ing and building real environmental decision
Nerbert, D. (2004). Developing spatial data
support systems. Environmental Modelling &
infrastructure: The SDI cook book, version 2.0.
Software, 19(9), 857–873. doi:10.1016/j.env-
Victoria, Australia: GSDI.
soft.2003.03.007
Open Geospatial Consortium Inc. (OGC). (2005).
Pumphrey, M. (2009). OpenGeo Suite. Retrieved
Web feature service implementation specification
June 1, 2010, from FOSS4G website: http://wiki.
1.1.0. Redlands, CA: Author.
osgeo.org/wiki/FOSS4G_2009_Demonstra-
Open Geospatial Consortium Inc. (OGC). (2006). tion_Theatre
Web map service implementation specification
Rajabifard, A., Binns, A., Masser, I., & William-
1.3.0. Redlands, CA: Author.
son, I. (2006). The role of sub-national govern-
Open Geospatial Consortium Inc. (OGC). (2007a). ment and the private sector in future spatial data
Sensor observation service (Version 1.0.0). Red- infrastructures. International Journal of Geo-
lands, CA: Author. graphical Information Science, 20(7), 727–741.
doi:10.1080/13658810500432224
Open Geospatial Consortium Inc. (OGC). (2007b).
Web processing service (Version 1.0.0). Redlands, Ramaprivan, H. K. (2008). NASA’s Earth science
CA: Author. data systems: A ‘bit of history’ and observations.
In Proceedings of the Cyberinfrastructure for
Open Geospatial Consortium Inc. (OGC). (2007c).
Environmental Observations, Analysis, and Fore-
OpenGIS® catalogue service implementation
casting: A Cyberinformatics Forum.
specification (Version 2.0.2). Redlands, CA:
Author.

968
Open Source Based Deployment of Environmental Data into Geospatial Information Infrastructures

Reichert, P., & Schuwirth, N. (2010). A generic Yang, C., & Raskin, R. (2009). Introduction
framework for deriving process stoichiometry in to distributed geographic information process-
environmental models. Environmental Modelling ing research. International Journal of Geo-
& Software, 25(10), 1241–1251. doi:10.1016/j. graphical Information Science, 23(5), 553–560.
envsoft.2010.03.002 doi:10.1080/13658810902733682
Richardson, L., & Ruby, S. (2007). RESTful Web Yang, C., Raskin, R., Goodchild, M., & Gahegan,
Services. Sebastopol, CA: O’Reilly Media. M. (2010). Geospatial Cyberinfrastructure: Past,
present and future. Computers, Environment
Secretariat, G. E. O. (2007). The full picture: GEO.
and Urban Systems, 34, 264–277. doi:10.1016/j.
Geneva, Switzerland: Tudor Rose.
compenvurbsys.2010.04.001
Steiniger, S., & Bocher, E. (2009). An overview
Yang, P., Li, W., Xiao, D., Raskin, R., & Rambacus,
on current free and open source desktop GIS de-
M. (2007). Earth information exchange - Sharing
velopments. International Journal of Geographi-
earth science information through interoperable
cal Information Science, 23(10), 1345–1370.
approach and cyberinfrastructure. In Proceed-
doi:10.1080/13658810802634956
ings of the SPIE Conference on Geoinformatics:
Weaver, J. L., Gao, W., Chin, S., & Iverson, D. Geospatial Information Science, Nanjing.
(2009). Pro JavaFX Platform: Script, Desktop
and Mobile RIA with Java Technology. New York,
NY: APress. ENDNOTES
Westlund, S. (2010). The use of geospatial tech- 1
All specifications for service interfaces
nologies in disaster management. International defined by OGC are publicly available at
Journal of Applied Geospatial Research, 1(3), http://www.opengeospatial.org
17–30. doi:10.4018/jagr.2010070102 2
Workshops, tutorials and presentations for
Wilkins-Diehr, N., Gannon, D., Klimeck, G., Oster, every conference edition are publicly avail-
S., & Pamidighantam, S. (2008). TeraGrid science able at http://foss4g.org
gateways and their impact on science. Computer,
3
The OpenSearch specification and its exten-
41(11), 32–42. doi:10.1109/MC.2008.470 sions can be found at http://www.opensearch.
org
4
http://docs.geoserver.org/stable/en/user/
extensions/rest/index.html

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 2,
edited by Donald Patrick Albert, pp. 6-23, copyright 2012 by IGI Publishing (an imprint of IGI Global).

969
970

Chapter 60
Application of Topographical
Capture Techniques for
Modelling Virtual Reality:
From the Static Object to
the Human Figure

Mercedes Farjas Abadía


Universidad Politécnica de Madrid, Spain

Manuel Sillero Quintana


Universidad Politécnica de Madrid, Spain

Pedro Ángel Merino Calvo


Universidad Politécnica de Madrid, Spain

ABSTRACT
Since the dawn of time man has attempted to represent the human figure with techniques ranging from
simple drawings to techniques that manage to reflect the movement of body segments. In parallel,
cartographic techniques have developed very advanced capture and 3D representation systems, but
even though they have been applied in recent years to other sciences, they have not been applied yet to
virtual reality. The appearance of the laser acquisition systems has enabled us to acquire data without
discrimination on points and to get quick 3D models. This situation allows us to work directly on the
concept of surface and to analyze it from the uniqueness of the detail, compared to traditional systems
which capture points for, later, imaging surfaces from them. Under this prism, a research group was
formed by graduates in Physical Activity and Sport and in Cartography, in order to bring together both
sciences and to improve techniques of capture and representation of the human body. The road is not
completely gone, but some results have been obtained and are presented in this work.

DOI: 10.4018/978-1-4666-2038-4.ch060

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Application of Topographical Capture Techniques for Modelling Virtual Reality

INTRODUCTION traditionally the development of the topography


has focused on modelling and representation of
Nowadays, the term “Virtual Reality” is associ- terrain in all its aspects. A few years ago, a group
ated to multiple disciplines, both in research and of people considered that topographic techniques
practical applications, but its ultimate goal in all and virtual reality had progressed in two parallel
of them is very similar: to produce an illusion of but not crossed ways, or, in other words, advanc-
reality, which can be subject to a practical, tech- ing without looking each other.
nical or conceptual application. Since the 50’s, Throughout time, topographic and cartographic
when they were started to nowadays, the capture, engineering technologies have been working on
processing and representation systems have been the acquisition of data using different equipment
varied, evolving in a parallel way to the comput- and topographic, photogrammetric and geodetic
ers, essential for the mathematical calculations. methods in order to represent 2D and 3D objects,
Direct applications of virtual reality range from buildings, statues and archaeological sites, focus-
medicine to engineering, from computer games to ing the research on obtaining and modelling those
advertising. In all those fields, simulations provide elements with sufficient metric precision.
an effective and efficient support to its technical It is said that any investigation begins when
or practical goals. we are faced with a question or uncertainty that
If we consider the processing hardware and makes us question the situation and initiate a
software, we see that has changed dramatically. In search. And we have the following question:
some cases, it is difficult to distinguish between Can our techniques for capturing and modelling
real and fictitious when you see some pieces of to contribute something to the virtual reality and
work of virtual reality. In a deeper analysis, we can biomechanics?
conclude that the problems are in the collection We believe that one of the fundamentals of
of data, since in many cases much of the work is virtual reality is three-dimensional interactive
done through techniques of image designing and computer simulation, so that the user is introduced
processing. in an artificial environment that is perceived as real,
In this report, we propose a new technique to allowing you to interact in a virtual world that has
capture the information that is going to by virtual- the ability to maintain a two-way correspondence
ized, basing it in real-world data collection using with the real world. The more closely approximates
a 3D scanner. This technique provides us with a the information captured to the reality, the better
virtual model of reality with metric data, which the interrelationships between the virtual and the
we can say that is a perfect representation of real- real. If we apply these concepts to biomechanics,
ity. The study develops model tests in laboratory as a multidisciplinary study of the measurements
and real environments, trials with static human of the human body in movement, we can be at
models, leaving the door open to the modelling the gates to get almost perfect representations of
of the human figure as a moving object. the reality with virtual models.
Before going further in the work, we would In short, we have to understand both the func-
like that you turn the light back with us, so that tional and the physical variability in subject’s
when you read, you will understand the steps taken movement, and how these components interact
since the beginning of the topography until the and how they could be improved. The study of
representation of the human figure by topographic the dialectical relationship structure-function, is
as a result of biomechanical studies. a basic tool with a strong genetic dependence for
At present we are working on virtual reality the identification of future athletes.
techniques, but it has not always been our goal, as

971
Application of Topographical Capture Techniques for Modelling Virtual Reality

Under this view, in recent decades, and in team, the viability was assured thanks to Leica
parallel with the evolution of virtual reality, Geosystems and the company 3D ICOM, which
sports science has tried to analyze and explain always provided us with their last generation
the different aspects of the human figure and its equipments.
movement through various sports disciplines, in We show you now a glimpse of these new
order to propose procedures for evaluating and technologies in the field of virtualization and its
improving the actions of the athletes by virtual application to sports; technologies that should
models. Thus we can observe as the conventional have assessed by the reader.
training methods in elite athletes have been com-
plemented by technologic advances for analyzing
and quantifying the parameters involved in the VIRTUAL REALITY DEVICES
performance. At present it is impossible to reach TO CAPTURE INFORMATION
the maximum goals in sports without the help of
high technology. In a model of virtual reality we have to fulfil
From this viewpoint, we considered a quan- some minimum requirements of simulation or
tum leap in the object of study. If we were able representative capacity, user-model interaction
to represent all types of surfaces: and sensory perception of the user. Even as the
Why not introduce real coordinates and the last is the most important, it directly depends on
variable time in these virtual representations? the quality of the simulation we obtain as a result
If we could represent, controlling the accuracy of the input images we use, because depending
in the final model and choosing the proper tech- on their proximity to reality we can convince the
nique for that purpose, a surface at a time “A” and users of the accuracy of the parallel reality that
“B” at a later time (e.g. on a glacier to analyze we offer them.
the displacement of moraines, or on a Cathedral Currently, systems for image capture require
deterioration over time), the new variable could huge processing times and in some cases some-
also be monitored and would be defined by Δt. what tedious, so the incorporation of topographic
Once we entered into concept of metric rep- techniques (3D scanner) will allow us to capture
resentation of “dynamic surfaces”, it was an the real shape quick and eliminating most of the
adventure to go beyond the measured object. It post-processing tasks. This situation coupled
was an experience to observe the surroundings and with the precision of the shots and the availabil-
people close to zero velocity, not considering the ity of space-time metrics, will enable the virtual
scenarios as empty, in an isolated way, but with reality model, once completed, to interact with
his players and considering the human movement the user in an image very close to reality, where
slowed until a “moment” becomes a “taking” in three-dimensional movements can be controlled
the data acquisition process. by the metric component that is provided, a way
The results that we obtained, along with the of solving some of the problems existing today.
emergence of new equipment and technologies This relationship between the use of coordi-
in topography, let us accept the hypothesis of the nates of space and time, allows us to overcome
project, and our ability to look forward again and the barriers of space and time, and it could allow
again, investing hours and hours (the time vari- us to set up an environment where information
able again) in the design and implementation of provided by the model and the perception that the
the first tests. user receives are interlaced in a direct and two-way;
After having the idea, accepting the challenge that is an unknown situation with a huge potential
and setting the scientific context and the human in the development of virtual reality interfaces.

972
Application of Topographical Capture Techniques for Modelling Virtual Reality

The application of these results allows us to graphic system; this geometry is interlaced form-
move between the topology of three-dimensional ing meshes, which in turn represent objects, and
virtual reality or immersive and the non immer- the objects virtual scenarios. If we increase the
sive Internet-based, being the last one the more geometric quality of the original polygons (best
accepted by users, and where the method can shots and more accuracy), we can offer more
produce more tangible benefits. We can sustain quality in the virtual model. But at present this
this assertion, since in applications where the user quality improvement requires us to have more
is a component of the virtual world, the nearest is processing capacity and consequently increases
the image recognition, the better the acceptance of the cost. The incorporation of capture systems
the system, but if we apply the system to maintain based on topographic techniques, allows us to
the heritage of a nation or, in archaeology, as a not only improving the quality of the images,
method to aid in the exploration and subsequent but also improving and reducing the subsequent
modelling, the benefits increase exponentially, al- processing times to the minimum, as we shall see
lowing us not only to identify the precise site and later on specific developments.
its environment, but also knowing their temporal At present, the scanner system operating per-
variations, or enable people located in different fectly in static 3D objects, but there is a problem
places to make a virtual visit, simultaneously, as that affects the time delay of the system, so that
if actually they were in that place, even being in the shots with dynamic objects appeared to move;
the other side of the earth. so that, the capacity to respond to this issue is
If we consider the user interface based in today insurmountable, but moving at the pace
video-mapping, where the person or environment that this technology does, this problem will be an
is filmed in advance and incorporated into the anecdote in a few years. We face a similar situa-
computer by graphics simulations, the result is, tion to the beginnings of photography when they
without any doubt, closer to reality but still not were required seconds for taking a good pictures,
perfect. The use of 3D scanning systems will al- and the modern digital cameras are able to take
low us that the person, image, or environment to hundred of high definition pictures per second. It
introduce into the model is a clone of the reality, a is only a matter of time, to improve the technique
young teacher who comes to surprise the viewer, to obtain the results described in this paper, so that,
allowing the illusion is created with realism such we can consider it as a beginning or a window
that people can doubt where is the line between open for further studies to a fully development of
the real and the virtual world. the system. (See Figure 1)
Today, we are coming into a world where
reality is pure coincidence, and the virtual envi-
ronment with be as normal as the real one, like TOPOGRAPHIC AND
a cyberspace where space-time relations are not CARTOGRAPHIC TECHNIQUES AS
just virtual, but will be part of our lives and we AN ALTERNATIVE
will assume it. In this situation, the better defined
are the models, the better will be the acceptance In parallel with advances in virtual reality, the map-
by the users. For example, the success of the ping techniques have evolved to very fast allowing
clothing shops will depend on the availability of us access to information almost in real time, but
an implemented system for virtually testing how its use has been limited to the representation of
fit us some clothing. the earth’s surface and in some cases, very recent
In these terms, the virtual world is made up ones, to the cataloguing of the artistic heritage.
of geometric shapes produced by a computerized

973
Application of Topographical Capture Techniques for Modelling Virtual Reality

Figure 1. Human face capture by scanner (2008) Equipment Konica Minolta VI-9i

The increasing use of the advances that oc- In the course of science, topographic and
curred in media and technology in recent years cartographic techniques have traditionally been
has led to the introduction of graphics systems linked to the study and representation of the ter-
as indispensable tools in teaching and learning rain, in any of its states or situations, resulting in
processes. The use of the computer as an audio- digital images, in two or three dimensions, which
visual tool at work, and its computational power, were implemented for having maps as a result.
allows the design of models and processes with But these monothematic techniques have been
a high potential for development, expression and gradually changing its design to become more
operability. generalist and multidisciplinary and, presently,
The graphical representation has always been they have became an essential support for different
an essential complement for any type of biome- studies or scientific projects. They are remarkable
chanical analysis, as the kinematic and kinetic the advances that have occurred in the study and
variables obtained from it allow statistical studies cataloguing of architectural works or in supporting
and simulations (digital models) of the movement the analysis of deformation of materials.
of the human body. Let’s see two different examples:
At present the results depends on method of
capture, where the variables space and time play 1. Gathering data through a continuous digital
a decisive role in obtaining good results, which picture sensor system. (See Figure 2)
are operational in a short space of time. 2. Data collection by laser system without
In a review of the state of the art, as regards the prism.
representation of human figures, static or dynamic,
we can see that the modern systems apply tech- Currently, the technological development al-
niques based on photographic or opto-electronic lows the acquisition of information with laser
systems. Both systems record the spatial position systems without prism, which allow us the data
of a number of spatial markers associated with acquisition without discrimination on points
the body surface, whose movement supposedly (without markers), and therefore they need not
reflects the movement of body segments. Thus, it be placed in advance any element facilitate the
is difficult take decisions regarding the relationship calculation process later. As a direct consequence,
of markers with the anatomical characteristics of we have that the collection of data can be execut-
the structure that lies below.

974
Application of Topographical Capture Techniques for Modelling Virtual Reality

Figure 2. Capture of the Cibeles fountain by Laser equipment UPM-Leica (2003) CYRAX 2500 Leica-
Geosystem

ed faster and most importantly minimizing the We tried to define a methodology to be able to
subsequent process of calculation. capture and generate virtual representation of the
Under these systems, the representation is static human figure, and establish some parameters
done on all of the selected area, obtaining a two- in order to apply directly the 3D laser scanner on
dimensional and/or three-dimensional continuous the sport disciplines in which biomechanics have
image of the represented figure, in which the a relevant role.
density of points will be accord to the desired To do so, we have settled the following targets:
precision. Allowing us to work on the “concept”
surface directly, analyzing it from the uniqueness 1. Analysis of the current methodologies and
of detail, compared to traditional systems which their capture parameters applied to static
record points to imagine surfaces from them. objects and their virtual representations.
In 2004 a group of people, including the authors 2. Analysis of the mechanisms and parameters
of this work, they posed the following question: that can influence the acquisition of the hu-
man figure, in order to apply them to sports.
Do topographers’capture systems and their analy- 3. Development of methodology of capture
sis methodology can be used to generate virtual based on a laser system without prism.
reality models, and in particular, the representa- 4. Representation of the three-dimensional
tion of the human body and its biomechanical human figure on a digital format, in real
analysis? time and using real coordinates, allowing
its metric analysis.
To find answers, we began research in this
field. We started to work on the doctoral thesis of Based on the above targets, they have been
Pedro Calvo Merino untitled “Three-dimensional defined the following specific objectives:
modelling of the human figure through mapping
techniques and their application the moving ob- 1. Image capture and formation of a 3D model.
ject”, which is currently in edition process, and 2. Analysis of its application to the moving
some final works to obtain the Engineering degree object.
in Topography. We have used some of their results
as part of the contents of this work. Since obtaining the first results, we note that
the use of this technology allowed us to work with

975
Application of Topographical Capture Techniques for Modelling Virtual Reality

both the object and the human figure in real time, of it, as a points cloud with geometry and metric.
compared to other systems that require a large This allows us to model not only a rapid model-
computer processing time. At the same time, it ling process, but also, thanks to the availability
was confirmed that its main advantage would be of metrics, modifying the position of the model
the high degree of similarity to the real model, without changing its real geometry.
with a minimum post-processing time, and sub- 3D scanners are classified according to the
sequently allowing modelling the human figure measurement system: contact and non-contact
very close to real time. with the object. Focusing on the non-contact
Compared to traditional capture systems type, are classified into two categories: active
through photogrammetric or optoelectronic sys- (emit some kind of radiation) and passive (do
tems, the methodology of the 3D laser systems not emit any kind of radiation). In this paper we
is reduced in the number of stages. will focus on the non-contact scanners with active
We believe we are on the way to obtain inertial pulsed laser beam, which are the best suited to
parameters suitable for biomechanical modelling the capture of data for virtual reality applications.
of the human body, involving on it the use of At the moment, they are widely used in many
anatomical points and segments under an intercon- applications: topographic representations of the
nected on a local or real reference system, where land, industry, maintenance of cultural heritage,
there is an interrelationship of those anatomical or film and video games edition.
points on the surface of the human body with If we focus on the models currently on the
the space. market, there are different systems but they all have
This paper will try to define and open a door to common characteristics in their basic technology
a new system for capturing and representing the (pulsed laser beam, no markers are required to
human figure, as a primary step for the study of objects to capture the environmental conditions
human motion in real time. It also try to describe of work, etc.), which we will try to show in the
our findings on trying to develop, validate and following paragraphs.
apply a work methodology with direct applica- In the pulsed laser beam equipments, it is mea-
tions in sports where the biomechanics and virtual sured flight time (going and back) for each point
representation has an important role. on the object’s surface. These devices have a timer
with a resolution of 1 picoseconds, allowing us to
produce beam of diameter smaller than 6 mm at
DESCRIPTION OF a distance of 50 m from the object. Taking a step
CAPTURE SYSTEM forward, they have been linked to the technique
of laser beam of modulated amplitude, in which
New technologies may allow describing, analyz- the system scans continuously an object through
ing, evaluating and modelling statically and cin- a variable beam, by using rotating mirrors. With
ematically the environment and the human figure one of these systems the team was able to capture
and its movement (cinematic analysis), and then the energy reflected from the object, by comparing
resolve and optimize the biomechanical analysis, the shape of the reflected wave with modulation
through representations of the model with details released and, later, calculating the distance of the
and temporal patterns which were unthinkable object and consequently their spatial coordinates.
some years ago. As a result, all points were captured from the be-
The capture system using 3D scanner consists ginning on three-dimensional coordinates into a
of devices that analyze a real object, inanimate local reference system, allowing its analysis and
or animated, to produce a near-real representation conversion to various formats. This is one of the

976
Application of Topographical Capture Techniques for Modelling Virtual Reality

added values of the 3D laser scanners compared spatial coordinates. As a result, all points are
to traditional capture systems (Barber, Mills, & captured from the beginning three-dimensional
Bryan, 2004). coordinates on a local reference system, allowing
When we make a comparison of traditional its analysis, or export to various formats, this is
methods based on photogrammetry and optoelec- one of the added value of the 3D laser scanners
tronics compared to laser scanning systems, we compared to traditional systems of capture.
obtain the following advantages of the latter over There are different capture systems based
the former (Demir et al., 2004): on laser systems in the market, the most impor-
tant are the ScanStation 2 and the HDS6100 of
• Recording of direct points of the object in Leica-Geosystem (Leica Geosystems, 2005), the
3D. Capture in real time. Trimble GX 3D Scanner, and the VI-9i or VI910
• Provides a lot of points from the surface, of Konica Minolta.
allowing a more realistic and complete All of them are based on a system of continuous
analysis of the object. laser beam, allowing high speed scanning from
• It is an excellent technique for the de- 100,000 points / sec to 500,000 points / second,
scription and representation of irregular while providing a complete horizontal visual field
surfaces. (±360º) and ±135º in vertical. They provide high
• The final results are available in a very accuracy (± 0.05 mm in the VI-92), even at great
short period of time. distances, thanks to the great intensity and the
beam generated by each pulse of short wavelength,
When “c” is the speed of the pulsed light which combined with the optical compensation
and “t” is flight time (going and back), we have schemes and the dynamic expansion of the laser
simply that space travel is (t • c) / 2. As speed of scanning, allows very accurate captures of surfaces
the laser beam is a constant, the calculation of the with different levels of reflection.
distance will depend only on the flight time, which Capture methods are often based on laser op-
is currently around 3 picoseconds. Considering tical triangulation, providing a points cloud that
samples of 1mm in the surface, the estimated time automatically generate a polygonal mesh with
for scanning an object sized 50 x 180 cm (body) all the information connected, thereby geometric
located 50 meters away, with a rate of 200,000 ambiguities are eliminated and the final result is
points per second, is approximately 5 seconds. improved, allowing to apply 24bit textures – real
The distance of a point and its direction (polar color and consequently obtaining the continuity
coordinates) is calculated by changing the direc- of the elements represented virtually.
tion of the pulse using a rotating mirror method. The most relevant characteristics of the equip-
The rotating mirror device is commonly used be- ment are:
cause they are much lighter than prism and it can
be rotated much faster and with superior accuracy. • Laser scan for high precision and speed cap-
To capture technique uses the continuous beam ture using the pulse laser system, with dou-
of laser amplitude modulation, where the scanner ble mirror optics and servo-mechanisms.
scans an object through the beam of a continuous • Luminosity: Operating with bright light
variable, using rotating mirrors. With this system and complete darkness.
the team is able to capture the energy reflected • Allows a scan of 20.000 m3 to 50 meters
from the object, comparing the shape of the re- and 160.000 m3 to 100 meters.
flected wave with modulation issued and calculate
the distance of the object and consequently their

977
Application of Topographical Capture Techniques for Modelling Virtual Reality

• The range varies from one computer to an- of “flying time” to take the measurement
other, for example the ScanStation 2 has a of each pulse of the laser. The system does
range of 300 m with 90% reflectivity. not require the use of reflectors (markers).
• Capture precision < 4 mm to 50 meters • For each laser pulse, they are calculated
and resolution < 1 mm at distances be- distances and tiny integrated optical de-
tween 5 and 50 meters and angles of ± 60 coders perform angular measurements, ob-
micro-radians. taining primary polar coordinates, which
• Density vertical scanning 1,000 points/col- are transformed into Cartesian coordinates
umn and horizontal scanning 1000 points/ (X, Y, Z).
raw. • The resulting positions are displayed
• Measurement precision P2P between ±0.05 graphically on the computer screen in real
mm and ±4 mm. time (as soon as they are calculated).
• Capture time less than 5 minutes, de-
pending on the volume: if we consider 1
column/sec for 1000 points/column, 2 CAPTURE AND PROCESSING
columns/sec for 200 points/column but, SOFTWARE
considering little volumes, values are 0.3
seconds for 80,000 points and 2.5 seconds As an example, we briefly describe the Cyclone
for 300,000 points. software of Leica Geosystems. Most of them have
• Visual field complete (±360º) in horizontal a similar nature.
and ±135º in vertical, both under solar light Cyclone software is a modular PC software to
and complete darkness. operate the scanner Cyrax. It processes 3D point
• Direct interface and on real-time connec- clouds in an efficient and manages projects from
tion with the computer. 3D laser scanner. Makes a thorough record of each
• Colour video camera integrated with a res- detail, allowing a quickly and safely integration
olution of 680 x 480 pixel. of different scans, taking only certain points in
common.
How Do They Work? The system has a wide range of tools for work-
ing with images and models in three dimensions.
• Focus the scanner onto the area of interest. Its most important features are:
• The video camera integrated in the team,
made a first quick capture, which is sent to • Total control of the scanner and process
a computer connected to it. from the PC through the communication
• Implementation of a rapid scan capture in of the scanner and the PC by a standard
order to get a 3D preview to provide us in- Ethernet connection.
formation about the depth of the capture • Fast capture of the digital video image in
(blind or dark areas). 4-bit true colour with 630 x 480 pixels
• The operator selects the desired area, ac- resolution.
curacy, resolution and other required • Adjust parameters in detail: accuracy, den-
parameters. sity of points, etc.
• Starts the capture of the object. To do this • Fast conversion of the point cloud in a tri-
within the system, two mirrors perform a angular mesh surface.
quick and systematic scan by laser pulses • Generation of digital models using mesh.
on the selected area, using the technique

978
Application of Topographical Capture Techniques for Modelling Virtual Reality

• Generation of surfaces and maps of recently, for cataloguing of heritage. On them,


intensity. three-dimensional models were obtained which
• Calculation of volumes on the mesh. allowed carrying out studies, analyzing propos-
• 3D modelling from point clouds, by intel- als and making decisions on geographic areas or
ligent algorithms. archaeological sites. In the following figures we
• Record multiple scans. With overlapping present some of the results. (See Figure 3 and
functions, geometric adjustment, adjust- Figure 4)
ment of spatial coordinates, and so on. The progress occurred in data acquisition
• Instantaneous calculation of distances be- systems with 3D scanning techniques during the
tween two points and from the scanner at last years it has been both in the collection of data
any point of the object. in continuous mode and in bypassing the selection
• Allows displacement, flight, zoom and of points in the field and avoiding the subjectiv-
free-rotation of the 3D models.
• Texturing of the point cloud.
Figure 3. Survey of virtual representation at 1 /
• Viewing selected items with parameterized
500 of the archaeological area of Mleiha in the
intensity.
Emirate of Sharjah with GPS receivers
• Filters to enter data and information based
on distance and intensity ranges.
• Geo-referencing and processing of the
points in space.
• Record multiple scanning processes.
• Measures on the model (ΛX, ΛY, ΛZ), us-
ing P2P techniques.
• Export formats: VRML, DXF, DGN, text
(X, Y, Z, color), PTS, Warefront OBJ, etc.
It is required a PC with the following char-
acteristics:
• Windows 2000 (SP2 or higher), Windows
XP (Professional or Home Edition, SP2).
• Monitor: minimum resolution of 800 x 600 Figure 4. Digital model of the Torreón Guzmanes
pixel. (Avila), produced by a laser three-dimensional
• Pentium 4 2 GHz or higher. 3D Trimble GS200
• 512 MB RAM (1 GB recommended).
• SVGA - OpenGL Graphic Card.
• 10/100 Ethernet Network Card.
• Disk: minimum 10 Gb.

THE CAPTURE OF THE


STATIC OBJECT RESULTS
OBTAINED IN FIELD WORKS

As mentioned above, the first applications of


laser systems were only topographical and,

979
Application of Topographical Capture Techniques for Modelling Virtual Reality

ity remaining in the results. Data capture is auto- Our experience with the station Leica TC 305
matically done in the field and, later, in the cabi- (direct measure without prism) had allowed us to
net, it is made the selection of information. work with beams of discrete points, interconnected
This technique is being used, quite successfully, by minimum squared adjustments. The equipment
in some models as “Tyrannosaurus Rex” by the Geosystem Leica Cyrax 2500 made it possible
Carregie Museum of Natural History (Pittsburgh, to continue this line of study using continuous
PA) or the “Statue of Liberty” in New York by sequence of points, applying 3D laser technology
The College of Architecture at Texas Tech Uni- to archaeological pieces and other heritage items.
versity in cooperation with the Historic American (See Figure 5 and Figure 6)
Buildings Survery and the National Park Service, The methodology consisted of placing the
as representative examples. scanner on a tripod and connected to a laptop.
On the other hand, the use of these systems The scanner was oriented to the selected area to
has proliferated in the so-called “Reverse engi- scan and it began the acquisition of data. The
neering” consisting of duplication of pieces or equipment needed only fifteen minutes to complete
components without the aid of drawings and/or the acquisition providing in real-time the point
documentation, by creating 3D virtual models cloud and triangle mesh.
from an existing object for obtaining a copy as The precision was ±4 mm in distances and
accurate as possible. ±60 microradians in angles. The ability to data
In terms of the metrics, the modelling process capture reached 1,000 points per second, achieving
is concluded with the generation of the mesh of a precision in the beam of 1.2 mm. The cloud of
triangles that represent the surface of the object points was managed by the specific software that
by a finite set of surface elements (triangles). But has modules for calculation and modelling in two
from the standpoint of representation, the results and three dimensions. As the fourth coordinate,
can be improved by applying textures to the surface it was recorded the light intensity.
that resemble those of the constituent material of In the project “La Lonja de Valencia (Spain)”
the object, simulating different lighting effects (See Figure 5 and Figure 6), they were carried out
and getting shaded representations of the object additional data captures with the laser equipment
from different viewpoints, including stereoscopic in remote areas, in order to represent the vaults
images and animations. of the main courtyard. It is surprising how easily
is the process of providing textures to the model

Figure 5. (A) Modelling La Lonja de Valencia (B) The Central vault of La Lonja de Valencia

980
Application of Topographical Capture Techniques for Modelling Virtual Reality

Figure 6. Three-dimensional model of the facade of the church of Santa Teresa (Ávila) by the 3D laser
scanner Trimble GS200

using the software provided with the laser scaning with the greatest realism possible, to apply their
equipment. This task can be performed in situ, findings to the various sports or in addition to
having the final image just after finishing the data advances in medicine (Carter, 1982).
capture of the point cloud. The word Kinanthropometry derives from the
Within this general framework for data acquisi- Greek “Kinein” which means movement, move-
tion, we search for new models of cartographic rep- ment, “Anthropos” which means human, and
resentation and experiment with new technologies, finally “Metrein”, which means measurements.
always maintaining the metric relations between Carter (1982), defined it as “the study of size,
the data, and creating models or prototypes. In this proportion, maturation, body composition and
way, through the digital models, we extended to form, to describe the physical characteristics,
three dimensions the baseline scenario, without assess and represent the growth, nutrition and
losing the possibility of obtaining traditional the effects of training”. We can consider it as
cartographic models. an interface between the quantitative morphol-
ogy and physiology, or structure and function to
help interpret the dynamics of growth, exercise,
HUMAN REPRESENTATION nutrition and the influence of the physical human
ACHIEVEMENTS IN THE movement.
FIELD WORKS In the Esparza’s work (1993), we find refer-
ences explaining who the body shape and perfor-
Since ancient times, men have tried to show their mance are intimately related and the typology of
immediate reality and the environment around the athlete plays a decisive role in sports perfor-
them, either as art or as for technical proposes, mance. This means that the ability to perform work
focusing on the representation of the human figure or exercise will be closely related to the amount
in motion and trying to exaggerate its traits. Until and proportion of different tissues and body seg-
today, the representation of the human figure and ments that make up the human body and to the
its dynamics has been a hotspot for artists, think- economy in the management of the movements.
ers and scientists, using variety of materials and This science was born as an interdisciplinary
technical supports for that. scientific field (anthropologists, doctors, nutri-
As a reality or necessity, it become a science tionists, physiologists, biomechanical, physical
study: kinanthropometry, which seeks to measure education teachers, among others), to unify
and represent the human figure, moving or static, criteria and methods in the assessment study of

981
Application of Topographical Capture Techniques for Modelling Virtual Reality

man in motion. It aims to analyze the correlation basic tool with a strong genetic dependence for
of morphological and physiological responses, the identification of future athletes.
considered as a bridge between the relationship We can consider that one of the basics of biome-
structure-function. chanics is the multidisciplinary study of the mea-
Advances in techniques of virtual represen- surements of the human body, its movement and
tation has evolved dramatically, but always as- derivations thereof. Under this multi-character,
sociated with photogrammetric, optoelectronic great advances have been developed through the
or video systems, to get sensations very close to interest and perseverance of many researchers and
reality, but having insufficient density of points their contributions, to incorporate new methods
and especially metrics, they have been unrealistic and refine the measurement instruments.
simulations until now. In recent decades, and in parallel with the evolu-
In parallel with these developments, topo- tion of other disciplines that work with graphical
graphic and cartographic techniques have been representation or use graphic systems as a support,
progressing in their capture systems, making the they have tried to analyze and explain several as-
presentation of information almost in real time, but pects of the human figure and its movement in the
its implementation has been traditionally limited environment that surround it, being a clear example
to the representation of the earth’s surface and, in of this many sporting disciplines, in which it is
some cases, very recently, to the documentation necessary to establish procedures for evaluating
of the artistic heritage, but always away from the and improving the athletes performance. Thus, we
systems for virtual representation in applications see how conventional training in elite athletes has
which are in the field of topography or cartography. been complemented by advances in technology
Today, the graphic representation of the envi- to analyze and quantify the parameters involved
ronment and humans, as part of it, is a necessary in the performance. Nowadays, it is impossible to
complement to a number of activities: documenta- reach the international level in any sport without
tion of heritage, traffic accidents, reconstruction the assistance of high technology.
of criminal scenarios, civil engineering, anima-
tions and films, etc. The kinematic and kinetic
variables obtained allow statistical studies and TECHNIQUES USED TODAY
simulations using virtual models (digital models) IN BIOMECHANICS
to describe the behaviour of human beings and
their environment. However, the techniques used The use of photography has been for over a century
are limited by the method of capture, in which the most relevant technique applied to biomechani-
the variables space and time play a decisive role cal analysis of human movement. Photogramme-
in obtaining operating and accurate results in a try is “a set of techniques to obtain information
short space of time. regarding the size, position and orientation of a
By linking topographic techniques, biome- physical object and its environment, through film,
chanics, and virtual modelling applications for measurement and mathematical calculations on
the capture and representation of human body these measurement”, where the spatial coordinates
and its movement, we can obtain results both of the points of interest (anatomical points) are
on the functional variability and on the physical calculated from the planar coordinates of images
variability of subject in motion, and understand from at least two cameras (Woltring, 1995).
how these components interact and how they In these techniques, data acquisition requires
could be improved. The study of the dialectical the matching of points to represent by the opera-
relationship between structure and function is a tor (passive or active markers in terms of bright-

982
Application of Topographical Capture Techniques for Modelling Virtual Reality

ness), and the individual capture of each one of picture of the human figure, formed by a limited
them. After selecting and measuring each point number of points, so the result is very biased and
and with a laborious calculation of the data, it poorly defined.
could be reconstructed the element under study These systems, and other similar with little
(in example a double somersault). This technique variations, always rely on the use of active or
of modelling followed a line of work that went passive markers, being the passive the ones with
“from the point to the surface” and with a huge a best performance (better space-time resolution
human intervention in the intermediate stages and non-simulated identification of markers), but
(Soto, 1995). it is always necessary to use interpolation proce-
In these measurement protocols, entries were dures to estimate the position of the markers in
limited to taking the spatial position of a small the same instant of time, resulting in a distortion
number of points (markers), all attached to the of the final results. All this leads us to have some
body surface, and whose motion approximates systematic errors caused by distortion of the image,
the motion of the segments that form the union of scale errors, accidental errors (incorrect place-
these points (Vera, 1989; Zatsiorski & Donskoi, ment of the markers, errors in scanning, error in
1988). This capture method represents a partial the mathematical processing, etc.), which cause
aspect of reality, because it transform the informa- problems in the reconstruction of coordinates in
tion obtained from sensors (limited in number and space-time and, as result, in the representation of
position) from an recorded image and their further the human figure and its movement.
processing, digitalization and modelling, into a At present the use of Automatic Motion Cap-
numerical representation that can be processed ture 3D, such as VICON, determines the position
by a computer, forcing all to a high consumption of reflective markers attached to the body of the
of resources and time. subject. These are taken up by a set of 6 infrared
Subsequently, they began to be used for opto- cameras that record the movement with frequen-
electronic systems, based on the use of TV cam- cies between 60 and 1000 Hz. This capture must
eras capable of detecting the position of passive be done indoors, under controlled and calibrated
or active markers in terms of brightness, in real conditions, as it is necessary to have geometric
time. These systems have been employed in the reference system. This condition affects the ath-
field of biomechanics of human movement under lete’s actions, because they do not perform on
laboratory conditions. In the 90s, even today, their natural work conditions and environment.
There are used systems as ELITE (Elaboratore di The main purpose of the system is to collect
Immagini Televisive, Center of Bioengineering of the movements of an athlete to apply them to a
Milan, 1983), VICON VX (VInes CONverter for later animation work, resulting in a film of a few
Biomechanics, University of Strathclycle, 1989), seconds of the performed movement. We must
PREMIUM (Precision Motion Analysis System), consider that each second of animation requires
or Expert HirRes Vision 3D (High Resolution 3D approximately one hour of post-processing work
Motion Analysis System). They are classified by computer systems and specialized software, as
according to the sensors they use: PSD (Position Mocap and Motion Builder, and after that tedious
Sensitive Devices), CCD (Charge Coupled De- process, you get a 3D animation of the athlete
vices, online or in area), among others. which is feasible for biomechanical analysis
All this systems are multi-camera equipments. because it include a set of mechanical variables.
The coordinates of the markers are computed in This system is widely used in the film industry
almost real time, which partly reduces the further (Titanic, Star Wars, The Mask, Harry Potter, etc.).
process of data. But we still have a geometric

983
Application of Topographical Capture Techniques for Modelling Virtual Reality

Within the work done in Spain at present, apart • It is necessary to use markers attached on
from the above, you can highlight the following: the human body, which limits the number
of metric points to be captured.
• The Institute of Biomechanics of Valencia • The data collection is done “in door” and
(IBV), where the system “Kinescan/IBV”, under predefined conditions.
similar to VICON, can perform a com- • They have a very laborious post-treatment
plete analysis of movements in 3D based process.
on digital video technology with reflective • In some cases, between the data collection
markers. This system has a post-process of and the delivery of the results, there is a
the captured images by manual digitizing, long time and the conditions of the athlete
and as a result, the application provides may have changed.
information about the three-dimensional
movement of people in space: velocity, ac-
celeration, mechanical impulses, moments A NEW PROPOSAL BY THE
angular or energy. USE OF SYSTEMS BASED
• The Faculty of Physical Activity and Sport ON LASER CAPTURE
at the University of Granada, particularly
the Biomechanics Research Group. It has Some years ago, our working group was looking
to be highlighted their work on three-di- for new capture and modelling systems. We tried to
mensional photogrammetry, performance link capture and mapping cartographic techniques
evaluation and quantitative analysis of and the systems used in virtual reality techniques,
sport. which did not allow metric relationships between
• The group of Biomechanics and the reality and the final model. In this way, we
Ergonomics of the Center for Innovation extended to three dimensions the stage. Without
in the Industrial Enterprise (CINEI), losing the reference to the traditional mapping
Department of Technology of the models, we conduct tests that would allow us to
Universitat Jaume I of Castellon. To mea- develop a simple working procedure, valid for
sure the movement they use high speed obtaining rapid and real representation of reality
B/W digital video cameras (Speedcam with above all, dimensional metrics.
+500), which allow an analysis of 2D mo- As a subject of study, we focused on the hu-
tion. Complementing, they have digital man body because, as we have seen in previous
cameras (used for postural measures) and examples, the static object was being handled at
specific gloves to measure the position of that time with very positive results.
the hands (18 sensors Cyberglove). The road was not easy because the laser sys-
tems without prism were not very common as
In conclusion, we note that the common de- there were only three companies that sold them
nominator of all these systems is that the meth- and they were very expensive, so that, the num-
odology for obtaining representative points of ber of equipment in Spain was reduced and its
the human figure: availability for our tests was very limited. But it
was a fact that its use would allow us to test data
• They use capture system based on video or capture without discrimination of markers, and
digital photography. to capture the whole human figure, not only a
limited number of points. If we add the rapidly in
the point capture, completely and unlimited, and

984
Application of Topographical Capture Techniques for Modelling Virtual Reality

the small post-processing required to achieve a • In the context of the final work of the
three-dimensional digital image with real coordi- degree “A comparative study of details
nates (on a grid and capable of performing actual from a survey of classical precision and a
measurements on the same), the methodology al- lift with 3D laser system: The Warrior of
lowed us working with special data on real-time, Xian” (School of Topography, Geodesy
where the space-time relationship, it was defined and Cartography), it was carried out a pi-
at the time that the capture was made. lot study with a static human figure, show-
Our first objective was to obtain inertial ing as result a mesh of points equidistant,
parameters suitable for modelling of the human which encouraged us to continue our in-
body, involving on it the use of anatomical points vestigations in this direction.
and segments interconnected under a local or real • Then, we planned the first study of the
reference system where there were an interrelation- male human figure, using a model (Figure
ship between them and the rest of points of the 7). They were made several shots under
human body under study. In order to do that, the different static positions of the body in or-
following practical experiences were carried out: der to obtain as much information as pos-
sible from the results. The results provided
• We studied, at the technical level, the work us with a cloud of point at intervals of 5
that had been carried on the heritage, fo- mm, so we got a continuous graphical rep-
cusing on statues and irregular patterns. In resentation of the human figure.
this review, we obtained the first informa- • We proceeded to repeat the previous test,
tion to plan further experiences, focusing but this time with a female human figure.
on the representation of the human figure. (See Figure 8)
• Initially, in order to get an idea of the good-
ness of the system and initial capture pa- In both cases the results were very acceptable
rameters, we planned the capture of a static and provided us with a very rich experience for
non-human element; we used a wood mod- future experiments.
el commonly used by painters. The results In a second test, the high-speed capture pro-
were very exciting because we will have tocols of the equipments (1 to 3 seconds) were
in a short period of a three-dimensional used to provide some results on the environment
model of high quality. and movement occurring in it.

Figure 7. (A) Picture of the laboratory test Figure Madrid -2005 (B) The digital model obtained Madrid
-2005

985
Application of Topographical Capture Techniques for Modelling Virtual Reality

Figure 8. (A) Picture of the test with a female model Madrid -2005 (B) Digital pictures obtained by
rotation of the model Madrid -2005 (C) Detail of the obtained digital female model Madrid -2005

The results were more striking than those (Ibáñez & Jiménez, 2006): “An approach
obtained previously, because in seconds we had to anthropology in sport by mapping tech-
available on the computer screen a raw 3D image niques based on laser-scanning systems:
of the surroundings without any post processing. Survey a 3D human arm.”
The results were
However, it appeared the problem of shadows and acceptable, showing metrics precisions
blind spots produced by the system itself. around 6.3 mm between the mesh points.
An unexpected result was the colour tex-
• After analyzing the results obtained for tures offered by the model after texturing,
whole-body models, we proceeded to once analyzed the colour textures seemed
perform a test of a body part in more de- to correspond with muscle relaxation and
tail.
This was based on the study of human contraction, opening this results another
arm. Implemented within the framework line of possible future research.
of a final work of the degree at School of • After carrying out successfully the last
Topography, Geodesy and Cartography experiences, we were ready to perform

986
Application of Topographical Capture Techniques for Modelling Virtual Reality

the first study of the movement, which conclusive results but very hopeful. In
had not been previously performed in the parallel, we are working on systems for
world. The results were not the most opti- real-time 3D representations by virtual
mal, but they served to correct the capture images (holograms), in order to perform
parameters and to design the following an accurate representation of the human
experiences. body. In this vein, we have opened a way
• Based on the analysis of previous results, for research by looking at metrics in video
we proceeded to design and conduct a test capturing systems and its real 3D represen-
in laboratory conditions. We design a ramp tation, which combined with 3D laser cap-
with a known slope for rolling down a tures, could allow a process of capture and
ball which was moving with a controlled analysis almost in real time.
speed. With this test we had two targets:
(See Figure 9)
◦◦ Study of the movement of the ball
rolling down, considering it as a 3D CONCLUSION
object.
◦◦ Study of the velocity and acceleration In the last five years, the 3D laser capture systems
of the ball in all its way at intervals have had a stunning technological improvement,
of 1 mm. and its tendency is to achieve a higher improve-
• With the above results, it was designed an ment in medium term. The situation is similar to
experiment in real environment conditions: the development of conventional photography:
a race in the running track. We studied the 100 years ago a man should pose for not less than
evolution of the velocities and accelera- 5 minutes in order to have a picture, and now in
tions alone a 100 meters race. As a result seconds we got a lot of high quality shots. Thus,
we can see in the picture below the accel- following the example, these systems are evolv-
eration or deceleration caused by steps of ing in the same way but in very short period of
the athlete, which form the “v shapes” in time, they are evolving from low resolution and
the image. (Figure 10) high capture time to high resolution and shorter
• At present we keep on working on differ- capture times in just one decade.
ent experiences of motion capture and its The members of this working group are in
possible applications to sports, without direct contact with the companies who sell these

Figure 9. (A) Picture of the setting of the study of velocity and acceleration of a ball Madrid -2007 (B)
Picture of the results of the study of velocity and acceleration of a ball Madrid -2007

987
Application of Topographical Capture Techniques for Modelling Virtual Reality

Figure 10. Results of the acceleration of a 100 m athleteMadrid - 2008

systems, and based on their experience, they modelling of the human figure using mapping and
understand that in 5 years time these equipments its application to the moving object”, which is in
will be affordable to all individuals who need phase of completion.
them (their price today is high) and they will take
shorter to perform a high quality real-time capture
of a moving object. However, the working prin- REFERENCES
ciples and calculations that have been described
in this work will not vary much and, therefore, we Adams, L. (1992). Programación gráfica. Técni-
believe that the results we are getting are useful cas avanzadas de modelado, acabado y animación
not only for the development of better equipments 3D. Madrid, España: ANAYA Multimedia SA.
by companies, but also for future researchers in Allard, P., Stokes, I. A. F., & Blanchi, J. P. (1995).
order to continue this exciting work. Three-dimensional analysis of human movement.
Is worth noting the support and interest aroused Chicago: Human Kinetics.
by this study in institutions such as EPOCH-
European Network-European Research Network Barber, D., Mills, J., & Bryan, P. (2004). Towards
on Excellence in Processing Open Cultural A Standard Specification For Terrestrial Laser
Heritage, IST-2002-507382, framed in the Sixth Scaning. In Cultural Heritage. Presented at
Framework Program of the European Union, and International Society for Photogrammetry and
in other sports clubs, research institutions (traffic Remote Sensing. Antalya: Istambul.
accidents, criminal, insurance, etc..) and Technical
Borg, C. E., & Margin, M. (2003). Escáneres 3D de
Faculties (engineering, aeronautics, seismology,
largo alcance: ¿Avanzando hacia una herramienta
mining, archaeology, etc.).
híbrida o hacia una metodología híbrida? Datum
As we have commented, the project is cur-
XXI, 1(5), 42–46.
rently underway. The 90% of the program has
been already concluded, so we can offer partial Bracci, S., Falletti, F., & Scopigno, M. M. R.
results of the work done. Some the initial results (2004). Explorando David: diagnóstico y estado
will be released in the doctoral thesis of Pedro de la conservación. Roma, Italia: Giunti Press.
Calvo Merino, under the title “Three-dimensional

988
Application of Topographical Capture Techniques for Modelling Virtual Reality

Carter, J. E. L. (1982). Physical structure of Ikemoto, L., Gelfand, N., & Levoy, M. (2003). A
Olympic athletes. Part I: The Montreal Olympic hierarchical method for aligning warped meshes
Games anthropological project. Basel, Switzer- [Un método jerárquico para alinear acoplamien-
land: Krager. tos combados]. Paper presented at the Fourth
International Conference on 3D Imaging and
Demir, N., Bayram, B., Alkış, Z., Helvaci, C.,
Modeling (3DIM), October 6-10, 2003, Banff,
Çetin, I., & Vögtle, T. (2004). Laser scanning
Alberta, Canada.
for terrestial photogrammetry, alternativa system
or combined with traditional system? Presented Leica Geosystems. (2005). Documentación Téc-
at International Society for Photogrammetry and nica y Manuales de Referencia sobre sistemas
Remote Sensing, Istambul. láser [Electronic Version]. Retrieved from http://
www.leica-geosystems.com/es/index.htm
Donskoi, D., & Zatsiorski, V. M. (1988). Bio-
mecánica de los ejercicios físicos. La Habana, Lerma, J. L. (1999). Reconocimiento de materiales
Cuba: Pueblo y Educación. y deterioros en fachadas arquitectónicas. Datum
XXI, 1(0), 25–27.
Esparza, F. (1993). Manual de cineantropometría.
Pamplona, España: FEMEDE. Leva, P. (1994). Adjustments to Zatsiorsky-Seluy-
anov’s segment inertia parameter. Kinesiology
García, P. (n. d.). Kinantropometria o cineantro-
Department. Bloomington, IN: Indiana University.
pometria: Definición y alcance social [Versión
electrónica]. Universidad de Caracas. Retrieved Levoy, M., Rusinkiewicz, S., Ginzton, M., &
February 8, 2005, from http://www.rendeportin. Ginsberg, J. (2000). The digital Michelangelo
com.ve/kinan.htm project: 3d scanning of large statues. Department
of Computer Science and Engineering, University
Ghosh, S. K. (1979). Analytical photogrammetry
of Washington. Retrieved October 6, 2004, from
[Forogrametría analítica]. New York: Pergamon
http://graphics.stanford.edu/papers/dmich-sig00/
Press.
dmich-sig00.html
Gonzalez-Aguilera, D. (2001). Consideraciones
Moeslund, T. B., Madsen, C. B., & Granum, E.
sobre el análisis de la fiabilidad en el patrimonio
(2003). Modelling the 3d pose of human arm and
edificado. Fiabilidad y fotogrametría arquitec-
shoulder complex utilisima only two parameters.
tónica [Versión electrónica]. REDCientifica.
Paper presented at the meeting International
Retrieved November 8, 2004, from http://www.
Conference of Model-based Imaging, Rendering,
redcientifica.com/imprimir/doc200111070001.
Image Analysis and Graphical Special Effects,
html
Rocquencourt, Francia (pp. 11-19).
Hogarth, B. (1996). Dynamic Anatomic. Köln,
Ojeda, J. C., Martinez, R., Gonzalez, F., &
Germany: Benedikt Taschen Verlag GmbH.
Sanchez, J. A. (2002). Generación de modelos
Ibáñez, S., & Jiménez, J. (2006). Aproximación a tridimensionales de curvas y túneles. Mapping-
la antropología en el deporte mediante técnicas Interactivo, Artículo 180. Retrieved October 4,
cartográficas basadas en sistemas láser-escáner: 2004, from http://www.mappinginteractivo.com/
Levantamiento 3D de un brazo humano. Unpub- plantilla-egeo.asp?id_articulo=180
lished bachelor dissertation, Universidad Politéc-
nica de Madrid (UPM).

989
Application of Topographical Capture Techniques for Modelling Virtual Reality

Rusinkiewicz, S., & Levoy, M. (2000). QsPlat: Vera, P. (1989). Técnicas de análisis en bio-
A multiresolution point rendering system for mecánica deportiva: estado actual y perspectivas.
large meshes. Paper presented at the meeting Presented at Jornadas Unisport sobre tecnología
SIGGRAPH 2000, Computer Graphics Proceed- del deporte, Málaga.
ings (pp 343-352). Retrieved September 7, 2004,
Vozikus, G. (2004). Lasser Scanning: New method
from http://graphics.stanford.edu/papers/qsplat/
for recording and documentation in Archaeology.
qsplat_paper.pdf
Paper presented at the meeting FIG Working
Salinas, F. S., & Velilla, C. (2003). Estudio mé- Week of Athens. Retrieved October 13, 2004,
trico por fotogrametría terrestre: Documentación from http://www.fig.net/pub/athens/papers/wsa1/
de pequeños monumentos [Versión electrónica]. WSA1_4_Vozikis_et_al.pdf
Mapping-Interactivo, Artículo 175. Retrieved
Warn, H., Marschner, S., Levoy, M., & Hanrohon,
October 4, 2004, from http://www.mappingin-
P. (2000). A practical model for subsurface light
teractivo.com/plantilla-egeo.asp?id_articulo=175
transport. In SIGGRAPH 2000 (Computer Graph-
Shulz, T., & Ingensand, H. (2004). Terrestrial ics Proceedings).
Lasser Scanning: Investigations and Applications
Woltring, H. J. (1995). Smoothong and differentia-
for High Precision Scanning. Paper presented at
tion techineques applied to 3D data. Champaign,
the meeting FIG Working Week of Athens.
IL: Human Kinetic.
Soto, V. M. (1995). Desarrollo de un sistema para
Zatsiorski, V. M., & Donskoi, D. (1988). Bio-
el análisis biomecánico tridimensional del deporte
mecánica de los ejercicios físicos. La Habana,
y la representación gráfica realista del cuerpo
Cuba: Pueblo y Educación.
humano. Tesis Doctoral, Facultad de Ciencias de
la Actividad Física y el Deporte, Universidad de
Granada, España.

This work was previously published in Virtual Technologies for Business and Industrial Applications: Innovative and Syner-
gistic Approaches, edited by N. Raghavendra Rao, pp. 181-200, copyright 2011 by Business Science Reference (an imprint of
IGI Global).

990
991

Chapter 61
GIS Implementation
in Malaysian Statutory
Development Plan System
Muhammad Faris Abdullah
International Islamic University Malaysia, Malaysia

Alias Abdullah
International Islamic University Malaysia, Malaysia

Rustam Khairi Zahari


International Islamic University Malaysia, Malaysia

ABSTRACT
The chapter presents the current state of GIS implementation in Malaysian development plan system.
It offers an overview of GIS implementation worldwide, touching briefly on the history of GIS, plan-
ners’ early acceptance of the system, factors that promote GIS implementation, level of usage among
planners, and factors that impede successful GIS implementation. At the end, the chapter provides a
comparison between the state of GIS implementation in Malaysian statutory development plan system
with the state of GIS implementation worldwide. The evidence was derived from three main sources:
literature, empirical observation of GIS implementation in Malaysia, and a survey conducted in 2008.

INTRODUCTION GIS, they have now become one of the most fre-
quent users of the systems (Budic, 2000; Ceccato
After over two decades since its introduction into & Snickars, 2000; Geertman, 2002; Gilfoyle &
the planning fields, geographical information Wong, 1998; Gocmen, 2009). In describing plan-
systems (GIS) has become one of the important ners’ early resentment towards GIS, Klosterman
tools-of-the-trade for planners (Ceccato & Snick- (1997) points out that this was largely due to past
ars, 2000; Drummond & French, 2008; Gocmen, failures of efforts to computerize planning, such
2009). Despite planners’ early resentment towards as the failure of large-scale urban modelling.
Early GIS implementation was also expensive,
DOI: 10.4018/978-1-4666-2038-4.ch061

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
GIS Implementation in Malaysian Statutory Development Plan System

and the software was highly complicated for plications, such as spatial analysis and modelling,
planners liking. remain underutilized (Gocmen, 2009).
Beginning in the middle of 1980s, the adoption While GIS application among planners con-
of GIS among local governments in the United tinues to be underutilized, the fate of GIS-based
States of America began to increase slowly, and information systems is more difficult to assess,
then sharply in the 1990s (Drummond & French, mainly because such assessment is highly sub-
2008). Researchers attributed this change in GIS jective and timing-dependent (Heeks, 2002).
adoption rate to better and cheaper hardware and What is considered a success to one person may
software availability, as well as better GIS data be a failure to another, and what is considered a
accessibility (Drummond & French, 2008; Geert- success today may be a failure tomorrow. The
man, 2002; Gilfoyle & Wong, 1998; Klosterman, difficulty in assessing the success and failure of
1999). GIS-based information systems is made worst
Key actions by governments also helped to ac- by the propensity of system developers to only
celerate GIS adoption in the 1990s. For instance, report success story. However, several authors
the publication of Chorley Report in 1987 has suggest that there are many cases of failed GIS-
helped to increase GIS awareness and provided based information systems (Abdullah et al, 2002;
fundamental directions for GIS development in Heeks, 2002; Lee & Ahmad, 2000). For instance,
Britain (Gilfoyle & Wong, 1998). In Wisconsin, Heeks (2002) says that,
U.S.A., the state-funded Wisconsin Land Informa-
tion Program and the enactment of the Compre- On the basis of…these surveys, one may estimate
hensive Planning Law of 1999 have significantly that something like one-fifth to one-quarter of
contributed to increase GIS adoption in the state industrialized-country IS (i.e. information system)
(Gocmen, 2009). projects fall into the category of total failure
Since the 1990s, the use of GIS among planners category; something like one-third to three-fifths
has been widespread. Planners began to adopt this fall into the partial failure category; and only a
‘new’method in the course of their work, especially minority fall into success category. (p. 102)
in terms of map-making and land data storing.
GIS-based information systems were developed Nevertheless, researchers tend to agree that
and deployed to allow planners and stakeholders organisational factors are more important than
better access to information and data (Craglia & technological ones in ensuring a successful
Signoretta, 2000; Gilfoyle & Wong, 1998; Heeks, implementation of an information system (Abdul-
2002). However, widespread implementation lah et al., 2002; Heeks, 2002; Innes & Simpson,
of GIS in the planning field does not translate 1993; Ramasubramanian, 1999). Organisational
into full utilization of GIS application. Research mission to implement and support information
indicates that planners’ regular use of GIS has system, retaining key and trained staff involved
been largely limited to the basic functions of the in the development and implementation of the
systems, such as mapping and accessing land system, data availability, and system designers’
information, for routine operational and manage- understanding of users’ needs are among the or-
ment tasks including permit processing, land data ganisational factors identified as key ingredients
storing and map presentation (Budic, 2000; Gill, for a successful information system.
Higgs, & Nevitt, 1999; Klosterman, 1997; Men-
necke & West, 2001). Even at present, prevalent
use of GIS among planners continues to centre
on rudimentary applications while advanced ap-

992
GIS Implementation in Malaysian Statutory Development Plan System

BACKGROUND aims at introducing a more integrated and holistic


approach to planning, and enhancing the control
Malaysian Statutory and regulation of planning in Malaysia (Ahmad
Development Plan System Sarkawi, 2006). Prior to the amendment, the Ma-
laysian statutory development plan system was a
Town and country planning system in Malaysia is two-tier system consisted of structure plans and
governed by the Town and Country Planning Act local plans. Following the amendment, the system
19761 (Johar, 2003; Abdullah & Ariffin, 2003). now consists of a hierarchy of development plans
Central to this Act are the requirements for plan- ranging from national to local levels. At the top of
ning authorities in Malaysia to formulate planning the hierarchy is the national physical plan. This
and development policies through preparation of is followed by the state structure plan, the local
statutory development plans and to regulate de- plan and the special area plan (Figure 1).
velopment through development control process. The national physical plan sits at the top of
The statutory development plans required under the hierarchy of statutory development plans in
the Act are national physical plan, state structure Malaysia. Prepared by the Federal Department of
plan, local plan and special area plan. Town and Country Planning (FDTCP), the plan
The administration of this Act is carried out provides strategic development framework for
at three different governmental levels which are the whole of West Malaysia. The plan was ap-
the Federal, the State and the Local Government proved by the National Physical Planning Coun-
(Ahmad Sarkawi, 2006). Planning authorities are cil in 2005 and shall be in effect until 2020. It is
established at each level of the government to un- subject to revision every five years.
dertake planning of different scale and functions. The state structure plan is prepared by the
The Town and Country Planning Act 1976 State Department of Town and Country Planning
was amended in 2001 to make provision for an (State DTCP). The plan translates the development
improved system of statutory development plans framework of national physical plan to state level,
in Malaysia. The 2001 amendment introduced and provides planning policies and proposals for
additional set of plans into the system, which the whole area within the state boundary. The

Figure 1. Malaysian statutory development plan hierarchy

993
GIS Implementation in Malaysian Statutory Development Plan System

form and functions of state structure plan remain In general, the contents of these statutory
similar to the ‘old’ structure plan prior to the 2001 development plans consist of written statements
amendment of the Town and Country Planning and maps. The written statements present the
Act 1976. The only difference is that the current planning policies and development proposals for
state structure plan is a state-wide plan, while the the planned area, while the maps act as supporting
previous structure plan was prepared for a part of documents that clarify the policies and proposals
an area within state boundary. At present, eight through graphical representations. The national
out of eleven states in West Malaysia already had physical plan and the state structure plan are subject
their state structure plans published in the official to revision every five years. On the other hand,
gazette. Once published, the plan becomes a legal the revision of a local plan can be performed by
document that guides the planning and develop- the local planning authority whenever necessary.
ment within the state. Preparations of all statutory development plans
The 2001 amendment also affects the local in Malaysia are outsourced to private planning
plan. Although the form and functions of local consultants. A team of planning consultants is
plan remain the same as before the amendment, appointed for each statutory development plan
the plan is now a district-wide plan. Previously, preparation project and is responsible for collect-
it covered only a part of an area under a local ing data, performing analyses and writing reports.
planning authority’s jurisdiction. Thus, the present Meanwhile, the planning authorities administer
local plan is commonly referred to as the ‘district the preparation process by providing supervision
local plan’ in order to distinguish it from the ‘old’ and funds, as well as handling the publicity and
local plan. the gazette process of the plan. The planning
Local plan is prepared by local planning consultants, based on analyses of the collected
authority to set out detail policies and proposals data, propose planning policies and development
pertaining to development and use of land in the proposals for the planned area for commentary
district. The plan also translates strategic policies by the planning authorities. Once finalised, the
and proposals of state structure plan into details to plan will be put on public display before being
be implemented at the district level. Usually this published in the government’s gazette. Figure 2
involves detailing the exact location, components presents the flow chart of a typical statutory de-
and the cost of development. At present, a total of velopment plan preparation process in Malaysia.
108 local plans have been published throughout Although there are other types of development
West Malaysia. plans prepared by planning authorities in Malay-
Similar to the local plan, the special area sia, such as regional plans and rural growth cen-
plan is also prepared by local planning authority. tre plans, their preparations are not bounded by
Nevertheless, special area plan covers a smaller the requirements in the Town and Country Plan-
area in comparison to the local plan and contains ning Act 1976. Thus, these plans are sometimes
proposals that are more detailed in nature. Special termed as non-statutory development plans. For
area plan is prepared for an area that requires the purpose of this chapter, any reference to de-
special planning because of its unique character. velopment plans in the following discussions will
At present, special area plans have been prepared refer to the statutory development plans.
for areas that require special conservation efforts
because of their historical or natural environmental
value, and for areas that are subjected to intense
development pressure.

994
GIS Implementation in Malaysian Statutory Development Plan System

Figure 2. Typical development plan preparation process and integration with GIS (Adapted from Jabatan
Perancangan Bandar dan Desa Semenanjung Malaysia, 2001 and Jabatan Perancangan Bandar dan
Desa Semenanjung Malaysia, 2007)

995
GIS Implementation in Malaysian Statutory Development Plan System

History of GIS Implementation GIS research and development are being pursued
in Malaysian Statutory actively with universities playing significant roles
Development Plan System in developing GIS model and exploring new areas
where GIS can add value in the planning process
GIS was introduced in Malaysia in the early (Abdullah, Abdullah, & Ibrahim, 2009; Samat,
1980s. Early record shows that GIS was first 2006; Tan, 2005).
used by Malaysian Department of Agriculture in
1981 (Idrus & Harman Shah, 2006). It was only Information System
over a decade later that GIS was officially used Planning Masterplan
in planning in Malaysia.
The year 1993 marked the first GIS use in In 1996, the Malaysian Government launched
Malaysian planning when Putrajaya, the new e-government initiatives as part of its efforts to
Malaysian Federal Government Centre, was being employ information and communication technol-
planned (Abu Bakar, 2004). In the process, GIS ogy (ICT) in order to reinvent the way it operates.
(MapInfo version 1) was used, but only for map E-government was one of the seven flagship ap-
presentation. In that same year, the Local Plan for plications initiated under the Multimedia Super
Parit Buntar and Bagan Serai area was also being Corridor programme (Lee & Ahmad, 2000). Under
prepared. This presented another opportunity for the programme, government departments were
FDTCP to try to use GIS in planning. Thus, the required to formulate and implement proposals
first executive information system for a develop- for integration of ICT in their work in order to
ment plan, known as SMaRT, short for Sistem achieve speedier and better service delivery. In line
Maklumat Rancangan Tempatan, or Local Plan with the Malaysian Government’s e-government
Information System, was developed for that local aspiration, in 1997, FDTCP introduced its Infor-
plan (Abu Bakar, 2004). mation System Planning Masterplan. This master-
Beginning with these two GIS initiatives, GIS plan outlines the department’s approach towards
use in Malaysian planning gradually increased. integration of ICT into planning in Malaysia.
Early widespread implementation of GIS centred Nevertheless, many of the ICT systems proposed
on its implementation in the preparation of devel- to be developed under the masterplan have some
opment plans. By 1997, GIS was already embed- GIS elements in them. According to Mohd. Nazri
ded into the preparation process of development Abdullah (2004), based on the detail explanation
plans in Malaysia (Ibrahim et al., 2004). From the in the masterplan, it can be concluded that 80%
initial focus on development plan preparation, GIS of the proposed systems would lead to some GIS
was then implemented in other activities of plan- applications in planning. Among the ICT systems
ning including development control and land use proposed by the masterplan are:
change monitoring, as well as being the platform
on which tailor-made e-planning systems were • Development and Planning System;
developed (Abdullah, 2004). • Planning Monitoring System;
Today, GIS is a common knowledge among • Planning Approval System;
planners in Malaysia. Formal GIS training is now • Counter Service System;
offered to all planning graduates at all planning • Appeal Board System;
schools in the country (Ibrahim et al., 2004). • State Planning Council Decision Support
On-demand GIS trainings are also offered by System; and
various parties, including universities and private • Data Warehousing and Information
GIS companies, to public and private planners. System.

996
GIS Implementation in Malaysian Statutory Development Plan System

The masterplan recognizes the benefits of area. The plan would determine the type of de-
GIS to planning, and thus outlines measures to velopment allowable in an area, or land parcel,
integrate GIS into planning in a more coordinated as well as any restrictions (such as development
approach. Among the key thrusts of the masterplan density, plot ratio and building height) associated
pertaining to GIS are to increase GIS awareness to that land. The use of GIS would development
and knowledge among FDTCP officers, and to of an information system and its usage in the
promote GIS use in planning, especially in the development control process. Such information
development plan preparation process. system would allow users, especially council
planners, to quickly search for the land parcel
GIS Implementation in affected by a planning application using com-
Statutory Development Plan puter and to ascertain the planning policies and
Preparation Process the development control requirements related to
that land. Such system would also help speed up
The Information System Planning Masterplan’s the development control process considerably as
proposal to promote GIS use in development compared to manual search and reference tasks,
plan preparation process is perfectly appropriate especially when a plan may consist of more than
for several reasons. Firstly, development plan one hundred policies (refer Table 1). The less time
preparation involves the production of a signifi- taken to process a planning application would fit
cant number of maps. For instance, Terengganu very well into the e-government initiatives, which
State Structure Plan contains 62 maps and Perak seek for speedier service delivery.
State Structure Plan contains 52 maps (Table 1).
Nevertheless, it has to be noted that these are just Problems Faced During Early
the number of maps included in the published GIS Implementation Period
plan. The number of maps prepared throughout
the entire plan preparation process is notably Following the introduction of the Information
higher since there are also other reports like the System Planning Masterplan, FDTCP began its
inception report and the report of survey (Figure promotion of GIS. Officers from the department
2), which have to be produced in the process and
these reports contain a sizeable number of maps Table 1. Number of maps in published state
as well. In fact, report of survey usually contains structure plans
more maps than the published plan itself. There-
fore, even though the number of maps included No. State Structure Plan No. of No. of
maps policies
in Pahang State Structure Plan is only 15 (Figure
1. Terengganu State Structure 69 158
2), the total number of maps produced during the
Plan
Plan’s entire preparation process is considerably
2. Perak State Structure Plan 52 49
higher. Due to the high number of maps produced
3. Melaka State Structure Plan 29 165
during development plan preparation process,
4. Pulau Pinang State Struc- 29 79
many opportunities to use GIS arise in that pro- ture Plan
cess. Maps, which were usually prepared using 5. Selangor State Structure 26 100
computer-aided design (CAD) software (and in Plan
some cases using Microsoft PowerPoint) previ- 6. Negeri Sembilan State 25 111
Structure Plan
ously, can now be prepared using GIS software.
7. Pahang State Structure Plan 15 29
Secondly, statutory development plan forms
the basis for development control for the planned 8. Johor State Structure Plan n/a n/a

997
GIS Implementation in Malaysian Statutory Development Plan System

were sent to GIS trainings and road shows were FDTCP did not provide any detail guidelines on
conducted at various town planning departments land use colour classification, database structure,
in the country. FDTCP also conducted several metadata structure and the like. Lack of detailed
workshops to discuss in detail the areas where guidelines led to non-standardised GIS outputs
GIS can be used in the development plan prepa- prepared by the few consultants who decided to
ration process and, later on, introduced GIS to use GIS in the plan preparation.
private planning consultants who were involved The absence of detailed guidelines from
in development plan preparation projects (Ibrahim FDTCP and lack of GIS knowledge among con-
et al., 2004). sultants also resulted in limited use of GIS in the
Early attempts to integrate GIS into Malaysian development plan preparation process. GIS was
statutory development plan preparation process viewed as a map drawing tool and thus, used as
were not without problems. When initially in- one. Maps were drawn using GIS software but
troduced, GIS received lukewarm reaction from without data attributes. Spatial analyses were
FDTCP officers, created confusion among private still conducted manually despite the availability
planning consultants and failed to achieve what it of GIS software and digital maps. However, the
was intended to do. Mohd Ali Abu Bakar (2004) results of these analyses were then drawn onto
reports that despite the promotion and the road the maps using GIS.
shows, GIS was not initially well received by
many officers from the professional category, Quality Control of GIS Outputs
such as planners. Many of them perceived GIS
merely as a mapping tool and thus only useful Having learnt lessons from the problems faced
to sub-professional officers, such as technical during the early period of GIS implementation
assistants and technicians. in the development plan preparation process, in
When GIS was introduced to private planning 1999, FDTCP introduced measures to control
consultants, the reception was not any better as the quality of GIS outputs from the process.
well. To many of them, GIS was completely a A guideline on GIS application for local plan
new thing and most were unclear about how GIS preparation process was produced (Abu Bakar,
was to be integrated into the development plan 2004). Although brief in nature, the guideline was
preparation process. When further clarification rather useful as foundation towards standardisa-
was sought, FDTCP’s response suggested that tion of GIS outputs. The guideline was tested on
what the department wanted was for digital maps several local plan preparation projects before it
to be used in the preparation of the plans. How- was finally improved. In 2001, it was officially
ever, as stated by Mansor, et al. (2004), initially included in the revised FDTCP’s Manual for
many consultants still decided not to use GIS. As Local Plan Preparation (Jabatan Perancangan
digital plans and maps could be produced easily Bandar dan Desa Semenanjung Malaysia, 2001;
using CAD software, not all consultants decided Abu Bakar, 2004).
to venture into using GIS software in their work. Similarly, the FDTCP’s Manual for State
Instead, many of them opted to continue using Structure Plan Preparation was also revised in
CAD software, which they were already using 2003 to include guidelines for GIS application in
prior to the GIS introduction. the plan preparation process. Other guidelines con-
Early implementation of GIS in the develop- cerning map preparation, metadata and database
ment plan preparation process also faced the structure were also produced. These guidelines
problem of standardisation, or the lack of it (Ba- are constantly revised and improved. Today, the
hagian Teknologi Maklumat, 2006). At that time, level of details provided in these guidelines is

998
GIS Implementation in Malaysian Statutory Development Plan System

quite remarkable. For instance, the latest version Level of GIS Application in Statutory
of FDTCP’s Manual Penyediaan Pelan GIS (or Development Plan Preparation
GIS Plan Preparation Manual) goes to the extent
of providing step-by-step instructions on how to Despite having GIS consultants’ expertise at dis-
prepare land use data layers (Bahagian Teknologi posal, present GIS use in most statutory develop-
Maklumat, 2007b). The guideline for Format ment plan preparation projects is still limited to
Metadata dan Struktur Pengkalan Data Sistem map preparation and presentation with minimum
Maklumat Geografi (GIS) Kajian Rancangan Tem- GIS analyses being performed (Ibrahim et al.,
patan (or Format for Metadata and GIS Database 2004; Nor Salehi Kassim, personal communica-
Systems for Local Plan) is similarly detailed in tion, December 11, 2008). In most instances, GIS
contents. The 78-page guideline includes samples analyses performed are restricted to basic analyses
of metadata format for each land use layer and like overlays and catchment (buffer) analyses (G.
pre-determines the name of each of the layers L. Chua, personal communication, December
(Bahagian Teknologi Maklumat, 2007a). 12, 2008). Nevertheless, although rare, there
Another measure taken by FDTCP to ensure were cases of advanced GIS analyses performed
standardisation of GIS outputs is the appointment in development plan preparation, such as in the
of GIS consultant into the team of consultants preparation of the national physical plan and the
that are involved in preparing a development Kulim District Local Plan (Abdullah, Abdullah, &
plan. When GIS was initially introduced into Shahbuddin et al., 2009). In both cases, GIS was
the development plan preparation process, the coupled with decision support systems to conduct
responsibility to prepare GIS maps was on each analysis and generate maps for land suitability
of the planning consultant in the team. Such ar- (Amandus Jr, 2006; Kassim & Islam, 2009).
rangement lacked the necessary coordination to In explaining the reason behind the limited use
ensure that all the maps produced were uniformly of GIS analyses in the development plan prepara-
formatted. With GIS consultant in the team, the tion process, Ibrahim, et al. (2004) states that:
responsibility to coordinate maps preparation is
on the GIS consultant. Because of their advanced …the current limitations in terms of GIS use in
knowledge in GIS technologies, GIS consultants development plan preparation arisen due to in-
are also tasked to conduct GIS analyses, develop adequate comprehension among private planning
GIS database and develop executive information consultants on the analyses capability of GIS and
system for the plan (Jabatan Perancangan Bandar also on how GIS can help to ease their work. In
dan Desa Semenanjung Malaysia, 2007). In terms addition, the preparation of development plan
of map preparation per se, the responsibility does not solely involve analyses of spatial data,
remains with individual consultants in the team. in many instances, analyses of aspatial data, or
However, increasingly, in many development plan a combination of both, are equally important.
preparation projects, GIS consultants are also Marrying the spatial analyses with aspatial ones
given the task to prepare maps on behalf of the might be a little on the complicated side to many
planning consultants. In such cases, the planning planning consultants. (p. 3)
consultants still provide the required information
for GIS maps preparation to GIS consultants.. Although planning consultants who are in-
volved in the development plan preparation are
aware of GIS functions, they are not aware of GIS
full capability for analysis. In particular, they do
not know whether any of the GIS analyses can be

999
GIS Implementation in Malaysian Statutory Development Plan System

used to replace what they have been doing manu- as well. As such, the system is frequently used by
ally. In the same time, GIS consultants also tend the planning authorities’ officers as a substitute
to overlook their responsibility to advice planning for the hardcopy version of the plan.
consultants on GIS analyses because they are The development of EIS for development plan
too preoccupied with developing GIS database is advancing. Earlier, EIS was developed mainly as
and the executive information system, which are a stand-alone system to store GIS maps and reports
deemed as the core of their work. Additionally, produced during the development plan preparation
they receive no request for GIS analyses to be process. The database of the system was fixed and
performed from planning consultants (G.L. Chua, only updated when revision was done to the plan.
personal communication, December 12, 2008). However, recently, EIS began to be developed as
The superficial knowledge of GIS among a multi-user system and deployed over local area
planning consultants, as described above, can be network. The system also now includes features
attributed to the lack of GIS training received. It of a charting2 system, where the database can be
should be noted that many of the planning consul- easily updated through the system’s interface.
tants involved in preparing development plans are With these features, planners are able to update the
seniors in their fields. Many never received any land information in the database without having
formal GIS training during their university days. to actually use GIS software.
Although commercial GIS training is available at EIS for development plan targets three types
present, senior consultants prefer to not attend the of users, which are:
training sessions. Instead, younger staff are sent
for the training. A common idea shared by many • Executives, including council planners,
senior planning consultants is that GIS (and ICT) who make decisions about planning
suits younger people better. applications;
• System administrators who are usually
Executive Information System for planners and technical staff of planning
Statutory Development Plan authorities;
• The general public, including developers
Executive information system (EIS) is another and land owners.
aspect of GIS application in the statutory develop-
ment plan preparation process. The requirement Access to information stored in EIS data-
to produce EIS for development plan is now base is restricted according to the type of users.
mandatory in all development plan preparation Executive users have access to all information
projects (Jabatan Perancangan Bandar dan Desa including the digital version of both the report
Semenanjung Malaysia, 2007). EIS for develop- of survey and the published development plan.
ment plan is basically a computer system that stores However, they do not have access to update the
all GIS maps prepared for a development plan. GIS database. Public users neither have access to
Users can then access these maps through the EIS the reports nor to update GIS database. Instead,
and perform some GIS functions using the toolbar they can only access information regarding the
on the system’s graphical user interface. Usually, proposed land use of a particular land parcel and
the toolbar includes only basic GIS functions such the development restrictions attached to that land.
as calculating distance, accessing information System administrators, obviously, have access to
of land parcels and displaying selected layers of all information and to update the GIS database.
information. There are also EIS that stores the Besides being used as a medium for quick ac-
digital version of the published development plan cess to planning information, EIS is also used in

1000
GIS Implementation in Malaysian Statutory Development Plan System

development control, counter services and public such failure are lack of personnel and financial
exhibition of development plan. FDTCP has also capacities.
begun to encourage the development of internet- In hindsight, the EIS for development plan is ac-
based EIS (Jabatan Perancangan Bandar dan Desa tually forced upon planning authorities by FDTCP.
Semenanjung Malaysia, 2007). It is hoped that by When EIS became a mandatory requirement in
being internet-based, future EIS can be accessed development plan preparation, planning authori-
more easily by users, especially the public. With ties had to accept the system when it is handed
the present system, public users can only access over to them although they may not have the
the system at planning authorities’ offices, either resources to maintain the system. Unfortunately,
at the information kiosk or service counter. many of the planning authorities do not have the
resources to maintain the system. The problem
EIS Implementation Problems of not having sufficient personnel, especially
trained personnel, and funds to maintain EIS is
While EIS development is mandatory for all quite significant among many planning authori-
statutory development plans in Malaysia, the ties, especially those operating in the rural area.
success of its implementation is uncertain due In terms of personnel, some planning authorities
to lack of research on this issue. Nonetheless, it in the rural area do not even have a planner on
seems that some planning authorities are more their payroll. In cases like this, it is usually the
successful in implementing EIS than others. EIS technical assistant who will act as the planner for
implementation at planning authorities such as at the area. There are also cases where a planning
Melaka State DTCP, Petaling Jaya City Council authority has only one staff, either a planner or
and Seberang Perai Municipal Council are often a technical assistant, to handle planning matters
referred to, by those in the planning fraternity, of the area. To worsen the matter, some of these
as examples of successful EIS (G. L. Chua, per- planning authorities are not even equipped with
sonal communication, December 12, 2008; Mohd a computer in which the system can be installed
Ramzi Mohd Hussain, personal communication, (Aishah Abdullah, personal communication,
December 10, 2008). November 18, 2008; Muhammad Hakimi Mohd.
As mentioned previously, EIS for development Hussain, personal communication, November 18,
plan is developed during the plan preparation 2008). Their existing computers are obsolete that
process. The cost to prepare the plan, including they do not have the right specifications to run
EIS, is jointly funded by FDTCP and the planning EIS applications.
authority whose area is being planned for. In most
cases, the former contributes towards a major GIS-Based Statutory Development
portion of the funding. Once EIS development Plan Monitoring System
is completed, the GIS consultant will create an
installation disc of the system and hand it over to The monitoring of statutory development plan
the planning authority. A manual on how to use implementation has not been given sufficient
the system is also prepared and printed out by the attention by planning authorities in Malaysia,
GIS consultant for the planning authority’s use. even at present. In fact, many planning authori-
It is then up to the planning authority to install, ties do not monitor the implementation of their
maintain and upgrade the system. Yet, it is com- development plans. In the event where monitor-
mon for planning authorities to fail to maintain ing of development plan is undertaken, the focus
the system. Often times, the reasons cited for is solely on land use change. Meanwhile, other
aspects of the plans, such as protection of envi-

1001
GIS Implementation in Malaysian Statutory Development Plan System

ronmental resources, management of traffic and GIS-Based Monitoring System


provision of public facilities, are not included in Implementation Problems
the monitoring exercise.
Although the monitoring of development plan Despite being potentially useful, the implementa-
implementation is not widely practiced, the need tion of GIS9 is rather unsuccessful. This is due to
for it is clearly identified, especially for the state the failure of local planning authorities in updating
structure plan. The Manual for Preparation of the data for their area and poor system’s infra-
Structure Plan proposes a procedure that can be structure. As aforementioned, the system is very
employed by States DTCP to monitor their state dependent on local planning authorities updating
structure plans (Jabatan Perancangan Bandar dan the land use data for their area whenever there is
Desa Semenanjung Malaysia, 2007). Additionally, a change of land use. However, the local planning
all published state structure plans include a section authorities failed to do this. As in the case of EIS
that outlines the monitoring programme that the previously, lack of personnel was often cited as
States DTCP intend to undertake. Interestingly, one of the reasons for their failure to update the
while the manual does not specifically propose the data (Anuar Maidin, personal communication,
use of any ICT application in state structure plan November 11, 2008). As a result, the land use data
monitoring, the plans describe the possibility of in the system database is outdated and does not
GIS being used for monitoring purposes. reflect the actual land use allocation in the state.
The only GIS-based monitoring system for Thus, the data cannot be used to provide accurate
state structure plan in Malaysia is the one devel- monitoring results.
oped by Negeri Sembilan State DTCP. Known as The system is also plagued by poor infra-
GIS9, the system was developed for the purpose structure set up. The server experiences too
of monitoring land use change in the state and many problems that lead to numerous down time,
evaluating the change in relation to the land use which at times extending well over two months
allocation proposed in Negeri Sembilan State (Haibernarisal Baijuri, personal communication,
Structure Plan. The system is placed on a server November 16, 2008). The long down time leads
located at the Negeri Sembilan State DTCP office. to data updating exercise being put on hold and
The system is also connected to all local planning discourages users from using the system.
authorities in the state in order to enable them
to update the system’s database when change of Impacts of GIS Implementation in
land use occurs in their area. The updated data Statutory Development Plan System
will then be used to determine the actual land
use allocation in the state. This information will The implementation of GIS in statutory devel-
then be compared with the land use allocation opment plan system has impacted Malaysian
proposed by the state structure plan. Thereafter, planning scene in a number of ways. Perhaps, the
the difference between actual land use allocation biggest impact is how it has contributed towards
and the proposed one will be calculated and the the increase of GIS critical mass in the country.
results will be presented in terms of percentage Although Rogers’ diffusion of innovation theory
(UGisP, 2006). (as cited in Ramasubramanian, 1999) suggests that
critical mass precedes adoption of technology, in
the case of GIS implementation in Malaysia, the
reverse may have happened. When FDTCP intro-
duced GIS in the development plan preparation
process, few planners were aware of what GIS

1002
GIS Implementation in Malaysian Statutory Development Plan System

was all about. Nevertheless, FDTCP had already commissioning a new batch of development plan
decided to make GIS implementation in the plan preparation projects and was looking for private
preparation process mandatory. This created the planners to be appointed as consultants in order
demand for planners with knowledge on GIS. to prepare these plans. Since planning jobs were
Warnecke, Beatie & Lyday (as cited in Drum- scarce, many private planners had no choice but to
mond & French, 2008) traced the beginning of get involved in those development plan prepara-
large-scale introduction of GIS into planning back tion projects. It was through their involvement in
to 1984. So, while it was already 13 years that those projects that they began to obtain exposure
planners elsewhere in the world have been using to GIS and GIS software.
GIS in their work, many Malaysian planners still Planners from the public sector also involved
found GIS a completely new invention when it in development plan preparation projects, al-
was officially introduced into Malaysian develop- though not as consultants, but usually as a project
ment plan preparation process in 1997 (Mansor manager, as a technical committee member or as
Ibrahim et al., 2004). However, the result of a owner of the plan. Thus, similar to private plan-
survey conducted among planners in Malaysia ners, their involvement in the projects helped to
revealed that by 2004, already more than 50% of expose them to GIS. Additionally, many of them
the planners have used GIS software. A similar had the benefit of attending formal GIS training.
survey in 2007 showed that the figure had risen FDTCP, for instance, had established a programme
to almost 70% (Abdullah, Abdullah, & Ibrahim et under which GIS trainings are offered to planners
al., 2005). Looking from this angle, one can see from the public sector. The programme, known as
the profound impact of GIS implementation in Geotechnical Spatial Analysis Research and De-
development plan preparation process in creating velopment, was established in 2000 as a platform
GIS critical mass in Malaysian planning. It took for FDTCP to undertake research and training
only 10 years since GIS was officially introduced on spatial planning and decision-making tools
into the development plan preparation process for it relevant to planning such as GIS, multi-criteria
to become widespread among Malaysian planners. evaluation and spatial decision support systems
Based on our observation, we would like to (Jabatan Perancangan Bandar dan Desa Semenan-
offer two main reasons for the popularity of GIS jung Malaysia, 2008).
among Malaysian planners after its use was made Secondly, the implementation of GIS in the
mandatory in the development plan preparation development plan preparation process has also
process. Firstly, in the late 1990s, Malaysia and led to planning schools in the country to begin
several other countries in the Asian region were to offer, or place greater emphasis on, GIS sub-
faced with economic recession due to the cur- jects with the aim of producing more marketable
rency crisis. Prior to the recession, Malaysia was graduates. Graduates are not only trained through
developing rapidly and many private planners GIS-specific subjects, but also through heavy GIS
were involved as consultants in the construc- integration, both theoretically and practically,
tion industry, especially in terms of formulating within other subjects such as design studios, urban
masterplans for new townships and submitting design, planning theory and practice, environmen-
planning applications for new development. tal planning and so on (Ibrahim et al., 2004). In fact,
However, during the recession, the construction during the late 1990s and early 2000s, having the
industry in Malaysia was badly hit that it almost knowledge to operate GIS software became one of
halted. Suddenly, many of the private planners the, if not the most, important criteria for planning
found that there was no job in the construction graduates to secure employment at planning firms
industry. Around the same time, FDTCP began and authorities. Today, although the importance

1003
GIS Implementation in Malaysian Statutory Development Plan System

of GIS knowledge in securing employment has Recently, the development plan monitoring
reduced slightly (but still important nonetheless), is getting increasing attention from planning au-
planning schools continue to offer GIS-specific thorities. FDTCP is currently working to develop
subjects and the integration of GIS in other related a GIS-based development plan monitoring model
subjects remains high. These ensure that planning (Nor Salehi Kassim, personal communication,
graduates in the country continue to get exposure November 4, 2008). There are also efforts to
to GIS, at least into the foreseeable future. In turn, develop GIS-based systems for development
it helps to retain the critical mass needed to sup- plan monitoring at other planning authorities. For
port continued use of GIS in Malaysian planning. instance, Selangor DTCP will begin developing
their own GIS-based systems soon and Kuala
Lumpur City Hall is also looking into the idea of
FUTURE RESEARCH DIRECTIONS having GIS-based systems to help them monitor
IN GIS APPLICATION IN their local plan.
MALAYSIAN STATUTORY
DEVELOPMENT PLAN SYSTEM
CONCLUSION
Recent developments indicate that the use of GIS
in Malaysian statutory development plan system Looking at GIS implementation in Malaysian
will continue to flourish. Two areas of great op- statutory development plan system, one can iden-
portunity lie in the integration of GIS with decision tify several common problems in relation to GIS
support systems in the process of development implementation in planning worldwide.
plan preparation and the development of GIS- These are:
based systems for monitoring development plan
implementation. • The use of GIS is largely limited to basic
Within the past few years, efforts to institu- functions.
tionalise decision support systems into Malaysian • The implementation failure is due to or-
planning have been on-going (Abdullah, Abdullah, ganisational factors rather than technical.
& Ibrahim, 2009). Experimental projects in which • Data availability limits the success of GIS-
GIS was coupled with decision support systems based information systems.
in order to provide better decision making during
the development plan preparation process were The implementation of GIS in Malaysian de-
undertaken. Whether those projects have added velopment plan preparation process faces several
value to the plan preparation process is, however, persistent problems. For instance, advanced GIS
unclear at the moment. Presently, it seems that applications in the preparation of development
there is lack of research on this issue. Nevertheless, plans are still underutilized. Despite increased GIS
those projects had contributed towards creating awareness among planners, GIS application in the
among Malaysian planners on the integration of development plan preparation process continues
GIS with decision support systems and had laid to be rudimentary in nature. This is most likely
the foundation for further application of GIS and caused by insufficient GIS knowledge among
decision support systems. FDTCP is also work- senior planning consultants who are involved in
ing on a guideline on decision support systems the plan preparation. To mitigate the problem, in
implementation in the development plan prepara- the short term, it may be necessary for FDTCP
tion process at the moment. How this guideline to produce a guideline that details out the types
will turn out remains to be seen. of GIS analyses required in the plan preparation

1004
GIS Implementation in Malaysian Statutory Development Plan System

process. At present, the same measure is already of inadequate financial and personnel resources.
undertaken by FDTCP in order to promote the Although FDTCP has been funding the develop-
use of planning analyses in development plan ment cost of EIS, it is one-off in nature and this
preparation process. In its Manual for Preparation proved to be insufficient. As Alias Abdullah,
of Structure Plan, FDTCP has identified a list of Muhammad Faris Abdullah & Fauzan Nordin
required planning analyses that need to be under- (2003) point out, information system like EIS is
taken during the state structure plan preparation a ‘living’ system and thus:
process. These analyses include, for example,
consistency analysis between plans’ objectives ...require maintenance and system capacity build-
and policies, policy impact analysis, land require- ing from time to time. Accordingly, sufficient
ment analysis and cost benefit analysis (Jabatan funding must be made available. Funding is re-
Perancangan Bandar dan Desa Semenanjung quired for manpower to operate and maintain the
Malaysia, 2007). Similar approach may be taken system, and for capacity building, which include
by FDTCP for GIS analyses. hardware and software upgrades as well as the
In the long run, the number of GIS analyses overall improvement of the system. (p. 79)
used in the development plan preparation may
increase as younger planners with GIS technical For planning authorities that face financial
know-how become planning consultants in the restriction in maintaining their EIS, regular fund-
future. Since GIS is relatively new in Malaysia, ing from FDTCP may be necessary to subsidise
so are GIS courses at planning schools in the part of the maintenance cost. Additionally, the
country. While the senior planners may not have planning authorities must be committed to EIS
the opportunity to learn GIS during their univer- and not use the subsidy for other purposes, except
sity days, younger planners have had formal GIS for maintaining the EIS.
training at university. The effort to utilize GIS in monitoring the
Many researchers point out that strong organi- implementation of development plan has been
sational support is important to ensure successful severely hampered by the problem of data un-
GIS implementation (Abdullah, A. et al., 2002; availability. This is hardly surprising because
Innes & Simpson, 1993; Ramasubramanian, researchers have found that data unavailability
1999). Thus, relevant parties must continue the continues to be one of the most prominent fac-
drive towards advanced application of GIS in tors that limit successful GIS implementation in
statutory development plan system. More GIS re- planning (Gilfoyle & Wong, 1998; Mennecke &
search should be encouraged to discover the areas West, 2001). Therefore, to overcome this problem,
in which advanced GIS application can add value solutions have to be focused towards ensuring
to the development plan preparation process, and data availability. Very often data available are
funds must be made available for such research. generally inaccurate and incomplete, but these
The need for strong organisational support is are also the best available data (Klosterman &
more visible in the case of GIS-based information Abdullah, A., 2008). Thus, it is important that any
systems implementation. EIS for development GIS-based systems for monitoring development
plan could prove useful for planning authori- plan implementation do not require extensive data
ties in implementing the policies and proposals sets that are difficult to obtain in order for it to
of their development plans, and as a medium work. The system must also be flexible enough
to convey planning information to the general to accommodate the best available data.
public. However, its implementation, especially Another interesting development pertaining
at local authority level, is plagued by the problem to GIS implementation in Malaysian statutory

1005
GIS Implementation in Malaysian Statutory Development Plan System

development plan preparation process is the ef- Abdullah, M. F., Abdullah, A., & Ibrahim, M.
fort to institutionalise decision support systems (2009). Institutionalisation of spatial planning
in the process. These efforts are welcomed but it and decision support systems for planning and
has to be implemented with care. For planners governance in Malaysia. In Davide, G., & Abdul-
who are unfamiliar with, or have little knowl- lah, A. (Eds.), Spatial decision support for urban
edge of, decision support systems, it could be and environmental planning: a collection of case
a daunting task to apply it in their work. If its studies. Shah Alam, Malaysia: Arah Publications.
implementation is rushed, it would only lead to
Abdullah, M. F., Abdullah, A., Ibrahim, M., &
planners finding it too difficult and consequently
Abdul Samad, D. (2005). A study on the demand
losing interest to continue to use decision support
of spatial planning and decision support system in
systems. Researchers have suggested that one of
Malaysia. Planning Malaysia, 3, 47–60.
the important characteristics of successful tech-
nological implementation is that it must allow for Abdullah, M. F., & Ariffin, I. (2003). Incorporat-
incremental and small trials (Innes & Simpson, ing sustainable development objectives into de-
1993; Ramasubramanian, 1999). Planners must be velopment plans through strategic environmental
exposed slowly to decision support systems. The assessment. Planning Malaysia, 1, 1–18.
decision support technology and models adopted
Abdullah, M. N. (2004). Pelaksanaan sistem
must be able to sustain planners’ interest in the
maklumat geografi di Jabatan Perancangan Bandar
system and relevant to the planning tasks. We
dan Desa Negeri: dilema dan potensi. Malaysian
must not adopt complex models which are only
Townplan, 2(1), 29–37.
of pedagogic value but not useful for plan making
purposes (Batty, 2004). Abu Bakar, M. A. (2004). Sistem maklumat geo-
grafi di Jabatan Perancangan Bandar dan Desa
Semenanjung Malaysia. Malaysian Townplan,
REFERENCES 2(1), 19–23.

Abdullah, A., Abdullah, M. F., & Nordin, M. F. Ahmad Sarkawi, A. (2006). The legal consid-
(2002). Spatial information system as a tool in erations of neighbouring lands in development
managing urban development process. Paper planning: the Malaysian context. Newcastle-
presented at the 18th EAROPH World Planning upon-Tyne, United Kingdom.
Congress. Amandus, P. Jr. (2006). MCDM and land suit-
Abdullah, A., Abdullah, M. F., & Nordin, M. F. ability analysis for spatial development: case
(2003). Managing urban development process study Kulim local plan. In Shamsuddin, K. (Ed.),
by using spatial information system. Planning Implementing Multi Criteria Decision Making in
Malaysia, 1, 71–92. Malaysian Town Planning. Kuala Lumpur: Fed-
eral Department of Town and Country Planning
Abdullah, A., Abdullah, M. F., Shahbuddin, M. Peninsular Malaysia.
N. A., & Klosterman, R. E. (2009). Modelling
the reality in planning: SPDSS experience in Bahagian Teknologi Maklumat. (2007a). Format
Malaysia. Malaysian Journal of Environmental Metadata dan Struktur Pengkalan Data Sistem
Management, 10(1), 119–132. Maklumat Geografi Kajian Rancangan Tempatan.
Kuala Lumpur: Jabatan Perancangan Bandar dan
Desa Semenanjung Malaysia.

1006
GIS Implementation in Malaysian Statutory Development Plan System

Bahagian Teknologi Maklumat. (2007b). Manual Gocmen, Z. A. (2009). GIS use for planning in
Penyediaan Pelan GIS. Kuala Lumpur: Jabatan Wisconsin’s public agencies. Paper presented at
Perancangan Bandar dan Desa Semenanjung the 11th International Conference on Computers
Malaysia. in Urban Planning and Urban Management.
Batty, M. (2004). Dissecting the streams of plan- Heeks, R. (2002). Information Systems and De-
ning history: technology versus policy through veloping Countries: Failure, Success, and Local
models. Environment and Planning. B, Planning Improvisations. The Information Society, 18(2),
& Design, 31, 326–330. doi:10.1068/b3103ed 101–112. doi:10.1080/01972240290075039
Budic, D. Z. (2000). Geographic information sci- Ibrahim, M., Abdullah, M. F., & Abdul Samad,
ence implications for urban and regional planning. D. (2004). Geographical information system
URISA, 12(2), 81–93. (GIS) in Malaysian planning education. Paper
presented at the 7th International Seminar on GIS
Ceccato, V. A., & Snickars, F. (2000). Adapting
for Developing Countries (GISDECO).
GIS technology to the needs of local planning.
Environment and Planning. B, Planning & Design, Idrus, S., & Harman Shah, A. H. (2006). ‘Me-
27, 923–937. doi:10.1068/b26103 tabolisme’ GIS di Malaysia. Buletin Geospatial
Sektor Awam, 1, 13–19.
Craglia, M., & Signoretta, P. (2000). From global
to local: the development of local geographic Innes, J. E., & Simpson, D. M. (1993). Implement-
information strategies in the United Kingdom. ing GIS for planning. Journal of the American Plan-
Environment and Planning. B, Planning & Design, ning Association. American Planning Association,
27, 777–788. doi:10.1068/b2651 59(2), 230. doi:10.1080/01944369308975872
Drummond, W. J., & French, S. P. (2008). Jabatan Perancangan Bandar dan Desa Semenan-
The Future of GIS in Planning: Converging jung Malaysia. (2001). Manual Penyediaan
Technologies and Diverging Interests. Journal Rancangan Tempatan (Pindaan 2001). Kuala
of the American Planning Association. Ameri- Lumpur: Jabatan Perancangan Bandar dan Desa
can Planning Association, 74(2), 161–174. Semenanjung Malaysia.
doi:10.1080/01944360801982146
Jabatan Perancangan Bandar dan Desa Semenan-
Geertman, S. (2002). Participatory planning and jung Malaysia. (2007). Manual Penyediaan
GIS: a PSS to bridge the gap. Environment and Rancangan Struktur. Kuala Lumpur: Jabatan
Planning. B, Planning & Design, 29, 21–35. Perancangan Bandar dan Desa Semenanjung
doi:10.1068/b2760 Malaysia.
Gilfoyle, I., & Wong, C. (1998). Computer ap- Jabatan Perancangan Bandar dan Desa Semenan-
plications in planning: Twenty years’. Plan- jung Malaysia. (2008). Geotechnical spatial analy-
ning Practice and Research, 13(2), 191–197. sis research and development: fokus dan agenda.
doi:10.1080/02697459816193 Retrieved December 12, 2008, from http://www.
townplan.gov.my/penyelidikan_gsard.php
Gill, S., Higgs, G., & Nevitt, P. (1999). GIS in
planning departments: Preliminary results from Johar, F. (2003). Environmental concern in local
a survey of local planning authorities. Plan- planning. Planning Malaysia, 1, 19–34.
ning Practice and Research, 14(3), 341–361.
doi:10.1080/02697459915643

1007
GIS Implementation in Malaysian Statutory Development Plan System

Kassim, N. S., & Islam, R. (2009). Spatial AHP Tan, T. S. (2005). Electronic local authority man-
for the National Physical Plan of Malaysia. In agement system. Planning Malaysia, 3, 27–46.
Davide, G., & Abdullah, A. (Eds.), Spatial decision
UGisP. (2006). GIS9: Pembangunan Pengkalan
support for urban and environmental planning: a
Data Sistem Maklumat Geografi Negeri Sembilan
collection of case studies. Shah Alam, Malaysia:
Darul Khusus - Laporan Akhir Fasa III.
Arah Publications.
Klosterman, R. E. (1997). Planning support sys-
tems: a new perspective on computer-aided plan- ADDITIONAL READING
ning. Journal of Planning Education and Research,
17(1), 45–54. doi:10.1177/0739456X9701700105 Abdullah, A. Kamalruddin Shamsuddin, & Abdul-
lah, M. F. (Eds.). (2004). Applications of planning
Klosterman, R. E. (1999). New perspectives on and decision support systems. Kuala Lumpur:
planning support systems. Environment and Bureau of Consultancy & Entrepreneurship, In-
Planning. B, Planning & Design, 26, 317–320. ternational Islamic University Malaysia.
doi:10.1068/b260393
Abdullah, M. F., Abdullah, A., & Rustam Khairi
Klosterman, R. E., & Abdullah, A. (2008). Com- Zahari. (2009). GIS application in monitoring
puter technologies in urban planning. Paper development plan implementation in Malaysia:
presented at the Medina Forum Seminar on Devel- weaknesses and potential. Paper presented at the
oping and Deploying Planning Support Systems. 11th International Conference on Computers in
Lee, L. M., & Ahmad, M. J. (2000). Local authority Urban Planning and Urban Management.
networked development approval system. Paper Brail, R. K., & Klosterman, R. E. (Eds.). (2001).
presented at the Planning Digital Conference. Planning support systems: integrating geographic
Maklumat, B. T. (2006). Sistem pengurusan data information systems, models, and visualization
JPBD Semenanjung Malaysia. Buletin Geospatial tools. Redlands, CA: ESRI Press.
Sektor Awam, 1/2006, 26–30. Bruton, M. J. (2007). Malaysia: the planning of
Mennecke, B. E., & West, L. A. J. (2001). Geo- a nation. Kuala Lumpur: PERSADA.
graphic Information Systems in Developing Coun- Budic, D. Z. (1993). GIS use among southeast-
tries: Issues in Data Collection, Implementation ern local governments. Journal of Urban and
and Management. (Technology Information). Regional Information Systems Association, 6(6),
Journal of Global Information Management, 529–546.
9(4), 44–54.
Geertman, S., & Stillwell, J. (Eds.). (2003). Plan-
Ramasubramanian, L. (1999). GIS Imple- ning support systems in practice. Heidelberg,
mentation in Developing Countries: Learning Germany: Springer.
from Organisational Theory and Reflective
Practice. Transactions in GIS, 3(4), 359–380. Goh, B. L. (1990). Urban planning in Malaysia.
doi:10.1111/1467-9671.00028 Petaling Jaya, Malaysia: Tempo Publishing.

Samat, N. (2006). Applications of geographic Huxhold, W. (1991). An introduction to urban


information systems in urban land use planning geographic information systems. New York:
in Malaysia. Paper presented at the 4th Taipei Oxford University Press.
International Conference on Digital Earth.

1008
GIS Implementation in Malaysian Statutory Development Plan System

Johar, F. (2006). Environmental sustainability in Munir, A. B., & Zulhuda, S. (2007). Becoming
selected local plans in Malaysia. In Kadouf, H. e-cities: legal issues and challenges. Paper pre-
A., & Al-Junid, S. Z. (Eds.), Land use planning sented at the 2nd International Symposium on
and environmental sustainability in Malaysia: Knowledge Cities, Shah Alam.
policies and trends (pp. 255–274). Kuala Lumpur:
Nedovic-Budic, Z. (1998). The impact of GIS tech-
Research Centre, International Islamic University
nology. Environment and Planning. B, Planning
Malaysia.
& Design, 25, 681–692. doi:10.1068/b250681
Kadouf, H. A., & Al-Junid, S. Z. (Eds.). (2006).
Rogers, E. (1983). Diffusion of Innovations (3rd
Land use planning and environmental sustain-
ed.). New York: Free Press.
ability in Malaysia: policies and trends. Kuala
Lumpur: Research Centre, International Islamic Shamsuddin, K. (Ed.). (2006). Implementing
University Malaysia. multi-criteria decision making in Malaysian town
planning. Kuala Lumpur: Federal Department of
Kadouf, H. A., & Maidin, A. J. (2006). Theory
Town and Country Planning Peninsular Malaysia.
and practice in land use planning in Malaysia:
an overview. In Kadouf, H. A., & Al-Junid, S. Stillwell, J., Geertman, S., & Openshaw, S. (Eds.).
Z. (Eds.), Land use planning and environmental (1999). Geographical information and planning.
sustainability in Malaysia: policies and trends Heidelberg, Germany: Springer.
(pp. 77–113). Kuala Lumpur: Research Centre,
Vonk, G., Geertman, S., & Schot, P. (2005).
International Islamic University Malaysia.
Bottlenecks blocking widespread usage of plan-
Kaniclides, A., & Kimble, C. (1995). A develop- ning support systems. Environment & Planning
ment framework for executive information system. A, 37, 909–924. doi:10.1068/a3712
Paper presented at the GRONICS ‘95, Groningen,
Wan Ismail, W. H. (2007). Pematuhan peran-
The Netherlands.
cangan (teori). Paper presented at the Mesyuarat
Kasmin, A. M. (1999). Kerangka pelaksanaan Pegawai Kanan Perancang Bandar dan Desa
dasar rancangan struktur. Shah Alam, Malaysia: Malaysia ke 21.
Pusat R&D, Pejabat Setiausaha Kerajaan Negeri
Wegener, M. (1998). GIS and spatial planning.
Selangor.
Environment and Planning. B, Planning & De-
Kassim, N. S. (2003). Konsep dan proses e- sign, 25, 48–52.
submission dalam permohonan perancangan.
Yaakup, A. (2004). GIS as a tool for development
Malaysian Townplan, 1(1), 19–25.
planning and monitoring. Paper presented at the
Matori, A. N. (2007). Development of a user NGIS 2004.
interface application for information delivery - a
Yaakup, A., Abu Bakar, S. Z., & Sulaiman, S.
case study of Ipoh City Council’s Department
(2009). Decision support systems for urban
of Town Planning. Buletin Geospatial Sektor
sustainability in Malaysia. Malaysian Journal
Awam, 1, 24–29.
of Environmental Management, 10(1), 101–118.
Muhammad, Z. (2000). Integrated rural planning
Yaakup, A., Johar, F., Maidin, M. A., & Ahmad,
and development approach - the Malaysian experi-
E. F. (2004). GIS and decision support systems
ence. Habitat Malaysia, 3, 1–34.
for Malaysian development plan studies. Paper
presented at the International Seminar on SPDSS
2004.

1009
GIS Implementation in Malaysian Statutory Development Plan System

KEY TERMS AND DEFINITIONS ing, controlling and determining the planning and
development direction of its area.
Decision Support Systems: Computerized State Department of Town and Country
systems that can be used to help users in making Planning: Each state in the Peninsular Malaysia
decision over complex and ill-structured problems. has its own Town and Country Planning Depart-
Development Plan: A plan which is usually ment which oversees the planning and develop-
prepared by planners to guide or promote de- ment matters within the State boundary and pro-
velopment in an area. It usually contains maps vides advisory services to the State government.
of the area, and planning policies and proposals State Town and Country Planning Department
for the area. also acts as local planning authority, handling
Executive Information Systems: Comput- development control affairs for areas within the
erized systems that are developed to provide state, which are not under the jurisdiction of any
important information in a concise manner to local authority.
senior executives. This information can be used Statutory Development Plan: Development
by the executives in making decision over a busi- plan which preparation is required by law.
ness problem.
Federal Department of Town and Country
Planning: Is the planning authority at the Federal ENDNOTES
level that is responsible in charting the general
direction of planning and development for the 1
States in East Malaysia (e.g., Sabah and
nation. Its functions also include promoting town Sarawak), and the three Federal Territories
and country planning in Malaysia and providing (e.g., Kuala Lumpur, Putrajaya and Labuan)
advisory services to the Federal Government and in Malaysia are not bounded by the Town
planning authorities at State and Local levels. It and Country Planning Act 1976. They, how-
also provides technical and monetary supports to ever, have their own planning acts which are
State and Local level planning authorities in the almost similar to the Act.
preparation of their development plans. 2
Charting is a process where information
Geographical Information Systems (GIS): about planning applications is ‘drawn’ onto
Computerized systems that can be used to store, standard cadastral map. Traditionally, this
analyze and display spatially-referenced data. was done by hand where draughtsman would
GIS-Based Systems: Computerized systems draw on the map the boundary of land parcel
which are developed using GIS software as the where planning applications were submitted,
platform. as well as other basic information about the
Local Planning Authority: Local planning applications. Many planning authorities have
authority of an area is usually the local authority now replaced their manual charting with
of the area, such as city hall, the municipal council GIS.
or the district council. It is responsible in regulat-

This work was previously published in Handbook of Research on E-Planning: ICTs for Urban Development and Monitoring,
edited by Carlos Nunes Silva, pp. 435-454, copyright 2010 by Information Science Reference (an imprint of IGI Global).

1010
1011

Chapter 62
3D City Modeling and
Visualization for Smart
Phone Applications
Juha Hyyppä Zhengjun Liu
Finnish Geodetic Institute, Finland Chinese Academy of Surveying and Mapping,
China
Lingli Zhu
Finnish Geodetic Institute, Finland Harri Kaartinen
Finnish Geodetic Institute, Finland
Anttoni Jaakkola
Finnish Geodetic Institute, Finland

ABSTRACT
Smartphones with larger screens, powerful processors, abundant memory, and an open operation system
provide many possibilities for 3D city or photorealistic model applications. 3D city or photorealistic
models can be used by the users to locate themselves in the 3D world, or they can be used as methods
for visualizing the surrounding environment once a smartphone has already located the phone by other
means, e.g. by using GNSS, and then to provide an interface in the form of a 3D model for the location-
based services. In principle, 3D models can be also used for positioning purposes. For example, match-
ing of images exported from the smartphone and then registering them in the existing 3D photorealistic
world provides the position of the image capture. In that process, the central computer can do a similar
image matching task when the users locate themselves interactively into the 3D world. As the benefits
of 3D city models are obvious, this chapter demonstrates the technology used to provide photorealistic
3D city models and focus on 3D data acquisition and the methods available in 3D city modeling, and
the development of 3D display technology for smartphone applications. Currently, global geoinformatic
data providers, such as Google, Nokia (NAVTEQ), and TomTom (Tele Atlas), are expanding their prod-
ucts from 2D to 3D. This chapter is a presentation of a case study of 3D data acquisition, modeling and
mapping, and visualization for a smartphone, including an example based on data collected by mobile
laser scanning data from the Tapiola (Espoo, Finland) test field.

DOI: 10.4018/978-1-4666-2038-4.ch062

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
3D City Modeling and Visualization for Smart Phone Applications

INTRODUCTION (Baltsavias, Pateraki, & Zhang, 2001; Baltsavias,


Zhang, & Eisenbeiss, 2005; Fraser, Baltsavias &
In the past, visualization of 3D information was Güen, 2002; Grün, Zhang & Eisenbeiss, 2005;
limited to personal computers. Today, the increase Jacobsen, 2004; Lee, Shan & Bethel, 2002; Poli,
in the capabilities of mobile devices (powerful 2005; Sörgel, Szhulz, Toennessen & Stilla, 2005;
processors, abundant memory, and open opera- Sörgel, Thoennessen, Brenner & Stilla 2006;
tion systems) provides possibilities for users to Sörgel, Michaelsen, Thiele, Cadario & Thoennes-
utilize 3D information for purposes such as sen, 2009; Toutin, 2004; Zhang & Grün, 2006).
navigation and location-based services (LBS). Content on 3D modeling using aerial images
“Future developments in navigation and other has been presented by the following (Ahmadi,
location-enabled solutions will rely heavily on 3D Zoej, Ebedi, Moghaddam & Mohammadzadeh,
mapping capabilities,” said Cliff Fox, executive 2010; Baillard, Schmid, Zisserman & Fitzgibbo,
vice president, NAVTEQ Maps, in November, 1999; Baltsavias et al., 2005; Bescoby, 2006; Le
2010 (NAVTEQ). Many location-based service Besnerais, Sanfourche & Champagnat, 2008; Col-
agencies (e.g. Google, Nokia (NAVTEQ), and lins et al., 1998; Cord, Jordan & Cocquerez, 2001;
TomTom (Tele Atlas)) are currently expanding Fischer et al, 1998, Fradkin, Maitre & Roux, 2001;
their products from 2D to 3D. In 2007, Google was Grün & Wang, 1998a; Jaynes, Riseman & Hanson,
the first to introduced panoramic images (360°) 2003; Jung, 2004; Kim, Park, Kim, Jung & Kim,
of street views (2D) in the company’s Google 2004; Mayer, 1999; Paparoditis, Cord, Jordan &
Earth (GE) service. In December of 2010, their Cocquerez, 1998; Peng & Liu, 2005; Suveg &
products were expanded to 3D street views and 3D Vosselman, 2004; Tournaire, Bredif, Boldo &
buildings. After several months, in April of 2011, Durupt, 2010). Literatures on 3D modeling using
collaborating with Microsoft, Nokia launched Ovi terrestrial images have been contributed by the
Maps 3D where currently twenty cities are avail- following (Fraser, 1996; Fraser & Cronk, 2009;
able in 3D views. Intense competition is usually Grün, Remondino & Zhang, 2004; Luhmann,
an indication of heavy user demand. This has 2010; Remondino, 2006). Papers related to 3D
created a practical need for rapid 3D geographi- modeling using image sequences can be obtained
cal data acquisition and modeling. The 3D laser from the following sources (Bethmann, Herd, Luh-
scanning market is also expected to double by mann & Ohm, 2009; Cornelis, Leibe, Cornelis &
2015 (Geospatial World, 2011). Van Gool, 2008; D’Apuzzo, 2003; Fitzgibbon &
Currently, there are two primary means for Zisserman, 1998; Forlani, Roncella & Remondino,
3D geographic data acquisition: photogrammetry 2005; Pollefeys et al., 2008; Remondino, 2002;
and laser scanning. Photogrammetry is the tech- Remondino, 2004; Tian, Gerke, Vosselman &
nology of deriving 3D data from 2D images by Zhu, 2010). However, despite the research efforts
mono-plotting (using the existing height model), focused on photogrammetry over many decades,
stereo-imagery interpretation or multi-imagery the level of automation is still relatively low.
block adjustment. According to different vehicle The current technology in laser scanning (LS)
platforms for optical sensors, 2D images can offers an alternative solution for the acquisition
be separated into several categories: satellite of 3D geographic data. LS is also referred to as
images, aerial images, and terrestrial images as Light Detection And Ranging (LiDAR) because
well as image video sequences. Methodologies it uses a laser to illuminate the earth’s surface and
for 3D modeling from satellite images can be a photodiode to register the backscatter radiation.
found in numerous publications, and the follow- Typically, the time it takes for the laser beam to
ing is a representative sample of such documents reach the target and come back to the source (de-

1012
3D City Modeling and Visualization for Smart Phone Applications

lay) is used to measure the distance to the target, are primarily used for the reconstruction of man-
i.e. with the speed of the light. After GPS, IMU, made targets, e.g. in the fields of architecture,
and scanning mechanism were attached to laser civil engineering, archaeology, cultural heritage,
ranging measurements, first for military purposes and robotics (Vosselman & Maas, 2010). TLS is
in 1980s, and later on for surveying purposes, a feasible alternative when carrying out detailed
the field of science currently known as Airborne surveys over small areas, with typically radii for
Laser Scanning (ALS) was born. The typical high-density scans being less than 20-30 meters.
operating wavelength range used in LiDAR is Approaches applied in 3D modeling using TLS
between 250 nm -1600 nm, and in LS the most data are presented in the following examples of
common wavelength range is close to 1000 nm sources (Arayici, 2007; Buckley, Howell, Enge
e.g. 905nm or1064nm. However, they are easy & Kurz, 2008; Pu, 2008; Pu & Vosselman, 2009).
to be absorbed by the eye. An alternative laser Mobile Laser Scanning (MLS), also called
with 1550nm wavelength is eye-safe at much Mobile Terrestrial Laser Scanning, is currently
higher power levels. Commercial ALS systems a rapidly developing area in LS with laser scan-
have been available since 1994. Currently, ALS ners, GNSS and IMU, being mounted onboard a
systems are extremely feasible solutions for map- moving vehicle.
ping large areas with a high degree of accuracy, MLS can be considered to fill the gap between
e.g. to determine elevations and forest models. ALS and TLS. In MLS, data collection can be
Elevation accuracies of 5-10 cm are common, performed either in the so-called stop-and-go mode
whereas planimetric accuracies range between or in a continuous mode. The stop-and-go mode
20-80 cm, depending on the flying height and corresponds to conventional TLS measurements,
IMU characteristics. Nation-wide LS data acqui- and therefore MLS, hereafter, is used to refer to
sition projects for multipurpose uses have been the continuous model, i.e. the use of continuous
conducted in or are envisaged in several countries. scanning measurements along the drive track. In
Elevation models, stand-wise forest inventories, addition to laser scanners, MLS data acquisition
and corridor mapping are good examples of sensors can include accessories such as digital
commercial activities. In the Nordic Countries, cameras or video cameras. Research results about
stand-wise forest inventory is being increasingly 3D modeling from MLS have been addressed
assisted by ALS, which has the effect of reduc- by researchers such as the following (Brenner,
ing the costs of inventory data. Literature on 3D 2009; Früh & Zakhor, 2004; Haala, Peter, Kremer
modeling using ALS data can be found in sources & Hunter, 2008; Jaakkola, Hyyppä, Hyyppä &
such as the following (Ackermann, 1999; Baltsav- Kukko, 2008; Jaakkola et al., 2010; Zhao & Shi-
ias, 2004; Dorninger & Pfeifer, 2008; Hyyppä & basaki, 2003; Zhao & Shibasaki, 2005; Zhu et al.,
Engdahl, 2000; Hyyppä, Matikainen, Kaartinen, 2009; Zhu, Hyyppä, Kukko, Kaartinen & Chen,
Yu, Hyyppä & Rönnholm, 2007; Hyyppä et al., 2011). However, the point density of laser data has
2008; Hyyppä et al., 2009; Kada & Mckinley, great impact on the methodological development
2009; Melzer, 2007; Rottensteiner, Trinder, Clode and quality of the produced 3D models.
& Kubik, 2005a; Sampath & Shan, 2010; Sohn, The integration of photogrammetry and laser
Huang & Tao, 2008; Verma, Kumar & Hsu, 2006; scanning provide new technology solutions for
Vosselman, Gorte, Sithole & Rabbani, 2004). 3D modeling. By combining the good height-
Terrestrial Laser Scanning (TLS) differs from assessment accuracy of a laser scanner and the
ALS in that the scanner is mounted in a fixed good planimetric accuracy of aerial images, both
position, i.e. on a tripod, and ranging takes place high accuracy and higher automation can, in
across a large spherical angle-of-view. The scans theory, be obtained (Kaartinen & Hyyppä, 2006).

1013
3D City Modeling and Visualization for Smart Phone Applications

The use of data integration greatly improves the plied without some adaptation measures. Although
level of automation (Brenner, 2005). Method- NAVTEQ and some other navigation companies
ologies for data fusion from photogrammetric have supported 3D landmark visualization in
images and laser scanning have been contributed their navigation devices, the development of a
by researchers such as the following (Awrangjeb, high performance, photo-realistic 3D visualiza-
Ravanbakhsh & Fraser, 2010; Becker & Haala, tion engine to support smartphone navigation and
2007; Brenner, 2005; Früh & Zakhor, 2004; Haala LBS is still a work-in-progress (Bodgan & Coors,
& Kada, 2010; Habib et al., 2010; Kaartinen & 2009; Coors, 2008; Goetz & Zipf, 2010).
Hyyppä, 2006; Ma, 2004; Pu & Vosselman, 2009; This chapter endeavors to contribute to the
Sohn & Dowman, 2007; Vosselman, 2002; Vu, procedure of 3D city model reconstruction, and
Yamazaki & Matsouka, 2009). also to the state-of-the-art of 3D modeling and the
The development of desktop mapping applica- visualization aspects, e.g. smartphone 3D visual-
tions with 3D geo-visualization capabilities has ization for navigation and LBS approaches and
been explored over its long history (Koller et al., techniques. The complete workflow of 3D model
1994). Over the past 10 years, various 3D visual- reconstruction and visualization from data acquisi-
ization systems for geographical data have been tion to photorealistic 3D model reconstruction and
developed. Many commercial GIS and remote- also to the implementation of 3D engine as a case
sensing software vendors such as ESRI (ArcGIS study is also presented. In the last section, future
ArcScene and ArcGlobe) and ERDAS (IMAGINE research aspects and challenges are discussed.
VirtualGIS) have integrated 3D mapping facilities
to facilitatew visualization and spatial analyses.
Other companies and agencies, e.g., Google ACQUISITION OF GEOMETRY
(Google Earth) and NASA (WorldWind), provide FOR 3D CITY MODELS
online 3D virtual globe visualization software for
free viewing of the 3D Earth environment and use The complete process of 3D city model recon-
of location-based services. Techniques supporting struction covers two steps: acquisition of geom-
very large scenes, very high-resolution satellite etry and texture mapping. The approaches for
images and aerial photos with high visualization acquisition of geometry heavily rely on the data
performance, include the use of multi-resolution sources. Currently two primary means are widely
terrain models and high level of detail (LOD). used for geo-data acquisition: laser scanning and
Examples of studies looking into these are (Bösch, photogrammetry. In the following sections, the
Goswamiz & Pajarolax, 2009; Garland, 1999; basics of laser scanning and photogrammetry
Hoppe, 1998; Lindstrom & Pascucci, 2001; Livny, and the methodologies for 3D building geometry
Sokolovsky, Grinshpoun & El-Sana, 2008; Sch- extraction and reconstruction will be introduced.
neider & Westermann, 2006; Ulrich, 2000). As In addition, some useful commercial software and
regards quad-tree-based or oc-tree-based scene systems and also some free open sources will be
management and tile based terrain rendering, addressed.
the following researchers have looked into them
(Amara & Marsault, 2009; Blow, 2000; Livny, Basics of Laser Scanning
Kogan & El-Sasa, 2009).
Since a mobile device has limited hardware In order to understand the basics of laser scanning,
resources and far less graphical computation in the following we will summarize the operation
power built into it, the 3D visualization techniques principle of laser scanners, highlight major laser
developed for desktop computers cannot be ap- scanning parameters and talk about basics of point

1014
3D City Modeling and Visualization for Smart Phone Applications

cloud processing (Hyyppä, Wagner, Hollaus, & (Wagner, Ullrich, Ducic, Melzer & Studnicka,
Hyyppä, 2009). 2006) compared to previously developed large-
footprint systems (Blair, Rabine & Hofton, 1999;
Basic Operation Principle Garvin, 1996; Rabine & Martin, 1996) mainly used
to test space-borne LiDAR techniques.
The laser scanning is a LiDAR-based method for
acquiring range measurements typically from a Laser Scanning Parameters
moving platform and the precise orientation of
these measurements. Short laser pulses (4-10 ns) Important technical parameters to characterize the
are emitted with a high frequency (e.g. 100-400 physical properties of laser scanner systems are
kHz) and are continuously deflected in across- the laser wavelength (μm), pulse duration (ns),
flight direction using various scanning mecha- pulse energy (mJ), pulse repetition frequency/
nisms. The position and rotation of the sensor are rate (PRF, PRR) (kHz), beamwidth (mrad), scan
continuously recorded along the flight path using angle (deg), scan rate (Hz), flying height (m), and
a Global Navigation Satellite System (GNSS) size (m) of laser footprint on the ground. Some of
and an Inertial Measurement Unit (IMU). The these parameters related to the data acquisition,
recorded measurements of sensor position and which can be affected by the user are as follows:
orientation, beam deflection and range can be The flight altitude and the scan angle - for
converted to a georeferenced three-dimensional a selected laser scanner, the main user-selected
(3D) point cloud representing the measured tar- parameters are flight altitude and maximum scan
gets. Since it is a-priori not known which targets angle (FOV/2, Field of View). Flight altitudes
generated the echoes, i.e., the points are dummy can vary between 10 to 6,000 m, depending on
in that sense, typically surface models or points the systems and flying platform (UAV, helicopter,
forming the models are required for converting aircraft). The flight altitude also determines the
the 3D point cloud into geospatial data products PRF of systems, which previously allowed one
such as Digital Terrain Models (DTM), Digital pulse in the air and presently increasingly allow-
Surface Models (DSM), Canopy Height Model ing multiple pulses in the air. The type of scanner
CHM or normalized Digital Surface Model used (scanning mechanism) and, for a fixed PRF
(nDSM) (CHM=nDSM=DSM-DTM). These and scanner rate, the maximum scan angle and
basic geospatial products are increasingly used in flying height determines the area coverage and the
a broad range of applications, e.g. in hydrology, distribution and density of points for a given pulse
construction engineering, infrastructure monitor- ALS system. The density of the pulses on the target
ing, and forestry. When using points, filtering is the main parameter affecting the quality of the
techniques for point classification has to be ap- surveyed data. Increase in flight height increases
plied. Overviews of laser scanning can be found planimetric errors (errors in plane direction (X-Y)
in (Baltsavias, 1999; Fowler, Samberg, Flood & are mainly due to IMU).
Greaves, 2006; Wehr & Lohr, 1999). As ALS is Pulse repetition frequency - In 1993, the pulse
a rapidly growing technology, several technical rate (pulse repetition frequency, PRF) was 2 kHz,
improvements have occurred since these reviews while the best systems in 2007 provided samples
have been published. Specifically, the sampling at a rate of 150-200 kHz. Today, some ALS sys-
frequency, positioning accuracy, and number tems allow multiple pulses in the air or are using
of recorded echoes have been improved during several laser guns to send multiple pulses. Those
the last years. Also, small-footprint waveform- systems having only one pulse in the air, they are
digitizing ALS systems have become available

1015
3D City Modeling and Visualization for Smart Phone Applications

limited by maximum PRF from higher altitudes expected DTM level and the corresponding laser
(the speed of light acts as a limitation). point. The method is implemented in SCOP++
Beam size – the effect of beam size on the (Kraus & Otepka, 2005). Axelsson (2000) de-
quality of the model obtained has not been well veloped a progressive TIN densification method
studied and some findings are contradictory. The which is implemented into TerraScan software.
ability to detect either the ground or tree crowns A comparison of the filtering techniques used
depends not only on the beam size but also on for DTM extraction can be found in a report on
the signal strength and receiver sensitivity, the ISPRS comparison of filters (Sithole & Vossel-
pulse detection algorithm and the proper sampling man, 2004). Selection of the filtering strategy is
(footprint size versus sampling density). not a simple process. In practice, the amount of
interactive work determines the final quality of
Basics of Point Cloud Processing the product.
Classification of building points – Previous
Laser points have coordinate, echo, reflectance, research has shown that automatic detection of
and time information to be used in the classifica- buildings from laser scanner data is possible with
tion. Statistical classification methods are com- relatively good accuracy (e.g. Forlani, Nardinoc-
monly used. Points can be divided into various chi, Scaioni & Zingaretti, 2006; Hug, 1997; Mati-
classes using intensity values, echoes (e.g. only, kainen, Hyyppä & Hyyppä, 2003; Rottensteiner
the first of many, intermediate or last pulse), et al., 2005a, Rottensteiner, Trinder, Clode &
height information, aerial/hyperspectral image Kubik, 2005b; Vögtle & Steinle, 2003; Zhang,
features of the same point or time. Height can be Yan & Chen, 2006). Different types of features
relative to the ground or absolute values. Various have been used to separate buildings and vegeta-
filtering techniques have been developed for low tion, including local co-planarity, height texture or
point detection, power line, building, and ground surface roughness, reflectance information from
classification purposes. the images or laser scanning, height differences
The classification of low points - Removal of between the first pulse and last pulse laser scanner
low points is an important pre-processing part data, and shape and size of objects.
of data analysis and is usually done before the
ground classification. Low points are below the Basics of Photogrammetry
ground surface and their origin may be manholes
on the street, multiple reflections from trees or The fundamental task of photogrammetry is that
buildings or spreading of the too-strong returns 3D information is recovered from 2D images of a
in the receiver. A single or group of points may scene. When an image is taken from a 3D scene,
cause an anomaly in the correct ground surface if depth information is lost. Reconstructing 3D ob-
not removed before ground point classification. jects from 2D images by the means of photogram-
The classification of ground points – photo- metry usually needs georeferencing image space
grammetrists have developed various methods points to object space points. When images are
for obtaining DTMs from laser scanning point georeferenced to object space coordinate system,
clouds. For example, Kraus and Pfeifer (1998) interior orientation (IO), and exterior orientation
developed a DTM algorithm for which laser points (EO) of images should be determined. Generally
between terrain points and non-terrain points were speaking, IO can be derived from camera calibra-
distinguished using an iterative prediction of the tion of laboratory. Therefore the determination of
DTM and weights attached to each laser point, EO is a crucial step for georeferencing in any type
depending on the vertical distance between the of imagery from terrestrial, airborne or satellite

1016
3D City Modeling and Visualization for Smart Phone Applications

platforms. In photogrammetry, the georeferencing a 3D transformation, for instance, in the process


methods have direct georeferencing and indirect of absolute orientation, 3D model coordinates are
georeferencing according to different ways in transformed to object space coordinates.
deriving EO of images. When information is at- The 2D Projective transformation using eight
tained directly by means of measurements from parameters is typically used for the transforma-
sensors on-board vehicle, this approach is called tion between images or between an image and a
direct georeferencing (Skaloud, 1999). As regard planar object.
to indirect georeferencing method, EO of the image
is determined by performing block adjustment or a1 X + b1 Y + c1
stereo model set-up. The technological develop- X* =
a 0 X + b0 Y + 1
ment in the navigation, positioning, and orientation
systems, results that the direct georeferencing is
widely used. However the fundamental 2D or 3D a 2 X + b2 Y + c 2
Y* = (2)
transformation methods and collinearity equations a 0 X + b0 Y + 1
are still frequently utilized, especially when differ-
ent data sources for consistent 3D modeling and
From a single image geometry, it shows the
mapping e.g. automated texture mapping. In the
fundamental characteristic of the frame image,
following part, some widely used transformation
which is that the perspective centre, the image
methods are introduced.
point and the corresponding object point are on a
Generally, coordinate transformation in 3D
straight line in 3D space. It is also the elementary
can be decomposed to a single effect, such as
condition of collinearity equations.
translation, uniform scale, rotation, reflection,
Collinearity equations (from Mikhail, 2001):
non-uniform scale etc. In photogrammetry, 3D
linear transformation usually is limited to seven
x –xo =
parameters transformation composed of a uniform
scale, three translations, and three rotations. a11 (X − Xl ) +a 21 (Y −Yl ) +a 31 (Z − Zl )
−f 
a13 (X − Xl ) +a 23 (Y −Yl ) + a 33 (Z − Zl )
X *  X  t 
     x
Y *  = kM Y  + t  (1)
     y
 *     y −yo =
Z 
   Z  tz 
a12 (X − Xl ) +a 22 (Y −Yl ) +a 32 (Z − Zl )
−f 
a13 (X − Xl ) +a 23 (Y −Yl ) + a 33 (Z − Zl )
Where X, Y, Z --- original coordinates; X * , Y * ,
Z * --- the transformed coordinates; k --- scale (3)
factor; tx , ty ,tz --- translation factors; M --- rota-
Where
tion matrix; aij --- coefficient of rotation matrix,
f --- calibrated focal length;
i =1, 2, 3;
aij --- coefficient of rotation matrix, i =1, 2, 3; j
j = 1, 2, 3. = 1, 2, 3;
Xl , Yl , Zl --- perspective centre coordinates in
Typical seven parameters transformation is also object space coordinate system;
called Helmert transformation. It is often used in

1017
3D City Modeling and Visualization for Smart Phone Applications

X, Y, Z --- point coordinates in object space; x, y consisting of using aerial images and those con-
--- point coordinates in image plane; sisting of laser scanning and hybrid methods.
x o , yo --- image principal point coordinates.
Building Extraction Using Aerial Images
The object space coordinates can be calculated
from collinearity equations when initial Z value Digital aerial photogrammetry supports accurate
is assumed known: measurement of points and structures, which are
usually defined by a human operator (Brenner,
X − Xl = 2005). Manual work is time consuming and
a11 (x − xo ) + a12 (y − yo ) + a13 (−f ) expensive even when using today’s state-of-the-
(Z − Z ) art digital workstations, so a lot of emphasis has
a 31 (x − xo ) + a 32 (y − yo ) + a 33 (−f )
l
been placed on the development of automation
in data acquisition. Despite significant research
Y −Yl = efforts in the past, the low degree of automation
achieved has remained the major problem. Thus,
a21 (x − xo ) + a22 (y − yo ) + a23 (−f )
(Z − Zl ) the majority of development work has focused on
a 31 (x − xo ) + a 32 (y − yo ) + a 33 (−f ) semi-automatic systems, in which, for example,
(4) recognition and interpretation tasks are performed
by the human operator, whereas modeling and pre-
Collinearity equations are very famous in pho- cise measurements are supported by automation.
togrammetry. It can be applied to various tasks, In the following some approaches of how to apply
such as space intersection, space resection, and automation in building extraction using aerial
the relative orientation and so on. images are briefly described (see also Table 1).
On automatic approaches the 2D line segments
Methodologies for 3D can be extracted automatically from two (stereo
Building Geometry Extraction pair) or several aerial images covering the same
and Reconstruction area. These 2D segments can be searched in the
whole image area or regions of interest (ROI),
We have divided the methods for 3D building which can be given by the operator or found by
geometry extraction and reconstruction to those using existing ground plans or by using image

Table 1. Features of presented approaches using aerial images

Approach No of Feature search area Extracted features Constructed 3D Additional image


images primitives information used
Haala, 1996 2 ROIs determined by 2D lines 3D rectangles Grey value variance be-
DSM tween images
Henricsson & N Given by operator 2D lines Planes Color attributes for differ-
Baltsavias, 1997 ent sides of 2D lines
Fischer et al., 1998 N Segmentation of whole Points, lines, and 3D corners
images areas
Baillard & Zisser- 6 Whole images 2D lines Half planes to both Radiometry used for line
man, 1999 sides of 3D lines grouping, half-plane com-
puting, and verification
Süveg & Vossel- 2 2D ground plans 2D corners 3D corners Gradients and texture used
man, 2004 in solution evaluation

1018
3D City Modeling and Visualization for Smart Phone Applications

matching to generate DSM, which is then used are used to select one model from the possible
to find ROIs. After the 2D features are extracted reconstruction results (Brenner, 2005).
from the images, they are constructed to 3D fea- Henricsson and Baltsavias (1997) approach
tures. The approaches described here use either extracts generic planar patches, which are then
3D rectangles, matched planes or 3D corners as assembled to complete roofs. A generic 3D patch
building primitives, which are then refined and is non-vertical, roughly planar and encloses a
assembled to create extracted buildings. compact 2D polygonal enclosure with similar
In Haala (1996), as shown in Figure 1, a stereo photometric and chromatic attributes along its
pair is used to create a DSM using image match- boundary. The approach is depicted in Figure 2.
ing. This DSM is used to find ROIs, areas where The operator gives a ROI in the master image
buildings may exist; and to create a disparity map where 2D line segments are extracted. Extracted
between the left and right image of a stereo pair. segments from this one image are then matched
2D line segments are extracted from image ROIs. to other images by maximizing an “edginess
The disparity map is used to group the 2D line measure” along the epipolar line. The edginess
segments to 3D segments taking into account the measure is a function of the gradient (magnitude
orientation of the 2D segments, e.g. by grouping and direction) in the other images. Geometric and
them to longer lines or to parallel lines. These photometric constraints are also used to reduce
3D segments are grouped in 3D rectangles. 3D the number of mismatches. This creates a set of
rectangles are then used to obtain approximate 3D segments, which are used to hypothetical
parameters for a saddleback building, and these planes. A similarity grouping based on the similar-
parameters are then improved by using an esti- ity in proximity, orientation, and photometric and
mation, which minimizes the distance between chromatic region attributes (Henricsson, 1996) is
the 3D line segments and the line segments of used to find the most evident and consistent planes.
a saddleback wire-frame model. The estimation Vertical walls are added afterwards by projecting
error and the grey value variance of the images the eaves of the roof to a DTM. Along the ex-

Figure 1. Building reconstruction according to Haala (1996) (Figure based on Brenner, 2005)

1019
3D City Modeling and Visualization for Smart Phone Applications

Figure 2. Building extraction according to Henricsson and Baltsavias (1997) (Figure based on Brenner,
2005)

tracted contours, color attributes are computed case, basic features are points, lines, and regions.
for the left and right flanking regions. This infor- They can be extracted by a polymorphous seg-
mation is used during the stereo matching, planar mentation from the images (Fuchs, 1998). Corners
grouping, and similarity grouping. are aggregated features (one level up in the hier-
Fischer et al. (1998) propose a model-based archy), consisting of at least three intersecting
hierarchical concept for modeling. A generic planes, a corresponding number of line segments
modeling approach is employed which relies on and a single point. Corners can be obtained on
the well-defined combination of building part the basis of a polymorphous segmentation (Lang,
models. Starting from low-level image features, 1999). Still one level up, an aggregation of corners
data-driven and model-driven processes are com- into object parts takes place, in this case into
bined within a multilevel aggregation hierarchy, simple building primitives called ‘connectors’ and
thereby using a tight coupling of the 2D image `terminals’. The aggregation of those object parts
and 3D object modeling and processing. This finally leads to objects (which themselves are part
formalizes the idea of an early transition to 3D, of the scene, which is the highest level (Brenner,
as realized in the previously presented approach- 2005). The building extraction strategy consists
es of Haala (1996) and Henricsson and Baltsavias of three main tasks: 1) reconstruction of 3D cor-
(1997). The coupling on each layer does not only ners (2D to 3D); 2) Generation of building hy-
involve a projection of the geometry, but also potheses (3D to 3D); and 3) Verification of build-
propagates constraints and quality measures. The ing hypotheses (3D to 2D). In step 1 the
following layers were given as hierarchy levels extracted 2D corner hypotheses of different im-
(Braun et al., 1995): scene, object, object part, ages are used to derive 3D corner reconstructions
aggregated feature, feature, voxel. Accordingly, by multi-image parameter estimation. In step 2
for image space: image, aspect, aspect part, ag- the 3D corners are used for indexing into a library
gregated feature, feature, and pixel. This hierarchy of parameterized building parts, which explain
was partly realized for building corners. In this these corners. The indexed building parts are ag-

1020
3D City Modeling and Visualization for Smart Phone Applications

gregated to complete building hypotheses by the images. Since the half planes are constrained
successive steps of merging and connecting build- by the 3D line segment through which they pass,
ing parts. In step 3 the 3D building hypotheses only their slope has to be determined. This is done
are projected back into the original aerial images by a search over all possible angles (the geometric
and compared to originally extracted features. transformation is given by the plane equation and
The degree of similarity is computed and hypoth- the exterior orientation of images), computing
eses either confirmed or rejected. Furthermore, a heuristic correspondence measure of image
geometric reasoning of the images may derive similarity; if the plane is correct the intensities at
previously undetected image features. These three corresponding pixels will be highly correlated. The
steps are repeated until no further hypotheses can correspondence measure is based on a weighted
be generated. Verified building hypotheses predict correlation between all of the images. The correla-
earlier unobserved 2D primitives giving addi- tion is computed in the neighborhood of points of
tional information for reconstructing previously interest, which are selected by applying an edge
undetected corners. This initiates a further itera- detector to one image using a very low threshold
tion of the three tasks. of the gradient. After half planes are detected, line
The key idea in the approach by Baillard segments and half planes are grouped on the basis
and Zisserman (1999), as shown in Figure 3, of collinearity and coplanarity, which reduce the
is to determine 3D planar facets by using both total number of planar patches. Additional lines
linear features and their image neighborhoods are introduced by intersecting existing planar
over multiple views. The advantage compared to patches and the extent of each patch is obtained
other approaches is that by using the dihedral line from a heuristic grouping. Each surface is then
and left/ right image patches in its vicinity, only verified using a similarity measure computed
relatively local information is exploited (Brenner, over all images.
2005). Six overlapping images are used to extract Süveg and Vosselman (2004) use existing 2D
2D line segments, which are generated to 3D line ground plans to constrain possible solutions. First,
segments by using a line grouping (Baillard et al., to be able to project the ground plan to the correct
1999) based on the geometry and radiometry of location in the images, an approximate height is

Figure 3. Building extraction according to Baillard and Zisserman (1999) (Figure based on Brenner, 2005)

1021
3D City Modeling and Visualization for Smart Phone Applications

obtained by projecting the given ground plan to by operator guidance and by image matching,
the images and shifting it along the epipolar line which is used to find corresponding points on the
and evaluating the score function based on the other image automatically.
length of the extracted image lines inside the The above methods of building extraction
ground plan area. In a second step, the ground serve for nadir aerial images. Currently one of
plans are partitioned into rectangles, which are the highlighted 3D modeling technologies is so
then considered separately. As in Fischer et al. called C3 technology. It utilizes oblique aerial /
(1998), 3D corners are used as primitives. They UAV images for automatic 3D model generation.
are obtained from 2D corners, extracted from the It makes a great progress in the level of automa-
images using Föstner’s operator. Extracted 2D tion. As it is said, ‘C3’s models are generated with
corners are labeled as wall or roof corners; if a little human intervention. First, a plane equipped
corner may correspond to a wall primitive it is with a custom-designed package of professional-
labeled as a wall corner, otherwise it is labeled as grade digital single-lens reflex cameras takes
a roof corner. This labeling together with geomet- aerial photos. Four cameras look out along the
ric constraints is used when matching the 2D main compass points, at oblique angles to the
corners from different images to obtain 3D corner ground, to image buildings from the side as well
primitives. Using 3D corners and the 2D ground as above. Additional cameras capture overlap-
plan, an initial estimation for 3D volumetric ping images from their own carefully determined
primitives is set up. This estimation is then refined angles, producing a final set that contains all
using a gradient-weighted least square fitting in the information needed for a full 3-D rendering
the images. The fit is evaluated using a metric of a city’s buildings. Machine-vision software
based on two measures: the contour measure, developed by C3 compares pairs of overlapping
which checks for large image gradients along the images to gauge depth, just as our brains use stereo
wire frame model projected to the image; and the vision, to produce a richly detailed 3-D model’
texture measure, which evaluates intensity differ- (Technology Review of C3 technologies, 2011).
ences over two images. Finally, all 3D primitives
of a building are estimated again simultaneously, Building Extraction Using Laser
taking into account the relations between the Scanner Data and Hybrid Methods
primitives, which are introduced as weighted
constraints into the estimation. (Brenner, 2005) Due to the development of scanning systems and
On semi-automatic systems the operator improvements in the accuracy of direct georef-
measures 3D points in a predefined way. In Cy- erencing, airborne laser scanning (ALS) became
berCity Modeler (Grün & Wang, 1998a; Grün a feasible technology to provide range data in
& Wang, 1998b; Grün & Wang, 1999a; Grün & the early 1990s. At that time, ALS was already
Wang, 1999b) manual stereo measurement is used considered as a mature technology (Baltsavias,
to collect the roof points in a certain order and 1999). ALS provides dense point clouds (more
point coding, and this information is used in an than 10 returns per m2 are possible today) with 3D
automatic roof topology generation. In inJECT coordinates. This makes range data segmentation
(Gülch & Müller, 2001) the operator selects from relatively easy (Brenner, 2005).
predefined building models and adapts this model The integration of laser point clouds and
to match the building being measured by placing photogrammetric processes with aerial photos
a few points to the roof of the building in the other also provides new technological solutions. By
image of a stereo pair. Automation is applied to combining the good height assessment accuracy
minimize the amount of manually measured points of a laser scanner and good planimetric accuracy

1022
3D City Modeling and Visualization for Smart Phone Applications

of aerial images, both high accuracy and higher some approaches of how to apply automation in
automation can in theory be obtained. However, building extraction using laser scanner data with
despite the progress that has been made with the or without aerial images are briefly described
integrating laser scanning systems and digital (see also Table 2).
images, automated processing of the resulting Brenner and Haala (1998) approach uses in-
datasets is at a very early research stage (Brenner, terpolated raster DSMs (usually from laser scan-
2005). ning) and 2D ground plans as data sources for an
Approaches for building extraction using laser automatic and/or semiautomatic reconstruction
scanner data either use laser scanner data as it process. First, ground plans are divided into rect-
is (point clouds) or regularized data in raster or angular primitives using a heuristic algorithm.
grid format. Whereas automatic building extrac- For each of the 2D primitives, a number of dif-
tion methods based on aerial imagery typically ferent 3D parametric primitives from a fixed set
concentrate in finding edges, methods based on of standard types are instantiated and their optimal
laser scanner data typically concentrate on finding parameters are estimated. The best instantiation
planes, and the edges are detected either as plane is selected based on area and slope thresholds and
intersections, height jump edges in DSM or given the final fit error. The 3D primitive selection and
as additional data (ground plans). Therefore it is parameters can be changed later on using a semi-
well grounded that hybrid systems integrating the automatic extension. A human operator can use
power of edge detection of aerial imagery and plane aerial images for the refinement of the automatic
determination accuracy of laser scanner data are reconstruction when semi-automatic post-pro-
developed. This is coming more evident as digital cessing is performed. Rectangles of ground plan
cameras are integrated with modern laser scan- decomposition can be modified or added interac-
ning systems. Aerial images can also be utilized tively, and these rectangles are again used for 3D
in creating a realistic surface texture to model primitive matching. This interactive mode can
buildings and giving additional information to also be used if no ground plan is available. The
laser scanner data interpretation. In the following

Table 2. Features of presented approaches using laser scanner data with or without aerial images

Approach Used data Ground plan Aerial images used? Principle


Brenner, 1998 Raster DSM Yes Interpretation, ground Ground plan division and 3D primitive
plan manipulation matching
Vosselman, 1999 Laser points No No Plane extraction using Hough transform
Vosselman & Dijkman, Laser points Yes No Hough transform, ground plan division,
2001 and 3D primitive matching
Brenner, 2000 Raster DSM Yes No Planes extracted using RANSAC and
rule based acceptance, roof structure by
searching correct plane intersections
Rottensteiner & Briese, Raster DSM No Suggested to be used for Plane extraction using seed regions and
2003 interpretation and wire region growing, plane intersections, and
frame refinement jump edges detected
Hofmann, 2003 Laser points Yes No Detection of planes using clustering of
=> TIN 3D triangle parameter space
TerraSolid, 2003 Laser points No Possible to use for inter- Point classification and plane matching
pretation and measure-
ment

1023
3D City Modeling and Visualization for Smart Phone Applications

final object representation is obtained by merging eralizing the building model. In order to preserve
all 3D primitives. more detail in the model, another reconstruction
Vosselman (1999) approach uses non-regular- strategy has been explored. If the point cloud
ized laser scanner data. Roof faces are extracted within a building polygon can be represented by a
using a Hough transform which is extended to simple roof shape (flat, shed, gable, hip, gambrel,
a 3D transformation, where points in planes are spherical, or cylindrical roof) the model of this roof
clustered in cluster space. The cluster space is is fitted to the points with a robust least squares
defined by the slopes in x- and y-direction and a estimation. Building models can be decomposed
scalar in the direction of the plane normal. Each interactively so that all parts correspond to the
point in cluster space defines a plane in object above mentioned shape primitives. If the point
space and vice versa. The bin of the cluster space cloud is such that all roof planes can be detected
with the largest number of intersecting planes, i.e. automatically, an automatic reconstruction is
object points, is selected and this plane estimation attempted based on the intersection of detected
is improved using least squares fitting of points neighboring roof faces and the detection of height
that are within a given threshold to the observed jump edge between the roof faces. If the two above
plane. All points in the data set are triangulated in situations do not apply, the building polygon is
2D, and if all three points of a triangle are within split into two or more parts until each part fulfils
a given distance to the plane, the triangle is clas- one of the two above conditions. Optionally, point
sified as belonging to this plane. If the size of the clouds are edited to remove outlier points that
connected triangles exceeds a given threshold, a would disable an automatic roof reconstruction.
planar face has been found. Another least squares Brenner (2000b) uses regularized DSM where
fitting is performed using only the points of the planar faces are extracted using a random sampling
determined planar face. After removing the points consensus (Fischler & Bolles 1981; RANSAC).
of the found component from the cluster space, The RANSAC procedure uses as small an initial
the bin with the next highest count is selected and data set as feasible and enlarges this set with
analyzed, and so forth. The edges are found by the consistent data when possible. For example, given
intersection of faces and analysis of height dis- the task of fitting an plane to a set of 3D points,
continuities. No ground plans are used. The main the RANSAC approach would be to select a set
building orientation is determined and used as a of three points, compute the parameters of the
constraint for the edge orientation. Roof topology implied plane, and count the number of points
is built by bridging the gaps in the detected edges. that are close enough to that plane to suggest
The use of geometric constraints is proposed to their compatibility with it (i.e., their deviations
enforce building regularities. are small enough to be measurement errors). If
In a later approach (Vosselman & Dijkman, there are enough compatible points, RANSAC
2001; Vosselman & Süveg, 2001) ground plans would employ a smoothing technique such as least
are used to prevent the false roof faces. If build- squares, to compute an improved estimate for the
ing outlines are not available, they are manually parameters of the plane now that a set of mutually
drawn in a display of the laser points with color- consistent points has been identified. The normal
coded heights. The concave ground plan corners vectors of the extracted faces and ground plan
are extended to cut the building area into smaller edges are computed, and faces are labeled based
regions and Hough-based plane extraction is con- on the orientation between these normal vectors. A
strained to these regions. Split-and merge is used set of rules expressing possible labeling sequences,
to obtain the final face. This model may lead to i.e. possible relationships between faces and the
under-segmentation of the ground plan thus gen- ground plan edges, is used to either accept or reject

1024
3D City Modeling and Visualization for Smart Phone Applications

the extracted face. A plane is generated for every parameters of every TIN-mesh, which define its
accepted face, and final roof topology is obtained position in space uniquely, are mapped into a 3D
by a global search procedure, which selects the triangle-mesh parameter space. As all triangles
correct plane intersections. The search procedure of a roof face have similar parameters, they form
uses a discrete relaxation to reduce the number clusters in the parameter space. Those clusters that
of possible plane intersections, junctions, and represent roof faces are detected with a cluster
constrained tree search to cut down search space. analysis technique. By analyzing the clusters,
Usually several solutions remain after relaxation significant roof planes are derived from the 3D
and a subsequent backtracking search with forward triangle parameter space while taking common
checking can be used to find them. Regularity is knowledge of roofs into account. However, no
enforced by using additional constraints and least prior knowledge of the roof as e.g. the number of
squares adjustment (Brenner, 2000a). roof faces is required. The obtained roof planes
Also Rottensteiner and Briese (2003) use regu- are intersected in accordance to their position in
larized DSM. Planar roof segments are detected space. By analyzing the intersected roof faces,
using a segmentation based on the analysis of the roof outlines are determined and the ground
the variations of the DSM normal vectors and/or plan is derived.
a segmentation of aerial image grey levels, and TerraSolid (2003) approach extracts buildings
expanded by region growing algorithms. Plane using the classification of the laser scanner de-
intersections and step edges are detected and a rived points. Laser scanner points are classified as
polyhedral model is derived. The parameters of buildings, ground, and high, medium or low veg-
the building models are improved by a consis- etation. Parameters controlling the classification
tent estimation procedure using all the available are given based on the terrain type and building
information, i.e. laser points are assigned to roof characteristics on the target area. Building roofs
planes and direct observations for the planimetric are automatically found based on laser hits on
coordinates of building vertices at step edges are planar surfaces, resulting in vectorized planes of
introduced. Hypotheses about geometric con- each roof. Roof boundaries can also be created or
straints between planes are introduced to improve modified manually, and aerial images with known
the models further, and parameter estimation is exterior orientation can be applied (hybrid system)
repeated. Aerial images are suggested to be used to measure building boundaries.
to detect small planar segments and to improve the Kaartinen and Hyyppä (2006) collected build-
geometric accuracy of laser scanner data derived ing extraction methods from eleven research agen-
wire frame models. cies with four testing areas. The data were provided
Hofmann, Maas and Streilein (2003) approach by the Finnish Geodetic Institute (FGI), namely in
uses point clouds obtained by a pre-segmentation Senaatti, Hermanni and Espoonlahti and one site,
of airborne laser scanner data. It is a plane-based Amiens, provided by the Institut Geographique
approach that presumes that buildings are charac- National (IGN). Data sources contain ALS data,
terized by planes. It utilizes a TIN-structure that is aerial images and the ground plan.
calculated into the point cloud. The method only Building extraction methods were analyzed and
uses point clouds of the laser scanner data that evaluated from the aspects of the time consumed,
contain one building. In order to get such point the level of automation, the level of detail, the
clouds, polygons coarsely framing the building geometric accuracy, the total relative building
can be used to extract the points (e.g. in ArcGIS). area and shape dissimilarity. Details of the results
The polygons can be created manually or map can be found from Kaartinen and Hyyppä (2006).
or ground plan information can be used. The

1025
3D City Modeling and Visualization for Smart Phone Applications

Commercial Systems, Software, Automatic Texture Mapping


and Open Sources
Currently, most systems like ALS and MLS contain
In the above we have addressed the means of both laser scanner and camera, integrated with
geo-data acquisition and methodologies of 3D GNSS and IMU. The corresponding relationship
model reconstruction. Current status shows that between laser point cloud and images can be
data processing is still far from fully automation. calculated by collinearity equations or projec-
Data are processed mainly by the use of existing tive transformation (see section 2.2) since the
systems and software. Table 3 gives a list of current interior orientation (IO) and exterior orientation
available systems, software, and open sources. (EO) information of each image are known (IO
parameters and image distortion parameters can
be obtained from camera calibration informa-
ACQUISITION OF PHOTOREALISTIC tion and EO can be acquired from documented
TEXTURES FOR 3D CITY MODELS GNSS and IMU information). The RGB values
from the images can be easily projected onto the
In the section 2, 3D geometry reconstruction of mesh surfaces of geometric models according to
buildings from Laser scanning and Photogram- the corresponding relations.
metry have been introduced. However, geometric The advantage of automatic texture mapping is
models of buildings exclude societal information, high efficiency. However, object occlusions on the
e.g., landmarks, related to the real scene. Models images would lead to an unpleasant visualization.
with textures not only offer rich content to guide When images are collected by mobile systems, the
people’s lives, e.g., personal navigation, but they positions of the cameras on the platform are fixed.
also provide good visualization. It is fairly often that the case of object occlusion
Photorealistic textures make use of images happens, for example, a car or a tree in front of
taken from the real scene, by perspective correc- a building facade. Some publications related to
tion and image mosaicing to project it onto the object occlusion issues have been addressed by
surfaces of geometric models. There are two ways e.g. (Curless & Levoy, 1996; Früh, Jain & Zakhor,
for texture mapping: automatic texture mapping 2005; Stamos & Allen, 2002; Stulp & Fisher,
and software-assisted texture mapping. 2001). The method from Früh et al. (2005) was

Table 3. Commercial systems and software

The use of commercial system or software Name of the system or software


Photogrammetry, Geoinformation and Remote sensing BAE (SOCET SET); ERDAS; EspaCity; ESRI; INPHO; Intergraph Z/I
system imaging.
3D modeling from images Apollo Photo3D; Photometrix; PhotoModeler; Renoir.
3D modeling from point cloud data (TLS software survey) Rhino 3D; 3D Reshaper; Cloudworx; Cyclone; Geomagic; Kubit; Innov-
mentric Polyworks Software; Multigen Creator; Multigen Vega Prime;
Paraform; Pointools; Rapidform; Realworks; Solidworks; Terrasolid.
Professional 3D software AutoCAD 2000; Alias Wavefront (Maya); 3D Studio Max
Free 3D modeling sources (3D modeling applications) Blender; K-3D; Art of Illusion; SOFTIMAGE|XSI Mod Tool; Zmod-
eler; TopMod3d; Google SketchUp; AutoQ3D Community – 3D Editor;
Anim8or; Seamless3d; BRL-CAD; 3DPlus; 3D Canvas; eDrawings;
Blink 3D; Minos; freeCAD; Bishop3D; K3DSurf; DesignWorkshop Lite;
GDesign; Sweet Home 3D; trueSpace; Alibre Design Xpress; 3DVIA
Shape; DAZ 3D; Wings3d.

1026
3D City Modeling and Visualization for Smart Phone Applications

based on the MLS point cloud. Four steps were building facade. Thus, several images needed to be
taken: i) Separate the trajectory of scanning into combined to create an image mosaic of the facade.
approximate straight-line pieces; ii) Transform The advantage of images taken manually is that
laser points of each piece into a depth image; the position and pose of the camera can be chosen.
iii) Split the scan points into a foreground layer In some cases, it enables to avoid object occlu-
(e.g. a car or a tree) and a background layer (e.g. sions. However, in the majority of cases, object
a building facade) according to the most frequent occlusions still exist. There is a large work load
depth value; iv) Remove the foreground layer and for manual removal of object occlusions. Usually
fill the holes on the background using the linear it yields a good and realistic visual appearance. In
interpolation between the closest right and left generally speaking, manual work load and good
structure boundary point horizontally. And then a realistic visual appearance are proportional.
similar interpolation is done vertically, using the After the preparation of textures, texture map-
already filled in points as valid boundary points. ping can be performed by 3D software. The map-
When a large area occlusion happened, additional ping process is to apply mathematical functions
geometric information was obtained from stereo- to assign each point on the texture to each point
imagery. Therefore, full automation to solve the on the object surface. Different software usually
object occlusions is a challenge work. employs different texture mapping methods such
For photorealistic 3D models, we always as UV mapping, UVW mapping and polynomial
make efforts towards two goals: automation and texture mapping.
good visual appearance. It is a challenge work to
develop a fully automatic method for 3D models
with good photorealistic visual appearances. SMARTPHONE 3D VISUALIZATION
FOR NAVIGATION AND LBS:
Software-Assisted Texture Mapping APPROACHES AND TECHNIQUES

The images are taken manually without needed The provision of 3D graphical navigation ap-
orientation information. In this case, it is common plications and location-based services for mo-
to utilize existing 3D software for texture map- bile devices has been envisaged for a long time
ping. The main work for software-assisted texture (Zlatanova & Verbree, 2003). One of the benefits
mapping is texture preparation. It includes image of 3D navigation, according to Coors, Kray, Laakso
perspective correction, image mosaicing, and and Elting (2005), is that people can recognize
object occlusion removal from building facades. landmarks and find routes in cities easier by using
Perspective correction is indispensability for 3D models than using a symbolic 2D map because
texture preparation. When objects obstruct the line of the highly visual correspondence between map
of sight in front of a building or if buildings are objects and real world objects.
high, the images have to be taken at an oblique One of the major challenges for a 3D navi-
angle. In addition, even if images were taken at gation system is the development of a highly
approximately right angles to the building facade, efficient 3D navigation engine. Although some
they still need some perspective correction. desktop 3D navigation applications have been
Usually a building facade is corresponding to introduced in the past decades (Sharkawi, Ujang
an individual texture. Due to the limited field of & Abdul-Rahman, 2007), the 3D capabilities of
view of the camera when covering a large build- mobile devices are restricted in several aspects:
ing facade, one image only covered part of the dedicated 3D hardware chip, floating point units,
hardware division circuits, memory bandwidth,

1027
3D City Modeling and Visualization for Smart Phone Applications

and CPU speed (Zlatanova & Verbree, 2003), and Approaches to 3D Personal
the lack of an accurate integrated positioning sys- Navigation Visualization
tem. Besides, the visibility of scene management
on the low-resolution screens of mobile devices A smartphone-based 3D visualization engine
needs adapting. generates 3D graphical representations of the
Following recent improvements implemented navigation context environment, organizes them
in smartphones in regard to increased computa- into scene graphs, and allows mobile users real-
tional and graphic capabilities, improved power time navigation and interaction within the 3D
management and the reduced price of the GPS scene. To date, however, all this is still a real chal-
module, the navigation application is now be- lenge when considering the limited computation
coming a standard feature in smartphones, tablet and graphical capabilities of mobile hardware
computers, and other portable devices. This makes systems and the low bandwidth of the wireless
feasible the development of a 3D personal navi- communication network when compared to the
gation visualization engine. Although there have vast volume of massive geometrical and texture
been already a few industrial Portable Navigation data of the surrounding 3D city models and terrain
Devices (PND), e.g. Garmin Nuvi 765T GPS, data that must be transferred.
support visualization of 3D models on 2D maps Fortunately, given the new generations of
from different view angles, the development of iPhones or Android Smartphone systems and
a real 3D navigation engine, which seamlessly iPad/Xoom tablet computers, the computation
supports indoor/outdoor navigation, is still a and graphical capabilities of this genre of hard-
challenge with smartphones. ware have increased dramatically due to the use
This section mainly introduces different ap- of multi-core systems and new graphical chips,
proaches and architectures of the 3D personal which make it affordable to run high-performance
navigation visualization engine to support 3D 3D graphical applications. Alongside this de-
navigation and LBS application. We firstly pro- velopment, the emerging 4G communication
ceed to introduce a prototype smartphone-based technology, e.g., TD-LTE, means that wireless
real-time 3D engine. This prototype engine has communication network speeds will ultimately
so far been able to achieve high performance in satisfy the need for a fast approach to 3D model
real-time 3D rendering of very complex urban data transfer.
models; this has been demonstrated in indoor/ The development of a smartphone-based 3D
outdoor navigation within the Shanghai EXPO navigation visualization engine with powerful
area and within the Tapiola/FGI study area in rendering capability is normally based on low-
Finland. We also discuss another 3D visualization level high-performance 3D graphics API, e.g.,
engine implementation approach based on client/ OpenGL ES API, which is now available on almost
server architectures, which use the server’s 3D- all modern smartphone OSs, e.g., iOS, Android,
rendering capabilities and provides a 3D- render- Symbian 3, and MeeGo.
ing service to the client portable devices through a Here we proceed to introduce a prototype
communication network. Finally, the advantages smartphone-based real-time 3D engine, which
and disadvantages of these 2D vs. 3D navigation was initially developed for the Symbian environ-
methods are investigated and some future trends ment using standard C++ and OpenGL ES, and
are discussed. it can be ported to other mobile complex urban
models as was demonstrated in the indoor/outdoor
navigation trial conducted within the Shanghai
EXPO area and within the Tapiola/FGI study area

1028
3D City Modeling and Visualization for Smart Phone Applications

Figure 4. The architecture of the 3D navigation visualization engine

in Finland. The engine is basically structured as The data management layer accesses the 3D
having three layers: an API interface layer, a core models and ground-texture images persistent on
engine layer, and a data management layer, and the flash disk of the mobile phone via an inde-
these are illustrated in Figure 4. The top layer is the pendent thread (or active objects for a Symbian-
interface layer, which is used for cross-component based system).
functional operation, request handling, and data As regards the general object relationships in
exchange (Chen et al., 2010). the 3D Visualization Engine (3DVE), there is a
The core engine layer takes care of the 3D key object called NaviSceneControl, which func-
scene visualization, e.g., scene management, scene tions as the hub of the 3DVE. It is implemented
rendering, mathematical computation, coordinate as a visual user interface (UI) and controls itself
transformation, and model object management. with a window for displaying a 3D scene in the
To enable the 3D scene visualization for a large application. It is also responsible for the following:
area, the objects in the scene are classified into
two categories within this layer. One is for the 3D • Connecting with Interface Objects in the
models (such as buildings, tree and poles), while API Interface Layer to exchange informa-
the other is for the texture of the land surface, tion with the main application and other
which consists of ortho-rectified digital aerial components(Chen et al, 2010; Liu et al.
photos. All of the objects are subdivided into 2010);
squared tiles with geographic references derived • Creating and displaying the scene’s
from the original data. Using this data processing Renderable Objects such as terrain surface,
technique, the models and images are tailored to 3D models, landmarks, and planned route;
satisfy the navigation application, balancing be- • Importing incoming Point of Interest
tween performance and visualization effects. Information (POI) from the Interface

1029
3D City Modeling and Visualization for Smart Phone Applications

Layer and re-organizing this information and high-resolution aerial photos. The ordinary
as Geometry Objects; pavilion mesh consists of thousands to tens of
• Importing keyboard/command information thousands of vertexes and dozens of texture images
from Control Info in Interface Layer and for each pavilion. The rendering performance has
processing operations according to the dif- been successfully improved to less than 1 second
ferent event types using Platform Layers’ of the update interval on a Nokia 6710 navigator
(S60) event-processing mechanism; mobile phone. Figure 5a, 5b show the screenshot
• Updating view information and exporting of the China Pavilion and the virtual navigation
to the ViewInfo and LmkInViewList in the in the EXPO area respectively.
API Interface Layer. Two test areas in Finland were selected for
indoor/outdoor tests, as shown in Figure 5c, 5d.
The Renderable Objects encapsulate the Figure 5c demonstrates the outdoor navigation
drawing operation of the Geometry Objects. The within the Tapiola area, while Figure 5d demon-
persistence of the geometry objects is managed strates the indoor navigation along the corridors
by the Cache Access Objects, which use the plat- of FGI’s 3rd floor premises. The positioning ap-
form’s (S60) multi-thread techniques, i.e., active proach for indoor environment was arranged
objects, to access local caches from flash memory. through a combination of the smartphone’s blue-
In the SDK Layer, Symbian C++ provides a tooth (BT) and WLAN connected to the access
common language tool for developing the engine. points (AP) installed within the building (Pei et
All objects in 3DVE are implemented using this al. 2010), and this was achieved with a position-
Symbian C++ and its open C++ extension. In ing accuracy of about 5 meters.
addition, the OpenGL ES library is also used to
provide APIs to realize the rendering facilities, Image-Based 3D Visualization Engine
specifically for Renderable Objects in the Core
Engine Layer. Compared to a smartphone-based 3D navigation
The engine has been tested within the Shanghai engine, the image-based 3D navigation engine
EXPO area for outdoor navigation demonstration retrieves 2D perspective images of the 3D geo-
as well as within the Tapiola/FGI study area for virtual environments from remote servers via a
outdoor/indoor navigation demonstration. The high-speed network.
Shanghai EXPO area has over 200 pavilion models

Figure 5. Demonstrations within the Shanghai EXPO area and in Finland using a Nokia 2610 smart-
phone: a. China Pavilion; b. Virtual navigation within the Shanghai EXPO area; c. A 3D view of the
city center of Tapiola, Finland; d. FGI’s building

1030
3D City Modeling and Visualization for Smart Phone Applications

A typical example is the Web Perspective View 3D geo-coordinates for a specific 2D pixel posi-
Service (WPVS) and its extension WPVS++ as tion in the generated image. A GetFeatureInfo
proposed by (Hildebrandt et al. 2010; Hagedorn operation should provide extra information about
et al. 2010). This approach generates a set of the geo-objects at a specific pixel position. Us-
2D images of the perspective views of 3D geo- ing these APIs, the client provides an extendable
virtual environments (e.g., virtual 3D city models) interactive method between the mobile device
using the server-based 3D rendering service. and the remote server.
Retrieval was continuous from the client side on A key advantage of this image-based approach
the light-weight devices. To provide users with a is that the complexity exposed to a client for
high degree of interactivity, the WPVS includes displaying a visual representation is reduced to a
additional thematic information encoded into constant factor, primarily depending on the image
non-color image layers for local reconstruction resolution, and thus it provides better scalability
of the visual representations of the remote 3D with the increasing complexity of 3D city models,
geo-virtual environments and corresponding concurrent users, and the computation required for
functionalities for the exploration and analysis of real-time photorealistic 3D visualization. Finally,
the presented 3DGeoVEs by simple clients, e.g., since the 2D representative images of the 3D geo-
by requesting 3D positions, retrieving object IDs, virtual environments in this approach depend on
and measuring distances. A procedure called View the server’s 3D rendering service, the limitation
Process (VP) transforms commands for updating of this method is that it can only be accessed via
the visualization from the controller, which pro- a high-speed network environment, and high-
cesses user-interaction events into the calling of performance 3D rendering servers are preferred to
service operations for requesting images or other provide multi-user service. An example is shown
functionalities. In order to reduce the latency in the Figure 6.
between user input and display updates, which
can be introduced using the narrow bandwidth of Techniques of 3D Personal
the network and increased frame rates, a client- Navigation Visualization
side 3D reconstruction and rendering techniques
based on image-based modeling and rendering Tilling is a technique that is used to subdivide
(IBMR), point-based modeling and rendering original full-resolution terrain texture images to
(PBMR), and the nested, partial visualization generate squared tiles of predefined sizes with
pipeline (NPVP) pattern, are employed. Since reduced resolution. It is widely applied in large-
the WPVS is provided as an open, interoperable area terrain visualization on desktop applications,
HTTP-based interface, it can be easily integrated e.g., Google Earth. The tile structure used in the
into web-based applications or mobile systems to 3DVE engine is efficient for 3D models and ter-
provide smartphone 3D navigation. rain textures management. An XY grid starts in
WPVS supports interaction with the generated the lower left hand corner of the Earth, defined as
image by proposing a series of APIs. This includes (-90, -180) in latitude and longitude (see Figure
techniques for maneuvering the virtual camera 7). The coordinates of each box are from the lower
through the 3D world and for retrieving informa- left hand corner of the box. The initial point is
tion about the visualized objects. For example, a considered to be X = 0, Y = 0. With a given tile
GetView with style specifications and parameters size, it is easy to find the box’s X-Y coordinates for
should be used to request specific content (e.g., a given latitude and/or longitude, and vice versa.
feature type or building usage) for the requested The tiling strategy is very important for a
image layers. A GetPosition is proposed to retrieve mobile system because we just need to load the

1031
3D City Modeling and Visualization for Smart Phone Applications

Figure 6. An example of WPVS service showing the feasibility of the image-based 3D visualization
engine on mobile devices (courtesy of Hagedorn et al., 2010)

Figure 7. Illustration of the tiling technique


graphics rendering pipeline. It is especially im-
implemented in 3DVE
portant for 3D model data streaming between the
memory and flash disk, vertex transformations,
and texture mapping (Wikipedia, 2011). To do so,
the 3D models should be classified into different
layers and importance according to the service
purpose of the LBS applications. Meanwhile, the
maximum viewing height and distance between
the scene camera and the visible 3D objects must
also be defined accordingly.
Since building models, e.g., within the
Shanghai EXPO area for our demonstration, are
normally very complex, usually with thousands
of vertexes, tens to hundreds of wall textures
for a single building model, it is not possible to
acquire acceptable performance by rendering a
3D scene using OpenGL’s instant mode given the
small images and 3D models according to the current mainstream smartphone’s graphical and
extent of the view from the current viewing angle, computational capabilities. Using the Vertex Buf-
rather than the entire images with full resolution fer Object (VBO), the rendering performance can
and all 3D models. be enhanced by a factor of 10-30 In order to use
Using a static level of detail technique could VBO, the vertex coordinates of the models need
also significantly improve the rendering per- to be linearly stored in the disk file. Meanwhile, to
formance by reducing the complexity of 3D support easy inclusion of 3rd party 3D models, the
object representation as it moves away from the local model-space coordinate system and WGS84
viewer according to some key factors such as coordinate system are used simultaneously in 3D
object importance, eye-space speed or position. models. To import a 3D model with a local model-
This significantly decreases the workload on the space coordinate system into the engine, the user

1032
3D City Modeling and Visualization for Smart Phone Applications

needs to know only the latitude, longitude, and representation of objects, more variable views
the azimuth of the local origin of the 3D model. that are directional and bound to a first-person
An automatic geometry transformation for the perspective, more degrees of freedom in move-
3D model into the universal WGS84 coordinate ment, and dynamically changing object details.
system is performed before visualization using the In order to facilitate better understanding of
orientation information. Thanks to this, the model the differences between 2D and 3D navigation,
author does not need to know which coordinate two experiments and surveys were conducted by
system will be used in the model’s visualization. the 3D-NAVI-EXPO project team. Thirty-two
To reduce the data size of the 3D models and volunteers as testers were brought to the Shanghai
to improve the visualization performance, the EXPO area (where they had not been before) and
original *.3ds file created with 3D Max Studio to a street unknown to the testers in Finland. They
software from the aerial photo and CAD draw- were then asked to use the developed 3D personal
ing is compressed and reformatted to fulfill the navigation prototype software on a GPS-enabled
requirements of the mobile device. In order to smartphone respectively to find and navigate to
support model simplification and tiling, a utility a destination. Each tester was required to fill in a
software is developed to create a tile of models feedback form at the end of the test. A summary
and put each model into its corresponding location of the feedback comparisons from the users is
grid, packing all small textures of a model into illustrated in Table 4.
one or two images of size 512x512 or 256x256 It is an indisputable fact that a 3D map has the
using the bin packing algorithm (Lodi, Martello & volumetric concept, which is an important dimen-
Monaci, 2002). This is done in conjunction with sion of the cityscape. Rich and realistic visual
file formatting from *.3ds file to support VBO details of 3D models help the user to identify an
rendering, which results in significant time sav- object easier. Further, 3DPN supports the virtual
ings in reducing graphical rendering and texture reality capability, which ensures that the user
loading times. obtains an impression of a place without being
present there in person. However, there are also
2D vs. 3D Navigation Application: some disadvantages of the 3DPN, and the follow-
A Comparison from the ing are some examples:
User’s Point of View
• The ambiguous textual or missing details
In theory, compared to traditional 2D maps, 3D of an object can mislead the user when try-
mobile maps involve volumetric instead of flat rep- ing to rapidly understand the physical en-
resentation of space, realistic instead of symbolic vironment based on the 3D models.

Table 4. Comparison of user experiences with 2D vs. 3D navigation applications

Navigation Activity 2D 3D
Identifying the position Known landmarks, street names, relative directions, Building shapes, facades in general, facade
street crossings, cardinal directions. details, and relative directions.
Finding the direction Known landmarks, street names, relative directions, Finds the direction while identifying the posi-
street crossings, cardinal directions. tion. Even one building is enough.
Navigating to the destination Consumes fewer resources to maintain, but takes Costs more for resources, but easy to identify
more time to identify the target while approaching the target while approaching the destination.
the destination. Easy to get a general picture. Difficult to get a general picture.

1033
3D City Modeling and Visualization for Smart Phone Applications

• Inaccurate position and heading without of commercial buildings, including shopping


any processing can introduce a disaster centers, banks, government agencies, bookstores,
into the 3D visualization. For example, if a and high-rise residential buildings, with the tall-
user gets an incorrect position or heading, est building being 45 m in height. The data were
the user will obtain a problematic 3D scene collected by the Finnish Geodetic Institute (FGI)
with the wrong orientation and position. ROAMER system, which was a laser scanning
• Environmental elements, such as light system mounted on a mobile platform. The 3D
and weather, also influence the user’s models were constructed from the collected data
understanding. in two steps: geometry modeling and texture
• The field of view in 3DPN cannot simulate mapping. The resulting models are applied to a
the field of view of a human being exactly smartphone for personal navigation.
due to issues such as the limitation of the
screen size of a smartphone. Data Acquisition

Some efforts have been made to tackle these The FGI ROAMER system is a mobile laser
disadvantages, e.g., Pei et al. (2010) developed a scanning system which can be mounted on vari-
method of using built-in sensors in a smartphone to ous vehicles. The hardware sub-systems of the
assist 3D personal navigation in a GPS-degraded ROAMER include the following: (1) a FARO laser
area. Recognition of motion is firstly applied scanner; (2) a GPS-INS navigation equipment; (3)
to detect the context by using the data from a a camera system; (4) synchronization electronics;
built-in tri-axis accelerometer. Afterwards, a map and (5) a mechanical support structure. The mirror
matching algorithm is implemented to snap the rotation frequency, or profile measuring frequency
GPS position onto the middle of the nearest road. of the FARO LS, is typically set to 24 Hz, 49 Hz
Finally, in order to avoid the heading drift during or 61 Hz in mobile applications, and the verti-
the map matching procedure, a Bayesian Network cal angular resolution can be set to 0.009–0.288
-based heading change detection method is applied degrees. For platform speeds of 50–60 km/h, the
by combining the orientation readings from both profile spacing is about 30 cm when using the
the GPS and the digital compass integrated in profile measuring frequency of 49 Hz. With a
the smartphone. With this improvement, the user frequency of 49 Hz, the profile interval is less than
experience has been greatly improved in the 3D- 20 cm when the speed of the mapping unit is kept
NAVI-EXPO project (Pei et al. 2010). We believe below 40 km/h, and the point resolution along the
that further study of the other disadvantages of the profile is still 2.5–5 cm, which is sufficient for
3DPN is needed and that such study will improve the practical ranges of 20–40 m, respectively, in
the user experience. the urban environment (Kukko, 2009).
The ROAMER configuration as used in Tapi-
ola can be seen from Table 5. The data included
A CASE STUDY: 3D MODELS FOR altogether about 160,000 profiles, each profile
TAPIOLA AREA (ESPOO, FINLAND) having 2,150 points with 3D coordinates and
return intensity. Data collection lasted about one
Main contents of the case study were originally hour, and covered an area 180 m by 280 m. The
from Zhu et al. (2011). It presents a full proce- laser data were transformed into a map coordinate
dure of the generalization of photorealistic 3D system (ETRS-TM35FIN with GRS80 ellipsoidal
models from MLS data. The test site is located height). Due to narrow streets and high buildings,
in the Tapiola area, Espoo, Finland. It is an area images from the ROAMER system did not meet the

1034
3D City Modeling and Visualization for Smart Phone Applications

Table 5. The ROAMER’s acquisition parameters


falling into each bin, and the positions
Date May 12th 2010 of each bin center were calculated. The
Laser scanner Faro Photon™ 120
threshold (T) for the number of the points
Navigation system NovAtel SPAN™
in each bin (e.g. 800) can be defined by the
Laser point measuring 244 kHz
user according to the different density of
frequency the point cloud and the size of a dataset.
IMU frequency 100 kHz • Object classification: After noise remov-
GPS frequency 1 Hz al, the number of points is greatly reduced.
Data synchronization Synchronizer by FGI, scanner These points contained ground points,
as master building points, and other object points,
Cameras Two AVT Pike which needed to separate.
Profile measuring fre- 49 Hz
quency
Ground Point Classification

The algorithm was developed for a relatively flat


requirement for high-quality textures. Therefore, area. In order to facilitate the description, Table
the images were taken separately using a Canon 6 depicts abbreviations used.
EOS 400D digital camera. This algorithm was developed by considering
the most frequently occurring height value in the
Geometry Reconstruction data. For the MLS data in the relatively flat area,
ground points usually hold the most frequently
Geometry reconstruction from raw MLS data occurring height value, especially in an urban area
includes the following steps: noise point filtering, without a large area of low vegetation. In this
object classification (ground, buildings, and the algorithm, we used 0.25m as a tolerance taking
other objects), detection of planar surfaces, key into account the influence of measure accuracy
point extraction, and surface meshing. Our goal in of a few decimeters in the ROAMER system and
geometry reconstruction was to utilize some key also the unevenness of the terrain. In addition, a
points in building model construction. Detailed 3m threshold for Zd was used mainly considering
algorithms for noise points filtering, ground point that it is possible that data in some bins contain
classification, building point extraction, detection only the points from the reflection area due to the
of planar surfaces, and derivation of the buildings’ incomplete removal of the noise points. After the
key points are presented in each section. algorithm was performed, ground points are ex-
tracted.
• Noise points filtering: The acquired data
were composed of 340 million points with Building Point Classification
georeferenced XYZ coordinates. A noise
point here refers to a point with markedly After ground point extraction, the remaining data
deviation from the other points. We used contain buildings and other objects. In 3D city
the data of 2D projections (x-y, x-z, y-z) model creation, building point classification is a
respectively to filter out these noise points. key step for building model reconstruction. Un-
The data in each 2D projection were dis- like ALS data scanned from the several hundreds
tributed into 10-by-1 bins. A three-dimen- or even thousands of meters above the top of the
sional histogram of bivariate data was ap- object, MLS data are collected close to the ground
plied to calculate the number of elements passing along a street. Detailed 3D information of

1035
3D City Modeling and Visualization for Smart Phone Applications

Table 6. A list of abbreviations

Abbreviation Description
Zf_data The most frequently occurring height value for the dataset;
Zf_grid The most frequently occurring height value for data in each grid;
Z_min The minimum height value for the data in one file;
Zmin_grid The minimum height value for the data in the grid;
Zd The difference between Zf_data and Zmin_grid
Algorithm 1. Ground point classification
1: Calculate Zf_data: mode (height value of data)
2: Compare the difference between Zf_data and Z_min
3: if the difference <=1m then
points with the height value <= Zf_data +0.25m accepted as ground points
4: else
grid points in XY plane into 10*10 bins
5: for each bin
calculate Zf_grid and absolute value of Zd: abs(Zd)
6: if abs(Zd) <= 3m then
compare the difference between Zf_grid and Zmin_grid
8: if the difference <= 0.25m then
points with the height value <= Zf_grid +0.25m as ground points
9: else
points with the height value <= Zmin_grid +0.25m as ground points
10: end if
11: end if
12: end for
13: merge all ground points from each bin
14: end if

the vertical planes of buildings can be obtained. We employ image processing technology for
The algorithms were developed based on the as- non-building point removal. The x-y coordinates
sumption that the building facades are vertical of the rough building points (e.g. B1, B2, and B5)
with respect to the ground. Figure 8 shows the are gridded into predefined pixel size and trans-
principle of the algorithm. We first utilize two formed them to a binary image. By image param-
cutoff boxes (cutoff box 1 and cutoff box 2) to eter control (e.g. image region properties: area
find the overlap between them. Overlapping areas size, eccentricity value, major axis length and
reveal high buildings such as B1, B2, and B5. After minor axis length, and morphological structure
the overlap parts are detected, the whole buildings element), non-building points can be removed.
are extracted. Then these building points were After image processing, the binary image is trans-
removed from the data. The lower buildings were formed back to 3D points. Similarly, this image
then extracted from the remaining data by setting processing method is also applied to the detected
two other cutoff boxes (cutoff box 3 and cutoff lower buildings (e.g. B3, B4, B6, and B7) for
box 4). Then the overlaps between them can be non-building point removal. The resulting build-
found, which are B3, B4, B6, and B7. However, it ing points are obtained by merging these two sets
is possible that the above detected buildings still of cleaned points. The benefit of this algorithm
contain other points (e.g. tree points). Therefore, is that trees can then be effectively isolated from
these non-building points need to be removed. buildings. Figure 9 shows the procedure of the

1036
3D City Modeling and Visualization for Smart Phone Applications

Figure 8. The basic principle of the algorithm of building point classification

transformation from 3D points to a 2D binary 1. Two neighbor points (P1, P2) were selected
image, then applying image processing technol- randomly to form a vector (V1).
ogy for non-building point removal, and trans- 2. A third point (P3) was added to form the
forming the cleaned image to 3D points. vector V2 with P1. If V2 was not parallel
Key Point Extraction and Surface Meshes - to V1, then it was considered to be the basic
One of our goals was to generate building models plane.
that require low amounts of memory and can thus 3. If V2 and V1 were collinear, then this point
be applied for applications such as mobile de- (P3) was taken as a coplanar point. If not,
vices with limited memory, and rapid rendering then the next step was taken.
for visualization. The buildings extracted by ap- 4. P4 is any point in the dataset. P4 and P2
plying the above steps still have a huge amount formed vector V3. If it satisfies the coplanar
of points. Therefore, key point extraction was condition V3· (V1×V2) = 0, it is regarded
essential. We took two steps for key point extrac- as a coplanar point.
tion: identify the planar building facade and extract 5. When a plane was identified, the points of
the key points. There are two popularly used this plane were removed from the dataset.
planar detection methods including surface normal 6. Steps (1) to (5) were repeated until all points
consistency and region growth. Here the algorithm were allocated to a certain plane.
for coplanar point detection is realized as follows:

Figure 9. An example of the procedure of transformation of 3D point clouds into 2D binary images,
non-building points filtering on images, then into 3D point clouds

1037
3D City Modeling and Visualization for Smart Phone Applications

Key points here refer to the corners of the steps. However building geometric models do
building facades. The corners could be deduced not contain landmark information in the scene,
from the minimum and maximum values of the therefore it is important to provide photorealistic
X or Y coordinates and the Z values. In order to visualization for the models.
achieve high quality (e.g., completeness and cor- Due to narrow streets and high buildings, im-
rectness) of building geometry, manual checking ages from the ROAMER system did not meet the
and correction are needed. Manual checking and requirement for high-quality textures. Therefore,
correction are performed by visually comparing the images were taken separately using a Canon
the identified building planes and corners with EOS 400D digital camera. Thus manual texture
terrestrial images and also making correction preparation, e.g., the image perspective correc-
by resetting parameters in the algorithm for bet- tion, noise point removal, and image mosaicing,
ter results. Usually completed models can be is needed. These images were used to provide the
achieved by using building roof information from textures of the building facades. For complete 3D
airborne-based data and the building facade from models, the textures of the building roofs were
MLS data. However in our data the advantage is also needed. We made use of aerial images from
that the buildings are scanned from all sides. This website to obtain the textures of the building roofs.
means that complete models can be created with Each roof plane was made to correspond to one
manual assistance. texture. Image processing software (Corel Photo
Following key point extraction, the meshes Paint) was utilized in texture preparation. The
were constructed for the surfaces. When dealing 3ds Max software was used in texture mapping.
with complex buildings, they were divided into Figure 10 shows the results of 3D models in the
cubes according to different heights. The meshing Tapiola area.
surfaces were imported to 3ds Max for further op-
erations, e.g., surface merging and small geometry Implementation into a 3D Engine
adjustment for good visualization.
Recently, the development of 3D game engines
Texture Mapping has been rapid and, currently, there are a number
of free and commercial options available, such as
A detailed procedure of geometry reconstruction Blender, realXtend, and CryEngine 3. As mobile
from MLS data has been addressed in the above devices have improved in processing power and

Figure 10. 3D models of Tapiola

1038
3D City Modeling and Visualization for Smart Phone Applications

graphics performance, also 3D game engines ca- FUTURE ASPECTS AND


pable of running on mobile devices have started CHALLENGES
to emerge and some examples that run on mobile
devices as well as desktop computers include The development of sensors for 3D data acquisition
ShiVa3D, SIO2, Unity, and Unreal Engine 3. This provides more and more solutions for 3D model
is a significant progress, as previously the devel- generation. Current laser scanning has been able
opment of a 3D engine for a mobile device may to offer a dense point cloud, e.g. ALS enables up
have taken multiple years and has been limited to 40 points /m2, MLS up to 1.3Mpoints/s prov-
to a single platform and application. ing tens of thousands of points /m2. Due to the
With the modern game engines and develop- increase in the density of point clouds, the mea-
ment tools, it is now extremely easy to deploy surement accuracy and automation level have also
the produced 3D model into a game engine with greatly improved. The poor capability to detect
a graphical user interface. The game engine also breaklines will be improved by systems proving
allows for further application development that higher density of point clouds. For the extraction
may include networked, interactive, real-time of building geometry for the models of fly-through
and multiplayer aspects. Also, many of the game views, some successful examples (more automa-
engines feature simultaneous multi-platform tion and better results) show that good results can
development, which means that the same model be achieved by the use of both laser and ground
and the program code can be utilized on many plan data. Thus, high resolution 3D models for
different platforms with minimal effort. walk-through views require both airborne-based
The Tapiola 3D model is implemented with and terrestrially-collected data. Airborne-based
Unity 3D in the Android environment. It is ac- data provide building roof information whereas
cessible from Android Market (http://market. terrestrial data offer detailed building facade
android.com/details?id=com.FGI.Tapiola3D). information. Their combination increases the
Figure 11 shows an example of the scene on an completeness and the level of detail of the 3D
Android device. models, and in the near future large areas can be
created in 3D for personal navigation.

Figure 11. A screenshot of the Tapiola 3D model running on an Android device

1039
3D City Modeling and Visualization for Smart Phone Applications

Despite much effort in the development of REFERENCES


automated methods, fully-automatic methods are
not yet available. As regards building geometry Ackermann, F. (1999). Airborne laser scanning
reconstruction, although the use of both laser -present status and future expectations. ISPRS
scanning and images improves the level of automa- Journal of Photogrammetry and Remote Sensing,
tion, difficulties still arise when complex models 54, 64–67. doi:10.1016/S0924-2716(99)00009-X
need to be reconstructed. Many of the developed Ahmadi, S., Zoej, M. J. V., Ebadi, H., Moghad-
methods are scene-based. As regards photorealistic dam, H. A., & Mohammadzadeh, A. (2010). Au-
textures, object occlusions have caused the main tomatic urban building boundary extraction from
problems. Manual work can result in good visual high resolution aerial images using an innovative
appearances, but it leads to the lower efficiency model of active contours. International Journal of
of the reconstruction process. Thus, a high degree Applied Earth Observation and Geoinformation,
of automation with good visual appearance and 12(3), 150–157. doi:10.1016/j.jag.2010.02.001
high extraction accuracy provides challenges also
for future work. Amara, Y., & Marsault, X. (2009). A GPU tile-
Due to the limited hardware resources and load-map architecture for terrain rendering:
graphical computation power of mobile devices, Theory and applications. The Visual Computer,
3D visualization techniques for the specified 25(8), 805–824. doi:10.1007/s00371-008-0305-1
mobile platform need to be developed. The chal-
Arayici, Y. (2007). An approach for real world
lenge related to real-time fast 3D rendering and
data modeling with the 3D terrestrial laser scan-
display of large datasets and of complex urban
ner for built environment. Journal of Automation
environments still remains. We introduced two
in Construction, 16(6), 816–829. doi:10.1016/j.
main approaches to 3D navigation’s visualization
autcon.2007.02.008
engines. With each engine, personal 3D naviga-
tion in an urban environment is feasible. The 3D Awrangjeb, M., Ravanbakhsh, M., & Fraser, C.
visualization engine developed in conjunction S. (2010). Automatic detection of residential
with the 3D-NAVI-EXPO project realized a buildings using LIDAR data and multispectral
high-performance real-time 3D rendering of very imagery. ISPRS Journal of Photogrammetry and
complex urban models as we have demonstrated Remote Sensing, 65(5), 457–467. doi:10.1016/j.
in the experimental areas. In order to support isprsjprs.2010.06.001
highly efficient visualization, techniques for 3D
Axelsson, P. (2000). DEM generation from laser
model simplification and texture processing play
scanner data using TIN models. In D. Fritsch, &
an important role alongside tiling, level of detail,
M. Molenaar (Eds.), The International Archives
and vertex buffer objects.
of the Photogrammetry and Remote Sensing, Vol.
Our example in the Android Market (http://
XXXIII, Part B4/1 (pp.110-117).
market.android.com/details?id=com.FGI.Tapi-
ola3D) shows the current state of the art in creating Baillard, C., Schmid, C., Zisserman, A., & Fitzgib-
visually good appearance textures, high-quality bon, A. (1999). Automatic line matching and 3D
geometry, and 3D visualization in a single demo reconstruction of buildings from multiple views.
for near-future personal geospatial infrastructures. The International Archives of the Photogramme-
try and Remote Sensing, Vol. XXXII, Part 3-2W5
(pp. 69-80).

1040
3D City Modeling and Visualization for Smart Phone Applications

Baillard, C., & Zisserman, A. (1999). Automatic Bethmann, F., Herd, B., Luhmann, T., & Ohm,
reconstruction of piecewise planar models from J. (2009). Free- Form surface measurement with
multiple views. Proceedings of the IEEE Confer- image sequences under consideration of disturbing
ence on Computer Vision and Pattern Recognition, objects. Optical 3D. Measurement Techniques,
(pp. 559-565). 2, 51–61.
Baltsavias, E., Zhang, L., & Eisenbeiss, H. (2005, Blair, J. B., Rabine, D. L., & Hofton, M. A.
May). DSM generation and interior orientation (1999). The laser vegetation imaging sensor:
determination of IKONOS images using a testfield A medium-altitude, digitization only, airborne
in Switzerland. Paper presented at the ISPRS Han- laser altimeter for mapping vegetation and to-
nover Workshop 2005 on “High-Resolution Earth pography. ISPRS Journal of Photogrammetry
Imaging for Geospatial Information,” Germany. and Remote Sensing, 54, 115–122. doi:10.1016/
S0924-2716(99)00002-7
Baltsavias, E. P. (1999). A comparison between
photogrammetry and laser scanning. ISPRS Blow, J. (2000). Terrain rendering at high levels
Journal of Photogrammetry and Remote Sensing, of detail. In J. Blow (Ed.), Game Developer’s
54, 83–94. doi:10.1016/S0924-2716(99)00014-3 Conference Proceedings. San Jose, CA, CMP
Media LLC. Retrieved from http://number-none.
Baltsavias, E. P. (2004). Object extraction and
com/blow/papers/terrain_rendering.pdf
revision by image analysis using existing geo-
data and knowledge: current status and steps BlueFAQs. (2011). 30 Free 3D modeling applica-
towards operational systems. ISPRS Journal of tions. Retrieved June 8, 2011, from http://bluefaqs.
Photogrammetry and Remote Sensing, 58(3-4), com/2009/05/30-free-3d-modeling-applications/
129–151. doi:10.1016/j.isprsjprs.2003.09.002
Bodgan, J., & Coors, V. (2009). Using 3D urban
Baltsavias, E. P., Pateraki, M., & Zhang, L. (2001, models for pedestrian navigation support. Pro-
September). Radiometric and geometric evalua- ceedings of the ISPRS working group III/4, IV/8,
tion of IKONOS geo images and their use for 3D IV/5: GeoWeb 2009 Academic Track – Cityscapes,
building modeling. Paper presented at the Joint (pp. 9-15).
ISPRS Workshop on “High Resolution Mapping
Bösch, J., Goswamiz, P., & Pajarolax, R. (2009).
from Space 2001”. Retrieved from http://e-
RASTeR: Simple and efficient terrain rendering
collection.library.ethz.ch/eserv/eth:25224/eth-
on the GPU. In Ebert, D., & Krüger, J. (Eds.),
25224-01.pdf
Eurographics 2009 (pp. 35–42).
Becker, S., & Haala, N. (2007). Combined feature
Braun, C., Kolbe, T. H., Lang, F., Schickler,
extraction for facade reconstruction. Proceedings
W., Steinhage, V., & Cremers, A. B. (1995).
of the ISPRS Workshop Laser Scanning 2007 and
Models for photogrammetric building recon-
SilviLaser 2007, (pp. 241-247).
struction. Computer Graphics, 19(1), 109–118.
Bescoby, D. J. (2006). Detecting Roman land doi:10.1016/0097-8493(94)00126-J
boundaries in aerial photographs using Radon
Brenner, C. (2000a). Dreidimensionale Gebäuder-
transforms. Journal of Archaeological Science,
ekunstruktion aus digitalen Oberflächenmodellen
33(5), 735–743. doi:10.1016/j.jas.2005.10.012
und Grundrissen. Doctoral thesis, Universität
Stuttgart, Stuttgart, Germany. Retrieved June
10, 2011, from http://elib.uni-stuttgart.de/opus/
volltexte/2001/806/

1041
3D City Modeling and Visualization for Smart Phone Applications

Brenner, C. (2000b). Towards fully automatic Coors, V., Kray, C., Laakso, K., & Elting, C.
generation of city models. In I. Dowman, & L. (2005). Presenting route instructions on mobile
Janssen (Eds.), The International Archives of devices. In Dykes, J., MacEachren, A. M., &
the Photogrammetry and Remote Sensing: Vol. Kraak, M.-J. (Eds.), Exploring geo-visualization
XXXIII, Part B3 (pp. 85-92). (pp. 529–550). Elsevier Ltd.
Brenner, C. (2005). Building reconstruction Cord, M., Jordan, M., & Cocquerez, J.-P. (2001).
from images and laser scanning. International Accurate building structure recovery from high
Journal of Applied Earth Observation and Geo- resolution aerial images. Computer Vision and Im-
information, 6(3-4), 187–198. doi:10.1016/j. age Understanding, 82(2), 138–173. doi:10.1006/
jag.2004.10.006 cviu.2001.0905
Brenner, C. (2009). Extraction of features from Cornelis, N., Leibe, B., Cornelis, K., & Van Gool,
mobile laser scanning data for future driver as- L. (2008). 3D urban scene modeling integrating
sistance systems. In Sester, Z., Bernard, L., & recognition and reconstruction. International
Paelke, V. (Eds.), Advances in GIScience, Lecture Journal of Computer Vision, 78(2-3), 121–141.
Notes in Geoinformation and Cartography (pp. doi:10.1007/s11263-007-0081-9
25–42). Springer Verlag. doi:10.1007/978-3-642-
Curless, B., & Levoy, M. (1996, August). A volu-
00318-9_2
metric method for building complex models from
Brenner, C., & Haala, N. (1998). Rapid acquisition range images. Paper presented at the 23rd Annual
of virtual reality city models from multiple data Conference on Computer Graphics and Interac-
sources. In Chikatsu, H., & Shimizu, E. (Eds.), tive Techniques. Retrieved from http://graphics.
The International Archives of the Photogrammetry stanford.edu/papers/volrange/volrange.pdf
and Remote Sensing (Vol. XXXII, pp. 323–330).
D’Apuzzo, N. (2003). Surface measurement and
Buckley, S. J., Howell, J. A., Enge, H. D., & Kurz, tracking of human body parts from multi station
T. (2008). Terrestrial laser scanning in geology: video sequences. PhD Thesis, Institute of Geodesy
data acquisition, processing and accuracy con- and Photogrammetry, ETH Zurich, Switzerland.
siderations. Journal of the Geological Society, Retrieved from http://webfarm102b.ethz.ch/
165(3), 625–638. doi:10.1144/0016-76492007- berichte/Blaue_Berichte_PDF/81.pdf
100
Dorninger, P., & Pfeifer, N. (2008). A compre-
Chen, R., Kuusniemi, H., Hyyppä, J., Zhang, J., hensive automated 3D approach for building
Takala, J., & Kuittinen, R. (2010). Going 3D: extraction, reconstruction, and regularization
Personal nav and LBS. GPSWorld, 21(2), 14–18. from airborne laser scanning point clouds. Sen-
sors (Basel, Switzerland), 8(11), 7323–7343.
Collins, R. T., Jaynes, C. O., Cheng, Y. Q., Wang,
doi:10.3390/s8117323
X., Stolle, F., Riseman, E. M., & Hanson, A. R.
(1998). The ascender system: Automated site Fischer, A., Kolbe, T. H., Lang, F., Cremers, A.
modeling from multiple aerial images. Computer B., Förstner, W., Pluemer, L., & Steinhage, V.
Vision and Image Understanding, 72(2), 143–162. (1998). Extracting buildings from aerial images
doi:10.1006/cviu.1998.0729 using hierarchical aggregation in 2D and 3D.
Computer Vision and Image Understanding, 72(2),
Coors, V. (2008). Synthetic textures for 3d urban
185–203. doi:10.1006/cviu.1998.0721
models in pedestrian navigation. Proceedings of
TAMoCo 2008: Techniques and Applications for
Mobile Commerce, (pp. 46–53).

1042
3D City Modeling and Visualization for Smart Phone Applications

Fischler, M. A., & Bolles, R. C. (1981). Random Fraser, C. S., & Cronk, S. (2009). A hybrid mea-
sample consensus: a paradigm for model fitting surement approach for close-range photogram-
with applications to image analysis and automatec metry. ISPRS Journal of Photogrammetry and
cartography. Communications of the ACM, 24(6), Remote Sensing, 64(3), 328–333. doi:10.1016/j.
381–395. doi:10.1145/358669.358692 isprsjprs.2008.09.009
Fitzgibbon, A., & Zisserman, A. (1998). Automatic Früh, C., Jain, S., & Zakhor, A. (2005). Data
3D model acquisition and generation of new im- processing algorithms for generating textured
ages from video sequence. Proceedings of Euro- 3D building facade meshes from laser scans
pean Signal Processing Conference (EUSIPCO and camera images. International Journal of
‘98), (pp. 1261-1269). Computer Vision, 61(2), 159–184. doi:10.1023/
B:VISI.0000043756.03810.dd
Forlani, G., Nardinocchi, C., Scaioni, M., & Zinga-
retti, P. (2006). Complete classification of raw Früh, C., & Zakhor, A. (2004). An automated
LIDAR data and 3D reconstruction of buildings. method for large-scale, ground-based city
Pattern Analysis & Applications, 8(4), 357–374. model acquisition. International Journal of
doi:10.1007/s10044-005-0018-2 Computer Vision, 60(1), 5–24. doi:10.1023/
B:VISI.0000027787.82851.b6
Forlani, G., Roncella, R., & Remondino, F.
(2005). Structure and motion reconstruction of Fuchs, C. (1998). Extraktion polymorpher Bild-
short mobile mapping image sequences. Optical strukturen und ihre topologische und geometrische
3-D Measurement Techniques VII: Vol. 1, (pp. Gruppierung. PhD thesis, Institut für Photogram-
265-274). metrie, Rheinische Friedrich-Wilhelms-Univer-
sität zu Bonn, Germany.
Fowler, R., Samberg, A., Flood, M., & Greaves,
T. (2006). Topographic and terrestrial lidar. In Garland, M. (1999). Multiresolution modeling:
Maune, D. (Ed.), Digital elevation model tech- Survey and future opportunities. Proccedings of
nologies & applications (2nd ed., pp. 199–252). Eurographics, 99, 111–131.
Bethesda, MD: American Society for Photogram-
Garvin, J. B. (1996). Geological Society of Lon-
metry and Remote Sensing.
don Special Publication: Vol. 110. Topographic
Fradkin, M., Maitre, H., & Roux, M. (2001). characterization and monitoring of volcanoes via
Building detection from multiple aerial images in aircraft laser altimetry (pp. 137–153).
dense urban areas. Computer Vision and Image
Geospatial World. (2011). 3D laser scanning
Understanding, 82(3), 181–207. doi:10.1006/
market to double by 2015. Retrieved September
cviu.2001.0917
3, 2011, from http://geospatialworld.net/index.
Fraser, C. S. (1996). Network design. In Atkin- php?option=com_content&view=article&id=2
son, K. B. (Ed.), Close-range photogrammetry 2771%3A3d-laser-scanning-market-to-double-
and machine vision (pp. 256–282). UK: Whittles by-2015&catid=72%3Abusiness-market-survey-
Publishing. research&Itemid=1
Fraser, C. S., Baltsavias, E., & Gruen, A. (2002).
Processing of IKONOS imagery for submetre
3D positioning and building extraction. Inter-
national Journal of Photogrammetry and Remote
Sensing, 56(3), 177–194. doi:10.1016/S0924-
2716(02)00045-X

1043
3D City Modeling and Visualization for Smart Phone Applications

Goetz, M., & Zipf, A. (2010). Open issues in Gülch, E., & Müller, H. (2001, June). Semi-
bringing 3D to location based services (LBS) - A automatic building acquisition. Paper presented
review focusing on 3D data streaming and 3D at the Third International Workshop on Automatic
indoor navigation. In T. H. Kolbe, G. König, & Extraction of Man-Made Objects from Aerial and
C. Nagel (Eds.), The International Archives of Space Images. Retrieved from http://www.ipb.
the Photogrammetry, Remote Sensing and Spatial uni-bonn.de/projects/semi-automatic.html
Information Sciences, Vol. XXXVIII-4/W15 (pp.
Haala, N. (1996). Gebäuderekonstruktion durch
121-124).
Kombination von Bild- und Höhendaten. Ph.D
Grün, A., Remondino, F., & Zhang, L. (2004). thesis, Universität Stuttgart, Stuttgart, Germany.
3D modeling and visualization of large cultural Retrieved from http://elib.uni-stuttgart.de/opus/
heritage sites at very high resolution: The Bamiyan volltexte/2006/2714/pdf/Haala_diss.pdf
valley and its standing Buddhas. International
Haala, N., & Kada, M. (2010). An update on auto-
Archives of Photogrammetry and Remote Sensing,
matic 3D building reconstruction. ISPRS Journal
XXXV(Part B5), 603–608.
of Photogrammetry and Remote Sensing, 65(6),
Grün, A., & Wang, X. (1998a). CC-Modeler: 570–580. doi:10.1016/j.isprsjprs.2010.09.006
A topology generator for 3-D city models. IS-
Haala, N., Peter, M., Kremer, J., & Hunter, G.
PRS Journal of Photogrammetry and Remote
(2008). Mobile lidar mapping for 3D point cloud
Sensing, 53(5), 286–295. doi:10.1016/S0924-
collection in urban areas – A performance test.
2716(98)00011-2
The International Archives of the Photogram-
Grün, A., & Wang, X. (1998b). Acquisition and metry and Remote Sensing, Vol. XXXVII, Part
management of urban 3D data using CyberCity B5, V, 1119–1124.
modeler. Retrieved June 10, 2011, from www.
Hagedorn, B., Hildebrandt, D., & Dollner, J.
cybercity.tv/pub/ccm_3D_data.pdf
(2010). Towards advanced and interactive web
Grün, A., & Wang, X. (1999a, September). Cyber- perspective view services. In Neutens, T., & De
City modeller, a tool for interactive 3-D city model Mauer, P. (Eds.), Developments in 3D Geo-Infor-
generation. Paper presented at the Photogram- mation Sciences, Lecture Notes on Geoinformation
metric Week 99, Wichmann Veriag, Heideiberg. and Cartography (pp. 33–51). Springer Verlag.
doi:10.1007/978-3-642-04791-6_3
Grün, A., & Wang, X. (1999b, November). Cyber-
City spatial information system (CC-SIS): A new Henricsson, O. (1996). Analysis of image
concept for the management of 3-D city models in structures using color attributes and similarity
a hybrid GIS. Paper presented in The 20th Asian relations. Ph.D thesis, Institut für Geodäsie und
Conference on Remote Sensing. Retrieved from Photogrammetrie, ETH Zürich, Zürich. Retrieved
http://www.geod.ethz.ch/p02/general/persons/ from http://e-collection.library.ethz.ch/eserv/
AG_pub/cc-sis_acrs.pdf eth:40304/eth-40304-01.pdf
Grün, A., Zhang, L., & Eisenbeiss, H. (2005, Henricsson, O., & Baltsavias, E. (1997). 3-D build-
March). 3D Precision processing of high resolution ing reconstruction with ARUBA: A qualitative and
satellite imagery. Paper presented at the ASPRS quantitative evaluation. In Grün, A., Baltsavias, E.,
2005 Annual Conference. Retrieved from ftp:// & Henricsson, O. (Eds.), Automatic extraction of
ftp.ecn.purdue.edu/jshan/proceedings/asprs2005/ man-made objects from aerial and space images
Files/0048.pdf (II) (pp. 65–76). Basel, Switzerland: Birkhäuser.
doi:10.1007/978-3-0348-8906-3_7

1044
3D City Modeling and Visualization for Smart Phone Applications

Hildebrandt, D., Hagedorn, B., & Dollner, J. Hyyppä, J., Hyyppä, H., Yu, X., Kaartinen, H.,
(2010). Image-based, interactive visualization Kukko, A., & Holopainen, M. (2009). Forest
of complex 3D geovirtual environments on light- inventory using small-footprint airborne lidar.
weight devices. Proceedings of the 7th Interna- In Shan, J., & Toth, C. (Eds.), Topographic laser
tional Symposium on LBS and TeleCartography, ranging and scanning: Principles and processing
(pp. 52-71). (pp. 335–370). London, UK: CRC Press, Taylor
& Fracis.
Hofmann, A., Maas, H.-G., & Streilein, A. (2003).
Derivation of roof types by cluster analysis in Hyyppä, J., Matikainen, L., & Kaartinen, H. Yu.
parameter spaces of airborne laser scanner point X., Hyyppä, H., & Ronnholm, P. (2007). Improv-
clouds. Proceedings of the ISPRS Working Group ing automation in map updating based on national
III/3 Workshop, 3-D Reconstruction from Airborne laser scanning, classification trees, object-based
Lasers Canner and InSAR Data. change detection and 3D object reconstruction.
Proceedings 2007 Urban Remote Sensing Confer-
Hoppe, H. (1998). Smooth view-dependent level-
ence, (pp. 1-10).
of-detail control and its application to terrain ren-
dering. Proceedings of Ninth IEEE Visualization Hyyppä, J., Wagner, W., Hollaus, M., & Hyyppä,
(VIS ‘98), (pp. 35-42). H. (2009). Airborne laser scanning. In Warner,
T. A., Nellis, M. D., & Foody, G. M. (Eds.), The
Hug, C. (1997). Extracting artificial surface ob-
SAGE handbook of remote sensing (pp. 199–212).
jects from airborne laser scanner data. In Grün,
London, UK: SAGE Publications.
A., Baltsavias, E. P., & Henricson, O. (Eds.),
Automatic extraction of man-made objects from Jaakkola, A., Hyyppä, J., Hyyppä, H., & Kukko,
aerial and space images (II) (pp. 203–212). Basel, A. (2008). Retrieval algorithms for road surface
Switzerland: Birkhauser Verlag. doi:10.1007/978- modeling using laser-based mobile mapping.
3-0348-8906-3_20 Sensors (Basel, Switzerland), 8(9), 5238–5249.
doi:10.3390/s8095238
Hyyppä, J., & Engdahl, M. (2000). Verification
of the capability of repeat-pass ERS-1/2 SAR in- Jaakkola, A., Hyyppä, J., Kukko, A., Yu, X.,
terferometry, to provide digital elevations models Kaartinen, H., Lehtomäki, M., & Lin, Y. (2010).
and the impact of tree height and canopy closure A low-cost multi-sensoral mobile mapping
on SAR-derived terrain height in boreal forests. system and its feasibility for tree measure-
The Photogrammetric Journal of Finland, 16(2), ments. ISPRS Journal of Photogrammetry and
16–26. Remote Sensing, 65(6), 514–522. doi:10.1016/j.
isprsjprs.2010.08.002
Hyyppä, J., Hyyppä, H., Leckie, D., Gougeon, F.,
Yu, X., & Maltamo, M. (2008). Review of meth- Jacobsen, K. (2004). DTM generation by SPOT5
ods of small-footprint airborne laser scanning for HRS. International Archives of Photogrammetry
extracting forest inventory data in boreal forests. and Remote Sensing, XXXV(Part B1), 439–444.
International Journal of Remote Sensing, 29(5),
Jaynes, C., Riseman, E., & Hanson, A. (2003).
1339–1366. doi:10.1080/01431160701736489
Recognition and reconstruction of buildings from
multiple aerial images. Computer Vision and Im-
age Understanding, 90(1), 68–98. doi:10.1016/
S1077-3142(03)00027-4

1045
3D City Modeling and Visualization for Smart Phone Applications

Jung, F. (2004). Detecting building changes from Kraus, K., & Pfeifer, N. (1998). Determination of
multitemporal aerial stereopairs. ISPRS Journal terrain models in wooded areas with airborne laser
of Photogrammetry and Remote Sensing, 58(3- scanner data. ISPRS Journal of Photogrammetry
4), 187–201. doi:10.1016/j.isprsjprs.2003.09.005 and Remote Sensing, 53, 193–203. doi:10.1016/
S0924-2716(98)00009-4
Kaartinen, H., & Hyyppä, J. (2006). EuroSDR-
project commission 3 “evaluation of building Kukko, A. (2009). Road environment mapper—3D
extraction”, final report. (EuroSDR, Official data capturing with mobile mapping. Licentiate’s
Publication No. 50). MAA: European Spatial Thesis, Helsinki University of Technology, Espoo
Data Research. Finland.
Kada, M., & McKinley, L. (2009). 3D build- Lang, F. (1999). Geometrische und semantische
ing reconstruction from LiDAR based on a cell Rekonstruktion von Geb¨auden durch Ableitung
decomposition approach. In The International von 3D-Geb¨audeecken. Ph.D thesis, Institut
Archives of the Photogrammetry, Remote Sensing für Photogrammetrie, Rheinische Friedrich-
and Spatial Information Sciences, Vol. XXXVIII, Wilhelms-Universität zu Bonn, Bonn, Germany.
Part 3/W4 (pp. 47-52). Retrieved from http://elib.uni-stuttgart.de/opus/
volltexte/2006/2714/pdf/Haala_diss.pdf
Kager, H. (2004). Discrepancies between overlap-
ping laser scanning strips- Simultaneous fitting Le Besnerais, G., Sanfourche, G. M., & Champag-
of aerial laser scanner strips. The International nat, F. (2008). Dense height map estimation from
Archives of the Photogrammetry, Remote Sensing oblique aerial image sequences. Computer Vision
and Spatial Information Science, Vol. XXXV, Part and Image Understanding, 109(2), 204–225.
B/1 (pp. 555 – 560). doi:10.1016/j.cviu.2007.07.003
Kim, T., Park, S.-R., Kim, M.-G., Jung, S., & Lee, D. S., Shan, J., & Bethel, J. S. (2002). Class-
Kim, K.-O. (2004). Tracking road centerlines guided building extraction from IKONOS imag-
from high resolution remote sensing images by ery. Photogrammetric Engineering and Remote
least squares correlation matching. Photogram- Sensing, 69(2), 143–150.
metric Engineering and Remote Sensing, 70(12),
Lindstrom, P., & Pascucci, V. (2001). Visualiza-
1417–1422.
tion of large terrains made easy. Proceedings of
Koller, D., Lindstrom, P., Ribarsky, W., Hodges, IEEE Visualization, 01, 363–370.
L., Faust, N., & Turner, G. (1994).Virtual GIS:
Liu, Z., Zhang, J., & Yan, Q. (2010, September).
A real-time 3D geographic information system.
Implementation of a 3D personal navigation
Proceedings of ARPA Image Understanding
visualization engine and its demonstration in
Workshop 94, i(pp. 94-100).
Shanghai World Exposition 2010. Paper presented
Kraus, K., & Otepka, J. (2005, September). DTM at the 7th International Symposium on LBS and
modeling and visualization-The SCOP approach. TeleCartography, Guangzhou, China.
Paper presented at the Photogrammetric Week 05,
Livny, Y., Kogan, Z., & El-Sana, J. (2009). Seam-
Stuttgart, Germany. Retrieved from http://www.
less patches for GPU-based terrain rendering. The
ipf.tuwien.ac.at/publications/kk_jo_photogram-
Visual Computer, 25(3), 197–208. doi:10.1007/
metric_week_2005.pdf
s00371-008-0214-3

1046
3D City Modeling and Visualization for Smart Phone Applications

Livny, Y., Sokolovsky, N., Grinshpoun, T., & El- NAVTEQ. (2010). NAVTEQ website. Retrieved
Sana, J. (2008). A GPU persistent grid mapping December 11, 2010, from http://corporate.navteq.
for terrain rendering. The Visual Computer, 24(2), com/webapps/NewsUserServlet?action=NewsDe
139–153. doi:10.1007/s00371-007-0180-1 tail&newsId=946&lang=en&englishonly=false
Lodi, A., Martello, S., & Monaci, M. (2002). Two- Paparoditis, N., Cord, M., Jordan, M., & Coc-
dimensional packing problems: A survey. Euro- querez, J.-P. (1998). Building detection and
pean Journal of Operational Research, 141(2), reconstruction from mid- and high-resolution
241–252. doi:10.1016/S0377-2217(02)00123-6 aerial imagery. Computer Vision and Image
Understanding, 72(2), 122–142. doi:10.1006/
Luhmann, T. (2010). Close range photogram-
cviu.1998.0722
metry for industrial applications. ISPRS Journal
of Photogrammetry and Remote Sensing, 64(3), Pei, L., Chen, R., Liu, J., Tenhunen, T., Kuus-
558–569. doi:10.1016/j.isprsjprs.2010.06.003 niemi, H., & Chen, Y. (2010). An inquiry-based
Bluetooth indoor positioning approach for the
Ma, R. (2004). Building model reconstruction
Finnish Pavilion at Shanghai World Expo 2010.
from LiDAR data and aerial photographs. Doc-
Proceedings of IEEE/ION PLANS 2010, (pp.
toral thesis, The Ohio State University, USA.
1002-1009).
Retrieved from http://www.asprs.org/a/publica-
tions/proceedings/reno2006/0088.pdf Peng, J., & Liu, Y. (2005). Model and context-
driven building extraction in dense urban aerial
Matikainen, L., Hyyppä, J., & Hyyppä, H. (2003).
images. International Journal of Remote Sensing,
Automatic detection of buildings from laser scan-
26(7), 1289–1307. doi:10.1080/0143116051233
ner data for map updating. In H.-G. Maas, G. Vos-
1326675
selman, & A. Streilein (Eds.), The International
Archives of Photogrammetry, Remote Sensing and Poli, D. (2005). Modeling of spaceborne linear ar-
Spatial Information Sciences, Vol. XXXIV, Part 3/ ray sensors. Doctoral thesis, Institute of Geodesy
W13 (pp. 218-224). and Photogrammetry, ETH Zurich, Switzerland.
Retrieved from http://webfarm102b.ethz.ch/
Mayer, H. (1999). Automatic object extraction
berichte/Blaue_Berichte_PDF/85.pdf
from aerial imagery - A survey focusing on build-
ings. Computer Vision and Image Understanding, Pollefeys, M., Nister, D., Frahm, J.-M., Akbar-
74(2), 138–149. doi:10.1006/cviu.1999.0750 zadeh, A., Mordohai, P., & Clipp, B. (2008).
Detailed real-time urban 3D reconstruction from
McGlone, J. C., Mikhail, E. M., Bethel, J., & Mul-
video. International Journal of Computer Vision,
len, R. (Eds.). (2004). Manual of photogrammetry
78, 143–167. doi:10.1007/s11263-007-0086-4
(5th ed.). USA: American Society of Photogram-
metry and Remote Sensing. Pu, S. (2008). Automatic building modeling from
terrestrial laser scanning. In Oosterom, P., Zlatano-
Melzer, T. (2007). Non-parametric segmentation
va, S., Penninga, F., & Fendel, E. (Eds.), Advances
of ALS point clouds using mean shift. Journal
in 3D geoinformation systems (pp. 147–160).
of Applied Geodesy, 1(3), 159–170. doi:10.1515/
Berlin, Germany: Springer. doi:10.1007/978-3-
jag.2007.018
540-72135-2_9
Mikhail, E. M., Bethel, J., & McGlone, J. C.
(2001). Introduction to modern photogrammetry.
New York, NY: John Wiley & Sons, Inc.

1047
3D City Modeling and Visualization for Smart Phone Applications

Pu, S., & Vosselman, G. (2009). Knowledge Sampath, A., & Shan, J. (2010). Segmentation
based reconstruction of building models from and reconstruction of polyhedral building roofs
terrestrial laser scanning data. ISPRS Journal from aerial LiDAR point clouds. IEEE Transac-
of Photogrammetry and Remote Sensing, 64(6), tions on Geoscience and Remote Sensing, 48(3),
575–584. doi:10.1016/j.isprsjprs.2009.04.001 1554–1567. doi:10.1109/TGRS.2009.2030180
Rabin, R. M., & Martin, D. W. (1996). Satel- Schneider, J., & Westermann, R. (2006). GPU-
lite observations of shallow cumulus coverage friendly high-quality terrain rendering. Journal
over the central United States: An exploration of WSCG 2006, 14(1213-6972), 49-56.
of land use impact on cloud cover. Journal of
Sharkawi, H., Ujang, M. U., & Abdul-Rahman, A.
Geophysical Research, 101(D3), 7149–7155.
(2007, November). Indoor and outdoor navigation
doi:10.1029/95JD02891
based on 3D game engine. Paper presented at the
Remondino, F. (2002). Human body reconstruction Joint International Symposium and Exhibition
from image sequences. In Van Gool, L. (Ed.), Pat- on Geoinformation and International Sympo-
tern Recognition (Vol. 2449, pp. 50–57). Lecture sium and Exhibition on GPS/GNSS 2007, Johor
Notes in Computer Science Zurich, Switzerland: Bahru, Johor.
Springer Verlag. doi:10.1007/3-540-45783-6_7
Simple 3D. (2011). 3D Scanners, digitizers, and
Remondino, F. (2004). 3-D reconstruction of static software for making 3D models and 3D measure-
human body shape from image sequence. Journal ments. Retrieved June 8, 2011, from http://www.
of Computer Vision and Image Understanding, simple3d.com/#software
93(1), 65–85. doi:10.1016/j.cviu.2003.08.006
Sithole, G., & Vosselman, G. (2004). Experimen-
Rottensteiner, F., & Briese, Ch. (2003). Automatic tal comparison of filter algorithms for bare Earth
generation of building models from LIDAR data extraction from airborne laser scanning point
and the integration of aerial images. In: H.-G. clouds. ISPRS Journal of Photogrammetry and
Maas, G. Vosselman, & A. Streilein (Eds.), Pro- Remote Sensing, 59(1-2), 85–101. doi:10.1016/j.
ceedings of the ISPRS WG III/3 Workshop on ´3-D isprsjprs.2004.05.004
Reconstruction from Airborne Laserscanner and
Skaloud, J. (1999). Optimizing georeferencing of
InSAR data´, vol. XXXIV, Part 3/W13 (pp. 174-
airborne survey systems by INS/DGPS. Doctoral
180). Dresden: IAPRS.
thesis, University of Calgary, Canada. Retrieved
Rottensteiner, F., Trinder, J., Clode, S., & Ku- from https://dspace.ucalgary.ca/bitstream/1880/
bik, K. (2005a). Automated delineation of roof 25307/2/38506Skaloud.pdf
planes from LiDAR data. In G. Vosselman, &
Sohn, G., & Dowman, I. (2007). Data fusion of
C. Brenner (Eds.), The International Archives of
high-resolution satellite imagery and LiDAR data
the Photogrammetry, Remote Sensing and Spatial
for automatic building extraction. ISPRS Journal
Information Sciences, Vol. XXXVI, Part 3/W4
of Photogrammetry and Remote Sensing, 62(1),
(pp. 221-226).
43–63. doi:10.1016/j.isprsjprs.2007.01.001
Rottensteiner, F., Trinder, J., Clode, S., & Kubik,
Sohn, G., Huang, X., & Tao, V. (2008). Using a
K. (2005b). Using the Dempster–Shafer method
binary space partitioning tree for reconstructing
for the fusion of LIDAR data and multispectral
polyhedral building models from airborne LiDAR
images for building detection. Information Fusion,
data. Photogrammetric Engineering and Remote
6(4), 283–300. doi:10.1016/j.inffus.2004.06.004
Sensing, 74(11), 1425–1438.

1048
3D City Modeling and Visualization for Smart Phone Applications

Sörgel, U., Michaelsen, E., Thiele, A., Cadario, Tian, Y., Gerke, M., Vosselman, G., & Zhu, Q.
E., & Thoennessen, U. (2009). Stereo analysis of (2010). Knowledge-based building reconstruction
high-resolution SAR images for building height from terrestrial video sequences. ISPRS Journal
estimation in cases of orthogonal aspect direc- of Photogrammetry and Remote Sensing, 65(4),
tions. ISPRS Journal of Photogrammetry and 395–408. doi:10.1016/j.isprsjprs.2010.05.001
Remote Sensing, 64(5), 490–500. doi:10.1016/j.
TLS software survey. (2011). ISPRS website.
isprsjprs.2008.10.007
Retrieved June 8, 2011 from http://www.cage.
Sörgel, U., Schulz, K., Thoennessen, U., & Stilla, curtin.edu.au/~gordonsj/isprs_wgv3/data/ispr-
U. (2005). Integration of 3D data in SAR mission swgv3_tls_software_survey.pdf
planning and image interpretation in urban areas.
Tournaire, O., Bredif, M., Boldo, D., & Durupt,
An International Journal on Information Fusion,
M. (2010). An efficient stochastic approach for
6(4), 301–310. doi:10.1016/j.inffus.2004.06.007
building footprint extraction from digital elevation
Sörgel, U., Thoennessen, U., Brenner, A., & models. ISPRS Journal of Photogrammetry and
Stilla, U. (2006). High-resolution SAR data. New Remote Sensing, 65(4), 317–327. doi:10.1016/j.
opportunities and challenges for the analysis of isprsjprs.2010.02.002
urban areas. IEE Proceedings. Radar, Sonar
Toutin, T. (2004). Comparison of stereo-extracted
and Navigation, 153(3), 294–300. doi:10.1049/
DTM from different high-resolution sensors:
ip-rsn:20045088
SPOT-5, EROS-A, IKONOS-II, and QuickBird.
Stamos, I., & Allen, P. K. (2002). Geometry and IEEE Transactions on Geoscience and Remote
texture recovery of scenes of large scale. Computer Sensing, 42(10), 2121–2129. doi:10.1109/
Vision and Image Understanding, 88(2), 94–118. TGRS.2004.834641
doi:10.1006/cviu.2002.0963
Ulrich, T. (2000, July). Rendering massive terrains
Stulp, F., & Fisher, R. (2001). Reconstruction of using chunked level of detail. Paper presented
surfaces behind occlusions in range images. Pro- at the SIGGRAPH 2000 Conference Abstracts
ceedings of the 3rd International Conference on and Applications, Scaling up to Massive Virtual
3-D Digital Images and Modeling, (pp. 232-239). Worlds. Retrieved from http://tulrich.com/geek-
stuff/sig-notes.pdf
Süveg, I., & Vosselman, G. (2004). Reconstruction
of 3D building models from aerial images and Verma, V., Kumar, R., & Hsu, S. (2006). 3D build-
maps. ISPRS Journal of Photogrammetry and ing detection and modeling from aerial LiDAR
Remote Sensing, 58(3-4), 202–224. doi:10.1016/j. data. Proceedings of the 2006 IEEE Computer
isprsjprs.2003.09.006 Society Conference on Computer Vision and
Pattern Recognition (CVPR’06, (pp. 2213-2220).
Technology Review Website. (2011). Technology
review of C3 technologies. Retrieved November Vögtle, T., & Steinle, E. (2003). On the quality
20, 2011, from http://www.technologyreview. of object classification and automated building
com/computing/37466/ modeling based on laserscanning data. In H.-G.
Maas, G. Vosselman, & A. Streilein (Eds.), The
TerraSolid. (2003). TerraScan user’s guide.
International Archives of Photogrammetry, Re-
mote Sensing and Spatial Information Sciences,
Vol. XXXIV, Part 3/W13 (7p.).

1049
3D City Modeling and Visualization for Smart Phone Applications

Vosselman, G. (1999). Building reconstruction Wagner, W., Ullrich, A., Ducic, V., Melzer, T.,
using planar faces in very high density height & Studnicka, N. (2006). Gaussian decomposi-
data. In The International Archives of the Photo- tion and calibration of a novelsmall-footprint
grammetry and Remote Sensing, Vol. XXXII, Part full-waveform digitising airborne laser scan-
3-2W5 (pp. 87-92). ner. ISPRS Journal of Photogrammetry and
Remote Sensing, 60(2), 100–112. doi:10.1016/j.
Vosselman, G. (2002). Fusion of laser scanning
isprsjprs.2005.12.001
data, maps, and aerial photographs for building
reconstruction. In IEEE International Geoscience Wehr, A., & Lohr, U. (1999). Airborne laser
and Remote Sensing Symposium (IGARSS’02), scanning – An introduction and overview. IS-
Vol. 1 (pp.85-88). PRS Journal of Photogrammetry and Remote
Sensing, 54(2-3), 68–82. doi:10.1016/S0924-
Vosselman, G., & Dijkman, S. (2001). 3D build-
2716(99)00011-8
ing model reconstruction from point clouds and
ground plans. In The International Archives of Wikipedia. (2011). Level of detail. Retrieved
the Photogrammetry and Remote Sensing, Vol. June 8, 2011, from http://en.wikipedia.org/wiki/
XXXIV, Part 3W4 (pp. 37-43). Level_of_detail
Vosselman, G., Gorte, B. G. H., Sithole, G., & Zhang, K., Yan, J., & Chen, S. (2006). Automatic
Rabbani, T. (2004). Recognising structure in la- construction of building footprints from airborne
ser scanner point clouds. In M. Thies, B. Koch, LIDAR data. IEEE Transactions on Geosci-
H. Spiecker, & H. Weinacker (Eds.), The Inter- ence and Remote Sensing, 44(9), 2523–2533.
national Archives of Photogrammetry, Remote doi:10.1109/TGRS.2006.874137
Sensing and Spatial Information Sciences: Vol.
Zhang, L., & Gruen, A. (2006). Multi-image
XXXVI, Part 8/W2 (pp.33-38).
matching for DSM generation from IKONOS
Vosselman, G., & Maas, H.-G. (Eds.). (2010). imagery. ISPRS Journal of Photogrammetry and
Airborne and terrestrial laser scanning. Scotland: Remote Sensing, 60(3), 195–211. doi:10.1016/j.
Whittles Publishing. isprsjprs.2006.01.001
Vosselman, G., & Süveg, I. (2001). Map based Zhao, H., & Shibasaki, R. (2003). Reconstructing
building reconstruction from laser data and images. a textured CAD model of an urban environment
In Automatic Extraction of Man-Made Objects using vehicle-borne laser range scanners and line
from Aerial and Space Images (III) (pp. 231–239). cameras. Machine Vision and Applications, 14(1),
Ascona, Switzerland: Balkema Publishers. 35–41. doi:10.1007/s00138-002-0099-5
Vu, T., Yamazaki, F., & Matsuoka, M. (2009). Zhao, H., & Shibasaki, R. (2005). Updating digital
Multi-scale solution for building extraction from geographic database using vehicle-borne laser
LiDAR and image data. International Journal of scanners and line cameras. Photogrammetric
Applied Earth Observation and Geoinformation, Engineering and Remote Sensing, 71(4), 415–424.
11(4), 281–289. doi:10.1016/j.jag.2009.03.005

1050
3D City Modeling and Visualization for Smart Phone Applications

Zhu, L., Hyyppä, J., Kukko, A., Jaakkola, A., Hyyppä, J. (2011). State of the art in laser scanning.
Lehtomäki, M., & Kaartinen, H. … Chen, Y. (2009, In Fritsch, D. (Ed.), Photogrammetric Week’11 (pp.
May). 3D city model for mobile phone by using 203–216). Berlin, Germany: Wichmann Verlag.
MMS data. Paper presented at the IEEE 2009
Hyyppä, J., Hyyppä, H., Yu, X., Kaartinen, H.,
Urban Remote Sensing Joint Events. Retrieved
Kukko, A., & Holopainen, M. (2009). Forest
from http://ieeexplore.ieee.org/xpl/freeabs_all.
inventory using small-footprint airborne lidar.
jsp?arnumber=5137691
In Shan, J., & Toth, C. (Eds.), Topographic laser
Zhu, L., Hyyppä, J., Kukko, A., Kaartinen, H., & ranging and scanning: Principles and processing
Chen, R. (2011). Photorealistic 3D city modeling (pp. 335–370). London, UK: CRC Press, Taylor
from mobile laser scanning data. Remote Sensing, & Francis.
3(7), 1406–1426. doi:10.3390/rs3071406
Hyyppä, J., Jaakkola, A., Hyyppä, H., Kaartinen,
Zlatanova, S., & Verbree, E. (2003). Technological H., Kukko, A., Holopainen, M., et al. (2009). Map
developments within 3D location-based services. updating and change detection using vehicle-based
Proceedings of the International Symposium laser scanning. Proceedings of the 2009 Urban
and Exhibition on Geoinformation 2003, (pp. Remote Sensing Joint Event, (pp. 1-6).
153-160).
Jaakkola, A., Hyyppä, J., Kukko, A., Yu, X.,
Kaartinen, H., Lehtomäki, M., & Lin, Y. (2010).
A low-cost multi-sensoral mobile mapping
ADDITIONAL READING
system and its feasibility for tree measure-
Alho, P., Kukko, A., Hyyppä, H., Kaartinen, H., ments. ISPRS Journal of Photogrammetry and
& Hyyppä, J. (2009). Application of boat-based Remote Sensing, 65(6), 514–522. doi:10.1016/j.
laser scanning for river survey. [Letters to ESEX]. isprsjprs.2010.08.002
Earth Surface Processes and Landforms, 34(13), Kukko, A. (2009). Road environment mapper—3D
1831–1838. doi:10.1002/esp.1879 data capturing with mobile mapping. Licentiate’s
Brenner, C. (2005). Building reconstruction Thesis, Helsinki University of Technology, Espoo
from images and laser scanning. International Finland.
Journal of Applied Earth Observation and Geo- Kukko, A., & Hyyppä, J. (2009). Small-footprint
information, 6(3-4), 187–198. doi:10.1016/j. laser scanning simulator for system validation,
jag.2004.10.006 error assessment, and algorithm development.
Chen, R., Kuusniemi, H., Hyyppä, J., Zhang, J., Photogrammetric Engineering and Remote Sens-
Takala, J., & Kuittinen, R. (2010). Going 3D: ing, 75(10), 1177–1189.
Personal nav and LBS. GPSWorld, 21(2), 14–18. Lehtomäki, M., Jaakkola, A., Hyyppä, J., Kukko,
Gülch, E., Kaartinen, H., & Hyyppä, J. (2009). A., & Kaartinen, H. (2010). Detection of vertical
Quality of buildings extracted from airborne laser pole-like objects in a road environment using
scanning data: Results of an empirical investiga- vehicle-based laser scanning data. Remote Sens-
tion on 3D building reconstruction, 535-574. In J. ing, 2(3), 641–664. doi:10.3390/rs2030641
Shan, & C. Toth (Eds.), Topographic laser ranging Lin, Y., & Hyyppä, J. (2011). K-segments-based
and scanning: Principles and processing. London, geometric modeling of VLS scan lines. IEEE
UK: CRC Press, Taylor & Francis. Geoscience and Remote Sensing Letters, 8(1),
93–97. doi:10.1109/LGRS.2010.2051940

1051
3D City Modeling and Visualization for Smart Phone Applications

Lin, Y., Jaakkola, A., Hyyppä, J., & Kaartinen, H. Zhu, L., Hyyppä, J., Kukko, A., Kaartinen, H., &
(2010). From TLS to VLS: Biomass estimation Chen, R. (2011). Photorealistic 3D city modeling
at individual tree level. Remote Sensing, 2(8), from mobile laser scanning data. Remote Sensing,
1864–1879. doi:10.3390/rs2081864 3(7), 1406–1426. doi:10.3390/rs3071406
Matikainen, L., Hyyppä, J., Ahokas, E., Markelin,
L., & Kaartinen, H. (2010). Automatic detection
KEY TERMS AND DEFINITIONS
of buildings and changes in buildings for updat-
ing of maps. Remote Sensing, 2(5), 1217–1248. 3D city Models: Scenes are represented by
doi:10.3390/rs2051217 geometric elements such as points, lines and
Mikhail, E. M., Bethel, J., & McGlone, J. C. surfaces in 3D space, and rendering the textures
(2001). Introduction to modern photogrammetry. onto the surfaces of geometry.
New York, NY: John Wiley & Sons, Inc. 3D Modeling: It is the process to develop the
mathematics methods to represent the objects in
Puttonen, E., Jaakkola, A., Litkey, P., & Hyyppä, J. 3D space.
(2011). Tree classification with fused mobile laser Airborne Laser Scanning: a method based
scanning and hyperspectral data. Sensors (Basel, on laser (LiDAR) range measurements from an
Switzerland), 11(5), 5158–5182. doi:10.3390/ aircraft consisting of a laser scanner, GPS/IMU
s110505158 and a computer for collecting point clouds typi-
Rönnholm, P., Hyyppä, H., Hyyppä, J., & Hag- cally for topographic surveys.
grén, H. (2009). Orientation of airborne laser Laser Scanning: a method based on laser
scanning point clouds with multi-view, multi-scale (LiDAR) range measurements providing a point
image blocks. Sensors (Basel, Switzerland), 9(8), cloud of objects scanned.
6008–6027. doi:10.3390/s90806008 Mobile Laser Scanning: a method based
on laser (LiDAR) range measurements from a
Vaaja, M., Hyyppä, J., Kukko, A., Kaartinen, H., system which consists of a laser scanner, GPS/
Hyyppä, H., & Alho, P. (2011). Mapping topog- IMU and also a computer, mounted on a moving
raphy changes and elevation accuracies using platform e.g. a van, a car or a boat for collecting
a mobile laser scanner. Remote Sensing, 3(3), point clouds from the surrounding environment.
587–600. doi:10.3390/rs3030587 Photogrammetry: It is the process of acquir-
Vosselman, G., & Maas, H.-G. (Eds.). (2010). ing information from images by establishing the
Airborne and terrestrial laser scanning. Scotland: geometric relationship between images and scene
Whittles Publishing. and interpreting the images.
Terrestrial Laser Scanning: a method based
Wagner, W., Hyyppä, J., Ullrich, A., Lehner, H., on laser (LiDAR) range measurements from a
Briese, C., & Kaasalainen, S. (2008). Radiometric laser scanner mounted on a tripod for collecting
calibration of full-waveform small-footprint air- point clouds from the surrounding environment.
borne laser scanners. The International Archives of
the Photogrammetry. Remote Sensing and Spatial
Information Sciences, XXXVII(Part B1), 163–168.

This work was previously published in Ubiquitous Positioning and Mobile Location-Based Services in Smart Phones, edited
by Ruizhi Chen, pp. 254-296, copyright 2012 by Information Science Reference (an imprint of IGI Global).

1052
1053

Chapter 63
Assessing the Success of
an Industrial Cluster
Neil Reid
The University of Toledo, USA

Bruce W. Smith
Bowling Green State University, USA

ABSTRACT
Industrial clusters have received considerable attention as a regional development strategy. While their
efficacy has been debated by academics, clusters have become popular among practitioners. Despite
clusters’ acceptance, there have been few attempts to measure their success or their impact on constituent
firms. This paper outlines and discusses the metrics developed to evaluate the success of the northwest
Ohio greenhouse cluster. The cluster was launched in 2004 to help the industry become more competi-
tive though collaborative problem solving. In identifying success metrics, the authors were cognizant
of the fact that they had to reflect the cluster’s objectives and goals. Thus metrics that measured the
impact of branding and marketing efforts, reducing energy costs, and increasing collaboration among
cluster stakeholders were developed. The work reported in this paper is only the beginning phases of a
longer-term, on-going effort to track the progress and success of the northwest Ohio greenhouse cluster.

INTRODUCTION have widely adopted the cluster strategy. In 2002,


van der Linde (2002) identified over 800 cluster
Cluster-based economic development strategies projects globally. More recently, Sölvell (2008)
have generated substantial attention in recent reported that there were over one thousand cluster
years. Even though the benefits of clusters have initiatives in Europe alone. Clusters have also been
been questioned in the academic literature (Mar- embraced in the United States. For example, the
tin & Sunley, 2003; Taylor, 2006), practitioners National Governors Association (NGA, 2007)
published Cluster-Based Strategies for Growing
DOI: 10.4018/978-1-4666-2038-4.ch063 State Economies and subsequently announced

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Assessing the Success of an Industrial Cluster

that it was launching a policy academy for states is necessary in order to attract funding and partici-
using cluster analysis and innovation-based pants (Sölvell, 2008). Those were the motivations
economic development strategies. Also, Muro behind the greenhouse cluster management team
and Fikri (2011, p. 1) argued that cluster-based developing measures to assess the success of the
strategies are particularly attractive for governors northwest Ohio greenhouse cluster.
concerned with revitalizing state economies since
they “provide a direct route to economic renewal
because they build on existing assets to promote PURPOSE
growth in regions by enhancing the interactions
by which firms complete transactions, share ideas, The purpose of this paper is to discuss the initial
start new enterprises, and create jobs.” measures that have been developed to evaluate the
Despite the acceptance of clusters by many success of a greenhouse cluster which has been
academics and policy makers, there have been operating in northwest Ohio since 2004. Success
few attempts to objectively measure the success of for this paper is defined as the benefits that the
clusters or the impact of a cluster on the constitu- cluster has generated for the participating firms and
ent firms (Feser et al., 2008; Fromhold-Eisebith the local industry. An assessment of the impact of
& Eisebith, 2005; Schmiedeberg, 2010). Sölvell the cluster on the regional economy is outside the
(2008) reported that only five cluster organiza- domain of this paper. The work reported in this
tions out of 50 he surveyed had completed cluster paper is only the beginning phases of a longer-
evaluations. A few academic researchers have term, on-going effort to track the progress and
investigated the impact of clusters on specific success of the cluster.
economic conditions. For example, Feser et al. In the remainder of this paper, we begin by
(2008) analyzed the affect of clusters on job growth providing background information on the clus-
and new business formation in Appalachia. Also ter, which is necessary to understand the metrics
Delgado et al. (2010) investigated the relationship that have been developed. Also data collection
between clusters and entrepreneurship. As a final procedures are outlined. This is followed by a
example, Fowler and Kleit (2010) assessed the discussion of the results.
impact of cluster activity on areas’ poverty rates.
While such studies contribute to our general un-
derstanding of regional development, they do not THE NORTHWEST OHIO
necessarily inform about measuring the success GREENHOUSE CLUSTER
of a specific cluster project.
Fromhold-Eisebith and Eisebith (2005) sug- While northwest Ohio’s greenhouse industry can
gested that the heterogeneity of clusters leads trace its existence back to the late 19th century, the
to intricate methodological issues that inhibit organization of the industry as a formal industrial
evaluation. Schmiedeberg (2010) and Sölvell cluster began in 2004. The cluster was initiated by
(2008) discuss a variety of such problems in their university researchers at the University of Toledo
overviews of cluster policy evaluation. Moreover, and Bowling Green State University, with strong
Fromhold-Eisebith and Eisebith (2005) noted that support from U.S. Congresswoman Marcy Kaptur.
there may be a vested interest on the part of cluster Funding for the cluster project has been provided
management and public officials not to have an by the United States Department of Agriculture.
evaluation because it may yield undesirable re- The members of the cluster, including green-
sults. Nonetheless, for those persons managing a house owners, university researchers, industry
cluster, an evaluation of the progress of a cluster suppliers, and other community stakeholders, have

1054
Assessing the Success of an Industrial Cluster

worked collaboratively to identify and implement When, in 2004, Congresswoman Kaptur ap-
solutions to common problems that the local in- proached the University of Toledo to develop and
dustry has been experiencing. Those challenges implement a strategy to assist the industry, uni-
include a weak market presence, international versity researchers, in partnership with colleagues
competition, dated infrastructure and production from Bowling Green State University (and later
technology, and high and rising energy costs Ohio State University and Indiana State Univer-
(LaFary et al., 2006). As a result of these issues, sity) decided to adopt a cluster-based economic
many greenhouse operators have been pessimistic development approach (Reid & Carroll, 2006). A
about their industry’s future. In a 2004 survey, cluster approach seemed appropriate for the
only 27% of the growers in northwest Ohio were greenhouse industry because initial analyses found
optimistic about the future of the industry (Gatrell a fragmented local industry composed of small
et al., 2009). and medium-sized enterprises that were family-
The geographic footprint of the cluster com- owned and multi-generational in nature. There is
prises six counties in northwest Ohio (Figure 1). a critical mass of greenhouses (80+) in the region
The region covers an area of 2,189 square miles that has led to the creation of some external
and has a population of 789,452. The region’s main economies of scale. For example, a small number
urban area is Toledo (located in Lucas County), of suppliers to the local industry exists in the
with a population of 287,208 (U.S. Bureau of the region. From the perspective of the university
Census, 2010). The six counties are home to 81 researchers, however, the key to helping the in-
greenhouse businesses that have over 5,125,717 dustry become more competitive was in helping
square feet under glass or other protection (U.S. the growers to see the benefits of and engaging
Census of Agriculture, 2007). The industry com- in collaborative projects and initiatives.
prises a mix of retail and wholesale greenhouses We define cluster-based economic develop-
whose main product line comprises floriculture ment as a network driven economic strategy
products (bedding plants, hanging baskets, etc.) built on collaboration and joint action among the

Figure 1. Northwest Ohio greenhouse cluster region

1055
Assessing the Success of an Industrial Cluster

participants which, when combined with external products. The implementation of the branding
economies of scale, generate what Schmitz (1995) project and the subsequent marketing initiative was
refers to as collective efficiencies. The goal is to slow because the growers had never previously
create a culture of collaboration that can occur engaged in joint marketing efforts. For example,
when growers, suppliers, researchers, economic a 2004 survey found that nearly two-thirds of
development agencies, and other community the northwest Ohio growers reported a lack of
stakeholders strategically join forces to confront marketing expertise and market information. Thus
the competitive challenges facing the industry they were unfamiliar, and initially uncomfortable,
– competitive challenges that due to the lack of with larger scale marketing projects. However, the
resources individual growers cannot successfully very process of creating the brand and the then
address by themselves. designing and implementing a marketing initiative
Meaningful collaboration can be problematic utilizing the brand brought the growers together,
since firms within a particular industry often cemented existing relationships, and created new
view each other as competitors. In the case of the relationships (Reid & Carroll, 2008).
northwest Ohio greenhouse industry, enhancing Following the launch of the brand and associ-
collaboration has been significantly impacted ated marketing campaign, the next challenge that
by the grower culture. Most of the greenhouses the growers tackled was high natural gas costs.
have been owned by the same family for multiple To manage natural gas costs, a number of the
generations, resulting in a complex network of growers came together in 2007 and combined
embedded relationships. While one might expect their purchases of natural gas into a single pur-
this to be a positive characteristic from the per- chase, thereby generating significant economies
spective of the relationships within the industry, of scale. In making these purchases the growers
this is not the case. Not all of these embedded worked with an energy consultant who made the
relationships are positive. Moreover, many of purchase on their behalf.
the growers are independent and working col- Natural gas is purchased on the spot and
laboratively does not come easy, even in the face futures markets and reflects current and future
of common threats. Consequently, an ongoing demand conditions (Augustine et al., 2006). As
challenge for the cluster management team has a result many growers had been spending inordi-
been the fostering of collaboration. nate amounts of time researching gas prices and
Unlike many cluster projects which are top- agonizing over buying decisions, often second-
down and expert-driven, this particular one has guessing themselves. The introduction of an
attempted to be grass roots and grower-driven The expert consultant and the economies of scale that
grower-led nature of the cluster makes grower resulted from bulk purchases removed many of
collaboration and the existence of a strong social the uncertainties of purchasing and resulted in
network among the participants essential. lower natural gas costs.
The cluster has undertaken a variety of projects Despite the demonstrated benefits of the pro-
to enhance the industry’s competitiveness. The first gram, however, many growers were slow to agree
cluster project was the creation of a brand identity, to participate. As with the branding and market-
Maumee Valley Growers (MVG), in 2005. Starting ing initiative, there was an inherent resistance to
in 2006, the MVG brand was used in a number collaboration. While the process of going-it-alone
of collaborative marketing initiatives that were in purchasing natural gas was stressful and chal-
designed to enhance awareness of the industry lenging, it still meant that individual growers
among residents of northwest Ohio and to con- were in charge of their own decision-making.
vince people to purchase more locally-produced Participating in the joint purchasing program

1056
Assessing the Success of an Industrial Cluster

required relinquishing critical decision-making for a response rate of 95%. This survey provided
to the group. Once the group got large, it meant insights into growers’ perceptions of the value
individual growers relinquishing decision-making of the cluster to them. Supporting these data are
to a decision-making body of which they were personal observations by the researchers obtained
not necessarily a participant. By 2009, buying while working with the cluster since its inception.
decisions were made by a nine-grower advisory A second source of data was a telephone survey
panel. In addition to the expertise and knowledge of northwest Ohio residents that was conducted
of the energy consultant, the growers on the panel during 2008 (Reid et al., 2009). One objective of
also brought their particular wisdom and expertise this telephone survey was to provide greenhouse
to the process. Having an energy consultant also owners with information on the characteristics
reduced the influence of gas marketers, whose and motivations of persons buying greenhouse
advice to the growers often varied dramatically products. Another objective was to determine
in its quality and was rarely in the best interests consumers’ awareness of the MVG brand that was
of the growers. being used by growers in collaborative market-
ing initiatives. A total of 2,388 adults completed
the survey, resulting in a sampling error of 2.7%.
DATA Respondents were selected by randomly selecting
telephone numbers from an electronic directory
As Schmiedeberg (2010) discussed, data avail- of active residential telephone lines. The response
ability has a major impact on the type of cluster rate was 56%.
evaluation one utilizes. Ideally one would ac- Sölvell (2008) suggests that measures such as
quire economic data, such as sales, employment, membership and network meetings are indicators
and productivity growth, for the businesses in of strengthening cluster dynamics. In the case
the greenhouse cluster in order to evaluate the of greenhouse cluster, one of the authors was
cluster’s success (Sölvell, 2008). However, the instrumental in the establishment of the cluster.
greenhouse owners are very reticent to share such He also served as a member of the Board of Direc-
information. In addition, we had little success in tors of the Maumee Valley Grower Association
acquiring information about those owners’ social and attends monthly stakeholder meetings. This
characteristics that studies ((Madill et al., 2004; active participation provides rich insights into the
Watts et al., 2006) have reported impact business growers’ culture and attitudes toward all aspects
owners participation in clusters. Moreover, the of the cluster. Due to the key role of the cluster
greenhouses are small firms and therefore little manager in a cluster project (Coletti, 2010) he/
data about them exist in published databases. she is a good source of insights into the operation
Consequently it was necessary to create indirect of all phases of the cluster.
indicators by which we could begin to triangulate To measure the development and growth of
the success of the cluster. Sölvell (2008) provides social relations within the cluster, social network
an excellent discussion of the role of perceptual analysis was used. Social network analysis de-
and analytical data, as well as hard data and soft scribes and analyzes the relationships among a
indicators in assessing clusters. In this project, we group of people and/or organizations (de Nooy et
employ a variety of types of information, includ- al., 2005). In social network analysis, the focus is
ing hard and soft data and perceptual indicators. on the relationships among individuals, with the
A 2009 survey of greenhouse operators was one underlying premise that the behavior of people and
important source of information. The survey was organizations is impacted by, and in turn, shapes
sent to 60 greenhouses, of which 57 responded, their social networks. We focused specifically on

1057
Assessing the Success of an Industrial Cluster

the collaboration and advice/support networks Networks


within the cluster. Moreover, we examine the
changes over time in those networks. This ap- Various networking opportunities have been in-
proach was chosen since standardized values have corporated into cluster activities to build the social
not been set for social network metrics making it network. These include monthly stakeholder meet-
difficult to state definitively that any given metric ings, an annual dinner, and other periodic events.
value is either high or low. However, one can One positive indicator of relationship building is
monitor trends in the metrics to gauge evolution that average attendance at the monthly stakeholder
of the network. meetings increased from 15.0 in 2004 to 23.3 in
In 2007 and 2009, social network surveys of 2010. Moreover, as a result of their participation
the participants in the greenhouse cluster were in the cluster, some growers have noted that they
undertaken. In 2007, there were 111 people were no longer feel as isolated as they did previously.
identified as being involved in the cluster, includ- In the 2009 grower survey, 80% of the respon-
ing growers and representatives from suppliers, dents said that they have more interaction with
academia, government agencies, and the economic their peers as a result of the cluster project. In
development field, whereas that number was 154 addition, nearly 60% of the respondents reported
in 2009. The data were collected using the roster- greater access to university researchers as a result
recall method (Steiner & Ploder, 2008; Ter Wal of their participation in the cluster. The fact that
& Boschma, 2009). In this approach, each of the growers continue to attend stakeholder meetings
survey participants was given a list of all members on a regular basis indicates that the meetings and
of the cluster project. They were asked to indicate the project more broadly, provide value-added for
those individuals with whom, during the past 12 the growers. As small business owners surviving
months, they had collaborated on a project and on razor-thin margins, they do not have the time
who had given them advice or support over the to attend meetings that they perceive to have
previous 12 months. The return rate was 66.7% little value.
in 2007 and 43.0% in 2009. More specifically, the An analysis of the social network data is another
analysis is based on the 46 people who returned avenue for examining collaboration and advice/
the questionnaire in both 2007 and 2009. These 46 support networks. Cluster-based economic devel-
people have been involved in the cluster since its opment is an active and participatory collaborative
inception. The group is composed of 33 growers strategy that relies on a well-connected network.
and 13 non-growers, including academics, com- One indicator of the connectedness within a
munity stakeholders, and economic development network is the number of ties. The number of
practitioners. ties in both the collaboration and advice/support
networks decreased slightly over the two-year
period. In the case of the collaboration network,
RESULTS the number of ties declined by 3.3% from 512 to
495. Similarly, the number of ties in the advice/
The results for collaboration and advice/support support network declined by 4.4% from 454 ties
networks will be reviewed. This is followed by to 434 ties. Explaining the decline in the number
a discussion of the findings for the branding/ of ties is challenging because most indicators
marketing and natural gas projects respectively. of cluster activity, such as meeting attendance,
participation in cluster projects, and the like,
increased over the study period. One hypothesis
is that people over time reduce the number of

1058
Assessing the Success of an Industrial Cluster

people with whom they interact by focusing to be between 0 and 0.4.” It is important to note
on those individuals with whom they discover that there is no agreement on the level of density
interaction to be the most beneficial. The desire that is beneficial as opposed to detrimental to a
to reduce the number of ties may be driven by given network (Provan et al., 2007).
the high cost of maintaining ties in any network The generally low density in the greenhouse
(Kastelle & Steen, 2010). It is also possible that cluster in part reflects the history of grower rela-
people’s sets of links with other cluster participants tions mentioned previously. These results are
change over time as their need for different types consistent with other research that has shown
of information changes. that actor characteristics and motivations impact
Another network measure is density. Density their networking activities (Madill et al., 2004).
is the ratio of actual node connections to the to- As Bathelt (2005) suggested, formation of a
tal potential node connections within a network cluster does not cause firms to immediately start
(de Nooy et al., 2005). The greater the number working for the common good. Historical rela-
of links (ties) between nodes (growers and non- tions and power asymmetries among the actors
grower members of the cluster), the higher will be initially are major barriers that must be overcome
the density of the network. Density is important (Bathelt et al., 2004).
because ideas and information can travel more If we could have acquired more greenhouse
efficiently through a higher density network. High production data, as well as data describing owners’
density networks also tend to me more robust and social characteristics, then we could have used
can more easily withstand the loss of members. GIS analyses. The spatial distribution of cluster
Also, Krätke (2002, p. 38) argued that network greenhouses and the total industry could have been
density impacts “the productive capacity of the examined. Also the differences between cluster
cluster as a regional production system.” Ucinet member and non-members in terms of production
6 (Borgatti et al., 2002.) was used to compute attributes and owner characteristics could have
densities and the other social network metrics. been analyzed. Such analyses would not only
The density of the collaboration network provide another perspective on the success of
was 0.248 in 2007 and 0.242 in 2009, whereas the cluster, but also they might produce insights
the advice and support network had a density as to the type of greenhouses and owners that are
of 0.219 in 2007 and 0.209 in 2009. Decreasing attracted to the cluster, which could be useful in
densities are to be expected since the number of recruiting non-member greenhouses to the cluster.
nodes remained the same, but the number of ties In addition to density, we examined the in-
decreased over the study period. degree of centralization in the network. Degree of
Both networks had relatively low densities centralization measures the degree of inequality in
in both years. As noted previously, standardized a network as a percentage of that of a perfect star
thresholds have not been established for social network of the same size, where a star network is
network metrics so it is difficult to make definitive the most centralized network possible (Hanneman,
statements that any given value is either high or 2005). In a star network, all the links in the network
low. Moreover, finding directly comparable stud- center on one node (person) so that that one node
ies is problematic due to variations in questions (person) has the most influence or power in the
asked, data collection methodologies, etc. Also network, as exemplified by node A in Figure 2.
density values will vary depending on the size Cross et al. (2006, p. 39) argued that many new
of the network. Nonetheless Krätke (2002, pp. networks are characterized by high degrees of
38-39) stated that: “the ‘realistic’ value range of centralization, with perhaps 15% of the members
the network density in production clusters tends accounting for 50% of the ties.

1059
Assessing the Success of an Industrial Cluster

Figure 2. Hypothetical star network


examines the network graphs. Figure 3 shows the
participants’ degree-in centralities in the networks;
size of the node is proportional to the node’s cen-
trality. Centrality is the number of connections (ties
or links) which a person has with other people in the
network. According to Steiner and Plodder (2008),
centrality provides insights into the interaction
among people in the network. Moreover, a high
centrality is indicative of more opportunities to
receive information. Also, centrality is a measure
of a person’s influence in the network (Lee & Su,
2010). Centrality can be measured as degree-out
which is a respondent reporting their ties to other
people. Conversely, degree-in is the number of
times a person is mentioned by other people in
the network. Degree-in is preferred because it
eliminates chances of self-reporting bias.
In both the collaboration and the advice/sup- An examination of the two graphs reveals the
port networks, the degree of centralization dimin- centrality of a number of growers had increased
ished somewhat between the two years. Central- between 2007 and 2009. Those growers enhanced
ization decreased from 21.4% in 2007 to 13.8% their interaction with other people in the network
in 2009 in the collaboration network and the re- and improved their opportunities for receiving
spective percentages for the advice/support net- advice and support from network members. In
work were 51.7% and 15.4%. In the case of the addition, their influence expanded over the study
collaboration network in 2007, the 15% of the period. In general, some growers were playing a
participants with the most ties in the network ac- greater role in the advice and support network in
counted for less than 34.7% of all ties and by 2009 2007 than they had in 2009. This result is rein-
that groups’ percentage of all ties had decreased forced in the 2009 grower survey which found
to 31.6%. In the advice/support network, the re- that nearly 80% of growers indicated that they
spective percentages were 32.9% in 2007 and had more interactions with other growers as a
15.8% in 2009. It is notable that the advice/sup- result of their participation in the cluster.
port network showed more decrease than the We also examined growth in the network. The
collaboration network. The decrease in centraliza- collaboration network grew by 183 ties whereas
tion is important to the goals of the cluster. As the advice/support network produced 181 new
noted earlier, the cluster is a “grass roots” initia- ties. Preferential attachment is one explanation
tive in which the strategy has been to build col- of how a network expands. According to Ter
laboration and support networks among the Wahl and Boshma (2009, p. 743), “the process of
cluster participants. The decline in centralization preferential attachment describes the growth of a
suggests that there is a broadening of influence network in which the probability that a new node
among the various people in the network, which will link to a certain other node is proportional
is in keeping with the goals of the project. to the number of links that node already has. An
The change in centralization within the advice outcome of this probabilistic process is that central
and support networks is also evident when one firms tend to become more central, whereas pe-
ripherally positioned firms tend to stay peripheral.”

1060
Assessing the Success of an Industrial Cluster

Figure 3. Network graphs showing in-degree centrality

1061
Assessing the Success of an Industrial Cluster

According to the preferential attachment notion, the geography of the social relationships will be
we would expect a positive correlation between revealed. One can assess the impact of distance
the number of ties which a node had in 2007 and on the collaboration and advice/support networks.
the number of ties added between 2007 and 2009. Also spatial biases in the social network can be
The data do not support that expectation. In the identified. As one example, one might find that
case of the collaboration network, the correlation most linkages are highly localized causing growers
was 0.042 and it was 0.054 for the advice/support to miss potentially productive linkages with more
network, and neither correlation coefficient was distant people. The management team can then
significant at the 0.05 level. This result suggests work to bridge such relationship gaps (structural
that the development of new ties is not simply holes). In general, GIS can be applied in various
benefiting central people in the networks, which ways to assess critical social relationships within
is a positive finding given that one of the goals a cluster. Other examples of the use of GIS with
of the cluster project is to grow the network by SNA can be found in Reid et al. (2009).
getting individuals who previously had little or no
participation to become more active participants. Branding and Marketing
One phenomenon impacting the development
of new ties in the cluster is the presence of a The thrust of the marketing campaigns between
management team working at fostering mutu- 2005 and 2010 have been fairly traditional in
ally beneficial ties for the long term benefit of nature. For example, in the spring/early summer
the cluster. In fact, one member of the cluster of 2010 a number of northwest Ohio greenhouses
functioned as a network weaver whose job was participated in a collaborative marketing cam-
to build ties among participants which would paign. The campaign comprised commercials on
benefit the cluster. local cable television, advertisements in three local
In general, the social network metrics reveal newspapers, a direct mailing initiative, and adver-
that the network among the 46 people had not tising on the grower website (maumeevalleygrow-
substantially changed over the two year period. ers.com). The marketing also had a number of less
While the number of ties had declined somewhat, traditional elements, including “wrapping” a local
the density of the networks had remained approxi- transit authority bus in the Maumee Valley Grower
mately the same. These results may have occurred logo. Coffee cups, refrigerator magnets, t-shirts,
because two years is too short a time-frame for and an annual calendar are also used to promote
substantial change to occur. Also they reflect the the local greenhouse industry. The growers have
fact that the greenhouse industry is generally a also participated in and contributed to a number
conservative, slow changing industry. In the case of local community efforts including Habitat for
of the advice/support network, there is evidence Humanity, Extreme Makeover: Home Edition,
of a broadening of influence and increasing role America in Bloom, and a variety of beautification
of growers within the network. plantings throughout Lucas County.
GIS can be applied to investigate the spatial To assess the outcomes of the branding and
structure of social relationships within a cluster. marketing campaigns, data obtained from both
Even though the SNA nodes and their associated the consumer and producer surveys and par-
metrics are aspatial, one can create a spatial com- ticipating growers were used. In the consumer
ponent by geocoding the node (peoples’) locations. survey, residents of northwest Ohio were asked
We were unable to acquire locations for all nodes if they recognized the MVG brand and 41% of
and therefore could not pursue this line of analysis. the respondents indicated that they did. However,
By mapping the links among cluster members, there was considerable inter-county variation in

1062
Assessing the Success of an Industrial Cluster

brand recognition. The highest level of brand individual business advertising space that was
recognition occurred in Lucas County where 47% more prominent. For example, one full page
or those surveyed recognized the brand. This is advertisement that ran for ten days in the Toledo
perhaps not surprising given that Lucas County Blade in the spring of 2010 cost a total of $15,423.
is the most heavily populated county in northwest With 14 growers participating in this initiative,
Ohio and has been the focus of MVG marketing the per-grower cost was $1,108.81. Only by col-
dollars. For example, there have been some mar- laborating was such a prominent advertisement
keting initiatives, e.g., the TARTA bus wrap, that possible. The advertisement included coupons
have been almost exclusively focused on Lucas for $5 off purchases of $25 or more. A total of
County. The lowest levels of brand recognition 3,353 coupons were redeemed. An indicator of the
(18.7%) are found in the most geographically growers’ perception of the success of this initiative
distant county, Erie County. Unfortunately, we is the fact that the growers have taken increasing
do not have data for years other than 2008 so we responsibility for the financing, creation, and
cannot assess the extent to which recognition of execution of collaborative marketing initiatives.
the brand has changed over time. However, given Between 2005 and 2007 all of the marketing was
that the brand was only introduced to the public as paid for by a USDA grant. Beginning in 2008, the
recently as 2005 we feel that a recognition level of growers started to contribute their own dollars to
47% within the region’s core county is reasonable. defray a portion of the collaborative marketing
We also learned that television was the most effec- costs so the growers paid 67.7% of marketing
tive media for promoting the brand with 34.9% of costs in 2009. By 2010, their contribution had
survey respondents having heard of the brand via increased to 81% and they will pay 100% of col-
that medium, followed by newspaper (23%), and laborative marketing costs in 2011. Growers have
word-of-mouth (22.7%). Through the survey, we also taken greater ownership of the actual creation
also learned about the motivations of consumer of marketing initiatives. Up until 2009 the grow-
purchasing preferences. The survey showed that ers utilized the services of a professional brand-
consumers who shopped at greenhouses valued ing and marketing firm. The growers have since
product quality and the opportunity to purchase created a grower committee that is responsible
from locally-owned businesses versus “Big-Box” for creating and executing the growers’ annual
store customers who valued price and convenience marketing campaign.
(Reid et al., 2009). Another approach to assessing the impact of the
Growers participated in a number of collabora- brand and marketing initiative is asking growers
tive marketing initiatives utilizing the Maumee about the impact that it has had on their business.
Valley Grower brand. The major benefit of this As part of the 2009 grower survey, growers were
collaborative marketing was that growers were asked if participation in the cluster marketing
able to get better exposure for their businesses initiative had helped them attract new customers.
and their products than they did when marketing Given the nature of the industry, we expected that
individually. In the case of print advertising, for the branding/marketing program would provide
example, growers would historically place a small greater benefit to the retail rather than the whole-
advertisement in one of the local newspapers. sale growers. This is because the retail growers
This advertisement would often appear along with sell directly to the consumer while the wholesale
other advertisements for unrelated businesses and growers sell to another retailer (mostly “Box
was not particularly prominent. By collaborating, Box” stores like Walmart and Home Depot) who
growers were able to place full-page advertise- in turn sells their product to the customer. A chi-
ments in local newspapers, thereby giving their square analysis of a contingency table (Table 1)

1063
Assessing the Success of an Industrial Cluster

Table 1. Growers attracting new customers*


year of the program (2007), participating growers
Attract New Customers? Retailers Wholesalers collectively saved $150,000, with individual firms
Yes 26 5
reducing natural gas costs by between 8 and 15%.
No 11 12
The gas program also has been successful in
terms of attracting new firms. Growers elsewhere
*Significantly different at the 0.01 level of significance
in Ohio and in neighboring southeastern Michi-
gan began participating in the purchasing pool in
indicated a significant difference between retailers 2009. By March 2011, the program had a total of
and wholesalers responses. Whereas 29.4% of 102 participating growers (90 in Ohio and 12 in
the wholesalers indicated that participation in the Michigan) whose collective savings for the year
project had helped them attract new customers, are forecast to be between $400,000 and $500,000.
70.3% of retailers indicated that participation in The program has become particularly attractive to
the project had helped them attract new customers. small growers who benefit from the economies of
Given that wholesalers have no direct relationship scale resulting from the purchasing pool.
with the final consumer, it is interesting that five Another benefit of the gas program is that it
of them indicated that participation in the project has provided money for the cluster organization
had helped them attract new customers. Perhaps (the Maumee Valley Growers Association). The
this is the result of higher retail sales causing organization receives a commission on natural gas
higher wholesale sales. purchased through the program, which has gener-
GIS has long been applied in retail location ated over $60,000 for the organization since 2007.
studies; see for example Clarke, 1998. A future Because of its success, the gas program has
endeavor, which will be useful from an assessment led to other collaborative endeavors to manage
viewpoint, is to use appropriate GIS procedures growers’ costs. For example, the model is being
to analyze the retail greenhouses. As one example, extended to electric power purchasing. While still
a map of retail greenhouse market potential in the in the early phases, this project will generate a
region will be generated using census block group 5.0% savings for firms over a two-year period.
data. Building of an index of sales potential that Also this approach has been extended to purchas-
can be mapped with conventional data sources ing other greenhouse inputs. In the case of one
will be a challenge since the demographics of supplier, the growers were able to negotiate a 20%
greenhouse consumers are only reported for large discount with no minimum order, instead of the
regions of the country. If the potential can be customary 5% discount that also had a minimum
reasonably estimated, then market penetration by order requirement. Moreover, 5% of the orders’
cluster firms can be assessed and opportunities dollar value is returned to the cluster organization
for expansion of sales can be gauged. If indi- to reinvest into the program.
vidual retailers will provide their sales data, then From the perspective of evaluation, the gas
more detailed analysis will be possible. These program is the easiest to assess. For one thing,
efforts will provide a different assessment of the the nature of the project results in detailed records
market initiatives of the cluster. being kept, and those records are kept by the
program manager so they are available in con-
Natural Gas Purchasing trast to growers’ records which are inaccessible.
Moreover, it focuses directly on the firms’ bot-
The natural gas purchasing program has been suc- tom lines, not indirect actions such as the amount
cessful in various respects. From the perspective of of collaboration among participants. As Sölvell
managing costs, it has been beneficial. In the first (2008, p. 60) argued evaluations should partially

1064
Assessing the Success of an Industrial Cluster

focus on the performance of firms in the cluster, we do not know if brand recognition has led to
not measures, such as “... number of meetings, greater sales. We need to collect point-of-sale data
extension of personal networks, and reports that to validate such a relationship. The perception of
have been carried out and presented.” Because one many growers is that cluster activities have enabled
can easily define and measure performance in the them to attract more customers. However, we can-
gas project (dollar savings to growers), growers not measure the increase in sales. Moreover we
more readily embrace it than other initiatives. do not know that the increased sales, if they exist,
have been caused by the cluster. In fact, a general
problem in cluster evaluation is to determine if
DISCUSSION AND CONCLUSION observed effects have been caused by the cluster
or if there are other “explanatory drivers” (Sölvell,
Evaluation of cluster outcomes is important, but the 2008, p. 62).
topic has rarely been addressed in the literature. No Measurement of collaboration is problematic.
doubt people involved in building and/or managing While some indirect indicators, such as grower
a cluster may lack the time or skill sets to execute interviews and participation in cluster activities,
sophisticated evaluation schemes. Such is the case are positive, we cannot gauge the extent to which
with the northwest Ohio greenhouse cluster. For collaboration has generated positive economic im-
these reasons, it is important to develop metrics pacts. Social network analysis provides a promising
and procedures that can be compiled easily, while venue for assessing collaboration, but that technique
still providing meaningful information. Metrics of has inherent problems in both data collection and
success must also be appropriate for the cluster in interpretation (Ter Wal & Boschma, 2009; Brandt
question and must reflect the goals of the cluster. et al., 2009). Future research plans include making
For example, one of the advantages of clusters that greater efforts to create more useful indicators of
are often cited in the literature is that it enhances the collaboration and the impact of collaboration and
innovative capacity of the industry. In the case of joint actions.
the northwest Ohio greenhouse industry, enhancing A major challenge will be the acquisition of
innovative capacity is not a primary goal at this time, economic data on the greenhouses in the cluster
and therefore it is not included in our evaluation. region. At various places in this paper, we have
There is evidence that the greenhouse cluster identified the need for sales, employment, and
has had a positive impact on its participants. The size data, as well as other information. To date,
most notable success is the natural gas program. It greenhouse owners have been most resistant to
has saved participants substantial money and gained providing specifics on their operations. With the
acceptance by growers located outside the local acquisition of more data, it will be possible to
region. It is easier to quantify the impact of the gas improve our existing measures of success and use
program because the collection of monetary data GIS to provide valuable insights into social rela-
is a necessary part of operating the program. Other tions in the industry, market potential, and the like.
hard data, such as sales of greenhouse products, are Ideally the measures that are developed should
not available to the research team. not only help gauge the success of a cluster, but also
The evidence regarding the impact of the brand- they should help guide future decisions (Schmie-
ing and marketing program is less solid than in the deberg, 2010). An example of this is using social
case of the gas project. On one hand, a notable network analysis to build desired connections within
percentage of surveyed northwest Ohio residents the cluster. As suggested by Brandt et al. (2009),
had heard of the MVG brand, but that percentage one can identify isolated people and fragments of
was very uneven across the cluster region. Also, the network that perhaps can be better integrated

1065
Assessing the Success of an Industrial Cluster

with the main cluster core. Another example is the Bathelt, H. (2005). Geographies of produc-
finding from the consumer survey that individuals tion: Growth regimes in spatial perspective (ii)
who shop at retail greenhouses value product qual- – knowledge creation and growth in clusters.
ity and purchasing from locally-owned businesses. Progress in Human Geography, 29(2), 204–216.
This information has helped shape advertising doi:10.1191/0309132505ph539pr
campaigns.
Bathelt, H., Malmberg, A., & Maskell, P. (2004).
The metrics developed to date provide a baseline
Clusters and knowledge: Local buzz, global
for future comparison. One piece of perceptual data
pipelines and the process of knowledge creation.
we believe is significant is the fact that 60% of the
Progress in Human Geography, 28(1), 31–56.
growers stated in 2009 that they were optimistic
doi:10.1191/0309132504ph469oa
about the future of their business as a result of the
cluster. This statistic is in marked contrast to the Borgatti, S., Everett, M., & Freeman, L. (2002).
27% who were optimistic about the future of the Ucinet for Windows: Software for social network
industry in 2004. Due to the intergenerational nature analysis. Harvard, MA: Analytic Technologies.
of this industry, such optimism is critical. If adult Retrieved March 12, 2011, from http://www.
children, many of who are college educated and have analytictech.com
viable career opportunities outside of the industry,
Brandt, A., Hahn, C., Krätke, S., & Kiese, M.
are going to take over the family business when
(2009). Metropolitan regions in the knowledge
their parents retire then a high degree of optimism
economy: Network analysis as a strategic informa-
is highly desirable. At the same time, this growth
tion tool. Tijdschrift voor Economische en Sociale
in optimism occurred while northwest Ohio expe-
Geografie, 100(2), 236–249. doi:10.1111/j.1467-
rienced extreme economic distress. Therefore, this
9663.2009.00532.x
cluster may very well have had some real impact
on the economic health of this industry. Clarke, C. (1998). Changing methods of location
planning for retail companies. GeoJournal, 45(4),
289–298. doi:10.1023/A:1006995106736
ACKNOWLEDGMENT
Coletti, M. (2010). Technology and industrial
clusters: How different are they to manage? Sci-
This paper was invited for publication in the Inter-
ence & Public Policy, 37(9), 679–688. doi:10.31
national Journal of Applied Geospatial Research
52/030234210X12778118264413
with expansion and revision from its earlier form
published in the Papers of the Applied Geography Cross, R., Laseter, T., Parker, A., & Velasquez, G.
Conferences, Volume 33, 2010, and is adapted (2006). Using social network analysis to improve
herein with permission. communities of practice. California Management
Review, 49(1), 32–60.
de Nooy, W., Mrvar, A., & Batagelj, V. (2005).
REFERENCES
Exploratory network analysis with pajek. Cam-
Augustine, C., Broxson, B., & Peterson, S. (2006). bridge, UK: Cambridge University Press.
Understanding natural gas markets: A policy Delgado, M., Porter, M., & Stern, S. (2010).
analysis study by Lexecon, an FTI Company. Clusters and entrepreneurship. Journal of Eco-
Retrieved March 28, 2011, from http://www. nomic Geography, 10(4), 495–518. doi:10.1093/
api.org/aboutoilgas/upload/UNDERSTAND- jeg/lbq010
ING_NATURAL_GAS_MARKETS.pdf

1066
Assessing the Success of an Industrial Cluster

Feser, E., Renski, R., & Goldstein, H. (2008). Lee, P., & Su, H. (2010). Investigating the structure
Clusters and economic development outcomes: of regional innovation system research through
An analysis of the link between clustering and in- keyword co-occurrence and social network analy-
dustry growth. Economic Development Quarterly, sis. Innovation: Management. Policy & Practice,
22(4), 324–344. doi:10.1177/0891242408325419 12(1), 26–40. doi:10.5172/impp.12.1.26
Fowler, C., & Kleit, R. (2010). Linking economic Madill, J., Haines, G. Jr, & Riding, A. (2004).
development and equity: An analysis of the effects Networks and linkages among firms and organi-
of industrial clusters on the poverty rate. Seattle, zations in the Ottawa-region technology cluster.
WA: Community Vitality Project, Evans School Entrepreneurship & Regional Development, 16(5),
of Public Affairs, University of Washington. Re- 351–368. doi:10.1080/0898562042000188414
trieved March 11, 2011, from http://cvp.evans.
Martin, R., & Sunley, P. (2003). Deconstructing
washington.edu/wp-content/uploads/2010/07/
clusters: Chaotic concept or policy panacea?
fowler_and_kleit_working_paper_2010.pdf
Journal of Economic Geography, 3(1), 5–35.
Fromhold-Eisebith,M.,&Eisebith,G.(2005).Look- doi:10.1093/jeg/3.1.5
ing behind facades: Evaluating effects of (automo-
Muro, M., & Fikri, K. (2011). Job creation on a
tive) cluster promotion. Regional Studies, 42(10),
budget: How regional industry clusters can add
1343–1356. doi:10.1080/00343400701654202
jobs, bolster entrepreneurship, and spark inno-
Gatrell, J., Reid, N., Steiger, T., Carroll, M., vation. Washington, DC: Brookings-Rockefeller
& Smith, B. (2009). “Value”-Chains: Identity, Project on State and Metropolitan Innovation,
Tradition, and Ohio’s Flori (culture) Industry. Brookings Institution. Retrieved March 10, 2011,
Applied Geography (Sevenoaks, England), 29(3), from http://www.brookings.edu/~/media/Files/
346–357. doi:10.1016/j.apgeog.2008.12.001 rc/papers/2011/0119_clusters_muro/0119_clus-
ters_muro.pdf
Hanneman, R. (2005). Introduction to social net-
work networks. Retrieved March 14, 2011, from National Governors Association (NGA).
http://analytictech.com/networks.pdf (2007). Cluster-based strategies for grow-
ing state economies. Retrieved March
Kastelle, T., & Steen, J. (2010). Are small world
11, 2011, from http://www.nga.org/Files/
networks always best for innovation? Innovation:
pdf/0702INNOVATIONCLUSTERS.PDF
Management. Policy & Practice, 12(1), 75–87.
doi:10.5172/impp.12.1.75 Provan, K., Fish, A., & Sydow, J. (2007). Inter-
organizational networks at the network level: A
Krätke, S. (2002). Network analysis of production
review of the empirical literature on whole net-
clusters: The Potsdam/Babelsberg film industry as
works. Journal of Management, 33(3), 479–516.
an example. European Planning Studies, 10(1),
doi:10.1177/0149206307302554
27–54. doi:10.1080/09654310120099254
Reid, N., & Carroll, M. (2006). Collaborating to
LaFary, E., Gatrell, J., Reid, N., & Lindquist, P.
compete: The case of the northwest Ohio green-
(2006). Specialized agriculture: Local markets &
house cluster. In Gatrell, J., & Reid, N. (Eds.),
global competitors in Ohio’s greenhouse industry.
Enterprising worlds: A geographic perspective on
In Gatrell, J., & Reid, N. (Eds.), Enterprising
economics, environments, and ethics (pp. 41–56).
worlds: A geographic perspective on economics,
Dordrecht, The Netherlands: Springer-Verlag.
environments, and ethics (pp. 57–70). Dordrecht,
The Netherlands: Springer-Verlag.

1067
Assessing the Success of an Industrial Cluster

Reid, N., & Carroll, M. (2008). Creating trust Taylor, M. (2006). Clusters and local economic
through branding: The case of northwest Ohio’s growth: Unpacking the cluster model. In Gatrell,
greenhouse cluster. In Stringer, C., & LeHeron, J., & Reid, N. (Eds.), Enterprising worlds: A
R. (Eds.), Agri-food commodity chains and glo- geographic perspective on economics, environ-
balizing networks (pp. 174–188). London, UK: ments & ethics (pp. 99–117). Dordrecht, The
Ashgate. Netherlands: Springer-Verlag. doi:10.1007/1-
4020-5226-X_9
Reid, N., Carroll, M., Smith, B., & Frizado,
J. (2009). GIS and economic development. In Ter Wal, A., & Boschma, R. (2009). Applying
Gatrell, J., & Jensen, R. (Eds.), Geotechnolo- social network analysis in economic geography:
gies and the environment: Socioeconomic and Framing some key analytic issues. The Annals of
planning applications (pp. 5–28). Heidelberg, Regional Science, 43(3), 739–756. doi:10.1007/
Germany: Springer-Verlag. doi:10.1007/978-1- s00168-008-0258-3
4020-9642-6_2
U.S. Census Bureau. (2010). Census of population
Reid, N., Smith, B., Haase, D., Ross, P., Mirzoy- 2010. Retrieved November 5, 2010, from http://
ants, A., & Gatrell, J. (2009). Marketing & growing factfinder2.census.gov
placed-based clusters: The case of the northwest
U.S. Department of Agriculture. (2007). U.S.
Ohio greenhouse industry. Papers of the Applied
Census of agriculture 2007. Retrieved March 28,
Geography Conferences, 32, 40–46.
2011, from http://www.agcensus.usda.gov/Pub-
Schmiedeberg, C. (2010). Evaluation of cluster lications/2007/Full_Report/Volume_1,_Chap-
policy: A methodological overview. Evaluation, ter_2_County_Level/Ohio/st39_2_035_035.pdf
16(4), 389–412. doi:10.1177/1356389010381184
van der Linde, C. (2002). The cluster meta-study;
Schmitz, H. (1995). Collective efficiency: list of clusters and bibliography. Boston, MA: In-
Growth path for small scale industry. The Jour- stitute for Strategy and Competitiveness, Harvard
nal of Development Studies, 31(4), 529–566. Business School. Retrieved March 12, 2011, from
doi:10.1080/00220389508422377 http://www.isc.hbs.edu/MetaStudy2002Bib.pdf
Sölvell, Ö. (2008). Clusters: Balancing evolution- Watts, D., Wood, A., & Wardle, P. (2006).
ary and constructive forces. Stockholm, Sweden: Owner-managers, clusters and local embed-
Ivory Tower. dedness: Small firms in the Sheffield (UK)
metal-working cluster. Entrepreneurship
Steiner, M., & Ploder, M. (2008). Structure and
and Regional Development, 18(3), 185–205.
strategy within heterogeneity: Multiple dimensions
doi:10.1080/08985620600680141
of regional networking. Regional Studies, 42(6),
793–815. doi:10.1080/00343400701861310

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 3,
edited by Donald Patrick Albert, pp. 21-36, copyright 2012 by IGI Publishing (an imprint of IGI Global).

1068
1069

Chapter 64
A Geospatial Analysis
of Convective Rainfall
Regions within Tropical
Cyclones after Landfall
Corene J. Matyas
University of Florida, USA

ABSTRACT
In this article, the author utilizes a GIS to spatially analyze radar reflectivity returns during the 24 hours
following 43 tropical cyclone (TC) landfalls. The positions of convective rainfall regions and their areal
extent are then examined according to storm intensity, motion, vertical wind shear, time until extra-
tropical transition, time after landfall, and distance from the coastline. As forward velocity increases in
conjunction with an extratropical transition, these regions move outward, shift from the right side to the
front of the TC, and grow in size. A similar radial shift, but with a decrease in areal extent, occurs as
TCs weaken. Further quantification of the shapes of these regions could yield a more spatially accurate
assessment of where TCs may produce high rainfall totals.

INTRODUCTION example, more than 150 lives were lost from


fresh water flooding as Hurricane Camille (1969)
Tropical cyclones (TCs) can produce high rainfall produced record rainfall totals nearing 700 mm
totals that lead to flooding both near the coastline in Virginia. Several days earlier, most deaths that
where fast winds and storm surges are also of occurred during the landfall of Hurricane Camille
concern, and hundreds of kilometers away from resulted from the high storm surge (Schwarz,
the location of landfall (Rappaport, 2000). For 1970). Recently, the original rainfall climatology
and persistence (R-CLIPER) statistical model
developed to verify rainfall predictions for TCs
DOI: 10.4018/978-1-4666-2038-4.ch064

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

(Marchok, Rogers, & Tuleya, 2007) has been coastline. In addition to achieving these research
upgraded to include the effects of vertical wind goals, this study analyzes a large number of TCs
shear and topography as these characteristics in a GIS framework.
have been shown to cause asymmetries in TC
rain fields (Lonfat, Rogers, Marchok, & Marks,
2007). However, as recently as 2007, the Ameri- BACKGROUND
can Meteorological Society stated that rainfall
predictions for landfalling TCs are still in need Willoughby, Marks, and Feinberg (1984) describe
of improvement (AMS, 2007). the regions of convection that develop in both
Although stratiform clouds can cover ten the core of a hurricane (within 100 km of the
times more area within a TC than convective circulation center), and in the outer rainbands
clouds (Jorgensen, 1984), the higher rainfall that can extend more than 360 km from the
rates produced by convective clouds can produce circulation center of the storm. Convection is
flooding rainfall in a relatively short amount of enhanced by increased surface friction along the
time (Geerts et al., 2000; Elsberry, 2002). Even coastline during landfall (Tuleya, 1994; Frank
though Hurricane Floyd (1999) was moving at a & Ritchie, 1999), and fast tangential winds can
relatively fast forward velocity of 9 m s-1 during advect moisture counterclockwise into the left
landfall, the high rain rates produced by convec- front quadrant in the core of a hurricane (Parrish,
tive processes caused more than 500 mm of rain Burpee, Marks, & Grebe, 1982). Konrad, Meaux,
to accumulate (Lawrence et al., 2001; Atallah & and Meaux (2002) found that intense TCs have
Bosart, 2003). Thus, identifying the conditions increased rainfall near the core of the storm. On
under which convective clouds cover a large area the other hand, tropical depressions have very
within a TC’s rain fields, and where these areas little organized convection in their core (Frank &
of convection will exist relative to the circulation Ritchie, 1999). These findings suggest that storm
center of a TC is an important task. intensity could exhibit a strong relationship with
This study employs geospatial research tech- both the location of convection relative to the
niques to examine convective rainfall regions that circulation center of the storm and the size of the
exist during the 24-hour period after TCs make area covered by convective rainfall.
landfall. Specifically, three research goals are A recent modeling study by Kimball (2008)
pursued: 1) determine where regions of heavy found that rainfall spreads outwards from the
rainfall exist in relation to the circulation center circulation center as a TC tracks inland, and shifts
of the TC under different conditions, 2) determine from a maximum on the right side of the storm
the areal extent, or size, of these rainfall regions, to the front of the storm in a counterclockwise
and 3) determine the conditions under which the direction. Observational studies of individual TCs
largest areas of heavy rainfall tend to form. To have demonstrated that convection within TCs
achieve these research goals, radar reflectivity may increase or decrease in spatial extent after
returns are analyzed within a GIS to locate the landfall (e.g., Bluestein & Hazen, 1989; Atallah
heavy rainfall regions and to calculate the spatial & Bosart, 2003; Medlin, Kimball, & Blackwell,
properties of these regions as they relate to six 2007; Matyas, 2008). Time elapsed after landfall
factors. These factors are a) storm intensity, b) may demonstrate an association with these trends
vertical wind shear, c) storm motion, d) whether in the location and areal extent of TC convec-
or not a TC becomes extratropical, e) the time tion. However, the influence of the atmosphere,
elapsed after landfall, and f) the distance that the internal storm dynamics, and the earth’s surface
storm and its convection are located relative to the may exhibit an even stronger association with the

1070
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

position and size of convective regions during Bosart, 2003; Matyas, 2008) as rainfall expands
the post-landfall period. TCs that remain near to ahead of the circulation center. For TCs that do not
the coastline after landfall may be able to advect experience the rainfall-enhancing dynamics that
moisture into their circulation so that rainfall does occur during an extratropical transition, convective
not diminish as quickly as it would if the storm rainfall may decrease as the storm moves inland,
had moved directly inland (Medlin et al., 2007). and is removed from its source of latent energy
Thus, the location of the TC or its convection (Tuleya, 1994; Kimball, 2008), and is surrounded
relative to the coastline could be related to the by drier continental air masses (Cubukcu, Pfeffer,
position and/or the areal extent of convection. & Dietrich, 2000; Hill & Lackmann, 2009).
Several observational studies have shown
that vertical wind shear causes an asymmetrical
distribution of rainfall about the center of a TC so DATA
that the majority of rainfall occurs in the downs-
hear direction (Molinari, Moore, & Idone, 1999; Radar reflectivity returns from multiple Weather
Corbosiero & Molinari, 2002; Cecil, 2007). Other Surveillance Radar-1988 Doppler (WSR-88D)
studies have focused on the asymmetries in the sites are utilized to determine where heavy rainfall
rain fields of TCs produced by storm motion (e.g., is located relative to the TC’s center of circulation.
Burpee & Black, 1989; Rodgers, Baik, & Pierce, These data are obtained from the National Climatic
1994; Lonfat, Marks, & Chen, 2004), finding that Data Center’s NEXRAD data archive (http://www.
rainfall shifts to a location ahead of the storm as ncdc.noaa.gov/nexradinv/). Base reflectivity data
forward velocity increases. When Corbosiero from the Level III WSR-88D products are utilized
and Molinari (2003) and Chen, Knaff, and Marks whenever available. In cases where only the raw
(2006) examined the contributions of both storm Level II data are available, data from the lowest
motion and vertical wind shear to the distribu- scan elevation are utilized for the spatial analy-
tion of TC rainfall, they determined that vertical sis. Additional information pertaining to Level
wind shear exerts the dominant influence, while II and III radar data are available in the Federal
storm motion controls rainfall distribution only Meteorological Handbook 11-Part C (Office of
when vertical wind shear is weak. However, these the Federal Coordinator for Meteorology, 2006).
authors only examine lightning flashes and areas Archives for many of the 159 WSR-88D sites
of rainfall within 300 and 330 km of the storm extend from 1995 through the present, allowing
centers, respectively, whereas research by Lonfat landfalling TCs during 1995-2008 to be examined
et al. (2004) show that convective rainfall can oc- in the current study. Due to problems with the
cur 500 km from the circulation center of a TC. archiving system at the National Climatic Data
When TCs become extratropical, they ex- Center, data for several TCs are not available at
perience both strong vertical wind shear and an the time of this study, including all TCs from the
increase in forward velocity (Atallah & Bosart, 1998 season. For a TC to be analyzed in the current
2003; Jones et al., 2003), so it stands to reason study, its rain fields must be within range of the
that the time until a TC becomes extratropical WSR-88D network for a 24-hour period following
should exhibit a relationship to the distribution of the time of landfall. Thus, TCs that either dissipate
rainfall about the circulation center of the storm within 24 hours of landfall (e.g., Matthew (2004)),
in a pattern similar to that imposed by shear and or that did not remain over land for 24 consecutive
storm motion. TCs that transition into extratropical hours after landfall (e.g., Charley (2004), Wilma
systems tend to have well-developed rain fields (2005)) are not included in the study. As a second
on the left side of their storm track (Atallah & U.S. landfall occurs for some TCs more than 24

1071
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

hours after the first landfall, these subsequent ANALYSIS


landfalls are also analyzed. Overall, this study
examines 43 U.S. landfalls from 38 TCs. To identify regions of heavy rainfall within the
Data pertaining to the latitude and longitude rain fields of a TC and to calculate their spatial
of the circulation centers of the TCs are obtained attributes, the radar reflectivity data are converted
from two sources. The position of the circulation into a georeferenced format and imported into a
center at the time of landfall is taken from the GIS. Working with each three-hourly observation
National Hurricane Center (NHC)’s Hurricane separately, the data from adjoining WSR-88D sta-
Season Tropical Cyclone Report for each storm tions are first combined into a single raster layer
(http://www.nhc.noaa.gov/pastall.shtml). These with a grid size of one square kilometer. A small
reports also provide the date and time of landfall. amount of error is introduced into the analysis due
For this study, the time of landfall is rounded to to differences in calibration among the different
the nearest half-hour. The position of the storm radars and the range-dependent beam geometry
at all other analysis times, as well as its intensity and elevation (Anagnostou, 2004). At locations
as measured by the maximum sustained wind where reflectivity values are available from mul-
speed, forward velocity and heading and stage tiple WSR-88D sites, the highest reflectivity value
of the storm, are obtained from the Hurricane is retained. Figure 1 demonstrates the combining
Database (HURDAT) maintained by the NHC of the radar reflectivity from multiple WSR-88D
(2006). This database contains observations for sites within the GIS. Next, an interpolation is per-
all TCs in the North Atlantic Ocean basin every formed using inverse distance weighting to create
six hours at 0000, 0600, 1200, and 1800 UTC. contour lines that connect areas having the same
A linear interpolation is performed to obtain reflectivity value. As the level III reflectivity data
data in three-hourly time steps beginning at the are available in 5 dBZ increments, the contour
time of landfall and ending 24 hours later. This lines are also placed in 5 dBZ increments. The
process yields nine observation times for each polylines created by the interpolation are then
TC landfall, or 387 total observation times for converted into polygons, and spatial attributes
the 43 landfalls examined. The number of hours are calculated for each polygon, including area
until a TC becomes extratropical is also calculated and the latitude and longitude of the centroid, or
at each observation time for TCs that became center of mass.
extratropical either during the study period or up As this study is concerned with identifying
to 72 hours after landfall. regions of heavy rainfall that may have the po-
The velocity and direction of the vertical wind tential to produce flooding, polygons larger than
shear are acquired from the Statistical Hurricane 500 km2 in area that enclose reflectivity values
Intensity Scheme (SHIPS) database (DeMaria & of 40 dBZ or greater are retained for further
Kaplan, 1994; DeMaria, Mainelli, Shay, Knaff, & analysis. As shown in Figure 2, multiple convec-
Kaplan, 2005). SHIPS data are available for TCs in tive regions of 40 dBZ reflectivity values over
both the Atlantic and eastern Pacific Ocean basins 500 km2 in area may exist for a TC at a given
at 0000, 0600, 1200, and 1800. The vertical wind observation time, and all such areas are analyzed
shear is the difference betweeen the 200 and 850 in the current study. TC researchers such as Jor-
hPa winds calculated for an annular region located gensen (1984) have used 40 dBZ reflectivity
200-800 km from the circulation center of each values to denote heavy rainfall within convective
TC. The SHIPS data are also interpolated linearly rainbands. Reflectivity values of 40 dBZ may
to obtain data in three-hourly time steps beginning produce rainfall rates of approximately 21.6 mm
at the time of landfall and ending 24 hours later. hr-1 (0.85 in hr-1) according to the reflectivity to

1072
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

Figure 1. Radar reflectivity values for Hurricane Ivan (2004) at the time of landfall; The highlighted
polygons represent regions having reflectivity values greater than or equal to 40 dBZ that are greater
than 500 km2 in a real extent

Figure 2. Position and intensity of Ivan (2004) and the 40 dBZ polygons that are analyzed at each
3-hourly observation time

1073
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

rain-rate conversion developed by Rosenfeld, right, upshear left, and downshear left respectively.
Wolff, and Atlas (1993) for tropical systems. The GIS is also utilized to determine the distance
This study seeks to determine which of six from each polygon centroid to the nearest point
factors are most closely associated with the posi- on the U.S. coastline. In all, 21 columns of data
tion of the convective regions relative to the exist for each convective region.
circulation center of the storm, and the areal extent Two strategies are employed to determine
of these regions. These factors are the intensity which of the six factors has the strongest asso-
of the storm as measured by the maximum sus- ciation with the radial position of the convective
tained wind speed, the speed of the vertical wind regions relative to the circulation center of the
shear and storm forward motion, whether the TC TC. First, correlation coefficients are calculated
becomes extratropical, time that has elapsed since between the distance of the convective region
landfall, and the position of the TC and its rainfall relative to the circulation center of the TC and the
regions relative to the coastline. Therefore, the variables representing the six factors. As the data
attribute table of each convective polygon is are not normally distributed, they are ranked from
expanded to include additional information for highest to lowest values and Spearman rank cor-
its observation time. Columns added to the at- relation coefficients (Wilks, 1995) are calculated.
tribute table include the storm name, year, month, The second strategy to explore patterns in
day, observation time, circulation center latitude the location of convection under the six factors
and longitude, maximum sustained wind speed, involves grouping the convective regions ac-
direction and speed of vertical wind shear and cording to sub-classifications of each condition.
storm motion, the number of hours since landfall, Observations are grouped by the intensity of the
and the number of hours until the storm becomes storm as determined by the velocity of the maxi-
extratropical, if applicable. The distance and bear- mum sustained winds: 1) greater than 33 m s-1
ing of each polygon centroid relative to the cir- for hurricanes, 2) 17-33 m s-1 for tropical storms,
culation center of the TC at each observation time or 3) less than 17 m s-1 for tropical depressions.
is then calculated using spherical trigonometry. The convective regions are placed into one of
The next step in the analysis is to recalculate three groups based on the speed of the vertical
the positions of the polygon centroids so that wind shear: 1) less than 5 m s-1, 2) 5-10 m s-1,
their bearings are in coordinates that are relative or 3) greater than 10 m s-1. This breakdown of
to the heading of the storm. First, the heading of slow, medium, and fast vertical wind shear is
the storm is subtracted from the bearing of the utilized to render the results comparable to those
centroid. Next, 360° is added to negative values. of Corbosiero and Molinari (2002). These same
Thus, polygon centroids located at bearings 0-90° divisions are used to group the convective regions
are in the right-front quadrant, and centroids with according to the forward velocity of the TC. Four
bearings of 90-180°, 180-270°, and 270-360° are categories are developed to examine convection
located in the right rear, left rear, and left front relative to the time that a TC becomes extratropi-
quadrants respectively. A similar process is per- cal: 1) less than 24 hours, 2) 24-48 hours, 3) 48-72
formed to determine the shear-relative quadrants. hours, or 4) observations from TCs that did not
For this calculation, the heading of the vertical become extratropical within 72 hours of landfall.
wind shear is subtracted from the bearing of the The number of hours after landfall (0, 3, 6, 9,
centroid, and 360° is added to negative values. 12, 15, 18, 21, and 24) is also used to group the
Polygon centroids with bearings of 0-90° are observations. Distance relative to the coastline is
located in the downshear right quadrant, while 90- the final factor by which the convective regions
180°, 180-270°, and 270-360° are located upshear are divided into three categories: 1) more than

1074
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

25 km offshore, 2) within 25 km of the coastline RESULTS AND DISCUSSION


either on or offshore, or 3) more than 25 km in-
land. These distances represent natural breaks in The 43 TC landfalls examined in this study have
the data. To examine the radial positions of the 1545 regions of convection larger than 500 km2
convective regions, the percent of observations during the 387 observation times (Figure 3). On
in 100 km-wide annular rings extending outward average, four convective polygons are present at
from the circulation center is calculated for each each observation time, but 36 observation times
condition-group. The distances of convective re- contain no areas of convection larger than 500 km2.
gions that are greater than 500 km are combined. Of the 43 landfalls examined, 25 occurred when
To characterize the azimuthal distribution of the TC was a tropical storm rather than a hurri-
convection for each of the condition-groups, the cane. By twelve hours post-landfall, only Danny
percent of observations occurring in each quadrant (1997) and Frances (2004) remained hurricanes,
of the storm is calculated. All quadrants are placed and most TCs were at tropical storm intensity.
relative to storm motion save for the examination Therefore, the majority of convective polygons
of data according to vertical wind shear; for this examined in the study occur when the TC is at
condition, quadrants are placed according to the tropical storm intensity. Also, in nearly half of
direction of the shear vector. the landfalls examined, TCs became extratropical
To determine which of the six factors is most within 72 hours of landfall.
strongly associated with large areas of convec- The discussion of each factor is presented in
tion, a second dataset is constructed so that each order according to the strength of its association
observation time has only one entry. For each with the location and areal extent of convection.
observation time, the areas of all 40 dBZ polygons As storm motion exhibits the strongest relation-
over 500 km2 in size are summed. Variables repre- ship to both the location and areal extent of con-
senting each factor are then added for each entry. vection, this factor is discussed first in both
For this dataset, the distance from the coastline subsections. For the location of convection rela-
variable represents the distance of the circulation tive to storm center subsection, the factors are
center of each TC from the nearest point on the ordered as follows: forward velocity, time until
coastline as calculated within the GIS. Spearman extratropical transition, vertical wind shear, in-
rank correlation coefficients are then calculated to tensity, time after landfall, and distance to coast-
determine if the largest areal extent of convection line. In the areal extent of convection subsection,
corresponds to the largest or smallest values of the the factors are presented as follows: forward
six factors. The observations are then placed into velocity, time until extratropical transition, inten-
the same sub-groups for each condition as those sity, distance from the coastline, vertical wind
described for the centroid distance comparisons shear, and time after landfall.
save for the distance to the coastline. For this
condition, observations are placed into one of Location of Convection
four groups: 1) within 50 km, 2) 50-100 km, 3) Relative to Storm Center
100-200 km, or 4) greater than 200 km from the
coastline. For each condition-group, the average Results suggest that the forward velocity of the TC
area covered by convection is calculated to de- has the largest effect on the location of convection
termine which variables are associated with the both in the azimuthal and radial directions during
largest area covered by convection, which might the 24 hours after landfall. Overall, more convec-
indicate the largest potential for flooding to occur. tion occurs on the right side of the storm rather
than the left (Figure 4a), which agrees with the

1075
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

Figure 3. Positions of the circulation centers of the 43 TC landfalls examined in the current study cat-
egorized by intensity, and the location of the centroid of areas of convection larger than 500 km2

findings of previous research (Frank & Ritchie, Forward velocity is also important when con-
1999; Corbosiero & Molinari, 2003; Lonfat et al., sidering the distance from the circulation center
2004; Chen et al., 2006). Shapiro (1983) explains at which convection tends to occur. The correla-
that storm motion induces asymmetries in the tion between centroid distance and storm forward
boundary layer frictional drag due to the stronger velocity demonstrates that storm motion has the
winds on the right side of the vortex that result strongest association with the radial distribution
when translation speed is added to the rotational of convection out of all six factors tested (Table
speed of the tangential winds. The current study 1). TCs moving at a fast forward velocity have
finds that in TCs moving at velocities under 5 m 17% of convective observations located more
s-1, convection is evenly split between the right than 500 km from the circulation center, which
front and rear quadrants of the storm. When TCs is the highest percentage of all other sub-groups
are moving at 5-10 m s-1, 60% of convective ob- of factors examined (Table 2). The distribution
servations occur in the right front quadrant and of convection is fairly similar among TCs with
only 21% are located in the rear of the storm. forward velocities below 5 m s-1 and 5-10 m s-1.
Almost 55% of convective observations for TCs For these storms, approximately 70-75% of ob-
with forward velocities exceeding 10 m s-1 occur servations occur within 300 km of the circulation
in the left front quadrant, and 95% occur in the center. However, 52% of observations in TCs
forward half of the storm, indicating a counter- moving at speeds greater than 10 m s-1 occur
clockwise shift in the location of convection as beyond 300 km. This has important implications
forward velocity increases. This is similar to the for the findings of previous research as studies
pattern described by Kimball (2008). examining convection only within 400 km of the
circulation center (e.g., Chen et al., 2006; Cecil,

1076
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

Figure 4. Locations of polygon centroids relative to the circulation center of the storm based on a) storm
motion, b) time until becoming extratropical, c) vertical wind shear, d) intensity, e) hours since landfall,
and f) location relative to the coastline. All figures save c are oriented towards the direction of storm
motion with the right front quadrant at the top right of the figure. Panel c is oriented in the direction
of the vertical wind shear so that downshear right is at the top right of the figure. Only centroids for
convective regions in TCs that became extratropical are depicted in panel b. Annular rings are shown
every 100 km out to 800 km.

2007) may have missed as much as one third of missed more than half of the convection that ex-
the convection associated with fast-moving TCs. ists in fast-moving TCs.
Even more importantly, studies examining data The number of hours until a TC becomes
within only 300 km of the circulation center (e.g., extratropical also exhibits an association with the
Molinari et al., 1999; Corbosiero & Molinari, location of convection in TCs after landfall (Fig-
2002; Corbosiero & Molinari, 2003) may have ure 4b). When a TC is within 48 hours of becom-

1077
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

Table 1. Spearman Rank correlation coefficients


within left front quadrant increases only slightly.
for each convective region’s centroid distance,
However, in the next 24 hours, the amount of con-
and for the total areal extent of all convective
vection in the right (left) front quadrant decreases
regions at each observation time; Bold values
(increases) to 49 (42) %, completing the counter-
are statistically significant at α = 0.01
clockwise shift. Ritchie and Elsberry (2001) and
Centroid Distance Total Areal Extent Atallah and Bosart (2003) explain that this shift
(n=1545)a,b (n=387)c in the azimuthal location of convection occurs
Motion 0.235 0.524 when relatively cool and dry air that diminishes
Intensity -0.143 0.461 convection advects counterclockwise around the
Time until -0.189 a
0.274 circulation center beginning southwest of the TC.
Extratropical Meanwhile, convective precipitation is enhanced
Hour since 0.180 -0.252 to the north of the circulation center as the warm
Landfall
and moist air of tropical origin on the eastern side
Shear -0.057b -0.018c
of the TC is uplifted.
Distance to 0.034 0.013
Coastline The speed of the vertical wind shear appears
a
n = 908
to exert a stronger influence on the azimuthal
b
n = 1152 rather than the radial distribution of convection
c
n = 273 after landfall (Figure 4c). When all observations
are considered, most of the convection is located
in the downshear left and right quadrants, which
ing extratropical, convection decreases markedly
is in agreement with the findings of Corbosiero
in the inner 100 km, while 24 hours earlier, this
and Molinari (2002), Chen et al. (2006), and Cecil
innermost region contains the highest percentage
(2007). However, a clockwise shift in the position
of observations (Table 2). In the outer region of
of the convective regions is observed as vertical
the storm beyond 300 km, the number of convec-
wind shear increases. For TCs experiencing verti-
tive polygons increases from 14% to 43% of the
cal wind shear speeds less than 5 m s-1, 39 (28)% of
total number of observations during the same time
the observations occur in the downshear (upshear)
period (Table 2). These results suggest that a
left quadrant. When the velocity of the vertical
spreading out of convection occurs as a storm
wind shear increases to 5-10 m s-1, the number of
becomes extratropical. The negative correlation
observations increases (decreases) to 53 (17) %
between centroid distance and hours until the TC
in the downshear (upshear) left quadrant. When
becomes extratropical (Table 1) supports this
strong vertical wind shear is present, only 6% of
finding.
observations are located in both upshear quadrants,
Although the majority of the convective poly-
while 53 (36) % are located in the downshear left
gons remain in the right front quadrant regardless
(right) quadrants. This clockwise shift in convec-
of when or if an extratropical transition happens,
tion with increasing vertical wind shear has not
a shift in the counterclockwise direction towards
been discussed by previous researchers. Figures
the left front quadrant occurs as TCs approach the
from Corbosiero and Molinari’s (2002) study only
time that they become extratropical (Figure 4b).
indicate that the downshear quadrants experience
At 48-72 hours before becoming extratropical,
the most lightning flashes in both the inner 100
nearly 50 (25) % of convection is in the right front
km and outer 100-300 km regions of the storm
(rear) quadrant. Twenty-four hours later, the right
regardless of the strength of the vertical wind shear.
front (rear) quadrant contains 61 (10) % of the
While the speed of the vertical wind shear
convection, while the percentage of observations
appears to influence the azimuthal distribution

1078
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

Table 2. Percent of polygon centroids representing convective regions contained within each 100 km
annulus outward from the storm center, and the total number of centroids per condition-group. Bold
values indicate the distances with the highest percentage of observations.

Analysis n 0-100km 100-200km 200-300km 300-400km 400-500km 500+ km


Motion
< 5 ms-1 566 21 33 24 14 5 3
5-10 ms-1 736 17 29 23 16 8 8
> 10 ms -1
234 12 16 21 20 15 17
Time until Extratropical
0 - 24 hrs 466 12 23 23 23 12 9
24 - 48 hrs 297 13 26 27 16 7 11
48 - 72 hrs 145 34 28 23 10 1 3
None 527 22 34 21 12 6 6
Shear
< 5 ms-1 132 18 33 23 12 8 6
5-10 ms -1
306 16 31 29 12 7 6
> 10 ms -1
714 21 31 21 14 6 7
Intensity
Hurricane 288 28 26 20 8 8 9
Trop. Storm 870 17 31 23 16 6 7
Trop. Dep. 387 10 24 25 21 12 8
Hours since Landfall
0 200 24 32 26 9 4 7
3 203 27 32 16 12 5 8
6 193 21 34 20 12 5 8
9 200 17 24 26 20 8 5
12 180 17 26 27 17 8 5
15 161 11 26 22 21 14 6
18 157 12 29 20 12 15 11
21 134 12 26 28 19 4 10
24 117 11 21 24 26 9 9
Coastline Proximity
Offshore 299 7 28 26 17 10 12
Near Coast 291 24 27 22 15 7 4
Inland 933 19 29 23 16 7 6

of convection, it has a much weaker associa- are located in either the inner core, or beyond 400
tion with the radial distribution of convection as km regardless of the velocity of the vertical wind
compared to storm motion. Approximately 30% shear. This suggests that stronger wind shear values
of convective observations are located 100-200 do not result in the development or displacement
km from the circulation center, and less than 20% of convection farther from the circulation center.

1079
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

The lack of a statistically significant correlation 55% are located within 200 km of the circulation
between the velocity of the vertical wind shear center and only 12% are located beyond 400 km
and the distance of the convective observations (Table 2). From 9 to 12 hours post-landfall, con-
relative to the storm center (Table 1) supports vection diminishes in storm core, but remains the
this finding. same in the region beyond 400 km. Over the next
Convection is present in both the core and outer 6 hours, the percentage of observations located
rainbands of hurricanes, while weaker TCs have over 400 km away from the storm center doubles.
little convection present within their core (Table Towards the end of the 24-hour period, the amount
2). Hurricanes have nearly the same number of of convection in the farthest distances diminishes,
convective regions within 100 km of the circula- but the distance of most convective observations
tion center, which includes convection found in the shifts outwards from 200-300 km to 300-400 km.
eyewall, as they do in the region 100-200 km from A counterclockwise shift in the location
the circulation center (Table 2), where the edge of of convection is not observed when time after
the principal rainband is often located (Willoughby landfall is the only variable considered (Figure
et al., 1984; Molinari et al., 1999). Tropical storms 4e). The number of convective regions in each
have a clear maximum in convection 100-200 km motion-relative quadrant remains fairly consistent
from the circulation center. Tropical storms have for the first 21 hours post-landfall. The percent
more convection in the storm core than do tropi- of observations occurring in the right front, right
cal depressions, but much less convection near rear, left rear, and left front quadrants are approxi-
the core of the storm than do hurricanes (Table mately 50%, 20%, 10%, and 20% respectively.
2). Twice as many convective observations are However, at 24 hours post-landfall, the maximum
located 300-400 km from the circulation centers shifts to the right rear quadrant, with 32% of the
of tropical depressions than within the innermost observations. Most of these observations belong
100 km, indicating a lack of convection near the to slow-moving tropical depressions, and slow-
core of tropical depressions (Table 2). The negative moving TCs tend to have more convection present
correlation coefficient between intensity and the in the right rear quadrant of the storm than in the
distance of the observations from the circulation left front quadrant, possibly due to the advection
center indicates that higher intensity correlates to of moisture from the ocean (Bluestein & Hazen,
a closer centroid distance (Table 1). This result 1989). Analysis of additional hours post-landfall
suggests that convection spreads outwards as the is required to determine if a maximum in the left
TCs weaken. As the majority of the observations front quadrant emerges at a later time.
occur on the right side of the storm (Figure 4d) The distance at which convection forms relative
in all three sub-groups, intensity does not appear to the coastline appears to have some association
have as strong of an association with the quadrant with the azimuthal distribution of convection rela-
in which convection is located as does storm mo- tive to the circulation center of the storm. Figure
tion and vertical wind shear. 4a shows that convection exists in the left rear
When the locations of convection are examined quadrants of some TCs despite this area being
based upon the time that has elapsed since landfall, the most unfavorable location for the formation
the spreading out of convection with time that is of convection given that downward vertical mo-
discussed by Kimball (2008) is evident. However, tion is enhanced on the left side of a TC (Powell,
the outward spread of convection does not appear 1990). While many TCs track inland after landfall,
to happen steadily over time. The radial distribu- some travel across the peninsula of Florida (e.g.,
tion of convective polygons is similar during the Jeanne (2004) and Erin (1995)) or track parallel
first six hours post-landfall, when approximately to the U.S. coastline (Floyd (1999) and Hanna

1080
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

Table 3. The average areal extent of convection


(2008)). Thus, convection located more than 25
for each condition-group and the number of ob-
km offshore may occur in any motion-relative
servations contained within each condition-group;
quadrant of the storm. In this study, over 50%
Bold values indicate group with the largest areal
of the offshore convection is located in the rear
coverage of convection for each condition
quadrants of the TCs (Figure 4f). The 45% of
offshore observations located in the right front Analysis n Average Area (km2)
quadrant are associated with TCs making land- Motion
fall along the east coast of the U.S. and moving < 5 ms-1 175 5842
towards the north, for example Floyd (1999) and 5-10 ms-1 149 10340
Hanna (2008). Offshore observations located the
> 10 ms -1
32 29975
farthest from the circulation center in the right
Time until Extratropical
front quadrant exist in TCs that are nearing the
0 - 24 hrs 87 18994
time at which they become extratropical.
24 - 48 hrs 51 13407
Results from the analysis of the near-shore
48 - 72 hrs 33 6307
observations suggest that the coastline does exert
none 176 5253
an influence on the azimuthal location of con-
Intensity
vection as TCs make landfall and move inland.
Hurricane 56 13039
Previous studies have discussed how frictional
convergence produced along the coastline along Trop. Storm 178 11676

the right side of the storm enhances the develop- Trop. Dep. 121 5849

ment of convection as a TC makes landfall (e.g., Coastline Proximity

Jones, 1987; Frank & Ritchie, 1999). However, < 50 km 149 14773

more than 35% of the observations occur in the 50-100 km 67 6989


two rear quadrants of the storm, suggesting that 100-200 km 69 5787
the coastline influences convective precipitation > 200 km 65 6281
even after the circulation center has moved in- Shear
land. Additionally, since convection occurs near < 5 ms-1 45 4568
the coastline more than 300 km away from the 5-10 ms -1
110 4781
circulation center, coastal areas not affected by > 10 ms-1 118 12273
the strongest winds in a landfalling TC may still Hours since Landfall
experience heavy precipitation. 0 43 9056
3 43 9140
Areal Extent of Convection 6 43 10248
9 43 9973
The largest regions of convection are associated 12 40 11611
with fast-moving TCs that are nearing the time
15 38 10816
of transition to extratropical and located within
18 37 11119
50 km of the coastline. Of these three factors,
21 33 10676
storm motion has the strongest relationship with
24 34 6356
the areal coverage of convection (Table 1). TCs
that are moving at speeds exceeding 10 m s-1 have
larger areas covered by convection on average
than any of the other sub-groups examined (Table
3). The nearly 30,000 km2 average areal coverage

1081
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

Figure 5. The total area covered by convection at each observation time compared to: (a) storm motion,
(b) time until becoming extratropical, (c) intensity, (d) location of storm circulation center relative to
the coastline, (e) vertical wind shear, and f) time after landfall.

of convection for fast-moving TCs is 20,000 km2 occur when forward velocity is below 10 m s-1.
more than for TCs moving at 5-10 m s-1, and ap- As a TC becomes extratropical, the surrounding
proximately 10,000 km2 more than for TCs that environment becomes baroclinic and the isentro-
are within 24 hours of becoming extratropical. pic ascent of the moist air mass associated with
The top ten largest convective areas occur in TCs the TC enhances precipitation and causes the
with forward velocities greater than 8 m s-1, and region within which heavy rainfall occurs to in-
very few TCs that are moving at this speed have crease in area (Atallah & Bosart, 2003; Jones et
small areas of convection (Figure 5a). al., 2003). As a group, TCs that are within 24
Interaction with a middle latitude trough is hours of being extratropical have the second larg-
what typically causes storm motion to increase, est areas of convection on average (Table 3). The
and is also what causes the TC to become extra- ten largest areas of convection are associated with
tropical (Jones et al., 2003). All of the observation Floyd (1999), Hanna (2008), and Bertha (1996),
times that feature TCs moving at speeds greater which moved towards the north parallel to the
than 10 m s-1 also occur when TCs are within 24 east coast of the U.S. (Figure 3) at speeds greater
hours of becoming extratropical. However, all than 9 m s-1 as they transitioned into extratropical
TCs within 24 hours of becoming extratropical storms. The average area of convection increases
do not necessarily move at fast forward velocities. for TCs as the time until their extratropical tran-
Nearly 60% of the observations from this group sition (Table 3). The statistically significant cor-

1082
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

relation coefficient between total areal extent of and continental air masses surrounding the TC
convection and time until becoming extratropical have less available moisture to enhance rainfall
(Table 1) also supports the finding that extra- (Bluestein & Hazen, 1989; Cubukcu et al., 2000).
tropical transition results in larger regions of As previously discussed, many of the TCs that
convection. For TCs that do not experience an remain within 100 km of the coastline several
extratropical transition, convection develops over hours post-landfall experience an extratropical
an area of approximately 22,000 km2 in the larg- transition that helps to increase the area over
est case (Figure 5b), but convection is more com- which convective rainfall occurs. Despite the clear
monly confined to an area measuring approxi- tendency for inland TCs to have smaller convec-
mately 5000 km2 (Table 3). tive rainfall regions, a statistically significant
The second largest correlation with convective correlation does not exist between a TC’s distance
area is the maximum sustained wind speed (Table from the coastline and the area of its convection
1), indicating that the area covered by convection (Table 1). When TCs are close to the coastline,
is largest for hurricanes and decreases as TCs areas of strong convection present in the eyewalls
weaken. Although the five largest regions of of hurricanes as well as convection enhanced by
convection belong to tropical storms (Figure 5c), friction along the coastline in the outer rainbands
numerous regions smaller than 1000 km2 exist for have a limited spatial extent, which accounts for
both tropical storms and tropical depressions. In many of the regions of convection that occupy
hurricanes, however, the smallest region is more less than 10,000 km2 visible in Figure 5d.
than 2100 km2 in size. Hurricanes have convec- TCs experiencing vertical wind shear velocities
tion in both their core and in their outer rainbands above 10 m s-1 have a larger area of convection
so that greater total area is covered by convec- on average than do TCs experiencing less shear
tive rainfall than for weaker TCs. This finding (Table 3, Figure 5e). Nine of the ten largest con-
is consistent with that of Shepherd et al. (2007). vective areas for highly-sheared TCs are associ-
Also contributing to the large spatial coverage of ated with Floyd (1999) and Bertha (1996) when
convection for hurricanes is their location close these TCs were approximately 20-35 hours from
to the coastline. When the TC is located inland becoming extratropical and were located within
and the tangential winds are weaker, it is more 50 km of the coastline. Vertical wind shear typi-
difficult to advect moisture from the ocean into cally increases during an extratropical transition
the circulation to sustain heavy rainfall (Bluestein (Jones et al., 2003), and moisture advection from
& Hazen, 1989). The exception to the trend of the nearby ocean also contributes to the large area
diminishing convection as maximum sustained of rainfall associated with these highly-sheared
winds weaken after landfall occurs when TCs TCs. However, strong vertical wind shear is not
become extratropical. Many of these TCs are at solely associated with an extratropical transition,
tropical storm intensity as the size of their con- as nearly 40% of the observations for highly-
vective area grows. sheared storms occur in TCs that do not become
As a group, TCs located within 50 km of the extratropical within 72 hours of landfall. Many
coastline have the third largest areal extent of of these highly-sheared TCs that do not become
convection out of the 26 sub-groups examined extratropical have relatively small areas of convec-
(Table 3). Once a TC is more than 100 km inland, tion surrounding their circulation centers as they
no areas of convection larger than 25,000 km2 are track inland. Thus, a significant relationship does
present (Figure 4d). This is likely because less not exist between the speed of vertical wind shear
moisture is available over the land surface for and the area covered by convection (Table 1).
latent heat flux (Tuleya, 1994; Kimball, 2008),

1083
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

A negative correlation between the areal extent outwards from the circulation center of TCs after
of convection and the time elapsed since landfall landfall. Three factors are associated with a coun-
indicates that convection decreases in area over terclockwise shift in convection from the right side
time (Table 1). The two highest averaged areas of to the front of the storm: a) increasing forward
convection occur at 12 and 18 hours post-landfall velocity, b) decreasing time until extratropical
(Figure 5f), rather than at the time of landfall, which transition, and c) convection forming inland rather
has the second smallest average area of convec- than offshore. However, increasing vertical wind
tion (Table 3). At the time of landfall, convection shear shifts convection in the clockwise direction.
tends to be organized into smaller regions in the The spreading outwards of convection is associated
eyewalls of hurricanes and/or the outer rainbands with a) decreasing storm intensity, b) increasing
of hurricanes and weaker TCs. It is the dynamics storm forward velocity, and c) decreasing time
associated with an extratropical transition that until a TC becomes extratropical.
organizes these small regions of convection into Among the six factors examined in the current
larger regions and expands the overall size of the study, the forward velocity of the TC in conjunction
rain fields of the TCs. However, the overall size of with an extratropical transition had the strongest
the convective areas tends to decrease beginning association with the location of convection both in
approximately 12 hours prior to the TC becoming the radial and azimuthal directions relative to the
fully extratropical (Figure 5b), and this marked circulation center of the storm, and also the larg-
decrease in convection likely accounts for the est influence on the area over which convection
negative correlation between the areal extent of extends. As the forward velocity of TCs increases
convection and time elapsed since landfall. due to interaction with a middle latitude trough,
areas of convection grow in size, are located farther
from the circulation center of the storm, and shift
CONCLUSION counterclockwise from the right side of the storm
when motion is slow to the front of the storm when
A GIS was utilized to perform a spatial analysis motion is fast. The dynamical changes in the struc-
of radar reflectivity returns to determine where ture of a TC experiencing an extratropical transition
regions of convective rainfall are located relative act to decrease convection behind and in the core
to the circulation centers of TCs, and how large an of the storm while increasing convection ahead of
area was covered by these regions. The analysis the storm so that the same pattern of shifting and
was performed for 3-hourly periods during the first growing convection is observed whether examin-
24 hours after 43 TC landfalls. The radial and azi- ing TCs according to their forward velocity or the
muthal positions of each convective region greater time until they become extratropical.
than 500 km2 in area, and the total areal extent of Previous studies suggested that vertical wind
convection at each observation time, were then shear is the dominant influence on the location of
examined relative to six factors. These factors were convection within TCs, but the current study found
storm intensity, vertical wind shear, storm motion, that shear did not have as strong of an association
whether the TC became extratropical, the number with the location of and area covered by convection
of hours elapsed since landfall, and the distance as did storm motion and extratropical transition. As
of the convective areas as well as the circulation found by previous researchers, a clear maximum of
center of the TC from the coastline. convection in the downshear left quadrant convec-
This study supports the results of previous tion was observed in the current study. However,
modeling and observational studies that suggest a clockwise shift in the region where convection
convection shifts counterclockwise and spreads formed from upshear left to downshear right was

1084
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

observed as the velocity of the shear increased, convection diminishes, stratiform rainfall should
which is a pattern that has not been discussed by comprise a larger portion of the rain fields as the
previous researchers. Although the areas of convec- TC moves inland. The techniques utilized in the
tion were larger on average when TCs experienced current study should be applied to radar reflectiv-
high shear rather than shear with a velocity less ity regions having reflectivity values less than 40
than 10 m s-1, the speed of the vertical wind shear dBZ to quantify the position and areal coverage
did not significantly alter the radial position of the of stratiform rainfall within the TC rain fields.
convection. These results suggest that the tendency The relative contribution of both stratiform and
to base the forecasted location of convective rainfall convective rainfall to the overall areal coverage of
primarily on the direction and speed of the vertical TC rain fields could then be calculated.
wind shear may need to be reassessed.
This study analyzes a large number of TCs in
a GIS framework. To more accurately determine ACKNOWLEDGMENT
specific locations on the ground that may experi-
ence convective rainfall from a TC, and to more Three anonymous reviewers are thanked for their
effectively link the location of convective rainfall helpful comments that improved the quality of
to environmental variables, future work should also this manuscript.
utilize the GIS to quantify the shape and orienta-
tion of the convective regions. Adding these spatial
attributes to the size and location data presented REFERENCES
in the current study will allow a complete set of
spatial attributes to be cataloged for each convec- AMS. (2007). Hurricane forecasting in the United
tive region. Analyzing these shapes at hourly or States. Bulletin of the American Meteorological
half-hourly intervals would facilitate the tracking Society, 88, 87–92.
of each convective region as it develops so that Anagnostou, E. N. (2004). A convective/stratiform
changes in its spatial attributes can be calculated precipitation classification algorithm for volume
as the TC tracks inland. Additional factors that may scanning weather radar observations. Meteoro-
affect the development and displacement of convec- logical Applications, 11, 291–300. doi:10.1017/
tion such as the angle at which a TC crosses the S1350482704001409
coastline, topography, and moisture present within
the atmosphere and soil, will also be considered so Atallah, E. H., & Bosart, L. R. (2003). The extra-
that these observational results can be compared to tropical transition and precipitation distribution
modeling studies that examine similar variables. of Hurricane Floyd (1999). Monthly Weather
The results of these future analyses could then be Review, 131, 1063–1081. doi:10.1175/1520-
utilized to validate the spatial representation of 0493(2003)131<1063:TETAPD>2.0.CO;2
convective rainfall regions produced by dynami-
Bluestein, H. B., & Hazen, D. S. (1989). Dop-
cal models such as the HWRF (Davis et al., 2008)
pler-radar analysis of a tropical cyclone over
and GFDL (Marchok et al., 2007), and statistical
land - Hurricane Alicia (1983) in Oklahoma.
predictions of TC rainfall produced by models such
Monthly Weather Review, 117, 2594–2611.
as R-CLIPER (Marchok et al., 2007), TRaP (Kid-
doi:10.1175/1520-0493(1989)117<2594:DRAO
der et al., 2005), and PHRaM (Lonfat et al., 2007).
AT>2.0.CO;2
Future research should also consider the con-
tribution of stratiform rainfall to the total rainfall
produced by TCs as they move over land. As

1085
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

Burpee, R. W., & Black, M. L. (1989). Temporal DeMaria, M., Mainelli, M., Shay, L. K., Knaff, J.
and spatial variations of rainfall near the centers A., & Kaplan, J. (2005). Further improvements
of two tropical cyclones. Monthly Weather to the statistical hurricane intensity prediction
Review, 117, 2204–2218. doi:10.1175/1520- scheme (SHIPS). Weather and Forecasting, 20,
0493(1989)117<2204:TASVOR>2.0.CO;2 531–543. doi:10.1175/WAF862.1
Cecil, D. J. (2007). Satellite-derived rain rates in Elsberry, R. L. (2002). Predicting hurricane land-
vertically sheared tropical cyclones. Geophysical fall precipitation: Optimistic and pessimistic views
Research Letters, 342), L02811-L02821. from the symposium on precipitation extremes.
Bulletin of the American Meteorological Society,
Chen, S. Y. S., Knaff, J. A., & Marks, F. D. (2006).
83, 1333–1339.
Effects of vertical wind shear and storm motion
on tropical cyclone rainfall asymmetries deduced Frank, W. M., & Ritchie, E. A. (1999). Effects of
from TRMM. Monthly Weather Review, 134, environmental flow upon tropical cyclone struc-
3190–3208. doi:10.1175/MWR3245.1 ture. Monthly Weather Review, 127, 2044–2061.
doi:10.1175/1520-0493(1999)127<2044:EOEFU
Corbosiero, K. L., & Molinari, J. (2002). The ef-
T>2.0.CO;2
fects of vertical wind shear on the distribution of
convection in tropical cyclones. Monthly Weather Geerts, B., Heymsfield, G. M., Tian, L., Halv-
Review, 130, 2110–2123. doi:10.1175/1520- erson, J. B., Guillory, A., & Mejia, M. I. (2000).
0493(2002)130<2110:TEOVWS>2.0.CO;2 Hurricane Georges’s landfall in the Dominican
Republic: Detailed airborne Doppler radar
Corbosiero, K. L., & Molinari, J. (2003). The rela-
imagery. Bulletin of the American Meteoro-
tionship between storm motion, vertical wind shear,
logical Society, 81, 999–1018. doi:10.1175/1520-
and convective asymmetries in tropical cyclones.
0477(2000)081<0999:HGLITD>2.3.CO;2
Journal of the Atmospheric Sciences, 60, 366–376.
doi:10.1175/1520-0469(2003)060<0366:TRBS Hill, K., & Lackmann, G. M. (2009). Influence
MV>2.0.CO;2 of environmental humidity on tropical cyclone
size. Monthly Weather Review, 137, 3294–3315.
Cubukcu, N., Pfeffer, R. L., & Dietrich, D. E.
doi:10.1175/2009MWR2679.1
(2000). Simulation of the effects of bathymetry
and land-sea contrasts on hurricane development Jones, R. W. (1987). A simulation of hurricane
using a coupled ocean-atmosphere model. Jour- landfall with a numerical-model featuring latent
nal of the Atmospheric Sciences, 57, 481–492. heating by the resolvable scales. Monthly Weather
doi:10.1175/1520-0469(2000)057<0481:SOTE Review, 115, 2279–2297. doi:10.1175/1520-
OB>2.0.CO;2 0493(1987)115<2279:ASOHLW>2.0.CO;2
Davis, C., Wang, W., Chen, S. S., Chen, Y. S., Jones, S. C., Harr, P. A., Abraham, J., Bosart, L. F.,
Corbosiero, K., & DeMaria, M. (2008). Predic- Bowyer, P. J., & Evans, J. L. (2003). The extratropi-
tion of landfalling hurricanes with the Advanced cal transition of tropical cyclones: Forecast chal-
Hurricane WRF model. Monthly Weather Review, lenges, current understanding, and future direc-
136, 1990–2005. doi:10.1175/2007MWR2085.1 tions. Weather and Forecasting, 18, 1052–1092.
doi:10.1175/1520-0434(2003)018<1052:TETOT
DeMaria, M., & Kaplan, J. (1994). A statistical hur-
C>2.0.CO;2
ricane intensity prediction scheme (SHIPS) for the
Atlantic basin. Weather and Forecasting, 9, 209–
220. doi:10.1175/1520-0434(1994)009<0209:AS
HIPS>2.0.CO;2

1086
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

Jorgensen, D. P. (1984). Mesoscale and convec- Marchok, T., Rogers, R., & Tuleya, R. (2007).
tive-scale characteristics of mature hurricanes. Part Validation schemes for tropical cyclone quantita-
I: General observations by research aircraft. Jour- tive precipitation forecasts: evaluation of opera-
nal of the Atmospheric Sciences, 41, 1268–1285. tional models for US Landfalling cases. Weather
doi:10.1175/1520-0469(1984)041<1268:MACS and Forecasting, 22, 726–746. doi:10.1175/
CO>2.0.CO;2 WAF1024.1
Kidder, S. Q., Kusselson, S. J., Knaff, J. A., Ferraro, Matyas, C. J. (2008). Shape measures of rain
R. R., Kuligowski, R. J., & Turk, M. (2005). The shields as indicators of changing environmental
tropical rainfall potential (TRaP) technique. Part conditions in a landfalling tropical storm. Meteo-
I: Description and examples. Weather and Fore- rological Applications, 15, 259–271. doi:10.1002/
casting, 20, 456–464. doi:10.1175/WAF860.1 met.70
Kimball, S. K. (2008). Structure and evolution of Medlin, J. M., Kimball, S. K., & Blackwell,
rainfall in numerically simulated landfalling hurri- K. G. (2007). Radar and rain gauge analysis of
canes. Monthly Weather Review, 136, 3822–3847. the extreme rainfall during Hurricane Danny’s
doi:10.1175/2008MWR2304.1 (1997) landfall. Monthly Weather Review, 135,
1869–1888. doi:10.1175/MWR3368.1
Konrad, C. E., Meaux, M. F., & Meaux, D. A.
(2002). Relationships between tropical cyclone Molinari, J., Moore, P., & Idone, V. (1999). Convec-
attributes and precipitation totals: Considerations tive structure of hurricanes as revealed by lightning
of scale. International Journal of Climatology, locations. Monthly Weather Review, 127, 520–
22, 237–247. doi:10.1002/joc.721 534. doi:10.1175/1520-0493(1999)127<0520:CS
OHAR>2.0.CO;2
Lawrence, M. B., Avila, L. A., Beven, J. L., Frank-
lin, J. L., Guiney, J. L., & Pasch, R. J. (2001). At- Office of the Federal Coordinator of Meteorology.
lantic hurricane season of 1999. Monthly Weather (2006). Federal Meteorological Handbook, No. 11:
Review, 129, 3057–3084. doi:10.1175/1520- Doppler Radar Meteorological Observations- Part
0493(2001)129<3057:AHSO>2.0.CO;2 C: WSR-88D Products and Algorithms. Washing-
ton, DC: U.S. Department of Commerce/ National
Lonfat, M., Marks, F. D., & Chen, S. Y. S. (2004).
Oceanic and Atmospheric Administration.
Precipitation distribution in tropical cyclones
using the Tropical Rainfall Measuring Mission Parrish, J. R., Burpee, R. W., Marks, F. D., & Grebe,
(TRMM) Microwave Imager: A global perspec- R. (1982). Rainfall patterns observed by digitized
tive. Monthly Weather Review, 132, 1645–1660. radar during the landfall of Hurricane Frederic
doi:10.1175/1520-0493(2004)132<1645:PDITC (1979). Monthly Weather Review, 110, 1933–1944.
U>2.0.CO;2 doi:10.1175/1520-0493(1982)110<1933:RPOBD
R>2.0.CO;2
Lonfat, M., Rogers, R., Marchok, T., & Marks,
F. D. (2007). A parametric model for predicting Powell, M. D. (1990). Boundary-layer structure
hurricane rainfall. Monthly Weather Review, 135, and dynamics in outer hurricane rainbands. 1. Me-
3086–3097. doi:10.1175/MWR3433.1 soscale rainfall and kinematic structure. Monthly
Weather Review, 118, 891–917. doi:10.1175/1520-
0493(1990)118<0891:BLSADI>2.0.CO;2

1087
A Geospatial Analysis of Convective Rainfall Regions within Tropical Cyclones after Landfall

Rappaport, E. N. (2000). Loss of life in the United Shapiro, L. J. (1983). The asymmetric boundary-
States associated with recent Atlantic tropical layer flow under a translating hurricane. Journal
cyclones. Bulletin of the American Meteorologi- of the Atmospheric Sciences, 40, 1984–1998.
cal Society, 81, 2065–2073. doi:10.1175/1520- doi:10.1175/1520-0469(1983)040<1984:TABL
0477(2000)081<2065:LOLITU>2.3.CO;2 FU>2.0.CO;2
Rodgers, E. B., Baik, J. J., & Pierce, H. F. Shepherd, J. M., Grundstein, A., & Mote, T. L.
(1994). The environmental influence on tropi- (2007). Quantifying the contribution of tropical
cal cyclone precipitation. Journal of Applied cyclones to extreme rainfall along the coastal
Meteorology, 33, 573–593. doi:10.1175/1520- southeastern United States. Geophysical Research
0450(1994)033<0573:TEIOTC>2.0.CO;2 Letters, 34, L23810. doi:10.1029/2007GL031694
Rosenfeld, D., Wolff, D. B., & Atlas, D. (1993). Tuleya, R. E. (1994). Tropical storm development
General probability-matched relations between and decay: Sensitivity to surface boundary-con-
radar reflectivity and rain rate. Journal of Ap- ditions. Monthly Weather Review, 122, 291–304.
plied Meteorology, 32, 50–72. doi:10.1175/1520- doi:10.1175/1520-0493(1994)122<0291:TSDA
0450(1993)032<0050:GPMRBR>2.0.CO;2 DS>2.0.CO;2
Schwarz, F. K. (1970). Unprecedented rains in Wilks, D. S. (1995). Statistical methods in the
Virginia associated with remnants of Hurricane atmospheric sciences. San Diego, CA: Academic
Camille. Monthly Weather Review, 98, 851–859. Press.
doi:10.1175/1520-0493(1970)098<0851:TURIV
Willoughby, H. E., Marks, F. D., & Feinberg,
A>2.3.CO;2
R. J. (1984). Stationary and moving convective
bands in hurricanes. Journal of the Atmospheric
Sciences, 41, 3189–3211. doi:10.1175/1520-
0469(1984)041<3189:SAMCBI>2.0.CO;2

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 1, Issue 2,
edited by Donald Patrick Albert, pp. 71-91, copyright 2010 by IGI Publishing (an imprint of IGI Global).

1088
Section 5
Organizational and Social
Implications

This section includes a wide range of research pertaining to the social and behavioral impact of Geographic In-
formation Systems around the world. Chapters introducing this section critically analyze and discuss trends in
Geographic Information Systems, such as historical GIS, volunteered GIS, and healthcare applications of GIS.
Additional chapters included in this section look at collaborative mapping, agriculture, and conservation. Also
investigating a concern within the field of Geographic Information Systems is research which discusses the effect
of trust and security on Geographic Information Systems. With 19 chapters, the discussions presented in this sec-
tion offer research into the integration of global Geographic Information Systems as well as implementation of
ethical and workflow considerations for all organizations.
1090

Chapter 65
The Digital Geography Lab
at Salem State University:
The Evolution of One of the Oldest
Educational Digital Geospatial Labs

Kym Pappathanasi
Salem State University, USA

Stephen Young
Salem State University, USA

ABSTRACT
Established in 1983, the Digital Geography Lab (DGL) at Salem State University (SSU) is one of the
oldest higher-education digital spatial labs. This paper details the evolution of the DGL as well as its
current status as one of the best computer labs at Salem State University. This paper describes the chang-
ing technology of the DGL over the past 28 years. The DGL not only provides SSU students with cutting
edge geospatial technology, but it is a research lab for faculty and is extensively used in community out-
reach programs. The DGL is being used more and more by other departments such as Geology, Criminal
Justice, Computer Science and Inter-disciplinary Studies which reflects the expansion of the geo-spatial
sciences beyond geography. The success of the DGL lies in part with the University’s management of
the facility where major decisions are made in consultation with the Department and DGL staff. The
configuration and technology of the DGL has changed considerably through the years, reflecting the
transformations in technology and educational philosophies, but the core mission of the DGL has not,
which is to serve the students, faculty and the local community with cutting edge geospatial technology.

INTRODUCTION Laboratory (DGL) (Figure 1). The DGL is one of


the oldest geo-spatial university labs in the United
Located on the third floor of Meier Hall is one States. It was officially established in 1983 and a
of Salem State University’s most outstanding few years later in 1986 it was proclaimed a Center
technology resources – the Digital Geography of Educational Excellence by then Massachusetts
Governor Dukakis (RSSG, 2003). As a Center of
DOI: 10.4018/978-1-4666-2038-4.ch065

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
The Digital Geography Lab at Salem State University

Figure 1. Frederick A. Meier Arts and Sciences


In addition to undergraduate students in the
Hall, Salem State University, Salem, Massachu-
College of Arts and Sciences, the DGL services
setts
students in the Master of Geo-Information Sci-
ences (GIS) program. Salem State University’s
Master of Geo-Information Sciences is also one
of the oldest in the country with the first gradu-
ate students arriving in 1993. Some DGL related
projects undertaken by masters students include:
archeological mapping, GPS accuracy assessment
(global positioning systems), GPS for nature re-
serve mapping, remote sensing for bird migration,
remote sensing for wind power development, and
environmental justice analysis of power generation
in Massachusetts.
Our graduates use their geo-spatial technical
expertise to secure positions in a variety of govern-
Educational Excellence it received special govern- ment, higher-education, not-for-profit and private
ment funding for its development of geo-spatial sector jobs throughout Massachusetts and beyond.
computing technology for the education of Salem Federal government agencies where our alumni
State students. Salem State University has also work include the FBI, Secret Service, National
continued to support the lab financially and has Geospatial-Intelligence Agency, NASA and the
allowed the policy and configuration decisions to Census Bureau. Private companies include ESRI,
be made unilaterally by the geography department. Tactician, Applied Geographics, Trimble, and Pic-
This hands-off policy has allowed the lab to grow tometry. For a more extensive list of our alumni’s
and service its students with the primary goals of employment please visit our DGL Alumni (2011)
geo-computing and geographic education. web page at dgl.salemstate.edu/alumni.shtml.
The DGL is an exceptional resource for educa- In addition to class-related work, the DGL
tion and research at Salem State University (SSU). provides state-of-the-art facilities for researchers
All geography majors, minors, and students who at Salem State University. Faculty, graduate stu-
are enrolled in geography courses, are provided dents and undergraduates all use the DGL for their
accounts in the DGL. In addition, most geography research needs. Research by faculty encompasses
classes require students to perform certain labs a wide breadth of areas from estimating floral
and exercises which are only available through intensity using high resolution aerial imagery, to
the software provided by the DGL. For example, demographic analysis of voter participation. Other
Introduction to Geography students map the dis- research includes emergency response mapping,
tribution of different ethnic groups across the sea-level rise mapping, transportation model-
United States, analyze population growth in India ing and land cover change. (Luna, 2010, 2008a,
and run development scenarios for urban growth 2008b; Ratner, 2009a, 2009b, 2008, 2007a, 2007b;
in Colorado. Remote sensing students download Young, 2011a, 2011b, 2004; Young & Harris,
satellite data from NASA to analyze deforestation, 2005). Each year geography faculty and students
global vegetation change and brush fires in Aus- attend regional and national conferences where
tralia as well as the melting of glaciers on Mount they present their research. In the past both under-
Kilimanjaro. graduate and master’s students have won national
research competitions held by the Association of

1091
The Digital Geography Lab at Salem State University

American Geographers (Remote Sensing, GIS, ing research support as necessary to the ongoing
and Cartography Specialty Groups Illustrated projects of the center.
Poster Competition: Graduate First Place: 2010: While department enrollments in recent years
Moumita Duttagupta; Undergraduate First Place have recently declined, the need for access to
2005: Michelle Roy; 2000: Kevin Coughlin, specialized geo-spatial software applications
Honorable Mentions: 2004, 2006, 2009). has remained steady due to more classes using
specialized software in their coursework (Fig-
ure 2). Many of the geography-based software
OUTREACH WITHIN AND OUTSIDE programs are beyond the means of most college
THE DEPARTMENT OF GEOGRAPHY students, and so through the DGL, Salem State
students can learn to use the most advanced and
The DGL is an academic departmental lab which cutting-edge technology in their field. Starting in
operates specifically to support the geography the fall of 2005 the university started requiring
department’s curriculum. The lab has always had all incoming students to purchase a laptop. This
a university-wide open door policy, and in recent has not reduced the overall use of the lab, as we
years we have seen increased usage by the geo- have found that many of our students would rather
logical sciences department due to the need for not carry their laptop around campus every day.
GIS in their field. In addition, other departments Therefore in addition to using the specialized
are starting to use GIS applications for research, geo-spatial software for classwork, the lab is used
such as biology and nursing. More academic frequently for e-mail, research, printing, scanning
departments are utilizing the DGL which reflects and for checking the campus online portal.
the expansion of the geo-spatial sciences beyond In addition to providing students with state-
geography. This semester (spring 2011) the DGL of-the-art technology, the DGL provides an envi-
will support 58 course sections (873 course reg- ronment where students can work on projects
istrations). These courses are offered through the together and build a sense of community, which
Departments of Geography, Geological Sciences, is often missing in many parts of our commuter-
Inter-disciplinary Studies, and Computer Science dominated university (Hickey, 2010). Alumni
at undergraduate and graduate course levels, and often speak of how those long nights together in
includes courses ranging from Introduction to the DGL forged deep friendships and a fondness
Geography, Weather and Climate, and Hydrology towards their years at Salem State.
to Advanced GIS, Digital Image Processing and
Spatial Database Design/Analysis.
Through the outreach of the department’s FROM THE PAST TO THE PRESENT
faculty, the DGL is also involved in community
programs. With the recent expansion of the lab, From the early 1980’s to the late 1990’s, primarily
it has been possible to invite regional middle and smaller upper division classes used the lab and
high school students, as well as the Massachusetts the need for physical space was not paramount.
branch of Women in Science and Engineering In 1984, the lab consisted of two rooms; the open
(WISE) program, to use the facility with the lab (a small room of 10 seats, created by splitting
intent of introducing pre-college students to the a traditional classroom in half) and a second class-
geo-sciences. The DGL is also part of Salem State room which was modified to house the computer
University’s Center for Economic Development (DEC VAX 11/780), its peripherals, two digitizers
and Sustainability, running workshops and provid- and the staff office. Many of the peripherals were
donated by the leading high-technology companies

1092
The Digital Geography Lab at Salem State University

Figure 2. Number of students utilizing the DGL by year

of that time (Tektronix, Synercom, UNIRAS and were coming by to write and print papers more
Digital Equipment Corporation). The lab also frequently, as typewriters were now becoming part
quickly developed alliances with many federal of the past, and as a result the DGL was being used
agencies including the National Park Service and by all levels of students in the program. In 1998
the University Center for Atmospheric Research the DGL transitioned to a Windows NT Server,
(UCAR). As the computer with the peripherals and by 2000 the DGL had completely phased
were the size of a small car (6’H x 4’W x10’L) out the DEC OpenVMS OS and had transitioned
and produced excessive heat, it required its own over to the Windows operating system for all its
full-size air conditioning unit built in the room with servers and PC clients.
it. In 1991 the lab upgraded to the VAX 4300. As By 2002, with the addition of a large format
hardware became more efficient and smaller (the plotter, oversize flatbed scanner and laser print-
new VAX was the size of an oven), this allowed the ers, the DGL was bursting at the seams, sign-up
building of a smaller room to house the computer sheets for computer usage time was necessary and
and the still very large air-conditioner it needed. had to be enforced. The need for a room where
The vacant room left over from the absence of the professors could hold lab classes had also become
larger VAX allowed for more lab space for students a necessity as well as a dedicated area for input
and the addition of more computer terminals. In and output devices. It was also important to have
1994 the lab purchased a DEC AlphaServer 2100 an environment for the students that encouraged
for its geo-spatial applications as well as to be group collaboration, which had always been a
used as a Gopher and Email server. At this time, large part of the success of the facility. Discus-
PCs were also becoming more affordable and sions started as to how to go about expanding
the DGL acquired 3 desktop computers to run the facility to meet the needs of the students. The
the newly introduced ArcView for Windows, to DGL worked with the university’s administration
supplement the AlphaServer’s OpenVMS version. and eventually came up with a plan to move and
With the introduction of desktop computers into the expand the DGL while also addressing other
DGL, the lab started to see more underclassmen facility issues in the process. As a result, new
using the facility for more than geo-spatial based electrical and networking closets were built and
applications and Web-based research. Students the DGL was able to move into a completely new

1093
The Digital Geography Lab at Salem State University

space specifically designed as a computing lab. Hardware resources include high resolution
In the fall of 2007 the DGL moved into the space color scanning on an 11x17 flatbed scanner. Out-
traditionally held by the manual cartography lab, put devices include high speed HP color laser
and darkrooms (see DGL’s Public Photo Gallery printers, and large format HP plotters. Discipline
at picasaweb.google.com/dgl.salemstate). specific software holdings of the DGL include
The DGL is now comprised of two rooms ArcGIS (ESRI), ERDAS Imagine, IDRISI (Clark
joined by a sliding pocket door, allowing for a Labs), Surfer (Golden Software), RockWorks
spacious feeling while it’s open. The primary (Rockware), SAS, and MapInfo (Pitney Bowes).
room, the Open Lab, is used on a daily basis by The DGL promotes equal access for students
those with accounts in the DGL. The Open Lab of all academic levels. All geography majors,
also has dedicated space to house the printers, minors and students enrolled in geography
plotters and scanner. The Teaching Lab is a non- classes have equal access and rights to the lab’s
scheduled classroom that is available for faculty facilities. This has been achieved by having full
to use during their class time. Faculty sign for the time staff that are responsible for all hardware,
Teaching Lab as needed and bring their classes in software, maintenance, and facility operations. It
to provide general instruction on coursework as is important to note that the lab has always been
it pertains to computer use in the DGL. staffed by degree holders in geography, therefore
The Digital Geography Lab has two Dell having an understanding of cartography and spatial
PowerEdge Servers serving thirty-seven clients analysis as well as having a lot in common with
(Dell Optiplex 745). Twenty clients are available the students and faculty. Due to the university’s
for use in the Open Lab (Figure 3) with multiple fiscal constraints beginning in late 2008, a real-
desk layouts to accommodate different research locating of staff resulted in the loss of one of the
group sizes. In the Teaching Lab (Figure 4), there two full time positions. To supplement this loss,
are 16 student machines and one machine con- between 5 and 7 student lab assistants are hired to
nected to an overhead digital projector for the assist the DGL systems manager during the year.
instructor. All seventeen clients in the Teaching The lab assistants monitor the DGL, aid fellow
Lab are available for student use when the room students with basic computer and coursework
is not reserved for classes. questions, assist the staff with hardware and soft-

Figure 3. Open lab, Salem State University’s Figure 4. Teaching lab, Salem State University’s
digital geography laboratory digital geography laboratory

1094
The Digital Geography Lab at Salem State University

ware troubleshooting, and keep the lab open at REFERENCES


night and on weekends. All lab assistants must be
majors in the Geography or Geological Sciences Hickey, R. (2010). Beyond class hours: The GIS
Departments and must have had some previous lab as a center for geographic education. Inter-
DGL experience through their course work. The national Journal of Applied Geospatial Research,
success of the day to day operations of the lab 1(3), 1–16. doi:10.4018/jagr.2010070101
strongly depends on the quality and competence Luna, M. (2008a). Out of sight, out of mind: Dis-
of the lab assistants. Not only does this give the tancing and the geographic relationship between
students an opportunity to work on-campus, it electricity consumption and production in Mas-
provides them with good work experience in a sachusetts. Social Science Quarterly, 89(5), 578.
geo-computing environment and exposes them doi:10.1111/j.1540-6237.2008.00578.x
to a wide range of software that may be outside
their concentrations. Luna, M. (2008b). Estimate of limited English
The amount of hours that the lab is open each speaking eligible voters in select communities of
semester varies according to the student employee the Boston Metropolitan Area. A Report to Greater
budget and eligible students awarded work-study Boston Legal Services. Salem, MA: Salem State
funding. On average the DGL is open 75 hours per College.
week during the fall and spring semesters. During
Luna, M. (2010). Voter participation and demo-
the summer semesters and academic breaks, the
graphics in Lowell, MA: A GIS study. A report to
lab is usually open Monday through Friday during
ONE Lowell and the City of Lowell. Retrieved
the day. In order for the lab to be open at night or
from http://dgl.salemstate.edu/geography/Profs/
on weekends the systems manager, a lab assistant,
luna/projects/2010_01Spring/GPH903/Lowell-
or faculty member must be present. Although this
VoterReport2010reduced.pdf
does impact the amount of time students have
access to the lab, 24-hour unsupervised access Ratner, K. (2007a). Use of orthophoto imagery
is not an option due to the extensive amount of with integrated elevation values to determine
hardware and software in the lab. roadway lengths. Salem, MA: FTA Transportation
Research Program.
Ratner, K. (2007b). Why we need to measure our
CONCLUSION
roadways. Paper presented at the North East St.
Lawrence Valley Division, Association of Ameri-
While the hardware, software and location of the
can Geographers Meeting, New Haven, CT.
lab have changed over the years, its mission has
not. With technology education becoming more Ratner, K. (2008). The changing world of roadway
and more important at Salem State University, mapping and measurement. Paper presented at
the DGL will continue to service the faculty and the Annual Meeting of Association of American
student body in the College of Arts and Sciences Geographers, Boston, MA.
as well as in the Graduate School with cutting-
Ratner, K. (2009a). The reshaping of land use and
edge geo-spatial computing technologies. As the
urban form in Denver, Colorado through transit-
geo-spatial sciences become more mainstream
oriented development. Denver, CO: National Cen-
in society (i.e., GPS and GIS technologies), labs
ter for Intermodal Transportation (NCIT) Grant.
such as the DGL play a crucial role in providing a
well educated and essential geo-spatial workforce.

1095
The Digital Geography Lab at Salem State University

Ratner, K. (2009b). Was it going to happen any- Young, S. S. (2011a). Geoinformatics and the
way? Examining the link between transit and land mapping of lands vulnerable to sea level rise. In
use development in Denver. Paper presented at P. K. Joshi & T. P. Singh (Eds.), Geoinformatics
the Annual Meeting of Association of American for climate change studies. New Delhi, India: The
Geographers, Las Vegas, NV. Energy and Resources Institute (TERI).
Remote Sensing Specialty Group of Association Young, S. S. (2011b). The Earth exposed: How
of American Geographers (RSSG-AAG). (2003). geographers use art and science in their exploration
Digital Geography Laboratory. The RSSG News- of the Earth from space. In Dear, M., Ketchum, J.,
letter, 3(1), 5. Luria, S., & Richardson, D. (Eds.), GeoHuman-
ities: Art, history, text at the edge of place (pp.
Young, S. S. (2004). Satellite detected broad-scale
183–186). New York, NY: Routledge.
vegetation change in China, 1982 – 1999. Asian
Geographer, 22(1-2), 123–142. doi:10.1080/102 Young, S. S., & Harris, R. (2005). Changing
25706.2003.9684103 patterns of global-scale vegetation photo-
synthesis, 1982-1999. International Jour-
nal of Remote Sensing, 26(20), 4537–4563.
doi:10.1080/01431160500239198

1096
The Digital Geography Lab at Salem State University

APPENDIX: FACULTY AND STAFF CREDENTIALS, EXPERTISE

Richard T. Anderson, Professor Emeritus


B.S., Salem State College; M.Ed., Boston State College; M.A., Ed.D., Boston University.
Areas of specialty: World Regions, Europe, Geographic Education
Profile: His major area of scholarship is in Geographic Education. The National Geographic Society
selected him as the founding coordinator of the Massachusetts Geographic Alliance. He is also
the co-director of the Northeast Global Education Center and coordinates numerous geographic
workshops for K-12 teachers. He served as the National Council for Geographic Education rep-
resentative on the Geography Education National Implementation Project. In 1996 he received an
Outstanding Geography Educator Award and then in 1999 a Life-Time Achievement, both from
the National Council for Geographic Education (NCGE) with a Mentor Fund in his name. In 2001
he received an Achievement Award from the NESTVAL Geographic Society. Professor Anderson
has brought in more than $6,000,000 in grant money to Salem State University.

William L. Hamilton, Professor


B.S., Worcester State College; M.S., Southern Illinois University; Ed.M., Worcester State College;
Ph.D., Oregon State University.
Areas of specialty: Analytic Cartography, GIS, Quantitative Application to Geo Spatial Intelligence
(GEOINT), Photogrammetric and Digital Image Processing, Applied Meteorology and Climatol-
ogy, Applied Geosciences Curriculum Analysis and Development
Profile: Over 35 years teaching experience spanning elementary- primary, middle school, special educa-
tion, secondary school to post doctoral education. Held high level positions in and currently consults
with Fortune 100 companies in GEOINT applications. While at SSC developed seventeen new
courses, acquired funding and created the Digital Geography Laboratory, created the Masters in
Geo-Information Science, and the PhD in Geo-Information Science. Widely published in books,
proceedings, journals and software applications and has earned several awards for GEOINT curri-
cula development and integration. Current research endeavors are time-space modeling and system
development for anti-terrorism bio and chemical warfare initiatives. Currently Dr. Hamilton is the
co-director of the SSU Center for Economic Development and Sustainability.

John T. Hayes, Associate Professor


B.A., Bowling Green State University; M.A., Ph.D., University of California-Los Angeles
Areas of specialty: Physical Geography, Climatology, Meteorology, Natural Resource Conservation
and Management, Soils, GIS, Water Resources Management, Environmental Impact Assessment,
Global Warming, Climate Change
Profile: He has authored or co-authored over 20 scientific papers based on mathematical modeling of crop
yield, crop water use, and climate impacts on crop productivity. He has taught in Xian, China for 5
½ weeks (Xibei University) and participated on a 2-week research expedition to the Loess Plateau
region of China. He was co-editor for two years of the new peer-reviewed journal in geography,
The Northeastern Geographer. He has co-chaired the campus’ Earth Day Planning Committee
for the past 6 years which is responsible for putting on a week-long program of speakers, films,
juried art exhibit, and judged student poster competition each April to help the campus celebrate
environmental problems, issues, and successes. Prof. Hayes has supervised student interns over

1097
The Digital Geography Lab at Salem State University

the past decade for the city of Salem doing energy use audits and climate protection work for the
city and serves on Salem’s Renewable Energy Task Force. John is currently Vice-President of the
New England-St. Lawrence Valley Geographical Society, a regional division of the national As-
sociation of American Geographers (AAG); the position is a President-elect position. John’s recent
research (which he has presented at AAG’s annual meetings) has reviewed the potential carbon
sequestration potential of soils that are used for farming and/or ranching as a component of efforts
to mitigate against rising greenhouse gas concentrations.

Lorri K. Krebs, Associate Professor


M.A., Wilfrid Laurier University; Ph.D., University of Waterloo-Canada.
Areas of specialty: Tourism, Hospitality, Economic Geography, Canada, Caribbean
Profile: Dr. Krebs’ Ph.D. was awarded from the University of Waterloo, Canada in Geography with an
emphasis on marketing tools, tourism and decision-making. She also holds a Masters Degree in
Geography with a focus on ecosystem approaches, parks and protected areas. Her research in these
areas span from trans-boundary issues in Yukon and Alaska, to sustainability and tourism develop-
ment in National Parks. She has worked in various aspects of tourism in Switzerland, Mexico and
Canada. Current research interests include special events, Indigenous people and tourism, market-
ing, and economic development. Dr. Krebs is also Executive Director of Salem State University’s
Center for Economic Development and Sustainability.

Marcos J. Luna, Associate Professor


M.A. California State University, Los Angeles; Ph.D., University of Delaware.
Area of specialty: Environmental Justice, Political Ecology, Energy Policy and Planning, Environmental
Policy, Environmental History, Social Justice and Inequality, GIS Analysis
Profile: Dr. Luna works with a variety of non-profit organizations in the greater Boston metropolitan
area on issues ranging from voter participation to environmental change to urban history. Regularly
works with both undergraduate and graduate students on service learning GIS projects to engage
with the community and to provide students with the opportunity for real-world experience. Before
coming to Salem, Dr. Luna worked as an Environmental Analyst for the Massachusetts Department
of Environmental Protection and as a GIS consultant for Native American tribes in the Southwest
and Northwest. He has held research assistantships in grant-funded remote sensing and spatial
analysis projects and has participated in health and environmental public service projects.

Stephen Matchak, Professor and Department Chairperson


B.A., University of Hartford; M.A., University of Wales; M.A., Ph.D., University of North Carolina.
Areas of specialty: Tourism, Cultural Geography, Folklore
Profile: Dr. Matchak is a cultural geographer especially interested in landscapes, tourism, New England,
Europe, and folklore. His degrees include a Ph.D. from the University of North Carolina as well as
masters degrees from the University of North Carolina and the University of Wales-Aberystwyth.
For several years, Dr. Matchak has been interested in international education and has been a faculty
member on over two dozen international educational trips.

Kym Pappathanasi, Staff Assistant


B.A., University of Vermont; Graduate Certificate in the Cartographic Sciences, Salem State College

1098
The Digital Geography Lab at Salem State University

Areas of specialty: Cartography, GIS, Geo-Computing Systems Management


Profile: Ms. Pappathanasi received her degree in geography at the University of Vermont. After complet-
ing a Graduate Certificate in the Cartographic Sciences at Salem State, she became a Cartography
Instructor at Salem State College. The following year she accepted the Systems Manager position,
and for the last 19 years she has been supervising and managing the Digital Geography Lab at
Salem State University.

Keith Adam Ratner, Professor


B.S., Michigan State University; M.S., Pennsylvania State University; Ph.D., University of Denver.
Area of specialty: Urban and Transportation Planning, and GIS Implementation and Development
Profile: Prior to coming to SSU Dr. Ratner worked for eleven years as a professional planner in New
England at the local, regional and state levels of government. Research projects at SSU have in-
cluded an investigation into the regionalization of transportation planning north of Boston, the use
of three dimensional data for the determination of road lengths for federal reporting requirements,
and the analysis of regional land use changes related to transit oriented development in Denver,
Colorado. Student projects at SSU have included participation in the development of the Boston
Enterprise GIS Website, the development of a Service Area Management GIS for the Boston Centers
for Youth and Families, the development a GIS implementation plan for North Shore Community
Development Corporation.

Steven E. Silvern, Associate Professor


B.A., Clark University; M.A., University of Illinois; Ph.D. University of Wisconsin.
Area of specialty: Cultural Geography, Political Geography, Historical Geography, Indigenous
Profile: Dr. Steven Silvern is a cultural and political geographer who teaches a wide variety of human
geography courses including: World Regions, Sustainable Development, Geographic Research, Politi-
cal Geography, Population Geography and Native American Environments. He has been extensively
involved in using the internet and information technology to enhance geographical education and is
working with the Association of American Geographers to develop on-line instructional materials
on global climate change. Dr. Silvern’s research has been on the political and cultural geography
of Native Americans with current research on right-wing anti-Indian sovereignty movement and
how history museums represent Native Americans. He is also researching the reasons for success
or failure of community supported agriculture in Massachusetts.

Stephen S. Young, Professor


B.A., University of Vermont; M.E.S., Yale University; Ph.D., Clark University
Area of specialty: Biogeography, Global Change, Remote Sensing, China, Asia.
Profile: Through field work Dr. Young has worked in: Southwestern China, England, Taiwan, Japan,
Hong Kong, Australia, Montana and Alaska. He has published over a dozen scientific papers
based on his field work. Currently he is working with the Chinese Academy of Sciences analyzing
land-cover change in and around China’s nature reserves. He also has put together a traveling art
exhibition which demonstrates how Geographers study the Earth from Space. The exhibition has
been shown in over a dozen locations including the Headquarters of the National Science Founda-
tion, Philadelphia, Australia and North Africa.
This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 3,
edited by Donald Patrick Albert, pp. 86-96, copyright 2012 by IGI Publishing (an imprint of IGI Global).

1099
1100

Chapter 66
Analysis of Tourist Behavior
Based on Tracking Data
Collected by GPS
Oriol Bernadó
Science and Technology Park for Tourism and Leisure, Spain

Albert Bigorra
Science and Technology Park for Tourism and Leisure, Spain

Yolanda Pérez
University Rovira i Virgili, Spain

Antonio P. Russo
University Rovira i Virgili, Spain

Salvador Anton Clave


University Rovira i Virgili, Spain

ABSTRACT
The research field of tourism management has been witnessing in the last ten years an important de-
velopment in methods based on the analysis of the spatial behavior of visitors, mainly driven by the
progressive enhancement and economic accessibility of satellite-based tracking techniques. However,
the practical operations of destination management systems based on tracking technologies still suffer
from the relative complexity of obtaining and mastering this information. This chapter illustrates the
development of Track-Tour, a user-friendly system that integrates the tracking of visitor spatial activity,
the elaboration of such data according to structured formats, and their visualization through a web-
based viewer. It is believed that this system has the potential to allow an easier uptake by destination
management and planning organizations in their quest for more sustainable tourism impacts.

The chapter presents the design and validation of this comprehensive methodology, and the early results
from a test application in the historical centre of the World Heritage city of Tarragona, a semi-enclosed
area with unpredictable patterns of tourism mobility that typically suffer from congestion and flow
management problems.

DOI: 10.4018/978-1-4666-2038-4.ch066

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

INTRODUCTION ing in the ensemble of the regions and/or sights


at the destination. These potential actions should
It is clear that mobility is a core aspect of contem- be complemented by better marketing strategies
porary life (Drummond, et al., 2006), and tourism (Thornton, et al., 1997) and several other aspects,
is by no means indifferent to this. The increased based on a better segmentation of the target and the
possibilities and means of transport (reducing visitable space. Knowing exactly what routes have
functional distances), as well as the supposed been taken, by how many people or vehicles, for
“normality” of the individual’s occupational, how long and with what effort, what facilities or
social and leisure mobility (Urry, 2008) have shows they went to, who visited and photographed
engendered a cultural and economic revolution the monuments or when, as well as what types of
and has led to the progressive standardization of shops and places interested visitors more or less,
tourism as a social commodity. This evolution could lead to a very significant improvement in
modifies the traditional approach to tourism and the performance of destinations. This reality will
the topics being researched or investigated in this allow us to establish new solutions designed with
field addressing research into aspects such as precision on the elements, such as signs, opening
the choice of destinations and forms of tourism times, the setting of license costs, public transport
(where to travel), and aspects of performativity engineering, etc. In summary, it leads us to a new
and spatial behavior at the destination (what to do tourist site management scenario on any spatio-
there, what to consume, and where). This article temporal scale.
focuses particularly on the second aspect, defining
a systematic method for the study of tourist mobil-
ity at destinations or tourist areas that will help STATE OF THE ART
destinations to plan and manage actions geared
towards improving the competitiveness and the This potential improved management of tourist
sustainability of tourist activity. Hitherto, this area destinations was unthinkable until only recently;
of research has aroused scant interest, resulting however, in the last decade, technological devel-
in a limited practical usefulness of the methods opment (particularly in the new Information and
employed and the results obtained for policies and Communication Technologies (ICT) and satel-
the management of visitable spaces (Dietvorst, lite communication technology) has permitted a
1995; Thornton, et al., 1997; Shaw, et al., 2000). change of paradigm in the consideration of the
Even in places where visitor surveys are carried aforementioned mobility. Until barely a few years
out regularly, visitor mobility has continued to ago, research in tourism had generated only gen-
merit little attention; the spatial or geographic eral information on the activity of visitors through
aspect of visits in general is neglected, since it is traditional or “low-tech” methods. While it is true
assumed and estimated to be impossible to amass that general information on tourist activity can be
such complex information through direct empirical obtained through methods such as direct surveys
research (Meng, et al., 2005). or travelogues, these methods present different
Thornton et al. (1997), among others, point out shortcomings in terms of availability of detailed
that understanding the spatio-temporal behavior of and real or exact data. This is due to the complex-
tourists at destinations could generate different im- ity of the environment: a visitor in an unknown
provements such as transport management, avoid setting will find it difficult, if not impossible, to
the saturation of places of interest, optimize routes remember the route, and then there is the need to
or, and beyond the actual destination, extend the secure the tourist’s close collaboration and dedi-
geographic distribution of visitors and their spend- cation, which means biasing the sample to those

1101
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

who are very interested in the study and have a the analysis part, as they have a webmap and the
methodical nature which leads them to keep a potentiality of developing ad-hoc tools for this
record of all their activities. The development purpose. In open natural areas; we refer to Loch
of other methods to ascertain mobility, such as Lomond and Trossachs National Park, Scotland
tracking and recording visitor activity time, either (Connel, et al., 2008) and Twelve Apostles Na-
personally or by analysis of closed circuit televi- tional Park, Victoria, Australia (O’Connor, et
sion cameras (Hartmann, 1988; Gali & Donaire, al., 2005). These projects are slightly different
2006), encounters different relevant problems such as regards space. The former analyzes car-based
as violation of privacy, and is barely profitable tourism through a survey and latter registers tour-
due to the limited sampling size it permits, or due ist movement through pressure sensors and maps
to the impossibility of segmenting tracking data their route with the aid of an agent-based model
according to the population’s socio-demographic for recreation behavior simulation. This project
characteristics. shows us another way to get data and a multitude
The need to know visitor mobility in the dif- of analysis possibilities. Another kind of space is
ferent tourist environments calls for the devel- in enclosed areas. For this, we participated in a
opment of a specific, effective, and sustainable project at the Port Aventura theme park (Russo,
methodology (in terms of time and economics). et al., 2010). What we learned in this project is
This is linked to different and new methods that what has helped improve the present stage, as is
are provided by different structural improve- described in this chapter.
ments in technology, and which are detailed in This ensemble of technologies coincides in
several documents (Shoval & Isaacson, 2006; the possibility of data collection on a very finely
Shoval & Isaacson, 2007; Shoval, 2008; Spek, honed scale, in real time and at low cost, but it
2008) and in recently published books (Schaick, should be complemented by management and
2008; Shoval & Isaacson, 2010) which permit exploitation tools adapted to needs and to all
a revision of the methodology. This has made a types of users. It is in this dimension where the
decisive contribution to reviving these studies for parallel development of Geographic Information
the development of a more or less homogeneous Systems (GIS), technologies related to territorial
series of methods for the analysis of visitors’ analysis and the study methods of human spatial
spatial behavior in tourist areas. Several studies behavior render it possible to rise to this challenge.
have been implemented in different cultural and Like the different tracking technologies, the use
physical environments, as well as on different of these GIS tools has become more widespread
spatial scales, using different methodologies and and prices have fallen considerably.
technologies. Particular mention may be made of Our contribution fits into this new line of re-
projects conducted in urban environments such as search, proposing some improvements to the data
in Akko, Israel (Shoval, 2008) or in Bilbao, San capture, management, and analysis procedures that
Sebastian, and Vitoria (Euskadi, Spain) (Alzua, substantially increase their efficacy as destination
et al., 2010): two of them use GPS devices but management instruments.
the second uses smart phones. In the first case, The fundamental aspects of the projects listed
the use of the devices implied that there must be here are:
a contact and an active collaboration of visitors,
which may be interfering with the experiment;; • Detailed knowledge of tourists’ spatial
in the second, the methodological bias in con- activity (use of space, routes, timing), so
stituted by the relative exclusivity of the device. as not to interfere in their spatio-temporal
The experience in Euskadi has a strong point in decisions through the collaboration of visi-

1102
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

tors who voluntarily carry a GPS device the data collected, with pedestrians the tracking
that stores data on their routes. system has to be discreet and passive, so as not to
• The preparation of these data through a distort or affect the study participants, and these
Geographic Information System that in- requirements are often difficult to fulfill.
cludes, moreover, an analysis module, Knowledge of pedestrian mobility will pivot
both of them developed specifically for around a combination of technologies for the de-
this project to obtain information on fun- velopment of two vectors: one based on tracking
damental profiles in terms of tourist spatial tools, and one for the monitoring and manage-
mobility, “risk” and “opportunity” space in ment of this tracking. This global system, since
flow management, and the main determin- it includes the whole process, should interfere as
ing elements of these patterns, such as the little as possible with visitors, so that their route
attraction capacity of certain elements at is not affected through their participation in the
the destination, the effect of natural events research. Thus, the generalization and cheapening
and conditions (e.g. the weather, time- of both technologies makes it possible to develop
tables), or systems signaling routes and a comprehensive methodological corpus to study
accesses. visitor mobility in any setting. This methodology
is based on the use of Tracking Technologies (TT)
The implementation of this project will provide for collecting visitors’ spatial data. These data will
different work phases that include the design of be integrated in management systems that will
fieldwork campaigns from the technological and facilitate data treatment and cleansing, as well as
socio-demographic standpoint and the develop- the automated cartographic representation of the
ment of the interface used to input and analyze and visitors’ behavior in tourist settings. The current
view the data, named Track-tour in this project. technological reality provides different alterna-
tives for these two tracking and display vectors.
In TT we can choose between different op-
TECHNOLOGICAL BACKGROUND tions. To focus these different possibilities, TT
can be classified, for our purposes, into two major
The current technological scenario is ideal for the groups: short- and long-range or in combination.
comprehensive development of methodologies Both work according to a specific radiofrequency
and tools that allow us to learn more about visitors’ protocol for wireless data transmission and follow
spatial behavior. The potential and the track record a synthesis of these. In short-range and potentially
of these tools in research into tourism, for example useful TT in this research, we can include differ-
Global Navigation Satellite Systems (GNSS), are ent communication protocols such as: Radiofre-
very broad, although no really major effort has quency Identification (RFID), sensor networks
been made. Until only recently, most research such as the Wireless Sensor Network (WSN), or
has focused solely on transport analysis, motor Wireless Wide Area Networks (WWAN), such
vehicle tracking studies, while the monitoring as Global System for Mobile Communications
of pedestrians has remained virtually untapped. (GSM), General Packet Radio Service (GPRS),
This may be due to different factors; one possible and Universal Mobile Telecommunications Sys-
explanation is that compiling data on pedestrians tem (UMTS). The different protocols they use
using GPS is far more complicated than doing so permit communication between points (in linear
on motor vehicles. If we consider that the tracking or grid form), providing information to the whole
system of a car is merely a device which, once network from nodes to other nodes, generating
installed, neither affects nor alters the nature of new information.

1103
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

Long-range TT can include different GNSS, solidation of web cartography and Spatial Data
either active or under development, including Infrastructure (SDI) viewers, due to the maturity
the current standard, GPS, although the cur- of different standards, technologies and data man-
rent GLONASS, the future GALILEO and the agement software. This technological environment
potential developments of BEIDOU and IRNSS is complemented by improving data storage op-
should also be considered. These systems permit tions in spatial databases (both proprietary, such
exact positioning on a terrestrial surface on the as IBM DB2, Oracle, Microsoft SQL Server, and
basis of the calculation of distances from at least Open Source, such as PostgreSQL, MySql,…),
three satellites (of known position) and a fourth making it possible to display them through dif-
satellite that provides the altitude; in practice it ferent standards (WMS, Web Map Service; GML,
is based on the synchronization of the satellites Geography Markup Language; KML, Keyhole
and the exact time the signal was transmitted. Markup Language,…) and through map servers
Receiver errors are minimized through filters and (mapserver, GeoServer,…).
augmentation systems that may be Satellite-Based
(SBAS) or Land-Based (GBAS). At conceptual
level, it is estimated that all these communica- TRACK-TOUR: A TOURIST
tion protocols will render it possible to ascertain MOBILITY ANALYSIS SYSTEM
visitor mobility at an affordable cost for use in
tourist management and be considered beneficial Two sequential work phases for the tourist mobility
or profitable. The precision of GNSS systems is analysis may be established: (1) data collection
currently undoubted, but positioning depends and (2) data analysis. Each one establishes a clear
exclusively on the visibility of the transmitting relationship with a specific technology, the first
satellites for the receivers; thus, this system is does so with the TT, while the second is based on
not effective indoors or in narrow streets, a key WebGIS technology. In this project, both have been
factor in this type of project. In this chapter, we integrated in a tool that permits the comprehensive
focus our interest on this type of TT for reasons management of the whole process, and whose
explained later on in this paper. internal architecture is described in this section.
The other technological cornerstone of the
project corresponds to data management tools, Architecture
and it is here that GIS and web technology take
on special importance. The characteristics of both The comprehensive tourist mobility data man-
render it possible to develop this functionality as a agement tool envisages two capital phases: data
comprehensive and multi-user management tool. capture and data management. The main system
The objective is to design a web interface, Track- architecture is a database complemented by a
Tour, that makes it possible to display data in a specific module: it permits spatial data display,
simple and user-friendly way, while also adding management, and web publishing. An Object-Re-
a toolset for the treatment and analysis of data lational Database (ORD) was chosen to implement
according to the needs of tourism managers and this project; i.e. a hybrid database that contains
the characteristics of each tourist area. both storage technologies: relational and object.
Data display and management in a web envi- This ORD delivers responses to users through the
ronment was regarded as the most suitable option web interface, i.e. which receives data and subse-
due to universality of use and access; moreover, quently serves them for analysis. This structure
in the future, real time data may be loaded into makes it possible to add data to the system after
the system. This decision was based on the con- sampling, and analyze them following cleansing.

1104
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

Both tasks will be implemented within a web participants, and the PostGis extension as a
environment (multi-user and multi-platform). complement to store the spatial objects. The
Data display and analysis are organized around display of data stored in PostgreSQL/PostGis is
the database but are complemented by a map web implemented by means of PHP and Open Lay-
server (which makes it possible to load data from ers through a GeoServer map server. The use of
the ORD as an image in a web environment) and PHP (Hypertext Pre-Processor) is due to several
an API (Application Programming Interface) for reasons, although the main one is its standardiza-
using base maps from external sources (such as tion and the knowledge of it. It is an interpreted
Google, Yahoo, Bing Maps, or Openstreetmap). programming language that is currently almost
The ORD chosen to support the core of the standard in dynamic web development; moreover,
architecture of this system is a PostgreSQL data- it permits highly flexible development (and can
base with the PostGis extension for spatial data. be consequently modulated to suit specific needs)
This ORD was selected for the following reasons: while also delivering reliable connectivity with
database servers.
• Open Source, this project was started up For spatial data, Open Layers was chosen as
for research and integration purposes; the a solution to display cartographic data due to the
development freedom and flexibility it of- stability of the project and the proven results ob-
fers is appreciated. Moreover, for potential tained in other studies. This Javascript library is
commercial operation, cost needs to be an OGC-endorsed display standard, which permits
minimal. connectivity with different data and lean develop-
• Scalability, it can adapt optimally to large ment. The data were delivered to the web interface
data structures and the amount of system through different Web Map Service (WMS) for
memory, enabling it to support a greater the base cartography and GeoServer as WMS
number of simultaneous requests. server for own data. The latter was chosen for its
• In technical terms it implements the use interoperability and scalability.
of rollbacks, subqueries, and transactions, The system works sequentially. First of all, the
making for more efficient operation. spatial data collected by the devices are download
• It can verify referential integrity and store by means of a PHP form that includes raw data and
procedures in the actual database, just as which, after cleansing, saves the information and
high-level databases managers, such as the waypoints to the PostGis database. Secondly,
Oracle, do. the data is analyzed by means of a specific tool
• Compatibility; the Open Geospatial developed in a web environment, using functions
Consortium (OGC) awarded a certificate written in PHP, which present the different analysis
of compatibility that permits connectivity options available. Functions developed in PHP
with its different standards. Moreover, at and SQL are used in combination with existing
GIS desktop software level (QuantumGis, PostGis functions to perform the calculations of
Udig, Kosmo, gvSig, Openjump, or the the chosen analysis. The results of these analysis
like) and map server level (GeoServer, calculations are displayed on a web interface by
mapserver, …) it can function as a direct- Open Layers, which, in turn, queries the database
access data source, thus facilitating view- to represent them on a map through Goeserver.
ing and edition. The functions developed for both processes are
addressed in detail later (see Figure 1).
This PostgreSQL-based architecture has
been used to store the information on the study

1105
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

Figure 1. System architecture

Data Collection • The cost, which must be kept as low as


possible so that high price does not prevent
The database is the core of the application, but it it from being added to tourist management
is the data that gives it meaning, hence the reli- processes effectively.
ability of the results obtained depends on the data.
From the outset, we sought to collect the data by The TT selected was the GPS signal, for dif-
means of a TT, with visitors being given a device ferent reasons. GPS devices have become popular
to log their route and then upload the information in recent years as an almost de facto standard in
to the system. Thus, a series of requirements were location and positioning; they are integrated in
established to be fulfilled by this technology in cars, peripherals, telephones, and are proliferating
order to optimize the process: in everyday life continually, so they are devices
that tourists are familiar with and are not distrust-
• Non-interference: the device to be deliv- ful of. This technological consolidation makes it
ered had to detect the position of the visi- possible to estimate the timeliness of the choice.
tors at all times, without affecting their With regard to the initial assumptions, it is clear
route or impinging on their privacy. Above that it fulfills the non-interference requirement, as
all, this constrains the size and the weight there are several small-sized, light models. Regard-
of the device. ing the signal, generally speaking the reliability
• Reliability of the signal: the signal must and integrity requirements are met, albeit with
be continuous, without interruptions or al- nuances, which will be addressed in the spatial
terations by any external or visitor-derived adjustment section. As for price, they are low-cost,
factor. around 100 Euros per unit, and moreover, market
• Signal integrity: the resulting data must be evolution causes prices to drop quite quickly or
treated as generically and as interoperably remain the same, while functions are enhanced.
as possible for integration in any informa- In fact, one important issue in this project
tion system. was to choose the right device to use. Based on

1106
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

Table 1. Optimal models

Model Producer Waypoints Battery power WASS/EGNOS


BT-Q1000XT Qstarz 200,000 42h. Yes
i-gotU GT-200e Mobile action 262,000 80 h. Yes
V-900 Columbus MicroSD Storage 24h. Yes
RBT-2300 Royatek 650,000 9 h. Yes
GPS-4043 Blumax 150,000 25h. Yes
AGL3080 Amod 1,040,000 20 h. Yes

the devices used in a previous study in the Port problem with GPS devices is direct visibility in
Aventura theme park (Russo, et al., 2010) a study dense urban environments, as is the case of most
was carried out of the different choices in the GPS European cities. Positioning depends on at least
market to record position continuously. The key three satellites in the direct line of sight, which, in
factors that determined the choice were GPS chip the case of high buildings and/or narrow streets,
reliability, battery life and storage capacity. Table is not always feasible, generating positioning
1 shows the different models considered optimal. problems. This is known as the multipath error.
The models chosen were the BT-Q1000XT The multipath error appears when the receiver
and i-gotU and GT-200e, as their specifications detects parts of the same signal at two different
seemed the most suitable. The next step was to times. The first part received comes directly from
conduct a field trial to assess the signal quality a satellite, which is the real signal, the other one
and usability of the devices. In the field test phase, is a reflection of the same signal generated when
we noted that signal quality, storage capacity and it bounces off a surface (for example high build-
battery life were quite similar. The devices have ings), and is the distorted signal. This second part,
their strengths and weaknesses but overall gener- the distorted signal, is received later due to the
ated similar heat output. We decided on the first delay caused when travel distance increases. The
device due to its greater usability. The shape of magnitude of the multipath error depends on the
the device was a positive element, differing in following factors:
this respect to others.
• Position and types of surface near the re-
GPS Signal and Multipath Error ceiver where the signal can bounce
• The height of the antenna with regard to
Further to the above, strictly speaking the GPS the Earth’s surface
signal does not fulfill the precepts of integrity • GPS wave distance signals. Dimension of
and reliability, although this is a relative problem. the signal.
Structurally, the capture and reliability of the signal
generated by the GNSS depends on the visibility Analyzing the factors and the scope of the
of the satellite for the receivers. This conditioning different studies, we can see the effect of this
factor renders the devices vulnerable, for example, structural condition of the GPS signal. If we study
to atmospheric alteration, which are corrected by data from the routes through the historic centre of
augmentation systems such as WASS, EGNOS, Tarragona, a semi-enclosed space, we can see that
etc. (satellite systems based on other satellite or- the multipath error is magnified. This is due, on
bits that make the relevant corrections). Another the one hand, to the fact that in this part of Tarra-

1107
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

gona, the receiver is surrounded by high buildings ordinate every 10 seconds, the maximum
where the signal is reflected continually, and on distance between two recorded points can-
the other, to the fact that Tarragona is at very low not surpass 11.27 m.
altitude. These two factors lead the multipath error • Direction: In an urban tourist environ-
to be augmented and therefore degrade the quality ment, it is evident that movement may
of the signal in this spatial context. In summary, only be made in the street and in or over
urban morphology generates, on the one hand, elements adapted for this purpose, such as
signal discontinuity, derived from the loss of walls, promenades, beaches … Thus, visi-
satellite visibility, and on the other, the linking tor flows can only move through an exist-
of correct data to other incorrect data, generated ing, and therefore known, urban fabric.
by the signal bouncing off buildings. • Connectivity: An urban route must fol-
The worst issue is that this error is random and low the connectivity relationship, hence to
from the reading, it is impossible to know, with move from one point to another you have
certainty, the time or place of impact or intensity to follow streets that are connected to each
of deviation. The error arises from satellite posi- other.
tion and it is not feasible to establish a system
to fix it previously. These anomalies prompted The premises described above were used as the
the search for a solution: the development of an bases for the development of a spatial adjustment
automated cleansing tool. tool of the points collected by the GPS. These
conditions limit visitor movement possibilities in
Snap Tool time and in space, but also permit correction; even
if the devices generate erroneous signals they may
As was described in the preceding section, and be corrected in most cases. The generalization of
due to the multipath error, the signal received by studies and tools like this one will render it pos-
the devices, particularly in the case of Tarragona, sible to improve the ruggedness and reliability of
was produced at source. To attempt to solve this the spatial adjustment.
difficulty, an automatic and realistic tool was At a technical level, a series of steps are fol-
designed and implemented to adjust waypoints. lowed; the first is the elimination of incorrect
The urban fabric is used to adjust these points positions. The positioning of the points on different
and the different objective premises are taken into GPS receivers will be performed according to the
account with regard to the coordinates collected intensity of the signal received, for example, if
by the devices and the visitors’ real movement direct satellite view is not received they estimate
possibilities. These are the premises addressed: the position. Thus, this first step eliminates these
points (in most cases they are at the beginning
• Speed: It is assumed that people only of the route or the exit of an indoor space). The
move through the old town of Tarragona second premise refers to the adjustment of the
on foot. This conditions speed of move- points to the speed of pedestrian movement. In
ment and, in this case, the maximum speed this phase, the tool applies a first algorithm to the
estimated was 1.12 m/s. This speed may be remaining points to discard points that do not fulfill
modified, but in dense urban settings with the first premise. Since each point has an associ-
nearby tourist sites, no means of transport ated moment in time and a position in space, the
is likely to reach higher speeds. speed between each one of them and the previous
• Distance: Since the average speed estimat- and subsequent point in time is calculated. If the
ed is 1.12 m/s and the GPS records a co- speed surpasses 1.12 m/s, the point in question

1108
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

is discarded. The following picture shows an ex- the positions captured by the GPS, as squares,
ample with five points ordered in time and each and the repositioned points following the meth-
one of them separated from the previous one by odology depicted as bullets. As can be observed,
10 seconds (GPS coordinate recording interval). the first four points are repositioned correctly, as
The distance between points (p1, p2) and (p3, p4) are the last two. This is not the case of the rest of
is about 10 meters, on the other hand the distance the intermediate coordinates, which are reposi-
between (p2, p3) and (p3, p4) is more than 30 tioned in the adjacent streets; hence, the third
meters, therefore this algorithm eliminates point premise is not fulfilled (see Figure 3).
p3, maintaining p1, p2, p4, and p5, as can be To solve this problem and fulfill the fourth
observed in Figure 2. premise, a condition has been added to the repo-
To fulfill the third premise, the points resulting sitioning of the points. This condition refers to
from the first filter are adjusted to the closest checking that the street where the point is relo-
street. This step is necessary in the case of the cated is connected to the street of the previous
historic centre of Tarragona, because the impreci- point. If this condition is not fulfilled, it would
sion of the GPS leads some points to fall over mean that the tourist had gone through a building,
buildings. The following image shows an ex- and as this, in principle, is impossible, another
ample of this step. The original route taken by the street was sought. This new street had to be one
tourist is shown as a line, the points that simulate closest to the current point and, at the same time,

Figure 2. Snapping tool: Operating model

Figure 3. Snapping tool: Wrong result

1109
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

Figure 4. Snapping tool: Right result

to the street of the previous point. Figure 4 shows area with a limited number of entry points (9) and
the result of applying this algorithm and how the a high density of streets in the form of a practically
problem of the previous step is solved. regular grid. In any case, this location is no different
Figure 5 shows a real example of this reposi- from many other historic cities in Europe, such as
tioning in the streets of the historic centre of Venice, Bruges, Dubrovnik, Toledo, and many oth-
Tarragona. ers, where spatial behavior has been identified as
one of the main aspects determining inefficiencies
Fieldwork in the use of public space and consumption (see
for example Russo [2002] for the case of Venice).
The methodology of the fieldwork prepared for The working methodology basically consisted of
this project was modified according to the study distributing GPS devices to visitors, who took the
scenario. Sampling was performed in Tarragona. route they wanted and returned the device at the
The historic centre of Tarragona, from the stand- end of the visit or period. In Tarragona, control
point of visitor mobility, is a very peculiar space points were set up at the different entry points
since, while access is free, it is a historic walled to the old city so as not to condition the routes.

Figure 5. Snapping tool: Example

1110
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

These points were used to deliver and collect the • The daily “life cycle” of the tourist space
devices at the beginning and end of the route, re- analyzed, as well as of the different spe-
spectively. This process was complemented by a cific tourist attractions.
qualitative survey (nationality, age, gender, place • The identification of the existence of mo-
of accommodation, companions, etc.) to comple- bility clusters between the different socio-
ment the spatial data of each route with a wide demographic groups, both generally, in
socio-demographic profile generating patterns of terms of space, and specifically, in terms of
mobility, visits and specific activities. The device attractions or sites of interest.
delivery process was prepared on the basis of a
scheduled plan that aimed to cover most of the The information from the GPS, together with
possibilities depending on space and time, so that that of the socio-demographic survey, makes up
the routes had different beginnings and endings. an aggregate of georeferenced data. As has already
This first sampling process was carried out over been mentioned, the spatial data are added to the
a 14-day period for 11 hours, with a total of 436 PostgreSQL/PostGis database and cleansed with
surveys, and each device was issued, on average, the space adjustment tool. Parallel to this process,
1.74 times a day in July, 2010. the alphanumeric information is also inputted into
the database, so that each route is linked to its socio-
Data Analysis demographic data and the activities the tourist had
engaged in. These data, in both scenarios, are the
Data collection is one of the project’s key points, cornerstone of the current analysis. This process
the other cornerstone being data analysis, a pro- may be carried out from different perspectives
cess which is detailed in this section. The main and with different methodologies. One option,
objective of the study is to learn about the dis- the most classic, proposes the implementation of
tribution of visitors in each one of the reference a results report comprised of two documents that
scenarios; in spatial and temporal terms. A series renders it possible a) to learn about all the socio-
of sub-objectives was established with regard to demographic characteristics of the sample, and b)
this general characterization of tourist behavior: the different trends in visitor mobility, the latter
on cartography. A series of conclusions linked
• The identification of the general patterns to this proposal may be obtained that enhance
of visitors’ spatial behavior, segmented knowledge of visitor routes as well as of the most
by their different socio-demographic commonly visited points and spaces (generically
characteristics. or segmented by socio-demographic data). Modern
• The identification of spaces with more and technology permits a fully integrated and flexible
less appeal, segmented in terms of socio- analysis that adapts to managers’ needs.
demographic characteristics. This project aims to generate an application,
• The analysis of the influence of visit dura- with the help of web technologies, in which this
tion (time allocation) on the spatial behav- analysis is dynamic and adapted to the specific
ior of the visitors. needs of each tourist area. Intrinsically, mobility
• The sequence and selection of the visits means immediacy, thus it would seem obvious
to places of interest or different landscape that it would be better to conduct an analysis that
elements, segmented by time and budgets. offers quick results in order to be able to acceler-
• The analysis of space congestion, either ate the decision-making process. On the basis of
generically or segmented (by space, time this idea, analysis architecture was developed and
or socio-demographic characteristics). it was deemed more convenient to design a web

1111
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

viewer, Track-tour, that provides the generic and flexible environment and permits flexible adap-
consensus analysis tools according to the tourist tation to any corporate web environment. On a
area studied. The objective of this viewer is to more specific level, the viewer has different parts
permit greater flexibility of configuration and or modules: a menu panel, a cartographic display
adaptation to different environments or to the panel and a tool panel. The first allows the user to
specific needs of each tourist space; at the same toggle between the different tasks performed by
time, it should offer suitable management and the viewer, i.e. browse through different submenus
generate results integrally and reliably. such as data loading, survey edition, and return
On the basis of these premises, the analysis to the matrix web... The second, the cartographic
viewer requires a series of specific and concrete display panel, takes up most of the space on the
characteristics to fulfill its requirements properly. browser window with a view to displaying the
One initial premise is that it should be interoper- data in response to an action on the different
able, i.e., development can be implemented and tools provided by this panel. The base map, on
replicated in different environments regardless of which the tourist mobility data are deployed, is
the web browser or the operating system. Another served through external sources, such as Bing
requirement is that development should be flexible Maps, Google Maps, or Open Street Map through
and dynamic so that querying will be agile and GeoServer.
useful. The last and probably the most important This tool panel is the viewer’s key element
requirement is that the system, the viewer, must and features two groups of different actions. The
be reliable. The solution to these requirements is first groups are the actions related to browsing
provided by the actual genesis of the system: using and data contextualization, while the second
a web environment is what actually affords the groups, detailed below, are those developed spe-
viewer the leanness required. Reliability depends cifically for the project. The former include the
on the application’s internal source code and on its basic tools of any web geoportal such as zooming
interaction with the database, and it also delivers tools and layer management, while the latter in-
dynamism, avoiding long response times. Apart clude several developed analysis instruments
from the code, calls to the database have been linked to a printing module and another data filter,
optimized by minimizing the unnecessary data which make it possible to display the result of the
loading and by pre-calculating core functions. analysis processes according to a socio-demo-
Interoperability is achieved by implementing graphic or temporal parameter.
technologies with the highest possible degree of The analysis tools developed use the accumu-
standardization, thus avoiding the use of additional lation of points from the GPS devices, with the
plug-ins or specific formats. following being developed for this project and
The core of this viewer is the PostGis data- tourist environment: routes, stop points, sights,
base, which is the data source; the main axis of functional polygons (time and space) and concen-
display will be PHP language. This will enable tration. The first generates and enables the display
us to generate the responses required by users in of visitors’ itineraries in a color range varying
the viewer. In specific terms, the data displayed from the maximum value (red) to the minimum
will depend on other technologies, such as the value (green) with dynamically variable intervals
Open Layers library for display and GeoServer according to the filter established. The two specific
as cartography server. analysis tools, stop and interest points, relate to
In structural terms, the viewer will feature a the most important points for the visitors in the
series of clearly different parts (Figure 6). The sampling space using the same color range. The
external graphic structure is comprised within a points of interest count all the points and establish

1112
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

Figure 6. Web interface analysis

absolute maximums linked to the time dedicated each one is associated; the greater the value the
to each one (symbol size is proportional to the greater the color hue.
length of the visit). The stop points relate the The tools developed as the central part of the
spaces with the concentration of visitors so that project allow us to identify mobility values and
visitor frequency can be assessed. The polygonal patterns for the whole data set, although filtering
tools show the use and occupation of the space ac- is also possible. The complement to this tool panel
cording to a predefined spatial segmentation with is another specific filtering tool that can segment
regard to functional polygons that correspond to data according to the qualitative variables gener-
a combined parameter of urban morphology and ated by the surveys and also time patterns.
uses of space. This tool displays data according
to concentration by number of visitors or by time
of visit. The last tool, the concentration function, PRELIMINARY RESULTS AND
is used to display the evolution of the use of the PENDING RESEARCH CHALLENGES
urban space. This provides for a grid of small
regular polygons to which the visitor value for Preliminary results of Track-tour are being ana-
lyzed. As a sample of possibilities that we get with

1113
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

Figure 7. Main focus attractions by time and space in Tarragona

this tool we can show a map with the difference that should be studied. In closed or semi-enclosed
in intensity of visits in Tarragona according to spaces, a possible alternative would be to estab-
time concentration, left side image, and space lish a system of digital monitoring of the space
concentration, right side image (Figure 7). and link it to specific qualitative data produced
In any case, data analysis ability of Track-tour indirectly through surveys. There are a great deal
has room for improvement both at specific level, of technologies and standards to develop and
modifying the PHP language, which generates test, which could fit the terms described in our
the output, and even by changing the tools. The technological background section. If GPS devices
consolidation of this comprehensive tool as a will continue to be used, and in the line of future
tourist management standard may make it pos- TT-linked developments, data cleansing is another
sible to target development according to users’ key area for improvement, especially as far as
specific needs; in this case, fewer tools may mean error-measuring devices and spatial adjustment
greater functionality. In this regard, data display are concerned, including a more finely honed
can be both formally and technically improved. definition of the method used for this adjustment.
In fact, there is the issue of improving the The capture of spatial data depends on the
methodology designed as a result of the combined TT, but what data should be linked to these posi-
development of the TT and the display technolo- tions? The qualitative information linked to the
gies associated with the data, which is still open routes is another element to be validated. Socio-
to testing, and the inclusion of new technologies demographic surveys tend to be very extensive,
that appear on the market. Likewise, one task that although only a few variables are used for spatial
remains to be addressed is to explore the different analysis. It would be advisable to determine, in
options for the analysis of the information further collaboration with the stakeholders involved,
with a view to achieving greater adaptation to the which are the key aspects for spatio-temporal
needs of managers of tourist destinations. analysis and socio-demographic segmentation.
As for the future involving TT in this devel- A new sampling campaign would be needed
opment, there are several potential lines of de- to consolidate the tools developed and to validate
velopment, although they converge in two main the functionalities defined and applied. In this
possibilities. This study chose a development way, it would be possible to certify the automa-
linked to GPS devices, but there are other options tion of the cleansing tool. Another interesting

1114
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

possibility would be to replicate this experiment tourist destinations can be addressed in many ways,
in a different geographic environment to evaluate one of which is the proposal described herein.
the system’s adaptability and its flexibility with In this economic activity, support to ICT may
regard to territorial characteristics. Another field be expected to be visible, real and useful on differ-
of future development stems from the possibility ent scales, in a short period of time and in all its
of automating the data collection environment yet facets, such as promotion, destination management
further. The idea is that this process should enable and the planning of tourist activities. The use of
two tasks: (a) that the operators that prepare the tools and methods for automating, facilitating, and
surveys can do so via a user-friendly editor and that harmonizing processes is almost an imperative.
the data from these studies can be copied directly The virtually infinite development possibilities
to the database, and (b) that the spatial data can in tourism linked to information and communi-
be validated by relating them to the thematic data cation technologies are being demonstrated with
from the surveys. each passing day, but they call for comprehensive
Finally, in another sense we can assess how development, implementation, and management
new mobile information technologies and devices between the different authors and the different
such as smart phones among others, with position- technicians.
ing and localization services can be adapted for This chapter demonstrates how the techno-
tourism research. Another specific example of this logical tracking of visitors in tourist spaces can
could be runkeeper (http://runkeeper.com/). This be organized and implemented as a fundamental
product works as a sports community; to share, tool to improve destination management. At a
track and measure fitness. This approach could conceptual level, it has enabled us to move from
satisfy both consumers’ needs and expectations an unknown element, namely visitors’ routes, to
and provide information for research. In fact, from one that is known, i.e. the spatio-temporal behav-
them we can glean much important knowledge ior of tourists filtered by means of meaningful
about tourist and personal space mobility. socio-economic variables. In this regard, the tool
presented may become an important application
that helps to take decisions in tourist areas since
CONCLUSION it makes for easy adaptation of the fieldwork to
the specific characteristics of each destination and
Tourism is a structural economic activity in many for a flexible adjustment of the different system
places and it is expected to continue to be so. Some processes.
tourist destinations, as economic products, have The methodology defined, and the develop-
reached a period of maturity, which calls for a ment and implementation of the system, are rated
rethink in the way that they work, are maintained as positive, surpassing the expected results. The
and are sold. The major challenge faced by these assessment of the technologies and the devices
destinations is to hold their position in the market was suitable, as was the process related to the
and, if possible, expand it in what is an increasingly fieldwork and the development of the computing
competitive environment. In this regard, this topic application.
heralds an incomparable opportunity for work, In general, the conclusion of this chapter is
research, and development, where information that ICT generate many and good opportunities
and communication technologies should play a for all of the economic activities linked to tour-
key role. The development of ICT and associated ism, and that they should continue to do so in
methodologies to improve the management of order to consolidate research in the development
framework.

1115
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

ACKNOWLEDGMENT Russo, A. P. (2002). The vicious circle of tourism


development in heritage cities. Annals of Tourism
This project was funded by the FEDER European Research, 29(1), 165–182. doi:10.1016/S0160-
Regional Funds and Tarragona Provincial Council. 7383(01)00029-9
Russo, A. P., Anton Clavé, S., & Shoval, N. (2010).
Advanced visitor tracking analysis in practice:
REFERENCES
Explorations in the Port Aventura theme park and
Alzua, A., Aramburo, I., Gerrikagoitia, J. K., insights for a future research agenda. In Gretzel,
Peralta, M., & Espinosa, N. (2010). eGIStour: U., Law, R., & Fuchs, M. (Eds.), Information and
Sistema de medición de flujo de visitantes. TU- Communication Technologies in Tourism 2010
RITEC, 289-300. (pp. 159–170). Vienna, Austria: Springer-Verlag.
doi:10.1007/978-3-211-99407-8_14
Dietvorst, A. G. J. (1995). Tourist behaviour and
the importance of time-space analysis. In Ash- Shaw, G., Agarwal, S., & Bull, P. (2000). Tour-
worth, G. J., & Dietvorst, A. G. J. (Eds.), Tourism ism consumption and tourist behaviour: A British
and Spatial Transformations. Wallingford, CT: perspective. Tourism Geographies, 2(3), 264–289.
CAB International. doi:10.1080/14616680050082526

Drummond, J., João, E., & Billen, R. (2006). Shoval, N. (2008). Tracking technologies and
Current and future trends in dynamic and mobile urban analysis. Cities (London, England), 25(1),
GIS. In Drummond, J., Billen, R., João, E., & 21–28. doi:10.1016/j.cities.2007.07.005
Forrest, D. (Eds.), Dynamic and Mobile GIS: Shoval, N., & Isaacson, M. (2006). The appli-
Investigating Changes in Space and Time (pp. cation of tracking technologies to the study of
289–300). London, UK: Taylor and Francis. pedestrian spatial behavior. The Professional
doi:10.1201/9781420008609.pt1 Geographer, 58(2), 172–183. doi:10.1111/j.1467-
Galí Espelt, N., & Donaire Benito, J. A. (2006). 9272.2006.00524.x
Visitors’ behavior in heritage cities: The case Shoval, N., & Isaacson, M. (2007). Tracking
of Girona. Journal of Travel Research, 44(4), tourist in the digital age. Annals of Tourism
442–448. doi:10.1177/0047287505282956 Research, 34(2), 141–159. doi:10.1016/j.an-
Hartmann, R. (1988). Combining field methods nals.2006.07.007
in tourism research. Annals of Tourism Research, Shoval, N., & Isaacson, M. (2010). Tourist mobil-
15, 88–105. doi:10.1016/0160-7383(88)90073-4 ity and advanced tracking technologies. London,
Meng, L., Zipf, A., & Reichenbacher, T. (Eds.). UK: Routledge.
(2004). Map-based mobile services: Theories, Thornton, P. R., Williams, A. M., & Shaw, G.
methods and implementations. Vienna, Austria: (1997). Revisiting time-space diaries: An explor-
Springer-Verlag. atory case study of tourist behavior in Cornwall,
O’Connor, A., Zerger, A., & Itami, B. (2005). England. Environment & Planning A, 29(10),
Geo-temporal tracking and analysis of tourist 1847–1867. doi:10.1068/a291847
movement. Mathematics and Computers in Urry, J. (2008). Preface. In Burns, P., & Novelli,
Simulation, 69(1-2), 135–150. doi:10.1016/j. M. (Eds.), Tourism and Mobilities: Local-Global
matcom.2005.02.036 Connections (pp. xiv–xv). Wallingford, CT: CABI.

1116
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

Van der Spek, S. (2008). Spatial metro: Track- Connell, J., & Page, S. J. (2008). Exploring the
ing pedestrians in historic city centres. In van spatial patterns of car-based tourist travel in Loch
Schaick, J. (Ed.), Urbanism on Track: Applica- Lomond and Trossachs national park, Scotland.
tion of Tracking Technologies in Urbanism (pp. Tourism Management, 29, 561–580. doi:10.1016/j.
79–102). Amsterdam, The Netherlands: IOS Press. tourman.2007.03.019
Van Schaick, J. (Ed.). (2008). Urbanism on track: Donaire Benito, J. A., & Galí Espelt, N. (2008).
Application of tracking technologies in urbanism. Modeling tourist itineraries in heritages cities:
Amsterdam, The Netherlands: IOS Press. Routes around the old district of Girona. Pasos.
Revista de Turismo y Patrimonio Cultural, 5(3),
435–449.
ADDITIONAL READING
Donggen, W., & Tao, C. (2001). A spatio-temporal
Ahas, R., Aasa, A., Mark, Ü., Pae, T., & Kull, data model for activity-based transport demand
A. (2007). Seasonal tourism spaces in Estonia: modelling. International Journal of Geo-
Case study with mobile positioning data. Tour- graphical Information Science, 15(6), 561–585.
ism Management, 28(3), 898–910. doi:10.1016/j. doi:10.1080/13658810110046934
tourman.2006.05.010 Fennell, D. A. (1996). A tourist space-time bud-
Ahas, R., Aasa, A., Roose, A., Mark, Ü., & Silm, get in the Shetland Islands. Annals of Tourism
S. (2008). Evaluating passive mobile position- Research, 23(4), 811–829. doi:10.1016/0160-
ing data for tourism surveys: An Estonian case 7383(96)00008-4
study. Tourism Management, 29(3), 469–486. Frihida, A., Marceau, D. J., & Thériault, M.
doi:10.1016/j.tourman.2007.05.014 (2004). Development of a temporal extension
Asakura, Y., & Iryo, T. (2007). Analysis of tourist to query travel behavior time paths using an
behavior based on the tracking data collected using object-oriented GIS. GeoInformatica, 8(3), 211.
a mobile communication instrument. Transporta- doi:10.1023/B:GEIN.0000034819.57376.92
tion Research Part A, Policy and Practice, 41(7), Itami, R. M. (2003). RBSim3: Agent-based simu-
684–690. doi:10.1016/j.tra.2006.07.003 lations of human behavior in GIS environments
Chardonnel, S., & Van Der Knaap, W. (2004). using hierarchical spatial reasoning. In Proceed-
Managing tourist time-space movements in recre- ings of the ModSim 2003 Conference. ModSim.
ational areas: A comparative study of a protected Keul, A., & Kühberger, A. (1997). Tracking
natural park in the French Alps and the “De Hoge the Salzburg tourist. Annals of Tourism Re-
Veluwe” Dutch national park using the same search, 24(4), 1008–1012. doi:10.1016/S0160-
methodology. Revue de Geographie Alpine, 90(1), 7383(97)00038-8
37–48. doi:10.3406/rga.2002.3071
Kwan, M., & Casas, I. (2006). Gabriel: GIS
Chhetri, P., Arrowsmith, C., & Jackson, M. (2004). activity-based travel simulator: Activity sched-
Determining hiking experiences in nature-based uling in the presence of real-time information.
tourist destinations. Tourism Management, 2(1), GeoInformatica, 10(4), 469. doi:10.1007/s10707-
31–43. doi:10.1016/S0261-5177(03)00057-8 006-0343-7

1117
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

Lam, T., & Hsu, C. H. C. (2006). Predicting Mukai, N., Honma, H., & Kosugi, M. (2007). Gen-
behavioral intention of choosing a travel desti- eration of dynamic moving path for autonomous
nation. Tourism Management, 27(4), 589–599. human model. In Proceedings of MODSIM07:
doi:10.1016/j.tourman.2005.02.003 Land, Water and Environmental Management:
Integrated Systems for Sustainability. MODSIM.
Lau, G., & Mckercher, B. (2007). Understanding
tourist movement patterns in a destination: A GIS Oppermann, M. (1997). First-time and repeat visi-
approach. Tourism and Hospitality Research, tors to New Zealand. Tourism Management, 18(3),
7(1), 39–49. doi:10.1057/palgrave.thr.6050027 177–181. doi:10.1016/S0261-5177(96)00119-7
Lew, A., & Mckercher, B. (2002). Trip destina- Van Der Knaap, W. G. M. (1999). Research re-
tions, gateways and itineraries: The example port: GIS-oriented analysis of tourist time-space
of Hong Kong. Tourism Management, 23(6), patterns to support sustainable tourism devel-
609–621. doi:10.1016/S0261-5177(02)00026-2 opment. Tourism Geographies, 1(1), 56–69.
doi:10.1080/14616689908721294
Lew, A., & Mckercher, B. (2006). Modeling
tourist movements: A local destination analysis. Xia, J., & Arrowsmith, C. (2005). Managing scale
Annals of Tourism Research, 33(2), 403–423. issues in tourist spatio-temporal movement model-
doi:10.1016/j.annals.2005.12.002 ling. In A. Zerger & R. M. Argent (Eds.), Proceed-
ings of the Modsim 2005 International Congress
Loiterton, D., & Bishop, I. (2005). Virtual en-
on Modelling and Simulation. MODSIM.
vironments and location-based questioning for
understanding visitor movement in urban parks Xia, J., Arrowsmith, C., Jackson, M., & Cart-
and gardens. In Proceedings of Trends in Real- wright, W. (2008). The wayfinding process rela-
Time Landscape Visualization and Participation tionships between decision-making and landmark
(Conference on Information and Technologies in utility. Tourism Management, 29(3), 445–457.
Landscape Architecture). Springer. doi:10.1016/j.tourman.2007.05.010
Mcwilliam, S, E. G., & Crompton, J. L. (1997).
An expanded framework for measuring the ef-
KEY TERMS AND DEFINITIONS
fectiveness of destination advertising. Tourism
Management, 18(3), 127. doi:10.1016/S0261- AVITA: Is the methodological body based on
5177(96)00114-8 the joint use of this new generation of tracking
Modsching, M., Kramer, R., Gretzel, U., & Ten technologies (TT) for the collection of spatial data
Hagen, K. (2006). Capturing the beaten paths: on visitors, possibly to be integrated by surveying
A novel method for analysing tourists’ spatial activities, and the use of GIS for the elaboration
behavior at an urban destination. In Proceedings and representation of visitor spatial behaviour.
of the International Conference on Information Budget-Time Analysis: Is the systematic re-
and Communication Technologies in Tourism cord of a person’s use of time over a given period.
2006. Springer. It describes the sequence, timing and duration of
the person’s activity.
Montanari, A., & Muscarà, C. (1995). Evaluating Global Navigation Satellite Systems:
tourist flows in historic cities: The case of Venice. (GNSS): Refers to different positioning techniques
Tijdschrift voor Economische en Sociale Geografie, using satellites, which may operate in isolation
86(1), 80–87. doi:10.1111/j.1467-9663.1995. or in combination.
tb01831.x

1118
Analysis of Tourist Behavior Based on Tracking Data Collected by GPS

GPS: Is a planet-wide navigation system depends on the receiver, antennas, the reflecting
that can determine position and time. Accuracy material.
depends on satellite view and the use of augmenta- Spatial Analysis: Is the process of extracting or
tion systems that vary from meters to centimeters. creating new information about a set of geographic
Multipath Error: Is a general error in GPS features; it is useful for evaluating suitability and
signal receivers and appears when the receiver capability, for estimating and predicting, and for
detects parts of the same signal at two different interpreting and understanding spatial behavior
times. The first is received directly from a satel- of any feature.
lite and the other part is a reflection of this signal Tracking Technologies: Group of wireless
because it has bounced off a surface (for example technologies that allow collecting any kind of
high buildings or the same surface). The error spatial data, to deliver location-based service, such
as GPS, GIS, WLAN, RFID and others.

This work was previously published in Discovery of Geospatial Resources: Methodologies, Technologies, and Emergent Ap-
plications, edited by Laura Díaz, Carlos Granell and Joaquín Huerta, pp. 241-260, copyright 2012 by Information Science
Reference (an imprint of IGI Global).

1119
1120

Chapter 67
The RiskCity Training
Package on Multi-Hazard
Risk Assessment
Cees van Westen
University of Twente, The Netherlands

Tsehaie Woldai
University of Twente, The Netherlands

ABSTRACT
As part of the capacity-building activities of the United Nations University – ITC School on Disaster Geo-
Information Management (UNU-ITC DGIM) the International Institute for Geo-information Science and
Earth Observation (ITC) has developed a training package on the application of GIS for multi-hazard risk
assessment. The package, called RiskCity comprises a complete suite of exercise descriptions, together
with GIS data and presentation materials on the various steps required to collect and analyze relevant
spatial data for hazard, vulnerability and risk assessment in an urban environment. The package has
been developed in collaboration with several partner organizations on different continents, and is used
as the basis for a series of courses. Currently it is available as a distance education course.

INTRODUCTION 2009). The possible impacts of hazardous events


are large, especially in developing countries and
The world is confronted with a rapidly growing governments have to incorporate risk reduction
impact of disasters, due to many factors that cause strategies in development planning at different
an increase in the vulnerability of society combined levels (UN-ISDR, 2004). The evaluation of the
with an increase in (hydrometeorological) hazard expected losses due to hazardous events requires
events related to climate change (Blaikie et al., a spatial analysis, as all components of a risk as-
1996; IPCC, 2007; EM-DAT, 2009; Munich Re, sessment differ in space and time (Van Westen,
2010). Therefore risk assessment can only be car-
DOI: 10.4018/978-1-4666-2038-4.ch067 ried out effectively when it is based on extensive,

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
The RiskCity Training Package on Multi-Hazard Risk Assessment

multidisciplinary studies on the basis of spatial terial related to multi-hazard risk assessment is
information, derived from Remote Sensing and concerned, the HAZUS methodology developed
other sources. There is an urgent need to include the in the US can be considered the standard. This
concepts of disaster geo-information management comprehensive loss estimation software which
into emergency preparedness planning, spatial runs under ArcGIS is a very good tool for carrying
planning and environmental impact assessment. out loss estimations for earthquakes, flooding and
This requires capacity building and training of di- windstorms (FEMA, 2010b), but is restricted to
saster management experts and professionals, such use in the USA, due to constraints in the data and
as planners, engineers, architects, geographers, the classifications used for elements at risk and
environmental specialists, university teachers fragility curves. The manuals of Hazus, however,
etc. (Van Westen, 2010). The Hyogo framework provide a very good overview of the entire process
of action 2005-2015 of the UN-ISDR indicates of multi-hazard risk assessment. Courses on the
risk assessment and education as two of the key use of HAZUS can be followed online from the
areas for the development of action in the coming ESRI Virtual Campus (ESRI, 2010). Another very
years (UN-ISDR, 2005). promising initiative is the CAPRA project (World
Worldwide, a number of organizations are Bank, 2010). The Central American Probabilistic
specialized in providing short training courses on Risk Assessment (CAPRA) initiative aims to
disaster risk management-related issues (ADPC, strengthen the regional capacity for assessing,
2005). Some organizations have also prepared understanding and communicating disaster risk in
training materials that are accessible through the Central America. The project has developed tools
internet; for example the Disaster Management for multi-hazard probabilistic risk assessment,
Training Programme (DMTP), or the International using Open Source software tools (World Bank,
Federation of Red Cross and Red Crescent Societ- 2010). However, complete GIS based training
ies (IFRC). Most of these however concentrate on packages on spatial hazard and risk assessment
community-based methods (Peters Guarin, 2008). using low-cost or free GIS software are still very
Disaster risk management courses at BSc or MSc scarce, to the knowledge of the authors. One ex-
level are now available in many Universities on ample is a training package in English and Spanish
all continents developed for Central America in the framework
Relatively few training materials are avail- of the UNESCO RAPCA project (ITC, 2004).
able on multi-hazard risk assessment. A good This paper describes the main aspects of a
textbook on the subject is made by Smith and GIS-based training package on multi-hazard risk
Petley (2009). Online training materials can be assessment, which has been developed by the
obtained for example from the websites of the US United Nation University (UNU)– ITC School
Federal Emergency Management Agency (FEMA, for Disaster Geoinformation Management. The
2010a) and Australian Emergency Management International Institute of Geo-Information Science
Agency (EMA, 2002). There is a very useful guide and Earth Observation (ITC), has recently been
prepared by FEMA called “Understanding your integrated in the University of Twente after a period
risks” that guides readers through the various of 60 years as an institute for postgraduate train-
phases of a risk assessment (FEMA, 2010c). The ing and research in the field of geo-information
development of innovative forms of learning and directed to capacity building and institutional
teaching oriented towards building new curricula development of professional and academic or-
in the field of natural risk has attracted attention ganizations from developing countries. In 2005,
in European initiatives such as DEBRIS (2006) ITC and the United Nations University established
and NAHRIS (2006). As far as GIS-related ma- a collaborative programme on the use of spatial

1121
The RiskCity Training Package on Multi-Hazard Risk Assessment

information for disaster management, which re- urban area, because the elements at risk have a
sulted in the formation of the UNU-ITC School for much higher density, the study areas are generally
Disaster Geo-Information Management. The main smaller and the scale of analysis larger as compared
activities of the DGIM School focus on training, to a rural setting. This allows us to demonstrate
education, curriculum development, knowledge which tools can be used for generating hazard as
development and research collaboration. This well as elements at risk databases, even in data
is done through the establishment of Univer- poor environments, and to show how qualitative
sity networks in Asia, Africa and Latin America, and quantitative techniques for risk assessment can
where the member Universities exchange spatial be used, and in which situation. Also the combined
information, course materials and jointly carry out effect of different hazard on the overall risk can
training and research projects. The DGIM School be better demonstrated, as well as the effect of
develops training packages and courses that are risk reduction measures.
given jointly with the partners of the networks One important consideration in designing the
in various countries. The materials are uniform, exercises is that people from developing countries
and have been developed in different languages, should not be restricted in using it due to financial
and the support is given by local University staff burdens for software acquisition. Therefore the
who have followed earlier training and by staff aim was to use Open Source software as a basis.
from the UNU-ITC DGIM School. One of these The exercises are written for the Integrated Land
courses is on Multi-hazard risk assessment, which and Water Information System (ILWIS, 2010). It
is centered around a case study on the use of is a Geographic Information System (GIS) with
Geographic Information Systems, and Remote integrated image processing capabilities. It also
Sensing for the assessment of hazard, vulnerability has its own attribute data analysis, spatial data
and risk in a typical urban area representative of entry and conversion modules. New modules have
situations in many developing countries. Rapid been recently added for Spatial Multi Criteria
urbanization, combined with a lack of planning, Evaluation, analysis of Digital Elevation Models
often leads to the spreading of squatter areas and for digital stereo image interpretation (ILWIS,
located in hazardous areas, such as steep slopes, 2010). The strongest point of the software is the
flood prone areas etc. map calculation module that allows extensive
modelling with raster maps, also using scripts.
ILWIS, is very user-friendly software, and allows
RISKCITY TRAINING PACKAGE the participants to concentrate on the risk assess-
ment application rather than on the specifics of
The package, called RiskCity, comprises a com- the software.
plete suite of exercise descriptions, together with Each of the exercises in RiskCity has its own
GIS data and presentation materials on the vari- dataset. The data are all provided in separate
ous steps required to collect and analyze relevant directories, including the results of the previous
spatial data for hazard, vulnerability and risk exercises that are needed to make a subsequent
assessment in an urban environment. Figure 1 one. So it is possible to carry out each of the
gives a schematic overview of the various steps exercises separately. Result files and PowerPoint
involved in the process. The RiskCity training presentations with instructions are also included.
package focuses on demonstrating the procedures The overall structure of the RiskCity training
of risk assessment for natural and human-induced package is given in Figure 2, and an overview
hazardous phenomena in an urban environment of the various components is presented in Table
within a developing country. We have selected an 1. Four different types of hazards are evaluated:

1122
The RiskCity Training Package on Multi-Hazard Risk Assessment

Figure 1. Schematic representation of multi-hazard risk assessment

landslides, floods, earthquakes and technological unit and land use type, and to characterize the
hazards. The training package starts with introduc- buildings, for instance by calculating their height
tory exercises dealing with the software and with and floorspace using Lidar data. The floorspace
the study area, where students learn the various is used then to distribute the census population
hazard problems by evaluating high resolution over the mapping units, and population estimates
images. An important component of RiskCity is are made for day- and night-time scenarios. The
the generation of a database of elements at risk in elements at risk database contains information on
order to evaluate the vulnerability of buildings and the buildings, with important attributes such as
population. Here two options are considered. The urban land use type, construction type, floorspace
first is that there is no spatial data available, except and height, as well as on population for a daytime
for a high resolution image, and the students have and night-time scenario.
to generate mapping units with homogeneous types There are a wide range of hazard assessment
of buildings by stereo interpretation and digitiz- exercises, not only those dealing with the four
ing on the image. Sampling is then carried out to types of hazard mentioned before, which are based
define the number of buildings and population on the data of RiskCity, but also those which use
per mapping unit. The second option for generat- data from other areas for tsunami, cyclone, vol-
ing the elements at risk database assumes there canic, forest fires and land degradation hazard
are digital data available in the form of building assessment. They use a variety of approaches,
footprint maps, census information and detailed such as inventory-based, heuristic, statistical and
elevation data from a Lidar survey. These are used deterministic. The vulnerability assessment in-
to calculate the number of buildings per mapping cludes exercises on the use of vulnerability curves

1123
The RiskCity Training Package on Multi-Hazard Risk Assessment

Figure 2. General structure of the RiskCity training package

for assessing physical vulnerability, as well as the and the more conceptual equation:
use of expert-based Spatial Multi Criteria Evalu-
ation for the evaluation of social vulnerability and Risk = Hazard * Vulnerability / Capacity
capacity. (2)

In the RiskCity training package both equations


RISK CONCEPTS are used. Equation [2] is only conceptual, but al-
lows incorporating the multi-dimensional aspects
The Risk Assessment approach adopted in of vulnerability, and capacity, which are often
RiskCity is based on the definition of risk from integrated with hazard indicators using Spatial
UN-ISDR (2009) “The potential disaster losses, Multi-Criteria Evaluation. Equation [1], given
in lives, health status, livelihoods, assets and above, is not only a conceptual one, but can also
services, which could occur to a particular com- be actually calculated with spatial data in a GIS
munity or a society over some specified future time to quantify risk, with a focus on (direct) physical,
period”. In other words risk can be described as population and economic losses. As illustrated
“the probability of losses”. Risk can be presented in Figure 3 there are two important components,
conceptually with the following basic equation which also should be spatially represented: hazards
(see also Table 1 and Figure 1): and elements at risk. They are characterized by
both spatial and non-spatial attributes. Session 2 of
Risk = Hazard * Vulnerability * Amount of ele- the RiskCity training package explains the spatial
ments-at-risk (1) data requirements for hazard and elements-at-risk

1124
The RiskCity Training Package on Multi-Hazard Risk Assessment

Table 1. Overview of the exercise structure of RiskCity

Session Theory RiskCity Exercise


1. Introduction to the course, objectives, Introduction to ILWIS and the RiskCity dataset
Introduction to risk structure, and set-up; Explanation of the
assessment available materials
2. Presentation of data requirements for Obtaining free data from the Internet;
Spatial data the various types of hazards; Sources of Generating a DEM using SRTM data;
for risk assessment spatial data Download and georeferencing Google Earth images;
Creating and interpreting multi-temporal images
3. Hazard types; Main concepts of hazard Frequency assessment;
Hazard assessment assessment; Frequency magnitude – Flood modeling 2D flood propagation model, Philippines;
relationships; Methods for use of GIS in Flood hazard monitoring with SPOT imagery, Bangladesh;
hazard assessment Statistical landslide hazard assessment, Colombia;
Deterministic landslide hazard assessment, India;
Modeling pyroclastic flow deposits, Philippines;
Earthquake hazard assessment: Kathmandu, Nepal;
Hazard analysis of cyclone flooding in Bangladesh;
Monitoring coastal erosion and sedimentation, China;
Modeling land subsidence and sea level rise, Indonesia
4. Types of elements at risk; Generating a database of elements at risk from scratch;
Elements at risk Classification of buildings, infrastruc- Generating a database of elements at risk using existing data; Partici-
ture, lifelines, critical facilities; Popula- patory GIS for risk assessment
tion information;
Collection of elements at risk informa-
tion
5. Types; social vulnerability; physical Generating vulnerability curves;
Vulnerability as- vulnerability; Methods for vulnerability Spatial Multi-Criteria Evaluation for vulnerability and qualitative risk
sessment assessment; Participatory GIS; Spatial assessment
Multi Criteria Evaluation
6. Loss estimation models; HAZUS; quali- Flood risk assessment;
Risk tative risk assessment; QRA; Basics of Landslide risk assessment;
assessment flood risk, seismic risk, landslide and Seismic risk assessment;
technological risk assessment Technological risk assessment;
Multi-hazard risk assessment
7. Risk evaluation; risk governance; risk Risk information for emergency preparedness & Response; Simula-
Risk management communication; cost benefit analysis; tion exercise; Analysis of costs & benefits of risk reduction scenarios;
Using risk information for emergency Using risk information in environmental assessments
planning; spatial planning, and environ-
mental impact assessment
8. Each participant makes it own final Select a topic from a list, analyze a problem and generate a report and
Final project project and does a small research presentation

data, and how available data from the internet For earthquakes the terms magnitude and inten-
can be used. sity do have a different meaning, with magnitude
Hazards are characterized by their temporal expressing the energy level of the earthquake (on
probability and magnitude or intensity derived the Richter scale) and intensity expressing the
from frequency magnitude analysis (this is local effects of the earthquake, that vary over a
treated in Session 3 of the training package). In distance, becoming less further from the epicen-
this respect magnitude and intensity can be con- ter (and expressed in qualitative classes such as
sidered as synonymous terms that express the the Modified Mercalli Intensity). The hazard
severity of the hazard. For instance flood depth, component in equation [1] actually refers to the
flow velocity, and duration in the case of flooding. probability of occurrence of a hazardous phenom-

1125
The RiskCity Training Package on Multi-Hazard Risk Assessment

Figure 3. Basic function of risk, which can be divided into the components of hazard, the vulnerability,
and the amount of elements at risk that are exposed to the hazard

enon with a given intensity within a specified ent types of elements at risk (which are treated in
period of time (e.g., annual probability). Hazards session 4 of the training package) and they can
also have an important spatial component, both be classified in various ways. The way in which
related to the initiation of the hazard (e.g., a vol- the amount of elements-at-risk are characterized
cano) and the spreading of the hazardous phe- (e.g., as number of buildings, number of people,
nomena (e.g., the areas affected by volcanic economic value or the area of qualitative classes
products such as lava flows). Session 3 gives an of importance) also defines the way in which the
overview of the approaches that can be used for risk is presented. The interaction of elements at
the analysis of the temporal and spatial components risk and hazard defines the exposure and the
of hazards. Table 2 gives a summary of definitions vulnerability of the elements-at-risk. Exposure
used in the GIS-based risk assessment. refers to people, property, systems, or other ele-
Elements at risk are the population, properties, ments present in hazard zones that are thereby
economic activities, including public services, or subject to potential losses. The spatial interaction
any other defined values exposed to hazards in a between elements-at-risk and hazard footprints,
given area. They are also referred to as “assets”. which is often referred to as “exposure” in other
Elements at risk also have spatial and non-spatial risk formulas, is an integral component of GIS-
characteristics. First of all there are many differ- based risk assessment, and therefore the term

1126
The RiskCity Training Package on Multi-Hazard Risk Assessment

Table 2. Summary of definitions used in the GIS-based risk assessment

Term Definition
Natural hazard A potentially damaging physical event, phenomenon or human activity that may cause the loss of life or injury,
property damage, social and economic disruption or environmental degradation. This event has a probability of
occurrence within a specified period of time and within a given area, and has a given intensity.
Elements-at-risk Population, properties, economic activities, including public services, or any other defined values exposed to
hazards in a given area”. Also referred to as “assets”. The amount of elements at risk can be quantified either in
numbers (of buildings, people etc), in monetary value (replacement costs, market costs etc), area or perception
(importance of elements-at-risk).
Exposure Exposure indicates the degree to which the elements at risk are exposed to a particular hazard. The spatial inter-
action between the elements at risk and the hazard footprints are depicted in a GIS by simple map overlaying of
the hazard map with the elements at risk map.
Vulnerability The conditions determined by physical, social, economic and environmental factors or processes, which increase
the susceptibility of a community to the impact of hazards. Can be subdivided in physical, social, economical,
and environmental vulnerability.
Capacity The positive managerial capabilities of individuals, households and communities to confront the threat of disas-
ters (e.g., through awareness raising, early warning and preparedness planning).
Consequence The expected losses in a given area as a result of a given hazard scenario.
Risk The probability of harmful consequences, or expected losses (deaths, injuries, property, livelihoods, economic
activity disrupted or environment damaged) resulting from interactions between (natural, human-induced or man-
made) hazards and vulnerable conditions in a given area and time period.

exposure is not used as such in the risk equation. The risk curves form the basis of subsequent
When we calculate the risk equation using a cost-benefit analysis, in which for each hazard
Geographic Information System (GIS) the ele- type, a number of risk reduction measures is
ments at risk that are exposed to the hazards are evaluated. The investments to implement certain
automatically obtained using map overlaying measures (e.g., relocation of houses, flood con-
techniques trol) are estimated and compared to the reduc-
Vulnerability refers to the conditions deter- tion in annual losses that would result if they are
mined by physical, social, economic and envi- implemented. Based on this, the most appropriate
ronmental factors or processes, which increase methods for risk reduction are selected. The last
the susceptibility of a community to the impact part of the RiskCity exercises deal with a final
of hazards. Vulnerability can be subdivided in project in which the participants are given a par-
physical, social, economical, and environmental ticular problem they have to solve with the risk
vulnerability. The vulnerability of communities information obtained earlier.
and households can be based on a number of cri-
teria, such as age, gender, source of income etc.
which are analyzed using equation [2]. However, ABOUT THE STUDY AREA
according to equation [1] vulnerability is evalu-
ated as the interaction between the intensity of the The exercises deal with a hypothetical case study,
hazard and the type of element-at-risk, making use which is originally based on an actual study area:
of so-called vulnerability curves. The concept of Tegucigalpa in Honduras. Tegucigalpa suffered
vulnerability and the generation of vulnerability severe damage from landslides and flooding dur-
curves are treated in session 5 of the training pack- ing Hurricane Mitch in October 1998 when the
age, including the use of participatory methods city received 281 mm of rain in 3 days (Mastin &
for community-based risk assessment. Olsen, 2002). Due to river flooding, an old land-

1127
The RiskCity Training Package on Multi-Hazard Risk Assessment

slide was reactivated and an entire neighborhood to reach the learning objectives, modifications
on top of it was destroyed. The landslide caused and additions were made to the original data. It
the damming of the river and resulted in severe is very difficult to have a dataset for a particular
flooding in large parts of the city centre for several area where all aspects of multi-hazard risk assess-
weeks (Harp et al., 2002). These events are easily ment can be properly demonstrated, either because
identifiable on the high resolution image which particular hazard types do not happen in the city
serves as the basis for the exercises (Figure 4). or because particular data sets are incomplete,
After Hurricane Mitch, USGS and JICA carried restricted or erroneous. The time required to
out extensive work in Honduras and produced prepare a suitable dataset and related exercises is
extensive datasets. very large. Therefore we have decided to make a
Only part of the exercises is based on the ac- generic case study that is representative for many
tual situation in Tegucigalpa. In order to be able cities in developing countries. When the training

Figure 4. High-resolution image of the center of RiskCity with some of the hazard and vulnerability
features indicated

1128
The RiskCity Training Package on Multi-Hazard Risk Assessment

package is used in local course, often also local actually doing GIS analysis. They will not use
data sets are used to illustrate part of the procedure. ILWIS, but will use the WebGIS version. The
In the case of RiskCity several types of remote WebGIS exercises will take much less time than
sensing data were used. Aerial photographs for the GIS version, and have separate exercise de-
several periods, including the period of the major scriptions. The WebGIS version can be accessed
disaster event in 1998, and two sets of satellite at: http://geoserver.itc.nl:8181/cartoweb3/htdocs/
data from 2001 and 2006 were the basis for land- RISKCITY.php
slide mapping. High resolution satellite data was The RiskCity training package is constantly
used for mapping elements-at-risk, and medium being updated and further improved. The plan is
resolution Aster data for generating a land use to incorporate more Participatory GIS approaches
map of the area. Elements-at-risk data can be ob- in the training package, as well as to include
tained at different levels of detail. In the RiskCity more examples of the use of risk information in
case study this is done at the urban level, where environmental assessment and spatial planning.
information needs to be as detailed as possible, There is also a plan to make a separate version
preferably at the individual building level, or at a focusing on risk occurring in rural areas, taking
slightly more aggregated level of mapping units into account flooding, forest fires, drought, and
or building blocks with homogenous land use and land degradation as the main types of hazards.
building type. Figure 4 gives an example of the The training package is used regularly in courses.
high resolution image used with some information The duration of this course can vary between 2
on the areas affected in 1998. days and 3 months. The standard duration is 6
weeks (distance education version) or 3-4 weeks
(fulltime course), depending on the option for
CONCLUSION doing the final project in session 8. The course
materials are available in English, with major parts
The course is designed in such a way that even also translated in Spanish, Vietnamese, Chinese,
non-GIS specialists can follow the course, since and Georgian languages. The RiskCity training
the instructions are describing the steps in a cook- package is given annually as a distance education
book manner, at least in the initial phases. In the course on Multi-hazard risk assessment. Courses
later part of the course, when participants are on multi-hazard risk assessment are offered an-
more used to the ILWIS software the exercises nually in the Netherlands, Mexico, Bolivia, and
becoming more demanding, and participants have Thailand, and frequently also in India and China.
to design the steps to solve problems themselves. We are currently exploring the option to organize
The course is also designed in such a way that an annual course on multi-hazard risk assessment
participants can also follow it without hands-on with partner organizations from Africa. Informa-
exercises with the ILWIS GIS for the exercises. tion can be found on http://www.itc.nl/Course-
This may be the case for courses at decision Finder (Select the option disaster management
makers’ level, when the participants are not that for an overview of all courses).
much interested in the particular steps to follow
in a risk assessment, but want to know more about
the overall procedure and what can be done with ACKNOWLEDGMENT
the (intermediate) results. Therefore we have
made the course also in a WebGIS version. The We would like to thank Gonzalo Funes from
WebGIS version allows participants to evaluate Honduras for providing the initial data sets. The
the individual steps of the methodology without Digital Surface Model and flood information

1129
The RiskCity Training Package on Multi-Hazard Risk Assessment

was obtained from a study by the United States EMA. (2002). Disaster loss assessment guidelines.
Geological Survey. The high resolution image Retrieved from http://oneresponse.info/resources/
was obtained from a project funded by JICA. NeedsAssessment/publicdocuments/EMA%20
Ruben Vargas Franco, Dinand Alkema, Lorena -DisasterLossAssessmentGuidelines.pdf
Montoya, Michiel Damen, Nanette Kingma,
ESRI. (2010). Virtual campus courses: HAZUS-
Antonio Naverette, Jean Pascal Iannacone,
MH (multi-hazards) for decision makers. Re-
Manzul Hazarika and Norman Kerle are thanked
trieved from http://training.esri.com/Courses/
for their contributions on various aspects of this
case study. Colleagues from ADPC, AIT, CDUT, FEMA. (2010a). Mitigation planning: “how to”
ICIMOD, UNAM, CLAS, IIRS and UGM are guides. Retrieved from http://www.fema.gov/
thanked for the friendly collaboration and the plan/mitplanning/risk.shtm
testing of the training package in various train-
FEMA. (2010b). Hazus, FEMA’s software for es-
ing courses. The following persons are thanked
timating potential losses from disasters. Retrieved
for the translation into Spanish: Carlos Saavedra,
from http://www.fema.gov/plan/prevent/hazus/
Jose Antonio Navarrete, Ruben Vargas, Edward
Gonzalez, Estuardo Lira, and Manolo Barillas. FEMA. (2010c). Understanding your risks. Iden-
Fan Xuanmei is thanked for translating part of it tifying hazards and estimating losses. Retrieved
in Chinese. Kakha Bhattak This work is part of from http://www.fema.gov/library/viewRecord.
the United Nations University – ITC School for do?id=1880
Disaster Geo-Information Management (http://
Harp, E. L., Castaneda, M., & Held, M. D. (2002).
www.itc.nl/unu/dgim).
Landslides triggered by Hurricane Mitch in Te-
gucigalpa, Honduras (Tech. Rep. No. 02-0033).
Retrieved from http://pubs.usgs.gov/of/2002/
REFERENCES
ofr-02-0033/
ADPC. (2005). Knowledge development, educa- ILWIS. (2010). The integrated land and water
tion, public awareness training and information information system. Retrieved from http://52north.
sharing: A primer of disaster risk management in org/
Asia. Retrieved from http://www.adpc.net
International Institute for Geoinformation Science
Blaikie, P., Cannon, T., Davis, I., & Wisner, B. and Earth Observation (ITC). (2004). Módulo de
(1996). At risk: Natural hazards, people’s vulner- capacitación: Aplicación de Sistemas de Infor-
ability, and disasters. London, UK: Routledge. mación Geográfica y Sensores Remotos para el
DEBRIS. (2006). Innovative education for risk Análisis de Amenazas, Vulnerabilidad y Riesgo.
management: Development of innovative forms of Retrieved from http://www.itc.nl/external/unesco-
learning and teaching oriented towards building rapca/start.html
a family of new curricula in the field of natural IPCC. (2007). Fourth assessment report: Climate
risk. Retrieved from http://www.e-debris.net change 2007: Impacts, adaptation and vulner-
EM-DAT. (2009). The OFDA/CRED international ability. Retrieved from http://www.ipcc.ch/ipc-
disaster database. Brussels, Belgium: Université creports/ar4-wg2.htm
Catholique de Louvain.

1130
The RiskCity Training Package on Multi-Hazard Risk Assessment

Mastin, M. C., & Olsen, T. D. (2002). Fifty-year UN-ISDR. (2005). Hyogo framework for action
flood-inundation maps for Tegucigalpa, Honduras 2005-2015: Building the resilience of nations and
(Tech. Rep. No. 02-261). Retrieved from http:// communities to disasters. Retrieved from http://
pubs.usgs.gov/of/2002/ofr02261/ www.unisdr.org/wcdr/intergover/official-doc/L-
docs/Hyogo-framework-for-action-english.pdf
Munich Re. (2009). Topics GEO: Natural catastro-
phes. Retrieved from http://www.munichre.com UN-ISDR. (2009). Terminology on disaster risk
reduction. Retrieved from http://www.unisdr.
NAHRIS. (2006). Dealing with natural hazards
org/eng/terminology/terminology-2009-eng.html
and risks. Retrieved from http://www.nahris.ch/
van Westen, C. J. (2010). GIS for the assessment of
Peters Guarin, G. (2008). Integrating local knowl-
risk from geomorphological hazards. In Alcantara-
edge into GIS based flood risk assessment, Naga
Ayala, I., & Goudie, A. (Eds.), Geomorphological
city, The Philippines (Doctoral dissertation, Wa-
hazards and disaster prevention (pp. 205–219).
geningen University). ITC Dissertations, 157, 352.
Cambridge, UK: Cambridge University Press.
Smith, K., & Petley, D. N. (2009). Environmental doi:10.1017/CBO9780511807527.017
hazards: Assessing risk and reducing disaster.
World Bank. (2010). Central American probabi-
London, UK: Routledge.
listic risk assessment (CAPRA). Retrieved from
UN-ISDR. (2004). Living with risk: A global http://www.ecapra.org
review of disaster reduction initiatives. Retrieved
from http://www.unisdr.org/eng/about_isdr/bd-
lwr-2004-eng.htm

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 3, Issue 1,
edited by Donald Patrick Albert, pp. 41-52, copyright 2012 by IGI Publishing (an imprint of IGI Global).

1131
1132

Chapter 68
Automatic Classification of
Decorative Patterns in the
Minoan Pottery of Kamares Style
Filippo Stanco
University of Catania, Italy

Davide Tanasi
Arcadia University, USA

Giuseppe Claudio Guarnera


University of Catania, Italy

Giovanni Gallo
University of Catania, Italy

ABSTRACT
An important feature of the Minoan culture is the pottery of Kamares style, that documents the Cretan
cultural production between the first half of the 2nd millennium BC. This high level painted production,
characterized by the combination of several diverse motifs, presents an enormous decorative repertoire.
The extraordinary variety of combinations between elementary motifs according to a complex visual
syntax makes interesting the automatic identification of the motifs, particularly upon potsherds. A com-
plete pipeline to accomplish this task is still a challenge to Computer Vision and Pattern Recognition.
Starting from a digital image ROI identification, motif extraction, robust contour detection should be
performed to obtain a bag of digital shapes. In a second phase each of the extracted shapes has to be
classified according to prototypes in a database produced by an expert. The co-occurrence of the dif-
ferent shapes in a specimen will, in turn, be used to help the archaeologists in the cultural and even
chronological setting.

DOI: 10.4018/978-1-4666-2038-4.ch068

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

INTRODUCTION chronological layer’ is far from being understood,


determined a stop in the cognitive progress of the
In the past decades, the application of the com- Minoan civilization.
puter science in the archaeological research, and In this perspective Computer Vision and Pat-
especially in the field of prehistory, turned out tern Recognition could provide a great support in
from a simple auxiliary technology into a cog- automatically assisting the archaeologists in clas-
nitive strategy influencing the approach to the sification of Kamares pottery fragments (Figure2),
ancient artifacts. Although the introduction of especially considering that in many cases, the
the database enhanced the possibility of dealing visual informations available are of several kinds,
with large amount of text data, the problem of like watercolors, black and white and color pho-
taking into account huge groups of visual data tographs of unequal resolution and quality, pencil
still remains unsolved. One common case that sketches and high quality digital photographs.
makes hard the initial steps of an archaeological Within the research program Archeomatica Proj-
study is for example the analysis of decorative ect (IPLab, 2010) of the Catania University, de-
repertoires of some prehistoric pottery classes voted to the application of Computer Graphics
that are characterized by the exuberant use of and Image Processing techniques in the field of
a multiplicity of motifs. The most complicated prehistoric archaeology (Stanco, Battiato, Gallo,
artistic production of the Mediterranean prehis- 2011), a specific study for the developing of an
tory is certainly the Kamares style pottery (Figure automatic classification system of the Kamares
1), flourished in Crete in the first half of the 2nd simple decoration elements (Farinella, Stanco,
millennium, which main feature is to present a Tanasi, 2008), (Guarnera, Stanco, Tanasi, Gallo,
complex system of polychrome painted decoration 2010) (Gallo, Stanco, Tanasi, in press) started
with a rich decorative alphabet aimed to produce under the inspiration of the Center of Cretan
an endless variation of visual results. The highest Archaeology (2010) of the Catania University,
number of vessels and potsherds available of this dealing with the Kamares pottery from Phaistos
class all over Cretan territory and the fact that a (Levi, Carinci, 1988), (Carinci, 1997).
complete framework for the Kamares decorative The application of this research strategy on
grammar and the associations ‘motif/vessel/site/ the available visual corpus of image data is the

Figure 1. Examples of Kamares style vessels (this image has been obtained as a collage from several
public sources)

1133
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

Figure 2. Selection of Kamares pottery sherds from Phaistos (courtesy of Prof. V. La Rosa)

object of the research that we present in this (Knappett, 1999), (Zois, 1998). This class, with
chapter. The chapter is structured as follow: in a large lifespan that covers the Protopalatial and
Sections 1.2 we introduce the Kamares pottery. Neopalatial periods, can be divided in four main
Section 1.3 shows the proposed technique. In phases related to the local chronology arranged
Section 1.4 a review of existing shape similarity by A. Evans (1921): Pre-Kamares or 1 (Middle
techniques Minoan IA), Early Kamares or 2 (Middle Minoan
IB/IIA), Classical Kamares or 3 (Middle Minoan
Background: The Kamares Pottery IIA/IIB/IIIA), Post-Kamares or 4 (Middle Minoan
IIIA/IIIB). Kamares ware is named for finds first
The Kamares style pottery represents the main identified at the end of 19th century in the Kamares
artistic feature of the Minoan civilization between cave sanctuary in Mount Ida, and its diffusion on
the 20th and 17th century BC and its technical the Cretan territory can be distinguished between
and stylistic level achieved was never equaled in the elegant production of the palatial centers of
the Aegean Bronze Age (Walberg, 1978), (Wal- Knossos and Phaistos (Palatial Kamares) (Day,
berg, 1983), (Walberg, 1987), (Walberg, 2001), Wilson, 1998) and the less impressive creations

Figure 3. Map of Crete showing principal sites with Kamares style pottery (Walberg, 1983)

1134
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

of the provincial workshops of Eastern, Western, it was found appropriate for vases which had to
Central and Eastern-Central Crete (Provincial be strong and practical rather than attractive.
Kamares areas 1-4) as in Figure 3. In a preliminary survey, seventy-four different
Fine fabric, thrown on the wheel, sometimes shapes were identified including both the deco-
with very thin bodies (2-3 mm like in the egg shell rated and fine ware and domestic and specialized
varieties), painted with polychrome pigments on forms. During the four chronological phases, open
dark background, the Kamares style has a vast and closed shapes are equally preferred and even
assemblage of shapes and its distinctive feature if there is a general conservativeness in the use
is a huge repertoire of exuberant decorative mo- of the same formal typologies some specific ten-
tifs often resembling naturalistic atmospheres dencies for each phase can be defined (Figure 4).
(Figure 1). About the colors, the black slip which For what concerns the decoration, the Kamares
covers most of Kamares vessels has been identi- style is basically characterized by the application
fied as a clayish paint layer containing iron oxides of thirty-one different core motifs (abstract, rec-
of spinel type with a high content of potassium. tilinear, pictorial, pictorialized motifs) and patterns
It is produced by exposure to high temperatures (stone, rock, sponge and metal surface patterns),
in a reducing atmosphere, thereby turning in into each of them with several levels of varieties,
a black sintered layer. The red color consists of composed together in order to obtain complex
red ochre, which in contrast with the black slip decorative syntaxes joined to create much more
does not contain potassium at all. As a result of complicated composite designs.
the lower content of potassium, the red paint In general the repertoire of core motifs main-
layer does not melt and turn into a sintered layer tains itself the same in both the main production
in a reducing atmosphere but it remains porous districts, the Palatial and Provincial, and basically
and turns bright red when exposed to oxidation. remains unaltered during all the chronological
The white pigment is composed by calcium sili- development, just with few elements peculiar for
cate or talc depending on the chronological pe- some phases. But it must be also considered that a
riod of the production and of the production core motif, even simple, can deeply change shape
centre itself. In the decoration of large vessels of when transformed from a 2D sketch to a 3D design
domestic type also a less elaborated and much on the vessel body, when combined with copies
lasting dark on light decoration system is used as of the same motif or joined with other motifs,

Figure 4. Schematic shape taxonomy of the Kamares pottery repertoire (Walberg, 1978)

1135
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

Figure 5. Schematic taxonomy and possible elaborations of some Kamares decorative motifs (Walberg,
1978)

when elaborated or pictorialized and in relation to or opposed to the movement inherent in the ves-
different chronological phases. As a result all this sel shape. The dynamic effects of the vase and
makes very hard, in some cases, the classification the decoration together create the overall effect,
of each element (Figure 5). that can give, for example, the illusion of the
About the composition of decorative elements contraction or expansion of some vessel parts. In
and its relation with the tectonic of the vessel, the selection of the composition strategies it’s
two different method can be identified. First is possible to find difference between the Palatial
the unity decoration that depends on the shape of and Provincial production and to define tendencies
the vase and that can be circumcurrent, that means from the Pre-Kamares to the Post-Kamares
planned with regard to the whole vessel body and phases (Macgillivray, 1998).
composed by encircling zones, facial, that means The overall effect of a Kamares vessel is
planned with regard only to one part of the vessel sometimes completed by the application of an
body as seen from a special angle, and zonal, that accessory decoration represented by incision,
means composed by a system of decorated zones relief and applied plastic details or by the use of
related to each other in order to create coherent complementary embellishment techniques like
designs. Second is the structural decoration that the ripple, barbotine and the trickle decoration.
consists in separation of the vase into different Finally, it must be noticed that differences can
parts and accentuates the horizontal and vertical be find also within the same centre and in the
axes. In the both type of decoration a field division same period because, besides the guidelines of
for the location of the motifs is applied, selecting this production, the essence of the style itself was
different parts of the body vessels for the defini- strongly influenced by the creativity and aesthetic
tion of the limits of the decorated area. In the field concept of the single artisan, that constantly elabo-
division the choice of the decoration strategy for rated the core motifs and experimented new ways
the accessory parts of the vase, like rim, lip, of expression. As a result, the Kamares style is
handle, spout, is very peculiar of each period and more than the simple combination of syntaxes of
production centre. It must be also considered that elementary motifs with different vessel type. It is
both the shapes and the decoration contain dy- a complicated interplay, carried out to the small-
namic effects and the decoration cannot merely est detail, where every elements has an important
be described as a movement across a surface, as part, especially in the Palatial production. The
in the case of a static background. It must be fact that each Kamares style product depends by
described as a movement in the same direction many quantifiable variables (time, place, vessel

1136
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

shape, high number of probable composition of of visual references arranged by the scholars, has
core decorative elements, colors and accessory become the hardest part of the research.
decoration techniques) and just by one variable Although the goal of an exhaustive classifica-
not quantifiable at all that is the unpredictable tion of Kamares pottery’s production is likely to
will of the artisan, it made quite impossible for be unobtainable with the present state of the art,
the archaeologists specialized in the Middle Computer Vision and Pattern Recognition could
Minoan pottery to carry out a exhaustive study provide a great support in automatically assisting
of the main features of this style, also because of the archaeologists in the classification task. Our
the huge amount of data represented by millions contribution illustrates a complete pipeline to
of decorated potsherds. automatically process these data.
The first study of classification of the Kamares The processing starts with the extractions of
pottery was carried out by G. Walberg in 1978 and it a clean representation of the decorative designs.
was dedicated to the Palatial production of Phaistos Our goal is to create a standardized database of
and Knossos (Walberg, 1978), (Walberg, 1987) “shapes” that could be successively automatically
(Figure 3). In this work, the role of Phaistos, as the investigated with Pattern Recognition methods.
most important production centre and also the only
site with a clear chronological progression in the Iconographical
development of the production itself, was pointed Documentation Available
out and a corpus of core motifs and patterns found
there was published. Furthermore, the features Besides the large amount of specimens available,
characterizing the climax of the Kamares style another significant problem in an exhaustive study
were find between Middle Minoan IIA and Middle of the Kamares style pottery is dealing with an
Minoan IIIA, within the phase named Classical heterogeneous group of iconographical data.
Kamares. In 1983 a second reassessment of the For the two most important Palatial production
Kamares materials from Provincial districts was centers, Phaistos and Knossos, the documenta-
carried out by the same author (Walberg, 1983), tion is basically composed by watercolors, black
emphasizing analogies and differences between and white pictures, technical drawings in scale
Palatial and Provincial production and trying to with front view and section and reconstructive
rebuild the decorative repertoire, as whole as unscaled perspective drawings. The best preserved
possible, in order to enrich the main assemblage vessels representing the highest stylistic level of
and obtain a complete study and classification of the Kamares style, in the editions of the two sites,
the Kamares pottery. were illustrated by unscaled watercolors carried
After the Walberg’s corpora, a large quantity out by E. Stafani, R. Oliva and Th. Fanourakis
of new Kamares vessels were found and published for Phaistos and D. Mackenzie for Knossos. In
together with groups of pottery coming from the reproduction of the complex Kamares painted
old excavations, in both cases often fragmented decoration, each modern artist or technician at
potsherds. This re-opened the problem for the work was influenced by his own personal taste
archaeologists of dealing with a class of materials and this caused, in many cases, alteration and
so variable present in huge amount of specimens. distortion of the original shape of the motifs and
In particular the interpretation of the decorative of the original scheme of the syntaxes.
motifs and syntaxes partially preserved on the It must be also considered that the draftsmen
sherds, in order to ascribe them to the repertoire working in Crete in the first half of the XX century,
of a specific workshop, trying to match the frag- had different formation and artistic education.
mentary information with a standard collection For the reproduction of the Phaestian Kamares,

1137
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

E. Stefani, chosen by the director L. Pernier for berg’s corpora (Walberg, 1983), (Walberg, 1987)
the documentation of the excavation, was an ar- will be discussed.
chitect. Decades later, D. Levi, new head of the
expedition, chose R. Oliva and Th. Fanourakis, two
painters strongly influenced by his archaeological KAMARES SHAPE ANALYSIS:
point of view. Otherwise at Knossos, a large part SYSTEM OVERVIEW
of the graphical documentation, including the
watercolors, was carried out by D. Mackenzie The automated system that we are building to
(Momigliano, 1999), that was an archaeologist, assist the archaeologist in assessing similarities
second in charge after A. J. Evans, head of the among the Kamares findings follows a general
mission. In this case he demonstrated a more pipeline that could be easily adapted to other
straight and scientific method of depicting artistic similar case studies. As in most Computer Vision
features (Mackenzie, 1906). applications the idea is to go from the raw data
Furthermore, the fact that the documentation (pixels) to a symbolic representation of the image
of Kamares vessels from Knossos dated back to content. These information may, in turn, be fed
1921-1935 (Evans, 1921) while that of Phaistos to an intelligent system that assists the experts in
was performed in 1935-1951 and 1976 (Pernier, formulating and checking working hypotheses
1935), (Pernier, Banti, 1935), (Levi, 1976), it about the scenes captured in the original images.
determined different choices in the reproduction More specifically for the present case study the
strategies, like different line thickness or colors, system starts with digitalized format of a pictorial
due to the changing taste of the time (Figure6). representation of a vase or of a sherd. Figure 7
Finally, another misleading problem in the inter- shows the successive steps performed. The digital
pretation of the Kamares pottery from Knossos data are the input of a contour extraction algorithm.
come also from the reconstructive drawings, where Please, notice that in this way we are disregarding
in many cases the hypothesis of the missing part any 3D information: perspective deformation, and
of a motif or of a decorative outline was denied occlusions together with noise and artifacts due
by the subsequent findings. to low resolution are simply ignored at this stage.
In this paper the results achieved working Contour extraction produces a collection of
basically on the visual archive provided by Wal- contours: some of them are not relevant for our

Figure 6. Watercolors of Kamares style pottery and potsherds: (a) Phaistos 1935 drawings by E. Stefani
(Pernier, 1935); (b) Phaistos 1976 drawings by R. Oliva (Levi, 1976); (c) Knossos 1935 drawings by
D. Mackenzie (Mackenzie, 1906)

1138
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

Figure 7. The complete pipeline of the proposed technique

application and should be filtered out (Gonzalez, manually produced by an expert draftsman, are
Woods, 2006), (Gonzalez, Woods, Eddins, 2009), available. These lucky cases reduce the problems
(Moore, 1968). We choose to ask the interventa- due to 3D distortion and occlusion but introduce
tion of the expert for this Region Of Interest (ROI) in the pipeline the bias of the draftsman.
identification task because of the occurrence of The proposed system deals with simpler shapes
cases like those reported in Figure 8. In the Figure one at each time. They are analyzed by the shape
several occurrences of the “raquet” decorative analyzer and reduced to a set of numerical and
pattern are shown. It is clear that the isolation of geometrical invariants. These invariants are the
this complex motif is the product of an informed final features that the matching module of the
semantic choice and that this expert choice is not system considers for search and similarity re-
easily mimicked with a fully automatic approach. trieval into a reference shape database.
ROI identification most commonly reduces the The reason to deal with the elementary simpler
set of contour shapes to examine. Even so Kamares shapes instead that with the set of contours in a
decorative motifs are reduced by this step into a ROI has the following motivations:
smaller but yet complex set of elementary shapes.
Figure 9 shows several instances of the pattern • Algorithms that deal with simple contours
that are obtained after ROI selection. Observe that and extract suitable invariants from them
in some case it is more convenient to refer to the are much better understood and robust at
“flattened” version of the motifs whenever these, the actual state of art;

Figure 8. Examples of occurrences of the “raquet” motif in different vases

1139
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

Figure 9. Kamares potteries and relative decorations

• A ROI represents a complex figurative different methods of contour extraction and pres-
“proposition” composed of simpler el- ent different algorithmic challenges. Once the
ementary shapes. Unfortunately, the “order pictorial data have been translated into a more
of reading” of such complex visual struc- abstract “contour shape” representation it is pos-
tures is not linear. Indeed it is not easy to sible to apply to them the shape similarity measures
“read” those drawing in a canonical un- presented below. The idea is to present the shape
ambiguous way. database with a “query” which represents a
decorative element that the archaeologist wishes
Shape Similarity Measures to recognize. The answer to the query is an estimate
of the similarity distance between the query shape
To apply the technique presented in this paper and the shapes in the database. In this chapter we
is necessary a contour line representation of the present different shape similarity algorithms:
motifs. Hence, a decorative pattern is previously contour flexibility technique (Xu, Liu, Tang,
translated into a digital sequence of consecutive 2008), Shape Context (Belongie, Malik, Puzicha,
points on a raster plane. 2002), logo recognition into document images
Several issues about resolution and standard- (Shen, Jin,Chang, Wu, 2005), Procrustes methods
ization of these rasterized contours arise. Going (McNeill, Vijayakumar, 2006), and Circular
from digital images to digital shapes is far from Blurred Shape Model descriptor (Escalera, Fornes,
being an easy task: different media (water colors, Pujol, Escudero, Radeva, 2010).
photos and hand drawn schematic lines) require

Figure 10. (a) Figure interpretation by the archaeologist. (b) Figure interpretation by the computer
scientist. The same color indicates the same symbol.

1140
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

Notice that even if the published ideas in shape The shape context considers the set of n-1 vec-
recognition are overabundant in the present scien- tors originating from a point to all other sample
tific literature most of them are relative to complex points on a shape, which expresses the configura-
vision tasks where one wishes to recognize real tion of the whole shape with respect to the reference
objects that move and deform within a complex point. Since shapes may vary from an instance to
real environment. This is not the case for our ap- another in the same category, the full set of vectors
plication where the most common deformations is too detailed and does not guarantee robustness.
observed are affine distortions, noise, irregulari- The distribution over relative position is a more
ties, lacunae, and non linear scaling. robust and compact description, hence for each
point pi on the shape, a coarse histogram hi of the
Shape Matching using relative n-1 coordinates is computed:
Shape Context
hi (k ) = #{q ≠ pi : (q − pi ) ∈ bin(k )} (1.1)
Belongie et al. in 2002 propose a correspondence-
based shape matching method using shape contexts
The length r and orientation θ of the vectors
(Belongie et al., 2002). Shape matching using
(q- pi) are quantized to create a histogram map
shape contexts is an improvement to traditional
which is the shape context used to represent the
Hausdorff distance based methods. It extracts
point pi. To make the histogram more sensitive to
a global feature, called shape context, for each
the location of nearby points than to the location
corresponding point. The matching between cor-
of points farther away, these vectors are mapped
responding points is then the matching between
into a log-polar space representation (Figure11).
the context features. In this approach an object is
Shape contexts are distribution represented as
treated as point set and it is assumed that the shape
histograms: it is hence possible to use the χ2 test
of the object can be captured by a finite subset of
statistic to define the cost of matching two points.
its points, for example sampled uniformly from
Consider two shapes P and Q. A point pi on the
the external contour. These points do not need
first shape and a point qj on the second shape, Ci j
to be key-points such as inflection points, cusps,
= C(pi, qj) denotes the cost of matching the two
extrema of curvature, etc., and can be derived
points in exam, where
from edge pixels found by a simple edge detec-
tor, whose output is a set P of n points P = {p1,
p2, …, pn}, pi ∈ ℜ2.

Figure 11. The log polar maps of the contest of two different points in the contour of a shape

1141
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

2
K h (k ) − h (k ) his guests to make them fit to his bed. Correspond-
C (pi , q j ) = 1 / 2∑ 
 i j  ing regions of shapes often appear at slightly differ-
h (k ) + h (k )
k =1 i j ent scales and positions and matching algorithms
must be flexible to adjust these variations. Simple
and hi(k), hj(k) denote the K - bin normalized global point methods (McNeill, Vijayakumar,
histogram respectively at pi and qj. Given the set 2005), (Zhang, Zhang, Krim, Walter, 2003) and
of costs Ci j between all pairs of points pi of P and non linear approaches (Bookstein,1997), (Sebas-
q j . of Q, the total cost of matching tian, Klein, Kimia, 2003) suffer from presence
H (π) = ∑ i C (pi , q π(i ) ) has to be minimized, of smooth deformation and may produce a large
discrepancy between a perceived difference and
with the constraint of one-to-one matching. The
its numerical estimation. HPM avoids this by
matching of two shapes then is done by matching
matching in a global to local direction. Longer
two context maps of the shapes, which is a matrix-
segments that have been matched provide initial
based matching. It minimizes the total cost of
matches for the shorter segments, which can then
matching between one context matrix and all the
slide and stretch/contract in order to find the best
permutations of another context matrix. To reduce
matches at this smaller scale.
the matching overhead, the shortest augmenting
Given two objects represented by n 2D points:
path algorithm for the matrix matching is used.
When the number of sample points is not equal U = (U1, …, UN)N and V = (V1, …, VN)N ∈ ℜN ×2
on the two shapes, the matching is done adding this approach normalizes the boundary length of
dummy nodes to the smaller point set, with a the polygon associated with V and then transforms
constant matching cost of εi. Since all measures U to match V. Each point is represented as a com-
are taken with respect to points on the object, plex number Vn = (xn,yn) → xn + i yn = wn ∈  .
invariance to translation is intrinsic to the defini- Then, V→ w and U → z ∈ N . After centering w
tion of the descriptor. The use of histograms makes and z so that ∑ n =1 wn = ∑ n =1 z n = 0 , the Pro-
N N

shape context insensitive to small deformations, crustes Distances (PD) between U and V is given
non linear transformation occlusions and presence by
of outliers. Additional robustness to outlier can
be added not allowing points labeled as outlier to
d p (U ,V ) = (w * w − (w * zz * w ) / z * z ) / N
contribute to any histogram. The above definition
of shape context is not invariant to rotation, which (1.2)
can be obtained using a relative frame instead of
the absolute frame described, e.g., treating the In order to compute HPM is necessary to
tangent vector at each point as the positive x- compute the Global Procrustes Matching (GPM)
axis. that find the best correspondence between U
and V with the smallest PD among the N valid
Hierarchical Procrustes Matching correspondences that respect the cyclic order
of the indices. This initial correspondence is in
Hierarchical Procrustes Matching (HPM) (Mc- turn refined using a vector that is one half of V
Neill, Vijayakumar, 2006) is a segment-based and finding the smallest PD with each “half” of
shape matching algorithm that approximates U. The matched segments at this length provide
the perceptual similarity between two shapes by initial estimates for matching at the successive
matching progressively smaller boundary seg- scale that can be performed recursively. The shape
ments. The denomination makes reference to a similarity of U and V is the weighted sum of scores
mythological character that used to stretch or cut at each length, where the weight is related to the

1142
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

segments length. Let Sl denote the sum of PDs cases articulated high flexible parts correspond
over the matched segments at a fixed length l. to high curvature points. Let ℓ be a closed simple
The asymmetric similarity of U and V is given by contour, surrounding a bounded domain D. For
a point p on a contour ℓ and a given radius r, the
dF (U ,V ) ≡ w100d p (U ,V ) + ∑ wl sl contour flexibility is defined as ω(p, r) = min(ω+(p,
l =50,25,12.5 r), ω-(p, r)), where ω+ is the interior flexibility
(1.3) and ω- is the exterior flexibility:

where the wl are constant weights, 100 is the


boundary length, and 50, 25 and 12.5 are length ∫C p+,r
k+ (x )dx
ω+ (p, r ) = (1.5)
of segmentswhen we divide the boundary. Since
dF(U, V) ≠ dF(V, U), the symmetric similarity is
∫ C p+,r
dx

given by

DF (U ,V ) ≡ dF (U ,V ) + dF (V ,U ) (1.4)
∫C p−,r
k− (x )dx
ω− (p, r ) = (1.6)
∫ C p−,r
dx
In order to consider the average confidence
with which each segment match is selected, the
C p+,r and C p−,r respectively are the connected
PDs are normalized by making the average PD
equal to 1, and then apply HPM as normal. This is components containing p in the sets {x ∈ D:║ x- p
also known in literature as Normalized Procrustes ║≤ r} and {x ∈ ℜ2 -D:║ x- p ║≤ r}. k+ and k- are
Distances. two function of distance transform on ℜ2 ,

Shape Matching by k+ (x ) = d (x , ℜ2 \ D ) and


Contour Flexibility k− (x ) = d (x , D ), x ∈ ℜ2 (1.7)

It this Section we briefly report the main ideas d(⋅,⋅) is the minimum Euclidean distance between
about contour flexibility. The reader is urged two sets. The radius r is called bendable size and
to find the details of this algorithm in (Xu, Liu, should be tuned taking into account the width of
Tang, 2008). Vision science has developed as the limb-like parts of an object. The bendable size
an interdisciplinary research field, frequently r and the contour flexibility are proportional to the
involving concepts and tools from computer sci- scale of the contour but invariant to translation,
ence, image processing, biology, psychology and rotation and the choice of starting point for the
cognitive science, for example the importance parameterization of the contour. Let
of high curvature points in shape perception has
been already described in (Atteave, 1954) and z(t) = (x(t), y(t)); 0 ≤ t < 1 (1.8)
(Biderman,1985). From this theories the important
features for shape recognition come from parts of be the arc-length parameterization of a contour
the object where changes occur (e.g., corners). A ℓ. Since it is difficult to match flexible parts of
recent descriptor, named contour flexibility (Xu, two contours (e.g., using uniform sampling of the
Liu, Tang, 2008), depicts the deformable potential contour), a better strategy is to give large weights
at each point along a closed curve, extracting both to inflexible landmarks and smaller weights to
global and local features, with proper trade-off more flexible landmarks, using more samples
between them. It can be observed that in most

1143
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

on the segments of a contour which are more given a number of circles C, number of sections
inflexible. Let S, and an image region I, a correlogram B = {b{1,1},
…, b{C, S}} is a radial distribution of sub-regions
t
of the image. In each region b of the correlogram
∑ ω(z (u), r )du the centroid b* can be located by its coordinates
γ(t ) = 0
1
(1.9) and used as reference point. The regions around b
∑ ω(z (u), r )du define the neighborhood of b, and the number of
0
neighbors depends on the spatial location b (e.g.
inner circle, middle circle or extern circle).
the optimal sampling can be obtained with sam- The descriptor makes use of the information
pling speed dγ/dt. Considering each landmark a obtained from the contour of the object, which
complex number zk =xk+ jyk, the contour can be can be extracted for example by means of an edge
treated as a complex vector Φ=(z1, z2,…, zn)T and detector. Every point x in the contour map is taken
the Procrustean Distance between two sequences into account, calculating first the distance to the
can be used for global matching of Φ1 and Φ2 : corresponding centroid of the region in which x
lies, then the distances to the centroids of neighbor
(
dˆ(Φ1, Φ2 ) = cos−1 max S∈N Φ1, σ s (Φ2 ) ) regions. The inverse of these distances are com-
puted and normalized by the sum of total distances
(1.10)
and the obtained values are then added to the cor-
responding positions of the descriptor vector n,
where σ s (Φ2 ) is a cyclic permutation of Φ2 , to
which as dimension C×S. As in the case of shape
achieve independence from the starting point and context, the use of histograms makes description
“ ” denotes the inner product of two complex tolerant to irregular deformations. As for the com-
vectors. The matching score between the two plexity, for a map of k relevant contour points, the
shapes represented by Φ1 and Φ2 is determined computation of the descriptor requires O(k) simple
by operations. Parameters C and S defines the degree
of spatial information taken into account in the
description process (i.e., as the number of regions
αˆ
Md =
π
( )
d (Φ1, Φ2 ) + D Ω1, Ω2 (1.11) increase the description becomes more local) and
should be tuned for each particular application.
See also Figure 12. To obtain a rotationally invari-
where D(⋅,⋅)is the warping distance between the ant descriptor a second step needs to be included.
two sequences of the contour flexibility values The main diagonal Gi of correlogram B, with the
extracted from the contours; Ω1, Ω2 are the se- highest density, is searched. This diagonal is then
quences of the values of the contour flexibility at used as a reference to rotate the descriptor. The
each landmark of uniformly sampled sequences, orientation of the rotational process, so that Gi is
starting form the leading landmark ŝ found by aligned with the x-axis, is that corresponding to
(1.10), α weighting factor, π normalization factor. the highest description density at both sides of Gi.
Circular Blurred Shape Model is a recent rota- Once obtained the rotationally invariant CBSM de-
tionally invariant descriptor, which makes use of scriptor, it can be used to design a symbol spotting
a correlogram structure to capture spatial arrange- methodology (e.g., using Adaboost to distinguish
ment of object parts, shared among regions defined between foreground and background) or a multi-
by circles and sections (Escalera et al., 2010). The class classifier by embedding binary classifiers,
correlogram structure can be defined as follows: for instance using Adaboost to define a classifier

1144
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

Figure 12. Circular Blurred Shape Models of the same shape with different choices of the parameters
C and S.

based on the features that best discriminate one ognize. To use as a reference only the clean and
class against another, combining then binary clas- canonical drawing prepared by an expert is too
sifiers in a Error Correcting Code Framework. demanding for any shape recognition algorithm.
Following a general praxis in the shape recogni-
tion community we hence enriched the database
EXPERIMENTAL RESULTS with three variations for each decorative motif.
These variations have been artificially obtained
To test our system we have built an experimental applying the following transformations: rotation in
image database. The database has been created as clockwise sense of about 33 degrees, perspective
follows. First of all, a selection of images from camera distortion where camera orientation has
(Walberg, 1978) have been inserted in the col- been assigned at random and random warping.
lection. A total of 36 motifs have been chosen. Examples of the shape included in the database
The selection of these motifs among the much are reported in Figure 13, an example of varia-
larger catalogue in (Walberg, 1978) has been done tions over a given shape is reported in Figure 14.
randomly, although in this initial stage of our Eventually the database is made of 144 shapes.
research we choose to sort out the most complex Those shapes have been stored in binary images
motifs. Instances of these chosen patterns show a of 500×500 pixels. Those images have been pro-
high-class variability in terms of scale, rotation, cessed for contour extraction in Matlab (Lorensen,
rigid and elastic deformations when observed in Cline, 1987), (Maple, 2003). The resulting con-
the pictorial reference corpus. Some of the shapes tours have been resampled and for each shape a
exhibits a low inter-class variability especially vector of 100 points have been obtained. Using
for simpler shapes. This explains for example a higher sampling rate does not improve the per-
some resulting ambiguities of the classification. formance of the system, perhaps because a
The images selected are in a way the “canonical” denser sampling of a contour tends to preserve
reference for each of the decorative motifs to rec- some of the noise of the original image making

Figure 13.

1145
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

Figure 14.

recognition and matching a harder task. The We choose to check the performance only of
down-sampling has been performed in two dif- the CSBM algorithm and of the Contour Flexibility
ferent ways, and the results of both selections approach. Indeed CSBM is a refinement of the
have been stored in the reference database. The initial Belongie’s proposal and it is safe to assume
first way to sample 100 points from a closed that it will perform better. A similar reasoning
contour is to compute 100 equally spaced pints justifies the choice of Contour Flexibility over
along the contour. Notice that the sample points Hierarchical Procrustes Distance for our tests.
are obtained with cubic interpolations of close by The experiments have been carried out as
points on the contour. This uniform sampling follows. Each shape in the database has been, in
procedure is applied to be able to compute CBSM turn, considered as a query and the best matching
distance and to estimate local contour flexibility shapes (other than the query shapes itself) have
values. A second non-uniform sampling has been been considered as the results of the retrieval opera-
also performed. This second sampling takes 100 tion. In Table 1.1 we reported the results obtained
points according to contour flexibility values: with this set of experiments. In particular the first
more points are allocated in the most flexible column of the Table indicates the percentage of
segments of the shape. This non-uniform sampling total queries when the algorithms found, within
procedure is applied to be able to compute contour the four most similar shapes, the query shape itself
flexibility distance. Contour flexibility value in and the three variations of it in the database. The
each of the sampled point is also stored. In sum- second column in the table indicates the percent-
mary our system refers to this collection of 144×2 age of total queries when the algorithms found,
point vectors and 144 contour flexibility values within the three most similar shapes (other than
to compute similarity distances of queries from the query shape itself) the majority of shapes of
the database. Observe that this mathematical the correct class.
representation is quite compact and easily allows A closer observation of the results reveals that
the scaling up of the system up to thousands of most of the errors are “near misses”. An example
shapes. of these is given in Figure15.

Table 1. Performance of the shape matching procedures

% of majority of correct
% of perfect matches within
matches within the four closest
the four closest shapes
shapes
Circular Blurred Shape Model (CBSM) 48% 72%
Contour Flexibility (CF) 75% 99%

1146
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

Figure 15. Instance of false positives produced by the system, (a) CF; (b) CBSM

The results are very encouraging, although the in the available database it suggests three hypoth-
system is far from being error free it provides a esis of interpretation, the simple j-spiral (Walberg
valuable help to the expert. When trying to match 2.i.1), the running j-spiral (Walberg 2.i.2), even
a novel query an archaeologist, in almost all of possibly deformed, and the j-spiral with filled
the cases, has to look only at a very small (4 to angle (Walberg 6.2), again with a certain degree of
6) reference shapes provided as an answer from alteration. The immediate restriction of the range
the system. This greatly reduces the time that he of possible motifs gives to the archaeologists the
should otherwise spend to look among a larger chance to easily check the available data on the
set of candidate reference patterns. On the other geographical and chronological distribution of
hand at this stage of development of the system the motifs themselves and obtain confirmation for
the human expert intervention is still necessary. their initial hypothesis. In fact, while the simple
J- spirals has a wide diffusion in both Provincial
Real World Examples and Palatial areas and in the four chronological
phases, the j-spiral with filled angle is present
To further support our claim about the robustness in the Palatial area just in the phase 3 and in the
and the usefulness of our approach we present two Provincial district exclusively in the phases 2 and
real world examples. The queries and the results 3 and just in the East and East-Central Crete (in
are shown in Figure16. The application of the (Walberg, 1983) pp. 38-39, 41, pll. 28, 31; (Wal-
algorithm for the automatic classification of the berg, 1987), pp. 48-49, 51, 180, 183). Again in the
Kamares style pottery revealed itself particularly second example in Figure16(b), in the potsherd
efficient for the immediate discrimination of the is visible a series of teardrops motifs. After the
motifs on the chronological and geographical extraction of the contour, the algorithm suggests
scales. Basing upon the Walberg’s corpora it’s two different matches, the spiral derivatives (Wal-
possible to offer two examples of the procedure berg 8.33), both the original motif and its altered
working. In the first case in Figure16(a), we have versions, and the petaloid loops (Walberg 12.i.1).
a potsherd with a spiral type motif partially pre- Also in this case, the two motifs have exclusive
served. From the image the algorithm automati- distribution features. The spiral derivatives are
cally extracts the contour and checking for matches present in the Palatial area in phases 2-4 and are

1147
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

Figure 16. Examples of real queries to the database

absent in phase 1, while in the Provincial district specimens or altered by the chronological and
they occur in all the chronological phases but geographical variability of the production and by
restricted just to East, East-Central and Central the creativity of the Minoan artisans, let expect the
Crete. The petaloid loops have same distribution extension of this technique also to the decorative
in the Palaces as the above mentioned derivatives syntax of the Kamares style, in order to allow the
but in the Provincial areas they are exclusive of archaeologist to look forward for the long waited
phase 1 and 3 also having the same geographical exhaustive study of this so significant feature of
diffusion ((Walberg, 1983), pp. 43-44, 48-49, the Mediterranean prehistory.
pll. 35, 41; (Walberg, 1987), pp. 52-53, 57-58,
183, 188).
ACKNOWLEDGMENT

CONCLUSION AND FUTURE WORK The authors wish to thank Professors V. La Rosa
and F. Carinci for their precious support and the
In this chapter a pipeline to automatically classify many useful advices that they have provided dur-
simple Kamares decoration has been presented. In ing this research.
particular, we focused on the shape matching of the
pipeline showing how contour flexibility together
with Procustean distance, circular blurred shape REFERENCES
model or shape context may solve this problem.
The proposed system will be of great support in Attneave, F. (1954). Some informational aspects
automatically assisting the archaeologists in clas- of visual perception. Psychological Review, 61,
sification operations according time and place of 183–193. doi:10.1037/h0054663
production of Kamares pottery fragments. Belongie, S., Malik, J., & Puzicha, J. (2002). Shape
This first example of application of Computer Matching and Object Recognition Using Shape
Vision and Pattern Recognition techniques to a Contexts. IEEE Transactions on Pattern Analysis
specific topic of the prehistoric archaeology as and Machine Intelligence, 24, 509-522. doi:http://
the Minoan Kamares pottery opened a completely doi.ieeecomputersociety.org/10.1109/34.993558
new field of investigation that in the future can
deeply change the approaching of the scholars Biderman, I. (1985). Human image understanding:
to iconographical problems. The satisfactory Recent research and a theory. Computer Vision
results obtained with the classification of single Graphics and Image Processing, 32, 29–73.
motifs, even deformed by the preservation of the doi:10.1016/0734-189X(85)90002-7

1148
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

Bookstein, F. (1997). Landmark methods for Gonzalez, R. C., Woods, R. E., & Eddins, S. L.
forms without landmarks: morphometrics of (2009). Digital Image Processing Using MATLAB.
group differences in outline shape. Medical Im- Upper Saddle River, NJ: Gatesmark Publishing.
age Analysis, 1(3), 225–243. doi:10.1016/S1361-
Guarnera, G. C., Stanco, F., Tanasi, D., & Gallo,
8415(97)85012-8
G. (2010). Classification of decorative patterns
Carinci, F. (1997). Pottery workshops at Phaestos in Kamares pottery. In In proceedings of SCCG
and Haghia Triada in the protopalatial period, in 26yh Spring Conference on Computer Graphics
Aegaeum (Vol. 16). Center of Cretan Archaeology. (pp. 20-23). Comenius University, Bratislava.
(S.d.). Center of Cretan Archaeology. Retrieved
IPLab. (S.d.). Archeomatica Project. Retrived July
July 31, 2010 from http://www.cac.unit.it
31, 2010 from http://www.archeomatica.unict.it
Dawkins, R. M., & Laistner, M. (1912)... The
Knappett, C. (1999). Tradition and innovation
Excavation of the Kamares Cave in Crete. in
in pottery forming technology. Wheel-throwing
Annual of the British School at Athens, 19, 1–34.
at Middle Minoan Knossos. BSA, 94, 101–129.
Day, P. M., & Wilson, D. E. (1998). Consuming
Levi, D. (1976). Festós e la Civiltá Minoica I,
power. Kamares ware in protopalatial Knossos.
Roma.
Antiquity, 72, 350–358.
Levi, D., & Carinci, F. (1988). Festós e la Civiltá
Escalera, S., Fornes, A., Pujol, O., Escudero, A.,
Minoica.
& Radeva, P. I. (2009). Circular Blurred Shape
Model for symbol spotting in documents. In Lorensen, W. E., & Cline, H. E. (1987). Marching
ICIP09 (pp. 2005-2008). cubes: A high resolution 3D surface construc-
tion algorithm. In SIGGRAPH ‘87: Proceed-
Evans, A. (1921). The Palace of Minos at Knos-
ings of the 14th annual conference on Com-
sos. London.
puter graphics and interactive techniques (pp.
Farinella, G. M., Stanco, F., & Tanasi, D. (2008). 163–169). New York: ACM. doi:http://doi.acm.
Digital-Kamares: un sistema automatizzato per org/10.1145/37401.37422
riconoscere e classificare la ceramica minoica di
Macgillivray, J. A. (1998). Knossos. Pottery
stile Kamares. In AIAr 2008, V Congresso Nazio-
groups of the Old Palace period. London.
nale di Archeometria, Scienza e Beni Culturali
(pp. 547-556). Mackenzie, D. (1906). The Middle Minoan pottery
of Knossos. Journal of Hellenic Studies.
Gallo, G., Stanco, F., & Tanasi, D. (in press).
Riconoscimento automatico di motivi decorativi Maple, C. (2003). Geometric design and space
nelle ceramiche minoiche di stile Kamares da planning using the marching squares and march-
Festós. In 13a Giornata di Archeometria della ing cube algorithms. In Geometric Modeling
Ceramica: Nuove metodologie analitiche e di and Graphics, 2003. Proceedings. 2003 Interna-
trattazione dei dati nello studio della ceramica tional Conference on (pp. 90 - 95). doi:10.1109/
in archeologia ed in architettura. GMAG.2003.1219671
Gonzalez, R. C., & Woods, R. E. (2006). Digital McNeill, G., & Vijayakumar, S. (2005). 2D shape
Image Processing (3rd ed.). Upper Saddle River, classification and retrieval. In IJCAI’05: Proceed-
NJ: Prentice-Hall, Inc. ings of the 19th international joint conference
on Artificial intelligence (pp. 1483–1488). San
Francisco: Morgan Kaufmann Publishers Inc.

1149
Automatic Classification of Decorative Patterns in the Minoan Pottery of Kamares Style

McNeill, G., & Vijayakumar, S. (2006). Hierarchi- Stanco, F., Battiato, S., & Gallo, G. (2011). Digi-
cal Procrustes Matching for Shape Retrieval. In tal Imaging for Cultural Heritage Preservation.
CVPR ‘06: Proceedings of the 2006 IEEE Com- Analysis, Restoration ad Reconstruction of Ancient
puter Society Conference on Computer Vision and Artworks. CRC Press.
Pattern Recognition (pp. 885–894). Washington,
Walberg, G. (1978). The Kamares style. Overall
DC, USA: IEEE Computer Society. doi:http://
effects (Vol. 10). Uppsala: Acta Universitatis
dx.doi.org/10.1109/CVPR.2006.133
Upsaliensis. Boreas.
Momigliano, N. (1999). Duncan Mackenzie:
Walberg, G. (1983). Provincial Middle Minoan
a Cautious Canny Highlander and the Palace
pottery. Mainz.
of Minos at Knossos. (Vol. 72). Bulletin of the
Institute of Classical Studies, London. Walberg, G. (1987). Kamares. A study of the
character of Palatial Middle Minoan pottery.
Moore, G. A. (1968). Automatic Scanning and
Göteborg.
Computer Processes for the Quantitative Analy-
sis of Micrographs and Equivalent Subjects. In Walberg, G. (2001). The role and individuality of
PPR68 (pp. 275-326). Kamares ware. in Aegean Archeaology, 5, 9-18.
Pernier, L. (1935). Il palazzo minoico di Festós. Xu, C., Liu, J., & Tang, X. (2008). 2D Shape
Matching by Contour Flexibility. IEEE Trans-
Pernier, L., & Banti, L. (1951). Il palazzo minoico
actions on Pattern Analysis and Machine Intel-
di Festós.
ligence, 31, 180-186. doi:http://doi.ieeecomput-
Sebastian, T. B., Klein, P. N., & Kimia, B. B. ersociety.org/10.1109/TPAMI.2008.199
(2003). On Aligning Curves. IEEE Trans. Pattern
Zhang, J., Zhang, X., Krim, H., & Walter, G. G.
Anal. Mach. Intell., 25(1), 116–125. doi:http://
(2003). Object representation and recognition
dx.doi.org/10.1109/TPAMI.2003.1159951
in shape spaces. Pattern Recognition, 36(5),
Shen, D. F., Jin, L., Chang, H. T., & Wu, H. H. 1143–1154. doi:10.1016/S0031-3203(02)00226-1
(2005). Trademark Retrieval Based On Block
Zois, A. A. (1968). Der Kamares-Stil. Werden
Feature Index Code. In ICIP05 (pp. III: 177-180).
und Wesen (Diss.), Tübingen.

This work was previously published in Pattern Recognition and Signal Processing in Archaeometry: Mathematical and Com-
putational Solutions for Archaeology, edited by Constantin Papaodysseus, pp. 186-211, copyright 2012 by Information Science
Reference (an imprint of IGI Global).

1150
1151

Chapter 69
Cultural Dasymetric Population
Mapping with Historical GIS:
A Case Study from the
Southern Appalachians

George Towers
Concord University, USA

ABSTRACT
There has been a recent flurry of interest in dasymetric population mapping. However, the ancillary
coverages that underlie current dasymetric methods are unconnected to cultural context. The resulting
regions may indicate density patterns, but not necessarily the boundaries known to inhabitants. Dasymetric
population mapping is capable of capturing the cultural commonality and community interaction that
define social spaces. Dasymetric mapping may be improved with methodologies that reflect the ways in
which social spaces are established. This research applies a historical GIS methodology for identifying
early 20th Century agricultural neighborhoods in southern Appalachia. The case study is intended to
encourage discovery of additional methods for mapping population on the scale of lived experience.

INTRODUCTION landscape. The resulting dasymetric spaces ground


spatial analysis of social issues like environmental
Human geographers’ use of dasymetric mapping justice. Most dasymetric analysis, however, is
has increased with advances in geospatial data. predicated on the problematic assumption that
By intersecting land use imagery or property discontinuities in population density align with the
parcel data with census geography, gradations boundaries delimiting the micro-regional cultural
in population density are revealed that refine the landscape of neighborhoods and communities.
cartographic basis for investigating the cultural Cultural dasymetric mapping, cartography that
conforms dasymetric space to cultural contours,
DOI: 10.4018/978-1-4666-2038-4.ch069 is a rarity. The following review of the dasymetric

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Cultural Dasymetric Population Mapping with Historical GIS

literature chronicles this inconsistency, calling 2006). The foundational US Census unit, the cen-
attention to the need for culturally informed dasy- sus block, often endorses the ecological fallacy.
metric mapping. The methodology presented in the Outside high density city neighborhoods, census
case study section offers an example of cultural blocks expand to capture sufficient population and
dasymetric mapping and invites exploration of the their internal heterogeneity increases accordingly
diversity of techniques for infusing dasymetric (Crandall & Weber, 2005; Goodchild, Anselin, &
mapping with cultural significance. Deichmann, 1993; Mennis, 2003). The Census
Bureau’s use of roads to bound census blocks
contributes not only to the ecological fallacy
DASYMETRIC MAPPING but shows how arbitrary boundaries create the
MAUP. Rural roads focus residential settlement,
Dasymetric mapping is designed to improve upon community interaction, and identity (Daniels,
choropleth maps that employ arbitrarily bounded 1999; Theobald, 2005). Serving as census block
enumeration areas. Choropleth mapping is inher- boundaries, roads split rural neighborhoods. With
ently problematic in two fundamental ways. First, rural population concentrated along roadways,
choropleth maps suffer from the modifiable areal rural census blocks are doughnut-like, consisting
unit problem (MAUP) (Langford & Unwin, 1994; of peopled peripheries and empty centers. Road
Openshaw, 1984): “Any observed pattern in the boundaries heighten heterogeneity by lumping
mapped data may be to a large extent due to the together fragments of disparate neighborhoods
particular configuration of zonal boundaries used, linked by census block circumferences.
and all relationships observed between variables Dasymetric mapping promises to resolve the
will only hold for this particular aggregation” MAUP and the ecological fallacy by incorporat-
(Martin, 1996, p. 974). ing appropriate ancillary information. In keeping
Second, areal boundaries drawn without with the connotation of “dasymetric” in its original
reference to the choropleth map subject invite Russian and in Wright’s 1936 translational title,
the “ecological fallacy,” that is, areal units im- “A method of mapping densities of population,”
ply regional homogeneity and mask internal the most common method of dasymetric map-
heterogeneity. Regional homogeneity is defined ping forms boundaries with population density
by the human geography textbook trichotomy discontinuities. By adjusting thresholds separating
of functional, formal, and perceptual regions. density categories, dasymetric boundaries enclose
Functional regions are determined by spatially areas of homogenous density (Mennis, 2003).
conscribed interaction. Fellow residents of func- The application of density based dasymetric
tional regions - school districts, for example – are mapping to socio-spatial analysis is based on the
compelled to cooperate. Formal and perceptual longstanding assumption that density disconti-
regions are based on identity. In the case of formal nuities demarcate socially meaningful regions.
regions, inhabitants may be objectively ascribed a For example, Goodchild et al. argue that “… the
shared identity through common socioeconomic problem of basis change for socioeconomic data
or cultural characteristics. Perceptual regions are can be seen as one of estimating one or more
self-designated. Residents share an attachment underlying density surfaces” (1993, p. 388),
to place and regionally define their community. Martin supports “modelling the same informa-
As the basis for much spatial analysis, census tion [population characteristics] in the form of
geography offers an important example of these density surfaces” (1996, p. 974), and Moon and
two ills of choropleth mapping (Openshaw, 1996; Farmer assert that “Communities of place could
Schuurman, Leszczynski, Fielder, Grund, & Bell,

1152
Cultural Dasymetric Population Mapping with Historical GIS

be operationalized by measures of population analysis reflects the utility of this method in rep-
density …” (2001, p. 46). resenting settlement patterns. Density surfaces are
Land use / land cover (LULC) maps are the essential for subjects like emergency evacuation
most widely used ancillary data in the construc- planning that focus on where, not who, people
tion of population density surfaces. The simplest are (Chen, McAneney, Blong, Leigh, Hunter,
methodology is binary, coding LULC classes & Magill, 2004; Garb, Cromley, & Wait, 2007;
for the presence or absence of population. The Khomarudin, Strunz, Post, Zobeder, & Ludwig,
resulting presence / absence layer is combined 2008). Density discontinuities, however, are not
with census geography to reassign population necessarily relevant to distinguishing between
from the uninhabited to the inhabited parts of neighborhoods. For example, coherent communi-
census enumeration units. For example, the binary ties may be centered on open spaces like parks
method has recently been employed to analyze or, conversely, may bind a low density periphery
environmental justice and homelessness risk to a high density core.
(Boone, 2008; Fielder, Schuurman, & Hyndman, To demarcate culturally meaningful regions,
2004). Expanding upon this approach is the reclas- dasymetric methodologies must model identity
sification of LULC codes into several categories and interaction. For example, Trodd and Gearey
weighted for their estimated relative population (2006) incorporate participatory GIS in dasymetric
densities. Density surfaces are calculated by mul- mapping to delineate perceptual regions for the
tiplying the weighted coverage with population provision of early childhood services. Finding a
counts for census districts (Holt, Lo, & Hodler, close match between census geography and local
2004; Mennis, 2003, 2009). Mennis (2002) has leaders’ perceptions of neighborhood boundar-
used the weighted dasymetric method to measure ies, they suggest that infusing local geographic
environmental justice. identity with census information will help target
Residential property and structures are mapped social service delivery.
in a variety of forms, each of which is conducive Grannis (2005) uses dasymetric methods to
to dasymetric population mapping. For example, define functional and formal regions within cit-
Maantay, Maroko, and their colleagues have de- ies. Drawing upon the consensus that “walkable
veloped their Cadastral-Based Expert Dasymetric neighborhoods” evoke “socio-physical homoge-
System to assess a variety of environmental justice neity, a shared sense of place, connection, and
issues in New York City including flood hazards access” (Moudon et al., 2006, p. 102) and upon
and disease (Maantay & Maroko, 2009; Maan- his own survey research, Grannis postulates that
tay, Maroko, & Porter-Morgan, 2008; Maantay, networks of pedestrian streets bounded by heav-
Maroko, & Culp 2009). Parcel based population ily trafficked thoroughfares comprise meaningful
density surfaces serve to identify and measure urban neighborhoods. He automates neighborhood
suburban sprawl (Hasse & Lathrop, 2003; Irwin boundary identification with GIS and finds that
& Bockstael, 2002). Similarly, researchers have the resulting spaces generally coincide with urban
used house locations in conjunction with popula- census block boundaries. Grafting census data into
tion maps to generate density surfaces for socio- his walkable neighborhoods reveals residential
cultural analysis (Kirk, 2009; Moon & Farmer, racial segregation at a scale significant to resi-
2001; Theobald, 2001). Finally, postal coded dents. The functionally derived neighborhoods’
address points may be integrated into density internal homogeneity suggests that they are also
surface generation (Mesev & McKenzie, 2008). formal regions.
The volume of research devoted to develop- To accommodate standardized LULC and
ing and implementing density surfaces for social parcel data offering detailed information over

1153
Cultural Dasymetric Population Mapping with Historical GIS

vast expanses, dasymetric mapping has distanced to how topography circumscribed community.
itself from localized data collection. Contextually According to James S. Brown, a pioneering so-
sensitive research methods are, however, more ciologist of rural Appalachia, “Neighborhoods
appropriate for dasymetric mapping because develop among people who have frequent and
neighborhood and community boundaries are regular contacts, and in this region topography
grounded in local culture. The examples provided has helped to determine social relationships of
by Trodd and Geary and Grannis show that par- the inhabitants and to form neighborhood group-
ticipant observation and in-depth interviewing ings” (Brown, 1988, p. 52). Approximating the
merge with GIS analysis to dasymetrically reveal boundaries of these functional neighborhoods
the boundaries of regional interaction and identity. is a three part process: creating a terrain-based
The following case study presents another qualita- travel cost surface that modeled the mountains’
tive methodology useful for cultural dasymetric socially limiting effects, identifying nodal points
mapping, historical ethnography. centering each neighborhood, and, assigning land
to each neighborhood based on the cost of travel
to its node.
CASE STUDY: CULTURAL I chose Summers County, West Virginia, a
DASYMETRIC MAPPING OF county that was exceptionally statistically repre-
HISTORIC SOUTHERN APPALACHIA sentative of southern Appalachian farming from
1880 through 1940, as the pilot area for my initial
Methodology study. To make a travel cost surface for Summers
County, I used the 30 meter US Geological Sur-
In an earlier article, I incorporated the collective vey (USGS) National Elevation Dataset (NED).
observations of Appalachian ethnographers into I began by calculating a slope surface from the
GIS analysis to systematically map the boundar- NED. I then modified an equation from applied
ies of the rural southern Appalachia communi- physiology to convert the slope surface into a
ties of a century ago (Towers, 2010). Beginning caloric cost surface for walking at slope (Minetti,
in the 1930s, anthropologists, sociologists, and Moia, Roi, Susta, & Ferretti, 2002).
geographers produced case studies of the shared
culture underlying the region’s farm family neigh- k = (((280.5 * s5) – (58.7 * s4) – (76.8 * s3) +
borhoods between the Civil War and World War (51.9 * s2) + (19.6 * s) + 2.5) * w * d) / 4184
Two. On the typical semi-subsistence farms of the
era, families diversified production from grains k represents kilocalories expended
to fruits to vegetables and raised a menagerie s represents percent slope
of livestock including cattle, sheep, and pigs. w represents body weight in kilograms
Neighboring farmers depended on social and d represents meters traveled
economic reciprocity. Farm neighborhoods were
close-knit, kin-based homogenous communities In other words, the cost surface makes foot
that shaped residents’ identity. In other words, travel between any pair of NED cells measurable
they were simultaneously functional, formal, and in terms of its caloric expenditure.
perceptual regions. Central neighborhood locations in early 20th
Like Grannis, I used walking-based social century rural Appalachia were post offices,
interaction as a starting point for regional identi- country schools, and churches. Post offices were
fication. This pedestrian premise is supported by established in the tiny hamlets that dotted the
Appalachian ethnography which called attention countryside and were named on the 1912 USGS

1154
Cultural Dasymetric Population Mapping with Historical GIS

maps available for Summers County. A typical patterns to those documented by ethnographic
hamlet was a low order central place whose ser- and geographic fieldwork combine to confirm
vices typically included a post office, a general that my methodology offers a replicable and re-
store, a mill, and a blacksmith shop (Hart, 1975). liable means of bounding the social spaces that
Interspersed between hamlets were farm neighbor- structured the regional cultural landscape in the
hoods supporting one-room country schools. The decades surrounding the turn of the last century.
local schools were not only community centers
hosting educational, social, and political functions Study Areas
(Gulliford, 1996), they were also deliberately
evenly spaced to ensure that children’s walk to To show the utility of this social dasymetric
school was minimized (Holy, 1929). technique to map historical population, I examine
With neighborhood nodes placed on the walk- two West Virginia study areas: Wyoming County
ing cost surface, territory was assigned to each in the state’s southwestern coalfields, and, the
node based on ease of access. The NED cells farmlands of the lower Greenbrier River Valley
sharing the same lowest cost destination formed in Greenbrier and Monroe Counties (Figure 1).
my hypothetical agricultural neighborhoods. Structure locations from 1920s USGS 15 minute
Hypothesis confirmation consisted of con- topographic maps were digitized and are the pri-
firming that settlement patterns within the pro- mary data for the GIS analysis. The topographic
spective neighborhoods matched ethnographic maps for Wyoming County date from 1924 to 1929
descriptions. This was predicated on verifying except for the 1914 Eccles quadrangle covering
that the houses shown on the 1912 maps were a the far northeastern corner of the county. All of the
comprehensive inventory of the residential land- Lower Greenbrier Valley study area quadrangles
scape. Adjusting the number of mapped houses for were surveyed in 1921.
estimated vacancy rates and the household counts
from the 1910 manuscript census for undercount Wyoming County
produced nearly identical figures for magisterial
districts and supported using the 1912 USGS maps Settled in the early years of the 19th century,
as a primary data source. Wyoming County supported a pre-industrial agri-
I then reviewed ethnographic descriptions cultural economy based on subsistence and barter.
of neighborhood size and settlement patterns to The agricultural neighborhoods described above
establish expectations for the spatial patterns of organized community life in Wyoming County
house locations contained in the hypothetical through the 1800s (Cook, 2000). As elsewhere in
neighborhoods. According to secondary sources, the region, these farm neighborhoods found focal
Appalachian agricultural neighborhoods con- points in country schools. As of 1895, there were
tained between one dozen and four dozen farm- at least 95 small school buildings in the county
houses regularly dispersed along streams flowing (Bowman, 1965).
through hollows or along narrow ridgetop plateaus Wyoming’s timber and coal attracted land
(Wilhelm, 1978). Spatial analysis of house loca- speculators in the late 1800s. In advance of the
tions within the hypothetical agricultural neigh- railroad, the timber industry came to Wyoming
borhoods resulted in house counts, distributions, County in the 1890s. William Ritter, southern West
and densities that closely fit these expectations. Virginia’s leading timber baron, owned 30,000
The representativeness of the study area, the acres by 1906 and had built a company town at
legitimacy of the map data, and the faithfulness Estell, later renamed Maben. Complete with an
of the GIS-generated neighborhood settlement immigrant workforce and charges of peonage,

1155
Cultural Dasymetric Population Mapping with Historical GIS

Figure 1. Study areas

Maben resembled the coal camps that were to narrow and rough. Wagons and horses could
follow (Feller, 2001; Lewis, 1998). hardly get over them. There were no bridges across
Delayed by the county’s rugged topography the streams” (Feller, 2001, p. 22). For example,
(Table 1), physical isolation, and fractious politics, Keystone in neighboring McDowell County is 20
the railroad finally arrived in Wyoming County kilometers across the mountains from Mullens.
in 1906 (Cook, 2000). The Virginian Railway Prior to the railroad, shipping between the two
ran through eastern Wyoming County on its way locales took three days (Feller, 2001). These
south from Deepwater on the Kanawha River to primitive conditions assert the topographical
Princeton in Mercer County. Western Wyoming tyranny reflected in the pedestrian cost surface
County was not linked to the railroad until the modeling employed in this research.
late 1920s (Bowman, 1965). With the railroad came the coal industry. The
The railroad “meant the end of our way of life” county’s first mine was dug near Mullens in
remembered resident Howard Mullins. Kinsman 1912. As World War One grew demand, another
Foley Mullins reported that prior to the railroad, 14 mines opened in eastern Wyoming County.
“Transportation was very difficult as roads were County coal production, an incipient 1,848 tons
in 1913, surpassed one hundred thousand tons by
Table 1. Slope 1915, one million tons by 1918, and three million
tons by 1925 (Bowman, 1965).
Slope categories Wyoming Lower Greenbrier Population grew with the coal industry, much
County Valley of it imported from distant labor reservoirs. By
Flat (0-15% slope) 10% 49% 1925, there were 2,513 coal miners in the county,
Steep (15-35%) 31% 39% 36% of whom were African Americans from the
Very steep (35-100%) 59% 11% South and 7% were European immigrants (Trotter,

1156
Cultural Dasymetric Population Mapping with Historical GIS

1990).The county’s eastern coal districts, Barker’s farmland declined slightly through the 1920s,
Ridge and Slab Fork, more than doubled from a farming occupied a dramatically smaller propor-
combined population of 3,353 in 1910 to 7,343 tion of Wyoming Countians as mining expanded.
in 1920. Mullens, straddling the border of the two The 1920s topographic maps display the as-
districts, incorporated to become a town of 1,425 cendance of coal in eastern Wyoming County in
in 1920 (Figure 2). contrast with the county’s agrarian western por-
The cultural landscape of the World War One tion. Settlement patterns on the two sides of the
coal boom in Wyoming County was of a piece county diverge accordingly. As it winds through
with the regional experience. Colorfully named narrow valleys, the railroad connects dense clus-
coal camps like Bud, Caloric, and Montecarlo ters of housing in eastern Wyoming’s company
dotted the Virginian’s route. For example, Itmann, towns. The agricultural neighborhoods to the west
named for mine operator I.T. Mann, was con- share a dispersed pattern of farmsteads.
structed of 100 pre-fabricated houses over a three
month period in 1918 (Feller, 2001). Lower Greenbrier River Valley
Despite rural industrialization’s territorial
encroachments, county farmers found a market The Lower Greenbrier River Valley study area
in the timber and coal camps. Indeed, Lewis finds represents the long-established farm country of
that instead of an “antithetical” relationship, farms southern Appalachia’s ridge and valley physio-
coexisted “harmoniously” with lumber camps graphic province. Agricultural productivity is
and mines in Wyoming County (Lewis, 1998, p. insured by broad stream valleys underlain by
257). Farms and farmland increased from 1880 limestone formations (Table 1). As the mainstream
and peaked in 1910 with the new commercial of settlers flowed down the Shenandoah Valley
opportunities (Cook, 2000). Table 2 shows that in the mid 1700s, some branched off into south-
while the number of farms and the amount of eastern West Virginia. Disrupted by the French

Figure 2. Wyoming County

1157
Cultural Dasymetric Population Mapping with Historical GIS

Table 2. Wyoming County agriculture, 1880-1930

Date County Farms Estimated pct. of county Average farm size Pct. of County
Population population on farms (hectares) Territory in Farmland
1880 4,322 608 77 54 36
1890 6,247 862 76 56 35
1900 8,380 1,202 79 45 37
1910 10,392 1,462 77 44 41
1920 15,180 1,181 43 40 36
1930 20,926 1,154 34 36 32
Source: University of Virginia, 2004.

and Indian War, permanent settlement followed center of the valley’s lumber trade. Its huge state
independence. By the 1820s, the lower Greenbrier of the art mill stayed in operation until the timber
was well-integrated into the national economy by industry slowed in the 1920s. By 1910, some 25
the James River and Kanawha turnpike. Livestock other large sawmills had been built in the lower
were the leading regional export and local farms Greenbrier region at towns including Alderson,
also sold corn and small grains. The region also Fort Spring, and Union (Lewis, 1998; Rice, 1986).
boasted an early tourism attraction in mineral Early 20th Century rural population growth
springs resorts. Of these, White Sulphur Springs followed the route of the Greenbrier River and
remains the most prominent (Rice, 1986). C&O through the Greenbrier County section of
A contested Civil War theater, the Greenbrier the study area and bypassed the Monroe County
Valley quickly recovered to become one of the section lying to the south (Figure 3). The rural
new State of West Virginia’s leading stock-raising Greenbrier County portion grew from a population
centers. Facilitated by the construction of the of 4,912 in 1900 to 6,380 in 1920 while the rural
Chesapeake and Ohio Railroad (C&O) along the Monroe County portion fell from 4,820 in 1900 to
Greenbrier River in 1869, cattle, sheep, horses, and 4,434 in 1920. The economic development contrast
hogs were exported from valley farms to eastern between Monroe and Greenbrier counties is clearly
cities. In the late 19th and early 20th centuries, shown in Tables 3 and 4. Investment in timber and
local farmers continued to support themselves railroading grew Greenbrier’s population steadily.
with mixed farming that included corn, wheat, The 1920s growth spike came with the opening
oats, vegetables and fruit. The region’s agrarian of the far western portion of the county to coal
prominence is celebrated by the state fairgrounds and timber. While agricultural figures hold fairly
in Fairlea (Rice, 1986). constant, farming occupies a smaller proportion
The timber industry came to the region’s rich of the population. Monroe County, however,
hardwood forests in the 1870s. As in Wyoming remained exclusively agricultural throughout the
County, investors acquired large tracts of tim- period and Table 4 shows that farming expanded
berlands and initiated a natural resource boom to the county’s capacity by 1900 and remained
cycle. Conveyed at first by the river, then by the consistent thereafter.
rails, logs were processed in sawmills that sprang
up along the transportation corridor formed by Study Area Comparison
the Greenbrier River and the C&O. Ronceverte,
built in the 1870s, more than doubled in size from Density analysis reflects the dominance of agri-
1900 to 1910 to over 2,000 residents and was the culture in the Lower Greenbrier Valley compared

1158
Cultural Dasymetric Population Mapping with Historical GIS

Figure 3. Lower Greenbrier Valley

to Wyoming County. In the flatter, more fertile, 2010). Vacant land is far enough from settlement
and longer-established farmland of the Lower that it is unlikely to have been part of a farm. I
Greenbrier Valley, population density was half set the vacancy threshold at 800 meters from
again as high as in rural Wyoming County (Table the nearest structure on the basis of regional
5). To assess the distribution of density, I apply farm size. One hundred years ago, few southern
four settlement categories derived from historic Appalachian farms exceeded 200 hectares. For
Appalachian farm sizes: vacant, archetypal agri- example, in Summers County, 90 percent of farms
cultural, agricultural, and commercial (Towers, were smaller than 200 hectares. In a landscape

Table 3. Greenbrier County agriculture, 1880-1930

Date County Farms Estimated pct. of county Average farm size Pct. of County
Population population on farms (hectares) Territory in Farmland
1880 15,060 1,506 55 131 74
1890 18,034 1,872 57 82 58
1900 20,683 2,434 65 68 62
1910 24,833 2,551 56 53 50
1920 26,242 2,439 51 52 48
1930 35,878 2,242 38 53 44
Source: University of Virginia, 2004.

1159
Cultural Dasymetric Population Mapping with Historical GIS

Table 4. Monroe County agriculture, 1880-1930

Date County Farms Estimated pct. of county Average farm size Pct. of County
Population population on farms (hectares) Territory in Farmland
1880 11,501 1,248 60 78 79
1890 12,429 1,521 67 66 81
1900 13,130 1,794 75 56 81
1910 13,055 1,918 81 51 79
1920 13,141 1,834 77 52 78
1930 11,949 1,826 76 49 73
Source: University of Virginia, 2004

occupied by evenly spaced 200 hectare farms Agricultural land extends beyond archetypal
centered on farmhouses, the houses will be 800 agricultural densities to settlement patterns too
meters from each other. crowded to be consistent with Appalachian farm-
Archetypal agricultural land would have been ing. Eight hectares was the low end of the smallest
covered by average sized farms. Late 19th and census farm size category, implying that properties
early 20th southern Appalachian farms averaged of less than 8 hectares were not used as farms.
40 hectares in size. Allowing for these farms to Using the 40 hectares surrounding each cell for
be slightly unevenly spaced suggests that zero, density calculations means that five very small
one or two farmhouses should be found within a farms could coexist within this space and suggests
40 hectare area. Therefore, I chose 40 hectares as that six houses within the 40 hectare search area
the search area surrounding each cell on the den- serve as the threshold between agricultural and
sity surface. As ethnographic reports concur that commercial land. In terms of population density,
southern Appalachian farm households averaged this translates into an upper limit of 75 people per
about five members (Brown, 1988; Martin, 1984), square kilometer on agricultural land. Figures 4
archetypal agricultural population density is rep- and 5 and Table 5 confirm that population densi-
resented by non-vacant land containing fewer ties associated with agricultural settlement were
than 25 people per square kilometer. much more prevalent in the Lower Greenbrier
Valley than in Wyoming County.

Table 5. Population density


DISCUSSION
Population Density Indicators Wyoming Lower
County Greenbrier Ethnography documents that early 20th Century
Valley
southern Appalachian agricultural neighborhoods
1920 Study Area Population 15,270 16,174 provided a spatial structure for community interac-
Study Area (sq. km) 1,299 908 tion and identity. In previous work, I demonstrated
Population per sq. km 11.8 17.8 a GIS methodology that reliably reconstructs the
Vacant 29% 6% boundaries of these miniature culture regions
Archetypal agri- 61% 80% and is a basis for cultural dasymetric population
Density cultural
categories mapping (Towers, 2010).
Agricultural 8% 12%
Commercial 3% 2%

1160
Cultural Dasymetric Population Mapping with Historical GIS

Figure 4. Population density map of Wyoming County

Figure 5. Population Density map of the Lower Greenbrier Valley

1161
Cultural Dasymetric Population Mapping with Historical GIS

Applying the GIS methodology to the two Figures 6 through 9 are dasymetric population
study areas generated boundaries around rural maps based on the GIS-generated communities.
communities and towns. Rural communities were In Wyoming County, the rural portions of the
organized around a single node – a post office, seven magisterial districts averaged 5.8 people
school, or church - by the cost allocation process. per mapped structure with only two of the districts
Towns were represented by several nodes on the more than one person from the mean: Barkers
town’s periphery. The territories allocated to the Ridge at 4.5 and Clear Fork at 7.4 people per
nodes were combined to represent the town’s ef- house. In the Lower Greenbrier Valley’s seven
fective vicinity, not its jurisdictional boundaries. rural magisterial district areas, the average was
Calculating populations for GIS-generated 4.6 people per house with all of the districts be-
neighborhoods is a straightforward process. 1920 tween 4.0 and 5.1. Within city limits, population
Census populations for enumeration units - incor- per house averaged 5.7 in Wyoming County and
porated towns and the unincorporated portions of 6.1 in the Lower Greenbrier Valley.
magisterial districts - were divided by the number The communicative value of the dasymetric
of houses mapped within each enumeration area population maps is readily compared with cho-
to provide a population estimate for each house. ropleth maps and density surfaces. Choropleth
The estimated population of the houses in each population mapping is shown in Figures 10 and 11.
GIS neighborhood is then summed to produce Drawn from 1920 data for magisterial districts and
neighborhood population estimates. Tobler’s pyc- incorporated towns, the choropleth maps hint at the
nophylactic requirement that dasymetric divisions population shift to eastern Wyoming’s coalfields
combine to reconstitute original population totals and at the even population distribution across the
is easily met with this method (Tobler, 1979). Lower Greenbrier Valley. As enumeration units
Because the original census population is entirely imposed for administrative purposes, the magiste-
distributed across the mapped houses, it is recover- rial districts carried limited functional significance
able from any regional recombination of houses. but were not an explicit basis for social identity.

Figure 6. Dasymetric map of Wyoming County showing population

1162
Cultural Dasymetric Population Mapping with Historical GIS

Figure 7. Dasymetric map of Wyoming County showing population density

Figure 8. Dasymetric map of the Lower Greenbrier Valley showing population

1163
Cultural Dasymetric Population Mapping with Historical GIS

Figure 9. Dasymetric map of the Lower Greenbrier Valley showing population density

Figure 10. Choropleth Population map of Wyoming County

1164
Cultural Dasymetric Population Mapping with Historical GIS

Furthermore, at the county scale, the large size of In contrast, the even population densities of the
the magisterial districts makes them vulnerable to Lower Greenbrier Valley reflect the agrarian
the ecological fallacy. The obscuration of variation landscape of a premier farming region. Density
within the relatively large districts is demonstrated discontinuities, however, cannot distinguish be-
by the density surfaces and dasymetric maps. tween distinct coal camps nor draw out the neigh-
The density surfaces (Figures 4 and 5) are borhoods that comprised the Lower Greenbrier
constructed from house locations assigned esti- farm country.
mated populations and are calculated at the fine Informed by ethnographic ancillary data, the
grain of 30 meter raster cells using a 40 hectare dasymetric maps (Figures 6 through 9) display
search area. Discontinuities between the four population at the intermediary scale of lived
density categories defined above are informative. communities. For example, descriptive statisti-
For example, the knotted string of coal camps cal analysis of the dasymetric neighborhoods
lining the Virginian Railway stands out on the identifies three groups of rural communities:
Wyoming County map. The uneven rural settle- Wyoming coal camps, Wyoming agricultural
ment of agriculturally marginal, yet unindustrial- neighborhoods, and Lower Greenbrier Valley
ized western Wyoming County is also manifest. agricultural neighborhoods. Tables 6 and 7 show

Figure 11. Choropleth Population map of the Lower Greenbrier Valley

1165
Cultural Dasymetric Population Mapping with Historical GIS

Table 6. Wyoming County communities

Community statistics Agricultural neighborhoods Coal camps Mullens Pineville


Number 105 20 1 1
Average size (sq. km) 10.6 8.1 8.0 16.6
Standard deviation of size 7.6 5.6 NA NA
Average population 74 281 1,446 383
Standard deviation of pop. 49 209 NA NA
Population per sq. km 7.0 34.7 173.7 23.1

Table 7. Lower Greenbrier Valley communities

Community statistics Agricultural neighborhoods Towns1


Number 153 4
Average size (sq. km) 5.6 11.4
Standard deviation of size 3.7 NA
Average population 65 1,550
Standard deviation of pop. 42 NA
Population per sq. km 11.6 134.7
1. Towns are Alderson, Lewisburg, Ronceverte, and Union.

that average density in Wyoming’s coal camps is mapping, in contrast, does not model the identity
almost five times that in neighboring agricultural and interaction that socialize space. Like Trodd
neighborhoods. A further contrast may be drawn and Gearey’s generation of perceptual regions and
between agricultural neighborhoods in the two Grannis’ analysis of social networks, my use of
study areas. Lower Greenbrier Valley agricultural historical ethnography suggests a diffuse yet shared
neighborhoods are smaller and denser, reflecting direction for further cultural dasymetric mapping.
the region’s superior agricultural setting.

ACKNOWLEDGMENT
CONCLUSION
The author thanks the editor of this special edition
The study presented above is intended to advance of the IJAGR, Rebecca Dobbs, for her interest in his
the case for cultural dasymetric mapping. As in the work. He also thanks the anonymous reviewers for
southern Appalachian example, cultural dasymetric their very insightful and productive suggestions.
mapping involves merging contextually sensitive
qualitative research with census geography or other
generic data. Cultural dasymetric methodologies REFERENCES
are not necessarily transferable across space and
time but demonstrate the productivity of deepening Boone, C. G. (2008). Improving resolution of cen-
contextual knowledge to choose the combination sus data in metropolitan areas using a dasymetric
of research resources that best maps the scale of approach: Applications for the Baltimore ecosys-
lived experience. Contemporary density dasymetric tem study. Cities and the Environment, 1(1), 3.

1166
Cultural Dasymetric Population Mapping with Historical GIS

Bowman, M. K. (1965). Reference book of Grannis, R. (2005). T-communities: Pedestrian


Wyoming County history. Parsons, WV: McClain street networks and residential segregation in
Printing Company. Chicago, Los Angeles, and New York. City &
Community, 4(3), 295–322. doi:10.1111/j.1540-
Brown, J. S. (1988). Beech Creek: A study of a
6040.2005.00118.x
Kentucky mountain neighborhood. Berea, KY:
Berea College Press. Gulliford, A. (1996). America’s country schools
(3rd ed.). Niwot, CO: University Press of Colo-
Chen, K., McAneney, J., Blong, R., Leigh, R.,
rado.
Hunter, L., & Magill, C. (2004). Defining area
at risk and its effect in catastrophe loss estima- Hart, J. F. (1975). The look of the land. Upper
tion: A dasymetric mapping approach. Applied Saddle River, NJ: Prentice Hall.
Geography (Sevenoaks, England), 24(2), 97–117.
Hasse, J., & Lathrop, R. G. (2003). A housing-
doi:10.1016/j.apgeog.2004.03.005
unit-level approach to characterizing residential
Cook, S. R. (2000). Monacans and miners: Na- sprawl. Photogrammetric Engineering and Re-
tive American and coal mining communities in mote Sensing, 69(9), 1021–1030.
Appalachia. Lincoln, NB: University of Nebraska.
Holt, J. B., Lo, C. P., & Hodler, T. W. (2004).
Crandall, M., & Weber, B. (2005). Defining rural Dasymetric estimation of population density and
Oregon: An exploration. Corvallis, OR: Oregon areal interpolation of census data. Cartography
State University. and Geographic Information Science, 31(2),
103–121. doi:10.1559/1523040041649407
Daniels, T. (1999). What to do about rural sprawl?
Paper presented at the American Planning As- Holy, T. C. (1929). Survey of education in West
sociation Conference, Seattle, WA. Virginia: Vol. III. School buildings. Charleston,
WV: West Virginia State Department of Education.
Feller, J. W. (2001). Memories and photos of
“Mullins,” West Virginia: Vol. 1. 1894 to 1925. Irwin, E. G., & Bockstael, N. E. (2002). Interacting
St. Albans, WV: Harless Printing Company. agents, spatial externalities, and the evolution of
residential land use patterns. Journal of Economic
Fielder, R., Schuurman, N., & Hyndman, J. (2004).
Geography, 2, 31–54. doi:10.1093/jeg/2.1.31
Improving census-based socioeconomic GIS for
public policy: Recent immigrants, spatially con- Khomarudin, M. R., Strunz, G., Post, J., Zobeder,
centrated poverty and housing need in Vancouver. K., & Ludwig, R. (2008, November). Spatial im-
ACME: An International E-Journal for Critical provement of information on population distribu-
Geographies, 4(1), 145–171. tion using GIS approaches: An input for tsunami
people exposure assessment. Paper presented at
Garb, J. L., Cromley, R. G., & Wait, R. B. (2007).
the International Conference on Tsunami Warning
Estimating populations at risk for disaster pre-
(ICTW), Bali, Indonesia.
paredness and response. Journal of Homeland
Security and Emergency Management, 4(1), 3. Kirk, R. W. (2009). Land use and terrestrial
doi:10.2202/1547-7355.1280 carbon storage in Western North Carolina from
1850-2030: A historical reconstruction and simu-
Goodchild, M. F., Anselin, L., & Deichmann, U.
lation study. Unpublished doctoral dissertation,
(1993). A framework for the areal interpolation
University of Minnesota, Duluth, MN.
of socioeconomic data. Environment & Planning
A, 25, 383–397. doi:10.1068/a250383

1167
Cultural Dasymetric Population Mapping with Historical GIS

Langford, M., & Unwin, D. J. (1994). Generating Mennis, J. (2003). Generating surface models
and mapping population density surfaces within of population using dasymetric mapping. The
a geographical information system. The Carto- Professional Geographer, 55(1), 31–42.
graphic Journal, 31(1), 21–26.
Mennis, J. (2009). Dasymetric mapping for es-
Lewis, R. L. (1998). Transforming the Appa- timating population in small areas. Geography
lachian countryside: Railroads, deforestation, Compass, 3(2), 727–745. doi:10.1111/j.1749-
and social change, 1880-1920. Chapel Hill, NC: 8198.2009.00220.x
University of North Carolina.
Mesev, V., & McKenzie, P. (2008). Urban neigh-
Maantay, J., & Maroko, A. (2009). Mapping ur- borhood pattern recognition using high spatial
ban risk: Flood hazards, race, and environmental resolution remotely sensed data and point-based
justice in New York. Applied Geography (Sev- GIS data sources. In Wise, S., & Craglia, M.
enoaks, England), 29(1), 111–124. doi:10.1016/j. (Eds.), GIS and evidence-based policy making
apgeog.2008.08.002 (pp. 19–42). Boca Raton, FL: CRC Press.
Maantay, J., Maroko, A., & Culp, G. (2009). Us- Minetti, A. E., Moia, C., Roi, G. S., Susta, D., &
ing geographic information science to estimate Ferretti, G. (2002). Energy cost of walking and
vulnerable urban populations for flood hazard and running at extreme uphill and downhill slopes.
risk assessment in New York City. In Showalter, Journal of Applied Physiology, 93, 1039–1046.
P. S., & Lu, Y. (Eds.), Geospatial techniques in
Moon, Z. K., & Farmer, F. L. (2001). Population
urban hazard and disaster analysis (pp. 71–97).
density surface: A new approach to an old problem.
New York, NY: Springer. doi:10.1007/978-90-
Society & Natural Resources, 14, 39–49.
481-2238-7_5
Moudon, A. V., Lee, C., Cheadle, A. D., Garvin, C.,
Maantay, J. A., Maroko, A. R., & Porter-Morgan,
Johnson, D., & Schmid, T. L. (2006). Operational
H. (2008). A new method for mapping popula-
definitions of walkable neighborhood: Theoreti-
tion and understanding the spatial dynamics of
cal and empirical insights. Journal of Physical
disease in urban areas: Asthma in the Bronx,
Activity & Health, 3(1), 99–117.
New York. Urban Geography, 29(7), 724–738.
doi:10.2747/0272-3638.29.7.724 Openshaw, S. (1984). The modifiable areal unit
problem: Concepts and techniques in modern
Martin, C. E. (1984). Hollybush: Folk building
geography. Norwich, UK: Geo Books.
and social change in an Appalachian community.
Knoxville, TN: University of Tennessee Press. Openshaw, S. (1996). Developing GIS-relevant
zone-based spatial analysis methods. In Longley,
Martin, D. (1996). An assessment of surface
P., & Batty, M. (Eds.), Spatial analysis: Modelling
and zonal models of population. International
in a GIS environment (pp. 55–74). New York, NY:
Journal of Geographical Information Systems,
John Wiley & Sons.
10(8), 973–989. doi:10.1080/026937996137684
Rice, O. K. (1986). A history of Greenbrier County.
Mennis, J. (2002). Using geographic information
Lewisburg, WV: Greenbrier Historical Society.
systems to create and analyze statistical surfaces
of population and risk for environmental jus-
tice analysis. Social Science Quarterly, 83(1),
281–297. doi:10.1111/1540-6237.00083

1168
Cultural Dasymetric Population Mapping with Historical GIS

Schuurman, N., Leszczynski, A., Fielder, R., Gr- Trodd, N., & Gearey, W. M., Jr. (2006). A participa-
und, D., & Bell, N. (2006). Building an integrated tory GIS-based method to construct output areas
cadastral fabric for higher resolution socioeco- for neighbourhood analysis and social planning.
nomic spatial data analysis. In Proceedings of Retrieved from http://www.urisa.org/gearey
the 12th International Symposium on Progress
Trotter, J. W. Jr. (1990). Coal, class, and color:
in Spatial Data Handling (pp. 897-920).
Blacks in Southern West Virginia, 1915-32. Ur-
Theobald, D. M. (2005). Landscape patterns of bana, IL: University of Illinois.
exurban growth in the USA from 1980 to 2020.
University of Virginia, Geospatial and Statistical
Ecology and Society, 10(1), 32.
Data Center. (2004). Historical census browser.
Tobler, W. (1979). Smooth pycnophylactic inter- Retrieved from http://fisher.lib.virginia.edu/col-
polation for geographical regions. Journal of the lections/stats/histcensus/index.html
American Statistical Association, 74, 519–530.
Wilhelm, G. Jr. (1978). Folk settlements in the
doi:10.2307/2286968
Blue Ridge Mountains. Appalachian Journal,
Towers, G. (2010). Rediscovering rural Appala- 5(2), 204–245.
chian communities with historical GIS. South-
Wright, J. K. (1936). A method of mapping
eastern Geographer, 50(1), 58–82. doi:10.1353/
densities of population with Cape Cod as an
sgo.0.0065
example. Geographical Review, 26, 103–110.
doi:10.2307/209467

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 2, Issue 4,
edited by Donald Patrick Albert, pp. 38-56, copyright 2011 by IGI Publishing (an imprint of IGI Global).

1169
1170

Chapter 70
Using Volunteered Geographic
Information to Assess the
Spatial Distribution of West
Nile Virus in Detroit, Michigan
Kevin P. McKnight Ashton M. Shortridge
Michigan Department of Transportation & Michigan State University, USA
Michigan State University, USA
Meghan D. Burns
Joseph P. Messina Montana Natural Heritage Program, USA
Michigan State University, USA
Bruce W. Pigozzi
Michigan State University, USA

ABSTRACT
West Nile Virus is a vector-borne flavivirus that affects mainly birds, horses, and humans. The disease
emerged in the United States in 1999 and by 2001 had reached Michigan. In clinical human cases, the
most common symptoms are fever, weakness, nausea, headache, and changes in mental state. The crow is
the most common wildlife host in the life cycle of the virus. The state of Michigan, through the Michigan
Department of Community Health, collected the spatial locations of over 8,000 dead birds (Corvidae),
statewide, during 2002. The large number of samples made spatial and temporal hotspot detection pos-
sible. However, the volunteer reporting method produced a dataset with a direct correlation between
the numbers and locations of the dead birds and human population density and accurately identifying
hotspots remains a challenge. Geographic variation in dead bird intensity was modeled using both
global and local spatial clustering algorithms. Statistical models identified overall spatial structure and
local clustering. Identification of hotspots was confounded by limited information about the collection
procedures, data availability and quality, and the limitations of each method.

DOI: 10.4018/978-1-4666-2038-4.ch070

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

INTRODUCTION example, during a wildfire, a cell phone equipped


person on the spot may be in a much better posi-
West Nile Virus (WNV) is a vector-borne RNA tion to inform the public about local conditions.
flavivirus that affects mainly birds, horses, and Important advantages of VGI in the context of
humans (Anderson, Vossbrinck, Andreadis, Iton, this paper include cost-efficiency, timeliness,
Beckwith, 2001). This paper considers the spatial and citizen engagement with public health is-
distribution of West Nile Virus across a major sues. However, its utility for subsequent spatial
metropolitan region from a medical geography analysis may be questionable due to biases and
perspective, which is concerned with the relation- data quality concerns.
ship between diseases and their spatial contexts. Focusing on data collected during the 2002
Examination of the timing and geographic locations WNV epidemic in southeastern Michigan, USA,
of disease cases, as well as their environmental this paper explores two related research questions.
and social settings, can lead to identification of
patterns and associations among the individual 1. Are volunteered geographic health data
events together with key covariates. These as- suitable for use in statistical spatial hot spot
sociations may reflect underlying environmental detection applications?
influences and lead to greater understanding about 2. Assuming that the previous question is true,
the factors and drivers affecting the transmission then during the 2002 WNV epidemic in
and geographic diffusion of a disease, greatly Michigan, did spatial or temporal clusters
facilitating the response of public health officials. emerge that might be indicators of future
An important innovation in medical geography is risks?
the development of statistically-driven hot spot
detection methods (Anselin, 1995; Fotheringham The remainder of this introduction provides
& Zhan, 1996; Openshaw, 1996). Such methods important background on WNV, its spread in
are typically employed when extensive and often southeastern Michigan, and the surveillance meth-
exhaustive, high-accuracy spatial data on disease ods employed there. Then it considers the nature
are available (Ozdenerol, Bialkowska-Jelinska, of available VGI for the epidemic in this region.
Taff, 2008). Frequently, secondary demographic It identifies critical uncertainties in the available
datasets are employed to model the population at information that limit the kinds of questions that
risk, thereby accounting for spatial variation in that can be asked and conclusions that can be drawn.
background population and helping identify anoma- The paper continues by discussing data processing,
lous hot spots (Cooke, Katarzyna, Wallis, 2006). including geocoding, which is a critical operation
In contrast to the contexts of previous stud- for using VGI, and describes the spatial hotspot
ies with highly accurate and extensive data, this detection methods used in the research. We then
paper is concerned with the employment of vol- report our findings, and discuss what those findings
unteered geographic information in the analysis actually indicate about the spatial and temporal
and detection of hot spots. The term, “volunteered spread of WNV across the Detroit metropolitan
geographic information,” (VGI) is used to indicate area in 2002.
data that are measured and reported by the public.
The accuracy and completeness of such data may
be quite variable; in some cases, it may be more WEST NILE VIRUS
reliable than authoritative sources (Goodchild,
2008), especially in periods of rapid and critical In 1999, WNV became an endemic disease in the
change such as natural or human disasters. For Western Hemisphere. The disease was first iden-

1171
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

tified in New York City in 1999, resulting in 67 tally of 614 confirmed human cases. Of the nine
human cases and seven deaths (Crave & Roehrig, counties surveyed in the Lower Peninsula, seven
2001). Over the next few years, the disease spread produced WNV-positive mosquito pools resulting
across the United States through a bird-mosquito- in a total of 58 positive pools. The majority of
bird cycle in which the birds serve as the viral the WNV-positive mosquitoes were Culex, but
reservoir and mosquitoes spread the virus through- a Coquillettidia perturbans pool tested positive
out susceptible populations. When environmental as well (State of Michigan, 2006). At the time of
factors such as weather or host immunity boost writing the program continues, with the State of
viral amplification, an increase in the number of Michigan providing web-based services to report
infected “bridge vector” mosquitoes (mosquitoes dead birds and mammals while the USGS main-
that bite both humans and birds) results (Petersen tains disease maps for reported corvid, human,
& Marfin, 2002). Currently 43 species of mos- mosquito, sentinel, and veterinary cases.
quito have tested positive for WNV (Marra et al., This paper focuses on the 2002 dead bird
2004), though the Culex species of mosquito is data set for the Greater Detroit metropolitan area
the most common bridge vector transferring the (Wayne, Oakland, and Macomb counties), Figure
virus from birds to humans. Among birds, the 1, for three reasons. First, Metro Detroit, with
American crow (Corvus brachyrhynchos) and about 4,043,000 people and covering about 2,150
the blue jay (Cyanocitta cristata) in the family square miles, is the largest population center in
Corvidae have seen the highest mortality rates. Michigan and one of the major urban regions in
In the northeastern United States increases in the United States. The potential hazard offered by
crow deaths have been quickly followed by the WNV is greatest in this region. Second, this area
emergence of WNV cases in humans (Petersen & contains a majority of the 2002 dead bird records.
Marfin, 2002). In its first five years in the United This abundance makes it particularly tractable for
States, the disease had infected at least 400,000 conducting high spatio-temporal resolution hot
humans and killed countless birds, mammals, and spot analysis. Third, geographic, demographic,
reptiles (Dodd, 2003; Marra et al., 2004). land cover and road framework, data are available
Beginning in 2001, disease surveillance in and relatively accurate, making both geocoding
Michigan was conducted in part via a state man- and multivariate analysis particularly feasible.
aged toll-free hotline which citizens could call The combination of widespread human popula-
to report dead bird sightings. The location and tion density, a large number of dead bird records,
condition of each bird was recorded and the bird mostly developed landcover, and accurate spatial
was tested for the virus. By August 2001, WNV referencing information makes Greater Detroit
was confirmed in dead crows in Michigan. Of the the overall best site for this research in the state
142 dead crows and 79 blue jays tested in 2001, of Michigan.
63 crows and 2 blue jays tested WNV positive. In
that first year of monitoring, WNV was detected
in corvids in ten counties (Wayne, Oakland, Ma- VOLUNTEERED GEOGRAPHIC
comb, Washtenaw, Jackson, Calhoun, Ingham, INFORMATION
Barry, Ottawa, and Muskegon) as well as in three
mosquito pools (two in Oakland County and one in What kinds of questions could we ask of this
Macomb County). The 2002 transmission cycle in dataset, and what assumptions must be made?
Michigan was the most significant. With continu- One approach is to assume that the likelihood of
ous surveillance, the first human WNV cases were reporting a dead bird varies by spatial location,
reported on August 16, 2002 with a final epidemic in part, as a function of the human population

1172
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

Figure 1. Detroit Metropolitan Area study site with dead bird (crows) GAM clusters by month

density around that location. The logic behind These could be due to errors of omission rather
this assumption is that the more human eyes there than an actual lack of bird mortality:
are in an area, the more likely a dead bird is to be
reported. By controlling for human population, we • People did not see dead birds
can identify locations with anomalous concentra- • People did not report dead birds
tions of dead birds. Anomalous concentrations in a
particular area may be due to one or a combination These errors of omission could be due to
of several factors: geographic variability in human propensity to
identify dead birds because of environmental or
• Elevated rate of WNV among corvids social barriers to interaction (dense vegetation,
• Elevated mortality rate due to non-WNV neighborhood design or demographics leading to
causes fewer people spending time out of doors). Under-
• Elevated population of corvids reporting might also be due to spatial variation
• Higher propensity of volunteers to call in in the public’s awareness or interest in the WNV
birds surveillance program. Finally, low numbers of
dead birds at a particular location might simply
A second and potentially much more serious be due to smaller concentrations of corvids in that
problem is the interpretation of areas with anoma- area. The overall infection or fatality rate might
lously low concentrations of reported dead birds, be just as high as in other places, but the location
after controlling for human population density. would not be flagged.

1173
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

Consequently, the utility of volunteered geo- less clear. Given the combined costs associated
graphic information for identifying anomalous with large numbers of cases and patient privacy
concentrations of WNV-infected corvids appears regulations, the collection and use of disease
very limited, despite an intensive data processing surrogates as the dead birds, in this case, is very
effort that resulted in exceptionally clean geocoded cost effective (Bonner, Daikwon, Nie, Rogerson,
information and the use of modern spatial statistics. Vena, & Freudenheim, 2003).
These data seem to be more useful for what they From the original statewide database only the
suggest about the likelihood for human interac- crows and blue jays were selected for geocoding.
tion with WNV. Each record is in fact a report Since the MDCH recorded the address only once
about a close encounter between a human and a and then recorded the number of birds of each
dead bird, and may therefore serve as an indicator species, a separate record had to be created for
of exposure risk. In this case we simply have to each bird. Misspellings or capitalization variants of
assume that the bird was infected, and, in turn, crow and blue jay were also queried and replaced.
infected more mosquitoes in close proximity to The geocoding process was completed in ArcGIS
its place of death, and that the chance of a human (ESRI, 2007). The reference data layer utilized
being bitten by an infected mosquito is therefore was the Michigan Framework All Roads data, last
elevated in that area. This approach avoids prob- updated May 1, 2009, and provided by the state of
lems associated with several, though not all, of Michigan; available for download from the Michi-
the factors listed above. In the discussion later in gan Geographic Data Library (Michigan Center
the paper, we describe what steps are possible to for Geographic Information). Since this database
assess the impact of some of those factors. contains address range information for both the
left and right side of each street, the dead bird
addresses could be geocoded to their approximate
DATA PREPROCESSING positions along the length and the correct side of
the street segment. For the matching options the
After the research team obtained the dead bird spelling sensitivity was set at 80, the minimum
data from the state, substantial preprocessing was candidate score at 10, and the minimum match
required prior to analysis. The street addresses of score at 60. These parameter settings improved
dead birds had to first be geocoded. Geocoding, or the completeness of the geocoding operation by
address matching, is a process by which addresses relaxing the constraints on spelling accuracy and
are given geographic coordinates (e.g., latitude the numeric address inputs. Resulting geographic
and longitude) based on a reference layer or a coordinates for each dead bird record were added to
street database (Bonner, Daikwon, Nie, Rogerson, the dataset. Each address was checked individually
Vena, & Freudenheim, 2003). Where no explicit to make sure it was located in the proper county
coordinates at the land parcel level exist within or city as the other attributes collected. In the case
the reference layer, address range data for street of potentially misspelled street names the street
segments are employed to interpolate coordi- name was changed if it matched street address
nates for the address. The process of geocoding records in the data collection. In some cases the
is indispensable in spatial epidemiology given standardized street address had to be changed to
that it is a relatively inexpensive and efficient match the roads database.
mechanism for locating cases. For volunteered An extensive manual procedure was conducted
geographic information, where citizens gener- for records with street addresses that could not
ally identify location with addresses rather than be matched. First, the address was searched us-
spatial coordinates, establishing a cost basis is ing the Internet mapping engine MapQuest™.

1174
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

If the street address could not be located using several models can account for associated popu-
MapQuest™, the nearest street intersection was lation density around dead bird locations. Popula-
inspected to determine whether the street address tion data were obtained from the 2000 US Census
numbers had changed, more houses had been built for this purpose and census block group data, the
since the roads file was last updated, or the street finest spatial resolution dataset available for the
name was misspelled. If the address still could not entire study area, were employed.
be located then a rematch was attempted using
the nearest street intersection. If this final effort
failed, or if more than one possible location was SPATIAL STATISTICS
identified, then the address remained unmatched
and was discarded. After preprocessing was completed, three broadly
Once the addresses were geocoded, an analysis distinctive detection models were used: Moran’s
of the matched addresses was conducted. Only I, Local Indicators of Spatial Association (LISA),
those corvid entries that contained a bird type
and sighting data were used. For computational Table 1. Crows by county
efficiency, the sighting dates were converted to
the day of the year (ordinal date). For example, County Dead Bird Population Ratio
May 1, 2002 would be converted to the 121st day Records

of the year. Additionally, the week number was Oakland 887 1,194,156 .00074278

assigned to each dead corvid sighting. Macomb 502 787,625 .00063736

The dataset was clipped to the spatial extent Wayne 1,108 2,061,162 .00053756

of the study area (Oakland, Macomb, and Wayne TOTAL 2,497 4,042,943 .00061762
counties) (Figure 1). This reduced the original
8,249 statewide records to a study area total of Table 2. Crows by month
2,533 bird records, which were subjected to further
quality assessment. With the dataset cleaned of Month Records
unreliable records, 2,497 dead bird locations, all April 2
of which were crows, remained. Crow locations May 32
were attributed with spatial boundary informa- April and May 34
tion. Using a tagging function, township, city, and June 59
block group information were appended to each June 1-15 34
data point. This final data preparation process was June 16-30 25
completed using TransCAD™ (Caliper, 2006). At July 396
this point, the dataset was considered finished. As July 1-10 20
Table 1 illustrates, most of the dead birds were July 11-20 108
found in Wayne County, with Oakland and Ma-
July 21-31 268
comb Counties having comparable ratios of dead
August 1,826
birds to people. The data were then partitioned by
August 1-10 875
month and within the month over the course of
August 11-20 461
the transmission season (Table 2).
August 21-31 490
Given the method of dead bird data collection,
September 166
it appears likely that the underlying human
October 16
population influenced the bird data simply due to
TOTAL 2,497
opportunity. By incorporating human population,

1175
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

and the Geographic Analysis Machine. The first Where N equals the number of regions, wij is
two tests were completed with the ClusterSeer™ a weight representing the connectivity between
software package and the third was coded using areas i and j, zi is the rate in zone i centered around
the statistical programming language R. These the mean rate (using zi = xi – x ; xi is the rate in
approaches were chosen because they allowed a zone i); and S0 is the sum of the weights.
progression from global cluster detection using
solely the dead bird data, through neighborhood- N N

scale metrics for hot-spot identification, to an


S0 = ∑ ∑w ij
,i ≠ j (2)
i= 1 j= 1
adaptive model assessing local dead bird con-
centrations while accounting for the underlying
human population. Global metrics only consider The expectation of I for a spatially random
whether the pattern is non-spatially random and process is:
not where anomalous concentrations are located.
Local models assess the spatial dependence of −1
E (I ) = (3)
deviations in attribute values from their mean: that (N − 1)
is, the second order spatial properties (Anselin,
1995; Bailey & Gatrell, 1995). Locally elevated
The expectation of I gets closer to 0 as N
concentrations can then be identified and mapped
increases, and can range from -1 to 1. The vari-
as clusters. By investigating patterns for different
ance of I for a spatially random process depends
time periods during 2002, the pattern and intensity
on the assumptions of the research question. In
of a spreading virus epidemic can be estimated.
this work, we employ a random permutation
The Moran Contiguity Coefficient, called
hypothesis, in which the observed I is presumed
Moran’s I, is a weighted correlation coefficient
to be drawn at random from the distribution of
used to detect departures from spatial random-
I metrics calculated on the attribute set. Monte
ness (Moran, 1950). It is typically calculated for
Carlo simulations are used to construct a ranked
aggregated variables (here, dead bird counts)
distribution of I values; the empirical I is compared
stored in geographical zones; in this research
with this distribution to determine significance.
these zones were generated by dividing the study
Moran’s I is a global statistic, a single number
area into a 10 km by 10 km grid. Departures from
representing the overall degree of spatial autocor-
randomness may suggest the presence of patterns
relation in a variable measured in spatial zones.
such as clusters, but may also identify geographic
It does not provide any information about where
trends. Positive spatial autocorrelation means that
elevated values might be grouped within the study
nearby areas tend to have similar values; in this
area. Such areas may be identified through the use
case, the numeric value for Moran’s I will be large
of local indicators of spatial association, or LISA
and positive. A value close to 1 is indicative of
statistics. Local Moran’s I is one such metric;
high positive spatial autocorrelation or a clustered
it is literally a disaggregated global Moran’s I,
spatial structure. Moran’s I is defined as:
defined as:
N N
N ∑ ∑ wij z i z j N

i= 1 j= 1
N ∑ wij z i z j
I= (1) j= 1
N Ii = (4)
S0 ∑ z 2 N

i −1
i
S0 ∑ z 2
i
i −1

1176
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

Where the terms are as defined for Equation The result of the filtering process is a set of
3. The Moran metric I is calculated for each loca- circles containing significantly elevated numbers
tion i in the region, and can be mapped. A high of cases. These circles are then smoothed with a
positive value for location i indicates the presence kernel density function to generate maps of hot
of clusters of similar attribute values around that spots (Openshaw, Charlton, Wymer, & Craft,
location. A value near zero indicates that the local 1987). As GAM employs a variety of circle radii, it
area under consideration or the average of the im- does not require the specification of a single spatial
mediate neighbors of the cell, or both, is near zero. scale for cluster presence. Although it incorporates
The Geographic Analysis Machine (GAM) is statistical testing to identify cluster significance,
a multi-scale cluster location tool used to analyze it is considered an exploratory method.
spatial point distributions to find evidence of lo-
calized geographical clustering while accounting
for background population at risk. It was initially RESULTS
designed to identify disease clusters such as child-
hood leukemia (Openshaw, Craft, Charlton, & The data were aggregated to a 10 by 10 grid over
Birch, 1988; Openshaw, 1996). The process is a the study area and partitioned by month due to the
computationally intensive approach for automated density of the data points. This enabled the calcu-
cluster identification. It incorporates a technique lation of global and local Moran statistics on the
that compares the intensity of events within spatial pattern of aggregated bird counts for time
circles of varying radii; the circles are centered periods throughout the transmission season. The
on a fine grid imposed over the area of interest. Moran’s I result for all records was 0.63, showing
The observed number of events (e.g., dead birds) strong, significant positive spatial autocorrelation
within each circle is compared to an expected with substantial temporal variation. Early in the
number of events based on a derived background season the Moran’s I value showed a small peak
rate, which controls for population, via a Poisson in May rising to 0.27. There is a brief drop-off
test. Circles with significantly elevated numbers during June, with a sharp increase toward the latter
of observed cases are then flagged and retained. half of the month. The Moran’s I value increases
An important user decision in the GAM model through July showing increased autocorrelation
is the rejection criterion for the significance test. with the seasonal peak in the first part of August
The level of significance serves as a filter to reject with a value of 0.64. The model for August showed
circles with lower rates. We chose to set this level the greatest positive autocorrelation.
high, (Openshaw, Craft, Charlton, & Birch, 1988), Maps of Local Moran’s I suggest some tempo-
to focus on areas of greatest cluster intensity. The ral and spatial pattern consistency in the aggregated
models were all initially completed with a 0.01 dead bird data. For all data, clusters appeared to
rejection criterion; however this caused large be concentrated near the center of the study area
areas to be identified as significant. Therefore, (Birmingham/ Royal Oak areas) and along the
smaller thresholds were employed, as low as 1 x eastern edge (Grosse Pointes area). Monthly totals
10-9, to filter out large numbers of circles. Due to provide a perspective on changes through time.
the number and locational variability of the data Insufficient numbers of records for April forced
records in each time period and the underlying their inclusion with the records from May for
population in these locations, different confidence modeling. Results for April/May showed cluster-
intervals were required to achieve the greatest ing in areas with greater human population. For
focus of the cluster detection. June, higher concentrations shifted to the Livonia/
Westland area. The July results demonstrated a

1177
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

cluster pattern in the inner suburbs surrounding DISCUSSION


the city of Detroit, with moderate intensity in the
cities of Royal Oak and Warren. The August data, The preparation process and subsequent analyses
including the majority of the records, suggested a of the geocoded dead crow data resulted in some
shift in cluster intensity from areas immediately interesting findings. We consider data processing
around the city of Detroit to smaller cities in the results and their implications for similar stud-
metro area. September showed very moderate ies first, and then evaluate the results from the
clustering throughout the middle of the study area. clustering study. Through the combined aid of
Results for October presented some clustering an internet mapping website and ancillary data,
along the eastern side of Oakland County. an address matching rate of 83.5% was achieved,
The results of GAM for all records (Figure 1) with 72.1% of the records scoring a quality rating
in this model showed a very similar pattern to the of 75 or above. This indicates a relatively com-
local Moran map. It identified a large central clus- plete and accurate conversion of the volunteered
ter over the Birmingham/ Royal Oak/ Southfield data. Intensive manual validation of records was
areas, with a smaller and separate cluster over critical in achieving these high rates. With the
Grosse Pointe. GAM results for the month-by- address matching rate of commercial geocoding
month results were variable, however. The April/ companies found to be 60 – 80% (Kreiger, Water-
May results showed two small clusters. The larger man, Lemieux, Zierler, & Hogan, 2001), it may
of the two clusters was centered over Pontiac be advantageous for researchers to conduct their
Township and the smaller cluster was located over own geocoding depending on time and resources.
the Grosse Pointe communities. Results for June The positional accuracy of address matching
initially showed one large cluster over the southern is an important consideration, as errors increase
part of the study area. However after adjusting the result uncertainty. Sources of error include the
significance level most of the intensity was over positional accuracy of the road network, the cor-
the Westland area, with a thin band to the north rectness of the recorded address; attribute accuracy
reaching into Livonia. For July, GAM identified of the address ranges stored in the roads data, and
a large cluster stretching from Birmingham to St. the fidelity of the interpolation along the segments
Clair Shores, and another large cluster reaching to the actual address locations (Karimi & Durcik,
from Livonia to Westland. Results for August 2004). One study found that the majority of ad-
show similar patterns to the GAM results for all dress derived geocoded points fall within 100 m
records. Two distinct clusters for the month of of GPS points with an overall median distance of
August were identified. The larger more intense 38 m (Bonner, Daikwon, Nie, Rogerson, Vena, &
cluster centralized over the Birmingham area, Freudenheim, 2003). At the metropolitan scale of
and the second cluster covered Grosse Pointe. this application, errors of this magnitude appear
For September, clustering was identified over the inconsequential. For neighborhood-scale research
northern half of the area. At the initial significance projects, this degree of positional error might be
level clustering was widespread; however after unacceptable.
changing the level clusters appeared to be focused A more fundamental source of uncertainty in
to two locations near the village of Clarkston. this analysis is due to the volunteered nature of
With few data points in October, GAM clusters the data; this is asserted geographic information,
moved even further into the rural areas of the not the result of a systematic census or a designed
study area. The main cluster was located in west sample (Goodchild, 2008). Use of such data raises
Oakland County; this is consistent with the local interesting questions that impact the results re-
Moran’s I results. ported here, and have important implications for

1178
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

studies employing volunteered data with similar tions for clusters in these areas: the environmental
characteristics to those employed in the present hypothesis and the reporting bias hypothesis.
study. Often surveillance of emerging infectious The environmental hypothesis is that these areas
diseases demands an opportunistic approach to possess qualities that are amenable to crows and/
data collection. or WNV-positive mosquitoes. If one accepts this
It is useful to contrast the data in hand with hypothesis, the hot spots observed in these areas
the information we would have liked. In a hot are important indicators of WNV infection, and
spot analysis, we would like to know the spatial their variation through the warm months of 2002
distribution of the population at risk: here, the can provide insight into the diffusion of the dis-
corvid population in Metro Detroit during the ease across metropolitan Detroit. In contrast, the
weeks of the study. We would also like to have an reporting bias hypothesis is that higher reporting
exhaustive, or mapped, set of all WNV-positive levels in certain communities are due to greater
date-referenced dead corvid locations. A second- information about the surveillance campaign and
best corvid dataset would be a random sample greater willingness to participate. Geographic
of all dead corvid locations. Instead we have no variation in both the effectiveness of advertis-
information about the geographic distribution of ing methods used to inform the public about the
the underlying bird population, and the sample reporting program and social and cultural char-
of dead birds is biased: only birds identified and acteristics related to willingness to participate in
reported by volunteers are in the database. At best, the program have a direct effect on the patterns
this is a dataset of the geographic intersection of seen in these data. In support of this hypothesis we
informed human volunteers and dead birds. note that the communities associated with the most
The effects of this are clear from both the spatial pronounced clusters are some of the most affluent
and attribute information obtained. The geographi- in the study area and may enjoy improved access
cal distribution of dead birds is positively and to media information about WNV and the citizen
strongly associated with human populations. This surveillance program, increased willingness to
association is not uniform, however, as certain report sightings, and greater interaction with the
urban areas consistently report dead birds, and outdoors. All of these factors could explain higher
other urban areas consistently do not. Patterns reporting levels. In contrast, no clusters were
in the attribute data also suggest reporting bias. observed within the Detroit City limits and lower
Particularly notable is the striking difference in income communities. People in these areas may
quantities of American crow and blue jay reports. not have been informed of the program or might
There were nearly five times as many American have been less interested in participating. Under
crow sightings than blue jay sightings in the this hypothesis, observed dead bird clusters are
statewide database and none in our subset. This no less “real”, but they may not reflect the spatio-
may be due to the larger physical size of the crow, temporal patterns of WNV impact so much as
its identifiability, or its natural behavior to roost the socio-economic geography of Metro Detroit.
and to scavenge for food in urban environments.
Alternatively, the impact of WNV on the American
crow may have been reported more in the media CONCLUSION
and thus the public may have been more interested
or aware of American crow deaths. This research was concerned with the utility of
Both Local I and GAM methods consistently volunteered geographic information reported by
identified persistent clusters in inner-ring suburban private citizens to track the progress of an infec-
communities. We raise two competing explana- tious, vector-borne disease across a metropolitan

1179
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

region. By employing this volunteered, asserted case-location level. Aspects of the environment
information, many well-known problems in de- such as land cover and land use should be consid-
signed, authoritative data collection efforts are ered. Mosquito breeding micro-environments need
exacerbated. Of particular concern are errors in to be identified to determine points where the vec-
data collection and entry, post-processing accuracy tor might be initially located. Also, the proximity
assessment and the general credibility of the as- of birds and humans to these locations should be
serted data. The problem of false negatives – of known, and information regarding the flying range
dead birds that weren’t reported – is acute. Data of the vector would identify areas where humans
records are biased to areas where informed, will- and birds would be at risk. Examining the breed-
ing people are located. We analyzed the impact ing cycle of the vector would help in identifying
of these problems on an exploratory investigation possible periodic blooms in mosquito activity. This
into geographic variability in a database of dead research did not examine the roosting patterns
bird sightings in the warm months of 2002 in of the bird population; this information would
metropolitan Detroit. Standard global and local identify the range of the host, and combined with
autocorrelation tests as well as a multiscale clus- the mosquito information, would identify the host
tering algorithm were implemented. Despite the population at risk. Research into the case fatality
challenges, similar results were found using each rate of WNV on the bird population, as well as
of the models. All local models identified clusters the effects of the virus on the movement, and the
in similar areas. One large cluster was present in length of time before mortality would also assist
all three of the local models for the entire metro- in the temporal study. One of the large drawbacks
politan Detroit dataset. Modest temporal variation to the integrity of the dead bird data was that
in local clustering is apparent. However, areas there was not 100% testing of the reported birds;
with clusters of dead bird records also supported complete testing of the birds would remove any
large human populations. We posit two competing false positive errors in the sample. Also obtain-
hypotheses for the nature of these clusters: that ing a GPS (Global Positioning System) location
they are due to actual variation in corvid mortal- of the dead bird would remove the location error
ity, or that they are due to variation in knowledge associated with address matching and the error
and willingness of the local human population to in the base GIS layers. Finally an examination of
report dead bird sightings. Further information the WNV public information campaign would be
is needed to determine which hypothesis is more valuable for correcting reporting biases, and for
valid. Further research on this topic could follow improving future efforts to obtain volunteered
two different tracks. First, from the clustering per- geographic information.
spective, additional years of dead bird data, along
with alternative models and methods to identify
spatial and temporal clustering in the dead bird REFERENCES
data would be useful. The second would examine
the collection methods, environmental, and bio- Anderson, J. F., Vossbrinck, C. R., Andreadis,
logical aspects related to the spread of WNV. By T. G., Iton, A., & Beckwith, W. H. (2001). A
obtaining more complete information about the phylogenetic approach to following West Nile
local etiology of WNV, a more compelling model virus in Connecticut. Proceedings of the Na-
for the underlying geographic distributions of bird tional Academy of Sciences of the United States
and mosquito populations could be constructed. of America, 98(23), 12885–12889. doi:10.1073/
There are many facets to disease cluster analy- pnas.241472398
sis that go beyond the information found at the

1180
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

Anselin, L. (1995). Local indicators of spatial Jelinski, D., & Wu, J. (1996). The modifiable
association-LISA. Geographical Analysis, 27, areal unit problem and implications for landscape
93–115. doi:10.1111/j.1538-4632.1995.tb00338.x ecology. Landscape Ecology, 11, 129–140.
doi:10.1007/BF02447512
Bailey, T. C., & Gatrell, A. C. (1995). Interactive
spatial data analysis. Essex, UK: Longman. Karimi, H., & Durcik, M. (2004). Evaluation of
uncertainties associated with geocoding tech-
Bonner, M., Daikwon, H., Nie, J., Rogerson,
niques. Computer-Aided Civil and Infrastructure
P., Vena, J., & Freudenheim, J. (2003). Po-
Engineering, 19, 170–185. doi:10.1111/j.1467-
sitional accuracy of geocoded addresses in
8667.2004.00346.x
epidemiologic research. Epidemiology (Cam-
bridge, Mass.), 14, 408–412. doi:10.1097/01. Kreiger, N., Waterman, P., Lemieux, K., Zierler,
EDE.0000073121.63254.c5 S., & Hogan, J. (2001). On the wrong side of
the tracts? Evaluating the accuracy of geocod-
Caliper (2006). Mapping software, GIS, and
ing in public health research. American Journal
transportation planning software. Newton, MA:
of Public Health, 91, 1114–1116. doi:10.2105/
TransCAD.
AJPH.91.7.1114
Cooke, W., Katarzyna, G., & Wallis, R. C. (2006).
Lawson, A. B., & Kleinman, K. (2005). Spa-
Avian GIS models signal human risk for West
tial and syndromic surveillance for public
Nile virus in Mississippi. International Journal
health. Chichester, UK: John Wiley & Sons.
of Health Geographics, 5(36).
doi:10.1002/0470092505
Craven, R. B., & Roehrig, J. T. (2001). West
Marra, P., Griffing, S., Caffrey, C., Kilpatrick,
Nile Virus. Journal of the American Medical
A., McLean, R., & Brand, C. (2004). West Nile
Association, 286(6), 651–653. doi:10.1001/
virus and wildlife. Bioscience, 54, 393–402.
jama.286.6.651
doi:10.1641/0006-3568(2004)054[0393:WNVA
Dodd, R. Y. (2003). Emerging infections, transfu- W]2.0.CO;2
sion safety, and epidemiology. The New England
Moran, P. A. P. (1950). Notes on continuous
Journal of Medicine, 349(13), 1205. doi:10.1056/
stochastic phenomena. Biometrika, 37, 17–23.
NEJMp038138
Openshaw, S. (1996). Using a geographical analy-
ESRI. (2007). ArcGIS 9.2 desktop help. Retrieved
sis machine to detect the presence of spatial clusters
from http://webhelp.esri.com/arcgisdesktop/9.2/
and the location of clusters in synthetic data. In
index.cfm?TopicName=welcome
Alexander, F. E., & Boyle, P. (Eds.), Methods for
Fotheringham, A. S., & Zhan, F. B. (1996). investigating localised clustering of disease (pp.
A comparison of three exploratory methods 68–87). Lyon, France: IARC Scientific.
for cluster detection in spatial point pat-
Openshaw, S., Charlton, M., Craft, A., & Birch,
terns. Geographical Analysis, 28(3), 200–218.
J. (1988). Investigation of leukaemia clusters by
doi:10.1111/j.1538-4632.1996.tb00931.x
use of a geographical analysis machine. Lan-
Goodchild, M. F. (2008). Assertion and authority: cet, 331(8580), 272–273. doi:10.1016/S0140-
The science of user-generated geographic content. 6736(88)90352-2
In Proceedings of the GeoInfo Colloquium for
Andrew U. Frank’s 60th Birthday.

1181
Using Volunteered Geographic Information to Assess the Spatial Distribution of West Nile Virus

Openshaw, S., Charlton, M., Wymer, C., & Petersen, L. R., & Marfin, A. A. (2002). West
Craft, A. W. (1987). A mark I geographical Nile virus: A primer for the clinician. Annals of
analysis machine for the automated analysis of Internal Medicine, 137, 173–179.
point data sets. International Journal of Geo-
State of Michigan. (2006). Emerging dis-
graphical Information Science, 1(4), 335–358.
ease issues: West Nile virus. Retrieved from
doi:10.1080/02693798708927821
http://www.michigan.gov/emergingdiseas-
Ozdenerol, E., Bialkowska-Jelinska, E., & Taff, es/0,1607,7-186-25805---,00.html
G. N. (2008). Locating suitable habitats for West
Nile Virus-infected mosquitoes through associa-
tion of environmental characteristics with infected
mosquito locations: A case study in Shelby County,
Tennessee. International Journal of Health Geo-
graphics, 7(12), 200.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 2, Issue 3,
edited by Donald Patrick Albert, pp. 72-85, copyright 2011 by IGI Publishing (an imprint of IGI Global).

1182
1183

Chapter 71
GIS Application of Healthcare
Data for Advancing
Epidemiological Studies
Joseph M. Woodside
Cleveland State University, USA

Iftikhar U. Sikder
Cleveland State University, USA

ABSTRACT
Healthcare practices increasingly rely on advanced technologies to improve analysis capabilities for
decision making. In particular, spatial epidemiological approach to healthcare studies provides signifi-
cant insight in evaluating health intervention and decisions through Geographic Information Systems
(GIS) applications. This chapter illustrates a space-time cluster analysis using Kulldorff’s Scan Statistics
(1999), local indicators of spatial autocorrelation, and local G-statistics involving routine clinical
service data as part of a limited data set collected by a Northeast Ohio healthcare organization over a
period 1994 – 2006. The objective is to find excess space and space-time variations of lung cancer and
to identify potential monitoring and healthcare management capabilities. The results were compared
with earlier research (Tyczynski & Berkel, 2005); similarities were noted in patient demographics for
the targeted study area. The findings also provide evidence that diagnosis data collected as a result of
rendered health services can be used in detecting potential disease patterns and/or utilization patterns,
with the overall objective of improving health outcomes.

INTRODUCTION systems. In healthcare settings application of such


new technology are proving useful in the analysis
The increasing demand for health data analysis in of health data and planning of healthcare services
spatial and temporal scale has made emerging tech- (Pfeiffer, Robinson, Stevenson, Stevens, Rogers,
nologies such as Geographic Information Systems & Clements, 2008). The ability of GIS to manage
(GIS) an essential tool for healthcare information and retrieve georeference data has demonstrated
its value in the integration of complex epidemio-
DOI: 10.4018/978-1-4666-2038-4.ch071

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
GIS Application of Healthcare Data for Advancing Epidemiological Studies

logical models through visualization of spatial and care, i.e. serving a particular population with tar-
temporal relationships. This has been recognized geted specialists, and preventing spread of disease
by the World Health Organization (WHO): amongst populations. This research aims to study
different clustering methods of the spatial and
Geographical information systems (GIS) provide spatio-temporal patterns of lung cancer particu-
ideal platforms for the convergence of disease- larly for routine clinical service data collected by
specific information and their analyses in relation a Northeast Ohio healthcare organization over a
to population settlements, surrounding social and period from 1994 – 2006.
health services and the natural environment. They
are highly suitable for analyzing epidemiological
data, revealing trends and interrelationships that GIS
would be difficult to discover in tabular format.
Moreover GIS allows policy makers to easily Geographic information systems (GIS) integrate
visualize problems in relation to existing health computer applications and data for capturing, stor-
and social services and the natural environment ing, querying, analyzing, viewing and modeling
and so more effectively target resources. (World geographic and spatial information for improved
Health Organization, 2008) decision making. GIS are distinguished from
other information systems, based on their ability
Geographical analysis is not only important to utilize geographic data (ESRI, 2010; Chang,
for the identification of patterns of healthcare 2006). GIS originated in the 1960s and 1970s,
outcomes it also offers insight into understanding and were introduced to mainstream use during
the association or linkage to political processes the 1990s with the addition of a graphical user
and policy makers (Cromley, 2002; Gatrell, 2002). interface, reduced software and hardware costs,
Health data from managed health care organiza- and available data. GIS have been used in a
tions offers the opportunity to analyze unusual wide spectrum of applications including: natural
geographical patterns of disease. Routine, aggre- resources planning, hazard management, crime
gated healthcare data stored in health systems can mapping, transportation, navigation, farming,
be utilized to identify disease clusters or utilization environmental monitoring, and epidemiology,
patterns. Recently methods have been sought to among others (Chang, 2006; Li, 2008).
further improve identification within case and The application architecture below displays
disease management programs. common components within a healthcare infor-
The real world clinical service data stored mation system for epidemiological studies: a web
in healthcare information systems provides op- server, GIS, and database. The web component
portunity to analyze spatio-temporal patterns at includes data entry, exchange, and transfer, along
finer granularity. The investigation of space and with a centralized user interface for healthcare
space-time epidemiological patterns often gives information. The web server also permits viewing
rise to the explanation of factors that might cre- of displayed data either on-premise or remotely,
ate an adverse health condition. This study uses enabling greater accessibility of information by
routine, aggregated service data to find excess the end-user. The GIS maintains geographic and
space and time variations in rendered services spatial information, and also incorporates statisti-
where the primary diagnosis was lung cancer. cal and data analysis tools, such as a path analysis,
From the health care management point of view, data querying, map data exploration, frequency
if clusters are detected and explanatory factors distribution, regression, chi-square analysis, and
linked, this understanding allows for better patient multivariate analysis to analyze healthcare data

1184
GIS Application of Healthcare Data for Advancing Epidemiological Studies

Figure 1. Application architecture


GIS spatial and space-time clustering to detect
childhood cancer rates in Florida, discovering po-
tential environmental or risk factors in geographic
areas. In Yu et al. (2004), the authors utilize GIS
to identify quality improvement and cost factors
associated with common veteran population
conditions such as chronic heart failure (CHF),
colorectal cancer, diabetes, HIV/AIDS, ischemic
heart disease, psychiatric disorders, spinal-cord
injury (SCI), stroke, and substance abuse. The
GIS output assisted with policy development
with regard to resource allocation, quality con-
siderations, and cost identification. The CDC and
Huang et al. (2010) utilize GIS to identify obesity
prevalence in conjunction with other conditions
such as diabetes. The findings have identified
significant geographic variances at the state level
in both spatial and space-time components (Amin,
2010; CDC, 2010; Huang, 2009; Yu, 2004). In
pharmacoepidemiology, Brownstein et al. 2010
for decision making. The database contains in- applied GIS to develop risk maps for opoid abuse,
tegrated forms of healthcare data, and maintains in order to assist with health intervention planning.
quality, timeliness, aggregation and other data Significant clusters were found, and could be
management items. Common inputs include raw utilized to determine availability of prescriptions,
data, metadata or description of the data, and map for public health agencies to use in planning, and
data (Chang, 2006; Li, 2008). treatment surveillance (Brownstein et al., 2010).
Environmental disease monitoring is a key
GIS Healthcare Applications component of healthcare monitoring. Conditions
such as cancer, birth defects, asthma, bird flu,
GIS along with geocoded data is a key objective and respiratory problems have been linked to
within healthcare, in order to improve costs, dis- environmental factors. Environmental data has
ease surveillance, and interventions. In Healthy been collected for many decades by government
People 2010, the GIS goal for use in healthcare agencies, though is often contained in disparate
systems is targeted at 90% from the current 45% data locations. In Li et al. (2008) the authors pro-
baseline (CDC, 2010). GIS applications have pose a GIS integrated environmental monitoring
been utilized in a variety of healthcare service system, and utilize asthma condition as a GIS
categories including epidemiology, pharmacode- mapping against air emissions. In Stelling et al.
pidemiology, public health, and managed care, (2010), outbreak detection is employed through
with specific examples presented below. GIS surveillance. Statistical events were identified
In epidemiology GIS applications have in- and referred to health departments, for appropri-
cluded disease state detection through clustering, ate response. These systems are able to monitor
quality improvement, and cost containment. For in real-time to improve public health (Li et al.,
example in Amin et al. (2010), the authors utilize 2008; Stelling et al., 2010).

1185
GIS Application of Healthcare Data for Advancing Epidemiological Studies

Table 1. GIS healthcare applications

Healthcare Services Category Healthcare GIS Application Supporting Works


Epidemiology Health Care Cost Containment Yu et al. (2004)
Quality Improvement Amin et al. (2010)
Disease Detection Huang et al. (2010)
Pharmacoepidemiology Substance Abuse Brownstein et al. (2010)
Cluster Detection
Pharmaceutical Risk Management
Public Health Outbreak Detection Ling et al. (2008)
Environmental Monitoring Stelling et al. (2010)

GIS Application Example: Space- value which exceeds the normal value for the
Time Cluster Analysis space and/or time. The closer a cluster popula-
tion is defined, the excess value will be greater
The question whether diseases such lung cancer for the cluster population, and the significance
or breast cancer are spatially clustered is an ac- will be greater. The closer a cluster population is
tive research area (Lawson, Biggeri, Böhning, defined, the greater the excess value will be for
Lesaffre, Viel, & Bertollini, 1999; Lawson, A., & the cluster population, and the significance will
Denison, D., 2002; Marshall, R. J., 1991; Tango, be greater. Initial assessments of clusters include
T., & Takahashi, K., 2005). Since the detection of reviews of cases, boundaries of space and time,
spatial and temporal patterns of clusters of lung estimated number of cases, estimates of standard-
cancer is sensitive to the clustering algorithm, it is ized mortality ratios, statistical significance, and
difficult to evaluate results from a single method public communication. Cluster analysis has been
(Jacquez, G. M., & Greiling, D. A., 2003). Cur- frequently used to identify occurrence of morbid-
rently, most of the comparative analysis of disease ity or unusual localized trends in disease patterns
clusters depends on simulated data (Ozonoff, A., (Alexander, F. F., 1992).
Bonetti, M., Forsberg, L., & Pagano, M., 2005). A considerable amount of research in tempo-
Tycznski developed a broad atlas of cancer in ral and spatial context in ‘scan’ statistics has been
Ohio which involved a “smoothing” method where invested in identifying disease clusters. The
weighted averages of cancer per county were theory has been successfully applied in a wide
calculated versus geographic location of patients variety of epidemiological studies for cluster
with cancer at the time of diagnosis. However, the detection (Viel, J. F., Arveux, P., Baverel, J., &
clusters are generated by considering only spatial Cahn, J. Y., 2000; Perez, A. M., Ward, M. P., Tor-
aspect. The temporal characteristics of the cluster res, P., & Ritacco, V., 2002; Sankoh, O. A., Ye,
are not reported (Tyczynski, J. E., et al., 2005). Y., Sauerborn, R., Muller, O., & Becher, H., 2001).
In recognition of the usual epidemiological It has been identified as the most powerful
definition of cluster, this study adopts the formal method for detecting local clusters (Kulldorff,
definition of cluster which refers to the patterns M., Tango, T., & Park, P. J., 2003; Song, C., &
of location of disease cases, relative to the pat- Kulldorff, M., 2003). Ideally, this method is suit-
tern of non-cases (Wakefield, J. C., Kelsall, J. able where one needs to scan for clusters in space
E., & Morris, S. E., 2000). In principle, since the that vary over time. Since the method for detect-
cases are more clumped than non-cases, the dif- ing clusters is entirely unsupervised, there is no
ference between the two patterns is statistically need for a priori knowledge for the population
recognizable. Intuitively, a cluster is an excess size. The method is based on the concept of ‘win-

1186
GIS Application of Healthcare Data for Advancing Epidemiological Studies

dows’ which are defined to contain a fixed popu- We applied the SaTScan method for a spatial
lation (N*), and are centered on each area centroid. and space-time analysis for detecting local clusters
The algorithm identifies a significant excess of (Kulldorff, M., 1997). Due to temporal trends,
cases within a predefined moving window that clusters may be generated for a ramp up or down
exhaustively searches all existing space-time in data trends. For this reason, the space-time
locations and keeps increasing size in space time permutation model automatically adjusts for these
until it reaches a maximum limit. The maximum temporal data trends. In the study we used case
number of cases M = maxYj (N * ) across the data, with the spatial location represented by zip
j
code centroid latitude and longitude, and with
windows is used as a test statistic. In this case j
time represented by service month. The actual
represents the indexes of the areas defended in
number of cases in a cluster is compared with the
N*. It is also possible to use a fixed number for
expected count if the spatial and temporal loca-
the population by introducing predefined con-
tions of all records were independent. A cluster
straints (e.g., the number population with a circle
is determined to be present in a spatial location,
should be less than a specified fraction of the
if, during the time period, there are excess cases
total population of the study area). Hjalmars et
or recess within the surrounding areas. Using a
al. (1996) and Kulldorff et al. (1997) applied a
cylindrical window with a spatial base and time
similar method to detect childhood leukemia and
as height, the space-time statistic is defined. The
breast cancer incidence. The test statistic is based
window is moved in space and time, and a cylin-
on maximum likelihood ratio statistic across all
der is created for each possibility. The algorithm
circles.
accounts for multiple testing by calculating the
Yj Y −Yj
maximum likelihood of occurrence for all pos-
Y  Y −Y  + sible cluster locations and sizes (Kulldorff, M.,
  +
L = max  j  
j 
 I (Yj > E j )
j  E j  Y+ − E j  1997, 2001). In this study, retrospective analysis
(1) was performed in terms of months, with periods
representing January 1994 through May 2006. In
Here, Yj and Ej represent the observed and each window, the alternative hypothesis concludes
expected number of cases within the window j. that there is heightened risk.
The indicator function I(.) becomes 1 when the The Poisson model is used for the space-time
observed number exceeds the expected number permutation probability model as this allows for
of cases within the window; otherwise the value covariate adjustments, in this case age and gender.
is 0. When the window with greatest exceedance This likelihood function is maximized over all
is encountered, the sampling distribution of like- windows, and the maximum likelihood window
lihood ratio is determined using a Monte Carlo describes the prevalent cluster. The test statistic
test of cases across windows under a random is calculated by generating a large random sample
distribution assumption. Thus under the repeated from the data generated under the null hypothesis.
permutation, the distribution of likelihood statis- Monte Carlo testing is used to obtain the predicted
tic, the null hypothesis is developed. The result value. In this study, Monte Carlo replications were
is significant at 0.05 levels if the likelihood ratio generated to produce a P-value to 0.001. Covari-
is among the top 5% of all the values. It is also ates were used since clustering can occur due to
possible to determine secondary cluster with a covariates. Covariates are adjusted to prevent this
lower significance level. false clustering. The time precision was monthly
and ranged from January 1994 through May 2006
based on available data at time of collection. The

1187
GIS Application of Healthcare Data for Advancing Epidemiological Studies

maximum spatial and temporal cluster size was Moran’s I for rates, the underlying assumption
the default 50% of cases (Kulldorff, M., 2005). of stationarity may be violated due to intrinsic
The scant statistic generated both primary and variance instability of the rates. Since the popula-
secondary clusters and the output was reported tion at risk in the study area varies significantly
in ASCII format, which contains a log likelihood across patient zip codes, variance instability may
ratio and the significance level for the study area. lead to spurious inference for Moran’s I. To ac-
The output file was finally imported in standard count for this effect, the Empirical Bayes (EB)
GIS environment of ArcGIS to visualize cluster standardization was performed (Assuncao, R.,
location for further spatial analysis. The Pois- & Reis, E. A.) using zip code population as the
son model was also used for the purely spatial base variable. The standardized rate was used to
probability model, and follows closely with the calculate the univariate LISA. The spatial autocor-
space-time permutation model. However this relation analysis utilized local tests for Moran’s I
model utilizes a population file which includes statistic, with significance maps generated to the
information regarding the at risk population, and P-value of 0.05 (Anselin, L., Syabri, I., & Kho,
was taken from the 1999 US Census Bureau Zip Y., 2004). Sensitivity analysis was done by chang-
Code file, with regard to total 1999 population for ing the number of permutations (9999 times) for
each zip code. As this file did not include additional different significance cutoff values.
population attributes such as age and gender, the While Moran’s I is effective in identifying
case file along with the population file excluded presences of clustering of similar values cluster-
these for this model (Kulldorff, M., 1997; U.S. ing, it cannot differentiate between high and low
Census Bureau, 1999). values. Another spatial autocorrelation statistic,
Spatial autocorrelation analysis provides both the general G-statistic (Getis, A., Ord., J.K.,
global and local clusters which can be detected 1992), is able to detect hot and cold spots. The
by Moran’s I statistic. The global pattern can be G-Statistic also uses cross-product statistics to
detected from Moran’s Scatterplot, where the measure spatial association, similar to Moran’s
slope of the regression line represents Moran’s I. The local G-statistic is the local version of the
I. We applied the local indicators of spatial au- general G-statistic, and it indicates how the value
tocorrelation (LISA) method using sample cases of each unit is associated with surrounding units
in each zip code. The local Moran statistic for within distance d (Wong, D. W. S., et al., 1999).
location i is given as follows (Wong, D. W. S., &
Lee, J., 1999):  ∑ w (d )x 
 ij j 

Gi (d ) =  j  j ≠i (3)
I i = z i ∑ wij z j (2)  ∑ x j 
j  j


Where zi, zj are the deviations from the mean Here, wij(d) is the weight, with distance d,
for associated x, and where zi is the z-score of and the weight is 1 if j is within d of i; otherwise
xi. A high Moran’s I indicates associated values, weight is 0. The cross product of the points i and
whereas a low value indicates non-associated j are represented by xixj. A high Gi(d) indicates a
values. The row-standardized matrix, used to spatial association of similar high values; a low
estimate weights for each of the unit’s neighbors, Gi(d) indicates low and below-average values.
is defined as wij (Wong, D. W. S., et al., 1999). A z-score near 0 indicates no spatial pattern; a
When the study involves the measurement of highly negative z-score indicates low values; and

1188
GIS Application of Healthcare Data for Advancing Epidemiological Studies

a highly positive z-score indicates high values (ICD-9-CM) Code 162.XX was used to represent
(Wong, D. W. S., et al., 1999). We applied the the diagnosis for lung cancer. The aggregation
local G-statistic using samples cases in each zip unit of geographic location used is the five-digit
code. The standardized Empirical Bayes rate was US zip code. The coordinates were then mapped
used to estimate the G-statistics. in ESRI’s ArcGIS 9.0. Geocoding was performed
through assignment of coordinates (latitude and
Data Compilation and Analysis longitude) to each patient zip code. The coordinate
file contains the geographic coordinates for each
The study included the area of Northeast Ohio. location id specified in the case file. Coordinates
The service data was collected from northeast were specified using latitude and longitude of
ohio health plan with appropriate approval from each 5-digit patient zip code entered in decimal
the Internal Review Board. It includes the routine degrees, and, where identical, one or more coor-
clinical service data of over the service period of dinates were combined for a single location. Each
January 1994 through May 2006. The case file patient zip code was geocoded to the centroid
contains information about the cases where each using a US census 1999 zip code file containing
record represented an individual service (Kull- the latitude and longitude. For graphical analysis
dorff, M., 2005). The dataset was anonymised and to reflect the majority of sample records, a
to avoid disclosure of individual information. fourteen-county region with contiguous zip codes
In addition, a limited number of attributes was was used to represent the Northeast Ohio region.
authorized for addressing the specific research The dataset used for the analysis included a total
objective in finding excess space-time variations of 2,364 records or unique initial service claims
of lung cancer. Among the attributes available in for patients having included the ICD-9-CM Code
the case file are primary diagnosis, patient zip 162.XX (lung cancer) as the primary diagnosis,
code, service month, age, and gender. Service and 152 unique patient zip codes.
month was the month of service/diagnosis where To account for the inherent variance instability
data was collected and available for use beginning of rates (Bailey, T. C., & Gatrell, A. C., 1995) of
in January 1994 through May 2006; the attribute lung cancer incidence, empirical Bayes smoothing
age represents the age at date of service. In cases was performed (Clayton, D., & Kaldor, J., 1987),
where multiple services/diagnoses existed, only whereby the raw rates were adjusted towards the
the first occurrence for each individual was used overall average of the study area. The technique
to avoid cluster creation through repeat services. consists of computing the weighted average
The dataset was generated during 07/2006 and between the raw rates for each zip code and the
included all services rendered or received to date. study area average with weights proportional to
For services rendered at external locations, several the underlying population at risk. In other words,
months may have elapsed before service infor- small zip codes (i.e., with a small population at
mation was received, and may be excluded from risk) will tend to have their rates adjusted consid-
this study for this reason. It would be possible to erably, whereas for larger zip codes, the rates will
periodically re-run past results to ensure the latest barely change (Clayton, D., & Kaldor, J., 2005).
dataset available. Due to system memory require- The empirical Bayes (EB) smoothed box map in
ments (32-bit Windows memory allocation size), Figure 2 shows that 5 zip codes are in the upper
and software limitations, individual diagnoses outlier and as many as 34 zip codes are within
were required to be selected and scanned (Kull- 75 percentiles.
dorff, M., 2005). International Classification of
Diseases, (Ninth Revision) Clinical Modification

1189
GIS Application of Healthcare Data for Advancing Epidemiological Studies

Figure 2. Empirical Bayes (EB) smoothed box map of lung cancer in Northeast Ohio during 1994-2006

RESEARCH RESULT <= 0.01) exists. The primary cluster detected is a


geographically contiguous area in Northeastern
The spatial scan statistic result shows that there Ohio, with a relative risk (RR) of 1.784 and sig-
exists a cluster of lung cancer in the Northeast nificant P-value of 0.001. The cluster contained 44
Ohio region. Areas of excess were detected with zip codes within the contiguous sample area and
statistical significance using a spatial Poisson spanned a time period of 7/1/1999 – 8/31/2002.
probability model for services with a diagnosis of The location of the cluster changes when includ-
lung cancer. The results show that a statistically ing time, as compared with the purely spatial
significant cluster exists with a relative risk (RR) model. The primary cluster shows no geographic
of 4.164 at P-value of 0.001 that includes 72 zip change when adjusted for attributes. It was found
codes. The results also listed another statistically that adjustment of attributes is not required as the
significant cluster with RR of 0.186 and P-value of cluster location does not change when attributes
0.001, and contained 28 zip codes within the con- are introduced Secondary clusters were identified,
tiguous sample area. As expected, the significance but were not statistically significant.
levels match closely with the geographic sample The LISA analysis indicates a cluster of high
distribution. The space-time scan statistic result incidence of lung in the region with statistically
also shows that there exists a cluster of lung cancer significant clusters (P-value <= 0.05), containing
in the region for the space-time scan statistic. With 64 zip codes within the study area. The local G-
spatial-temporal data and covariates considered, statistic result shows that there exists a hot spot of
statistically significant areas of excess were also lung cancer in the region. The results also show
detected using the space-time scan statistic for that statistically significant clusters (P-value <=
services with a diagnosis of lung cancer. Clusters 0.05) consist of 89 zip codes. Figure 3 shows the
were scanned first only for location of patient zip spatial statistic significance graphs for the SatScan,
code, and then scanned with the addition of an LISA, and local G-statistic. The figure also shows
attribute (age, gender). Furthermore, clusters were the space-time scan statistic significance graphs.
scanned using five-year age brackets. The results Table 2 shows the significance level measured
show that a statistically significant cluster (P-value by the P-value at the 0.001, 0.01, and 0.05 levels.

1190
GIS Application of Healthcare Data for Advancing Epidemiological Studies

The corresponding percentage area, (where the DISCUSSION


total area of significance was divided by the total
area for which sample cases existed), the number This study detected several high and low asso-
of sample cases, number of zip codes contained, ciation clusters using the spatial and space-time
average age, and county by gender are included, methods. When comparing the various methods
with each method listed as a column. of the scan statistic, LISA, and local G-statistic,
The scan statistic result shows that there exists spatial association areas are somewhat similar
a cluster of lung cancer in the region for the geographically. The local G-statistic returned the
purely spatial statistic. Areas of excess were de- highest number of records (2002) with regard to
tected with statistical significance using a purely significance at the 0.05 level. The scan statistic
spatial Poisson probability model for services method also returned a similarly high number of
with a diagnosis of lung cancer. The results show associated records (2000) as the local G-statistic.
that a statistically significant high value cluster The LISA method returned a comparatively
(P-value <= 0.05) exists and contains 72 zip codes. smaller number of significant records (1242);
The results also listed another statistically sig- however when comparing LISA geographically
nificant low value cluster containing 28 zip codes to the scan statistic and local G-statistic, LISA
within the contiguous sample area. The results appears to exclude bordering zip codes included
list 54 zip codes as not significant. The scan sta- in the other methods. Conversely, LISA had the
tistic result also shows that there exists a cluster highest number of records (1122) that were not
of lung cancer in the region for the space-time significant, followed by the scan statistic (364)
scan statistic. The results show that a statistically and local G-statistic (362). The scan statistic
significant high value cluster (P-value <= 0.01) contained the highest number of high-high cluster
exists. The cluster contained 52 zip codes and records (1896), followed by the local G-statistic
spanned a time period of 7/1/1999 – 8/31/2002. (1377) and LISA (1069). The local G-statistic had
Two of the zip codes contained similar coordinates, the highest number of high-low spatial outlier
which left 50 unique zip code locations. No low records (419), followed by LISA (31) and scan
value clusters were identified. The LISA result statistic (N/A). Only LISA contained low-high
shows statistically significant cluster of lung spatial outlier records (31). The local G-statistic
cancer in the region for the purely spatial statistic. contained the highest number of low-low records
The results show that statistically significant (206), followed by scan statistic (104), which the
clusters (P-value <= 0.05) exist, and contained output indicated low only, and LISA (100).
87 not significant, 31 low-low, 2 low-high, 4 When accounting for temporal trends using the
high-low, and 28 high-high value zip codes. scan statistic, this study detected only one area
The local G-statistic result shows that there of excess lung cancer spanning the three-year pe-
exists a cluster of lung cancer in the region for riod of the study using a space-time method. The
the purely spatial statistic. The results show that SatScan space-time statistic returned the fewest
statistically significant clusters (P-value <= 0.05) number of records (103) and geographic location
exist, and contain 63 not significant, 31 low-low, in terms of significance and clustering, and it was
0 low-high, 16 high-low, and 42 high-high value different geographically from the purely spatial
zip codes. Figure 3 shows the purely spatial sta- methods in that the majority of high value asso-
tistic cluster graphs for the scan statistic, LISA, ciation cases and area was not contained within
and local G-statistic. It also shows the space-time Cuyahoga County, but rather in Stark and Summit
statistic significance graphs for the scan statistic. Counties. In addition, no low value association
clusters were detected, as with the purely spatial

1191
GIS Application of Healthcare Data for Advancing Epidemiological Studies

Figure 3. The area of statistically significant (p<=0.05) lung cancer using a spatial autocorrelation and
scan statistic for space and space-time model

1192
GIS Application of Healthcare Data for Advancing Epidemiological Studies

Table 2. P-value distribution of area, sample cases, zip codes, average age and gender attributes of the
cluster(s) for each method

P-Value SpaceTime Scan Spatial Scan G-statistic LISA


Statistic Statistic
0.05 – 1
% Area 74.30% 43.00% 63.20% 63.00%
Cases 2261 364 362 1122
Zip # 108 54 63 88
Avg Age 67.7 66.4 65.7 67.1
Male 1276 218 207 637
Female 985 146 155 485
0.01 - 0.05
% Area - - 14.90% 25.20%
Cases - - 252 640
Zip # - - 22 37
Avg Age - - 69.6 67
Male - - 138 346
Female - - 114 294
0.001 – 0.01
% Area - - 8.40% 8.70%
Cases - - 195 483
Zip # - - 17 21
Avg Age - - 66.7 69.2
Male - - 100 281
Female - 95 202
0 - 0.001
% Area 25.70% 57.00% 13.50% 3.10%
Cases 103 2000 1555 119
Zip # 44 98 50 6
Avg Age 65.6 67.8 67.9 69.2
Male 55 1113 886 67
Female 48 887 669 52

methods. The introduction of time allows for a of 1999-2002. The narrow temporal concentra-
more focused area and localized study region. tion and high relative risk suggests that further
Since the health data from managed health care investigation is required to understand the causal
organizations are aggregated at the zip code level, factor within the time period.
the smallest mapping unit discernible in a GIS The application of scan statistic is specifically
map is the zip code polygon of the study area. useful when there is a plausibility of a single
Compared to long temporal range (1994-2006) hotspot. Since the likelihood ratio is derived
of the study, the temporal occurrence of cluster from a single hot/cold spot, the possibility of
was found significant only within the three years detecting multiple hot/cold is ruled out in the

1193
GIS Application of Healthcare Data for Advancing Epidemiological Studies

scan statistic algorithm. Moreover, scan statistic further topics of interest. Tycznski notes that
detects space-time clusters by using cylindrical lung cancer was the leading cause of death across
windows; hence the result needs to be interpreted categories; this fact coincides with the focused
with respect to the choice a specific shape of the interest of this study on the leading cause of cancer
window. While the cluster is detected by rejecting death. Tycznski also notes higher mortality rates in
the null hypothesis, the circular spatial window Blacks as compared with Whites, and recommends
tends to detect a larger cluster than the true cluster focused efforts on this demographic. In this study
by absorbing surrounding regions where there a spatial cluster is found surrounding Cuyahoga
is no elevated risk (Tango, T., & Takahashi, K., County, which, when compared with surrounding
2005). The relatively large cluster area detected Northeast Ohio counties, has a greater percent-
in this study could be interpreted as the result of age of the Black population. When incorporating
the boundary effect of the circular window. The temporal trends into the study, an additional area
effect could be confirmed by comparing the result of interest is identified within Northeast Ohio.
with a flexibly shaped scanning window which
allows for irregular shapes (Tango, T., et al., 2005).
Moreover, the results need to be interpreted by PRACTITIONER IMPLICATIONS
comparison with other spatial cluster algorithms
(e.g., spatial filtering [34], generalized additive Practitioner-based usage of this study falls into
models (GAM) (Ozdenerol, E., Williams, B., & two general categories: identification and preven-
Kang, S. Y., & Magsumbol, M., 2005), and Bayes- tion. Detection of lung cancer, or other chronic
ian disease mapping (BYM)) (Besag, J. E., York, diseases identified through spatial and temporal
J. C., & Mollie, A., 1991). When utilizing spatial clusters can discover population areas of interest,
autocorrelation (LISA), the number of significant can gauge the effectiveness of disease reduction
records is reduced due to the generation of larger methods within a given population area, and is
P-values. The spatial autocorrelation of simulated considered an important public health measure.
rates creates large variances in LISA values, This study compares the results of three different
leading to insignificant P-values, as neighboring methods to gain greater insight and better reflect
values are more likely to be jointly low or high spatial and temporal variations, as related to
(Goovaerts, P., & Jacquez G. M., 2004). The lo- identification and/or monitoring current programs.
cal G-statistic produces the highest number of Increasing disease rates may trigger increased
significant records, but also produces the highest focus or re-direction of current efforts. Decreas-
number of spatial outlier records, which, when ing rates may be utilized as validation for current
accounted for, produce a more similar, but still efforts, improvement value, or as a component of
higher records count than LISA. The local G- a shared knowledge base for other efforts. Early
statistic uses a z-score scale around 0 to indicate detection programs along with case and disease
clustering situations, which leads to a classification management programs can also utilize geographic
for all non-zero significant records. modeling information to determine the overall pro-
In comparison with Tycznski’s study (2005), gram design and success factor achievement. The
this study reviewed spatio-temporal clusters of program design may vary by geographic location,
lung cancer cases to identify areas of interest, or may be used to identify the optimal program
whereas the former study considered overall methodology for the particular locality (2005).
mortality rates of all cancer types within Ohio to As an important step in enhanced treatment,
determine time trends. Several key areas can be efforts have been made to materialize best practices
synthesized between the two studies to provide of disease therapies. Identification is made through

1194
GIS Application of Healthcare Data for Advancing Epidemiological Studies

geographical variances, and can trigger alerts to the ability to improve access and utilization of
dedicated specialists for discovery of previously preventative care. These managed care plans gen-
unknown opportunities. Activities following the erally provide more preventative services which
discovery of identified areas may include patient/ result in improved outcomes over fee or service
provider education, therapy identification, local- based plans (Mobley, L. R., Root, E., Anselin,
ity factors, and identification for early treatment, L., Lozano-Gracia, N., & Koschinsky, J., 2006).
among others. This type of geographical evalua- Whether a singular method or rather a com-
tion has occurred with early-stage breast cancer. bination of factors prove effective when adapted
Geographic information systems and associated to the patient set, it is imperative to provide tools
analysis can instruct physicians, patients, and other to successfully identify patients and program out-
medical personnel in resource distribution, costs, comes through monitoring, and enact additional
health outcomes, and patient satisfaction. The programs to prevent future diseases and complica-
location, service provider, and services performed tions. This contributes to the overall management
can be analyzed by geographic area to identify goal of improving quality of care, while reducing
effective methods. Once identified, these methods overall costs, thus ensuring high-quality, afford-
can be standardized to improve consistency across able healthcare.
many locations, in order to gain effectiveness.
Further granularity can be provided to allow for
reduced area identification, in which specific CONCLUSION AND
disease control strategies can be implemented to FUTURE DIRECTIONS
supplement broader programs (Gregorio, D. I.,
Kulldorff, M., Barry, L., Samocuik, H., & Zarfos, The spatial scan statistic, LISA, and local G-sta-
K., 2001). tistic were able to detect several areas of high and
Other studies have sought to determine vari- low value clusters. The space-time scan statistic
ances between local and global populations. Find- identified one area of excess lung cancer spanning
ings have suggested that differences in survival the three-year period of the study and produced a
rates are not a result of biological factors, but different result than a purely spatial scan statistic.
rather treatment and prevention factors. These The cluster remained unchanged and statistically
include disease stage at diagnosis, effects of significant even after covariate adjustment. The
compound diseases, and general treatment fac- findings also provide evidence that diagnosis data
tors. Temporal trends are required in addition to collected as a result of rendered health services can
spatial trends to locate areas whose intervention be used in detecting potential disease patterns and/
would prove most beneficial. Prior studies have or utilization patterns throughout space and time.
examined geographic locations to determine The methods shown are also useful for identify-
physician and medical shortages. The findings ing which areas have the highest occurrence of a
suggest that while the existing supply is in fact particular disease for the placement of facilities or
adequate, the distribution is not optimal. This cre- specialists. Time can also be a factor for identify-
ates access restrictions across previously unseen ing more specific areas of interest, given a large
urban, suburban, and rural areas. Literature sug- geographic region. The possible boundary effect
gests that the physician specialty combination as of circular window and space time-interaction
well as distribution of non-physician clinicians is resulting from geographical population shifts
vital for health outcomes. Accessibility measures in the study area needs to be investigated. The
require accuracy to appropriately capture the flows popularly used Knox method (1964) and other
within service areas. Managed care plans have tests for space-time interaction are required to

1195
GIS Application of Healthcare Data for Advancing Epidemiological Studies

confirm if the space-time clustering of lung REFERENCES


cancer in the study area is a result of underlying
geographical population distribution (Kulldorff, Alexander, F. F. (1992). Methods for the assess-
M., 1999). Unbiased space-time interaction tests ment of disease clusters. In Geographical & envi-
and prospective space-time permutation analysis ronmental epidemiology: Methods for small-area
may also be performed as additional data is made studies (pp. 238-247).
available (Kulldorff, M., 2005; Hjalmars, U., & Amin, R. (2010). Epidemiologic mapping of
Kulldorff, M., 1999). Florida childhood cancer clusters. Pediatric Blood
As managed care programs are initiated in ar- & Cancer, 54, 511–518.
eas of interest, additional localized studies can be
utilized to track improvements in overall mortality Anselin, L. (2005). Exploring spatial data with
rates as a result of the initiatives. By providing GeoDa: A workbook. Center for Spatially Inte-
granularity to the zip code level in this study, grated Social Science.
focused geographic programs can be developed
Anselin, L., Syabri, I., & Kho, Y. (2004). GeoDA:
at strategic facility sites and can be modified
An introduction to spatial data analysis.
based on ongoing spatio-temporal surveillance
of the population. As health care entities seek Assuncao, R., & Reis, E. A. (n.d.). A new pro-
to improve healthcare outcomes, these methods posal to adjust Moran’s I for population density.
provide a useful tool in identifying geographic Statistics in Medicine, 18, 2147-2161.
areas of interest.
Bailey, T. C., & Gatrell, A. C. (1995). Interactive
While the research objective is not to identify
spatial data analysis. New York, NY: John Wiley
a specific cause-and-effect relationship between
and Sons.
lung cancer and environmental conditions such as
air pollution or individual smoking habit, further Besag, J. E., York, J. C., & Mollie, A. (1991).
research is needed to establish a space-time spe- Bayesian image restoration with two applications
cific causal relationship and a latency period of in spatial statistics. Annals of the Institute of
cancer. Given the scope and objective of this study, Statistical Mathematics, 43, 1–59. doi:10.1007/
it was not deemed necessary to collect additional BF00116466
detailed case data. For future study, additional
Brownstein, J., Green, T., Cassidy, T., & Butler,
attributes which are not available for the current
S. (2010). Geographic Information Systems and
dataset should be generated from a localized study
pharmacoepidemiology: Using spatial cluster
that would include things such as ethnicity, socio-
detection to monitor local patterns of prescription
economic status of individual cases, exposure to
opioid abuse. Pharmacoepidemiology and Drug
tobacco smoke, and population migration. These
Safety, 19, 627–637. doi:10.1002/pds.1939
attributes can be adjusted for in the model in order
to determine effects. Localized studies can also CDC. (2010). U.S. obesity trends.
determine utilization of services in which lung
CDC, HRSA. (2010). Healthy people 2010.
cancer diagnosis appears. This identification can
be useful in managing the care for those with an Chang, K. (2006). Introduction to Geogrpahic In-
identified diagnosis and determining frequency formation Systems. New York, NY: McGraw-Hill.
of services for those with an identified diagnosis.

1196
GIS Application of Healthcare Data for Advancing Epidemiological Studies

Clayton, D., & Kaldor, J. (1987). Empirical Bayes Huang, L., Tiwari, R., Pickle, L., & Zou, Z. (2009).
estimates of age standardized relative risks for use Covariate adjusted weighted normal spatial scan
in disease mapping. Biometrics, 43, 671–681. statistics with applications to study geographic
doi:10.2307/2532003 clustering of obesity and lung cancer mortality
in the United States. Statistics in Medicine, 29,
Cromley, E. K. (2003). GIS and disease. Annual
2410–2422.
Review of Public Health, 24, 7–24. doi:10.1146/
annurev.publhealth.24.012902.141019 Jacquez, G. M., & Greiling, D. A. (2003). Local
clustering in breast, lung and colorectal cancer in
ESRI. (2010). ArcGIS.
Long Island, New York. International Journal of
Gatrell, A. (2002). Geographies of health. Malden, Health Geographics, 2.
MA: Blackwell.
Knox, G. (1964). The detection of space-time
Getis, A., & Ord, J. K. (1992). The analysis interactions. Applied Statistics, 13, 25–29.
of spatial association by use of distance sta- doi:10.2307/2985220
tistics. Geographical Analysis, 24, 189–207.
Kulldorff, M. (1997). A special scan statistic. Com-
doi:10.1111/j.1538-4632.1992.tb00261.x
munications in Statistics Theory and Methods, 26,
Goovaerts, P., & Jacquez, G. M. (2004). Account- 1481–1496. doi:10.1080/03610929708831995
ing for regional background and population size in
Kulldorff, M. (1999, June). The Knox method
the detection of spatial clusters and outliers using
and other tests for space-time interaction.
geostatistical filtering and spatial neutral models:
Biometrics, 55, 544–552. doi:10.1111/j.0006-
The case of lung cancer in Long Island, New York.
341X.1999.00544.x
International Journal of Health Geographics,
3(1), 14. doi:10.1186/1476-072X-3-14 Kulldorff, M. (2001). Prospective time periodic
geographical disease surveillance using a scan
Gregorio, D. I., Kulldorff, M., Barry, L., Samocuik,
statistic. Journal of the Royal Statistical Society.
H., & Zarfos, K. (2001). Geographical differ-
Series A, (Statistics in Society), 164(1), 61–72.
ences in primary therapy for early-stage breast
doi:10.1111/1467-985X.00186
cancer. Annals of Surgical Oncology, 8, 844–849.
doi:10.1007/s10434-001-0844-4 Kulldorff, M. (2005). SaTScanTM v6.0: Software
for the spatial and space-time scan statistics.
Hastite, T. J., & Tibshirani, R. J. (1991). General-
Information Management Services, Inc.
ized additive models. New York, NY: Chapman
and Hall. Kulldorff, M., Tango, T., & Park, P. J. (2003).
Power comparisons for disease clustering tests.
Hjalmars, U., Kulldorff, M., Gustafsson, G.,
Computational Statistics & Data Analysis, 42.
& Nagarwalla, N. (1996). Childhood leu-
kaemia in Sweden: Using GIS and a spatial Lawson, A., Biggeri, A., Böhning, D., Lesaffre,
scan statistic for cluster detection. Statis- E., Viel, J. F., & Bertollini, R. (1999). Disease
tics in Medicine, 15, 707–715. doi:10.1002/ mapping and risk assessment for public health. In
(SICI)1097-0258(19960415)15:7/9<707::AID- Lawson, A., Biggeri, A., Böhning, D., Lesaffre,
SIM242>3.0.CO;2-4 E., Viel, J. F., & Bertollini, R. (Eds.), Disease
mapping and risk assessment for public health.
London, UK: John Wiley & Sons.

1197
GIS Application of Healthcare Data for Advancing Epidemiological Studies

Lawson, A., & Denison, D. (2002). Spatial Sankoh, O. A., Ye, Y., Sauerborn, R., Muller, O., &
cluster modelling. Boca Raton, FL: CRC. Becher, H. (2001). Clustering of childhood mortal-
doi:10.1201/9781420035414 ity in rural Burkina Faso. International Journal
of Epidemiology, 30, 485–492. doi:10.1093/
Li, L., Xu, L., Jeng, H., Naik, D., Allen, T., &
ije/30.3.485
Frontini, M. (2008). Creation of environmental
health Information System for public health ser- Song, C., & Kulldorff, M. (2003). Power
vice: A pilot study. Information Systems Frontiers, evaluation of disease clustering tests. Interna-
10, 531–542. doi:10.1007/s10796-008-9108-1 tional Journal of Health Geographics, 2, 1–8.
doi:10.1186/1476-072X-2-9
Marshall, R. J. (1991). A review of the statistical
analysis of spatial patterns of disease. Journal of Stelling, J. (2010). Automated use of WHONET
the Royal Statistical Society. Series A, (Statistics and SaTScan to detect outbreaks of Shigella spp.
in Society), 154, 421–441. doi:10.2307/2983152 using antimicrobial resistance phenotypes. Epide-
miology and Infection, 138, 873–883. doi:10.1017/
Mobley, L. R., Root, E., Anselin, L., Lozano-
S0950268809990884
Gracia, N., & Koschinsky, J. (2006). Spatial
analysis of elderly access to primary care services. Tango, T., & Takahashi, K. (2005). A flexibly
International Journal of Health Geographics, 5. shaped spatial scan statistic for detecting clusters.
International Journal of Health Geographics, 4,
Ozdenerol, E., Williams, B., Kang, S. Y., &
1–15. doi:10.1186/1476-072X-4-11
Magsumbol, M. (2005). Comparison of spatial
scan statistic and spatial filtering in estimating Tyczynski, J. E., & Berkel, H. J. (2005). Long-
low birth weight clusters. International Journal term all-sites cancer mortality time trends in Ohio,
of Health Geographics, 4(19). USA, 1970–2001: Differences by race, gender
and age. BMC Cancer, 5.
Ozonoff, A., Bonetti, M., Forsberg, L., & Pagano,
M. (2005). Power comparisons for an improved U.S. Census Bureau. (1999). U.S. Postal Service
disease clustering test. Computational Statistics ZIP codes.
& Data Analysis, 48, 679–684. doi:10.1016/j.
Viel, J. F., Arveux, P., Baverel, J., & Cahn, J. Y.
csda.2004.03.012
(2000). Soft-tissue sarcoma and non-Hodgkin’s
Perez, A. M., Ward, M. P., Torres, P., & Ritacco, lymphoma clusters and a municipal solid waste
V. (2002). Use of spatial statistics and monitoring incinerator with high dioxin emission levels.
data to identify clustering of bovine tuberculosis American Journal of Epidemiology, 152, 13–19.
in Argentina. Preventive Veterinary Medicine, doi:10.1093/aje/152.1.13
56, 63–74. doi:10.1016/S0167-5877(02)00124-1
Wakefield, J. C., Kelsall, J. E., & Morris, S. E.
Pfeiffer, D. U., Robinson, T. P., Stevenson, M., (2000). Clustering, cluster detection, and spatial
Stevens, K. B., Rogers, D. J., & Clements, A. C. A. variation in risk. In Elliot, P., Wakefield, J. C.,
(2008). Spatial analysis in epidemiology. NewYork, Best, N. G., & Briggs, D. J. (Eds.), Spatial epi-
NY: Oxford University Press Inc. doi:10.1093/ac demiology: Methods and applications. Oxford,
prof:oso/9780198509882.001.0001 UK: Oxford University Press.
WHO. (2008). Public health mapping. WHO.

1198
GIS Application of Healthcare Data for Advancing Epidemiological Studies

Wong, D. W. S., & Lee, J. (1999). Statistical Yu, W. (2004). Using GIS to profile healthcare
analysis of geographic information. Hoboken, costs of VA quality-enhancement research initia-
NJ: Wiley. tive diseases. Journal of Medical Systems, 28,
3. doi:10.1023/B:JOMS.0000032844.26308.95

This work was previously published in New Technologies for Advancing Healthcare and Clinical Practices, edited by Joseph
Tan, pp. 362-377, copyright 2011 by Medical Information Science Reference (an imprint of IGI Global).

1199
1200

Chapter 72
Colorado 14ers, Pixel by Pixel
Brandon J. Vogt
University of Colorado at Colorado Springs, USA

ABSTRACT
This document describes a capstone learning exercise designed for undergraduates enrolled in an intro-
ductory geospatial tools course. The overarching theme of the exercise, Colorado 14ers, Pixel by Pixel,
is mountain geography. While immersed in a digital mountainous landscape, students explore topics
of geomorphology and geomorphometry and discover how it is possible for a summit that rises well
above 14,000 feet not to qualify as a true Colorado ‘14er’. To address these topics, students examine
freely-available digital elevation models (DEMs) using common geospatial analysis tools. The exercise
was developed with five purposes in mind: (1) students should be introduced gently to geospatial soft-
ware; (2) students should enter upper-level geography courses armed with basic computer skills and
a diverse geospatial toolkit; (3) students should internalize the fact that invariably digital geographic
explorations are influenced by scale; (4) students of geography should be able to identify natural and
anthropogenic impacts to a landscape, and; (5) to help students conceptualize what geographers do,
learning exercises should include local and interesting case studies. As an exercise intended to synthesize
geographic concepts covered throughout a course, the learning objectives range from grasping broad
theoretical concepts related to scale and measurement to learning specific computer skills related to
directory structures and file naming conventions. Deliverables of the exercise include posting four maps
and answers to nine questions to a website. The outcome of the exercise is a confident, geospatial tool-
savvy student who is eager to further investigate the field of geography.

INTRODUCTION1 ronmental Studies (GES) geospatial tools course.


Digital Earth introduces geographic information
Colorado 14ers, Pixel by Pixel is a stand-alone, systems (GIS), global positioning systems (GPS),
computer-based exercise that concludes Digital geovisualization, remote sensing, geomorphom-
Earth, an undergraduate University of Colorado at etry, virtual globes, and web design. The course
Colorado Springs (UCCS) Geography and Envi- seeks to prepare students for the digital worlds they
will encounter in upper-level geography courses
DOI: 10.4018/978-1-4666-2038-4.ch072

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Colorado 14ers, Pixel by Pixel

and exposes students, in exciting and interactive • Minimize intimidation: Students of ge-
ways, to the breadth and depth of geography. The ography should learn to apply powerful
course, which is required for all GES majors, and complex tools for geospatial analysis
focuses on eight computer-based exercises (labs). in non-intimidating and exciting ways.
As examples, in one lab, students use the virtual Negative or frustrating first encounters
globe ‘SkylineGlobe’ to compare and contrast with geospatial tools such as ArcMap
fashion by ‘surfing’ crosswalk cameras in large GIS can intimidate and dissuade students.
cities across the globe. In another lab, students Similarly, the datasets explored are often
manipulate 3D models in a GIS to investigate difficult to view, edit, and manage in these
controls of geology on settlement patterns in software environments. Avoiding intimida-
and around Durango, Colorado. Asking students tion is addressed in the exercise by provid-
to solve imaginative problems using interactive ing students with data and a step-by-step,
and exploratory approaches inspires interest in cookbook style set of instructions, many
geospatial tools and their practical applications. with screen-shot examples.
Digital Earth has been well received by students • Colorado 14ers, Pixel by Pixel: is a cap-
at UCCS as measured by higher than expected stone exercise for a course that serves is a
enrollment numbers and strong course evaluations. prerequisite for all upper-level geospatial
Colorado 14ers, pixel by pixel is the capstone tools courses including GIS, remote sens-
exercise of Digital Earth and as such represents ing, image processing, and internet GIS. A
a summary of many of the concepts and tools student familiar with geospatial terminolo-
encountered during the semester. Set in a virtual gy (e.g, raster, vector, interpolation, DEM,
mountainous Colorado landscape, the exercise TIGER) who possesses some general com-
requires students to explore issues related to physi- puter skills (e.g., copying folders, trans-
cal geography, map scale, spatial resolution, and ferring files to a web server) and is armed
digital measurement. The exercise asks students to with a diverse geospatial toolkit (e.g., GIS,
demonstrate through various mappings how it is geovisualization, geomorphometry) is po-
possible that one peak in a group of three 14,000 sitioned to enter upper-level courses with
foot + peaks does not qualify as a true 14er. A heightened confidence.
‘14er’ is one of 54 iconic peaks in Colorado (e.g., • Scale is king: With most geographers rely-
Pikes Peak, Longs Peak, Mount of the Holy Cross) ing at some point on GIS, remote sensing,
that reaches or exceeds an altitude of 14,000 feet digital cartography, digital measurements,
(Blake, 2002). and raster and/or vector data analysis, it
is important at early stages in a student’s
academic trajectory to recognize that all-
PURPOSE STATEMENT things-digital are influenced by scale.
• Human impacts: Geologic and atmo-
The intent of the exercise is to focus student spheric hazards, sustainability, climate
interest on an interesting geographic problem change, and political stability are topics
that requires a suite of geospatial tools to solve. currently being addressed by geographers.
To address this intent and the specific learning To elucidate the connectedness between
expectations of the exercise (described in the humans and environment in these contexts,
‘Learning Objectives’ section below), five general students should interact with geographic
purposes were developed: data face-to-face to visualize tangible ex-

1201
Colorado 14ers, Pixel by Pixel

amples (forms) of natural and anthropo- • Visualize topographic relief on DEMs:


genic processes. students create contour lines, index con-
• What geographers do: Geographic inves- tour lines, hillshade perspectives, and
tigations should be delivered to students slope maps
using local, applied, and interesting case • Gain experience posting, presenting,
studies. This approach helps students rec- and sharing maps and information shar-
ognize what geography is and what geog- ing: students export maps from GIS soft-
raphers do. ware, edit images, design web pages, and
securely transfer all files to a publicly ac-
cessible UCCS website
LEARNING OBJECTIVES

The learning objectives of the exercise dictate that THE EXERCISE


students gain experience using multiple geospatial
tools, different data models, and different file types Colorado 14ers, Pixel by Pixel is summarized in
delivered at different resolutions. Concurrent with a short presentation that provides an overview
tool and scale-related objectives, the exercise seeks of the exercise and outlines its purpose, learning
to introduce students to the depth and breadth of objectives, gradable items, and data and software
the discipline of geography and seeks to inspire requirements (Appendix A). The full-length class
students to take additional upper-level geospatial exercise exists in an electronic downloadable
tools courses. The specific learning objectives are format (Appendix B). The exercise requires an
for students to: internet connection to view helper images. An
answer key provides guidelines for instructors
• Visualize on a map a socially-derived (Appendix C). The URL below points to the two
topographic constraint: following a set DEMs required to complete the exercise: http://
of 14er ‘rules,’ students use GIS-based ras- www.uccs.edu/bvogt/co_14ers (http://www.uccs.
ter measurement techniques to determine edu/bvogt/co_14ers).
which of three nearby 14,000 foot plus
peaks is not a true 14er
• Observe on a DEM patterns of anthro- REAL-WORLD CLIENTS
pogenic processes, fluvial and paleogla-
cial erosion, and frost-shattering: these This exercise was developed with undergraduate
resulting forms are made visible by cre- geography students in mind as the primary client.
ating a pixel diversity map with 32 color However, in addition to serving as a learning
classes (elevation diversity serves as a experience for college students, outcomes of the
proxy for surface roughness) exercise could serve real-world customers. Those
• Understand issues of map resolution: who could gain insight from this exercise include
students compare elevation value accu- outdoor recreation managers, peak ‘baggers’ (i.e.,
racies and contour line shapes of a 10 m those attempting to summit a group of peaks in a
DEM to a 30 m DEM checklist fashion), and physical / natural scientists.
• Gain experience using local operators: According to the Colorado Fourteeners Initia-
students us a map calculator to convert tive (CFI), a non-profit group whose focus is the
measurement units and to show surfaces protection and preservation of Colorado 14ers,
above 14,000 feet as many as 500,000 visitors attempt to summit a

1202
Colorado 14ers, Pixel by Pixel

Colorado 14er each year (CFI, 2010). This non- instructor to work with students on geographic
profit group, as well as recreation managers with concepts or on software-related trouble-shooting.
the United States Forest Service and the National The exercise can be finished outside of the class-
Park Service, seeks to minimize erosion and trail room provided students are equipped with the
degradation while maintaining a challenging necessary software and the ability to transfer files
recreation experience for peak baggers. Colorado to a campus-based web directory.
14ers, Pixel by Pixel provides recreation managers
with a tool that could help, for example, place a
new or reroute an existing hiking trail that ascends ACKNOWLEDGMENT
a peak within a specific slope range.
Other potential clients served by this exercise The exercise described in this paper, Colorado
include the 500,000 peak baggers who attempt 14ers, Pixel by Pixel, won the Applied Geogra-
to summit a Colorado 14er each year. Colorado phy Specialty Group (AGSG) of the Association
14er peak baggers who understand how socially- of American Geographers’ (AAG) 2010 class
defined criteria (e.g., the ‘rules’ for 14er selection) exercise award competition. The AGSG/AAG
are extended to a physical landscape may be competition announcement invited graduate stu-
enlightened by the peak identification process or dents and faculty to submit an innovative applied
may find peace of mind when bypassing a >14,000 geography exercise designed for undergraduate
ft. peak that is not classified as a true 14er. students. Submission guidelines requested a 250
Finally, geomorphologists, environmental word abstract; an exercise handout to students; a
geologists, ecologists, paleoclimatologists, and narrative describing the intent of the exercise, the
other physical and natural scientists could directly fictional or real-world client, student engagement
benefit from this exercise: Ten and 30 m DEMs time; and five to ten PowerPoint slides summariz-
are commonly used in scientific investigations ing the exercise.
to identify, map, and measure various configura-
tions of landforms and landscapes (Pike, 2002).
Knowledge extracted from these measures and REFERENCES
mappings are then used to connect form to physi-
cal and/or human process. The exercise could Blake, K. (2002). Colorado fourteeners and the
serve to train researchers in how to use common nature of place identity. Geographical Review,
geospatial tools to extract relevant information 92(2), 155–179. doi:10.2307/4140968
from freely-available DEMs. Colorado Fourteeners Initiative (CFI). (2010).
A partnership for preservation. Retrieved from
http://www.14ers.org/
STUDENT ENGAGEMENT TIME
Pike, R. (2002). A bibliography of terrain modeling
The exercise is generally completed in four to six (geomorphometry), the quantitative representa-
hours. Ideally, one to two hours working on the tion of topography (Tech. Rep. No. 02-465).
exercise should be under the direct supervision Washington, DC: United States Geological Survey.
of an instructor. This face-to-face time allows the

1203
Colorado 14ers, Pixel by Pixel

APPENDIX A:COLORADO 14ERS, PIXEL BY PIXEL:

An Exercise in Mountain Geography (Slide Presentation)

Figure 1.

1204
Colorado 14ers, Pixel by Pixel

Figure 2.

1205
Colorado 14ers, Pixel by Pixel

Figure 3.

1206
Colorado 14ers, Pixel by Pixel

Figure 4.

1207
Colorado 14ers, Pixel by Pixel

APPENDIX B: COLORADO 14ERS, PIXEL BY PIXEL (STUDENT HANDOUT)

Figure 5.

1208
Colorado 14ers, Pixel by Pixel

Figure 6.

1209
Colorado 14ers, Pixel by Pixel

Figure 7.

1210
Colorado 14ers, Pixel by Pixel

Figure 8.

Figure 9.

1211
Colorado 14ers, Pixel by Pixel

APPENDIX C: ANSWER KEY TO CLASS EXERCISE ‘COLORADO 14ERS, PIXEL


BY PIXEL’

Students are required to place four images (each with caption) and the answers to the nine questions
below on their class exercise web page. Specifically, the following are graded:

1. One image with caption showing a close-up of the saddle (with contour lines) connecting Mt.
Cameron and Mt. Lincoln

Figure 10.

2. One image with caption showing all elevations above14,000 ft. on the Alma, CO DEM

Figure 11.

1212
Colorado 14ers, Pixel by Pixel

3. One image with caption showing the results of the slope analysis of the Alma, CO DEM

Figure 12.

4. One image with caption showing the results of the variety analysis of the Alma, CO DEM

Figure 13.

1213
Colorado 14ers, Pixel by Pixel

5. Answer the following set of questions:


5 a. In terms of square meters, exactly how much area are we talking about IN ONE PIXEL on a
10 m DEM and on a 30 m DEM?
Answer: 100 square meters and 900 square meters respectively
5 b. What is the highest elevation value on the 10 m Alma, CO DEM? What is the highest eleva-
tion value on the 30 m Alma, CO DEM and why do you suppose the 10 m and the 30 m show
different maximum elevations?
Answer: The highest elevation on the 10 m DEM is 14,285.4 ft. and the highest elevation on the
30 m DEM is 14,245.4 ft. US Geological Survey 10 m DEMs were made from 30 m DEMs.
Thus, the difference in maximum elevations can be explained by the cartographic interpola-
tion process: To create finer resolution data (10 m) from coarser resolution data (30 m), the
interpolation process used by the USGS created a more smooth surface that in some cases
was forced above and below the known (more coarse) elevation data.
5 c. According to 14er ‘rules,’ one must descend 300 feet between 14ers for an adjacent 14,000 ft. +
peak to be an official 14er (Roach 1999, pg. 287). What is the vertical elevation loss between
Mt. Cameron and Mt. Lincoln? According to 14er ‘rules’ is Mt. Cameron an official 14er?
Answer: About 220 ft. No - a descent of less than 300 ft. means that Mt. Cameron is not an of-
ficial 14er.
5 d. Which areas (landforms) on your DEM show the lowest slope values (the most gentle slopes)?
Answer: The water surface in the reservoir, valley floors, and some high locations that sit well
above the level of former glaciation.
5 e. Which areas (landforms) on your DEM show the highest slope values (the steepest slopes)?
Answer: Slopes adjacent to ridges, cirque walls, and incised stream channel walls.
5 f. Based from the slope analysis, what natural/physical processes do you suppose cause STEEP
slopes and what processes do you supposed generate GENTLE slopes?
Answer: Steep slopes are likely caused by glacial erosion during last two or so glaciation periods
(e.g., Bull Lake and Pinedale), talus (scree) accumulations that reside at or near their angle
of repose, and valley incision by running water. Gentle slopes are likely caused by sediment
fill in valleys, rapid rock weathering at very high elevations (>13,000 ft.), and water surfaces
on reservoirs.
5 g. Based from a slope analysis, hillshade analysis, or variety analysis, provide three examples
of human (anthropogenic) forms visible on the DEM. Describe likely processes responsible
for shaping these features.
Answer: 1) a reservoir; containment of water, 2) narrow flat sections running along ridge tops;
foot traffic / compaction of unstable loose rock, 3) road grades; road construction, 4) mine
tailings; oversteepening of slopes from mining
5 h. From your variety analysis, what generalizations can you make that connect surface ‘variety’
and ‘landforms’?
Answer: High variety shows heterogeneity while low variety shows homogeneity. Generalizations
between variety and landforms include landform age (young = high elevation variety; old
= low elevation variety); glacial processes (glacial erosion on valley walls = high elevation
variety; glacial erosion on valley floor = low elevation variety); fluvial processes (fluvial
erosion = high elevation variety; fluvial deposition = low elevation variety).

1214
Colorado 14ers, Pixel by Pixel

5 i. Why are the contour lines different (e.g., why do the lines follow slightly different paths) for
the two different resolutions?
Answer: Contour lines on the 10 m DEM show higher sinuosity in plain view than do the contour
lines on the 30 m DEM. 10 m contour lines conform more closely to finer scale landsurface
features than do the 30 m contour lines.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 2, Issue 2,
edited by Donald Patrick Albert pp. 17-32, copyright 2011 by IGI Publishing (an imprint of IGI Global).

1215
1216

Chapter 73
A Component-Based 3D
Geographic Simulation
Framework and its Integration
with a Legacy GIS
Zhen-Sheng Guo
Hokkaido University, Japan

Yuzuru Tanaka
Hokkaido University, Japan

ABSTRACT
There is an increasing demand for 3D geographic simulation systems. Most systems currently available are
closed and based on fixed architectures. Some systems allow us to develop and customize a 3D geographic
simulation system, but this usually requires the writing of extensive program code. Especially in 3D geo-
disaster simulations, for example, we need to dynamically integrate 2D legacy GIS with 3D geographic
simulation systems in order to investigate the details about the damaged areas and the consequences of the
disasters. The authors propose a component-based application framework for 3D geographic simulation
that can integrate a legacy 2D GIS with geographic simulation systems in a 3D visual environment. Their
framework provides a set of 3D visual components required for the development of a new interactive 3D
visual geographic simulation. In their framework, component integrators can construct 3D geographic
simulation systems by composing the 3D visual components. Moreover, the authors’ integration framework
provides two fundamental integration mechanisms, view integration and query integration mechanisms, to
integrate it with legacy 2D systems. The view integration function maps the 2D rendering of a legacy 2D
GIS onto the surface of the 3D geography used in a 3D visual geographical simulator, and then dispatches
every event on the geographic surface to the original 2D GIS. The query integration automatically converts
each 3D simulation result that is shown as a set of highlighted regions on the surface of the geography
to the corresponding regional query to the 2D GIS. The proposed framework is based on their 3D meme
media architecture in which components are represented as meme media objects, and their interoperation
is defined by slot connections between them. As a result, their framework enables users to compose 3D
geographic simulation systems and to integrate a legacy 2D GIS with a 3D geographic simulation system
simply by composing display objects in a 3D visual environment.

DOI: 10.4018/978-1-4666-2038-4.ch073

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

INTRODUCTION Open Geospatial Consortium (OGC) standards


(OpenGIS) can be used to integrate different Web
Various geographic simulation systems have services through an OGC-standard Web service in-
been developed to meet the growing need for terface. This integration demands OGC-compliant
3D geographic simulation of disasters, such as Web services. However, most of the currently
avalanches, floods, and landslides. In general, available Web services are not OGC-compliant
the development of each has been primarily inde- (Alves, 2006).
pendent with closed and fixed architectures, it is This chapter proposes a component-based 3D
therefore difficult to introduce existing functions geographic simulation framework. Our frame-
of one existing geographic simulation system work provides basic composite components for
to others. Moreover, although the geographic 3D geographic simulation, and two fundamental
simulation systems generally have strong disaster legacy GIS integration mechanisms for component
simulation capability, they often lack the informa- integrators. Our framework enables component
tion necessary for analysis and evaluation of the integrators to easily build 3D geographic simula-
disaster, whereas many legacy geographic infor- tion systems by visually combining the required
mation systems (GIS) can provide visual map and components and by defining data flows among
region query functions for disaster analysis and the components. The component integrators need
evaluation. For this reason, GIS and geographic to have knowledge about the legacy systems, for
simulation systems often need to be integrated. example, knowledge about the display and query
However, in general, the development of the 3D APIs and their parameters for the legacy GIS,
geographic simulation systems and legacy GIS the input and output data formats of the legacy
have been primarily independent and integrative simulators, and the data flows among compo-
uses are still at an early stage. nents. The component integrators also need to
There are many conventional GIS, and simula- know how to combine the required components
tors have become more advanced. Because these together. The end users of this framework only
types of software are usually independent, closed need to replace some component with another of
systems, they are difficult to customize and to the same functionality, by mouse manipulation,
integrate with others. Although some systems, to carry out 3D geographic simulations integrated
such as GRASS (GRASS) and ArcGIS (ArcGIS), with a legacy GIS.
can be customized with library packages, they The component-based 3D geographic simula-
still require users to write lengthy programs. tion framework presented here incorporates the
Other systems, such as GeoVista (Takatsuka, basic components of 3D geographic simulation
2002) and AVS (Upson, 1989), provide a visual systems and two legacy GIS integration mecha-
programming environment for the development nisms into an integrated environment. This frame-
or customization of the 3D simulation systems. work provides a set of 3D visual basic components
These systems allow users to interact only with required for the development of a new interactive
their visual programs; users cannot interact with system that can integrate legacy 2D GIS and legacy
their visual output to develop a new 3D system or geographical simulators to make 3D geographic
modify their input parameter values. Component simulation systems. In the integration framework,
object model (COM) is a model for linking soft- construction of a new 3D geographic simulation
ware modules. When such a link is established, system only requires combining the necessary 3D
the modules can communicate with each other visual components. To support this framework
through standard communication interfaces. These of integrating a legacy GIS, we also propose two
links require users to be skilled in programming. fundamental integration mechanisms: view inte-

1217
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

gration and query integration. The view integration component-based framework, the detail of the
maps the 2D rendering of a legacy 2D GIS onto the framework, and its integration mechanism with
surface of the 3D geography used in a 3D visual a legacy GIS. In section 3, we describe an appli-
geographical simulator, and then dispatches every cation of our framework. Finally, we make some
event on the geography surface to the original 2D concluding remarks.
GIS. The query integration automatically converts
each 3D simulation result that is shown as a set of
highlighted regions on the surface of the geogra- COMPONENT-BASED FRAMEWORK
phy to a corresponding regional query on the 2D
GIS. We used the 3D meme media system called This section describes the proposed component-
IntelligentBox (Tanaka, 2003; Okada, 1995) as the based 3D geographic simulation framework.
basic platform of our framework. IntelligentBox Figure 1 shows the framework, which consists
is a component-based 3D visual software devel- of three types of 3D visual components used to
opment system. In IntelligentBox, each primitive develop a 3D geographic simulation system. Our
component called a box has a special function and framework allows component integrators to use
some state holders called slots, as well as a special a mouse to combine these visual 3D components
3D shape. End users can define a functional and together to construct geographic simulation sys-
visual composition using primitive boxes and slot tems. Moreover, in our framework, the display and
connections among them. Users can construct 3D query functions of a legacy 2D GIS are integrated
applications just through direct manipulation of with the 3D geographic simulation systems. Our
visual components. Moreover, each component, framework involves three basic technologies: 3D
once created, provides a simple API for its reuse visual components, component composition, and
with other components in different applications. integration of a legacy 2D GIS with 3D geographic
The remainder of this chapter is structured simulation systems technology.
as follows: section 2 shows an overview of the

Figure 1. The component-based framework

1218
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

Details of the 3D Visual Components graphic simulator component integrates the


functions of a legacy geography simulator into
This subsection describes the three basic com- the 3D geography simulation environment. The
ponents. The 3D terrain component visualizes legacy geographic simulator component holds the
terrain data stored in the elevation database as data that is acquired from other components in
a 3D terrain model. It also works as a provider slot #geo_data and generates an input file for the
of the terrain data to other components in the legacy geographical simulator. The data stored in
simulation environment. The geographic simu- slot #geo_data includes the terrain data and the
lation component is used for integrating legacy parameter values for the legacy simulator. We can
simulators into our simulation framework. It also specify the path of the input and output files for
calculates the necessary parameters for the geog- the legacy geographical simulator with slots #in-
raphy simulator and for visualizing the results of put and #output. The legacy geographical simula-
simulation. The query component integrates the tor obtains the input file from the path, which is
query function of the legacy 2D GIS into the 3D specified by slot #input, and sends simulation
simulation environment. In this research, we used results to the path, which is specified by slot
a type of 3D meme media called IntelligentBox #output. The legacy geographical simulator com-
as a basic system. IntelligentBox offers a series of ponent observes the update of the output file of
interactive 3D components. Each 3D component the legacy geographical simulator (specified by
has its own function and a standardized data ac- slot #output) at regular intervals. When the output
cess interface called slots. Component integrators file is updated, the legacy geographical simulator
can interactively define the parent-child relation- component reads the output file and holds the
ships between the 3D components to transfer result of the simulation in slot #result. The simu-
data between them by connecting slots. We will lation result is generally an array, each element
explain the slots and the data that are kept in each of which corresponds to a grid cell, and takes as
component. its value either 0 or 1. Finally, this component
Figure 2 shows the 3D terrain component. visualizes the result in the same 3D environment.
It has slot #terrain_data and slot #displayAPI, The parameter calculation component calculates
which hold the terrain data and the map range the necessary parameters for the geographical
parameters. The terrain data include the list of simulator and visualizes the calculation result.
the elevation data and the coordinate location of The parameter calculation component holds the
the mesh point. The 3D terrain component can data needed to calculate the parameter in slot
display a 3D terrain model in the 3D environment
using the terrain data that are saved in slot #ter-
Figure 2. 3D terrain component
rain_data. This component obtains a map image
from a legacy 2D GIS and maps the map onto
the 3D terrain surface using the necessary data
(position, size, scale) saved in slot #displayAPI.
We call the integration of the display function of
the legacy GIS with the 3D terrain component
view integration. It is explained below in detail.
Figures 3 show the geographic simulation
component. It includes a legacy geographic
simulator (geo-simulator) component and a pa-
rameter calculation component. The legacy geo-

1219
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

Figure 3. Geographic simulation component

#geo_data and saves this calculation result in slot obtains a query result from the legacy 2D GIS
#result. Moreover, if it is necessary, this compo- and holds it in slot #result. Finally, the query
nent can visualize the result data in a 3D environ- component displays the query result (i.e., the
ment. damaged road IDs) in the 3D environment. We
Figure 4 shows the query component. It has call the integration of the query functions of the
slot #geo_data, slot #queryAPI, and slot #result, legacy GIS with the 3D geographic simulations
which hold the simulation results, query param- query integration, described in detail below.
eters, and query results respectively. The query Our framework allows component integrators
component receives the simulation result and to combine these components to construct geo-
keeps it in slot #geo_data. The simulation result graphic simulation systems. The interoperation
includes the terrain data and the simulation result of visual components is defined by slot connec-
values (for example, the damage zone data). Next, tions among them. As a result, they can be com-
the query component uses the data in slot #geo_ bined with other components by connecting their
data to generate a range parameter and calls the slots. They interoperate by exchanging messages
query function of the legacy 2D GIS with this through slot connections. Our framework also
parameter through slot #queryAPI. For example, enables end users to integrate a legacy 2D GIS
the range parameter of a grid cell in the damaged with a 3D geographic simulation system simply
area takes the value 1. Then, the query component through the composition of their display objects.

Figure 4. Query component

1220
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

Component Composition the parameter data from the slot #result of the
parameter calculation component and holds the
In our framework, component integrators can com- parameter values in its slot #geo_data. The legacy
bine the 3D components arbitrarily as desired. To geo-simulator component uses the parameter data
combine more than one component, the processed to create simulation and holds the simulation result
data and parameters must be exchanged among in its slot #result. Finally, the query component
the components. In our approach, the data and obtains the simulation result from the slot #result
parameters can be exchanged through the slots. of the legacy geo-simulator component and holds
In addition, we use the view integration and query the simulation result in its slot #geo_data. The
integration mechanisms to integrate the display query component uses the simulation result to
and query functions of the 2D legacy GIS with generate a query for integrating the simulation
the 3D simulation environment. result with a legacy GIS and holds the query in its
Figure 5 shows an example of a composition. slot #queyAPI. The query component integrates
In this example, we combine four components: the simulation result with the legacy GIS through
a 3D terrain component, a parameter calculation the slot #queyAPI and holds the query result in
component, a legacy geo-simulator component, slot #result.
and a query component. The processed data and pa- Figure 5 shows the basic composition structure
rameters are exchanged among these components of our framework. In addition, our framework
through the slots. First, the parameter calculation allows vertical and horizontal extension. Vertical
component obtains the terrain data from the 3D extension means that we can combine other dif-
terrain component through slot #terrain_data. ferent legacy geo-simulation components or
The parameter calculation component calculates legacy GIS components with the topmost query
the parameters using the terrain data and holds component in Figure 5 to simulate a secondary
the parameter values in its slot #result. Next, the or tertiary disaster caused by the preceding dam-
legacy geographical simulator component obtains age. Horizontal extension means that we can

Figure 5. An example composition

1221
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

combine two or more different legacy simulation 3D terrain component to the original 2D legacy
components and legacy GIS components with the GIS with an inverse transformation of the texture
same 3D terrain component to comparatively mapping process. Therefore, we can easily extend
study different disaster simulations for the same the GUI of a 2D legacy GIS to a 3D geographic
area. Moreover, we can simply change the input information system. We use the 3D shadow-copy
data of the 3D terrain component to immediately mechanism to map the 2D GUI of a legacy GIS
apply the same simulation system to another area. onto the surface of the 3D terrain model.
Such flexibility and extensibility characterizes Figure 6 shows a view integration of a legacy
our framework. 2D GIS with 3D terrain model. We call this the
3D map component. First, the 3D map component
Integrating a Legacy 2D GIS with 3D calculates the position, size, and scale information
Geographic Simulation Systems from the original terrain data that is kept in slot
#terrain_data, which is used for generating ter-
We integrated the display and query functions of rain polygons, as shown in Figure 6 (i). Next, this
a legacy 2D GIS with a 3D simulation system. component displays the appropriate part of the 2D
We call the two integrations view integration and map from the whole map by calling the display API
query integration. of the legacy 2D GIS as shown in Figure 6 (ii).
The view integration function maps the 2D Then, this component maps the clipped map onto
rendering of a legacy 2D GIS onto the surface a 3D terrain using a 3D shadow-copy mechanism
of the 3D terrain model used in a 3D visual geo- as shown in Figure 6 (iii). Finally, we obtain a 3D
graphical simulation, and it dispatches every event map as shown in Figure 6 (iv).
on the terrain surface to the original 2D GIS. We The query integration function integrates
use the 3D shadow-copy mechanism (Ohigashi, query functions of a legacy 2D GIS with 3D geo-
2005; Ohigashi, 2006) to map the 2D graphical graphic simulation systems. We propose a query
user interface (GUI) of a legacy GIS onto the integration function that uses the query API of the
surface of the 3D terrain model. Moreover, the 3D legacy 2D GIS. For example, query integration
shadow-copy mechanism dispatches every mouse automatically converts each 3D simulation result
event that occurs on the mapped surfaces of the that is shown as a set of highlighted regions on

Figure 6. Integrating the GUI of a legacy 2D GIS with a 3D terrain model

1222
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

the surface of the geography to a corresponding APIs to develop new systems. We can call only
regional query to 2D GIS. In general, to search one query API function for retrieving the infor-
for objects in a given area, we usually integrate a mation we want. A query integration mechanism
database and use SQL to retrieve the objects from for integrating the 2D GIS query function with
the database. For instance, for a query searching 3D geographic simulation results by calling its
for a damaged cell to obtain its population saved query API is proposed.
in GisDataList (GD), the GD schema would be Figure 7 shows the integration of a legacy
2D GIS query with a geographic simulation.
GisDataList (Population: integer, CellID:integer) The right is a query component, and the left
is a legacy 2D GIS (a road map). The legacy
Moreover, the damaged cell can be referred to 2D GIS offers a query API of searching for
by its ID. Therefore, this query can be expressed objects within a specified range from some
in SQL as layer. For instance, the query function could be
SearchForObjects(Within(range), From(layer)).
SELECT GD.Population If we want to find damaged roads in damaged
FROM GisDataList GD zones, which can be obtained from a simulation
WHERE GD.CellID=ID result, we can call the query function
SearchForObjects(Within(damaged zones),
Alternatively, if the query is searching for rail- From(road layer). The query component receives
ways across damaged cells, we have to query an the simulation result and keeps it in slot #geo_data
object-oriented database with an object-oriented as shown in Figure 7 (i). Next, the query compo-
query. The schema of railway (RAILWAY) and nent uses the data in slot #geo_data to generate a
damaged cell object (DCELL) is shown as range parameter and calls the query function of
the legacy 2D GIS with this parameter through
RAILWAY (RouteID: integer, Line: slot #queryAPI as shown in Figure 7 (ii). Then,
line) the query component obtains a query result from
DCELL (CellID:integer, Cell:area) the legacy 2D GIS and holds it in the parameter
slot #q_result of slot #queryAPI as shown in
Therefore, the query can be expressed in an Figure 7 (iii). Figure 7 (iv) shows that slot #result
object-oriented query as holds the query result and the terrain data. Fi-
nally, the query component displays the query
SELECT x.Line inside (y.Cell) result (i.e., the damaged road IDs) in the 3D en-
FROM x in RAILWAY vironment as shown in Figure 7 (v). The query
y in DCELL result and terrain data are stored in slot #result to
WHERE y.Cell=DamagedCell and be used by the next simulation or query.
x.Line inter (y.Cell),

where x.Line is a line object that belongs to APPLICATIONS


RAILWAY and y.Cell is a cell object that belongs
to DCELL. We present two applications of our framework to
However, these query expressions are long demonstrate its effectiveness. The two application
and complex. Furthermore, it is possible that the systems were constructed from several compo-
database we want may not exist. On the other nents. They integrate legacy GIS functions into
hand, in many cases, the legacy GIS provide 3D simulation systems.

1223
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

Figure 7. Integrating the query function of a legacy 2D GIS with 3D geographic simulations

An Avalanche Simulation the specified area and visualizes those slopes on


and its Disaster Query the 3D terrain surface. The avalanche simulation
component simulates the avalanche and visual-
In this section, we describe an example applica- izes the simulation result (avalanche area) on the
tion of our framework. This application shows same 3D terrain. The query component searches
an avalanche simulation and its disaster query. for damaged roads within the avalanche area by
The scenario is an avalanche hazard evaluation. integrating the query function of the legacy GIS
A risk management expert is analyzing the road and visualizing the query result in the same 3D
damage caused by an avalanche. The analysis simulation environment.
process consists of 4 phases: (1) visualizing a 3D Figures 9 and 10 show an avalanche simula-
map as the simulation base, (2) investigating the tion and its disaster query. Figure 8 (A) shows the
highly dangerous slopes where avalanches could application framework, while Figure 8 (B) shows
easily happen, (3) calculating the avalanche dam- a screenshot. The system consists of a 3D terrain
aged zones from a dangerous slope to the point component, a dangerous-slope search component,
where sliding snow would stop, and (4) searching an avalanche simulation component, and a query
for damaged roads within the damaged zones. component. Using these components, the simula-
The avalanche simulation and its disaster query tion and query are carried out in four steps.
process require four components: a 3D terrain In the first step, we use the 3D terrain compo-
component, a dangerous-slope search component, nent to visualize the terrain data held in slot #ter-
an avalanche simulation component, and a query rain_data as a 3D terrain model and use view
component. The 3D terrain component visualizes integration to visualize a legacy GIS map on the
the 3D terrain data as a 3D terrain model and dis- 3D terrain surface. The map range parameter is
plays a map of the 2D legacy GIS on its surface. It specified through slot #displayAPI as shown in
serves as the basic component for the simulation Figure 8 (A) (i). This generates a 3D map as shown
and visualization. The dangerous-slope search in Figure 8 (B) (a).
component investigates the dangerous slopes in

1224
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

Figure 8. An application for an avalanche-simulation system

Figure 9. the avalanche simulation with CONEFALL

1225
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

Figure 10. An application for a flood simulation system

In the second step, we use the dangerous-slope (ii). It is generally known that slopes with angles
search component to look for the high-danger between 35 and 40 degrees are prone to avalanches
slopes where avalanches could easily happen. (K. Maeno, 2000). Therefore, the dangerous-slope
In this process, the dangerous-slope search com- search component calculates each slope angle. As
ponent needs 3D terrain data and a 3D map to shown in Figure 8 (A) (iii), the grid cells of slope
visualize the dangerous slopes. Then, we combine angles between 35 and 40 degrees were set as 1
the dangerous-slope search component with the and others were set at 0. The dangerous-slope
3D terrain component (shown in Figure 8 (B) (1)). search component holds the calculation result data
The dangerous-slope search component obtains and terrain data in slot #result. This component
the terrain data from the 3D terrain component visualizes the dangerous slopes as red cells on the
through slot #terrain_data shown in Figure 8 (A) 3D map shown in Figure 8 (B)(b).

1226
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

In the third step, we use the avalanche simu- visualized on the 3D map as shown in Figures
lation component to simulate and visualize the 11 (iv) and (v). Moreover, the terrain data is also
avalanche damaged zones that would occur if the held in slot #result and can be used for the next
snow slid from a dangerous slope to a predicted simulation or query.
stop point. Here, we used a legacy simulator In the fourth step, we use the query component
called CONEFALL (Jaboyedoff) for simulating to search for and visualize the damaged roads
the damaged zones. CONEFALL is designed to within the damaged zones. This query process
estimate the zones prone to rock falls. It works on needs the damaged zones data and terrain data
the same principle as the avalanche simulation. In to generate query parameters. We combine the
this process, the avalanche simulation component query component with the avalanche simulation
needs to communicate with the legacy simulator component to find the damaged roads in the dam-
CONEFALL by exchanging files. It also needs aged zones as shown in Figure 8 (B) (3). The query
the dangerous slope and 3D terrain data as pa- component obtains the damaged zones from the
rameters. Therefore, we combine the avalanche avalanche simulation component shown in Figure
simulation component with the dangerous-slope 8 (A) (iv), and the damaged zones are used to
search component (shown in Figure 8 (B) (2)). generate the query parameters of the legacy GIS.
As shown in Figure 8 (A) (iv), the avalanche The query parameters are held in slot #queryAPI.
simulation component obtains the dangerous Using this slot, we integrate the query function
slope and terrain data from the dangerous-slope of a legacy 2D GIS with the query integration
search component through slot #geo_data and slot mechanism to search for damaged roads in the
#result. The files are exchanged with CONEFALL damaged zone, as shown in Figure 8 (A) (vi). The
through slot #input and slot #output. The result query result (damaged roads) is held in slot #result.
of this simulation is visualized on the 3D map as This component displays the query result in the
shown in Figure 8 (B) (c). 3D environment as shown in Figure 8 (B) (d).
Figure 9 shows the avalanche simulation with The simulation result is usually represented as
CONEFALL. The simulation is carried out as fol- a set of grid cells (as shown in Figure 9 (B)). The
lows. First, the avalanche simulation component point (X00min,Y00min) is the lower-left corner
obtains the terrain data and the dangerous-slope of cell 00, while the value d is the side length of
list from the dangerous-slope search component one cell. The query functions of legacy 2D GIS are
and keeps these data in slot #geo_data as shown usually specified as a vertex coordinate pair, i.e.,
in Figure 9 (A) (i). Then, the avalanche simulation the lower-left corner and the upper-right corner
component generates an input I/O file using the of a damaged cell, as a range parameter. We can
data held in slot #geo_data and saves this file in a calculate the vertex coordinate pair (Xijmin,Yijmin)
path specified via slot #input shown in Figure 9 (A) and (Xijmax,Yijmax) of any cell ij by the formulae
(ii). After that, CONEFALL simulates the damaged (Xijmin,Yijmin) = (X00min + i × d, Y00min + j
zones using the file, and sends the result file of × d) and (Xijmin,Yijmin) = (X00min +(i+ 1)×
its simulation calculation to a specified file path. d, Y00min + (j+1) × d). We can convert all the
The output of CONEFALL (simulation result) is damaged cells to a coordinate pair set in this way.
a list of 1s and 0s in which the 1s correspond to The coordinate pair set can be used as the range
the damaged zones. Then, the avalanche simula- parameter of the legacy 2D GIS query function.
tion component reads in the simulation result file Therefore, a query component can use the coor-
from the path specified in slot #output, as shown dinate pair set as a range parameter for calling the
in Figure 9 (A) (iii). Finally, the simulation result query function of the legacy 2D GIS to find objects
is held in slot #result and the simulation result is in them. For instance, we search for damaged

1227
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

roads, those roads overlapped with the damaged rain_data as a 3D terrain model and to visualize
cells, by calling the query function SearchForO a legacy GIS map on the 3D terrain surface with
bjects(Within((Xijmin, Yijmin),(Xijmax,Yijmax)), view integration, as shown in Figure 10 (A) (i).
From(road layer)). The map range parameter is specified through slot
#displayAPI. This generates a 3D map, as shown
A Flood Simulation and in Figure 10 (B).
its Disaster Query In the second step, we use the flood simula-
tion component to calculate the flooded zone.
Using our framework, we can perform a different In this process, the flood simulation component
simulation simply by changing the simulator com- needs 3D terrain data. Therefore, we combine
ponents. In this section, we show a flood simulation the flood simulation component with the 3D ter-
application and its disaster query using the same rain component shown in Figure 10 (B) (1). The
3D terrain and query components. The scenario flood simulation component obtains the terrain
is a flood hazard evaluation. A risk management data from the 3D terrain component through slot
expert is analyzing the road damaged that would #terrain_data and keeps it in slot #geo_data as
be caused by a flood. The analysis consists of three shown in Figure 10 (A) (ii). The flood simulation
phases: (1) visualizing a 3D map as the simulation component changes the height of the water surface
base, (2) calculating the flooded zone, and (3) based on the simulation results, taking into account
searching for damaged roads within the flooded the total amount of rainfall. It enables users to vi-
zone. The flood simulation and its disaster query sually find the submerged areas of the simulation
process require three components: a 3D terrain as shown in Figure 10 (B). The flood simulation
component, a flood simulation component, and result data is a set of 0s and 1s. The cells under
a query component. The 3D terrain component the water surface are set as 1, as shown in Figure
visualizes the 3D terrain data as a 3D model and 10 (A) (iii). So that another query or simulation
displays a map of 2D legacy GIS on its surface. It process can use the same terrain data, the flood
serves as the basic component for the simulation simulation component keeps the simulation result
and visualization. The flood simulation component data and terrain data in its slot #result.
simulates the flood and visualizes the simulation In the third step, we use the query component
result (water surface). The query component to search for damaged roads within the flooded
searches for damaged roads within the flooded zone. In this process, the query component needs
zone by integrating the query function of a legacy the data for the flooded zone and the 3D terrain.
GIS and visualizing the query result in the same Therefore, we combine the query component
3D simulation environment. with the flood simulation component as shown
Figure 10 shows a flood simulation and its di- in Figure 10 (B) (2). As shown in Figure 10 (A)
saster query. Figure 10 (A) shows the component- (iv), the query component obtains the data for the
based application framework, while Figure 10 (B) flooded zone and the terrain from the flood simu-
shows a screenshot of the simulation and the query lation component through slot #result and keeps
evaluation. This system consists of a 3D terrain it in slot #geo_data. When the query component
component, a flood simulation component, and a receives these data, it searches the flooded zone
query component. Using these components, the contained geometrically in its cuboids’ shape, and
flood simulation and its disaster query are carried the flooded zone is used as a condition of a query
out in three steps. to the legacy GIS. We integrated the query function
In the first step, we use the 3D terrain compo- of the legacy 2D GIS with the query integration
nent to visualize the terrain data held in slot #ter- mechanism as shown in Figure 10 (A) (v). As

1228
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

shown in Figure 10 (B), the query component of a legacy GIS with the simulation system, we also
displays the flooded road IDs (the roads within propose two fundamental integration mechanisms:
the flooded zone as shown in Figure 10 (A) (vi)) view integration and query integration mechanisms.
in the 3D environment. The view integration technology can integrate the
In addition, using our framework, we can GUI of a legacy 2D GIS on the surface of the 3D
perform the following process simply. We can terrain used in a 3D visual geographical simulator
interactively locate the desired areas by moving and then dispatch every event on its surface to the
the query component or by changing the size of original 2D GIS. The query integration technology
the component. To investigate another area of the can integrate the query function of a legacy 2D
3D terrain, we need simply to change the input GIS by automatically converting each 3D simula-
data of the 3D terrain component. To search for tion result to a corresponding regional query on
different objects in the same area, we can also a 2D GIS. As a result, our approach enabled us
combine another query component with the same to construct 3D geographic simulation systems
simulation component at the same time. interactively, moreover, to integrate the GUI and
the query function into a 3D simulation environ-
ment simply through the composition with their 3D
CONCLUSION display objects by direct manipulation on a com-
puter screen. This framework also allows users to
In this chapter we have proposed a new component- integrate a legacy GIS with a legacy geo-simulator
based 3D geographic simulation framework. Using that is independently developed and not assumed
this framework, component integrators can easily to integrate with it.
develop 3D geographic simulation systems just In this chapter we have proposed an integra-
by combining the required components and by tion technique for legacy geographic simulators
defining data flow among these components. The whose I/O text file formats are open to the public.
end users of this framework only need to replace However, in a future, it will become necessary to
some component with another of the same func- propose a new integration technique for various
tionality, using mouse manipulation, to carry out a other simulators that do not satisfy this condition.
3D geographic simulation integrated with a legacy To cope with such integration it is necessary to
GIS. Our framework incorporates the fundamental provide different query components for different
components of 3D geographic simulation systems GIS APIs. The method of dynamic API slots may
and two legacy GIS integration mechanisms into an become necessary. Such a solution should provide
integrated environment. The framework provides a wizard system to help users to semi-automatically
a set of 3D visual components required for the create the required slots from the API list of the
development of a new interactive system that can legacy system.
integrate legacy 2D GIS and legacy geographical
simulators to make 3D geographic simulation sys-
tems. The visual component technology provides REFERENCES
a set of 3D visual components required for the
development of new interactive 3D visual geo- Alves, L. L., & Davis, C. A., Jr. (2006) Interoper-
graphic simulations integrated with GIS functions. ability through Web services: evaluating OGC
In the proposed framework, constructing a new 3D standards in client development for spatial data
geographic simulation system only requires the infrastructures. VIII Brazilian Symposium on
component integrators to combine the necessary GeoInformatics, 2006, INPE,193–208. ArcGIS.
3D visual components. To support the integration Retrieved from http://www.esri.com/products.html

1229
A Component-Based 3D Geographic Simulation Framework and its Integration with a Legacy GIS

GRASS (Geographic Resources Analysis Support Okada, Y., & Tanaka, Y. (1995). IntelligentBox: A
System). Retrieved from http://grass.itc.it/ constructive visual software development system
for interactive 3D graphic applications. Proc. of
Jaboyedoff, M. & Labiouse. V. CONEFALL: a
Computer Animation, 95, 114–125.
program for the quick preliminary estimation
of the rock-fall potential of propagation zones,. OpenGIS Standards and Specifications. Retrieved
Computer & Geosciences. from http://www.opengeospatial.org/standards
JCMA.(1988). Protection Against Snow Hand- Takatsuka, M., & Gahegan, M. (2002). Ge-
book. oVISTA Studio: a codeless visual programming
environment for geoscientific data analysis and
Maeno, K. & Fukuda, M. (2000). Avalanche and
visualization. Computers & Geosciences, 28(10),
Snowstorm.
1131–1144. doi:10.1016/S0098-3004(02)00031-6
Ohigashi, M., Guo, Z. S., & Tanaka, Y. (2006).
Tanaka, Y. (2003). Meme Media and Meme
Integration of a 2D legacy GIS, legacy simula-
Market Architectures: Knowledge Media for
tions, and legacy databases into a 3D geographic
Editing, Distributing, and Managing Intellec-
simulation, SIGDOC ’06: Proceedings of the 24th
tual Resources. IEEE Press, John Wiley & Sons.
Annual Conference on Design of Communication,
doi:10.1002/047172307X
pp.149–156.
Upson, C., & Faulhaber, J. Thomas, Kamins,
Ohigashi, M., & Tanaka, Y. (2005). Shadows on
D., Laidlaw, D.H., Schlegel, D., Vroom, J., Gur-
3D: Hosting 2D legacy applications into a 3D
witz, R. & van Dam, A. (1989). The application
meme media environment. In Leipziger Informa-
visualization system: A computational environ-
tik-Tage 2005(LNI 72, pp. 401–410).
ment for scientific visualization. IEEE Com-
puter Graphics and Applications, 9(4), 30–42.
doi:10.1109/38.31462

This work was previously published in Interdisciplinary Advances in Adaptive and Intelligent Assistant Systems: Concepts,
Techniques, Applications, and Use, edited by Gunther Kreuzberger, Aran Lunzer and Roland Kaschek, pp. 63-82, copyright
2011 by Information Science Reference (an imprint of IGI Global).

1230
1231

Chapter 74
Collaborative Mapping and GIS:
An Alternative Geographic
Information Framework

Edward Mac Gillavry


Webmapper, The Netherlands

ABSTRACT
The collection and dissemination of geographic information has long been the prerogative of national
mapping agencies. Nowadays, location-aware mobile devices could potentially turn everyone into a
mapmaker. Collaborative mapping is an initiative to collectively produce models of real-world loca-
tions online that people can then access and use to virtually annotate locations in space. This chapter
describes the technical and social developments that underpin this revolution in mapmaking. It presents
a framework for an alternative geographic information infrastructure that draws from collaborative
mapping initiatives and builds on established Web technologies. Storing geographic information in
machine-readable formats and exchanging geographic information through Web services, collaborative
mapping may enable the “napsterisation” of geographic information, thus providing complementary
and alternative geographic information from the products created by national mapping agencies.

INTRODUCTION govern the territory. Away from home, maps were


an instrument for colonisation, when Africa and
Since the Enlightenment, mapping and the pro- Asia were split among the European nation-states.
duction of geographic information have been During the last few decades, there has been
institutionalised: the map is the power. At home, rapid democratisation of geographic information
maps were used as an instrument for nation build- and maps. Sawicki and Craig (1996) distinguish
ing as nation states emerged: a legitimisation three ways in which this movement is apparent.
device (McHaffie, 1995). People learned about First, the locus of computing power and data access
their country and administrations needed a tool to is broadening. Second, the level of skills to turn
raw geospatial data into geographic information
DOI: 10.4018/978-1-4666-2038-4.ch074 has become less demanding. Third, the locus of

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Collaborative Mapping and GIS

applications has moved closer to the citizenry. requiring new visualisations as the screen size,
Geographic information systems moved from resolution, and use patterns differ significantly.
mainframes and the UNIX operating system onto Collaborative mapping is an initiative to col-
personal computers and the Windows operating lectively create models of real-world locations
system. From research and government, GIS online that anyone can access and use to virtually
spread into the business sector. The PARC Xerox annotate locations in space (McClellan, 2003). The
Map Server and Virtual Tourist brought maps to value of the annotations is determined by physi-
everyone’s PC in the late 1990s, followed by online cal and social proximity, the former expressed
map Web sites such as MapQuest and Multimap. in distance, the latter in “degrees of separation.”
In 1997, Brandon Plewe noted that “the Internet Thus, the informational value and the pertinence
holds promise for exponential increases in the of spatial annotations is not only dependent on
efficiency and effectiveness of the ways in which physical distance, but also dependent on the
we obtain, use and share geographic information trust relationship between individuals or groups
in all its forms” (Plewe, 1997). In July 2002, 7.1 of people through social networks: the “Web of
million European users visited one of the many Trust” (Espinoza, Persson, Sandin, Nystrom,
online map Web sites (Nielsen//NetRatings, 2002). Cacciatore, & Bylund, 2001).
Google Maps, introduced in February 2005, However, there is a discrepancy between physi-
reached almost 1.7 million visitors in that month cal and social proximity. Privacy and personal
(Buchwalter, 2005). freedom become highly important issues when
Although maps are more widely used than ever, one’s location is related to their social behav-
the production of geographic information, and iour. On the other hand, the fear of surveillance
especially mapping, is still highly concentrated that accompanies positioning is already gradu-
among national mapping agencies and the GI in- ally reducing in society (Ahas & Mark, 2005).
dustry. But this oligarchy is soon to be dissolved, Furthermore, this discrepancy can be mediated
for we see the third aspect of the democratisation by users themselves by storing annotations and
of geographic information–the locus of applica- tracks locally, thus creating distributed reposito-
tions moving closer to the citizenry–becomes ries, and by explicitly setting the level of privacy
apparent now that location-aware mobile devices on each of these annotations and tracks. Finally,
are coming within everyone’s reach. GPS units users themselves remain in control of their so-
are not only available to surveyors anymore, as cial identification–their preferences and social
cheaper devices are sold for outdoor recreation. network–while they make use of collaborative
Also, small GPS antennae can communicate mapping services, whereas, for example, the so-
with other devices over Bluetooth, and there cial positioning method aggregates these social
are already mobile phones and personal digital characteristics to study the space-time behaviour
assistants (PDAs) for the consumer market that of society (Ahas & Mark, 2005). Collaborative
have GPS-chips built in. mapping services are therefore less pervasive in
At the same time, digital maps have become the privacy of their users because users negotiate
portable. Various mobile phone operators have the trade-off between the benefits of the service
started to deliver location-based services to mobile and their privacy concerns.
devices. Mobile phones come with route planning Collaborative mapping is to location-based
applications, thus making in-car navigation sys- services (LBS) what blogging is to the Web: a
tems redundant. Maps are not only delivered to mechanism for authoring one’s own location-
the desktop, but also to mobile phones and PDAs, based experiences (Bleecker, 2005). The notion
of “social proximity” as a measure of relevancy

1232
Collaborative Mapping and GIS

of geographic information sets collaborative Governments spend billions to create large-scale


mapping apart from corporate LBS-providers. geographic data to fulfill their civil and military
Requesting the nearest Italian restaurants, the responsibilities, and only governments can afford
service not only takes into account the distance the hardware and software, as well as the labour
to the restaurant from the current position of the costs of trained personnel involved in creating
user, but it also navigates his/her social network large-scale geographic data (Harris et al., 1996).
to find a suitable result. People are willing to Nevertheless, geographic data created by US
travel farther because the particular restaurant is government agencies such as the US Geological
a favourite haunt of one of their relations. Fur- Survey (USGS) and the Census Bureau is available
thermore, geographic information is not merely copyright free and made available at the nominal
broadcast to users, but users can actively contribute cost of copying it, whereas in most other countries,
to the service: geographic information flows back geographic data has to be paid for and is subject
to the service. A collaborative mapping service to strict copyright laws.
then works like a spatially enabled notebook or Since the market position of national map-
message board, depending on the privacy settings ping agencies (NMAs) is changing rapidly, their
people attach to their postings. budgets being cut and status changed, their geo-
Collaborative mapping may even become a graphic databases have become their main capital
vehicle for community-based participatory plan- assets. In the new economy, the digital elevation
ning. As users share their thoughts, opinions, and model and digital line graphs are the new currency
ideas about physical places, local knowledge, (McHaffie, 1996). To survive in their new role,
community needs, and specific social histories they have to protect their assets with copyright
may thus be appreciated and incorporated into the restrictions and inhibiting pricing structures.
process (Harris, Weiner, Warner, & Levin, 1996). Whereas other government agencies and academic
In this way, the necessary information is produced institutions in these countries obtain geographic
in an operation whereby the local knowledge aris- data at a discounted rate, nonprofit organisations,
ing from the social narratives, the mental map, is (small) businesses, and individual citizens have
converted into data within a GIS for research and found these protective measures inhibiting. Not
policy formulation. only pricing and copyright put up barriers. As the
This chapter aims to describe the various de- GI industry is evolving rapidly, patents have been
velopments that have led to the necessity and fea- granted to various key players in the market. This
sibility of collaborative mapping. Based on current practice is a delicate trade-off between encour-
practice, it goes on to describe an approach for an aging innovation and encouraging monopolies.
alternative geographic information infrastructure Collaborative mapping has therefore become a
that draws from collaborative mapping initiatives practice to create copyright-free geographic data
and builds on established Web technologies. for free and to bring together existing freely avail-
able sources of geographic data.

BACKGROUND Positioning Technology

Free Geographic Data There are various developments that have fa-
cilitated the viability of collaborative mapping.
Collaborative mapping as a practice to create First of all, the democratisation of mapping has
geographic data was born out of the lack of af- not only brought maps to everyone’s desktop or
fordable and copyright-free geographic data. hand-held device, the availability of position-

1233
Collaborative Mapping and GIS

Figure 1. Various trends drive the uptake of collaborative mapping

ing technology contributes to this trend as well. Location Awareness


GPS devices used to be the domain of surveyors
and geodetic engineers, but nowadays there are Publishing on the Web has changed over the last
many cheap GPS devices available for outdoor few years. There is no need to have your own
enthusiasts, which become smaller and smaller. Web server or even know how to code HTML:
As of 1 May 2000, selective availability has been nowadays everyone can publish on their Weblog.
turned off so public users can locate themselves These Web based publications consist primarily
using GPS with “military precision.” of periodic articles in reverse chronological order.
With Bluetooth, separate GPS devices commu- Admittedly, most early Weblogs were updated
nicate with palmtops and mobile phones. Recently, manually, but tools to automate the maintenance
there are palmtops and even mobile phones that of such sites made them accessible to a much
come with a built-in GPS antenna. Furthermore, larger population.
the Enhanced 911 (E-911) US federal mandate Not only Weblog authors (bloggers) writing
requires mobile phone operators provide precise about similar topics started to form communities,
location information of mobile devices within 50 but there also started to appear listings of Weblogs
to 300 meters. The ability to establish one’s loca- that were from the same country (e.g., the now
tion is unprecedented. The data that is obtained defunct Gblogs) or even located in the same city
from location-aware devices indicates the actual (BrightonBloggers). The next step came when
location and movement of people, with increas- NYCBloggers (http://www.nycbloggers.com/)
ing precision in real-time (Ahas & Mark, 2005). and London Bloggers (http://londonbloggers.
These location-aware devices turn everyone iamcal.com/) started indexing bloggers by their
into a producer of geographic information, thus nearest subway stop or tube stop respectively.
bridging the cartographic labour process. Ever Mapping the virtual space of the blogosphere on
since the first differentiation of cartographic to the physical one, these listings put a sense of
labourers that occurred with the invention and place back into cyberspace.
adoption of the printing press, when compila- In a final step, the positional accuracy increased
tion and reproduction of the map were separated from country, town, and subway stop to the level
(McHaffie, 1996), mappers are “slogging” into of geographic coordinates, while the geographic
the messy reality of the field again in order to extent increased to cover the world with GeoURL
produce the “map.” (http://geourl.org/). Bloggers are encouraged to

1234
Collaborative Mapping and GIS

add HTML meta tags to their Web pages to inform means of networking in virtual communities, and
visitors about their location: became popular in 2003 with the advent of Web
sites such as Friendster, Tribe.net and LinkedIn.
<meta name=”ICBM”conte While Friendster allows members to chart their
nt=”52.09022,5.08159”> network of friends, LinkedIn aims at profession-
<meta name=”geo.position”conte als to keep track of their business contacts. The
nt=”52.09022;5.08159”> popularity of these sites rapidly grew, and major
<meta name=”DC.title” companies have entered the Internet social net-
content=”Webmapper: what the map can working space.
be.”> Whereas these online social networks require
visitors to subscribe to a particular network before
GeoURL then spiders the Web and indexes using the service, and people end up subscribing
Weblogs that have these meta tags, and puts pins to all these Web sites, thus becoming a member
on the world map indicating the locations of all of all separate networks. XFN and FOAF are
the Weblogs. Some bloggers travel regularly, and distributed networks. People define their own
Weblogs can be updated using mobile devices, networks and can identify their online friends
so-called moblogging. Location thus becomes and the nature of their relationship. As FOAF is
another means to structure Weblog entries. Also, based on RDF, it can be mixed with other RDF
bloggers frequently write about locations they vocabularies, so resources can be shared among
are visiting. Developers have created extensions networks of people. Thus, sharing information
for popular blogging tools such as Movable Type among a group of people based on the nature of
for authors to attach a location to their Weblogs relationships is a feasible thing.
and individual postings, and to keep track of their
travels, nearby Weblogs, and the locations of their Open Source GIS
visitors. Other tools such as IndyJunior (http://
www.bryanboyer.com/indyjunior/) and World66 The final trend contributing to the birth of col-
(http://www.world66.com/) allow bloggers to laborative mapping is the adoption of open source
show their travels and the countries visited. The in the field of GIS. The IT industry has a long
World as a Blog (http://brainoff.com/geoblog/) history of open source projects, from operating
shows the locations of recently updated We- systems, Web servers, and programming languages
blogs. All in all, Weblogs have certainly raised to applications. Only recently, the GI industry
the awareness of location as a means to structure is catching up, with open source databases that
information sources. support spatial features, for example, MySQL and
PostGIS. There are several libraries, for example,
Online Communities to convert between common spatial data formats,
or to change projections that are included in vari-
Online communities have been around since the ous open source projects. GRASS has become
start of the Internet. More and more, it is not only a well-known open source desktop GIS. For
important to advocate that you belong to a certain online GIS applications, there are various open
community, but also who you know in various source Web-mapping servers such as Deegree,
communities. Search engines rate Web sites based GeoServer, and the UMN MapServer. Web clients
upon the number of hyperlinks that link to a Web such as Chameleon and ka-Map allow developers
site. Online social networks started appearing in to build compelling interfaces to work with maps
2002, when the term was used to describe the in your Web browser. Most of the architectures

1235
Collaborative Mapping and GIS

are built around OGC (open geospatial consor- them for a week. During that time, their locations
tium) standards such as the Web Feature Server were uploaded in real time to a Web server and
and Geography Markup Language specifications. beamed onto a screen in one of the exhibition
Online vector-based mapping used to be the rooms. Visitors of the exhibition saw a new map
domain of proprietary plug-ins from leading GIS of Amsterdam gradually appear as the individual
vendors. As the XML-based scalable vector graph- GPS tracks joined up and started to overlap.
ics (SVG) file format gains popularity, vector- These base maps typically consist of road
based mapping is not anymore the monopoly networks in urban areas that have been traced
of the GIS industry and government agencies. moving around the area with a GPS device. Most
Many individual Web developers have started community mapping projects seem to follow
to create compelling vector-based Web-mapping four steps: collection, editing, augmentation,
applications. and distribution (Walsh, 2005). In the following
With geographic data being transferred based paragraphs, these steps of community mapping
upon open standards and clear interfaces, it is easy projects are discussed. Although the paragraphs
for developers to combine various data sources are provided in this sequence, community map-
and make interesting Web-mapping applications, ping is an iterative process. The steps are regularly
with no resources spent on buying software. iterated as new data becomes available, and these
steps may take place concurrently.

COLLABORATIVE MAPPING Collection of Base Map


Data and GPS Tracks
Several projects are underway to collect geo-
graphic data for the creation of base maps. Amster- There are several ways to kick-start a collabora-
dam RealTime (http://www.waag.nl/realtime/), tive mapping project. One approach is to scan
conceived as an art project, has been a trigger for and georeference out-of-copyright paper maps
many people to realise the potential of collabora- or global satellite imagery, and incorporate these
tive mapping. As part of an exhibition on historical as a background to provide a geographic context
maps of Amsterdam in late 2002, visitors were for the project. The advantage of global satellite
invited to participate and take a GPS device with imagery over scanned paper maps is the flexibility

Figure 2. The iterative collaborative mapping process consists of four steps

1236
Collaborative Mapping and GIS

in the projection. Opting for scanned paper maps, pedia. GPS tracks are converted into PostScript
there is not this flexibility, as the map image may files that are merged, using a drawing package,
become distorted when changing the projection. into a GIF file that is put on the Web site. Apart
Another approach is to use readily available from showing the road networks in some of the
vector-based geographic data, having the flex- larger towns in Britain, the Web site allows visi-
ibility of projecting the data while maintaining the tors to enter information about pubs and shops,
sharpness of vectors. For example, the TIGER/ with personal reviews.
Line data from the US Bureau of the Census pro- Whereas Geowiki uses a rather onerous produc-
vides a good source for geographic data to start a tion process to convert the GPS tracks into a base
community mapping project. As discussed previ- map, recent initiatives build on the MapServer
ously, copyright restrictions may prevent projects Open Source WebGIS package to serve online
from incorporating these sources. For the Mumbai maps on the Web. The MapServer was set up in
Free Map project (http://crit.org.in/projects/gis), compliance with standards from the OGC, par-
the Collective Initiative Research Trust (CIRT) ticularly the Web feature server with transactional
builds the base map from the cumulative basis extension (WFS-T) to support not only serving
of the urban research, design, and development maps on the Web, but also to edit the maps re-
studies and interventions conducted previously motely. The London Free Map (http://uo.space.
by CIRT. Thus, CIRT develops an open-access frot.org/freemap/) initiative finds pedestrians,
spatial data infrastructure, and a set of simple tools bikers, and skaters in London volunteering to
and applications localised in Indian languages, collect GPS tracks. Of course, these data are dif-
for knowledge transfer and participatory urban ficult to integrate with the road networks for car
planning by communities and citizens in Mumbai. navigation, as these tracks are not representative
The third approach to collect base map data is for cars. However, as personal navigation becomes
the most labour intensive. This low-level approach more important, it is useful to have navigation
requires a large group of community members to systems that include cycle tracks and walk paths.
travel the area to be mapped with a GPS receiver, The London Free Map cooperates closely with
and follow the tracks, paths, and streets to trace the Open Street Map (http://www.openstreetmap.
the network. On their travels, members may add org) initiative. Steve Coast bikes around London
additional waypoints for landmarks in the area. carrying a GPS device (Dodson, 2005). Coming
Returning from these surveys, the community home, he downloads the GPS tracks onto his lap-
members upload their GPS tracks over the Web top to subsequently make them available online
to a central server, or equally make their tracks to share the tracks with others. Street by street, a
available online for others to harvest. Previously copyright-free map of London is built.
collected GPS data created while engaging in a
game of geocaching or as a form of GPS-drawing, Editing Waypoint Data
can be equally made available for integration.
In the United Kingdom, GPSdrawing (http:// This step involves the actual creation of the base
www.gpsdrawing.com/) uses a GPS receiver to map from the raw GPS tracks previously collected.
make pictorial drawings while moving around. First, it has to be decided which tracks or individual
Conceived as an art project, base mapping is not GPS points to consider for inclusion in this step.
the objective here, but the accumulation of GPS Next, geometric and statistical algorithms have
tracks makes a nice base map. Geowiki (http:// to be applied to decide when several very close
www.geowiki.com/) is a cartographic variation lines are actually the same line and thus need to be
of the interactive encyclopaedia Web site, Wiki- averaged together, or are actual parallel segments.

1237
Collaborative Mapping and GIS

Intersections have to be modelled into junctions at Edelkamp and Schrödl describe a “map refine-
which tracks have to be split in segments to ensure ment process” to create a traversable road network
topology. Other points may belong to traffic situ- from GPS tracks (Edelkamp & Schrödl, 2003).
ations such as squares, roundabouts, and parks. In the context of normalising and synthesising
Quantitative mechanisms relying on geometric GPS tracks, the road segment clustering stage
algorithms, or qualitative mechanisms requiring and the road centreline generation stage of the
human intervention, may decide which GPS tracks map refinement process are relevant. The road
are relevant at this stage in the process. Geometric segment clustering stage identifies segments that
algorithms may filter out GPS points that have are common between different GPS tracks and
a significantly different accuracy from the rest. locates intersections of these segments, based on
This may be important in areas with bad satellite the contiguity information and temporal order of
reception such as urban canyons, or at the start of the GPS points. The road centreline generation
a GPS track due to cold start: sometimes it takes creates a geometric construct that captures the road
rather long for a GPS device to establish con- geometry. The centreline is generated from a set
nection with an appropriate number of satellites. of GPS points using a weighted least-squares fit.
Equally, GPS points, tracks, or repositories can The relative weight of each GPS point is derived
be included or excluded from the GPS tracks that from the measurement error, which is usually
are the input for the base map based on qualita- available from the GPS receiver.
tive mechanisms (Shorter, 2005). There may be
a hierarchy of users, with trusted users given a Annotation of Geometry with
higher moderation status based on the quality of Real-World Semantics
their contribution to the community, to review con-
tributions. To ensure community members remain This step ensures the geometry of the network
interested and help reviewing, RSS feeds of latest makes sense for people that use the network to
updates and additional repositories of GPS data locate themselves and navigate around the area.
can be made available to trigger people to help The driving restrictions at one-way streets and
moderate the GPS tracks. Moderators and com- turning restrictions at the intersection of segments
munity members can discuss among themselves have to be added by ground truthing: surveying
about the value of GPS repositories on discussion the actual location to interpret the data correctly.
forums that are provided at a central server, or that Surveyors go out with a printed base map to an-
can be left as comments on distributed repositories notate and upload annotations at a later time, or
and be amalgamated using RSS readers. may well carry PDAs connected to a WFS-T map
As a save approach, GPS data is not avail- server and add annotations on the fly.
able for postproduction editing unless one com- The Downhill Map of Bristol (http://locate.ira-
munity member has reviewed the contribution. tional.org/bristol_map/) is a map for skateboard-
The strength of collaborative mapping is that ers, cyclists, and walkers. Artist Heath Bunting
relationships within the community can be made and the Bristol Skate Survey group surveyed the
explicit. Therefore, members may only want to streets of Bristol, the road surface quality, and the
include GPS tracks and landmarks in their base locations of fruit trees. Like Geowiki, the Downhill
map that were left by people within their personal Maps of Bristol incorporates landmarks along the
network of friends. Finally, previous moderations street network created. This initiative not only
of the GPS data remain available for revision of simply records the longitude and latitude, but
the individual member, and they can roll back also maps the attributes of the map object such
moderations on request, just like a wiki. as height and road surface quality. From this data,

1238
Collaborative Mapping and GIS

a functional online route planning service can be This strategy not only applies to point loca-
created for cyclists, skateboarders, and skaters, tions, but to streets segments and areas as well.
which no national mapping agency would be able The Neighborhood Project (http://hood.theory.
to offer (Albert, 2004). The Dutch GeoSkating org/) uses the TIGER/Line US Census data for
project (http://www.geoskating.com/) aims to cre- mapping city neighbourhoods based on the col-
ate a similar map of the outskirts of Amsterdam, lective opinions of Internet users. Addresses and
uploading the GPS tracks and notes on road surface neighbourhood data collected from housing posts
quality to a Web server in real time. on Craigslist are translated into latitude and lon-
Not only navigation attributes such as road gitude values and drawn on the map. The blobs
type, turn restrictions, and one-way streets are col- on the map thus give a notion of the spatial extent
lected in this step, but also toponymy is important of neighbourhoods derived from the collective
here: the geographic names of roads, streets, and mental map of Internet users.
landmarks. In the context of collaborative mapping
projects, geographic names are collected using Distribution
the notion of “consensus-based location,” a term
coined by Alasdair Turner at the PLAN meeting Once the geometry of the street network and the
in London, early 2005. The name of a location is locations of landmarks have been created, they
established through consensus among a group of have to be made available to a wide audience,
people. It is not determined a priori, but evolves without copyright or financial restrictions. In the
as people in a community add new annotations in previous steps, various projects were highlighted,
response to the needs of the moment. with the aim to build geographic databases along.
For example, in the GeoNotes service (Espi- While they may coexist to make up a distributed
noza et al., 2001), a “location tag” determines the repository of GPS tracks, it may be advisable
exact location of an annotation in the service in to cooperate, to ensure GPS tracks from these
case the accuracy of the positioning technology of repositories can be easily incorporated into any
either the creator or the reader is too coarse. Thus, of the other repositories,and base maps can be
other users can still verify the correct location of exchanged without any problem. Therefore, it
the note without having to rely on the accuracy is advisable for these projects to work together,
of the positioning technology. Furthermore, the but also to transcend boundaries and work with
positional accuracy of the location tag may evolve other initiatives that may benefit from geographic
with improving positioning technology, and de- data, for example, civic initiatives. Already, the
pending on what labels are defined. London Free Map, Geowiki, and OpenStreetMap
The communal character is emphasised as are cooperating to build a street network for
these location tags are shared in the community. London together.
The set of location tags is stored in the service for There are various sets of geographic data cre-
other people’s perusal. Upon tagging a note with ated as part of a collaborative mapping project. The
a location, other notes in the vicinity are scanned scanned and georeferenced base maps, the global
for their location tags, and presented as a list of satellite imagery, incorporated as background,
options for location tags, sorted according to can be made available, as well as the geographic
popularity. The creator selects a tag from the list vector data. For example, the London Free Map
or creates a new location tag. This service may be initiative hosts various BitTorrents to distribute
bootstrapped with predefined labels taken from free geographic datasets in a cost effective way.
gazetteers, the yellow pages, or tourist databases. Also, the individual GPS tracks collected during
the project can be made available as GPX files, a

1239
Collaborative Mapping and GIS

lightweight XML grammar to express GPS data, cheap GPS-devices rather than location-aware
or encoded using the geography markup language mobile technology.
(GML) published online as a Web feature service Since the information value and pertinence
(WFS). Also, for applications that do not require of collaborative mapping services is character-
the original vectors, raster images of the street ised not only by physical proximity, but also by
network can be published as a Web coverage social proximity, participation becomes another
service (WCS) that can be used as tiles to plot challenge when only a small group has access
other data on top of. to location-aware, mobile devices. Especially in
the case of applying collaborative mapping as a
Challenges and Opportunities form of community-based participatory planning
(Harris et al., 1996) or as a means to obtain data
Sawicki and Craig (1996) identify two contra- for the social positioning method as input for
dictory currents in information technology that urban planning (Ahas & Mark, 2005), one has to
equally apply to collaborative mapping services. be aware that it is only based on the space-time
On the one hand, there is a movement towards behaviour and mental map of only a small group
more accessibility by more users: online mapping within society, with specific social identifica-
has experienced a steady uptake, and more and tions. The dependency of collaborative mapping
more people turn to Web maps to find their way in services on social proximity may well expect a
the real world. Also, mobile phones and PDAs are wider participation, as indicated by the popularity
becoming new platforms for delivering geographic of online social networking tools, thus possibly
information. Since the mid-1990s, more than 50% pushing the uptake of LBS in general.
of the citizens of the developed countries have Current collaborative mapping initiatives rely
become owners of mobile phones, while mobile on GPS-devices to capture road network geometry
communication has also been remarkably suc- and open source GIS technology to serve maps
cessful in poorer countries (Ahas & Mark, 2005). and edit the raw data. The ubiquity of location-
On the other hand, there is a movement towards aware devices will increase over the next few
less accessibility. Although more users have ac- years. Not only will the pool of possible volunteers
cess to more data and information, the entry cost to participate in collaborative mapping projects
of the education, training, capital cost, time, and therefore expand accordingly, but the collabora-
experience of the users to take advantage of in- tive mapping process will also benefit from these
formation technology is becoming much greater technological advances as more raw positional
(Sawicki & Craig, 1996). As an example of this data is being captured with increasing accuracy
movement, Ahas and Mark (2005) write that the and in real time (Ahas & Mark, 2005). Finally,
uptake of LBS has slowed down, as the services the distributed nature will be reinforced, as more
proved too difficult for the public to obtain, and collaborative mapping services will derive spatial
users have difficulties imagining the potential of annotations from more repositories, and will have
spatial information. For collaborative mapping to take into account wider social networks when
initiatives to become successful, it is advisable establishing the pertinence of these annotations.
to start off with the lowest common technical
denominator, but to design and develop architec-
tures to cater for more advanced technology in the FUTURE TRENDS
future as location-aware devices become cheaper.
Already, projects such as the London Free Map Ben Russell (2003) suggests a “street server” as
use Web-based open source GIS technology and a vehicle for distributed collaborative mapping.

1240
Collaborative Mapping and GIS

Street servers can be thought of as a computer people being able to contribute geographic data
network with both private and public functions and access localised information. Various collab-
for a community of residents of a street. In the orative mapping projects are currently collecting
context of collaborative mapping, these street GPS tracks to derive street network data. The
servers operate as a GPS tracks database that can framework of steps described in this chapter was
be linked with other street servers to form a map based on experiences from these projects, and will
covering a wider area, as far as GPS tracks match stimulate the exchange of geographic information
up. Thus, mobile devices ask stationary street among these projects. Hopefully, collaborative
servers about their position and how to travel mapping will become a complimentary input for
around the neighborhood. This network approach corporate location-based services.
contrasts with the current practice of serving GPS
tracks from a single, centralized Web server.
As geographic information is created collab- REFERENCES
oratively, people upload their annotations to a
central repository, or store their annotations and Ahas, R., & Mark, Ü. (2005). Location-based
tracks on their personal Web sites, thus creating services – New challenges for planning and
a distributed framework for sharing geographic public administration? Futures, 37(6), 547–561.
information. With increasing storage capacity of doi:10.1016/j.futures.2004.10.012
mobile devices, people may even put their GPS Albert, S. (2004). Crossing the line between
tracks on their location-aware mobile devices and mapping and mapmaking: Artistic approaches to
share the data with their peers in their social net- cartography. Society of Cartographers Bulletin,
work, just as digital music files are shared online 38(1-2), 41–44.
or made available as podcasts. Ed Parsons, CTO
of the Ordnance Survey, spoke in an interview Bleecker, J. (2005). A design approach for the geo-
about the “napsterisation” of geographic informa- spatial Web. Retrieved February 2006, from http://
tion, claiming that “the experience of the music www.oreillynet.com/pub/a/network/2005/06/07/
industry may also apply to geographic informa- geospatialweb.html
tion” (Westell, 2003).
Buchwalter, C. (2005). Integrated interactive
Thus, as people roam around the city, their
marketing:Quantifying the evolution of online
mobile devices pick up blocks of geographic data
engagement. Retrieved from http://www.nielsen-
to locate themselves and navigate the area sur-
netratings.com/pr/wp_050401.pdf
rounding them. That geographic data represents
the street network and landmarks of the city blocks Dodson, S. (2005, April 7). Get mapping. The
where they find themselves, as derived from col- Guardian. Retrieved from http://www.guardian.
laboratively collected GPS tracks. co.uk/online/story/0,3605,1453293,00.html
Edelkamp, S., & Schrödl, S. (2003). Route plan-
ning and map inference with global positioning
CONCLUSION
traces. In R. Klein, H. W. Six, & L. Wegner (Eds.),
Lecure notes in computer science 2598. Com-
Collaborative mapping is the latest development
puter science in perspective: Essays dedicated to
in the democratisation of geographic informa-
Thomas Ottmann (pp. 128-151). Berlin: Springer.
tion. The proliferation of positioning technology,
location-awareness, online communities, and open
source GIS technology have paved the way for

1241
Collaborative Mapping and GIS

Espinoza, F., Persson, P., Sandin, A., & Nyström, Nielsen//NetRatings. (2002) Seven million Eu-
H. Cacciatore. E., & Bylund, M. (2001, September ropeans look to the Web for maps, guides &
30-October 2). GeoNotes: Social and navigational directions. Retrieved from http://www.nielsen-
aspects of location-based information systems. In netratings.com/pr/pr_020920_uk.pdf
Abowd, Brumitt, & Shafer (Eds.), Ubicomp 2001:
Plewe, B. (1997). GIS online: Information re-
Ubiquitous computing, International Conference,
trieval, mapping, and the Internet. Santa Fe:
Atlanta, Georgia (pp. 2-17). Berlin: Springer.
OnWord Press.
Harris, T. M., Weiner, D., Warner, T. A., &
Russell, B. (2003). Street servers – An outline of an
Levin, R. (1996). Pursuing social goals through
open source networked neigbourhood. Retrieved
particatory geographic information systems. Re-
from http://www.headmap.com/street-servers/
dressing South Africa’s historical political ecol-
street-servers.htm
ogy. In J. Pickles (Ed.), Ground truth. The social
implications of geographic information systems Sawicki, D. S., & Craig, W. J. (1996). The democ-
(pp. 196-222). New York: The Guildford Press. ratisation of data: Bridging the gap for community
groups. Journal of the American Planning As-
Mac Gillavry, E. (2004). Collaborative mapping:
sociation. American Planning Association, 62(4),
By the people, for the people. Society of Cartog-
512–523. doi:10.1080/01944369608975715
raphers Bulletin, 37(2), 43–45.
Shorter, C. (2005). Bicycle map builder white
McClellan, J. (2003, March 27). Get caught
paper. Retrieved from http://bikemap.openearth.
mapping. The Guardian. Retrieved from
com.au/docs/whitepaper/
http://technology.guardian.co.uk.online/sto-
ry/0,3605,922337,00.html Walsh, J. (2005). London free map. Retrieved from
http://uo.space.frot.org/?node=LondonFreeMap
McHaffie, P. H. (1995). Manufacturing metaphors:
Public cartography, the market, and democracy. Westell, S. (2003, July/August). The napsterisation
In Pickles, J. (Ed.), Ground truth. The social of geographic information. GI News.
implications of geographic information systems
(pp. 113–129). New York: The Guildford Press.

This work was previously published in Collaborative Geographic Information Systems, edited by S. Balram and S. Dragicevic,
pp. 103-120, copyright 2006 by IGI Publishing (an imprint of IGI Global).

1242
1243

Chapter 75
Provision of Web 2.0 Services
by Interoperable GIS-
Powered Local Administration
Portal Systems
Anastasios Tsitsanis
National Technical University of Athens, Greece

Sotirios Koussouris
National Technical University of Athens, Greece

Rob Peters
University of Amsterdam, The Netherlands

ABSTRACT
eGovernment for municipalities, a rather mature field when talking about the traditional and plain service
delivery to citizens, is nowadays seeking ways to provide more quality based services to the citizens and
to become more direct in terms of communication and interaction. In this direction, the most modern
implementations are already coming up with the utilisation of various Web 2.0 services in an attempt to
become more attractive to the users and to gain a larger user base. However, the introduction of Web 2.0
application to eGovernment portals does not seem to be the solution on its sole; in contrast, when these
solutions are accompanied with GIS powered technology, participation and utilisation of those portals
seems to reach the anticipated figures, as interoperable GIS systems are able to offer added-value and
more “personalised” services to users.

INTRODUCTION to the actual end users and “clients” of public


administrations; the citizens. The modern era of
During the last years, as a natural follow-up of eBusiness and electronic service delivery, has
the evolution of back office local administration touched, apart from the private sector, also the
information systems, significant progress has governmental services, not only at the levels of
been made towards electronic service delivery central governments, but also at the levels of
regional and public administrations, as the major-
DOI: 10.4018/978-1-4666-2038-4.ch075

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

ity of the by citizens demanded services, lies in from basic application filling (e.g. pinpoint road
their jurisdiction. Therefore, all over this period, malfunctions) to highly sophisticated services
eGovernment was a keyword in the ICT agenda, as public debates, discussions and consultations
as this new uptrend market was quite open and (about issues such as spatial planning, green IT,
populated with billions of potential customers (the advertisement, etc) which can take advantage of
citizens), and huge investments were placed in GIS technology for presenting the as-is situation,
this domain, either by private firms or by govern- but also for presenting forecasting models based
ments (van Ark, Melka, Mulder, Timmer, & Ypma, on the decisions that will be drawn.
2003), in order to overcome the main obstacles
which had to do with system’s interoperability,
data harmonization and of course re-engineering BACKGROUND
of the public sector’s processes in order to achieve,
along with the maximization of the use of ICT, The rapid development in Information Technol-
lower lead times and higher service rates. ogy is nowadays opening new horizons regard-
Nowadays, with eGovernment being a rather ing the facilitation of everyday life of computer
mature field, where the issues of the past have users. As the Internet becomes a daily activity of
been solved up to an extent, public administrations people’s life, more and more organizations tend
and governments in general are seeking ways of to offer internet based services, replacing their
engaging their citizens more in their everyday traditional front-desk transactions. Following this
operation, in order to transform themselves trend, governmental organizations, such as local
from a single service system, to a collaborative governments, local administration and various
process environment, where citizens are not only public service offices are constantly launching
seen as clients, but are also taking decisions or eGovernment portals that are not only offering
can pinpoint various issues as a feedback. This information to the public, but offer electronic
stream is strengthen with the use of various flavoured services as well, by promoting interop-
Web 2.0 technologies, that are able to offer the erability with the present underlying systems that
advantages of building consensus amongst the are currently serving the public (Jaeger, 2003).
citizens of the same region (Nations). This is far Local Administration Entities, such as munici-
more empowered with the use of Geographic palities (Capgemini, 2006), are topping the list of
Information Systems (GIS), as the representation such organizations, as they have high figures of
of areas is a powerful way to attract citizens to everyday transactions numbers with citizens and
public administration platforms, as through the also possess a large number of “clients”, which
photorealistic representation not only can they use are naturally the inhabitants and the locally based
such platforms better, as they are offered with a enterprises. The benefits that rise from such a
life-like user interface, but they also realize that portal are the following (Charalabidis, Askounis,
the area shown is a part of their everyday life and Gionis, Lampathaki, & Metaxiotis, 2006):
therefore they tend to increase their interaction
with the various services offered. • Alternative service channels for the citi-
In this context, this chapter presents how GIS zens and enterprises such as Internet,
systems can interoperate with already established Mobile phone access and also voice access
public administration portals and with other exist- with the use of Interactive Voice Response
ing and globally accepted technologies, in order (IVR) systems.
to offer to the citizens, but also to the public • Optimization of the service levels as the
officers, various mash-up services that extend on-site presence becomes unnecessary.

1244
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

• Exclusive services for facilitating different • The European Interoperability Framework


groups such as disabled persons, the elder- (IDABC, 2008)
ly, the youth, etc. • The Greek eGovernment Interoperability
• Workflow optimization and automation in Framework (Information Society S.A.,
the highest possible degree by standardiz- 2008)
ing processes and documents
• Effective cooperation between different in- International Standards and state-of-the-art
house departments of local administration, Modeling Languages and technologies are used
by intergrading and interconnecting servic- in eGovernment portals as they preserve the fea-
es and processes that take part in common sibility, the accessibility, the accessibility and the
transactions security of the end product which is the portal.
Service provision can be dived into four differ-
Nowadays, municipal eGovernment Portals, ent levels, regarding the completion rate of each
which will offer automated services and would service based on electronic means. Those levels,
address the public, it is essential to comply with as also adopted by the European Commission
international standards and system design tech- (IDABC, 2006) are the following (Layne & Lee,
niques that should guarantee the end system’s 2001):
functionality and feasibility. This section presents
the most important frameworks and standards • Level 1-Information: This level contains
that should be followed when designing and only information about the corresponding
implementing eGovernment Portals (Pavlichev & service.
Garson, 2004). Various Frameworks are nowadays • Level 2-One-way interaction: A 2nd level
present, offering the guidelines that should be fol- service provides to the end user download-
lowed when designing systems and applications able material, such as.pdf forms or similar
seeking interoperability with underlying systems. documentations which have to be filled in
Those frameworks define in detail: by the user and handed to the correspond-
ing office.
• Certification Frameworks for Public • Level 3-Two way interaction: This level
Services web sites provides on line tools where a user can fill
• Interoperability structures for interconnect- in his request and initiate the process of the
ing systems and developing applications transaction. For completing the transac-
• Digital Authentication structures for the tion, the end user must appear at the ser-
end-users vice office and collect his receipt. Services
• Standardization Meta-Data and XML offered in level 3 require the authentication
Schemas for data entities of the user.
• Level 4-Transaction: Services in the 4th
The most known frameworks are the following: level are fully automated and the end user
gets the service’s output in electronic for-
• The UK Electronic Government mat, after imitating the transaction. Such
Interoperability Framework (Cabinet services include steps as authentication,
Office, 2005) decision, notification and delivery of
• The German SAGA (Bundesministerium receipt.
des Innern, 2003)

1245
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

System Architecture be-spoke components. The core platform is a


Content Management Platform System (CMS).
Most eGovernment portal architectures are based This system handles the presentation of the in-
on n-tier structures (data layer, application layers, formation and offers out-of-the-box tools for the
presentation layer). This particular approach al- implementation of services belonging to levels 1
lows the scalability of modules according to the and 2. Other systems are GIS systems, Workflow
portal needs, as the portal size and its functional Engines, Citizen Relationship Management Sys-
requirements will guide the total implementation. tems and IVR Systems for enabling voice access.
Moreover, the sorting of operations in distinc- The Content Management Platform the Citizen
tive levels in order to avoid unequal burdens of Relationship Management System (CRM) and
particular sources, or of the whole system is also the Workflow Engine all cooperate as the core
guaranteed by the 3-tier architecture chosen and transactional components of the system. The CRM
last but not least this architecture offers the option serves the user authorization and identification
of selective expandability with no changes in the and tracks down all user activities, namely from
system’s infrastructure. simple queries or questions asked, to the current
Figure 1 depicts a conceptualisation of the status of an online submitted application. This
logical architecture model (Koussouris, Tsitsanis, way, the end-user is constantly aware of his opens
Gionis, & Psarras, 2008) of a system containing issues and on the other hand, the administration
parameterisable Common Off-the Self Com- authority is able to generate the end-users profile
ponents (COTS), open source components and in order to target the most needed services, an

Figure 1. Logical architecture of the system

1246
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

important issue in eGovernment portals (Medja- proach. Those interfaces are connected with the
hed, Rezgui, Bouguettaya, & Ouzzani, 2003). wrappers that facilitate the information flow to
Finally, the Workflow Engine offers the flex- and from the portal with the use of Web Services
ibility of adding, replacing and updating working (Soon, Vijayalakshmi, & Nabil, 2002). This ap-
processes, without requiring great code-writing proach enables the interconnection of the different
efforts. This way the process flow is constantly subsystems and guarantees the high performance
managed and the system guarantees the flow of as only the required interfaces are used. In order
documents to the appropriate users even at heavy to implement this architecture, the following steps
loads, surpassing the operation of manual systems are needed:
in Local Administration (Vassilakis, Laskaridis,
Lepoura, Rouvas, & Georgiadis, 2003). • Discover the inputs and the outputs of the
back-office systems
Interoperability Layer • Modeling the data that is transferred within
the system using XML
Speaking of interoperable platforms, mechanisms • Protocol and Communication channels de-
are implied that will facilitate the transformation, velopment. This refers to the wrappers and
mapping and transportation of information data the web services development by defining
packets between the various systems in order to the communication ways with the back-
enable their direct, errorless and flawless commu- office systems (.Net Calls, RPC Calls,
nication and collaboration. Therefore, concepts as Intermediate Tables, Direct DB Calls) and
interoperability layers are essential for eGovern- the portal (XML Schemas, Service Calls)
ment Portals, if services of level 3 and 4 are to be • Definition of workflow and application
offered to the public. Such a portal will operate calls. The application call can either be
as a front-end interface for the internet users, in triggered by the portal (in case of a request
order to be served by the Local Administration submission) where the portal is initiating
back-office systems and among them the GIS
services that are offered though them. As those
systems are of different technologies and of dif- Figure 2. Interoperability layer architecture
ferent generation, interoperability is the key which
adds value to the eGovernment portal by achieving
the interconnection and the cooperation between
heterogeneous systems (Koussouris, Charalabidis,
Gionis, Tsitsanis, & John Psarras, 2007).
An interoperability layer is designed in such a
way that future enhancements are possible and that
system and platform independence is preserved.
It contains “Encapsulation Software Components
(Wrappers)” that are responsible for the data
transportation between the Back Office systems
and the Portal, through specific interfaces.
As depicted in Figure 2, from each back-office
system only the required input and output inter-
faces, that are brought to life during a transaction,
are selected in a purely “follow the service” ap-

1247
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

the call and waits for a reply but can also the use of GIS systems and their interoperation
be trigger by the back-office system itself and co-existence with the various Web 2.0 services
(in case of a notification for a fee payment) adds the winning argument to those systems, as it
• Development of Security and manages to literally lift degree of user satisfaction
Authentication mechanisms and user popularity, e.g. having users satisfied
constantly using the system and not letting it die
(Smith, 2007). This winning point if no other than
GIS POWERED WEB 2.0 SERVICES the visual representation of the region a citizen
FOR LOCAL ADMINISTRATION lives, a fact which not only makes the citizen more
familiar with the layout that is unfolded in front
In the recent year, the trend of Web 2.0 brought of him, but also motivates him to use the system
many new applications in the surface, which at as he can perceive at once that he is in a position
first glance looked like fancy implementations to call the shots on what is happening in his own
that targeted mostly the entertainment factor of area, his “home”.
the internet. Those were web based GIS imple- Having in mind all the above, public admin-
mentation like Google Maps or Live Maps, social istrations are starting to pull various attempts
networks like Facebook, MSN, and various other to utilise GIS-powered WEB 2.0 services and
plain services or mash-ups that became popular mash-ups in order to satisfy the needs of their
due to the ever-growing user bases of them. This citizens, to manage internal affairs and increase
fact got the attention of many governmental or- their productivity and to open-up their “closed”
ganisations, such as public administrations, which modus-operandi which is seen a one of the most
slowly started to adopt such solutions in the search important factors contributing to the huge problem
of establishing a more direct and straightforward of democratic deficit and of people not taking ac-
communication with the citizens and offering the tively part in the decision making process, which
advanced tools that are able to increase their online lead to misbelieve in politics and politicians as
experience and satisfy their user needs. well. In this context, a list of possibly services
More precisely, in the case of public admin- powered by GIS is presented below:
istrations one can see that WEB 2.0 service are
constantly used as they can easily not only satisfy • Interactive eGovernment platforms of-
the needs of the direct information flow and the fering public administration services:
bi-directional communication between the citizens These are systems that interact with citi-
and the administration, but are able to facilitate zens for trivial issues such as data form
the establishment of virtual online communities, filling with the use of the exact location as
quite closely to the idea of a neighbourhood, where extracted from the GIS infrastructure but
citizens are granted the possibility to interact with also for more advanced issues that require
each other, express their opinions publicly and the fastest possible reaction of the munici-
in more general terms operate within the virtual pality’s officials like in cases of damages
boundaries of these communities similarly to in vital infrastructures of the city such as
the “agora” situations that existed in past direct- water supply grid, vehicular grid, sewage
democracy political environments. All the above system as well as issues that deal with the
mentioned capabilities spring of course directly citizens’ security which maybe be in the ju-
of the well-know advantages of the modern In- risdiction of Municipal Police.
formation Technology and are also encountered • Traffic Management System: Systems
every day in various on-line initiatives, however using GIS and telematics infrastructures

1248
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

in order to advise drivers about the fastest sion making process and make use of Web
and optimum routes throughout the border 2.0 technologies and tools (forums, social
of the city, targeting the highest possible networks, information provision, RSS
convenience of drivers and the protection feeds, audio and video streaming tools,
of the environment through the reduce legislation data, ePetitions, ePolls) in order
of CO2 emitted by the vehicles that pass to give citizens the opportunity to actively
through the city. participate in the various steps of the local
• GIS services for internal general admin- decision making processes and make their
istration use: Those systems are able to voice heard by the city’s officials.
operate using multiple layers containing • Entrepreneurship and general informa-
geographical information of many kinds tion portals powered by GIS technolo-
such as: fire zones, earthquake zones, spa- gies: These systems refer to GIS supported
tial data, building data, digital background platforms where enterprises have the op-
of the area, satellite photos and legal data portunity to pinpoint their presence to-
connected to several points of the GIS map. gether with their services and needs and in
• Green Areas Management System: such a way virtual marketplaces are creat-
These systems give the opportunity to the ed that enable the citizens to have an online
city’s officials to manage in an efficient yellow-page map of the business develop-
way the green areas of the city and include ments, the services and possibly the career
them in their spatial planning programs opportunities in their region.
(creation of new green areas (e.g. parks) in
the city, recreation and exploitation of un- The above mentioned list is of course end-
tended city locations, etc). These systems less, as modern technology is able to adopt GIS
are based on GIS solutions able to map in solutions in almost any case, and the advantages
detail the present situation in a public ad- of such breeding are quite visible from the few
ministration and offers the pubic servants examples discussed before, as users are treated
an in-a-glance overview of how green ar- with a familiar environment and this contributes
eas are allocated and how they should be greatly to their willingness to utilise the systems
managed. Moreover, these systems can be at a higher rate. Based on this discussion, the next
accompanied by sophisticated artificial in- sections of the chapter is devoted to the show-
telligence forecasting algorithms in order case of a real-life application of a GIS powered
to visualise what if scenarios based on the portal that was implemented in the Netherlands
data that correspond to each region. in order to help public administration to engage
• Crisis management systems: These refer stakeholder in the spatial planning process of the
to systems utilized as high-level manage- region of Flevoland.
ment tools that are able to elaborate several
civil protection scenarios in cases emer- Using GIS-Powered Portals
gencies (fire, earthquake, flood, ground for Conflict Resolutions and
frost and broiling weather) for achieving Public Consultation
the highest possible degree of protection of
the citizens in a fast and efficient way. System Architecture and Features
• eDemocracy GIS powered platforms:
Those are platforms targeting the enhance- Legal Atlas is a software application which in
ment of citizens’ participation to the deci- the first steps of prototyping included navigation

1249
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

techniques that helped to navigate through four user would be able to identify all objects or map
layers of laws and regulations: European, national, area’s where this regulation applies by means of
regional and municipal. a visualisation mechanism.
Early results from using and demonstrating While this may seem a small extension of the
the system in the Netherlands (Wilson & Peters, first version, the reader should realise that there
2004) showed not only high user satisfaction but is a big difference between the question: “what
also gave the confidence that this type of solution will be allowed here? and: “where will this be
would be feasible in other Europe’s countries. As allowed?”. The second inverse question demands
a natural consequence, a system named Legal At- a complete semantic definition of the world and
las II was implemented. The Legal Atlas systems complete legal coverage (Boer, van Engers,
should enable the ultimate transparency of local Peters, & Winkels, 2007) of that world as previ-
regulations in the sense that the citizen could ously described. With additional legal reasoning
ask “where can I do this?”, instead of: “can I do (Deontic) arithmetic, such a functionality (Boer,
this in place such and such?” (Wilson & Peters, van Engers, & Winkels, Using Ontologies for
2004). Such an application consequently requires Comparing and Harmonizing Legislation, 2003)
a unique coding for all local and regional legal enabled questions like: return all contours on the
functionalities affecting citizens and a mechanism map where regulation “X” and “Y” are both ap-
for semantically mapping (Boer, Hoekstra, Win- plicable, and with exception of rule “Z”.
kels, van Engers, & Willaert, 2002) between the After some prototype evaluation actions, a
legal terms and legal procedures affecting those real-life prototype was implemented using IMRO
required functionalities of citizens. In that way the in combination with the GEMET (“GEneral Mul-
client of the service could view his case in the light tilingual Environmental Thesaurus”). GEMET is
of aggregated data and monitor the consequences increasingly relevant because of INSPIRE. The
of decisions and opposition. This brings a new regional layers of data and most of the content
perspective on maintainability and consistency were provided by The Province of Flevoland
of the legal system as a whole. These are of great who maintains a number of databases (Thematic
importance, since regulations will change over Spatial Data Infrastructure) about the economy, the
time. The Legal Atlas makes use of the IMRO environment, our cultural heritage and the public
(information model spatial planning) XML/GML infrastructure. Legal Atlas III is the evolution of
(Open Geospacial Consortium) coding scheme the above named systems and for the purpose of
to enable the user to query the underlying legal European transferability Google maps technology
system using very detailed coloured maps repre- is used next to GIS technology to support “contour
senting legal constraints and possibilities (supports editing” by visitors of the website. The visitor can
explanation and understanding by users). indicate an area by drawing a line on the map to
The succeeding Legal Atlas II application was discuss and to support his case with documents,
based upon a (Winkels & Hupkes, 2007) open research reports, legislation and plans that can be
and extensible architecture using semantic web uploaded, stored and downloaded for that drawn
technologies and was designed using the expe- area. The interface of the prototype shown above
riences from the first version. The codes were is linked to the database of the regional authority
translated to GML syntax rules which helps to Flevoland. All map layers available can be pro-
create a more flexible and maintainable codifica- jected on this Google map screen. This way the
tion system. The main challenge was to extend conflict resolution system will maintain dynamic
the retrieval possibilities of law from text to map, input from legal constraints and map layer updates.
in other words; by clicking on a regulation the

1250
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

Regarding the conflict annotation engine, it OPTIONAL {


was mostly based on Semantic Web Technology ?keyword skos:inScheme ?scheme.
(Boer, van Engers, Peters, & Winkels, 2007). }
The knowledge models about the Cross Border }
Deliberation models and the domain knowledge
of the working scenarios are all described with The return is a gigantic list of all the concepts
RDF/OWL (W3C, 2004). The reason for this that can be used for annotation. Such a huge list
choice was that RDF/OWL is able to inference might not be convenient in a user interface. There-
and reasoning with these models. To publish this fore it is better to tone this list a bit down with a
information as a service the OpenRDF Sesame practical list. Pruning this list down to a domain
server (OpenRDF.org) is used, which has a web is one way to limit the amount of concepts. The
accessible SPARQL-endpoint which is an access following example shows the SPARQL query for
point to which SPARQL (W3C, 2008) queries a specific domain, namely the IMRO2006 SKOS
can be sent. The RDF that is stored within the (Geonovum, 2006) vocabulary.
OpenRDF Sesame server is processed with
OWLIM (Ontotext AD, 2009), a high-performance SELECT ?keyword ?label
semantic repository which is packaged as a Stor- WHERE {
age and Inference Layer (SAIL) for the Sesame ?keyword a skos:Concept.
RDF database. It reasons about the RDF data and OPTIONAL {
propagates this by means of rule-entailment. The ?keyword rdfs:label ?label.
SPARQL endpoint is used to fill up the Legal Atlas FILTER (lang(?label) = ‘nl’)
III with information. The Legal Atlas III can be }
seen as an interface for the OpenRDF Sesame ?keyword skos:inScheme <http://
server, and the SPARQL-endpoint is the interface www.ravi.nl/imro2006/skos#imro2006>.
between them. }
The SPARQL queries are based on the sche-
mata of the RDF/OWL models. This means that Figure 3 describes the architecture of the
they are independent on the actual content. This interaction between the municipal portal and
ensures that different content should be anno- the Annotation engine. The portal containing
tated with the RDF/OWL models to makes sure Legal Atlas gets its information by sending the
that the SPARQL queries are able to retrieve the appropriate queries to the SPARQL Endpoint.
content. Here is a simple query that retrieves all The SPARQL endpoint is an access point to the
the concepts with which content can be tagged OpenRDF Sesame server. The OWLIM is an infer-
with. The same concepts will also be used for ence layer within the OpenRDF Sesame server.
searching the OpenRDF Sesame server for the This is filled with the Cross Border Deliberation
appropriate content. models and the domain models and the content
that is part of the public administration portal. The
SELECT ?keyword ?label ?scheme Cross border deliberation models are schematic
WHERE { representations of the decision process around a
?keyword a skos:Concept. theme like spatial planning or enforcement agree-
OPTIONAL { ments. The standardised schema’s are required to
?keyword rdfs:label ?label. facilitate cross border negotiations and to transfer
FILTER (lang(?label) = ‘nl’) a formal decision process into (automated) legal
} modelling and constraint analyses.

1251
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

Figure 3. Architecture of the annotation engine of a map based conflict resolution portal

The portal also provides an interface for add- Problems with INSPIRE legal Mapping
ing data. Which adds RDF/OWL to the models. for Dutch regional authorities
These are then processed by the OWLIM module
and the result is propagated to the Portal by the The approach of the example of using a GIS
same SPARQL endpoint. powered portal in Flevoland is based upon stan-
An important part of the interface is the geo- dardized legal knowledge representations and
graphical interface with Google Earth and Google maps and allows the stakeholders involved to cope
maps. The connection between the geographical with legal pluralism and the complexity caused
data and the metadata that describes them can by the interaction between the building blocks
be seen in Figure 4. The Map has a number of of European laws and local regulations. In this
different regions which are of some type of land context, this implementation helps users to have
use. These regions have a particular geographical access to the regulations relevant to the problem
representations in the form of layers with polygons. at hand, in a concise and comprehensible way.
The map-based interface providing access to the
applicable regulations offers a powerful mean to
ensure usability and client satisfaction. Before

Figure 4. The interaction between the maps, its regions with particular landuse and the geographical
representation

1252
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

explaining in more detail the various steps, it protected the Regional authority will lose those
should be stressed that although most of the tests “sites” it wants to protect. The latter occurs be-
have been conducted in the domain of spatial cause of the hierarchical legal effect of European
planning our approach is also applicable in other directives. “Lesser” legal regimes become less
legal domains. All laws have a jurisdiction and are useful to oppose against economic interests. The
consequently location bound. While a legislative Ecological main infrastructure in the Netherlands,
maps approach has advantages in the domain of that was meant to become the vehicle to ensure
spatial planning it is not limited to that domain. less fragmentation of green area’s in an urban
Spatial planning is an area where many differ- country, may well become negligible in it’s effect
ent interests have to be weighed and balanced, given the greater impact of Natura2000 area’s.
like many other area’s. When proposing realistic The Dutch provinces have several examples of
solutions one should also take into account the non-intended effects. The regulation of salt and
massive volumes of existing regulations putting sweat water balance, for example, indicated that
constraints on the possible alternative solutions one Province was obliged to subsidise farmers for
to the problem at hand. lack of sweat water while those farmers had one
INSPIRE (INSPIRE, 2007) is the European of the best crops in Europe. The Dutch regions
wide standardisation effort to model the envi- do not want to lose this need for flexibility due
ronmental policies and vulnerable area’s in such to inflexible standardisation. These findings are
a way that monitoring and enforcement will relevant for our research on Legal Atlas.
become much easier. The goal is to meet Kyoto It appears that many times the visualisation of
targets and the European parliament is serious map layers for environmental purposes, like with
about it. The enforcement is likely to spark a INSPIRE, creates a legal challenge; the feeding
range of whole range law suits, since all NGO’s time of Goose in the Netherlands is six months at
will have immediate access to any building effort most. Then they fly back to the North. The map
or economic activity that threatens vulnerable showing area’s that are protected for the sake of
ecologies. It will be hard to change anything in those goose should convey the temporal factor
Spatial Europe without serious deliberations. Re- which is unusual in legal constraints mapping.
garding INSPIRE (EUROSTAT & DGENV, K.U. The biggest problem with European standar-
Leuven, 2003), the Dutch Provinces have carried disation efforts like INSPIRE for Dutch regions
out an impact analysis, which mentions several are those that concern the harmonisation of cal-
ambiguities with the new infrastructure, starting culating methods behind the layers. The Dutch
with the definition of “Protected sites” according regional authorities are not sure that all envi-
to the International Union for the Conservation of ronmental indicators are calculated in the same
Nature: “A Protected Site is an area of land and/ way in Europe. This envisioned level of variety
or sea especially dedicated to the protection and requires more explanation, flexibility and enough
maintenance of biological diversity, and of natural administrative room for local interpretation. Even
and associated cultural resources, and managed within the same country it takes time and effort to
through legal or other effective means”. understand the different ways to calculate sound
The scope of this definition does raise a dis- level measurements, visualisations and mappings.
cussion among policy advisors how to apply the Water quality differs with temperature. Ecological
INSPIRE directive wisely. If too many “protected complexity defies simple arithmetic’s. There is a
sites” are defined it tends to weaken enforcement relationship between legislation and methods of
possibilities and reporting to Europe may become calculation; changes in calculation methods at
tedious. If too few area’s are being identified as the European level may cause changes in Dutch

1253
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

and regional legislation. INSPIRE may well be people into the court room needlessly and too early.
one of the directives with the highest impact on The groups involved in protection of birds and
regional authorities. those involved in building the infrastructure have
been in court for the last 20 years. The chairman
Conflict Resolution and the Use of the bird protection stated that the courtroom
Case INSPIRE in Flevoland does not help “his” food chain to develop in a
natural way. Pressure groups require more insight
For the real life trial of Flevoland, it was chosen in each other’s negotiation space. Another reason
to take the example of INPIRE annex III layer to have dynamic maps fuelled by dynamic con-
11 and 18: areas with legal constraints, regulated straint analyses is the fact that no protected area
areas and habitat/Natura2000 areas. Figure 5 de- is fixed in the way some maps tend to indicate.
picts the Natura2000 map layer as it is now part INSPIRE has an explicit limitation to its scope
of the information infrastructure of the region. as support mechanism for transparency: it states
The map is represented by the vertical stripes in article 13,1: “information services should not
covering the entire Markermeer and Ijssel Lake. be published by means of INSPIRE if this action
In reality this is not the case. The Lake Marker- jeopardizes the very environment it tries to pro-
meer requires numerous ecological measurements tect. We argue here that map-layers represented
to save the area from further deterioration. The in static ways powered by static legal- and policy
water management is suffering for slib that cre- constraints do exactly that. They do not support
ates uninhabitable murky waters. Everyone agrees conflict resolution and negotiation but rather sug-
something has to be done. At the same time the gest more inflexibility and legal rigidity.
already stressed ecological balance of the region Feed back on the tests with the Flevoland
is even more threatened by the obligation to host regional development plan prove that questions
another 60.000 houses with accompanying rec- like: return all contours on the map that fulfils the
reational and transport infrastructure. Calling the legal constraints “X” and “Y”, but not “Z” are
region “locked under Natura2000” as indicated by indeed answerable. How does one provide such
the map layer is a simplification that would bring type of opportunity finding for the user in a mean-

Figure 5. Example of Inspire type map layers

1254
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

ingful representation?. The answer may have been and the surrounding area “lights up” problem
developed already by Simcity game developers areas (striped red, in this case a 5000 Hectare
who created manoeuvrability using a cursor and swamp that is envisioned to be the ecological
“tiles” with fixed business rules in 1985 when motor for the lake) and areas up for negotiation
Simcity was still called “Micropolis” (Koster & (orange). When the cursor is dragged further into
Wright, 2004). Simcity allowed the virtual the protected area it turns red to indicate the legal
Mayor of a city to plan different types of objects absolute negative. The dragging turned out to be
on a tiled map. The objects could be hotels, in- crucial since it resembles human negotiations
dustry, infrastructure or housing, for example. about proximity and the likelihood of damage
The “Mayor” was informed if his planned object to the ecology in a user friendly way. Dragging
did fit with the designated area because those tiles affords negotiations of NGO’s with conflicting
would lit up in red or green if the business rules interests in line with Gibsons theory of usability
defining those tiles in the database would match (Gibson, 1979).
the object dragged by the Mayor ‘s cursor. A
housing block could not be set on a highway. Early Results
The research in 2009 indicated that the most
ergonomical lay-out for a use case would be, that The upload and retrieval functionality has been
the user drags the cursor (Blue, for op sailing area) judged to be part of the Atlas behind the decision

Figure 6. Simcity approach to conflict resolution and opportunity finding

1255
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

process. The Legal annotation engine is currently Atlas as described has been evaluated by 25
being embedded behind the Spatial context (in real life stakeholders in the redesign process of
Dutch: “Omgevingsplan “) of Flevoland. Ijmeer/Markermeer. Early results indicate that
The interesting difference between a screen they regard the system as a positive contribution
showing map layers already there and those that to the decision making process. The critics state
“turn up” while moving a qualified cursor (like that the systems is not complete and that transpar-
the one seeking space for open sailing area’s) is ency is not always desirable. Most participants in
the affordance of opportunity finding in contrast the evaluation state that it would definitely help
with the annotation of an area that is “ locked up”. stakeholders to inform their constituencies about
The regional authorities call this functionality a the interests at stake. In international context the
“seeking area”. They have created the unusual system is criticised for implying a naïve perception
legal term “seeking area” to obtain legal degrees of regulatory systems. Different cultures would
of freedom in development plans that do not occur treat legal constrains in different ways.
with fixed parameterisation.
The map layer turning up while approaching
with the qualified cursor simulates the “seeking CONCLUSION
area” degree of freedom the authorities require to
state intentions with a certain level of flexibility, As discussed in this chapter, the new emerging
like moving 2 km to the right or left if this helps trends of the Internet, like Web 2.0 services are
with the original ecological goals. In Figure 7 constantly gaining space in the eGovernment
a bordered “seeking area” is indicating a new agenda, as municipalities, and generally public
island that is being planned. The Red Polygon administration entities, have realised that these
can be moved and changes colour based on the new technologies are supported and followed
legal business rules set. This is called the Simcity by a massive enthusiastic audience, which can,
approach of Legal Atlas, as it allows for playing if treated carefully, increase the degree of par-
around until a solution is found fit for purposes ticipation of eGovenrment platforms. In parallel,
which is similar to the Simcity game. The Legal through the last years, the use of GIS systems and

Figure 7. The“seeking area” near Lelystad, Flevoland

1256
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

more specifically GIS powered applications are REFERENCES


as well a very useful tool for visualising various
aspects from the everyday life and help users to W3C. (2004, 2 10). OWL Web Ontology Language.
conceptualise in visual terms the data they are Retrieved 5 18, 2009 from http://www.w3.org/
provided with. TR/owl-features/
The most important result that comes out W3C. (2008, 1 15). SPARQL Query Language
from the use of GIS data and infrastructures in for RDF. Retrieved 5 12, 2009 from http://www.
cooperation with web 2.0 tools, is that users seem w3.org/TR/rdf-sparql-query/
to find especially fascinating the fact that they
can interact with the system, the administration Boer, A., van Engers, T., Peters, R., & Winkels,
officers and with each other, not only for enter- R. (2007). Separating Law from Geography in
tainment reasons, but for real life events as well, GIS-based eGovernment Services. Artificial
targeting the improvement of their everyday life, Intelligence and Law, 15(1), 49–76. doi:10.1007/
their active participation in the decision making s10506-007-9042-4
processes, the most effective management of di-
Boer, A., van Engers, T., & Winkels, R. (2003).
sasters that can affect human life, the promotion
Using Ontologies for Comparing and Harmoniz-
of environmental policies, the reinforcement of
ing Legislation. Proceedings of the International
entrepreneurship in a location based perspective
Conference on Artificial Intelligence and Law
and last, but not least, the structured planning of
(ICAIL) (pp. 60-69). Edinburgh: ACM Press.
strategic actions for local public administrations
with the participation of elected representatives, Boer, A., Hoekstra, R., Winkels, R., van Engers,
public administration officers, NGOs, citizens and T., & Willaert, F. (2002). METAlex: Legislation in
other stakeholders involved in the overall process. XML. Legal Knowledge and Information Systems
As such, GIS powered portals are able to (Jurix 2002), 1-10.
provide information services but also to visualise
Bundesministerium des Innern. (2003). SAGA:
the application field of various issues, directives
Standards and Architectures for eGovernment
and various decisions. To this fact, the interactive
Applications Version 2.0 (Vol. 59). KBSt Publi-
character of new GIS applications plays a major
cation Series.
role, as the different layers are able to interoperate
with the underlying infrastructures of the public Cabinet Office. (2005, 3 18). e-GIF. Retrieved 5
administrations, and through mash up technologies 12, 2009 from http://www.govtalk.gov.uk/sche-
to present useful and interactive functionalities, masstandards/egif.asp
offering to the user a more user-friendly, familiar
Capgemini. (2006, 6). Online Availability of Web
and enhanced online experience. Summing up,
Public Services: How is Europe progressing? Web
one can clearly see that the use of GIS systems
based survey on Electronic Public Services. Report
alongside public administration portals is no longer
of the Sixth Measurement. Retrieved 5 12, 2009
a fairytale, but a reality and the advantages that
from http://ec.europa.eu/information_society/
these hybrid solutions offer are highly respected
eeurope/i2010/docs/benchmarking/online_avail-
by users, as they are presented with the best
ability_2006.pdf
possible interface which targets exactly on their
living regions.

1257
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

Charalabidis, Y., Askounis, D., Gionis, G., Lam- Koussouris, S., Charalabidis, Y., Gionis, G., Ts-
pathaki, F., & Metaxiotis, K. (2006). Organizing itsanis, T., & John Psarras, J. (2007). Building a
Municipal eGovernment Systems: A Multi-facet Local Administration Services Portal for Citizens
Taxonomy of e-Services for Citizens and Busi- and Businesses: Service Composition, Archi-
nesses. In M. Wimmer, H. Scholl, Å. Grönlund, tecture and Back-Office Interoperability Issues.
& K. Andersen (Ed.), Lecture Notes in Computer Lecture Notes in Computer Science - Electronic
Science, 5th International Conference of Elec- Government, 4656, 80-91.
tronic Government. (vol. 4084, pp. 195-206).
Koussouris, S., Tsitsanis, A., Gionis, G., & Psarras,
Krakow: Springer.
J. (2008). Designing Generic Municipal Services
EUROSTAT & DGENV, K.U. Leuven. (2003). Process Models towards eGovernment Interoper-
EC-INSPIRE: Spatial Data Infrastructures in ability Infrastructures. The Electronic Journal for
Europe: State of play. E-Commerce Tools & Applications, 2 (3).
Geonovum. (2006). IMRO: Informatiemodel Rui- Layne, K., & Lee, J. (2001). Developing fully func-
mtelijke Ordening (Information model for spatial tional EGovernment: A four stage Model Govern-
planning). Retrieved 5 20, 2009 from http://www. ment. Government Information Quarterly, 18(2),
geonovum.nl/informatiemodellen/imro-2006/ 122–136. doi:10.1016/S0740-624X(01)00066-1
Gibson, J. (1979). The Ecological Approach to Medjahed, B., Rezgui, A., Bouguettaya, A., &
Visual Perception. Boston: Houghton Mifflin. Ouzzani, M. (2003). Infrastructure for EGovern-
ment Web Services. IEEE Internet Computing, 7,
IDABC. (2006). eGovernment in the Member
58–65. doi:10.1109/MIC.2003.1167340
States of the European Union.
Nations, U. UN eGovernment survey 2008: from
IDABC. (2008, 11). European Interoperability
eGovernment to connected governance. United
Framework for pan-European eGovernment ser-
Nations. Department of Economic and Social
vices. Retrieved 5 12, 2009 from http://ec.europa.
Affairs. Division for Public Administration and
eu/idabc/en/document/2319
Development Management Administration.
Information Society S.A. (2008, 10). Greek eGov-
Ontotext, A. D. (2009). OWLIM Semantic Re-
ernment Interoperability Framework. Retrieved 5
pository. Retrieved 5 12, 2009 from http://www.
12, 2009 from http://www.e-gif.gov.gr/
ontotext.com/owlim/
INSPIRE. (2007). INSPIRE DIRECTIVE. Re-
Open Geospacial Consortium. (n.d.). Geography
trieved from INSPIRE DIRECTIVE: http://
Markup Language. Retrieved 5 12, 2009 from
inspire.jrc.ec.europa.eu/
http://www.opengeospatial.org/standards/gml
Jaeger, P. (2003). The endless wire: EGovernment
OpenRDF.org. (n.d.). OpenRDF.org. Retrieved 5
as global phenomenon. Government Informa-
18, 2009 from http://www.openrdf.org/
tion Quarterly, 20(4), 323–331. doi:10.1016/j.
giq.2003.08.003 Pavlichev, A., & Garson, G. (2004). Digital Gov-
ernment: Principles and Best Practices. Hershey,
Koster, R., & Wright, W. (2004). A Theory of Fun
PA: IDEA Group Publishing.
for Game Design. Paraglyph Press.

1258
Provision of Web 2.0 Services by Interoperable GIS-Powered Local Administration Portal Systems

Smith, A. (2007). Achieving competitive advan- Vassilakis, C., Laskaridis, G., Lepoura, G., Rou-
tage through the use of geographic information vas, S., & Georgiadis, P. (2003). A framework for
systems. Electronic Government, an International managing the lifecycle of transactional eGovern-
Journal, 4 (3), 326-344. ment services. Telematics and Informatics, 20(4),
315–329. doi:10.1016/S0736-5853(03)00011-X
Soon, A., Vijayalakshmi, A., & Nabil, A. (2002).
Dynamic composition of workflows for custom- Wilson, F., & Peters, R. (2004). Mapping the law:
ized eGovernment service delivery. ACM Inter- Knowledge support for business development
national Conference Proceeding Series, 129. Los enquiry. Proceedings of the eChallenges 2004
Angeles, California. Conference. Vienna.
van Ark, B., Melka, J., Mulder, N., Timmer, M., Winkels, R., & Hupkes, E. (2007). Legal Atlas: Ac-
& Ypma, G. (2003). ICT Investments and Growth cess to Legal Sources through Maps. Proceedings
Accounts for the European Union. Groningen on the 11th International Conference on Artificial
Growth and Development Centre September. Intelligence and Law (ICAIL 2007). Amsterdam:
AAIL, ACM.

This work was previously published in Interoperability in Digital Public Services and Administration: Bridging E-Government
and E-Business edited by Yannis Charalabidis, pp. 201-217, copyright 2011 by Information Science Reference (an imprint of
IGI Global).

1259
1260

Chapter 76
Sharing Environmental
Data through GEOSS
Gregory Giuliani Hy Dao
University of Geneva and UNEP, Switzerland University of Geneva and UNEP, Switzerland

Nicolas Ray Jaap Van Woerden


University of Geneva and UNEP, Switzerland UNEP, Switzerland

Stefan Schwarzer Ron Witt


UNEP, Switzerland UNEP, Switzerland

Andrea De Bono Martin Beniston


University of Geneva and UNEP, Switzerland University of Geneva, Switzerland

Pascal Peduzzi Anthony Lehmann


UNEP, Switzerland University of Geneva and UNEP, Switzerland

ABSTRACT
Understanding the complexity of earth-system processes is crucial to convey improved information on the
environment to decision-makers and the general public. Addressing this need by sharing environmental
data is challenging because it requires a common agreed framework that allows easy and seamless in-
tegration of data from different sources. In this regard, the Global Earth Observation System of Systems
(GEOSS) portends major benefits through various sharing mechanisms and by giving access to services
that could be linked together to process and generate new understandable knowledge and information.
Various United Nations projects could greatly benefit from the GEOSS approach.

INTRODUCTION have expanded to take on a remarkable place in


our society, allowing us to access an enormous
Today we are living in a globalized world with and continuous flow of information.
rapidly evolving processes including climate In the last 30 years, the availability of geospatial
change, population growth or environmental data has grown dramatically following the evolu-
degradation. In parallel, means of communication tion of communication technologies supported
by the rapid development of spatial data capture
DOI: 10.4018/978-1-4666-2038-4.ch076 means such as remote sensing imagery, sensors and

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Sharing Environmental Data through GEOSS

GPS (Philips, Williamson, & Ezigbalike, 1999). the interest of all citizens (Ryttersgaard, 2001),
One of the challenges we are facing today is to this huge amount of geospatial data is stored in
make sense of this vast amount of data in order to different places, by different organizations and
turn them into understandable knowledge (Gore, the vast majority of these data are not being used
1998). Concrete actions can be taken only on the as effectively as they should. In consequence,
basis of knowledge and understanding, but often a framework allowing one to discover, access,
we know too little about the state of our planet’s publish, share, maintain and integrate geospatial
environment to take informed and sound decisions data appears to be essential. Such a framework is
about how it should be managed. commonly known as a Spatial Data Infrastructure
Our planet is a multi-dimensional system made (SDI).
of complex interactions highly interconnected Different initiatives at the regional and global
and continuously evolving at many spatial and levels are influencing and promoting the creation of
temporal scales (GEO secretariat, 2007b). This SDIs allowing data providers to share and publish
means that to understand these interactions, we their data in an interoperable manner. These initia-
need to gather and integrate different sets of data tives coordinate actions that promote awareness
about physical, chemical and biological systems. and implementation of complementary policies,
Altogether, these sets of data constitute environ- common standards and effective mechanisms for
mental data sets or data related to the environment. the development and availability of interoperable
These data are often georeferenced, describing a geospatial data and technologies to support deci-
geographical location through a set of attributes sion making at all scales for multiple purposes.
and thus could be understood as being part of geo- These initiatives are related to data access, harmo-
spatial data. An environmental data set is seldom nization, standardization, interoperability, seam-
interesting in itself, but rather displays its full less integration and services. Such an initiative is
information potential when used in conjunction the Global Earth Observation System of Systems
with other data sets, allowing one to monitor and (GEOSS) which is a worldwide voluntary effort,
assess the actual status of the global, regional or coordinated by the Group on Earth Observation
local environments, to discover complex relation- secretariat, to connect already existing SDIs and
ships between them and to model future changes. Earth Observation infrastructures. GEOSS is
In 1998, the former vice-president of the United foreseen to act as a gateway between producers
States, Al Gore, presented his visionary concept of of geospatial data and end users, with the aim of
a Digital Earth (Gore, 1998), a representation of enhancing the relevance of Earth observations
the Earth embedding a vast amount of geospatial for the global issues and offering public access to
data and allowing to make better sense of it. To comprehensive information and analyses on the
achieve this vision, Gore highlighted the need for environment (GEO secretariat, 2005, 2007a). The
a collaborative effort (from government, indus- GEOSS Common Infrastructure (GCI) provides
try, academia and citizens) and pointed out the core capabilities that allow users to search, access
different technologies required: computational and use data, information, tools and services, and
power, mass storage, satellite imagery, broadband is made of five components: GEO portal (web
network, interoperability and metadata. portal to access GEOSS and search registries),
Despite the fact that administrations and GEOSS clearinghouse (connects the different
governments are recognizing that geospatial data components), GEOSS components and services
are an important component of an information registry (catalogue of services and components),
infrastructure (such as e-governement) that needs GEOSS standards and interoperability registry
to be efficiently coordinated and managed for (catalogue of standards to use allowing users to set

1261
Sharing Environmental Data through GEOSS

up and configure an interoperable system), and a in isolation, which made it difficult to easily dis-
best practices wiki (offers a single space to share, cover, access and use the data content of these sys-
discuss, propose and exchange ideas and best tems due to incompatibilities and inconsistencies
practices within the community). These compo- of formats and data models (Bernard & Craglia,
nents are dependent on the voluntary contributions 2005). In addition, there is typically insufficient
of members and participating organizations. To data exchange among different stakeholders,
support the nine defined Societal Benefit Areas which is partially due to differing data policies.
(SBAs) (disasters, health, energy, climate, water, Other important impediments to the flow of data
weather, ecosystems, agriculture, biodiversity), are the delays in accessing data that prevent timely
the mechanisms for data sharing and dissemination use of information, duplication and redundancy of
are presented in a 10-year Implementation Plan data acquisition, potential high costs associated
Reference Document (GEO secretariat, 2005) pro- with data creation and access, and unclear access
viding data sharing principles that any volunteer rights and licensing policies (GEO secretariat,
member must endorse. The key element o share 2005). Altogether, these difficulties lead to a
data through GEOSS is to agree on “interoper- fragmentation of data sources, impeding their
ability arrangements” (GEO secretariat, 2007a) effective and efficient use, requiring much more
allowing different components of the system to time than necessary for data collection (Open
communicate with each other. Geospatial Consortium, 2004). All the previous
Turning data into understandable knowledge considerations highlight the growing need to share
requires that data coming from different sources data in an interoperable way and to ensure that
be easily and seamlessly integrated. With the data are easily accessible and discoverable, so that
capabilities offered by standards like the one they can be used as often and widely as possible
proposed by the Open Geospatial Consortium (Arzberger et al., 2004). Moreover, the adoption
(OGC), geospatial community can not only dis- of the Agenda 21 resolution, a United Nations
cover, access and publish interoperable geospatial initiative proposing a set of actions to be taken at
data but also services that can be linked together, different scales to promote a sustainable develop-
in chains of services, to process data and gener- ment, fostered the importance of geospatial data to
ate new information. Moreover, by registering support decision-making and management related
services into GEOSS, these different resources to degradation and threats affecting the environ-
are now accessible in a standardized way and are ment (Nebert, 2005). Availability and access to
reusable for many different purposes. appropriate information, and the related develop-
The aim of this paper is to present experiences ment of interoperable databases, are the necessary
gathered through different United Nations (UN) conditions for creating the basis for supporting the
and European research projects and to discuss information management needs of implementing
promises and challenges envisioned in partici- and monitoring sustainable development policies
pating to an initiative like GEOSS, both in term and goals, such as the United Nations Millen-
of building chains of services and sharing data. nium Development Goals (MDGs) (Henricksen,
2007). The MDGs are eight development objec-
tives (eradicate extreme poverty and hunger,
THE NEED FOR DATA SHARING achieve universal primary education, promote
AND INTEGRATION gender equality, reduce child mortality, improve
maternal health, combat different diseases, ensure
Until very recently, the different systems used to environmental sustainability, and develop and
acquire environmental data were mostly operating

1262
Sharing Environmental Data through GEOSS

global partnership for development) that all UN different political/institutional levels (Rajabifard
members have agreed to achieve by 2015. & Williamson, 2004). In this sense, interoperabil-
Over the last twenty years, the emergence and ity appears to be a key element enhancing data
evolution of Geographic Information Systems sharing, communication and efficiency.
(GIS) technology and the advent of applications The great advantage of interoperability is
such as Google Earth or OpenStreetMap (Craglia that it describes the ability of locally managed
et al., 2008), allowed for a clear change on how and distributed heterogeneous systems (different
geospatial data are handled and incorporated into operating systems, different databases, different
regular workflows of organizations and agencies data formats) to exchange data in real time to
in the governmental, private and public sectors provide a service (OGC, 2004). The shift towards
(Booz, Allen, & Hamilton, 2005). Highlighting a processed-based infrastructure offering reusable
these changes, Masser (2007) stated that to real- and standardized components responsive to user
ize the full potential and benefits of geospatial needs and requests is supported by the Service
data, access must be maximized with the help Oriented Architecture (SOA) concept. In a SOA,
of Spatial Data Infrastructures (SDIs), that al- services are the elementary components represent-
low users to share, discover, visualize, evaluate ing a set of operations that could be invoked by
and retrieve geospatial data. Moreover, the vast users allowing them to access, in the case of the
amount of data needed to run a complex model geospatial community, distributed geospatial data
(e.g., in climatology or ecology), and the recogni- as well as geoprocessing services. To implement
tion that organizations and/or agencies need more and deploy geo-enabled services, the OGC pro-
data than they can afford financially (Rajabifard poses a suite of standards that use services over
& Williamson, 2001), reinforce the concept that the Internet, so-called web services, giving access
once a particular set of geospatial data has been to distributed data and services through Uniform
created, it should be accessible to potential users in Resource Locators (URLs). This allows data
both the public and private sectors (Ryttersgaard, providers to publish standardized services inde-
2001). This reinforces the need to store such data pendently on how it is implemented and on which
in databases that are made widely accessible for platform it is executed. This emphasizes the full
various purposes (Philips et al., 1999). As a con- potential of interoperability allowing an organiza-
sequence, geospatial data can be seen as a shared tion to maximize the value and reusability of data
resource which is maintained continuously. under its control and giving the ability to exchange
To remove the barriers that block and impede these data with other interoperable systems. Using
a wide use of geospatial data and related infor- such OGC web services offers the possibility to
mation, Masser (2005, 2007) identified different seamlessly couple and reuse them in a variety of
needs such as eliminating or reducing restrictions applications. By chaining together a series of web
on data access and availability (but protecting services, users can perform a set of operations to
intellectual property rights), promoting interoper- process data whereby new knowledge emerges
ability between different data sets and different from relationships that were not envisioned before
systems, and disseminating the information about (Open Geospatial Consortium, 2004). Granell et
data (metadata). Altogether these objectives are al. (2009) define service chaining as a mechanism
designed to create an environment that fosters for combining individual geospatial web services
activities for using, managing, producing and to create customized web applications. Although
sharing geospatial data in which all stakeholders current SDIs mostly offer the abilities to search,
can cooperate with each other and interact with view and access data, with the support of interop-
technology, to better achieve their objectives at erable services and SOA related concepts it is

1263
Sharing Environmental Data through GEOSS

now possible to build new applications based on disseminating information about the state of the
distributed services (Friis-Christensen, et al., 2007; world’s environment in a timely and understand-
Diaz et al., 2008). When services are organized able manner. To provide reliable environmental
through a coherent chain, combined services can assessments and early warnings, GRID-Europe
achieve a larger task (Di, 2004). The International specialized in handling and analyzing spatial
Organization for Standardization (ISO) through and statistical data on environmental and natural
its ISO 19119 standard (ISO, 2005) defines three resource issues through computerized GIS and
types of chaining: remotely-sensed imagery. Over the years, GRID-
Europe has compiled an archive of global, Euro-
• Transparent (user-defined): the work- pean and other geospatial databases as part of its
flow is defined and managed by users. information management function. The experience
• Translucent (workflow-managed): users and in-house capabilities of GRID-Europe offer
invoke a service that manages the chain. a great potential to make geospatial and tabular
Users are aware of atomic services that databases compiled over the years available to a
constitute the chain. large array of users. Since its foundation, the Ge-
• Opaque (aggregated): users invoke an ag- neva office has received considerable support from
gregated service that carries out the chain. Swiss and local authorities as well. This supporting
Users have no awareness of the atomic ser- was significantly reinforced, and GRID-Europe’s
vices that constitute the chain. institutional base broadened, with the signing of
a “Partnership Agreement” between UNEP, the
In this paper, we will focus on the transparent Federal Office for the Environment (FOEN) and
chaining either by hard coding or by using OGC the University of Geneva in June 1998.
Web Processing Service (WPS) specification GRID-Europe closely monitors developments
(OGC, 2007). in information technologies and examines their
Through its online catalogue of registered utility for environmental monitoring and policy
services, GEOSS is an interesting and promising formulation and thus is extending and developing
entry-point to discover and access services that its field of activities using SDIs. Moreover, the
could be integrated into service chaining process. “Partnership Agreement” provides a major op-
It offers a framework to share data, expose them portunity to work at different geographic scales
through interoperable services and allow the pro- ranging from global, to regional (Europe) and
duction and dissemination of timely and accurate national (Swiss) and finally local (Geneva). Such
data needed by decision makers and the public a specificity allows GRID-Europe to participate to
(GEO secretariat, 2005). different applied research projects funded either by
the United Nations or the European Commission.
A common ground for these projects is to serve
SERVING DATA INTO GEOSS and share data through the European Directive
on Infrastructure for Spatial Information in the
In 1985, the United Nations Environment Pro- European Community (INSPIRE) (European
gramme (UNEP)/Division of Early Warning and Commission, 2007), the United Nations Spatial
Assessments/GRID-Europe was founded as one Data Infrastructure (UNSDI) (Henricksen, 2007),
of the first two centres of the Global Resource as well as GEOSS.
Information Database (GRID) network to support
environmental decision-making within UNEP
and the UN system as a whole, by generating and

1264
Sharing Environmental Data through GEOSS

PREVIEW Global Risk Data Platform Climate, Disasters, Health and Gross Domestic
Product (GDP). The data can be displayed and
The PREVIEW (Project of Risk Evaluation, Vul- explored on-the-fly through maps, graphs, data
nerability, Information, and Early Warning) Global tables, downloaded in various popular formats,
Risk Data Platform (http://preview.grid.unep.ch) or copied and pasted into word processors. All
is a collaborative effort of UNEP, United Na- information products in the GEO Data Portal can
tions Development Programme (UNDP/BCPR), be accessed and used as web services as well.
United Nations International Strategy for Disaster The retrieval of statistical and country-wide in-
Reduction (UNISDR) and the World Bank to formation has been enabled via a Simple Object
share geospatial data on global risk from natural Access Protocol (SOAP) connection; data from
hazards. Users can freely visualize, download the database can be retrieved as maps via WMS
or extract data on past hazardous events, human or WFS; graphs can be displayed via a direct
and economical hazard exposure and risk from Uniform Resource Locator (URL) usage.
natural hazards. The platform covers nine types
of natural hazard: tropical cyclones and related enviroGRIDS
storm surges, drought, earthquakes, biomass fires,
floods, landslides, tsunamis and volcanic erup- EnviroGRIDS (http://www.envirogrids.net) is
tions. The collection of data is made via a wide a European research project that will last from
range of partners. This geoportal was developed as 2009 until 2013 and is funded under the seventh
a support to the 2009 Global Assessment Report on framework programme (FP7). The Black Sea
Disaster Risk Reduction (United Nations Interna- Catchment is largely following an ecologically
tional Strategy for Disaster Reduction Secretariat, unsustainable pathway based on inadequate re-
2009), replacing the previous PREVIEW platform source management that could lead to severe
initially designed by UNEP/GRID-Europe and environmental, social and economical problems,
already available since 2000. The new PREVIEW especially in a changing climate (WWF, 2008).
platform is fully compliant with the OGC Web The aim of the project is to build capacities in the
Services (OWS) to access data using Web Map Black Sea region to use new international standards
Service (WMS), Web Feature Service (WFS), to gather, store, distribute, analyze, visualize and
Web Coverage Service (WCS), geo-enabled Re- disseminate crucial information on past, present
ally Simple Syndication (GeoRSS) or Keyhole and future states of this region, in order to assess
Markup Language (KML) as well as metadata its sustainability and vulnerability. EnviroGRIDS
using Catalogue Service for the Web (CS-W). objective is to federate and strengthen existing
Observation Systems to address several GEOSS
GEO Data Portal Societal Benefit Areas within a changing climate
framework. The expected result will be a shared
The GEO Data Portal (http://geodata.grid.unep. information system that operates on the boundary
ch) is the authoritative source for data sets used by of scientific/technical partners, stakeholders and
UNEP and its partners in the Global Environment the public. It will contain early warning systems
Outlook (GEO) report and other integrated envi- able to inform in advance decision-makers and the
ronment assessments. Its online database holds public about risks to human health, biodiversity
more than 550 different variables, as national, and ecosystems integrity, agriculture production
sub-regional, regional and global statistics or as or energy supply caused by climatic, demographic
geospatial data sets (maps), covering themes such and land cover changes on a 50-year time horizon.
as Freshwater, Population, Forests, Emissions, To achieve and support the enviroGRIDS vision

1265
Sharing Environmental Data through GEOSS

and objectives, a grid-enabled Spatial Data Infra- changing or conflicting water resource demands.
structure (gSDI) is under construction. The aim Adaptation and policy options will be elaborated
of the gSDI is to host and analyze the data for the on the basis of the results. The chain of processes
assessment of GEOSS Societal Benefit Areas, as involved in climatic, cryospheric and hydrologic
well as the data produced within the project. These models is complex because each process impacts
data must be gathered and stored in an organized on different compartments of human and natural
form and accessible in an interoperable way on systems. Different types of data covering various
the grid infrastructure in order to provide a high geographical regions are therefore necessary to
performance and reliable access through standard- build different sets of scenarios, which translates
ized interfaces. into substantial amount of data.

ACQWA
TECHNICAL COMPARISON
ACQWA (http://www.acqwa.ch) stands for As- AND COMMON GROUNDS
sessing Climate impacts on the Quantity and
quality of Water. It is also a FP7 European re- All these projects have in common that they already
search project lasting from 2008 until 2013. As share (or will share in a near future) their data and
the evidence for human induced climate change metadata into the GCI. As a pre-requisite all the
becomes clearer, so does the realization that its registered services have to be interoperable using
effects will have impacts on natural environment mainly standards proposed by the OGC, but also
and socio-economic systems. Some regions are other protocols like the Simple Access Object
more vulnerable than others, both to physical Protocol (SOAP). A short comparison of these
changes and to the consequences for ways of life. different projects is given in Table 1, indicating
According to the description of work, the project which services are available, which software are
will assess the impacts of a changing climate on the used to publish these services, what are the types
quantity and quality of water in mountain regions of models used to chain these layers, and finally
which are particularly affected by rapidly rising what are the challenges and difficulties raised
temperatures, prolonged droughts and extreme while integrating these services.
precipitation. Modeling techniques will be used Most of these projects make use of free and
to project the influence of climatic change on the open source software (PostgreSQL/PostGIS,
major determinants of river discharge at various MapServer, GeoServer, GeoNetwork and PyWPS)
time and space scales. Regional climate models because it can ease the portability and replicabil-
will provide the essential information on shifting ity of tools developed. Indeed, many countries
precipitation and temperature patterns. Snow, ice, with low to moderate incomes are often also af-
and biosphere models will feed into hydrological fected by natural hazards, environmental threats
models in order to assess the changes in seasonal- or degradation, and these countries are espe-
ity, amount, and incidence of extreme events in cially interested to manage and share their geo-
various catchment areas. Environmental and socio- spatial data using free and open sources software.
economic responses to changes in hydrological Having tools readily available to be deployed in
regimes will be analyzed in terms of hazards, these countries is a strong incentive for capacity
aquatic ecosystems, hydropower, tourism, agri- building, knowledge transfer, and sharing of
culture, and the health implications of changing expertise.
water quality. Attention will also be devoted to the These projects are also strongly related to ca-
interactions between land use/cover changes, and pacity building in order to enhance an “open and

1266
Sharing Environmental Data through GEOSS

Table 1. Technical comparison of enviroGRIDS, ACQWA, GEO Data Portal and PREVIEW projects

Project name enviroGRIDS ACQWA GEO Data Portal PREVIEW


Services WMS, WFS, WCS, CS-W, WMS, WFS, WCS, CS-W, WMS, WFS, WCS, CS-W, WMS, WFS, WCS, CS-W,
KML, GeoRSS, WPS, WPS, KML, GeoRSS SOAP KML, GeoRSS
grid services
Software GeoServer, ArcGIS GeoServer, GeoNetwork, GeoServer, GeoNetwork, GeoServer, GeoNetwork,
Server, PyWPS, GeoNet- PyWPS MapServer MapServer
work, gLite
Type of models - Hydrological models - Snow cover mapping - providing base layers - providing base layers
(socio-economic,...) (events, risk,...)
Challenges & - linking SDI and grid - capacity building - data integration - data integration
difficulties infrastructure - data integration - data/metadata harmoni- - data/metadata harmoni-
- capacity building zation zation
- authorization/ authenti- - different standard imple- - capacity building
cation mentation
- portal integration - capacity building

sharing spirit”. It is necessary to show and prove Through simple data integrating scenarios
the benefits of data sharing through appropriate (integration into other web portals or applications)
examples, to communicate best practices as much GEO Data Portal and PREVIEW have pointed out
as possible and to develop guidelines and poli- different issues. Integrating some socio-economic
cies. Altogether this will help to reach agreement data sets coming from the GEO Data Portal with
and endorsement on the use of new standards. natural hazards maps of the PREVIEW project to
Such a participative approach will certainly compute, for example, economical exposition of a
stimulate data providers to be more “open” and country to a specific hazard, was impossible. This
in consequence to share their data. The different problem comes from the different implementations
projects presented before will organize different of OGC specifications between Mapserver (used
workshops and develop various teaching mate- by the GEO Data Portal) and Geoserver (used by
rial allowing participants, ranging from students PREVIEW). Indeed, it appears that Mapserver use
to members of government, to learn how to use an argument “MAP” that is not standardized and
the specific applications to share large amount of not recognized by all clients. This problem will
data. Rajabifard and Williamson (2004) believe be solved by migrating to Geoserver all the data
that building capacities is an important challenge services of the GEO Data Portal. Thus implemen-
for SDIs concepts to be accepted and adopted at tation of a same specification can differ from one
a large extent. For these authors, the best way to software to another and can impend a consistent
reach this objective is to establish a long-term com- integration of services.
mitment to education and research: otherwise the Another issue raised by data integration process
SDI vision will remain unclear and unachievable. was raised by the United Nations High Commis-
Through these projects, the objective is to build sioner for Refugees (UNHCR) while trying to
the capacity of scientists to share and document integrate WMS data coming from the PREVIEW
their data in order to strengthen existing observa- project in order to identify area that are not suitable
tion systems, the capacity of decision-makers to to install refugees camps. It appears that the only
use it, and the capacity of the general public to projection available was EPSG:4326 (Geographic)
understand the important environmental, social whilst UNHCR geoportal makes use of Google
and economic issues at stake. maps in EPSG:900913 (Spherical Mercator). This

1267
Sharing Environmental Data through GEOSS

experience showed us that it is important, while (MODIS) and Shuttle Radar Topography Mis-
publishing data services, to support at least the sion (SRTM). Project partners that are currently
most frequent projection types. Geoserver sup- working to produce such estimations are working
ports natively all types of projections and it is with PCI Geomatica, doing all the process chain
easy to reproject on-the-fly data stored in another manually. Our objective is to help our partners
projection so that it can be integrated with data by publishing a WPS geoprocessing service that
with other projections. In addition, following the allows them to automatize this analysis (Figure
size of the data set, an important processing over- 1). Once retrieved by FTP, MODIS images are
load has been observed caused by the on-the-fly saved on a server that store also SRTM tiles. All
reprojection process. This can slow the service data are in EPSG:4326 and will be available using
chain and impend and efficient data integration. WCS standard published by Geoserver. Finally,
In the ACQWA project, a specific constraint the WPS service, currently under development
is the important number of partners involved and using PyWPS, will implement the different steps
their different scientific backgrounds (climatolo- to process the data. A major difficulty encountered
gists, hydrologists, glaciologists, ecologists). It is until now is to “translate” the PCI functionalities
quite challenging to raise awareness on new tools by finding the equivalent in Geographic Resources
and way of gathering and exchanging data with- Analysis Support System (GRASS) software.
out strongly influencing the way these different Indeed, PyWPS does not process data by itself
communities are working with geospatial data. and instead uses GRASS as a backend to access
For that reason, the aim is to concentrate on the all the geoprocessing functionalities.
promotion of GEOSS as an interesting and useful Once the snow cover process is successfully
framework to handle and discover scientific data. achieved, our hope is to convince other commu-
Obviously, a dedicated geoportal is under develop- nities within the project to benefit from such an
ment to register the main outputs of the ACQWA approach and to develop other scenarios espe-
project into GEOSS using OGC web services. cially making use of climate data.
Nevertheless to show the benefits of working with In the process of turning data into understand-
interoperable services, we are currently developing able information and knowledge by chaining
a scenario to make estimation of snow cover from data services a new challenge has emerged. The
remote sensing imagery using data coming from ever-increasing spatial and temporal resolution
the Moderate Resolution Imaging Spectrometer of geospatial data are causing a tremendous in-

Figure 1. Data sources and processing steps for a geoprocessing service estimating snow cover

1268
Sharing Environmental Data through GEOSS

crease in term of data volumes and the limits of The main scientific and technological chal-
the processing capacities of traditional GIS and lenge of the enviroGRIDS project will be to link
SDI are being reached. With the advent of grid an SDI with a grid infrastructure to benefit from
computing and the progressive deployment of the processing capacities offered by grids. Indeed,
large grid infrastructure projects (e.g., Enabling WPS appears to be an adequate candidate to be
Grids for E-sciencE) many scientific disciplines grid-enabled because, first, it supports SOAP
now have access to sizable computing resources protocol and, second, geospatial community has a
and new opportunities are emerging. For Foster et growing processing need that current SDIs cannot
al. (2008) grid aims to federate resource sharing deliver. A grid-enabled SDI will allow users to
in a dynamic and distributed environment across a model high resolution hydrological models (e.g.,
network allowing to access unused CPUs and stor- Soil and Water Assessment Tool) of the Black Sea
age space to all participating computers. Currently, catchment under various climate, land cover and
SDIs are lacking processing power and should demographic scenarios. In order to develop such
therefore be made interoperable with grid infra- a gSDI to support the development of Black Sea
structures, which are offering large storage and portal functionalities, the different components
computing capacities. Recent studies (Muresan et of the enviroGRIDS architecture are currently
al., 2008; Di, Chen, Yang, & Zhao, 2003) applied being defined to highlight the main issues emerg-
a successful approach to extend grid computing ing from different conceptual and technological
to the remote sensing community and to make solutions (Figure 2).
OGC web services grid-enabled. Both studies These issues concern the choices of software
considered that the grid has a great potential for components, data repositories, data management,
the geospatial disciplines. Padeberg and Greve grid-oriented processing, grid portal, and interop-
(2009) have identified several differences between erability between SDIs and grid infrastructures.
OGC-compliant SDIs and grid infrastructures Although the use of grid-enabled web services to
concerning service description, service interface, access data sets stored in the SDI will also be
service state and security. In particular, grid infra- explored (Maué & Kiehle, 2009), bridging archi-
structures are based on SOAP messaging protocol tectural gaps between grids and SDIs remains
to invoke operations and Web Service Description very challenging (Padberg & Greve, 2009) with-
Language (WSDL) to describe services. OGC- out extensions and customizations. For example,
compliant does not support neither SOAP nor an important question concerns the location of
WSDL, except WPS, and thus chaining geospatial geospatial data repositories: inside or outside the
services with grid services could be problematic. grid? The answer is not trivial and will greatly
In addition, OGC standards do not provide any influence services and in particular chains of
security mechanisms (authentication, encrypted services to process data. In the one hand, being
communication between resources) which is a outside the grid, all OGC-compliant services
major concern in grid infrastructures. Finally, Di functionalities remain the same and grid services
et al. (2003) showed that the current grid metadata are only used to process the data. On the other
catalog system is not good enough to answer the hand, being inside the grid, all OGC web ser-
needs of the geospatial community, especially vices have to be modified to support grid environ-
the requirements of the ISO19115 standard. All ment, becoming grid-enabled. The latter would
these differences must be overcome in order to allow benefiting from all the advantages of the
allow traditional SDIs to benefit from the power grid (security, replicability, scalability, storage
of grid computing, and consequently to offer new and processing capacities) but would obviously
services to GEOSS. require a lot of developments for adapting already

1269
Sharing Environmental Data through GEOSS

Figure 2. EnviroGRIDS grid-enabled SDI components supporting Black Sea portal

existing SDIs. In consequence, an incremental development requires the integration of a large


development and implementation strategy will be number of different data from various sources.
developed taking into account different integration Through agreed common standards and a clear
scenarios aiming to hide the complexity of the political will, these data can be integrated in an
grid while preserving OGC interfaces. interoperable way, leading to a new collaborative
approach to decision-making.
Having environmental data in digital form
CHALLENGES AND PROMISES allows easy storage and dissemination, facilitate
data exchange and sharing, faster and easier up-
From the experience acquired, or being acquired, date and corrections, ability to integrate data from
through these different projects, it is obvious that multiple source (see Figure 1), and customization
many challenges remain both tangible (e.g., tech- of products and services (Henricksen, 2007). In
nology) or less tangible (e.g., culture, behavior). this sense SDIs appear to be a good choice to
Nevertheless, it is critical to overcome them in encompass the sources, systems, network link-
order to improve our knowledge, share our experi- age standards and institutional issues involved
ence and attempt to strive towards a society that is in delivering geospatial data from many data
better informed. Achieving the goal of sustainable sources to the widest possible group of potential

1270
Sharing Environmental Data through GEOSS

users (Coleman, McLaughlin, & Nichols, 1997). levels. In particular, building a system of sys-
The fact that, during the last years, multiple tems like GEOSS is highly dependent on a clear
SDIs initiatives have been developed all around governance structure that is understandable and
the world, ranging from local to regional levels, acceptable by the volunteer participants in order
is a good sign. It appears that there is a grow- to develop a shared vision of the system and to
ing recognition that geospatial data is a critical allow users to feel a common sense of ownership
element underpinning decision making in many (Masser, 2007). As it is reminded in the Strategic
disciplines (Rajabifard & Williamson, 2001) and Guidance document (GEO secretariat, 2007a), the
as such needs to be effectively managed. success of GEOSS will depend on interoperability
The SDI hierarchy model proposed by Rajabi- arrangements that data providers agree to endorse.
fard (2002) is composed of inter-connected SDIs As a provider of environmental data, GRID-
developed at different levels (from local to global). Europe is continuously facing the challenge of
Each SDI of a higher level is formed by the integra- encouraging data providers to go “open” and to
tion of data developed and made available by the share their data in an interoperable and OGC-
lower level. Such a hierarchy can be approached compliant way. At present, technology is no longer
though two views: on one hand, it is an umbrella a problem because solutions based on a variety
in which the SDI at a higher level encompasses of software can be proposed and/or developed
all SDI components from lower levels. On the depending on the requirements and the technical
other hand, it can be seen as the building blocks capabilities available. The most difficult task is
supporting the access of data needed by SDIs at to create an environment allowing wide agree-
higher levels. This hierarchy allows creating an ment on data sharing principles. In this particular
environment in which users working at any level regard, the GEOSS “best practices wiki” could be
can rely on data from other levels and integrate data of great benefit to help people promote sharing
from different sources (Mohammadi & Rajabifard, principles. A lesson learned from our experience
2009). Such a hierarchy is clearly envisioned in is that once users can discover data they need,
the concept of the system of systems on which their most important preoccupation is to know
GEOSS relies, integrating systems together into what is the quality of the data they are going to
an information highway which both links together access and whether they can trust this data. We
environmental, socio-economic and institutional are convinced that sharing data is an efficient
databases and provides a movement of information way to eventually recognize whether this data is
from local to global levels. For Masser (2006), of sufficient quality. By submitting/exposing the
the SDI hierarchy poses the challenge of multi- data to the judgment of the broader community,
stakeholder participation in SDI implementation, one can know if it is useful or not. Through data
because the bottom-up approach differs a lot from sharing, one can also benefit from the interaction
the top-down approach. The top-down vision, with end users by receiving feedbacks and then
common in the SDI literature, emphasizes the improve the data sets accordingly. Sharing data and
need for standardization and uniformity while participating to GEOSS can therefore contribute
the bottom-up view stresses the importance of to the improvement of data, which in turn allows
diversity and heterogeneity caused by the dif- better information and eventually better decisions.
ferent needs of the various stakeholders. As a In the current climate of economic constraints,
consequence, it is necessary to find a consensus interoperability and standardization have never
ensuring sufficient standardization and uniformity been so important because a non-interoperable
while recognizing the diversity and heterogeneity system impedes the sharing of data, information
of the different stakeholders acting at different and resources, which increase the risk for a system

1271
Sharing Environmental Data through GEOSS

to fail in delivering its expected benefits and to necessary data) especially if this service is chained
remain unused (Open Geospatial Consortium, with other services.
2004). Geospatial data can be an expensive and Consequently, GEOSS represents a very
time consuming resource to produce, and for promising and potentially powerful framework to
this reason, it is of high importance to improve share and expose data. In particular, the fact that
accessibility and availability and promote its a good governance structure is already in place
reuse. Many decisions that organizations need to allows a clear vision that can be easily shared
make depend on good quality and consistent data, and endorsed by the participants. The fact that
readily available and accessible (Rajabifard & participating to GEOSS is on a voluntary basis
Williamson, 2001). The process of reuse does not could be seen either as a great opportunity or
only concern the data itself, but also encompasses as a risk. Indeed, the voluntary aspect poses the
the capabilities, skills developed, invested effort threat that only a few data providers join such an
and capital. This process allows an organization initiative and, as a consequence, the system could
to share the costs of data, people, and technology, miss its objectives. Nevertheless, the growing
which helps realize more rapid returns on invest- number of components and services registered
ment. By reusing data, one can avoid duplication through GEOSS is a good sign for optimism. In
of efforts and expenses and enable users to save particular, we think that international organizations
resources, time and effort when trying to acquire such as UNEP could play a major role by paving
or maintain data sets (Rajabifard & Williamson, the way toward a broader acceptance by similar
2001). organizations. The fact that GEOSS is based on
Percivall (2006) claimed that in a distributed distributed systems that can operate, evolve and
environment, the help of open standards such be managed in a relative independence appears
as OGC can help scientists to rapidly find and to be a good choice to find a consensus ensuring
evaluate a lot of different data sets and processing sufficient standardization and uniformity, while
approaches, providing a flexible and coopera- recognizing the diversity and heterogeneity of
tive environment that foster collaboration in the the different stakeholders. Finally, GEOSS offers
different scientific communities that work with a unique characteristic that justifies by itself its
geospatial data. Thus, organizing the workflows existence, which is the possibility to see emergent
using standard-based web services could provide a properties. For Béjar et al. (2009), this emergence
great benefit in term of productivity to address the is the main objective of a system of systems, where
nine SBAs of GEOSS. OGC standards provide a users perform functions that cannot be made with
solid ground for interoperability between services any single component. This means that such a
within distributed geoprocessing environment system is more than the sum of its parts and offers
offered by SDIs (Friis-Christensen et al., 2007). the possibility to better understand the complex
In particular, the fact that these services can be relationships between the different components
reused and chained within other applications is of the Earth system.
a very useful aspect offering the opportunity to
solve specific problems in a more flexible way
than with stand-alone applications. Nevertheless, CONCLUSION
some performance issues can appear with services
that need to access and move large amount of data. Geospatial data is a critical element underpin-
This can negatively impact the execution time ning decision-making for many disciplines and
of this service (e.g., huge overload in gathering is indispensable to make sound decisions at all
levels, from global to local. Experiences from

1272
Sharing Environmental Data through GEOSS

developed countries show that more than two- ment Goals and to offer a unique framework to
thirds of human decision-making are affected by share data and collaborate for a better society. In
spatially-referenced data (Ryttersgaard, 2001). this sense, organizations such as UNEP can act
Even if the technology exists, organizations and as a “catalyst”, contributing to GEOSS, building
agencies around the world are still spending bil- capacities and ensuring that environmental data
lions of dollars every year to produce, manage and are easily accessible. This is a necessary step to
use geospatial data, but they still do not have the ensure better-informed decision-making for the
information they need to answer the challenges our more sustainable development of our planet.
world is facing (Rajabifard & Williamson, 2001).
The web service model proposed by the OGC
appears to be suitable to allow users to combine ACKNOWLEDGMENT
different services to solve a specific problem in a
scalable and flexible way. Nevertheless, through The authors would like to acknowledge the
simple examples of services chaining, we have European Commission “seventh framework pro-
highlighted different issues that could potentially gramme” that funded the enviroGRIDS (Grant
impede an easy integration: problems with dif- Agreement n° 226740) and ACQWA (Grant
ferent implementation of a same specification, Agreement n°212250) projects, and UNEP for
problems regarding different projections used its support. The views expressed in the paper are
in different web applications, overload caused those of the authors and do not necessarily reflect
by on-the-fly reprojection using large data sets. the views of the institutions they belong to.
Moreover, working with different communities
that are not necessarily aware of the possibilities
offered by OGC web services could limit the REFERENCES
diffusion of such approach outside the geospatial
community. These communities need to be con- Arzberger, P., Schroeder, P., Beaulieu, A., Bowker,
vinced, through simple examples, which working G., Casey, K., & Laaksonen, L. (2004). Promoting
with chained services can bring benefits in their Access to Public Research Data for Scientific,
own working flows. Finally, grid computing ap- Economic and Social Development. Data Science
pears to be a promising complement of traditional Journal, 3, 17. doi:10.2481/dsj.3.135
SDIs capabilities to build WPS services for pro- Béjar, R., Latre, M. A., Nogueras-Iso, J., Muro-
cessing large data sets. To achieve this objective, Medrano, P. R., & Zarazaga-Soria, F. J. (2009).
implementation of SOAP protocol into OGC Systems of Systems as a Conceptual Framework
specifications is a pre-requisite in order to allow for Spatial Data Infrastructures. International
the two types of infrastructures to communicate Journal of Spatial Data Infrastructures Research,
(interoperability) and to ease the combination of 4, 17.
OGC and grid services in efficient chains.
Ten years after, GEOSS could be seen as an Bernard, L., & Craglia, M. (2005). SDI - From
initial step to achieve Gore’s vision, because the Spatial Data Infrastructure to Service Driven
relevant technologies are available and there is Infrastructure. Paper presented at the 1st Research
growing recognition that countries can benefit both Workshop on Cross-learning on Spatial Data Infra-
economically and environmentally from better structures (SDI) and Information Infrastructures
access to data. GEOSS has the potential to sup- (II), Enschende, The Netherlands.
port the achievement of sustainable development
initiatives such as the UN Millennium Develop-

1273
Sharing Environmental Data through GEOSS

Booz, A. (2005). Geospatial Interoperability Friis-Christensen, A., Lutz, M., Ostländer, N., &
Return on Investment. Hamilton: NASA. Bernard, L. (2007). Designing Service Architec-
ture for Distributed Geoprocessing: Challenges
Coleman, D. J., McLaughlin, J. D., & Nichols,
and Future Directions. Transactions in GIS, 11(6),
S. (1997). Building a Spatial Data Infrastructure.
799–818. doi:10.1111/j.1467-9671.2007.01075.x
Paper presented at the 64th Permanent Congress
Meeting of the Fédération Internationale des GEO secretariat. (2005). Global Earth Observa-
Géomètres (FIG), Singapore. tion System of Systems 10-Year Implementation
Plan Reference Document. Geneva, Switzerland:
Craglia, M., Goodchild, M. F., Annoni, A., Ca-
GEO.
mara, G., Gould, M., & Kuhn, W. (2008). Next-
Generation Digital Earth: A position paper from GEO secretariat. (2007a). Strategic Guidance for
the Vespucci Initiative for the Advancement of Current and Potential Contributors to GEOSS.
Geographic Information Science. International Geneva, Switzerland: GEO.
Journal of Spatial Data Infrastructures Research,
GEO secretariat. (2007b). The Full Picture. Ge-
3, 22.
neva, Switzerland: GEO.
Di, L. (2004). Distributed Geospatial Information
Gore, A. (1998). The Digital Earth: Understanding
Services – Architectures, Standards, and Research
our planet in the 21st Century (p. 4).
Issues. In Proceedings of the International Ar-
chives of Photogrammetry, Remote Sensing, and Gorgan, D., Bacu, V., Ray, N., & Maier, A. (2009).
Spatial Information Sciences, Istanbul, Turkey EnviroGRIDS data storage guideline (p. 67).
(Vol. 35, Part 2, Commission II).
Granell, C., Gould, M., & Esbri, M. A. (2009).
Di, L., Chen, A., Yang, W., & Zhao, P. (2003). Geospatial Web Service Chaining. In Karimi, H. A.
The Integration of Grid Technology with OGC (Ed.), Handbook of Research on Geoinformatics
Web Services (OWS) in NWGISS for NASA EOS (pp. 189–195). Hershey, PA: IGI Global.
Data. Paper presented at the GGF8 & HPFC12,
Henricksen, B. (2007). UNSDI Compendium:
Seattle, WA.
A UNSDI Vision, Implementation Strategy, and
Diaz, L., Granell, C., & Gould, M. (2008). Case Reference Architecture. UNGIWG.
Study: Geospatial Processing Services for Web-
ISO. (2005). ISO 19119: Geographic Information
based Hydrological Applications. Geospatial
– Services. Gland, Switzerland: ISO.
Services and Applications for the Internet, 31-47.
Masser, I. (2005). The Future of Spatial Data
European Commission. (2007). Proceedings of the
Infrastructures. Paper presented at the ISPRS
Directive 2007/2/EC of the European Parliament
Workshop on Service and Application of Spatial
and the Council of 14 March 2007 establishing an
Data Infrastructure, Hangzhou, China.
Infrastructure for Spatial Information in the Euro-
pean Community (INSPIRE), Brussels, Belgium. Masser, I. (2006). Multi-level Implementation of
SDIs. GIM International, 20, 4.
Foster, I., Yong, Z., Raicu, I., & Lu, S. (2008).
Cloud computing and grid computing 360-degree Masser, I. (2007). Building European Spatial
compared. Paper presented at the 2008 Grid Data Infrastructure. Redlands, CA: ESRI Press.
Computing Environment Workshop, Austin, TX.

1274
Sharing Environmental Data through GEOSS

Maué, P., & Kiehle, C. (2009). Grid technologies Philips, A., Williamson, I., & Ezigbalike, C.
for geospatial applications: an overview. GIS (1999). Spatial Data Infrastructure concepts. The
Science, 3, 65–67. Australian Surveyor, 44(1), 8.
Mohammadi, H., & Rajabifard, A. (2009). Multi- Rajabifard, A. (2002). Proceedings of Diffusion
source Spatial Data Integration within the Context of Regional Spatial Data Infrastructures: with
of SDI Initiatives. International Journal of Spatial Particular Reference to Asia and the Pacific,
Data Infrastructures Research, 4, 18. Melbourne, Australia.
Muresan, O., Pop, F., Gorgan, D., & Cristea, V. Rajabifard, A., & Williamson, I. P. (2001). Spatial
(2008). Satellite Image Processing Applications Data Infrastructures: Concept, SDI Hierarchy
in MedioGRID. Paper presented at the 5th Inter- and Future Directions. Paper presented at the
national Symposium on Parallel and Distributed Geomatics’80, Tehran, Iran.
Computing.
Rajabifard, A., & Williamson, I. P. (2004). SDI
Nebert, D. D. (2005). Developing Spatial Data Development and Capacity Building. Paper pre-
Infrastructure: The SDI Cookbook. sented at the GSDI-7, Bangalore, India.
(2004). Open Geospatial Consortium (p. 7). The Ryttersgaard, J. (2001). Spatial Data Infrastruc-
Havoc of Non-Interoperability. ture: Developing Trends and Challenges. Paper
presented at the International Conference on
Open Geospatial Consortium. (2007). OpenGIS
Spatial Information for Sustainable Development,
Web Processing Service (p. 87).
Nairoibi.
Padberg, A., & Greve, K. (2009). Gridification
United Nations International Strategy for Disaster
of OGC web services: challenges and potential.
Reduction Secretariat (UNISDR). (2009). Global
GIS Science, 3, 77–81.
Assessment Report on Disaster Risk Reduction
Percivall, G. (2006). GEOSS to Benefit from 2009. Geneva, Switzerland: UN.
“Service Chaining” Based on OGC® Standards
World Wide Fund. (2008). Greening the Black
(p. 5). Geoinformatics.
Sea Synergy. Brussels, Belgium: WWF.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 2, Issue 1,
edited by Donald Patrick Albert, pp. 1-17, copyright 2011 by IGI Publishing (an imprint of IGI Global).

1275
1276

Chapter 77
Agriculture and Conservation
in the Natura 2000 Network:
A Sustainable Development
Approach of the European Union

Cristian Ioja Maria Patroescu


University of Bucharest, Romania University of Bucharest, Romania

Laurenţiu Rozylowicz Mihai Niţă


University of Bucharest, Romania University of Bucharest, Romania
Diana Onose
University of Bucharest, Romania

ABSTRACT
The Natura 2000 network represents a new approach in the sustainable spatial planning promoted at
the European Union level. Agricultural landscapes comprise 28.6% of the surface area of the Natura
2000 sites, many of which have significant conservation values. Plant and animal species, and ap-
proximately 30% of the natural habitats of community interest are directly influenced by the presence of
certain agricultural activities. This chapter presents a GIS analysis of the European Union Natura 2000
ecological network: spatial distribution of Natura 2000 sites in EU-27, dynamic agricultural surfaces
in Natura 2000 sites, and GIS tools in managing process. GIS techniques must represent the tool by
which the efficiency of this ecological network is monitored, as it must be permanently nourished with
important financial resources.

INTRODUCTION ing biological communities (Baur et al., 2006;


Kuemmerle, Muller, Griffiths & Rusu, 2009;
Ecologically, agriculture has a dual nature, being Pykala, 2003; Ruprecht, Enyedi, Eckstein &
considered the main risk affecting biodiversity at Donath, 2010). Globally, protected areas embody
a global level (Primack, Patroescu, Rozylowicz the most frequently used instrument for limiting
& Ioja, 2008) but also the support for sustain- threats upon representative samples of species
and natural ecosystems (Balmford et al., 2002),
DOI: 10.4018/978-1-4666-2038-4.ch077

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Agriculture and Conservation in the Natura 2000 Network

but also for preserving and valorising certain The projected benefits of the Natura 2000
traditional agricultural landscapes (Plieninger, network regarding risk control can confer long-
Hochtl & Spek, 2006). term protection to a greater number of species and
The intensity of biodiversity threats induced habitats, and a more efficient use of the available
by agriculture is extremely high in EU-27, where natural resources (European Commission, 2009;
agricultural fields comprise 47.4% of the land Ioja et al., 2010). Beside their inherent ecological
area (European Commission, 2009). Habitat value (Gaston, Jackson, Nagy, Cantu-Salazar &
disturbance, overexploitation, pollution, invasive Johnson, 2008) in underdeveloped areas many
alien species, and disease characterize the most Natura 2000 sites are created for the poten-
common threats encountered in the European tial to alleviate social and/or economic issues
space in which agricultural activities affect bio- (Dimitrakopoulos, Memtsas & Troumbis, 2004;
diversity (Plieninger et al., 2006; Primack et al., Mauerhofer, 2010; Rauschmayer, van den Hove
2008; Schmitt & Rakosy, 2007). & Koetz, 2009).
Agricultural techniques are becoming increas- The Natura 2000 network is perceived as
ingly destructive, as the methods are devoted a “social network” fit for the European Union
mainly to an increase in productivity (Stoate et landscape that intertwines the preservation of
al., 2009). This destruction can be minimized by nature with the maintenance of a sustainable tra-
the new orientations of the Common Agricultural ditional lifestyle for the local communities (e.g.,
Policy (CAP), promoting agri-environmental supply of locally grown products/produce, local
schemes (EEA, 2005; Piorr et al., 2009). The CAP employment opportunities, and/or eco-tourism)
will moderate the negative effects of agriculture (Bladt, Strange, Abildtrup, Svenning & Skov,
upon biodiversity (Henle et al., 2008), and reac- 2009; Maiorano, Falcucci, Garton & Boitani,
tivate benefits generated by ecosystem services 2007; Young et al., 2007).
(Hockings, 2003). The new schemes may also be Integrating the socio-economic elements
crucial in maintaining the efficiency of agricultural among the management objectives of the Natura
activities (EEA, 2009; Piorr et al., 2009). Illustra- 2000 sites significantly complicates the manage-
tive of this issue is the promotion of incentives ment methods (Pullin et al., 2009), the interest
for high nature value farmland, with the purpose in utilizing natural resources, and rendering
of increasing the conservation of avian species services being often of higher importance value
dependant on agricultural habitats (EEA, 2004). than conservation (Anthon, Garcia & Stenger,
In this framework, the Natura 2000 network 2010; Berentsen, Hendriksen, Heijman & van
represents an European Union goal, aimed at pro- Vlokhoven, 2007).
moting the conservation of natural habitats of flora To achieve the proposed conservation and
and fauna species, without excluding populated social goals, the Natura 2000 network requires
communities and local economy (Mucher, Hen- a considerable capital infusion of 5.58 billion €/
nekens, Bunce, Schaminee & Schaepman, 2009; year for administration in EU developed coun-
Pullin et al., 2009; Silva, 2009). The enforcement tries only (Bladt et al., 2009). For comparison,
of the Natura 2000 network appeared as a necessity approximately the same amount is invested
in fulfilling the objectives of reducing species and yearly in conservation worldwide, with develop-
habitats loss (Pullin et al., 2009). The European ing countries accounting for less than 12% of the
Union, through the Convention on Biodiversity, total (Bruner, Gullison & Balmford, 2004; James,
Habitats and Birds Directives, assumed these Gaston & Balmford, 2001).
objectives (Cogalniceanu & Cogalniceanu, 2010). The management process for the Natura 2000
sites is one aimed at transparency in decision mak-

1277
Agriculture and Conservation in the Natura 2000 Network

ing and public disclosure (Maiorano et al., 2007; This chapter, evidencing the dual nature of
Papp & Tóth, 2007), and mandates a significant agriculture, is structured into four sections. In the
volume of spatial information for sustaining introduction, the theoretical general framework
restrictive decisions made in a region (Stolon, illustrates the relationship among Natura 2000
2008). Information relating to the distribution sites and agricultural activities, as presented at
of protected species and habitats (Margules & an international level. Section one, Background,
Pressey, 2000; Moilanen, Arponen, Stokland & presents a structured account of the Natura 2000
Cabeza, 2009; Wilson, Cabeza & Klein, 2009), network, and correlates conservation activities
the manner in which risks are present (Hockings, with agriculture generated issues in the Natura
2003), the land use (Feranec, Jaffrain, Soukup & 2000 sites (abandonment of agriculture fields,
Hazeu, 2010), or the habitat carrying capacity are intensification of agricultural activities, conver-
essential in avoiding the emergence of conflicts sion of agricultural fields, development of inad-
among the protected areas managers and land equate utilization practices, and conflict with land
owners (Mauerhofer, 2010; Rauschmayer et al., owners), at the same time listing the main data
2009; Young et al., 2007). sources used in assessing the relationship among
In this framework, GIS techniques represent agriculture and Natura 2000. Section two identity
an important and integrated support for the pro- tools used in land-use analysis in the European
cess of protected areas management, allowing a Union. The third section presents results obtained
systematic conservation planning (Margules & by using GIS techniques concerning the spatial
Pressey, 2000; Wilson et al., 2009). For that reason, distribution of Natura 2000 sites in EU-27 and the
the existence of spatial databases relating to the dynamics of agricultural surfaces in the sites. The
distribution of species and habitats, the natural and last section of the article presents proposals and
human risk characteristics, conflict or functional future study areas for increasing the knowledge
areas in which specific rules must be adhered to, level of the agricultural-conservation activities
are critical in fundament decisions formulated at relationship in Natura 2000 sites.
local, regional and global level in protected areas
(Bock et al., 2005; Boteva, Griffiths & Dimopou-
los, 2004; Mucher et al., 2009). GIS techniques BACKGROUND
are essential in biodiversity management, as the
dimension of environmental changes are becoming Globally, a large percentage of the population and
increasingly dynamic, and decisions must be made their agricultural activities, depend directly and/or
by connecting diverse information (Feranec et al., indirectly of biodiversity (Primack et al., 2008).
2010). The failure of global and national politics The accelerated loss of biodiversity impacts the
of biodiversity conservation is proportional to the population’s ability to produce food in balance,
deficit of vital information (Gaston et al., 2008; promotes poverty and associated problems, de-
Ioja et al., 2010). creases the productivity of natural, semi-natural
The objectives of this study are: a) assessing and artificial ecosystems, broadens natural risks,
the spatial distribution of the Natura 2000 network and commonly creates political instability (Balm-
by using GIS techniques; b) identifying methods ford et al., 2002; Cogalniceanu & Cogalniceanu,
of analysing the relationship among agricultural 2010; Mace & Baillie, 2007).
and conservation activities within Natura 2000, At the international level, the main instrument
and c) identification of digital tools for assessment used by countries is the Convention on Biological
and management of conflict situations among Diversity (CBD), with the purpose of increasing
agricultural activities and conservation activities. the efficiency of conservation measures at a local,

1278
Agriculture and Conservation in the Natura 2000 Network

regional, and global level (Bock et al., 2005; Mace Directive) and SCI (Sites of Community Impor-
& Baillie, 2007). CBD is aimed at identifying and tance, established under the Habitats Directive),
applying stimulants for preserving biodiversity, transforming after the letter of advice from the
promoting instruments and mechanisms of bio- European Commission in SAC (Special Areas
logical resources use, and the equitable distribu- of Conservation) (European Commission, 2009;
tion of the benefits of using biological resources Papp & Tóth, 2007). All Natura 2000 sites follow
(Boisvert & Vivien, 2005). a maintenance and/or restoration program in a
beneficial conservation manner consistent with
What is Natura 2000 Network? the state of natural habitat and/or populations of
species for which the site is designated (EEA,
The European Union’s conservation policy is 2002, 2009).
directed to create a framework allowing member
states the organization of an ecological network Problems in Natura 2000 Network
consisting of protected areas. Its purpose is to Due to Agriculture Activities
optimize the efficiency of protection and manage-
ment actions for species and habitats of community The Natura 2000 is identified as a “social network”,
importance (Araujo, Lobo & Moreno, 2007; Pullin not only with regards to habitats and commu-
et al., 2009; WG, 2002). nity interest species conservation, but also to the
The Natura 2000 ecological network is the sustainable development of human settlements
cornerstone of conservation politics within the (Cogalniceanu & Cogalniceanu, 2010). More to
European Union, being enacted through the the point, a series of economic problems, related
Habitats and Birds Directives (79/409/EEC, 1979; mainly to agricultural activities, are seen as sig-
92/43/EEC, 1992; 2006/105/EC, 2006). Through nificant threats for species and natural habitats
the Natura 2000 network, conservation status within the Natura 2000 sites (EEA, 2005; Hartel,
is offered to 198 habitats, 1288 plants species Schweiger, Öllerer, Cogalniceanu & Arntzen,
(Chiarucci, Bacaro & Rocchini, 2008) and 1140 2010; Rozylowicz & Dobre, 2010).
animal species, consisting of 986 vertebrates and
154 invertebrates (Fontaine et al., 2007; Ioja et • The abandonment of agricultural fields
al., 2010). From the total habitats of community affects between 10% and 50% of the agri-
interest listed in Annex I of the Habitats Directive, cultural land mass of EU-27 member states
16.16% are directly dependent on the continuation (Henle et al., 2008), with an increased
of traditional agricultural practices, such as grass propensity in countries such as the Czech
cutting, pasturage, apiculture, etc., and others Republic, Ireland, Latvia, Romania, and
(Henle et al., 2008). As an example, the lack of Bulgaria (Stoate et al., 2009). The aban-
pasturage activities can determine the progression donment of agricultural fields determines
of areas from pastures to forests, resulting in the the loss of traditional landscapes, increases
loss of numerous species of plants and animals the risks of soil erosion, and deteriorates
adapted to those prior habitats (Baur et al., 2006; habitats for numerous species (Baur et al.,
Plieninger et al., 2006; Ruprecht et al., 2010). 2006; Muller, 2002). EEA (2004) estimates
Additionally, 65 natural habitats and 102 avian that between 5% and 65% of Important
species of community importance have an imperil Birds Areas are threatened by the abandon-
status due to agricultural practices (EEA, 2004). ment of agricultural fields, principally by
The Natura 2000 network includes SPA (Spe- the transformation of pastures and grass-
cial Protected Areas, established under the Birds lands into shrubs and forests.

1279
Agriculture and Conservation in the Natura 2000 Network

• The intensification of agricultural ac- where the resource utilization level was
tivities does not signify a normal reac- relatively high (Henle et al., 2008).
tion to the integration of a territory in the
Natura 2000 ecological network, but still Data for Natura 2000 Management
is an important risk at the EU-27 level (es-
pecially in Ireland, Portugal, Spain and The management of Natura 2000 sites requires a
the Baltic countries). Overutilization of a thorough knowledge of these threats, which must
region’s resources can produce a lessen- be removed, minimized, or at worst controlled.
ing in the natural and semi-natural ecosys- Towards this objective, GIS techniques present
tems’ capacity of support to biodiversity, a solid support, already being utilised in this
through an excessive infusion of pollutants goal. Boteva et al. (2004), based on a multilevel-
or through the depletion of nutritive sub- criteria, analyse the significance of plant forma-
stances (Kuemmerle et al., 2009; Schmitt tions included in the Natura 2000 sites from the
& Rakosy, 2007; Stoate et al., 2009). island of Crete. Mucher et al. (2009) models the
• The conversion of agricultural fields into spatial distribution of 27 Natura 2000 habitats in
dissimilar agricultural systems (as an ex- Europe, identifying, at a continental scale, their
ample, from perennial crops into annual) specific elements. Bock et al. (2005) uses the GIS
or other categories (particularly forestry, techniques in delimiting representative indicators
man-made permanent structures or surfac- for monitoring biological diversity.
es, and similar categories) demonstrates a Amongst the notable data sources that may
serious risk to the biodiversity in EU-27, be used in analysing the relationship agriculture-
as these changes suggest the destruction of Natura 2000 network, using GIS techniques, the
natural habitats (Stoate et al., 2009). This following are well-known:
demands sound recognition of the species’
distribution in the area and their key habitat 1. EUNIS (http://eunis.eea.europa.eu): EUNIS
requirements, which can be realised by us- comprises a database created and maintained
ing GIS techniques (Feranec et al., 2010). by the European Topic Centre on Biological
• The development of ineffective utiliza- Diversity for the European Environmental
tion practices represent a risk difficult to Agency and European Environmental
formulate without integrating prior experi- Information Observation Network, whose
ences concerning spatial databases, which principal intent is to facilitate access to
would then assist in the means for decision- information for environmental reports and
making. Ineffective practices involve any for the creation of the Natura 2000 network.
manner of intervention affecting the stabil- The database facilitates the extraction of data
ity of natural ecosystems (as an example, such as the coherent description of species,
the premature mowing of grassland, mech- habitats and sites from the Natura 2000 net-
anized intervention, etc.) (Kallimanis et work, a description of species, habitats, sites
al., 2008; Muller, 2002; Piorr et al., 2009). present in international conventions relevant
• The conflicts with farmers represent a for biodiversity, and offers up-to-date data
major issue amplified by the inclusion of about EEA activity in this field. EUNIS pro-
agricultural fields in the Natura 2000 net- vides information for over 275,000 European
work. Interdictions and restrictions that taxa. The degree of derived detail varies,
resulted from those inclusions amplified from taxonomic information to populations
territorial conflicts, particularly in regions and distribution. The database provides com-

1280
Agriculture and Conservation in the Natura 2000 Network

prehensive information regarding various Centre on Land Use and Spatial Information
habitats (natural or artificial, terrestrial or that allows for the spatial analysis of land
aquatic) included in the EUNIS classifica- cover change throughout the European
tion and in the Annex I Habitats of the EU Union territory. This process describes
Habitats Directive. EUNIS also provides how, from a quantitative and qualitative
information regarding the protected areas viewpoint, the ecosystems change over time.
at the European Union level: Natura 2000 To perform such an analysis, three online
sites, nationally designated protected areas, databases are utilized:
and internationally designated protected ◦◦ Land Cover Accounts (LEAC) based
areas – European Nature Diploma Areas on CORINE land cover changes da-
and Biogenetic Reserves. tabase. This database contains the
2. Corine (Coordination of Information on information on changes provided by
the Environment): Corine Land Cover CLC2000 and CLC Changes 1990-
(CLC) is the first European wide compila- 2000. This database is available for
tion of land cover information. The CLC Croatia, Bulgaria, Romania, Austria,
project is currently implemented in all Belgium, Czech Republic, Denmark,
EU countries, and in Central and Eastern Estonia, France, Germany, Greece,
European non-member countries (Belarus, Hungary, Italy, Ireland, Latvia,
Ukraine, Moldova, etc.). The first compi- Lithuania, Luxembourg, Netherlands,
lation reflects the situation as it existed in Poland, Portugal, Slovakia, Slovenia,
1990 (CLC1990), with the next two decades Spain, and the United Kingdom.
being updated and improved (CLC2000 ◦◦ PHARE 1975 – 1990 database. A
and CLC2006). The compilations resulted database that contains information
in several versions that eliminated the pro- on changes identified using two time
liferation of errors (geometric and thematic horizons (late 1970’s – early 1990’s)
mistakes). The scale (1:100,000), minimum at the second level of CLC nomen-
mapping unit (25 ha), and minimum width clature, covering Bulgaria, Hungary,
of linear elements (100 meters) for CLC Romania, Slovakia and the Czech
mapping, represents parameters common Republic.
for CLC1990, CLC2000, and subsequently ◦◦ LaCoast 1975 - 1990 - Eastern Europe
CLC2006. The standard CLC nomenclature Extension. LaCoast aims to quan-
includes 44 land cover classes, grouped into tify the changes of land cover types
three-levels, wherein the main level consists in a 10 km land strip inland from a
of: 1) artificial surfaces, 2) agricultural coastline. LaCoast uses CLC1990 as
areas, 3) forests and semi-natural areas, 4) its reference dataset and tracks dif-
wetlands, and 5) water bodies. CLC 2000 ferences of land cover changes using
produced a new dataset whose objective LANDSAT satellite images from the
was to detect land cover changes (LCC) in mid-1970s. The dataset will be avail-
Europe occurring between 1990 and 2000, able for Belgium, Denmark, France,
and 2000 and 2006 compilations. The LCC Germany, Greece, Ireland, Italy,
datasets aim to identify changes bigger than Netherlands, Portugal, and portions
5 ha. of Spain. An extension for Bulgaria,
3. Land and Ecosystem Accounting (LEAC): Estonia, Latvia, Lithuania, Poland,
is a means provided by the European Topic Slovenia, and Romania is avail-

1281
Agriculture and Conservation in the Natura 2000 Network

able (LaCoast 1975 - 1990 - Eastern est percentages are in countries such as United
Europe Extension). Kingdom (7.1%) and Denmark (8.9%).
4. EURURALIS (http://www.eururalis.eu): The area of SCI’s (Sites of Community Impor-
EURURALIS is a study consisting of four tance) varies from 6.8% in the United Kingdom to
distinct world scenarios (Global Economy, 31.4% in Slovenia, the average value being 12.7%
Global Cooperation, Continental Markets, (±6.02%). New EU members have percentages
and Regional Communities). It has a time of their national territory devoted to SCI of over
horizon of three decades, in 10-year time 10%, with lowest values in the Czech Republic
steps: 2000-2010, 2010-2020, 2020-2030, (10%), Poland (11%), Latvia (11.3%), Cyprus
and encompasses EU27 countries. The (11.5%), Slovakia (11.7%), with the highest in
projects have two objectives: to provide a Slovenia (31.4%), Bulgaria (29.6%), and Spain
means for a structured strategic discussion (24.5%). In countries with a high economic level,
at a European scale, and to develop and/or the percent of SCI’s is significantly lower, being
improve EURURALIS, by which a scientifi- generally under 10% of their national territory.
cally sound framework for integrated assess- Numerically, the largest number of SCI’s ap-
ment at the European level may be realized. pears in Germany (4622) and Sweden (3983),
The results should enable meaningful nego- and the lowest in smaller countries (Cyprus,
tiations among policy makers, stakeholders Luxemburg, Malta). This represents an important
and scientists from various domains and/ characteristic, particularly for understanding the
or countries. The results of EURURALIS management effort for these protected areas at
should potentially summarize what may the national level, as they have also a notable
occur in rural Europe towards 2030, based conservative value (Gaston et al., 2008).
on conditions that differ in nature, course, Some countries prefer having a greater number
duration, and/or place. of SCI’s, with smaller areas distributed on a larger
territory (e.g. Germany, the Czech Republic, Italy,
Sweden, Slovakia), while other countries choose
NATURA 2000: AGRICULTURE SCI’s with larger surfaces, in many cases concen-
RELATION USING GIS trated in regions with lower population densities
(Portugal, Bulgaria, Greece, and Romania).
Natura 2000 National Distribution In Romania, the average surface area of an
SCI is 120.3 km2. The highest concentration of
The current dimension of the Natura 2000 net- these SCI’s is found in the Carpathian Mountains
work is extraordinary: 22,419 sites of community and the Danube Delta areas, where the highest
importance covering 716,992 km2 (13.6% of the preservation degree is recorded (Ioja, et al., 2010).
continental surface), and 5242 special protected Thus, in a 0 m (delta area) and over 800 m areas
areas, covering 574,819 km2 (11.1% of the con- (Carpathian Mountain area), the protection of the
tinental surface) (European Commission, 2009). territory by the Natura 2000 sites exceeds 35%.
The network is not distributed homogenously Lowest values (below 20%) occur at altitudes
in member states, their percentage of national ter- below 600 m, where the suitability for human
ritory varying from the average value of 17.4%. activity is much higher (Rey, Groza, Ianoş &
Thus, at a national level, the largest percentage Patroescu, 2007) (Figure 1).
of fields managed in a regime of NATURA 2000 The distribution of SPA’s (Special Protected
site appear in Slovenia (35.5%), Bulgaria (34%), Areas) is dependent of the existence of avian
Slovakia (28.9%), and Spain (20.7%). The small- preferred habitats, such as wetlands, forestry areas,

1282
Agriculture and Conservation in the Natura 2000 Network

Figure 1. Elevation distribution of NATURA 2000 sites in Romania

and/or widespread agricultural fields (Table 1). In Romania, with regards to species classes,
The percent of SPA’s of the continental surface it was observed that most of the problems occur
of EU-27 is 11.9%, ranging from 2.9% of the within invertebrates and plants, where 40.9%
national territory in Ireland to 25.2% in Slovakia. protected invertebrate species, respectively 17.9%
The smallest percentages (under 10%) are re- protected plant species do not benefit from protec-
corded in smaller countries (Malta, Cyprus, tion inside a SCI (Ioja et al., 2010; Patroescu, Ioja,
Luxemburg) or those with a high percentage of Patroescu-Klotz & Necsuliu, 2006) (Figure 2).
human altered habitats (Ireland, France). The This is of considerable importance in the network
highest percentage occurs in countries that also expansion process or in its re-evaluation (European
have large areas devoted to SCI’s, respectively Commission, 2009; Papp & Tóth, 2007).
Slovakia (25.1%), Slovenia (23%), Bulgaria With the use of GIS techniques, the distribution
(20.4%), and Spain (20.6%). Regarding the dis- of Natura 2000 sites may be observed in the nine
tribution of SPA’s average area at a national biogeographic regions of the European Union
level, it is observed that the lowest values occur (EEA, 2002), resulting in important information
in Malta (1.2 km2), Luxemburg (11.2 km2), Bel- in estimating their degree of conservation. Thus,
gium (14 km2), with the highest in Poland (391 the best inclusion rate in Natura 2000 sites is
km2), Slovakia (322 km2), and Romania (292.7 recorded in the Macaronesian and Black Sea
km2) (Table 1). bioregions (100%), with the lowest inclusion rates
Using GIS techniques, the sufficiency degree in the steppic, pannonian and continental biore-
in comprising natural habitats and species of gions (under 90%), the latter supporting the most
community interest within the Natura 2000 net- intense agricultural activities (Feranec et al., 2010;
work is revealed. Thus, at a national level, the Mucher et al., 2009).
40%-60% rule (Papp & Tóth, 2007) is embraced Furthermore, to judge management effective-
by five countries (Denmark, Netherland, Italia, ness it is important to understand the overlapping
Belgium, and Finland). Therefore, values under levels of SCI’s, SPA’s, and the protected areas of
100% assume a new expansion of the Natura 2000 national interest. The overlapping of SCI’s and
network at a national level, with impacts on land SPA’s varies in EU-27 from 0% (Latvia) up to
use dynamics. 52.8% (Slovenia). In many countries, overlaps

1283
Agriculture and Conservation in the Natura 2000 Network

Table 1. NATURA 2000 sites distribution in relation to agriculture in EU-27

Average Average
% NATURA SCIs surfaces surface per % SCI SPA surfaces surfaces per % SPA
Country 2000 areas of NT km2 SCI km2 of NT km2 SPA km2 of NT
Austria 13,8 8978 53,4 10,7 9869 102,8 11,8
Belgium 12,7 3269 11,7 10,1 3282 14,0 9,7
Bulgaria 34 33430 146,6 29,6 23217 203,7 20,4
Cyprus 17,5 711 19,8 11,5 788 112,6 13,4
Czech 13,9 7854 7,3 10 9684 248,3 12,3
Deutschland 15,4 54342 11,8 9,7 59784 81,0 12,2
Denmark 8,9 19319 74,0 7,4 14718 130,2 5,9
Estonia 17,7 11321 21,3 16,7 12592 190,8 13,5
Spain 28,7 131434 90,8 24,5 105032 175,3 20,6
Finland 14,4 48552 28,3 12,7 30838 65,9 7,5
France 12,5 72418 53,0 8,5 76297 200,3 7,9
Greece 21 27798 116,3 16,3 16740 102,7 11,9
Hungary 21,1 13973 29,9 15 13512 245,7 14,5
Ireland 11,1 13558 32,0 10,7 2827 21,6 2,9
Italy 19,1 45309 19,8 14,3 43777 73,3 13,6
Lithuania 13,9 8822 31,6 13,2 6013 74,2 9
Luxemburg 17,8 399 8,3 15,4 145 11,2 5,6
Latvia 11,3 7856 24,2 11,3 6999 73,7 10
Malta 13 50 1,8 13,3 16 1,2 5,1
Netherland 13,8 14342 98,2 8,4 10125 131,5 12,6
Poland 19,4 38003 46,2 11 55228 391,7 15,6
Portugal 20,9 16788 174,9 17,4 10438 176,9 10,7
Romania 17,9 32833 120,3 13,2 31905 292,7 13,4
Sweden 13,8 64468 16,2 13,7 29874 56,3 6,2
Slovenia 35,5 6360 24,6 31,4 4656 172,4 23
Slovakia 28,9 5739 15,0 11,7 12236 322,0 25,1
United Kingdom 7,1 29066 46,7 6,8 16114 62,7 6,2
Total/ Average 17,6 716992 32,0 12,7 606706 115,7 11,9

of SCI’s and SPA’s occur on over 30% of their Dynamics of Agriculture Areas
total area (Belgium, Bulgaria, Slovenia, Romania, in Natura 2000 Sites in EU
United Kingdom, and Denmark), however, in
many circumstances, this is not total overlapping. Conservation activities are in direct competition
Thus, areas benefiting from several simultaneous with activities for territorial resources utilization.
conservation status (protected areas of national In Natura 2000 sites, all activities or actions that
interest, SCI, SPA), are subject to difficulties in may affect the conservation state of habitats and
their administration, for the management of hu- species of community interest for which they were
man activities is inherently dissimilar in different designated are restricted or prohibited (Dimitra-
categories (Ioja et al., 2010). kopoulos et al., 2004; Mauerhofer, 2010).

1284
Agriculture and Conservation in the Natura 2000 Network

Figure 2. Representation degree of group species in the Romania SCIs

CORINE Land Cover and EUNIS databases, manual grasslands mowing, pasturing, extensive
with the aid of the ArcGis Desktop 9.3, were used plants culture, sustainable exploitation of natural
to determine the overlapping degrees of agricul- resources (wood, forest fruits, mushrooms, fish-
tural fields in Natura 2000 sites. For identifying ing, or hunting) (Bladt et al., 2009; Gaston et al.,
the dynamics of agricultural fields between 2000 2008; Phillips, 1994), any protected area imposes
and 2006, CORINE Land Cover database for the narrow to broad restrictions in resources utilization
respective years was employed, observing the (Cogalniceanu & Cogalniceanu, 2010).
evolution for the Natura 2000 sites. For that reason, Natura 2000 sites are areas in
Agricultural fields with high natural conser- which most of the technologies for intensive
vation values represent 32% of the agricultural agricultural utilization are incompatible with
areas of EU-27, with reduced percentages in conservation objectives (mechanization, the use
Baltic countries and Benelux, and with higher of fertilisers and pesticides, pest species control,
percentages in Mediterranean countries, Austria, genetic modified organism) (Petrovan, Barrio,
Bulgaria, and Finland. Ward & Wheeler, 2010; Plieninger et al., 2006;
It is essential to understand the dynamics of Pullin et al., 2009).
agricultural activities in order to administer Natura Within EU-27, 15.8% of arable land and 22.8%
2000 sites that comprise 9.2% of the agricultural of pastures and grasslands are utilized in an inten-
areas of EU-27 (from 0.9% in Finland to 22.3% in sive regimen, and therefore it is important to rec-
Slovenia) that were being integrated into Natura ognize their overlapping with currently designated
2000 sites (Table 2). The greatest percentage of Natura 2000 sites. In the European Union, there
agricultural fields in Natura 2000 sites are located are three types of agricultural utilization systems
in Mediterranean countries and in some countries recognized as compatible with Natura 2000 sites
from the last enjoinment stages (Bulgaria, Slo- (EEA, 2004, 2005): type 1 – with a high percent-
venia, and Slovakia). Even if the purpose of the age of semi-natural vegetation; type 2 – farms
NATURA 2000 network is not that of creating dominated by lowered intensity agriculture or the
wildlife sanctuaries, allowing economic activities existence of natural-cultivate mosaics, and type
that do not convey perceivable disadvantages, 3 – farms containing rare species.
or even assisting preserved elements (such as

1285
Agriculture and Conservation in the Natura 2000 Network

Table 2. Natura 2000 sites distribution in relation to agriculture in EU-27

Agricultural % agriculture Evolution of agriculture


% agriculture surfaces in land from land in NATURA 2000
land in % HVAF of NATURA 2000 NATURA 2000 sites between 2000 and
Country NATURA 2000 agricultural land km2 surfaces 2006
Austria 11,4 68 4079,6 35,4 No data
Belgia 7,3 19 1304,5 33,6 0,01
Bulgaria 22 37 14815,3 39,3 0,032
Cyprus 1,5 33 95,6 9,5 0,028
Czech 6,4 21 3168,6 28,8 0
Deutchland 9,1 15 19662,7 35,8 No data
Denmark 4,9 5 1688,6 44,0 -0,013
Estonia 5,5 22 932,7 11,6 +0.011
Spain 15,5 56 52760,3 36,5 No data
Finland 0,9 45 267,0 0,5 No data
France 8,2 22 28955,7 42,1 -0,011
Greece 14,2 59 12953,6 46,7 No data
Hungary 14,5 28 9893,2 50,3 +0.132
Ireland 3,1 20 1791,0 23,0 +0.130
Italy 10,3 33 18910,4 32,9 No data
Lithuania 3,8 15 1580,7 17,4 -0.033
Luxemburg 11 9 156,9 34,0 -0.013
Latvia 6,6 20 1883,4 25,9 0
Malta 5,8 no data no data no data 0.026
Netherland 4,6 14 1206,0 21,0 -0.18
Poland 10,7 24 21648,1 35,6 +0.003
Portugal 18,5 58 9316,4 48,4 +0.060
Romania 9,6 34 13856,6 32,5 +0.013
Sweden 4,3 24 2046,7 3,6 No data
Slovenia 22,3 78 1682,0 23,4 -0.064
Slovakia 16,3 22 4051,3 28,7 -0.353
United Kingdom 3 27 5810,5 33,6 No data
Total/Average 9,23 32 215619 28,6 -0.012

The importance of agricultural fields is ampli- management of these areas is made difficult, with
fied by the reality that 28.6% of the area of Natura agricultural sources framed as diffuse environ-
2000 sites is represented by agricultural fields, mental degradations and therefore, challenging
with the lowest values being recorded in Nordic to control and monitor (Stoate et al., 2009).
countries (under 5%), and the highest in Mediter- Additionally, by analysing the prevalence of
ranean countries or in those regions containing a Natura 2000 sites upon agricultural activities, a
high percentage of plains and low hills (Hungary, decreasing tendency is observed in the 2000-2006
Romania, Denmark, and Poland) (Table 2). The period, especially in countries in which the sites

1286
Agriculture and Conservation in the Natura 2000 Network

were established prior to 2006. Thus, in countries land), while countries with lowered development
such as Denmark, Netherlands, or France, reduc- do not portray the capacity of absorbing these
tion of agricultural areas within Natura 2000 sites funds (Bladt et al., 2009).
is already perceivable and forewarns of social The large numbers of species that typically
and economical incidences that may be deter- benefit from a strict preservation regime, coupled
mined by models such as land use, principally with the tendency of covering large areas, predict-
those without economic support (Mauerhofer, ably mystify management objectives and the huge
2010; Rauschmayer et al., 2009). In exchange, funds necessary for management actions (James
countries in which the rehabilitation of traditional et al., 2001). It is problematic to create a Natura
agricultural landscapes present an advantage for 2000 network with projected low probability of
conservation activities from Natura 2000 sites enforcement (Pullin et al., 2009) without a spatial
(Portugal) or countries demonstrating a higher extend reassessment based on the principles of
development of agriculture (Ireland, Hungary), spatial planning for conservation (Araujo et al.,
present expansion of agricultural areas (European 2007; Ioja et al., 2010; Wilson et al., 2009).
Commission, 2009) (Table 2). The efficiency of protected areas of European
regions should be linked to the design of protected
Solutions and Recommendations areas of controllable dimensions, which bestow
favourable habitats, principally for vulnerable,
The European Union’s policy in the field of endangered, and critically endangered plant and
biodiversity is in many aspects inordinate, af- animal species, and with mechanisms through by
fecting the already fragile networks of protected which actions affecting natural ecosystems are
areas, principally in new member states (Ioja et kept away from protected areas (Ioja et al., 2010).
al., 2010). The orientation towards social and Stimulating eco-labelling and the development
economic aspects of the management methods of a bio-agricultural product’s market is an ob-
does nothing more than sidetrack attention from jective that may be promoted in the Natura 2000
the priority objective of protected areas, to that sites, although the economic efficiency of these
of biodiversity conservation (Dimitrakopoulos et activities is uncertain (Bladt et al., 2009; EEA,
al., 2004; Rauschmayer et al., 2009). More to the 2004; Piorr et al., 2009; Plieninger et al., 2006;
point, the extensive development of the protected Stoate et al., 2009). Member states should have
area network does not contribute to alleviating greater freedom in prioritizing protected areas and
social and economic problems of disadvantaged in choosing the types of protected areas they can
communities, generating at their level strong efficiently manage (Rauschmayer et al., 2009;
regional tensions through imposed restriction Rozylowicz, L., Popescu, V. D., Patroescu, M.,
and the lack of financial compensation (Anthon & Chisamera, G., 2011).
et al., 2010; Mauerhofer, 2010; Rauschmayer et
al., 2009). The mechanism that offers, just for
conservation, financial compensation, although FUTURE RESEARCH DIRECTIONS
generous (200-450 €/ha/year) (Anthon et al., 2010;
Piorr et al., 2009), it is relatively difficult to ac- In order to benefit from the advantages of the Na-
cess, especially for disadvantaged communities; tura 2000 network, and for establishing a proper
where most of these problems converge (Young management organization, significant financial
et al., 2007). Thus, most financial compensations resources are required, estimated at 6.1 billion €
are directed to countries with lowered biodiversity each year (Bladt et al., 2009; WG, 2002), compa-
but a strong economy (France, Germany, and Hol- rable with those spent currently at a global level

1287
Agriculture and Conservation in the Natura 2000 Network

(Balmford, Gaston, Blyth, James & Kapos, 2003; in promoting agricultural activities accentuate
James et al., 2001). These financial resources are much of the tensions present in local communi-
necessary for the management process, but prin- ties, where important conflicts currently subsist
cipally for offering compensations and financial (i.e. Germany). Natura 2000 is an objective of the
incentives to landowners or other entities for the European Union, being defined as an ambitious
restrictions imposed on them (Anthon et al., 2010). conservation program, a position that shapes
The assignment of any compensation and/or negative-national conservation politics.
stimulant-necessitates sound awareness of field Therefore, GIS techniques must represent the
realities, respectively - the productivity of the tool by which the efficiency of this ecological
land, the presence of threatened habitats and spe- network is monitored, as it must be permanently
cies, losses recorded by various resource users, nourished with important financial resources.
etc. In this objective, GIS methods should have
an important role, as they permit correlations
of multiple parameters needed in the process of ACKNOWLEDGMENT
Natura 2000 sites management.
Spatial databases regarding the Natura 2000 We would like to thank Edward F. Rozylowicz
network are in an evolutionary process, and the for proofreading and valuable comments and
diversity of future research is extremely high, anonymous reviewers for their suggestions.
especially as a well-managed network based on
quality data. In this objective, the monitoring of
agricultural fields dynamics in Natura 2000 sites is REFERENCES
paramount, specifically regarding their conversion
and the agricultural methods used. These assess- 2006/105/EC. (2006). Council Directive 2006/105/
ments should be correlated with the state of natural EC of 20 November 2006 adapting Directives
habitats and all species of community importance 73/239/EEC, 74/557/EEC and 2002/83/EC in the
for which a particular model of agricultural use field of environment, by reason of the accession
is applied. This is particularly important because of Bulgaria and Romania: Official Journal L 363,
this conservation status may suggest acceleration 20/12/2006.
in the utilization of regional resources. 79/409/EEC. (1979). Council Directive 79/409/
Special attention in the managing process EEC of 2 April 1979 on the conservation of wild
should be given to the monitoring of social and birds (Vol. 79/409/EEC): Official Journal L 103,
economic benefits of communities from Natura 25/04/1979.
2000 sites, essential in maintaining this manner
of resource use. 92/43/EEC. (1992). Council Directive 92/43/EEC
of 21 May 1992 on the conservation of natural
habitats and of wild fauna and flora (Vol. 92/43/
CONCLUSION EEC): Official Journal of E.U., L 206, 22.7.1992.
Anthon, S., Garcia, S., & Stenger, A. (2010).
The process of managing Natura 2000 sites is
Incentive contracts for Natura 2000 implementa-
at its beginning in EU-27. Scepticisms concern-
tion in forest areas. Environmental and Resource
ing the success of this politics of biodiversity
Economics, 46(3), 281–302..doi:10.1007/s10640-
social conservation are elevated, as the financial
009-9341-1
resources necessary for the management process
are lacking. Additionally, restrictions imposed

1288
Agriculture and Conservation in the Natura 2000 Network

Araujo, M. B., Lobo, J. M., & Moreno, J. C. (2007). Boisvert, V., & Vivien, F. D. (2005). The conven-
The effectiveness of Iberian protected areas in tion on biological diversity: A conventionalist ap-
conserving terrestrial biodiversity. Conservation proach. Ecological Economics, 53(4), 461–472..
Biology, 21(6), 1423–1432..doi:10.1111/j.1523- doi:10.1016/j.ecolecon.2004.06.030
1739.2007.00827.x
Boteva, D., Griffiths, G., & Dimopoulos, P. (2004).
Balmford, A., Bruner, A., Cooper, P., Costanza, Evaluation and mapping of the conservation sig-
R., Farber, S., & Green, R. E. (2002). Ecology nificance of habitats using GIS: An example from
- economic reasons for conserving wild nature. Crete, Greece. Journal for Nature Conservation,
Science, 297(5583), 950–953..doi:10.1126/sci- 12(4), 237–250..doi:10.1016/j.jnc.2004.09.002
ence.1073947
Bruner, A. G., Gullison, R. E., & Balmford, A.
Balmford, A., Gaston, K. J., Blyth, S., James, A., (2004). Financial costs and shortfalls of managing
& Kapos, V. (2003). Global variation in terrestrial and expanding protected-area systems in develop-
conservation costs, conservation benefits, and ing countries. Bioscience, 54(12), 1119–1126..
unmet conservation needs. Proceedings of the doi:10.1641/0006-3568(2004)054[1119:FCAS
National Academy of Sciences of the United States OM]2.0.CO;2
of America, 100(3), 1046–1050..doi:10.1073/
Chiarucci, A., Bacaro, G., & Rocchini, D. (2008).
pnas.0236945100
Quantifying plant species diversity in a Natura
Baur, B., Cremene, C., Groza, G., Rakosy, L., 2000 network: Old ideas and new proposals.
Schileyko, A. A., & Baur, A. (2006). Effects of Biological Conservation, 141(10), 2608–2618..
abandonment of subalpine hay meadows on plant doi:10.1016/j.biocon.2008.07.024
and invertebrate diversity in Transylvania, Roma-
Cogalniceanu, D., & Cogalniceanu, G. C. (2010).
nia. Biological Conservation, 132(2), 261–273..
An enlarged European Union challenges prior-
doi:10.1016/j.biocon.2006.04.018
ity settings in conservation. Biodiversity and
Berentsen, P. B. M., Hendriksen, A., Heijman, W. Conservation, 19(5), 1471–1483..doi:10.1007/
J. M., & van Vlokhoven, H. A. (2007). Costs and s10531-010-9777-1
benefits of on-farm nature conservation. Ecologi-
Dimitrakopoulos, P. G., Memtsas, D., & Troumbis,
cal Economics, 62(3-4), 571–579..doi:10.1016/j.
A. Y. (2004). Questioning the effectiveness of the
ecolecon.2006.07.026
Natura 2000 special areas of conservation strategy:
Bladt, J., Strange, N., Abildtrup, J., Svenning, J. The case of Crete. Global Ecology and Bioge-
C., & Skov, F. (2009). Conservation efficiency of ography, 13(3), 199–207..doi:10.1111/j.1466-
geopolitical coordination in the EU. Journal for 822X.2004.00086.x
Nature Conservation, 17(2), 72–86..doi:10.1016/j.
EEA. (2002). Europe’s biodiversity—biogeo-
jnc.2008.12.003
graphical regions and seas (Vol. 1). Copenhagen,
Bock, M., Rossner, G., Wissen, M., Remm, K., Denmark: European Environmental Agency.
Langanke, T., & Lang, S. (2005). Spatial indicators
EEA. (2004). High nature value farmland - char-
for nature conservation from European to local
acteristics, trends and policy challenges (Vol. 1).
scale. Ecological Indicators, 5(4), 322–338..
Copenhagen, Denmark: European Environment
doi:10.1016/j.ecolind.2005.03.018
Agency.

1289
Agriculture and Conservation in the Natura 2000 Network

EEA. (2005). Agriculture and environment in Hockings, M. (2003). Systems for assessing the
EU-15—the IRENA indicator report (Vol. 6). effectiveness of management in protected areas.
Copenhagen, Denmark: European Environmental Bioscience, 53(9), 823–832..doi:10.1641/0006-
Agency. 3568(2003)053[0823:SFATEO]2.0.CO;2
EEA. (2009). Progress towards the European Ioja, C. I., Patroescu, M., Rozylowicz, L., Popescu,
2010 biodiversity target (Vol. 4). Copenhagen, D. V., Vergheleţ, M., & Zotta, M. I. (2010). The
Denmark: European Environmental Agency. efficacy of Romania’s protected areas network
in conserving biodiversity. Biological Con-
European Commission. (2009). Natura 2000 ba-
servation, 143(11), 2468–2476..doi:10.1016/j.
rometer - December 2008. Natura 2000. European
biocon.2010.06.013
Commission DG ENV Nature Newletter, 8-9.
James, A., Gaston, K. J., & Balmford, A.
Feranec, J., Jaffrain, G., Soukup, T., & Hazeu,
(2001). Can we afford to conserve biodiversity?
G. (2010). Determining changes and flows in
Bioscience, 51(1), 43–52..doi:10.1641/0006-
European landscapes 1990-2000 using CORINE
3568(2001)051[0043:CWATCB]2.0.CO;2
land cover data. Applied Geography (Seveno-
aks, England), 30(1), 19–35..doi:10.1016/j.ap- Kallimanis, A. S., Tsiafouli, M. A., Pantis, J. D.,
geog.2009.07.003 Mazaris, A. D., Matsinos, Y., & Sgardelis, S. P.
(2008). Arable land and habitat diversity in Na-
Fontaine, B., Bouchet, P., Van Achterberg, K.,
tura 2000 sites in Greece. Journal of Biological
Alonso-Zarazaga, M. A., Araujo, R., & Asche,
Research-Thessaloniki, 9, 55–66.
M. (2007). The European Union’s 2010 target:
Putting rare species in focus. Biological Con- Kuemmerle, T., Muller, D., Griffiths, P., & Rusu,
servation, 139(1-2), 167–185..doi:10.1016/j. M. (2009). Land use change in Southern Roma-
biocon.2007.06.012 nia after the collapse of socialism. Regional
Environmental Change, 9(1), 1–12..doi:10.1007/
Gaston, K. J., Jackson, S. E., Nagy, A., Cantu-
s10113-008-0050-z
Salazar, L., & Johnson, M. (2008). Protected
areas in Europe - principle and practice. Annals Mace, G. M., & Baillie, J. E. M. (2007). The 2010
of the New York Academy of Sciences, 1134(1), biodiversity indicators: Challenges for science and
97–119..doi:10.1196/annals.1439.006 policy. Conservation Biology, 21(6), 1406–1413..
doi:10.1111/j.1523-1739.2007.00830.x
Hartel, T., Schweiger, O., Öllerer, K., Cogalnicea-
nu, D., & Arntzen, J. W. (2010). Amphibian distri- Maiorano, L., Falcucci, A., Garton, E. O., &
bution in a traditionally managed rural landscape Boitani, L. (2007). Contribution of the Natura
of Eastern Europe: Probing the effect of landscape 2000 network to biodiversity conservation in
composition. Biological Conservation, 143(5), Italy. Conservation Biology, 21(6), 1433–1444..
1118–1124..doi:10.1016/j.biocon.2010.02.006 doi:10.1111/j.1523-1739.2007.00831.x
Henle, K., Alard, D., Clitherow, J., Cobb, P., Margules, C. R., & Pressey, R. L. (2000). System-
Firbank, L., & Kull, T. (2008). Identifying and atic conservation planning. Nature, 405(6783),
managing the conflicts between agriculture and 243–253. doi:10.1038/35012251
biodiversity conservation in Europe - a review.
Agriculture Ecosystems & Environment, 124(1-2),
60–71..doi:10.1016/j.agee.2007.09.005

1290
Agriculture and Conservation in the Natura 2000 Network

Mauerhofer, V. (2010). Missing links: How indi- Phillips, A. (1994). The IUCN action plan for
viduals can contribute to reserve policy enforce- protected areas in Europe. Conserving Europe’s
ment on the example of the European Union. Natural Heritage, 69-73.
Biodiversity and Conservation, 19(3), 601–618..
Piorr, A., Ungaro, F., Ciancaglini, A., Happe, K.,
doi:10.1007/s10531-009-9737-9
Sahrbacher, A., & Sattler, C. (2009). Integrated
Moilanen, A., Arponen, A., Stokland, J. N., & assessment of future CAP policies: Land use
Cabeza, M. (2009). Assessing replacement cost of changes, spatial patterns and targeting. Envi-
conservation areas: How does habitat loss influ- ronmental Science & Policy, 12(8), 1122–1136..
ence priorities? Biological Conservation, 142(3), doi:10.1016/j.envsci.2009.01.001
575–585..doi:10.1016/j.biocon.2008.11.011
Plieninger, T., Hochtl, F., & Spek, T. (2006).
Mucher, C. A., Hennekens, S. M., Bunce, R. G. Traditional land-use and nature conservation
H., Schaminee, J. H. J., & Schaepman, M. E. in European rural landscapes. Environmental
(2009). Modelling the spatial distribution of Na- Science & Policy, 9(4), 317–321..doi:10.1016/j.
tura 2000 habitats across Europe. Landscape and envsci.2006.03.001
Urban Planning, 92(2), 148–159..doi:10.1016/j.
Primack, R. B., Patroescu, M., Rozylowicz, L.,
landurbplan.2009.04.003
& Ioja, C. (2008). Fundamentele conservării
Muller, S. (2002). Appropriate agricultural man- diversităţii biologice. Bucureşti, Romania: AGIR.
agement practices required to ensure conservation
Pullin, A. S., Baldi, A., Can, O. E., Dieterich,
and biodiversity of environmentally sensitive
M., Kati, V., & Livoreil, B. (2009). Conserva-
grassland sites designated under Natura 2000.
tion focus on Europe: Major conservation policy
Agriculture Ecosystems & Environment, 89(3),
issues that need to be informed by conservation
261-266. doi: S0167-8809(01)00235-3
science. Conservation Biology, 23(4), 818–824..
Papp, D., & Tóth, C. (2007). Natura 2000 site doi:10.1111/j.1523-1739.2009.01283.x
designation process with a special focus on the
Pykala, J. (2003). Effects of restoration with
biogeographic seminars (2nd ed., p. 36). Budapest,
cattle grazing on plant species composition
Hungary: CEEWEB Office.
and richness of semi-natural grasslands. Biodi-
Patroescu, M., Ioja, I. C., Patroescu-Klotz, I., & versity and Conservation, 12(11), 2211–2226.
Necsuliu, R. (2006). Umweltqualitat in Rumanien. doi:10.1023/A:1024558617080
In Kahl, T., Metzeltin, M., & Ungureanu, R. (Eds.),
Rauschmayer, F., van den Hove, S., & Koetz, T.
Rumänien. Raum und Bevölkerung. Geschichte
(2009). Participation in EU biodiversity gover-
und Geschichtsbilder.Kultur. Gesellschaft und
nance: How far beyond rhetoric? Environment
Politik heute. Wirtschaft. Recht undVerfassung.
and Planning. C, Government & Policy, 27(1),
Historische Regionen (p. 984). Münster/Hamburg/
42–58..doi:10.1068/c0703j
Berlin/Wien, Germany; London, UK; Zürich,
Switzerland: LIT Verlag. Rey, V., Groza, O., Ianoş, I., & Patroescu, M.
(2007). Atlas de la Roumanie. Montpelier/Paris,
Petrovan, S. O., Barrio, I. C., Ward, A. I., &
France: Reclus.
Wheeler, P. M. (2010). Farming for pests? Local
and landscape-scale effects of grassland manage-
ment on rabbit densities. European Journal of
Wildlife Research, 57(1), 27–34..doi:10.1007/
s10344-010-0394-9

1291
Agriculture and Conservation in the Natura 2000 Network

Rozylowicz, L., & Dobre, M. (2010). Assessing Wilson, K. A., Cabeza, M., & Klein, C. J. (2009).
the threatened status of Testudo hermanni boettgeri Fundamental concepts of spatial conservation
Mojsisovics, 1889 (Reptilia: Testudines: Testudi- prioritization. In Moilanen, A., Wilson, K. A., &
nidae) population from Romania. North-Western Possingham, H. P. (Eds.), Spatial conservation
Journal of Zoology, 6(2), 190–202. prioritization. Quantitative methods & computa-
tional tools (pp. 16–28). New York, NY: Oxford
Rozylowicz, L., Popescu, V. D., Patroescu, M.,
University Press.
& Chisamera, G. (2011). The potential of large
carnivores as conservation surrogates in the Roma- Young, J., Richards, C., Fischer, A., Halada, L.,
nian Carpathians. Biodiversity and Conservation, Kull, T., & Kuzniar, A. (2007). Conflicts between
20(3), 561–579..doi:10.1007/s10531-010-9967-x biodiversity conservation and human activities
in the central and eastern European countries.
Ruprecht, E., Enyedi, M. Z., Eckstein, R. L., &
Ambio, 36(7), 545–550. doi:10.1579/0044-
Donath, T. W. (2010). Restorative removal of plant
7447(2007)36[545:CBBCAH]2.0.CO;2
litter and vegetation 40 years after abandonment
enhances re-emergence of steppe grassland vegeta-
tion. Biological Conservation, 143(2), 449–456..
ADDITIONAL READING
doi:10.1016/j.biocon.2009.11.012
Schmitt, T., & Rakosy, L. (2007). Changes of Baylis, K., Peplow, S., Rausser, G., & Simon, L.
traditional agrarian landscapes and their conser- (2008). Agri-environmental policies in the EU
vation implications: A case study of butterflies in and United States: A comparison. Ecological
Romania. Diversity & Distributions, 13, 855–862.. Economics, 65(4), 753–764..doi:10.1016/j.eco-
doi:10.1111/j.1472-4642.2007.00347.x lecon.2007.07.034

Silva, C. N. (2009). Protected areas and regional Brooke, C. (2008). Conservation and Adapta-
development in Europe: Towards a new model tion to Climate Change. Conservation Biol-
for the 21st Century. Tijdschrift voor Econo- ogy, 22(6), 1471–1476..doi:10.1111/j.1523-
mische en Sociale Geografie, 100(1), 129–131.. 1739.2008.01031.x
doi:10.1111/j.1467-9663.2009.514_2.x Bunce, R. G. H., Metzger, M. J., Jongman, R. H.
Stoate, C., Baldi, A., Beja, P., Boatman, N. D., G., Brandt, J., De Blust, G., & Elena-Rossello, R.
Herzon, I., & van Doorn, A. (2009). Ecological (2008). A standardized procedure for surveillance
impacts of early 21st century agricultural change and monitoring European habitats and provision of
in Europe - a review. Journal of Environmental spatial data. Landscape Ecology, 23(1), 11–25..
Management, 91(1), 22–46..doi:10.1016/j.jenv- doi:10.1007/s10980-007-9173-8
man.2009.07.005 Costanza, R., dArge, R., deGroot, R., Farber, S.,
Stolon, S. (Ed.). (2008). Assessment of manage- Grasso, M., Hannon, B.,... vandenBelt, M. (1997).
ment effectiveness in European protected areas. The value of the world’s ecosystem services and
Sharing experiences and promoting good manage- natural capital. Nature, 387(6630), 253–260.
ment. Bonn, Germany: Bundesant für Naturschutz. doi:10.1038/387253a0

WG. (2002). Working Group on Article 8 of the Engel, S., Pagiola, S., & Wunder, S. (2008).
Habitats Directive. Final Report on Financing Designing payments for environmental services
Natura 2000. Retrieved from www.eeb.org/activi- in theory and practice: An overview of the is-
ties /biodiversity/Financing-Natura- 2000-WG- sues. Ecological Economics, 65(4), 663–674..
final-report-art8.pdf doi:10.1016/j.ecolecon.2008.03.011

1292
Agriculture and Conservation in the Natura 2000 Network

Garrod, G. (2009). Greening the CAP: how the Hynes, S., & Garvey, E. (2009). Modelling
improved design and implementation of agri- Farmers’ Participation in an Agri-environmental
environment schemes can enhance the delivery of Scheme using Panel Data: An Application to the
environmental benefits. Journal of Environmental Rural Environment Protection Scheme in Ire-
Planning and Management, 52(5), 571–574.. land. Journal of Agricultural Economics, 60(3),
doi:10.1080/09640560902958099 546–562..doi:10.1111/j.1477-9552.2009.00210.x
Gerard, F., Petit, S., Smith, G., Thomson, A., Keulartz, J. (2009). European Nature Conservation
Brown, N., & Manchester, S. (2010). Land and Restoration Policy-Problems and Perspec-
cover change in Europe between 1950 and 2000 tives. Restoration Ecology, 17(4), 446–450..
determined employing aerial photography. Prog- doi:10.1111/j.1526-100X.2009.00566.x
ress in Physical Geography, 34(2), 183–205..
Kleijn, D., Baquero, R. A., Clough, Y., Diaz, M.,
doi:10.1177/0309133309360141
De Esteban, J., & Fernandez, F. (2006). Mixed
Gibbs, J. P., Hunter, M. L. Jr, & Sterling, E. J. biodiversity benefits of agri-environment schemes
(Eds.). (2008). Problem-Solving in Conserva- in five European countries. Ecology Letters, 9(3),
tion Biology and Wildlife Management. Wiley- 243–254..doi:10.1111/j.1461-0248.2005.00869.x
Blackwell. doi:10.1002/9781444319576
Knickel, K., Kroger, M., Bruckmeier, K., &
Gohin, A. (2006). Assessing CAP reform: Sensi- Engwall, Y. (2009). The Challenge of Evaluating
tivity of modelling decoupled policies. Journal Policies for Promoting the Multifunctionality of
of Agricultural Economics, 57(3), 415–440. Agriculture: When ‘Good’ Questions Cannot
doi:10.1111/j.1477-9552.2006.00058.x be Addressed Quantitatively and ‘Quantitative
Answers are not that Good’. Journal of Envi-
Guzy, M. R., Smith, C. L., Bolte, J. P., Hulse, D.
ronmental Policy and Planning, 11(4), 347–367..
W., & Gregory, S. V. (2008). Policy Research
doi:10.1080/15239080903033945
Using Agent-Based Modeling to Assess Future
Impacts of Urban Expansion into Farmlands and Ladle, R. J., & Whittaker, R. J. (Eds.). (2011).
Forests. Ecology and Society, 13(1). Retrieved Conservation biogeography. Oxford: Wiley-
from: http://www.ecologyandsociety.org /vol13/ Blackwell.
iss1/art37/
Lobianco, A., & Esposti, R. (2010). The Re-
Haberl, H., Gaube, V., Diaz-Delgado, R., Krauze, gional Multi-Agent Simulator (RegMAS): An
K., Neuner, A., & Peterseil, J. (2009). Towards open-source spatially explicit model to assess
an integrated model of socioeconomic biodiver- the impact of agricultural policies. Computers
sity drivers, pressures and impacts. A feasibil- and Electronics in Agriculture, 72(1), 14–26..
ity study based on three European long-term doi:10.1016/j.compag.2010.02.006
socio-ecological research platforms. Ecological
Louette, G., Maes, D., Alkemade, J. R. M., Boitani,
Economics, 68(6), 1797–1812..doi:10.1016/j.
L., de Knegt, B., & Eggers, J. (2010). BioScore-
ecolecon.2008.11.013
Cost-effective assessment of policy impact on
Herzog, F. (2005). Agri-environment schemes biodiversity. Journal for Nature Conservation,
as landscape experiments - Preface. Agriculture 18(2), 142–148..doi:10.1016/j.jnc.2009.08.002
Ecosystems & Environment, 108(3), 175–177..
doi:10.1016/j.agee.2005.02.001

1293
Agriculture and Conservation in the Natura 2000 Network

Macedo-Sousa, J. A., Soares, A., & Tarazona, J. Pereira, J. M. C., & Duckstein, L. (1993). A
V. (2009). A conceptual model for assessing risks multiple criteria decision-making approach to gis-
in a Mediterranean Natura 2000 Network site. based land suitability evaluation. International
The Science of the Total Environment, 407(3), Journal of Geographical Information Systems,
1224–1231..doi:10.1016/j.scitotenv.2008.09.052 7(5), 407–424. doi:10.1080/02693799308901971
Maguire, D., Batty, M., & Goodchild, M. (Eds.). Piorr, A., Ungaro, F., Ciancaglini, A., Happe, K.,
(2005). GIS, Spatial Analysis, and Modeling. Sahrbacher, A., & Sattler, C. (2009). Integrated as-
Esri Press. sessment of future CAP policies: land use changes,
spatial patterns and targeting. Environmental Sci-
Manly, B. F. J. (2008). Statistics for Environmen-
ence & Policy, 12(8), 1122–1136..doi:10.1016/j.
tal Science and Management, Second Edition
envsci.2009.01.001
(Chapman & Hall/CRC Applied Environmental
Statistics): Chapman and Hall/CRC. Primack, R. B. (2010). Essentials of Conservation
Biology (5th ed.). Sinauer Associates, Inc.
Margules, C. R., & Sarkar, S. (2007). Systematic
conservation planning. Cambridge: Cambridge Primdahl, J., Vesterager, J. P., Finn, J. A., Vlahos,
University Press. G., Kristensen, L., & Vejre, H. (2010). Current use
of impact models for agri-environment schemes
Metzger, M. J., Bunce, R. G. H., Leemans, R.,
and potential for improvements of policy design
& Viner, D. (2008). Projected environmental
and assessment. Journal of Environmental
shifts under climate change: European trends and
Management, 91(6), 1245–1254..doi:10.1016/j.
regional impacts. Environmental Conservation,
jenvman.2009.12.012
35(1), 64–75..doi:10.1017/S0376892908004529
Randolph, J. (2003). Environmental Land Use
Mouro, C., & Castro, P. (2010). Local Commu-
Planning and Management. Island Press.
nities Responding to Ecological Challenges-A
Psycho-social Approach to the Natura 2000 Net- Reidsma, P., Tekelenburg, T., van den Berg, M., &
work. Journal of Community & Applied Social Alkemade, R. (2006). Impacts of land-use change
Psychology, 20(2), 139–155. doi:.doi:10.1002/ on biodiversity: An assessment of agricultural
casp.1025 biodiversity in the European Union. Agriculture
Ecosystems & Environment, 114(1), 86–102..
Mucher, C. A., Klijn, J. A., Wascher, D. M., &
doi:10.1016/j.agee.2005.11.026
Schaminee, J. H. J. (2010). A new European Land-
scape Classification (LANMAP): A transparent, Scott, J. M., Davis, F., Csuti, B., Noss, R., But-
flexible and user-oriented methodology to distin- terfield, B., & Groves, C. (1993). Gap analysis - a
guish landscapes. Ecological Indicators, 10(1), geographic approach to protection of biological
87–103..doi:10.1016/j.ecolind.2009.03.018 diversity. Wildlife Monographs, (123): 1–41.
Opdam, P. F. M., Broekmeyer, M. E. A., & Stephan, J. Goetz, F. B. (Ed.). (2009). New
Kistenkas, F. H. (2009). Identifying uncertain- Perspectives on Agri-environmental Policies:
ties in judging the significance of human impacts A multidisciplinary and transatlantic approach
on Natura 2000 sites. Environmental Science (Routledge Explorations in Environmental Eco-
& Policy, 12(7), 912–921..doi:10.1016/j.en- nomics): Routledge.
vsci.2009.04.006

1294
Agriculture and Conservation in the Natura 2000 Network

Stokes, D. L., Hanson, M. F., Oaks, D. D., Straub, Westhoek, H. J., van den Berg, M., & Bakkes, J.
J. E., & Ponio, A. V. (2010). Local Land-Use A. (2006). Scenario development to explore the
Planning to Conserve Biodiversity: Planners’ future of Europe’s rural areas. Agriculture Ecosys-
Perspectives on What Works. Conservation tems & Environment, 114(1), 7–20..doi:10.1016/j.
Biology, 24(2), 450–460..doi:10.1111/j.1523- agee.2005.11.005
1739.2009.01356.x
Yang, W. H., Bryan, B. A., MacDonald, D. H.,
Thiel, A. (2009). The use of ex-ante modelling Ward, J. R., Wells, G., Crossman, N. D., & Connor,
tools in European Impact Assessment: What role J. D. (2010). A conservation industry for sustaining
does land use play? Land Use Policy, 26(4), 1138– natural capital and ecosystem services in agricul-
1148..doi:10.1016/j.landusepol.2009.02.005 tural landscapes. Ecological Economics, 69(4),
680–689..doi:10.1016/j.ecolecon.2009.11.028
Vandermeulen, V., Gellynck, X., Van Huylen-
broeck, G., Van Orshoven, J., & Bomans, K.
(2009). Farmland for tomorrow in densely popu-
KEY TERMS AND DEFINITIONS
lated areas. Land Use Policy, 26(4), 859–868..
doi:10.1016/j.landusepol.2008.10.014 Agriculture Land Use Change: process that
Verboom, J., Alkemade, R., Klijn, J., Metzger, M. contribute to habitat degradation through which
J., & Reijnen, R. (2007). Combining biodiversity one habitat-type is removed and replaced with
modeling with political and economic develop- another habitat-type due to agriculture expansion.
ment scenarios for 25 EU countries. Ecological In the process of land-use change, plants and ani-
Economics, 62(2), 267–276..doi:10.1016/j.eco- mals which previously used the site are displaced
lecon.2006.04.009 or destroyed, reducing biodiversity.
Agri-Environmental Schemes: term used to
Verburg, P. H., Eickhout, B., & van Meijl, H. describe communitarian, national or local schemes
(2008). A multi-scale, multi-model approach for that pay farmers to farm in an environmentally
analyzing the future dynamics of European land sensitive way.
use. The Annals of Regional Science, 42(1), Biodiversity Conservation: how to protect
57–77..doi:10.1007/s00168-007-0136-4 and restore biodiversity, or the diversity of life
Verburg, P. H., van de Steeg, J., Veldkamp, A., on Earth. To preserve biodiversity, scientists must
& Willemen, L. (2009). From land cover change answer three general questions. First, how is the
to land function dynamics: A major challenge diversity of life distributed around the planet?
to improve land characterization. Journal of Second, what threats does this diversity face?
Environmental Management, 90(3), 1327–1335.. Third, what can people do to reduce or eliminate
doi:10.1016/j.jenvman.2008.08.005 these threats and, when possible, restore biological
diversity and ecosystem health?
Vereijken, P. H., & Hermans, C. M. L. (2010). A Birds Directive: the EU Directive on the
quick scan tool to assess the relative prospects of conservation of wild birds (79/49/EEC) seeks to
European regions for sustainable agriculture in a protect all wild birds and the habitats of listed
liberal market. Land Use Policy, 27(2), 440–448.. species, in particular through the designation of
doi:10.1016/j.landusepol.2009.06.002 special protection areas (SPA).

1295
Agriculture and Conservation in the Natura 2000 Network

Geographical Informational System: any biodiversity through the conservation of natural


system that captures, stores, analyzes, manages, habitats and of wild fauna and flora in the Eu-
and presents data that are linked to location. The ropean territory of the Member States to which
term describes any information system that inte- the Convention on Biological Diversity applies.
grate, store, edits, analyzes, shares, and displays NATURA 2000: the key instrument to protect
geographic information. biodiversity in the European Union based on the
Habitats Directive: Council Directive 92/43/ 1979 Birds Directive and the 1992 Habitats Di-
EEC of 21 May 1992 on the conservation of natural rective. It is an ecological network of protected
habitats and of wild fauna and flora. The aim of areas, set up to ensure the survival of Europe’s
this directive is to contribute towards ensuring most valuable species and habitats.

This work was previously published in Agricultural and Environmental Informatics, Governance and Management: Emerg-
ing Research Applications, edited by Zacharoula Andreopoulou, Basil Manos, Nico Polman and Davide Viaggi, pp. 339-358,
copyright 2011 by Information Science Reference (an imprint of IGI Global).

1296
1297

Chapter 78
A Neural Network for
Modeling Multicategorical
Parcel Use Change
Kang Shou Lu
Towson University, USA

John Morgan III


Towson University, USA

Jeffery Allen
Clemson University, USA

ABSTRACT
This paper presents an artificial neural network (ANN) for modeling multicategorical land use changes.
Compared to conventional statistical models and cellular automata models, ANNs have both the ar-
chitecture appropriate for addressing complex problems and the power for spatio-temporal prediction.
The model consists of two layers with multiple input and output units. Bayesian regularization was used
for network training in order to select an optimal model that avoids over-fitting problem. When trained
and applied to predict changes in parcel use in a coastal county from 1990 to 2008, the ANN model
performed well as measured by high prediction accuracy (82.0-98.5%) and high Kappa coefficient (81.4-
97.5%) with only slight variation across five different land use categories. ANN also outperformed the
benchmark multinomial logistic regression by average 17.5 percentage points in categorical accuracy
and by 9.2 percentage points in overall accuracy. The authors used the ANN model to predict future
parcel use change from 2007 to 2030.

INTRODUCTION economic development. Degraded environments,


however, threaten sustainable development. As
Land use change has been a major cause of many more states and local governments are willing to
environmental problems. Changes are often nec- create and implement smart growth strategies,
essary to accommodate population growth and planners and policy makers need to know the
impacts of future land use change and related
DOI: 10.4018/978-1-4666-2038-4.ch078

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
A Neural Network for Modeling Multicategorical Parcel Use Change

regulatory decisions. Predictive modeling is not all the models examined by Pontius et al. (2008).
only a learning tool to understand causal factors Although there is no suggested threshold for how
and dynamic changes of a land use system, but also many predictive variables are deemed appropri-
an important mechanism to predict future changes ate, a good model should include variables that
and simulate the potential effects under different measure key drivers of changes in land use such
growth scenarios (Pijanowski et al., 2002). as population, economy, and technology (Turner
Despite the emergence of numerous predictive & Meyer, 1994). More importantly, the model
models over the last half-century, our ability to should be able to generate meaningful predictions,
accurately predict land use change remains lim- and pass the reality test.
ited. Recently, 18 scientists jointly conducted an From the syntax perspective, most existing
evaluation of the performance of nine land cover/ models, particularly conventional statistical
land use models applied in 13 regions (Pontius et models, lack the multilayered, hierarchical,
al., 2008). These models included conventional interconnected structure needed for modeling
logistic regression (McConnell, Sweeney, & Mul- complex land use systems as Batty and Torrens
ley, 2004), cellular automata based SLEUTH (2005) have suggested. Multiple layers mimic
(Dietzel & Clarke, 2004; Silva & Clarke, 2002), the processes of land use changes that involve
multiple agent based SAMMBA (Boissau & land transaction, speculation, and development;
Castella, 2003; Castella, Trung & Boissau, 2005), whereas interconnections through these layers
multistage model CLUE and its variant CLUE-S capture the interrelationships and interactions
(Duan et al., 2004; Verburg et al., 2002; Veldkamp between dependent and independent variables,
& Fresco, 1996; Verburg & Veldkamp, 2004), and whether they are observable, quantifiable or
multifunctional LTM (Pijanowski et al., 2005), comprehensible. The CLUE (-S) model is to some
multiple objective GEOMOD (Pontius, Cornell degree structured in this way and thus performed
& Hall, 2001; Pontius & Malanson, 2005; Pontius relatively well (Pontius et al., 2008). Batty and Tor-
& Spencer, 2005). It was found that prediction rens (2005) did not provide specific suggestions
accuracy, as measured in percent of the correctly about what algorithms to use and how to derive
predicted against the observed, falls within a range them in order to describe such interrelationships
of 1-73 percent, falling below 30 percent in seven and implement the framework. The structure of
cases and above 50 percent only in three cases. the conceptual model, however, implicitly points
Some of the assessments were against the sample to an artificial neural network (ANN) approach
datasets used for model calibration or training. which does offer such capabilities.
But judging on these results, how much can we This paper presents an artificial neural network
trust the predictions of the models? (ANN) for modeling multi-categorical parcel use
Pontius et al. (2008) point out that something changes. We intended to improve the model by: 1)
must be wrong with the mechanics of these models. incorporating more effective predictor variables
We attribute the poor performances more to the for a better semantic representation; 2) expand-
oversimplification of complex land use systems ing the model structure for a multiclass land use
in both semantics and syntax. Semantically, most system; 3) using a multilayer neural network
models, particularly cellular automata models, to enhance predictive power; and 4) applying a
tend to overemphasize “parsimony” and use Bayesian regularization (BR) training algorithm to
too few predictive variables to truly represent avoid the over-fit problem. We applied the model
complex reality. It may not be a coincident that in Beaufort County, South Carolina, to predict
SLEUTH, constructed with only three effective and simulate future land use changes from 2000
predictive variables, is the worst performer among to 2030 under different growth scenarios.

1298
A Neural Network for Modeling Multicategorical Parcel Use Change

BACKGROUND resent the destination and source of a connector.


The connectors between hidden units and output
ANNs have been around since the early 1940s, units (second layer) are layer weights, denoted by
but it was not until the mid 1990s that they were LW2,1, which is a S1x S2 matrix. Each unit in the
introduced into the geosciences for spatial interac- hidden layer or output layer has a bias term, b1 or
tion, interregional telecommunication, resource b2. They together form a bias vector, b1 or b2. The
management, suitability assessment, and image input vector is p, whereas a1 and a2 are the output
classification (Fischer & Gopal, 1994; Openshaw, vectors for the hidden layer and output layer, re-
1993). Openshaw and Openshaw (1997) identified spectively. The vector a2 is also the output of the
four major benefits of neural networks for model- whole network; f1and f2 are transfer functions for
ing: better performance, greater representational the hidden layer and output layer, respectively. A
flexibility and freedom from current model design logistic function is often used as transfer function
constrains, the opportunity to handle explicitly for classification problems.
noisy data, and the incorporation of spatial de- The prediction starts from left to right. First,
pendency in the net representation (which is each hidden unit sums with its own bias b1 the
currently ignored). product of the input value and corresponding
Researchers in land use modeling have been weight lw1,1 from every input unit p. Then, a
a half-decade slower in realizing such benefits. transfer function, f1, is applied to the sum n1 before
Pioneering studies mainly focus the use of ANNs its output a1 is sent to every unit of the output
for simulating the change in urbanization patterns layer. Each unit of the output layer repeats the
(Li & Yeh, 2001; Pijanowski, Gage, & Long, process, but uses the output of every hidden unit
2000; Pijanowski et al., 2002; Yeh & Li, 2002). as the input. The result is the output, a2, or predic-
Pijanowski et al. (2005) later incorporated an tion of the network. This process is also called
ANN as one of the three components of the Land feedforward prediction. For this approach, the
Transformation Model (LTM), and applied it in determination of the weights and biases for mul-
several other areas. Allen and Lu (2004, 2006) tilayer networks used to be a challenge until re-
found that ANN models not only attained fair to cently.
high prediction accuracy but also outperformed A backpropagation training algorithm (Ru-
the logistic regression in several applications in melhart et al., 1986) makes it possible to auto-
coastal regions. They also found that ANN mod- matically derive the weights and bias for a neural
els were capable of discerning isolated, sparsely network by learning from a sample. It starts with
distributed features or events that other models random weights and bias and proceeds with the
often fail to predict. Land use systems modeled feedforward prediction. The prediction error, the
in these studies are all binary. It remains to be difference between the output value and the ob-
tested whether ANNs can achieve similar success served value (often called the target value), is then
in predicting multi-category land use change. calculated. The error is redistributed backward
Figure 1 provides both a graphic illustration to every hidden unit based on a gradient descent
and mathematic notations (MatLab) of a two-layer rule. The information is then used for updating
feedforward neural network appropriate for land the corresponding weight and bias for every unit
use prediction. The network has R input units, S1 at a certain correction rate before starting another
hidden units and S2 output units. The connectors cycle for the next input-target pair. The entire
between the input units and the hidden units are process that goes through every pair in a sample
input weights, collectively denoted by IW1,1, dataset is an epoch. It usually takes many epochs
which is a R x S1 matrix. The superscripts rep- before a network achieves a desirable goal and

1299
A Neural Network for Modeling Multicategorical Parcel Use Change

Figure1. A two-layer feedforward neural network for modeling multi-categorical land uses

stops training. The backpropagation method has testing, and validation as an essential technique
revitalized neural network sciences and marks the to avoid over-fitting problems.
second generation of neural computation (Penny
et al., 1999).
Many new algorithms have been developed METHODS
for increasing the training speed, maximizing
approximation capability, or avoiding over-fitting Study Area
problems. One such innovative algorithm is the
Bayesian Regularization (BR) developed by Beaufort County, South Carolina, was selected
MacKay (1992). Instead of minimizing just sum as the study area. It is a well-known coastal
of squared error (SSE) as the training objective, tourist destination. The county covers 923 mi2 in
BR incorporates sum of squared weights (SSW) area, with 63 percent land and 37 percent water.
into the performance function by using two ad- Driven by tourism development, new immigration,
ditional parameters (δ and γ) for regularizing service sector expansion, and military demand,
SSE and SSW, respectively. Built upon Bayes- Beaufort County has seen tremendous growth in
ian probability framework, BR is capable of 1) population, urban area, and impervious surface
determining all parameters based on the posterior over the past two decades. From 1990 to 2000,
probabilities conveyed in the sample data, 2) its population increased by 39.93 percent, from
automatically selecting an optimal model that 86,425 to 120,935. This growth rate triples the
avoids over-fitting problems for better generaliza- national average (13 percent) and leads all coun-
tion, and 3) retaining great approximation power ties in South Carolina. From 1990 to 2008, the
once sufficiently trained. It is no longer necessary total number of built parcels increased by 136.4
to split a sample into three subsets for training, percent from 23,446 to 56,929. Impervious sur-
face almost quadrupled (380 percent) in the fast

1300
A Neural Network for Modeling Multicategorical Parcel Use Change

growing Bluffton area. The land conversion rate We have also assessed the effects of predictor
has been high, with an average of 2,575 parcels variables.
(1,500 acres) per year over the last decade. As one
of the top seven retiree communities in the US, the Data Preparation
county is expected to continue growing at a rapid
pace. These changes impose tremendous pressure Parcel use in 2008 was derived from the parcel
on natural environments, local infrastructure, and shape file (ESRI, Inc., Redlands, California)
public services. There is a tremendous demand and Tax Assessor’s Table provided by Beaufort
for predictions and impact assessments needed County. We grouped parcel uses into five general
for smart growth decisions. categories: 1) commercial, including all com-
mercial, industrial, institutional, and service/
Neural Network Specification utility uses; 2) urban residential, including all
developed accommodation use < 5 ac; 3) rural
We used a two-layer, feedforward neural network residential, with parcel size falling between 5-10
with 18 x 18 x 5 architecture to model a five-class ac; 4) recreational (only golf courses); and 5)
parcel use system. The five output units included undeveloped, including wetlands, open space,
commercial, urban residential, rural residential, forestlands, croplands, parks and reserves, and
recreation (golf course), and undeveloped, all of other preserved and protected lands. Of a total
which are binary (1 or 0). Each parcel is assigned 99,781 parcels, 33,250 (33.32 percent) were de-
to one type of use, and each parcel is encoded veloped or built between 1990 and 2008. We call
by a vector of five elements. For example, if this ‘new development’ or ‘net change’ for model
a parcel was developed for commercial use, it validation. Because of rich temporal information,
was coded as (1, 0, 0, 0, 0). Input units represent we were able to derive changes on an annual or
18 independent variables listed in Table 1. The decadal basis for different land uses, periods, and
number of hidden units was initially set to 18, but
may change, depending on the BR training result.
Table 1. Predictor variables
We used a logistic function, f = 1 / (1+ exp
(- x)), as the transfer function for all layers, and Group Independent Variables
a softmax function, yj = exp (- xj) / ∑ exp (- xk), Physical properties • land area
for land transition probability and output classi- • wetland ratio
• slope
fication. The former allows us to simulate parcel • distance to water front
changes under different scenarios. The BR algo- • forest wetland
rithm is available in the Neural Network Toolbox Road accessibility • cost distance to major road
• distance to local street
of Simulink 8 (MatLabtm). We wrote scripts to
Proximity to utilities • distance to waterline
enable data input, classification, and error assess- • and distance to sewer line
ment. Several ArcGIS-based submodules were
Neighborhood • cost distance to central business
developed for sample extraction, data processing, effects district
and spatial prediction. • developed parcels
• distance to built parcels
Performance of the ANN was assessed in mea- • agricultural use
sures of: 1) mean squared error (MSE), sum of Demographic • block population
squared error (SSE), and sum of squared weights features • block neighborhood population
(SSW); 2) prediction accuracy; and 3) the Kappa Land value • parcel land value
• total property values
coefficient. Multinomial logistic regression was
Land ownership • private ownership
used as a benchmark model for comparisons.

1301
A Neural Network for Modeling Multicategorical Parcel Use Change

regions. We used the growth rates for three differ- entire training set was estimated based on the
ent lengths of periods to simulate future changes: formula S=Nw / Et where Nw is the total number
1980-2008, 1990-2008, and 2000-2008. of weights including biases, and Et is the targeted
We prepared two sets of data for all predictive error threshold. The final training sample includes
variables, 1990 for model training, and 2008 for 7,812 parcels, accounting for nearly 8 percent in
future prediction. National Wetland Inventory number and 17 percent in area of all parcels in the
data from 1989 were used to extract forested county. We also prepared four more datasets for
wetland, land area, and wetland ratio. Slope was model validation and testing. They included two
derived from the 30 m DEM downloaded from sample sets, a net change set, and the full dataset.
USGS’s National Map Seamless Server. Block The first three sets accounted for 20 percent, 10
population data were from Census Data 1990 and percent, and 33 percent of the total number of
2000. Beaufort County provided the updated data parcels in the county respectively.
for roads and streets from which proximity and
coast distance variables were derived. Waterline
and sewerline layers were from South Carolina RESULTS
Department of Commerce. Protected lands were
downloaded from the Natural Resources GIS Data Model Performance
Clearinghouse of the South Carolina Department
of and updated by using the 2008 parcel data. We performed multinomial regression analysis in
The tax assessor file was the source for building a preliminary study to screen predictive variables.
counts, built year, parcel size, land price, land Eighteen of the variables included in the prelimi-
value, and land ownership at the finest spatial nary study were found statistically significant and
units. Since the coastal area is highly fragmented thus used to construct the ANN model. Based
by water bodies and wetlands, we constructed two on the 4:2:2 ratios, the training sample was split
cost distance variables to measure the influences into three subsets for training, validation, and
of major roads and central business district over testing, respectively, during the training process.
a cost surface created by weighting these natural The training process stopped within 100 epochs
barriers more as frictional factors. in several runs. MSE is 0.018 for the training
All data were aggregated in parcel units and the subset and 0.038 for both testing and validation
parcel shape file was joined with the tax assessor’s subsets. Results of BR suggest that an optimal
table. Most distances were measured from the model with 415 effective parameters (18 x 17 x
centroid of a parcel, but distance to waterfront is 5) would be adequate.
the zonal average of the grid’s Euclidian distances The ANN model performed well against the
to a parcel. Data for each independent variable training dataset (Table 2). The prediction accu-
were scaled to a range between 0 and 1 by using racy, overall or categorical, is quite impressive as
the same set of min-max values for both training measured in terms of correct percent (82.0-98.5
and prediction sets. percent) and Kappa coefficients (0.81-0.98), par-
The training dataset was extracted by using ticularly for a change of 33.3 percent over newly
a pseudo-stratified systematic random sampling two decades. Kappa coefficient is considered a
method. The strata are five land use classes, but conservative measure of agreement with chance
subsample size for each stratum is not purely corrected, and a value above 0.80 suggests an
proportional to the total number of parcels, but excellent prediction. Although both accuracy and
proximately proportional to the total area of the Kappa tend to increase with the total number of
stratum in concern. The sample size (S) for the

1302
A Neural Network for Modeling Multicategorical Parcel Use Change

Table 2. Prediction accuracy of the ANN model against the training sample

Predicted
Observed Row Total Accuracy (%) Kappa
I II III IV V
I 511 24 24 9 6 574 89.0 0.88
II 12 2815 30 6 14 2877 97.8 0.97
III 10 22 866 7 38 943 91.8 0.91
IV 4 5 5 218 34 266 82.0 0.81
V 16 11 18 2 3105 3152 98.5 0.98
Column total 553 2877 943 242 3197 7812 96.2
a
0.94b
Notes: I=Commercial, II=Urban residential, III=Rural residential, IV=Golf Course, and V=Other Use.
a. Overall prediction accuracy, b. Overall kappa coefficient.

parcels within the category, their variations across Results of model validation against four dif-
different land uses are relatively small. ferent datasets are summarized in Table 4. Over-
The benchmark MLR was also statistically all accuracy is high in all cases (> 94.9 percent)
significant based on the likelihood ratio tests with only little variation (< 1.3 percentage points)
(Chi-square = 11,100, df = 76, p < 0.001). It was, across different cases. For all but net change
however, outperformed by the ANN model in all validation, prediction accuracy dropped only by
measurements (Tables 2 and 3). The ANN model average 2.4 percentage points in all 19 measure-
improved prediction accuracy by an average of ments with a range of 0.8-2.6 percentage points.
17.5 percentage points across the five uses and The net change case is an exception because we
by 9.2 percentage points in overall. The two observed three extreme discrepancies: the largest
numbers are even more impressive for Kappa jump (+12.2), the smallest drop (-1.2), and the
coefficients (0.19 and 0.14). The greatest improve- largest drop (-11.1) in prediction accuracy from
ment occurred where MLR failed to generate those for the training set. ANN’s predictive abil-
satisfactory results, by 33.6 and 32.2 percentage ity did not show a consistent decay pattern which
points for commercial and rural residential re- was not expected.
spectively.

Table 3. Prediction accuracy of the MLR model against the training sample

Predicted
Observed Row Total Accuracy (%) Kappa
I II III IV V
I 318 197 12 12 35 574 55.4 0.53
II 19 2659 50 3 146 2877 92.4 0.88
III 74 147 562 4 156 943 59.6 0.56
IV 13 24 5 177 47 266 66.5 0.66
V 5 52 8 4 3083 3152 97.8 0.96
Column total 429 3079 637 200 3467 7812 87.0
a
0.80b
Notes: I=Commercial, II=Urban residential, III=Rural residential, IV=Golf Course, and V=Other Use.
a. Overall prediction accuracy, b. Overall kappa coefficient.

1303
A Neural Network for Modeling Multicategorical Parcel Use Change

Table 4. Prediction accuracy (%) of the ANN model in four validations

Parcel Use Validation Sets


Training Set
Category Use-based Area-based Net Change a Full Set
I 89.0 86.6 83.8 79.2 87.1
II 97.8 96.5 96.5 96.6 96.2
III 91.8 88.5 84.9 80.7 87.4
IV 82.0 79.8 77.6 94.2 80.6
V 98.5 97.7 97.6 97.7
Overall 96.2 96.4 96.1 94.9 96.2
Notes: I=Commercial, II=Urban residential, III=Rural residential, IV=Golf Course, and V=Other use.
a
Category V or Other Use is treated as the background category and is thus not applicable in this case.

Predicted Changes straints. Table 5 provides a quantitative summary


of simulated regional development with and with-
The predicted long-term trajectories for parcel out a growth boundary constraint.
development are shown in Figure 2. They reflect
quantitative demands at three different growth
rates. We used 2008 data as the new input of the DISCUSSION AND CONCLUSION
trained ANN model to predict land transition
probabilities for five parcel uses. These two types A neural network was applied to predict parcel
of information were used to map future change use change in Beaufort County, South Carolina.
through 2030. Figure 3 shows the distribution of The model is not only theoretically appropriate
predicted parcel use changes for selected classes for addressing complex problems, but also has
and areas under selected growth rates and con- demonstrated improved predictive capability. The

Figure 2. Predicted trajectories of parcel development in Beaufort County at three different growth rates

1304
A Neural Network for Modeling Multicategorical Parcel Use Change

Figure 3. Predicted parcel use changes in Beaufort County from 2000 to 2030 under different growth
scenarios

Table 5. Potential impact of the growth boundary on regional development in Beaufort County

Growth Boundary Gain (or Loss)


Region Yes No Yes-No
N Area N Area N% Area%
St Henna Island 0 0 5132 4592 -100 -100
S Beaufort Islands 2726 2038 1664 875 64 133
Hilton Head 5239 3968 4884 3035 7 31
NE Corner 0 0 958 1341 -100 -100
Port Royal-City of Beaufort 10332 8148 9631 5945 7 37
Bluffton 9404 7318 8687 5677 8 29
Entire County 27701 21472 30956 21466 -11a <1a
Notes: Prediction is based on the 1990-2008 growth rate. Area units are acres.
a
Discrepancies are caused by variable parcel sizes due to the use of area-based demand.

1305
A Neural Network for Modeling Multicategorical Parcel Use Change

accuracy ranges from fairly good to excellent in urban fringe areas, infill development is expected
all 29 measurements, regardless of variations in to intensify in previously sprawled regions. Over
land use category, sample set, parcel size, and time, development will change the image of the
time span. The ANN outperformed both bench- county as a well vegetated, properly landscaped
mark MLR and models evaluated by Pontius et tourist destination. The simulation results suggest
al. (2008). A critique of ANNs is the over-fitting caution in implementation of a growth boundary.
problem. But this problem, if present, might have The policy may be able to preserve natural areas
been kept to minimal in our case, due to the use of and farm lands in North Beaufort, but it has a
BR and large training sample. Cross-validations potential risk of driving land conversion and en-
do not show any substantial drops in prediction vironmental change in many peripheral islands in
accuracy nor suggest any pattern of variations South Beaufort too fast for coastal terrestrial and
across different sample sets and land uses. It aquatic life to adapt. Incentives should be in place
was found again that ANNs enable us to predict in order to regulate such developments.
rare, sparsely distributed events with the level of
accuracy other models often fail to reach (Allen
& Lu, 2006). ACKNOWLEDGMENT
The use of parcels as objects to be modeled
and as units of analysis has brought both benefits We would like to thank South Carolina Sea Grant
and issues. This approach adds more realism to and NOAA’s National Centers for Coastal Ocean
our modeling practice because of the use of real Science for funding this project.
world units for land transaction and develop-
ment (Landis, 1994). Many parcel attributes are
predictive variables with measurements that can REFERENCES
be readily incorporated into a model to improve
model semantics and prediction accuracy. Also, Allen, J. S., & Lu, K. S. (2006). Predicting trajec-
many property variables are spatially well-defined tories of urban growth in the coastal Southeast.
by parcel boundaries and no additional error is In G. S. Kleppel, M. R. Devoe & M. V. Rawson
induced by aggregation or disaggregation. For (Eds.), Changing land use patterns in the coastal
the same reason, it is less difficult to interpret the zone: Managing environmental quality in rapidly
modeling results. Four issues we have encountered developing regions (pp. 47-67). New York, NY:
are related to large parcel partitioning, vector Springer.
data processing, parcel use classification, and Allen, J. S., & Lu, K. S. (2003). Modeling and
data availability. How to model the large parcel prediction of future urban growth in the Charleston
partitioning process remains to be a technical region of South Carolina: A GIS-based integrated
challenge. approach. Conservation Ecology, 8(2), 1–20.
The predicted trajectories and growth patterns
have policy implications. The anticipated urban Allen, J. S., & Lu, K. S. (2004). Artificial neu-
growth ratio is relatively moderate (2:1-3:1) com- ral network vs. binary logistic regression: Two
pared to that (6:1) for the Charleston area (Allen alternative models for predicting urban growth
& Lu, 2003). Potential impacts of land develop- in the Myrtle Beach region. Paper presented at
ment can be relatively severe because Beaufort’s the NOAA’s Land Use-Coastal Ecosystem Study
physical environment is very fragmented and (LU-CES) Program.
ecological systems are quite vulnerable. While
new development is continuously spreading in the

1306
A Neural Network for Modeling Multicategorical Parcel Use Change

Batty, M., & Torrens, P. M. (2005). Modeling and Lu, K. S., & Allen, J. S. (in press). Neural networks
prediction in a complex world. Futures, 37(7), for modeling urban growth patterns and potential
745–766. doi:10.1016/j.futures.2004.11.003 ecosystem impact in the coastal zone. In Brouwer,
F., & Goetz, S. J. (Eds.), The dynamics of land
Boissau, S., & Castella, J. (2003). Constructing a
use and ecosystem interactions: A transatlantic,
common representation of local institutions and
multidisciplinary and comparative approach. New
land use systems through simulation-gaming and
York, NY: Springer.
multi-agent modeling in rural areas of North-
ern Vietnam: The SAMBA-Week methodol- MacKay, D. J. C. (1992). Bayesian interpolation.
ogy. Simulation & Gaming, 34(3), 342–347. Neural Computation, 4(3), 415–447. doi:10.1162/
doi:10.1177/1046878103255789 neco.1992.4.3.415
Castella, J., Boissau, S., Trung, T. N., & Quang, Openshaw, S. (1993). Modeling spatial interaction
D. D. (2005). Agrarian transition and lowland- using a neural net. In Fischer, M. M., & Nijkamp,
upland interactions in mountain areas in northern P. (Eds.), GIS spatial modeling and policy (pp.
Vietnam: Application of a multi-agent simulation 147–164). Berlin, Germany: Springer-Verlag.
model. Agricultural Systems, 86(3), 312–332.
Openshaw, S., & Openshaw, C. (1997). Artificial
doi:10.1016/j.agsy.2004.11.001
intelligence in geography. Chichester, UK: John
Dietzel, C. K., & Clarke, K. C. (2004). Spatial Wiley & Sons.
differences in multi-resolution urban automata
Penny, W. D., Husmeier, D., & Roberts, S. J.
modeling. Transactions in GIS, 8, 479–492.
(1999). Introduction. In Lisboa, P. J. G., Ifeachor,
doi:10.1111/j.1467-9671.2004.00197.x
E. C., & Szczepaniak, P. S. (Eds.), Artificial neu-
Duan, Z., Verburg, P. H., Zhang, F., & Yu, Z. ral networks in biomedicine (pp. 1–9). Berlin,
(2004). Construction of a land-use change simu- Germany: Springer-Verlag.
lation model and its application in Haidian Dis-
Pijanowski, B. C., Brown, D. G., Manik, G., &
trict, Beijing. Acta Geographica Sinica, 59(6),
Shellito, B. A. (2002). Using neural nets and GIS
1037–1046.
to forecast land use changes: A land transforma-
Fischer, M. M., & Gopal, S. (1994). Artificial tion model. Computers, Environment and Urban
neural networks: A new approach to model- Systems, 26(6), 553–575. doi:10.1016/S0198-
ing interregional telecommunication flows. 9715(01)00015-1
Journal of Regional Science, 34(4), 503–527.
Pijanowski, B. C., Pithadia, S., Shellito, B. A.,
doi:10.1111/j.1467-9787.1994.tb00880.x
& Alexandrids, K. (2005). Calibrating a neural
Landis, J. (1994). The California urban futures network-based urban change model for two metro-
model: A new generation of metropolitan simu- politan areas of the Upper Midwest of the United
lation models. Environment and Planning. B, States. International Journal of Geographical
Planning & Design, 21, 399–420. doi:10.1068/ Information Science, 19, 197–215. doi:10.1080
b210399 /13658810410001713416
Li, X., & Yeh, A. G. O. (2001). Calibration of Pontius, R. G. Jr, Boersma, W., Castella, J., Clarke,
cellular automata by using neural networks for K., de Nijs, T., & Dietzel, C. (2008). Comparing
the simulation of complex urban systems. En- the input, output, and validation maps for several
vironment & Planning A, 33(8), 1445–1462. models of land change. The Annals of Regional
doi:10.1068/a33210 Science, 24(1), 11–37. doi:10.1007/s00168-007-
0138-2

1307
A Neural Network for Modeling Multicategorical Parcel Use Change

Pontius, R. G. Jr, Cornell, J., & Hall, C. (2001). Veldkamp, T. A., & Fresco, L. (1996). CLUE-CR:
Modeling the spatial pattern of land-use change An integrated multi-scale model to simulate land
with GEOMOD2: Application and validation for use change scenarios in Costa Rica. Ecological
Costa Rica. Agriculture Ecosystems & Environ- Modelling, 91, 231–248. doi:10.1016/0304-
ment, 85(1-3), 191–203. doi:10.1016/S0167- 3800(95)00158-1
8809(01)00183-9
Verburg, P. H., Soepboer, W., Veldkamp, T. A.,
Pontius, R. G. Jr, & Spencer, J. (2005). Uncertainty Limpiada, R., Espaldon, V., & Mastura, S. A. S.
in extrapolations of predictive land change mod- (2002). Modeling the spatial dynamics of regional
els. Environment and Planning B, 32, 211–230. land use: The CLUE-S model. Environmental
doi:10.1068/b31152 Management, 30(3), 391–405. doi:10.1007/
s00267-002-2630-x
Rumelhart, D. E., Hinton, G. E., & Williams, R.
J. (1986). Learning internal representations by Verburg, P. H., & Veldkamp, T. A. (2004).
error propagation. In J. L. McClelland & D. E. Projecting land use transitions at forest fring-
Rumelhart (Eds.), Parallel distributed processing: es in the Philippines at two spatial scales.
Explorations in the microstructure of cognition: Landscape Ecology, 19, 77–98. doi:10.1023/
Vol. 1. Foundations (pp. 318-362). Cambridge, B:LAND.0000018370.57457.58
MA: MIT Press.
Yeh, A. G. O., & Li, X. (2002). Urban simulation
Silva, E., & Clarke, K. C. (2002). Calibration using neural networks and cellular automata for
of the SLEUTH urban growth model for Lisbon land use planning. In Richardson, D., & van Os-
and Porto, Portugal. Computers, Environment terom, P. (Eds.), Advances in spatial data handling
and Urban Systems, 26, 525–552. doi:10.1016/ (pp. 451–464). Berlin, Germany: Springer-Verlag.
S0198-9715(01)00014-X
Turner, B. L., & Meyer, W. B. (1994). Global
land-se and land-cover change: An overview. In
Meyer, W. B., & Turner, B. L. II, (Eds.), Change
in land use and land cover: A global perspective
(pp. 1–10). Cambridge, UK: Cambridge Univer-
sity Press.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 2, Issue
3, edited by Donald Patrick Albert, pp. Donald Patrick Albert, copyright 2011 by IGI Publishing (an imprint of IGI Global).

1308
1309

Chapter 79
Historical GIS as a Platform for
Public Memory at Mammoth
Cave National Park
Katie Algeo
Western Kentucky University, USA

Ann Epperson
Western Kentucky University, USA

Matthew Brunt
Western Kentucky University, USA

ABSTRACT
The Mammoth Cave Historical GIS (MCHGIS) fosters new understandings of a national park landscape
as a historic farming community and offers a web-based platform for public memory of pre-park inhabit-
ants. It maps the 1920 manuscript census at the household level over a streaming topographic map and
georeferences Civilian Conservation Corps photographs of dwellings for visualization and analysis of
the area’s population on the eve of creation of Mammoth Cave National Park. A web interface to the
MCHGIS permits broader dissemination of archival holdings. Public participation GIS techniques are
adapted to initiate a virtual site of public memory to supplement the history presented by institutionally-
held materials with those donated from private holdings.

INTRODUCTION tools to explore them (Gregory & Healey, 2007).


This article describes a historical GIS created to
GIS has proven a valuable tool for historical ge- document and enhance understanding of the his-
ographers in facilitating integration of data from tory of the pre-park inhabitants of Mammoth Cave
diverse sources, permitting visualization and National Park (Figure 1). The national park was
analysis of past places, and allowing dissemination authorized by U.S. Congress in 1926, the same
via the Internet of both digital databases and the year as Great Smoky Mountains and Shenandoah
National Parks, part of a wave of park-creation
DOI: 10.4018/978-1-4666-2038-4.ch079 intended to meet the recreational and psychologi-

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

cal needs for wilderness among the core of U.S. areas in the U.S. are rare because rural street ad-
population located east of the Mississippi River dressing was not standardized until recently, and
(Ise, 1961). All three new parks had resident many rural areas lack alternative data sets, such
populations, variables mixes of EuroAmericans, as city directories, tax lists, or utility records, that
African Americans and Native Americans. The help locate urban populations. DeBat’s (2009)
Mammoth Cave region had, by far, the highest GIS for rural Washington County, Oregon, used
population density of the three new parks, yet it federal plat maps created under the aegis of the
has received the least scholarly treatment of its Donation Land Claim Act, a data set unique to
displaced population.1 As a first step in addressing the Oregon Territory. Thomas and Ayers’ (2003)
this lacuna, the Mammoth Cave Historical GIS Valley of the Shadow project, investigating the
(MCHGIS) provides a snapshot of the region’s impacts of slavery at the time of the Civil War,
habitation on the eve of the national park’s cre- created a detailed GIS of one Northern and one
ation. It combines 1920 manuscript census data Southern county from census and military service
for individual households with photographs of records, letters, and newspaper articles. The web
the dwellings where families lived, geolocating site for this project, however, presents only static
both to known house sites. It thus provides not maps.
only the basis for quantitative analysis of the pre- The MCHGIS is methodologically innovative
park population, but a framework for qualitative in its use of qualitative techniques to map census
understanding of landscape and living conditions data for an entire rural community at the household
in the region. level and for its creation of an interactive web
A number of GIS map populations at the interface that allows users to explore the GIS. A
household level for small cities or portions of detailed topographic map prepared in 1930 as part
larger urban areas (see, for example, DeBats, of the land acquisition process in the Mammoth
2008; Schlichting & Tuckel, 2006). Historical Cave region showed house locations, but the Park
GIS at the individual or household level for rural retained no information on who lived in each

Figure 1. Mammoth Cave National Park is located in the karst region of central Kentucky and is easily
accessible from Interstate 65, a modern roadway that roughly follows the route of the old Dixie Highway
of the 1920s

1310
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

house. The authors developed a three-prong ap- created in 1872 as the first U.S. National Park, es-
proach to matching census households with known tablished the model of national park as uninhabited
house sites, comparing head-of-household house wilderness, yet the delineation of its boundaries
ownership data from the census with national antecedent to European American settlement
park parcel purchase records, relying on local was the exception rather than the rule (Dilsaver
knowledge of key informants, and interpolating & Wyckoff, 2005). When Glacier National Park
between existing matches based on the presumed was created in 1910, sixteen years before the
path of the census taker. Eastern trio of Great Smokies, Shenandoah,
The MCHGIS is also innovative in its adoption and Mammoth, private inholdings amounting
of techniques from web-based public participa- to 10,000 acres were allowed to remain inside
tion GIS (PPGIS) to foster a collaborative site of Glacier’s boundaries. Moreover, homesteaders
public memory for the pre-park population of the and private land claimants were guaranteed both
Mammoth Cave region. Users may interact with rights to their land and access corridors to use it
the MCHGIS through an easy-to-use web GIS (Dilsaver & Wyckoff, 2005). In contrast, residents
interface, viewing house and parcel layers overlaid inside the boundaries of the eastern trio of parks
on a hill shaded topographic map. Clicking on a authorized in 1926 faced displacement, through
house site brings up the 1920 census data for that condemnation, if necessary.
household where a match is known, as well as These three Eastern parks have adopted differ-
any photographs of family members, the dwell- ent approaches through the course of the twentieth
ing or farm outbuildings in the GIS. Layers are century to interpretation of their cultural land-
fully searchable to assist in the location of family scapes and displaced communities, differences
names. Members of the public are encouraged to that affect the visitor experience in the parks today
submit additional photographs and other historic as well as public understandings of their history.
information for inclusion in the MCHGIS. Great Smoky Mountains National Park was the
The goals of this paper are to demonstrate the first of the three to embrace interpretation of its
utility of historical GIS in achieving new under- displaced community, through the (re-)construc-
standings of a national park landscape, document tion of Cades Cove, owing largely to the interest
the progress to date of the MCHGIS in creating a and initiative of one of the park’s rangers (Young,
geospatial database of the pre-park population and 2006). While initial approaches to interpretation
a platform for dissemination of pre-park history, created a romanticized vision of an Appalachian
and describe the creation of a geospatial frame- community in harmony with nature and embody-
work for public memory through a participatory ing a mythological primordial society, subsequent
collaboration with former park residents and their refinements in interpretation were more culturally
descendents. sensitive and environmentally sustainable (Young,
2006). What is significant is that, from the 1940s,
this park embraced its settlement history as part
TOWARDS A NEW UNDERSTANDING of the story told to visitors. Shenandoah National
OF A NATIONAL PARK LANDSCAPE Park has also seen contestation about interpretive
materials concerning its displaced population and
Individual U.S. national parks have varied in the responded by altering displays most offensive
policies applied to resident populations at the to descendents (Horning, 2001). Archaeological
time of park creation and in their subsequent ap- surveys begun during the 1990s have furthered
proaches to interpretation of cultural landscapes interpretation of its pre-park communities, some
within their borders. Yellowstone National Park, of which results are included on the park’s offi-

1311
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

cial web site in a section titled “The Displaced” As might be expected, however, the passing of
that explores cultural and economic differences time has greatly tempered bitter feelings. The
between various former communities within the former residents who are still alive were children
park and addresses the history of cultural resource when they left the park and, in discussions with
interpretation (Horning, 2001). In contrast to the authors, stress that they had a child’s view
these efforts at interpretation, similar efforts at of the situation. They and their descendents,
Mammoth Cave National Park are notably lack- while retaining feelings of attachment to former
ing. A brand new visitor center opened in 2010, homesteads, do not express ill will towards the
featuring displays on the physical environment park, but an interest in preserving memories of
and the development of tourism, but virtually their ancestors and their communities. This seems
nothing on the pre-park population. The “History an appropriate time, then, to open a dialog about
and Culture” section of the park’s official web appropriate and effective means of interpreting
site contains a database of individuals buried in the pre-park communities at Mammoth Cave and
over 80 cemeteries located throughout the park, foster a new understanding of this national park
a section on black history at Mammoth Cave that landscape. The MCHGIS, an academic initiative
recognizes the contributions of the many African launched by the authors, contributes to this effort
American cave guides, and a link to a selection of by recovering and making accessible primary
digitized photographs taken by a Civilian Conser- materials that document the history of the people
vation Corps photographer documenting the early and communities that were removed to make the
years of park administration. The “Brief History national park.
of Mammoth Cave” that makes up the remainder Most visitors to Mammoth Cave National Park
of this section discusses the national park idea, come to see a portion of the world’s longest known
the legislation behind the park’s creation, the cost cave system, the raison d’etre for the park, but
of land purchases, and the work of the CCC in many also enjoy camping, hiking a forest trail,
transforming the landscape, but the communities or boating on the Green River. In the visitor’s
and people who were displaced are not mentioned center or on a cave tour, they may learn about
at all. Thus, the official approach to interpretation the region’s distinctive karst geology, the delicate
of pre-park communities at Mammoth Cave has cave ecosystem, or the history of cave exploration.
been resounding silence. In short, the visitor’s experience is primarily an
Scholars have begun to probe such silences in encounter with nature writ large. While several
park histories. As William O’Brien (2007) points cemeteries along roadways are well preserved,
out in his work on segregation in Southern state there are few other reminders of the population
park systems, silences may initially occur because that used to live here and little interpretation of
these aspects of social history are painful or dis- the park’s human history. It is safe to say that the
turbing to recall, but lack of discussion leads to a typical visitor remains unaware that the iconic
loss of institutional memory that perpetuates the national park landscape that they encounter, one
silences. This is a historical trajectory that seems dominated by forest and devoid of human habi-
to be operating at Mammoth Cave. Oral histories tation, is a recent creation. Less than a century
by the Mammoth Cave’s first rangers document ago the area was occupied by rural hamlets and
a high level of bitterness among some of the scattered farmsteads, with much of the land in
displaced, leading, in one instance, to a ranger open pasture and cropland. Mammoth Cave was
being shot (Lally, 1987). Such incidents might a private tourism attraction with absentee owners
understandably lead to avoidance of dealing with run by a local community of guides, hotel workers,
the issue of displacement in park interpretation. and other tourism support staff (Algeo, 2004).

1312
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

The MCHGIS reveals the extent to which the social construction of the Mammoth Cave
the national park landscape has been socially National Park landscape was both material and
constructed by mapping the pre-park popula- ideational (Demeritt, 2002). Since interpretive
tion, revealing a lightly, but extensively settled material presented at the park favors a discourse of
landscape (Figure 2). Unlike the Great Smoky nature, science, and cave exploration, with human
Mountains, where vast tracts of unpopulated elements being most prominent in the latter, inter-
land were held by timber companies (Campbell, subjective encounters between visitors and park
1960), the Mammoth Cave region was thoroughly guides discursively reinforce notions of the park
settled and farmed. Photographs taken during the as nature over the park as historic human habitat
early 1930s by a Civilian Conservation Corps (Berger & Luckmann, 1966). By contributing to
photographer reveal extensive open pastures and a greater understanding of the area’s settlement
fields. Early park administrators followed a dual history and highlighting the social as well as
policy of removing signs of human habitation and environmental changes wrought by the national
re-creating a “natural” landscape via a massive park, the MCHGIS creates “new insights into the
tree planting campaign to compose a landscape geographies of the past,” thus fulfilling Gregory
that accorded with the institutional vision of and Healey’s (2007) criterion for “the real test for
what a national park should be. Subsequent visi- historical GIS as a discipline” (p. 644).
tors encountering the re-natured landscape, now Specific outcomes of the MCHGIS include
devoid of farm fields, pastures, and houses, have data integration, visualization, and analysis, which
reinforced for them the concept of a national park Gregory et al. (2001) identify as the three advan-
as a wilderness devoid of human imprint. Thus, tages of using GIS in historical research. Public

Figure 2. The Mammoth Cave Historical GIS with parcel and house layers shown over a modern
variable-resolution streaming topographic map. The MCHGIS reveals an extensive pre-park settlement
pattern (identified occupants from 1920 are shown as dark gray box with white outline in print edition)

1313
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

records and archival photographs are correlated, Preliminary demographic analysis of informa-
using location as an organizing framework (Fig- tion assembled in the MCHGIS suggests that a
ure 3). Linking the manuscript census record for new interpretation of forces behind the park’s
a household to the same location as a photograph creation may be in order. Brunt (2009)2 paints a
of a farmstead or the structure in which the fam- portrait of a relatively stable agrarian society in
ily lived adds a qualitative understanding of living the Mammoth Cave region of 1920. Farming was
conditions that cannot be discerned from census overwhelmingly the most common occupation
data alone, making possible new understandings among the slightly more than 2100 individuals
based on associations among data sets. Superim- living within what would become the park bound-
posing house locations on a topographic map ary. Nearly two-thirds of families owned the homes
rendered with hill shading permits visualization where they lived, in contrast to much higher
of settlement patterns and a better understanding tenancy rates in other parts of the South. Thus,
of how homesteads were situated in the landscape. the removal of the Mammoth Cave population
As more records and photographs are added to should not be confused with 1930s-era rural re-
the MCHGIS, it will become a data-rich explor- settlement programs in which struggling farm
atory environment that will assist genealogists communities were relocated to model settlements.
and family historians, as well as more academi- Such analysis helps frame the creation of Mam-
cally-oriented analysis (Ruvane & Dobbs, 2008). moth Cave National Park as a political approach
to state-wide economic development rather than

Figure 3. The Mammoth Cave Historical GIS combines census, house site, and parcel data with CCC
photographs of dwellings that were razed to make way for the national park. Although the streaming
topographic map has resolution limitations at extremely large scales, it is useful for identifying landscape
features such as the African American cemetery on Jennie Mansfield’s land

1314
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

a humanitarian response to a failing agricultural create historical databases on individuals in rural


sector. areas of the U.S. are few because of the greater
The example of Brunt (2009) places the MCH- difficulty of attributing location to rural house-
GIS within the emerging field of digital humani- holds. Household census records in many U.S.
ties, establishing it as more than a digital archive, cities included street addresses as early as 1880,
but also a tool, study object, exploratory laboratory but rural areas lagged in systematic use of street
and activist venue for engaging with and better names and numbers (DeBats, 2008). As part of a
understanding our world (Svensson, 2010). While series of four case studies creating individual-level
early phases of the project focus on assembling GIS for different settlements in North America,
a digital library of materials and making them DeBats (2009) mapped 71% of inhabitants of
publically accessible through innovative and ex- rural Washington County, Oregon, to a place of
perimental geospatial techniques, their potential as residence for the 1850s. This was the only rural
an object of study and analysis is exemplified by U.S. case study among the four, and the high level
Brunt (2009). The activist element of the project is of mapping was made possible by a set of federal
embodied in the preservation and dissemination of land plats unique to the Oregon Territory.
public memory about Mammoth Cave’s displaced The MCHGIS attempts to map all households
communities in the face of institutional silences. from the 1920 manuscript census for the area that
The authors’ continued engagement with National would become Mammoth Cave National Park.
Park personnel to encourage interpretation of this U.S. Census Bureau policy mandates that data for
aspect of the park’s cultural history may likewise recent censuses be aggregated because of privacy
be considered an activist endeavor. concerns, but individual data is released after 72
years. Thus, images of the original forms filled
out by census takers as they went from house to
MAPPING RURAL HOUSEHOLDS house are available for both the 1920 and 1930
FROM THE CENSUS censuses. Of the two, the 1920 census provides
the better snapshot of the pre-park population for
Mapping entire communities at the household Mammoth Cave. Although most land purchases
level rather than using aggregated data or sample for the national park were made during the 1930s,
populations not only allows more precise measure- land speculators became active after Congress
ment and visualization, but also permits analysis authorized the park in 1926, and some families re-
of within-group variation and individual-group located during the late 1920s in anticipation of the
relationships to better reveal social dynamics park’s creation. The manuscript census contains a
(DeBats, 2008). “In the historical world, how- wealth of socio-economic data about individuals,
ever, the scarcity of places fully mapped at the including age, occupation, homeownership, and
individual level remains a limiting feature for the household composition. Mapping the population
spatial analysis of past communities,” in large part at the household level rather than aggregated by
because the kind of spatial data about individuals census tract or other administrative unit both
that we take for granted in contemporary society permits fine-grained demographic analysis and
is lacking (DeBats, 2008, pp. 17-18). Historical allows the household to serve as a container for
GIS that have mapped individuals or households other types of data as the GIS expands.
are almost exclusively for urban areas. Urban A significant challenge in using the 1920 census
households are often possible to locate precisely for rural Kentucky is that no location information
thanks to Sanborn fire insurance maps or plat maps below the level of the census district is available.
created for utility or tax purposes. Attempts to Street names and house numbers were not then

1315
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

in use in the rural portions of the counties (Bar- properties, and thus, were not residential. When
ren, Edmonson, and Hart) from which Mammoth overlaid with the house site layer, 152 tracts were
Cave National Park was carved. While manuscript found to contain no houses, leaving 377 residential
census forms for towns such as Brownsville, the tracts. Over a third of the residential parcels (142)
Edmonson County seat, have marginal notations contained multiple dwellings. In this region, the
naming the street on which a house lies, forms level of absentee ownership was relatively low,
for rural areas simply number houses sequentially, but because extended family members or tenants
with no notation when the census worker left one typically occupied additional houses on a family
road for another or of which side of the road a farmstead, knowing a parcel’s owner was in many
house occupied. Moreover, other forms of spatial cases not sufficient to pinpoint the house occupied.
residential information, such as city directories, A three-prong methodology was developed to
fire insurance maps, plat maps, and utility records match households listed in the 1920 manuscript
are lacking for this area. census to house sites: 1) probable residency based
Two other data sets were useful for placing on the owner of a tract with a single house on it
households on the landscape – known house sites being listed in the census as a head-of-household
and parcels. House sites were digitized by Park and a home-owner; 2) local historical knowledge of
Service personnel from a topographic map created a key informant; and 3) interpolation of household
by a 1930 U.S. Department of the Interior survey records along a road between two existing matches.
as part of the land acquisition process for the park. It is recognized that error may be introduced
This resulted in a point data layer identifying during each phase of the methodology and that
664 former residences, but with no information error propagation may occur when interpolating
about who lived in the houses. These structures during the third phase from presumed matches
were dismantled during the 1930s by the Civilian made during the first two phases. As Gregory and
Conservation Corps, which supplied manpower for Healy (2007) note, “Historical GIS databases are
most of the landscape alterations needed to ready rarely a simple digital facsimile of a single source.
the park for opening, including removing fences, Instead they take data from multiple sources,
filling stock ponds, and a massive tree-planting integrate them in a manner that is sympathetic to
campaign. The challenge for the MCHGIS was to the sources’ limitations, and create metadata and
match these former house sites with the families documentation to record the sources and standards
who occupied them in 1920. The temporal offset used” (p. 639). In order to track uncertainty, an
between the 1920 census and 1930 survey of house attribute was added to the MCHGIS house site
sites must be kept in mind. A few houses extant in layer indicating the basis on which a match was
1920 may not appear in the 1930 data set because made with a census household and the level of
of their destruction, e.g. by fire, and any houses certainty of that match. This is what Gregory and
constructed after 1920 will not have a matching Healy (2007) term the “documentary” approach
census household. The latter are likely to be few, to handling error in historical GIS databases (p.
however, particularly after the 1926 authorization 641). As new sources of data are uncovered or
of the national park when it was known that the additional informants provide more information,
area was to be cleared of inhabitants. we will be able to reevaluate and revise matches
Park Service personnel also digitized parcel based on the best available information.
boundaries of the 532 tracts purchased from The first phase of the methodology compared
private landholders to form the park. This layer land tract owners with census heads-of-house-
includes the name of the former land owner as an holds. Of the 235 tracts containing a single house
attribute. Thirteen tracts were church or school site, 67 owners were located in the census as

1316
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

heads-of-households who owned their own homes. participation phase of the project, which will make
A simplifying assumption was made that these the MCHGIS accessible to many more potential
homeowners likely lived on the tract they owned local informants and potentially uncover new
and the census household was matched with the data sources.
house site. The second phase of the methodology
involved interviews with a key informant, Norman
Warnell, who has extensive local knowledge of the DISSEMINATING PRE-PARK
Mammoth Cave region and of the families who HISTORY
used to live there. Mr. Warnell taught science for
many years in the Edmonson County School Sys- A web-based interface was developed for the
tem, authored a folk history of the Mammoth Cave MCHGIS to disseminate materials documenting
region (Warnell, 2006), served as an officer of the the historic settlement history of the Mammoth
Edmonson County Historical Society, and spends Cave region. While the primary data used in the
much of his free time hiking off-trail through the MCHGIS is publicly accessible, there are practi-
park, lending him an intimate familiarity with the cal barriers to access by the general public. For
remnant roads and traces of old homesteads. Mr. instance, the 1920 manuscript census for Kentucky
Warnell verified many of the household-house site is available on microfilm in a number of federal
matches made during the first phase of the project, depository libraries (or on subscription-based
and was able to supply, based on his knowledge services such as Ancestry.com), but one has had
of the region, the names of residents for a number to travel to the library to use the microfilm (or pay
of additional house sites. Some of his matches a subscription fee for online access) and then one
were subsequently verified with former residents simply has the raw census data. It is the integration
and their descendants during a public exhibition of census data in geographic context with topo-
of the MCHGIS at a Homecoming event held at graphic maps and photographs that fosters a richer
the national park, giving the authors a high level understanding of the pre-park communities. Given
of confidence in the matches Mr. Warnell made. the common availability of the internet, a web ver-
The third phase of the methodology located sion of the MCHGIS seems the best way to reach
households within the manuscript census that were the widest possible target audience. The visual
already matched to house sites in close geographic orientation of GIS aligns well with that of the web;
proximity and sought to interpolate the gaps Map-based internet services, such as Mapquest,
both in household records and house sites. That YahooMaps, and GoogleEarth, are ubiquitous, as
is, if the manuscript census shows three records are the concepts of zooming in and out to alter the
between two census household records that are scale of a map, layers of information that can be
already matched to house sites A and B, and the turned on and off, and keyword searches. Thus, the
house site layer shows three houses along the road skills needed to operate a simple web-based GIS
between houses A and B, then we assumed the have become part of American popular culture.
path of the census worker ran between houses A Moreover, the cost-of-entry for the internet has
and B and matched the intervening houses to the decreased dramatically, with cell phones as well
intervening census records (Figure 4). Through as personal computers providing access, and most
the combination of the three methods, 230 of 664 schools and public libraries providing computers
house sites (35%) were matched to 1920 census with internet access.
household records (Brunt, 2009). Although this The web interface was created with ESRI
match rate seems low, we are hopeful of being ArcGIS Server (version 9.3.1) and is hosted on a
able to make additional matches during the public university computer at http://161.6.109.206/mam-

1317
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

Figure 4. Interpolation of census records between known matches allowed additional houses to be matched
to census households. In this figure, the residents of houses labeled “2” and “5” were determined by
key informants. The corresponding manuscript census records were located. Note that a generational
transfer is evident in the Ritter match, reflecting the temporal disjunction between the 1920 census and
1936 parcel data. When a slight misalignment of layers is taken into account (the Carwell house should
be on the Carwell parcel and the Hogan house on the Hogan parcel), it was possible to match three
additional houses with census records based on an inferred linear route by the census taker

moth_HGIS/. The internet mapping application family name and returns a sorted list of matching
displays MCHGIS data including house, parcel records from the parcel and house layers (Figure
and roads layers on top of a streaming topographic 5). Users can right click on a result and zoom to
map provided by the ESRI Resource Center. Two the matched feature. The search results include
advantages of the streaming map are its seamless- key attributes for matched parcels, such as acre-
ness and its variable resolution, allowing the image age, and for houses that have been matched to a
to remain clear at any zoom level. A high quality census household, including head-of-household
topographic map facilitates user interpretation of name, age, and occupation. A help page explains
the MCHGIS, as many former residents and their all of these features.
descendants associate their memories in refer- Custom HTML code was added to the map
ence to topographic features such as a streams, application to allow hyperlinking of photographs.
ridges, and sinkholes. Layers may be turned on Fields in the parcel and house layers specify the
and off, and standard map navigation tools al- physical file location of a high resolution.jpg and
low the user to zoom and pan. An inset overview metadata documenting the photo’s source. The
map shows the extent of the main frame as the hyperlink is displayed along with other attribute
user changes map scale. The “Search for Family information for a parcel or house, and when a user
Name” tool allows users to enter part or all of a follows the link, a new window displaying the

1318
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

Figure 5. The parcel and house site layers are searchable by name, allowing users of the Mammoth
Cave Historical GIS to locate properties occupied by ancestors

photograph is opened. Captions written on the a name, or the local use of variant spellings. In
verso of the CCC photographs, presumably by most cases, enough other information was present
the photographer or his assistant, in many cases to make a match. For instance, a photo captioned
identified the parcel number of a farm or the name “House to be razed on tract #281 T. M. Durham”
of a house’s owner or resident, allowing photo- clearly is the house of Tom M. Denham on parcel
graphs to be matched to parcel and house features. #281.
The CCC photographs had been scanned as high A pilot version of the MCHGIS was loaded
resolution.jpgs and the captions entered into a onto a laptop equipped with ESRI ArcMap soft-
database by Park Service personnel. Matching a ware and demonstrated at the 2009 Mammoth
photograph to a house or parcel sometimes re- Cave Homecoming, held on the Fourth of July
quired reconciling spelling variations. Since the weekend. This is an annual event whose origins
authors used processed versions of the captions, date back over a hundred years as a combination
they could not distinguish between data entry community celebration and tourism promotion
errors, cases when the photographer misrecorded during the era of private ownership of Mammoth

1319
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

Cave. It has been revived by the National Park significance is a key means of fostering national
Service in recent years for the same combination or group identity (Hoelscher & Alderman, 2004;
of purposes. Musical performances, guided hikes Leib et al., 2000), although separate groups may
and other events of interest to the general public not be in consensus about the meaning conveyed
are complemented by activities organized by and by a particular commemorative structure (Forest
for former residents of the park area and their et al., 2004; Linantud, 2008).
descendants, such as a genealogy room where The most obvious examples of public memory
people gather to reminisce and share family his- are statues, museums and other elements of the
tory. Here people display extensive collections built environment, but non-material culture, such
of photographs, records, and genealogy charts as commemorative naming practices, can also be
that document their family’s history. Reflecting key components of public memory (Alderman,
the diaspora of Mammoth Cave area residents, 2000). The evolution of the web has led to virtual
some participants came from as far as Kansas and forms of public memory, particularly web sites
Michigan. It was in this room that the MCHGIS that pay tribute to people and events (Hess, 2007).
was demonstrated. Virtual sites of commemoration have several ad-
Feedback for the pilot system was overwhelm- vantages over material ones. They are accessible
ingly positive from both Mammoth Cave area to a much larger potential audience, and access
descendants and national park personnel. Although is unconstrained by the exigencies of travel to a
the system was in the early stages of develop- physical site. Web sites can be developed rapidly,
ment, participants were receptive, even excited, while bricks-and-mortar monuments are often
to use the interactive map and see family names subject to design review and political contesta-
on parcels and houses and to see photographs ap- tion. For instance, while the official memorial to
pear upon clicking hyperlinks. Several attendees the victims of the September 11 terrorism attack
offered historic photographs from their private at the site of the World Trade Towers languished
collections for scanning and inclusion in the over competing visions, dozens of web tributes
MCHGIS. Additional photographs were submit- appeared. Although created by individuals, these
ted via email the following week after an article web sites opened “space for communal memo-
describing the Homecoming and mentioning the rializing” (Hess, 2007, p. 813) through means
MCHGIS appeared in the Bowling Green Daily such as digital guestbooks that allowed others to
News (Gaines, 2009). post tributes, reflect and grieve online, resulting
in “collections of vernacular voices, often with
contradictory and controversial stances on the
PARTICIPATORY GIS AS interpretations of public memory” (Hess 2007,
A FRAMEWORK FOR p. 828). Such multivocality arguably fulfills the
PUBLIC MEMORY “public” dimension of public memory better than
the elite visions of artists, government officials and
The final phase planned for the MCHGIS is a col- other experts who design and approve the typical
laborative site of public memory for the pre-park statue or museum. A memorial open to comment
communities. Public or collective memory resides and annotation, essentially allowing the public to
in consciously constructed representations of the co-construct it, closes a notable gap in meaning
past, such as monuments and museums that both between production and consumption of public
memorialize the past and, at the same time, con- memory, for a monument may be understood in
tribute to its ongoing social construction (Foote, ways very different from those intended by its
1997). Public memory linked to places of social designers, particularly when the passage of time

1320
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

has blurred other memories of the person or event Craig et al., 2002; Elwood, 2002; Ghose, 2001).
(Johnston & Ripmeester, 2007). Web-based GIS enhances public participation in
These commemorative web sites are vernacular some of the same ways that web-based memori-
forms of public memory, as opposed to the elite als do – by reaching a larger virtual audience
forms embodied in monuments and museums than could attend a physical meeting, by offering
designed by experts and approved by a political asynchronous access, and by the possibility of
process. Traditionally, vernacular forms of public anonymity, which reduces psychological barriers
memory have been less visible than elite forms, to participation (Wong & Chua, 2001). Although
by virtue of occupying less prominent positions technological barriers to participatory GIS still
in the landscape (e.g. roadside crosses marking exist in developing countries, among most of the
the deaths of ordinary citizens versus generals United States’ population, “GIS technologies are
memorialized with a statue in the town square) no longer elitist” (Wong & Chua, 2001, p. 65),
or because of use of ephemeral materials (e.g. a statement even more true a decade after Wong
flower tributes to Princess Diana placed outside and Chua wrote it.
the gates of Buckingham Palace), but the web, Web sites such as the Mark Twain’s Missis-
with its ability to draw a virtual flash mob of at- sippi Valley (Vandecreek, 2007) and the Civil
tention, through links and blogs, promises to be War-era site, Valley of the Shadow (Thomas &
something of an equalizer in terms of the promi- Ayers, 2003), foster public memory by making
nence of virtual real estate. available a huge variety of materials, text, images,
The methods of public participation GIS and video, for the public to explore. They use a
(PPGIS) will extend the MCHGIS web site to digital library model, presenting materials housed
a virtual site of public memory for the former at academic and historical institutions that have
residents of Mammoth Cave National Park and been selected for preservation by experts and
their descendents (Epperson, 2010). PPGIS, also digitized. This is essentially a top-down approach
called participatory GIS (PGIS) and community- to public memory. The MCHGIS mixes top-down
integrated GIS (Dunn, 2007), is well suited to and bottom-up approaches, allowing integration
fostering a digital form of collaborative public of institutional material selected by experts with
memory, although its use in this manner is rare information from private holdings deemed sig-
to date. The plethora of names for this form of nificant by members of the public. Most of the
GIS signal a technology in its infancy for which material currently available to the public about the
standards, including a name, have yet to be settled, pre-park population is institutional in origin, e.g.
but they also indicate its perceived usefulness, as the census and records collected by the nascent
many researchers are pursuing related forms of national park administration, yet a much greater
development. What their efforts have in common volume of historical material remains in private
is the intent to democratize GIS, making it more hands among former residents of the park area
socially-inclusive by widening access to it among and their descendants. Members of the public may
a variety of social groups and by cultivating those submit material for addition to the MCGIS by
groups’ participation in creation of geospatial data identifying the relevant household and submitting
represented in GIS. a digital file (e.g. a photograph or scanned docu-
Participatory GIS has been most often used in ment) and a signed donation agreement. Ethical
the areas of urban and environmental planning, to concerns require that potential contributors to the
solicit the perspectives and knowledge of residents MCHGIS be aware that their digital donations will
of a place whose lives will be impacted by decisions be freely accessible and downloadable over the
being made about infrastructure and resources (e.g. internet and that they must be free of copyright.

1321
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

Donors will be asked to sign a contribution from park area who is active in genealogy, and as of June
stipulating these conditions. They will be given a 2010, the Mammoth cave “family” had over 900
choice of being noted as the donor or having their members. Members use the site both for sharing
contribution listed as anonymous. The privacy historical information and for keeping up with
insured by the latter choice may help encourage the current lives of friends and family. Wong and
participation by individuals who would hesitate to Chua (2001) note that web-based PPGIS present
be seen as critical of the National Park Service. It barriers to use by some community groups, but
is typical for web GIS to employ a “gatekeeper” we anticipate that the Mammoth Cave MyFam-
for new submissions (Dunn, 2007) in contrast to ily group, which currently shares an abundance
the wiki model of community-created data sets of digital files, both image and text, and makes
where anyone can write or edit a wiki page. Filter- sophisticated use of online chat and blogging
ing public submissions is desirable for data quality facilities, is a good target audience for a PPGIS
control but also because the data structures behind application because a large number of interested
the web GIS are not directly modifiable by the individuals have demonstrated the kind of com-
public. A GIS expert will need to add hyperlinks puter skills necessary to use a simplified web GIS
for new photographs and incorporate other data interface. Hundreds of photographs have been
into existing structures. uploaded to the MyFamily Mammoth Cave web
Although participatory GIS is generally char- site, but without a strong organizational frame-
acterized as inclusive and democratizing, actual work, images of a particular family or place can
projects generally originate in technologically- be difficult to locate. The geospatial framework
informed settings, especially academia and gov- of the MCHGIS offers a distinct advantage by
ernment. One danger, then, is that because the grouping materials related to a family, neighbor-
projects are conceived by technocrats rather hood, and community. Moreover, as a publicly
than by members of the target communities, “‘a accessible web site, anyone can use the MCHGIS,
web-based PPGIS may run the risk of providing not just those who have been invited to join, as
services that the public does not want” (Wong & with MyFamily.com. In keeping with Dunn’s
Chua, 2001, p. 626). Since the MCHGIS was de- (2007) view of community-integrated GIS, the
veloped by an academic group, we should explain MCHGIS “acknowledges the ‘expert’ nature of
our perception of the need for it. First, the family GIS as a technology but enhances citizen access
history activities at the Mammoth Cave Fourth and participation and, hence, the democratic
of July Homecomings demonstrate a significant potential” (p. 620).
constituency for public remembrance of the pre-
park inhabitants. Second, interest in digital forms
of public remembrance and a willingness to share REPRESENTATION AND
privately-held historical materials are manifest REPRESENTATIVENESS
in the section of the web site MyFamily.com de-
voted to Mammoth Cave. MyFamily is a social One hope of the MCHGIS developers is to reach
networking site run by Ancestry.com intended to former park area residents and descendants who
allow sharing of photographs, videos, and other do not currently participate in Mammoth Cave
files, host discussions, and share an event calendar Homecomings or MyFamily.com and, thus, to
among members of a “family,” with membership broaden the pre-park population represented in the
defined by the person who creates a particular memory-making process. Alderman and Modlin
family group. The Mammoth Cave “family” was (2008) point out that commemorative web sites
created by a descendant of former residents of the can inadvertently create biased understandings by

1322
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

focusing on the lives of a small number of atypical must await a future study, but if true, differential
individuals who might be taken as representative migration patterns are a historical factor that likely
of a larger group. Their recommended strategy affects participation in Homecoming events and
of documenting the wider community is at the local historical societies.
heart of the MCHGIS methodology. There are
substantial challenges to achieving this goal; how-
ever that is historical and institutional in nature. FUTURE WORK
Participants in the Mammoth Cave Homecom-
ings, the MyFamily web site, and local histori- It is too early to know how successful a participa-
cal societies are the easiest to reach and invite tory MCHGIS will be, but it holds the potential
to become engaged with the MCHGIS project. for broadening representations of the history of
Participation in these venues is self-selected, to the Mammoth Cave region, letting the residents
a degree, but also influenced by existing social and their descendants assert a presence that, in
networks and residential proximity. For instance, many ways, has been officially minimized (e.g.
although some Homecoming attendees traveled by the removal of most human artifacts from
from distant states, most of those the authors the landscape and from the lack of interpretive
met lived within a short drive of the park. As an emphasis by park rangers). While it is no doubt
example of non-representative participation, the true that some segments of the former resident
authors did not observe any African Americans population will benefit more from a historical
in the genealogy room during the 2009 or 2010 GIS than others, owing to differences in internet
homecomings, despite the fact that most of the use, interest in genealogy, and amount of family
Mammoth Cave guides prior to park creation history preserved, the MCHGIS offers the poten-
were African American and most lived with their tial for participation to all with a minimum set
families in the Flint Ridge area that is now part of technology resources which are now widely
of the park. Jerry Bransford, a descendent of the available through internet-connected computers
storied Bransford family, many of whom were in homes, public libraries, and schools.
guides during the nineteenth and early twentieth To gauge who is using the web site and how
century’s, continues his family’s guiding tradition they are using it, a survey enables users to provide
as a part-time ranger at the national park. Thus, anonymous feedback. The “Take a Survey” link
the dearth of African American participants in the opens a new window with a ten-question survey
genealogy room activities does not stem from a created in the Qualtrics online survey software.
disinterest in their Mammoth Cave heritage, but The survey, which was approved by the authors’
from other, unknown factors. No systematic study university Human Subjects Review Board, queries
has been done of where the displaced population users about their background, prior experience
went after leaving the Mammoth Cave area. Anec- with web mapping applications, ease in using the
dotal evidence is that some moved to nearby towns, GIS application, and their perception of the site’s
including Cave City, Glasgow, and Louisville, and usefulness. It also provides an opportunity to report
that some who had particularly strong ties to place problems with the web site and leave additional
and the financial wherewithal to do so purchased open-ended feedback. Analysis of survey results
new homes just outside the park boundary. The after the web site has been online for a while will
authors suspect that differences in socioeconomic allow the authors to assess their success in dis-
status, especially home and property ownership, seminating knowledge of the site and tweak its
and race may have influenced migration decisions interface to resolve any usability issues.
following displacement. Testing of this hypothesis

1323
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

The work of adding historical materials to the privately-held stores of historical materials, thus
MCHGIS will be ongoing. User contributions will increasing publicly accessible materials about the
be incorporated as they are emailed to project region’s people and history.
staff. Based on a few initial contributions already We believe, with Bodenhamer (2007), that “a
received, we anticipate that a certain amount of GIS-facilitated landscape of memory… is a vision
email back-and-forth will be needed to best place worth pursuing” (p. 108) because of its potential for
contributed photographs on the landscape or to fostering a more inclusive construction of history.
pinpoint which house or family a contribution By welcoming multiple voices and privately-held
belongs with. Future plans include expanding memories, the MCHGIS can reveal a much richer
the multimedia capabilities of the MCHGIS to past than can be read from institutional holdings
include audio clips selected from oral history alone, a past that will inform expert interpreta-
interviews that have been recorded with former tions, offer alternative visions, and fill some of the
residents over the years and that reside in the silences surrounding a marginalized group. The
Mammoth Cave National Park collection. Since web-based public participation GIS will integrate
GIS-produced maps have the power to prompt diverse forms of data in a geospatial framework,
recollections, stories, and personal and family serve as a repository of local knowledge, provide
narratives (Bodenhamer, 2007), we also plan to persistent access to historical data, and serve as a
explore using the MCHGIS as a prompt during model for collaborative public history.
future oral history interviews, essentially to talk
with subjects about the materials available on
the site in hopes of evoking thick ethnographic ACKNOWLEDGMENT
descriptions of communities and localities.
The authors would like to thank the generosity
of Charles Silbereisen and the Silbereisen Fund
CONCLUSION in providing financial support that contributed to
this research project. They gratefully acknowledge
The applied value of the MCHGIS stems from the efforts of Kevin Cary, GIS Lab Manager at
its preservation and dissemination of the human Western Kentucky University, for his tireless as-
history of Mammoth Cave National Park. It will sistance in keeping the GIS servers running and
help the general public understand the landscape his instruction in internet GIS. They appreciate
transformation required to create the iconic “wil- the support and assistance of Mammoth Cave Na-
derness” they encounter when they visit the park tional Park personnel in locating archival material,
today, it will help rangers and other park personnel including Bob Ward, Joy Lyons, Terry Langford,
interpret the area’s history, and it will help former Chuck DeCroix, and Lillian Scoggins. And they
residents of the Mammoth Cave region and their are deeply indebted to all those who shared their
descendants retain a valued cultural identity, one memories, family histories, and photographs with
that has been marginalized through neglect in of- us, but especially Normal Warnell for the hours he
ficial histories and interpretations. The MCHGIS spent with us “talking Mammoth Cave.”
integrates and makes accessible data from multiple
archival sources, including the 1920 manuscript
census and a collection of photographs of many of
the pre-park dwellings taken by a CCC photogra-
pher. It also provides a platform for integrating in
the same geospatial framework some of the vast

1324
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

REFERENCES DeBats, D. A. (2008). A tale of two cities: Using


tax records to develop GIS files for mapping and
Alderman, D. (2000). A street fit for a king: Naming understanding nineteenth-century U.S. cities.
places and commemoration in the American South. Historical Methods, 41(1), 17–38. doi:10.3200/
The Professional Geographer, 52(4), 672–684. HMTS.41.1.17-38
Alderman, D., & Modlin, E. A. Jr. (2008). (In) DeBats, D. A. (2009). Using GIS and individual-
visibility of the enslaved within online planta- level data for whole communities: A path toward
tion tourism marketing: A textual analysis of the reconciliation of political and social history.
North Carolina websites. Journal of Travel Social Science Computer Review, 27(3), 313–330.
& Tourism Marketing, 25(3-4), 265–281. doi:10.1177/0894439308329759
doi:10.1080/10548400802508333
Demeritt, D. (2002). What is the ‘social con-
Algeo, K. (2004). Mammoth Cave and the mak- struction of nature’? A typology and sympathetic
ing of place. Southeastern Geographer, 44(1), critique. Progress in Human Geography, 26(6),
27–47. doi:10.1353/sgo.2004.0002 767–790. doi:10.1191/0309132502ph402oa
Berger, P., & Luckmann, T. (1966). The social Dilsaver, L. M., & Wyckoff, W. (2005). The
construction of reality: A treatise in the sociol- political geography of national parks. Pacific
ogy of knowledge. Garden City, NY: Doubleday. Historical Review, 74(2), 237–266. doi:10.1525/
Bodenhamer, D. J. (2007). Creating a land- phr.2005.74.2.237
scape of memory: The potential of humanities Dunn, C. E. (2007). Participatory GIS -- A people’s
GIS. International Journal of Humanities and GIS? Progress in Human Geography, 31(5),
Arts Computing, 1(2), 97–110. doi:10.3366/ 616–637. doi:10.1177/0309132507081493
E1753854808000207
Dunn, D. (1988). Cades Cove: The life and death
Brown, M. L. (2000). The wild East: A biography of a southern Appalachian community, 1818-1937.
of the Great Smoky Mountains. Gainesville, FL: Knoxville, TN: University of Tennessee Press.
University Press of Florida.
Elwood, S. A. (2002). GIS use in community
Brunt, M. (2009). Analysis of Mammoth Cave planning: A multidimensional analysis of em-
pre-park communities. Unpublished master’s powerment. Environment & Planning A, 34(5),
thesis, Western Kentucky University, Bowling 905–922. doi:10.1068/a34117
Green, KY.
Epperson, A. (2010). Internet GIS as a historic
Campbell, C. (1960). Birth of a national park place-making tool for Mammoth Cave National
in the Great Smoky Mountains. Knoxville, TN: Park. Unpublished master’s thesis, Western Ken-
University of Tennessee Press. tucky University, Bowling Green, KY.
Craig, W. J., Harris, T. M., & Weiner, D. (2002). Foote, K. (1997). Shadowed ground: America’s
Community participation and geographic infor- landscapes of violence and tragedy. Austin, TX:
mation systems. London, UK: Taylor & Francis. University of Texas Press.
Forest, B., Johnson, J., & Till, K. (2004). Post-total-
itarian national identity: Public memory in Germa-
ny and Russia. Social & Cultural Geography, 5(3),
357–379. doi:10.1080/1464936042000252778

1325
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

Frome, M. (1966). Strangers in high places: The Johnston, R., & Ripmeester, M. (2007). A monu-
story of the Great Smoky Mountains. Knoxville, ment’s work is never done: The Watson monument,
TN: University of Tennessee Press. memory, and forgetting in a small Canadian city.
International Journal of Heritage Studies, 13(2),
Gaines, J. (2009, July 4). Mammoth homecom-
117–135. doi:10.1080/13527250601118951
ing: National park hosts gathering for hundreds
of former residents of the area, once known as Lally, K. (1987). Unpublished oral history
Northtown. Bowling Green Daily News. interview with Vernon Wells. Bowling Green,
KY: Mammoth Cave National Park Curatorial
Ghose, R. (2001). Use of information technology
Archives.
for community empowerment: Transforming
geographic information systems into community Lambert, D. (1989). The undying past of Shenan-
information systems. Transactions in GIS, 5(2), doah National Park. Lanham, MD: Roberts
141–163. doi:10.1111/1467-9671.00073 Rinehart.
Gregory, I. N., & Healey, R. G. (2007). Historical Leib, J. I. (2004). Robert E. Lee, ‘Race,’ repre-
GIS: Structuring, mapping, and analysing geogra- sentation and redevelopment along Richmond,
phies of the past. Progress in Human Geography, Virginia’s Canal Walk. Southeastern Geographer,
31(5), 638–653. doi:10.1177/0309132507081495 44(2), 236–262. doi:10.1353/sgo.2004.0025
Gregory, I. N., Kemp, K., & Mostern, R. (2001). Leib, J. I., Webster, G. R., & Webster, R. H. (2000).
Geographical information and historical research: Rebel with a cause? Iconography and public
Current progress and future directions. His- memory in the Southern United States. GeoJour-
tory & Computing, 13(1), 7–21. doi:10.3366/ nal, 52, 303–310. doi:10.1023/A:1014358204037
hac.2001.13.1.7
Linantub, J. L. (2008). War memorials and memo-
Hess, A. (2007). In digital remembrance: Ver- ries: Comparing the Philippines and South Korea.
nacular memory and the rhetorical construction International Journal of Heritage Studies, 14(4),
of web memorials. Media Culture & Society, 347–361. doi:10.1080/13527250802155869
29(5), 812–830. doi:10.1177/0163443707080539
O’Brien, W. (2007). The strange career of a Florida
Hoelscher, S., & Alderman, D. H. (2004). Memory state park: Uncovering a Jim Crow past. Histori-
and place: Geographies of a critical relationship. cal Geography, 35(1), 160–184.
Social & Cultural Geography, 5(3), 347–355.
Pierce, D. S. (2000). The Great Smokies: From
doi:10.1080/1464936042000252769
natural habitat to national park. Knoxville, TN:
Horning, A. J. (2000). Shenandoah’s secret history. University of Tennessee Press.
Archeology, 53(1), 44–52.
Powell, K. M. (2007). The anguish of displace-
Horning, A. J. (2001). When past is present: ment: The politics of literacy in the letters of
Archaeology of the displaced in Shenandoah mountain families in Shenandoah National Park.
National Park. Retrieved from http://www.nps. Charlottesville, VA: University of Virginia Press.
gov/shen/historyculture/displaced.htm
Reader, C., & Reader, J. (1978). Shenandoah
Ise, J. (1961). Our National Park policy: A critical Heritage: The story of the people before the
history. Baltimore, MD: Johns Hopkins Press. park. Washington, DC: Potomac Appalachian
Trail Club.

1326
Historical GIS as a Platform for Public Memory at Mammoth Cave National Park

Ruvane, M. B., & Dobbs, R. (2008). Genealogy, Wong, S., & Chua, Y. L. (2001). Data intermedia-
historical geography, and GIS: Parcel mapping, tion and beyond: Issues for web-based PPGIS.
information synergies, and collaborative oppor- Cartographica, 38(3-4), 63–80.
tunities. In Timothy, D. J., & Guelke, J. K. (Eds.),
Young, T. (2006). False, cheap and degraded:
Geography and genealogy: Locating personal
When history, economy and environment collided
pasts (pp. 43–62). Burlington, VT: Ashgate.
at Cades Cove, Great Smoky Mountains National
Schlichting, K., & Tuckel, P. (2006). Residential Park. Journal of Historical Geography, 32(1),
segregation and the beginning of the Great Mi- 169–189. doi:10.1016/j.jhg.2004.06.001
gration of African Americans to Hartford, Con-
necticut. Historical Methods, 39(3), 132–143.
doi:10.3200/HMTS.39.3.132-144 ENDNOTES
Svensson, P. (2010). The landscape of digital 1
For scholarly treatments of the Great Smoky
humanities. Digital Humanities Quarterly, 4(1). Mountains National Park and its displaced
Thomas, W. G., & Ayers, E. L. (2003). An over- population see Brown (2000), Dunn (1988),
view: The differences slavery made: A close Frome (1966), and Pierce 2000; for Shenan-
analysis of two American communities. The doah National Park, see Horning (2000),
American Historical Review, 108(5), 1299–1307. Lambert (1989), Powell (2007), and Reader
doi:10.1086/529967 and Reader (1978).
2
Brunt’s (2009) analysis is for the portion of
Vandecreek, D. E. (2007). Digital history and the national park that falls within Edmonson
the public: Envisioning the Mississippi Valley of County, which comprises the bulk of the
the nineteenth century. International Journal of park. A very small portion of the national
Humanities and Arts Computing, 1(2), 123–136. park lies within Barren and Hart Counties.
doi:10.3366/E1753854808000220
Warnell, N. (2006). Mammoth Cave: Forgotten
stories of its people. Fort Washington, PA: Eastern
National.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 2, Issue 4,
edited by Donald Patrick Albert, pp. 19-37, copyright 2011 by IGI Publishing (an imprint of IGI Global).

1327
1328

Chapter 80
Applied Geospatial Perspectives
on the Rock Art of the
Lake of the Woods Region
of Ontario, Canada
John W. Norder
Michigan State University, USA

Jon W. Carroll
Michigan State University, USA

ABSTRACT
This study examines the role of rock art in the construction of Woodland Period (300 BC to AD 1700)
hunter-gatherer landscapes in the Lake of the Woods region of northwestern Ontario. The authors ex-
amine the distribution of documented pictograph sites relative to the locations of rock formations where
the geologic conditions would have favored the placement of pictographic rock art but are absent. Point
pattern analysis, Monte Carlo simulation, and least cost path analysis were used to analyze the findings.
The authors suggest that pictograph sites were placed at points on the landscape along water routes to
facilitate information exchange among highly mobile hunter-gatherers.

INTRODUCTION sites as records of the ideologies, histories, poli-


tics, and religions of early societies ranging in
The study of rock art has been an area of archaeo- complexity from hunter-gatherers of the Upper
logical interest for well over a century. However, Paleolithic to the empires of Ancient Egypt and
it has only been within the past three decades that Rome. In addition, rock art has been an area of
this heritage resource has gained mainstream greater efforts in heritage management plans as
disciplinary attention. Researchers have become increased development, air pollution, and other
increasingly aware of the significance of these human-related factors have either destroyed,
degraded, or currently threaten these sites. In
DOI: 10.4018/978-1-4666-2038-4.ch080 this context, the study of rock art sites takes on

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

greater urgency, not only for archaeologists, but components of the spiritual landscape of historic
for people with connected histories to these places and ancestral Algonquian Indian communities.
such as those in this study. Recent and current research has focused more
This study examines the role of rock art in the on this latter topic further linking these places to
construction of Woodland Period (300 BC to AD descendant First Nations aboriginal communities
1700) hunter-gatherer landscapes in the Lake of as part of their contemporary cultural landscapes
the Woods region of northwestern Ontario (Fig- (Arsenault, 2004; Norder, 2007, in press-a).
ure 1). Across the broader Canadian Shield, in This paper builds on work conducted by one
which the study area is located, rock art has long of the authors (Norder, 2003) that examines an
been suggested as a means by which ancestral additional area of study by focusing on the po-
Algonquian Indian communities communicated tential significance of pictographic rock art site
information. The majority of research has focused placement not just as part of a spiritual landscape,
on images at these sites and has been informed by but as part of the broader social landscape of the
ethnohistoric research and ethnographic fieldwork makers. Fieldwork conducted in 2000 (Norder,
among Algonquian speaking peoples of the region 2003) and 2005 (Norder, in press-b) strongly
– the Cree, Ojibwe, and Salteaux First Nations suggested that locations of pictographs, or paint-
(Dewdney & Kidd, 1972; Conway, 1993; Moly- ed rock art sites (Figure 2), were intentionally
neaux, 1980; Rajnovich, 1989, 1994; Steinbring, selected and placed on the landscape to foster
1992; Vastokas & Vastokas, 1973). Interpretations forms of secular information exchange related to
derived from this research have emphasized reli- population mobility, and seasonal aggregation.
gious and ritual meanings of these sites in terms of Despite these initial findings, it has remained to
cosmological significance, ritual practice, and as be determined more conclusively whether the

Figure 1. Example of a CSRAT pictograph site (photograph by J. Norder)

1329
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

spatial distribution on the landscape is a result of upper Great Lakes region and westward up through
random behavior or if there is some form of pat- Manitoba into Saskatchewan. This geographic
terning associated with their production. Using range is roughly equivalent to the spread of mem-
point pattern analysis, Monte Carlo simulation, bers of sub-Arctic Algonguian speaking peoples
and least cost path analysis, we propose that that include, as noted above, groups of Ojibwe,
pictograph sites were placed in strategic locations Cree, Saulteaux and a variety of other culturally
along specific water routes, which served to fos- and linguistically related groups. As a result, and
ter information exchange and guide travel and with supporting archaeological evidence, CSRAT
other forms of secular human activities and be- sites are considered to be Algonquian in origin
haviors. Understanding this intentional placement and this recognition has subsequently been used
of pictographs speaks to how these pre-Contact as justification for application of historic and
aboriginal communities actively used the land- ethnohistoric accounts to the interpretations of
scape to facilitate information exchange predict- these locations.
ably across space and time in structured and so- Rock art in the CSRAT occurs in four forms.
phisticated manners that have previously been The first, and the subject of the authors’ research
overlooked. here, are pictographs (Figure 2). These also are
the dominant rock art form of the CSRAT. The
second are petroglyphs, which are images that
PICTOGRAPHIC ROCK ART IN are carved, incised or pecked into a stone surface.
NORTHWESTERN ONTARIO The third are petroforms, and specifically refer to
arrangements or alignments of boulders that either
Rock art within the study area is considered to depict recognizable human/animal figures or form
be part of a larger regional expression of rock art abstract geometric patterns. The fourth and last
referred to as the Canadian Shield Rock Art Tradi- are an unusual form of geoglyph, referred to in
tion (CSRAT). The CSRAT has a site distribution the region as lichenographs, and are made by the
that spreads from Quebec in the east through the

Figure 2. Location of study area

1330
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

scraping away of surface molds and lichens from & Conway, 1990; Dewdney, 1975; Dewdney &
a rock surface to create an image. Kidd, 1967; Vastokas & Vastokas, 1973).
While the majority of pictographic rock art The results of these studies have shed a great
in northwestern Ontario is likely pre-European deal of insight into elements of cosmological and
in origin the exact time span of its production is ritual behaviors and meanings associated with
not known with any certainty. Part of this stems these sites, but as with dating, the precise mean-
from the nature of the production of the art itself. ing of any given composition remains elusive.
The primary pigment used in the production of As noted by many of the researchers (Rajnovich,
CSRAT pictographs, derived from the ground 1994; Dewdney & Kidd, 1967), despite insights
mineral red ocher, is not dateable by any direct provided by ethnographic and ethnohistoric
means (Keyser & Klassen, 2001; Nieves, 2006). sources, any single image could represent any
However, it should be noted that based on eth- number of real world phenomena or cultural
nohistoric and ethnographic sources red ocher ideological concepts. As a result, without direct
was likely mixed with an organic binding agent, transmission of a composition’s meaning from
usually some form of animal fat or fish oil that maker to audience and transgenerationally from
would have been dateable. Nevertheless, direct that point onward the meaning of a rock art site
exposure to the extreme seasonal weathering becomes altered or lost. As ethnographic field-
processes of the region would have dissolved work conducted with contemporary Algonquian
this organic component after a relatively short descendants of the makers in 2005 indicated, this
period of time. Regardless, there is sufficient was much the case (Norder, 2007, in press-a).
circumstantial evidence based on multiple lines To date, systematic documentation and
of ethnographic, ethnohistoric, and archaeological research of the rock art of Lake of the Woods
evidence to suggest an antiquity consistent with has been limited, with the majority of surveys
the Woodland Period of the region (Rajnovich, conducted either by provincial government ar-
1994; Arsenault et al., 1995), during which the chaeologists in a patchwork fashion throughout
most developed hunting and gathering societies the region (Lambert, 1985; Pastershank, 1989;
inhabited the region (Reid, 1977). Rusak, 1992) or as part of single or specific site
As with rock art globally, difficulty in direct comparisons (Conway, 1989; Rajnovich, 1980a,
dating has focused most research on the pursuit of 1980b; Steinbring & Simpson, 1983). The first
the meaning of rock art imagery (Chippendale & systematic survey of the region was begun by
Nash, 2004; Chippendale & Taçon, 1998; Clotte Norder in 2000 and elaborated substantially in
& Lewis-Williams, 1998; Coulson & Campbell, 2005. The goal of the initial survey in 2000 was
2001; David, 2001; Diaz-Granados & Duncan, to research the potential for identifying functional
2004; Domingo-Sanz, Fiore, & May, 2008; sites types based upon information encoded in a
Hays-Gilpin, 2003; Helskog, 2001; Keyser & given rock art site, a determination based on the
Klassen, 2001; Layton, 1992; Loendorf, Chip- use of the Shannon Information Measure (Hartley,
pendale, & Whitley 2005; Nash & Chippendale, 1992; Hartley & Vawser, 1998; Shannon, 1949).
2002; Schaafsma, 1980). Within the study region The measure determines the relative ease with
considerable effort has been made to interpret the which an observer would be able to interpret a
rock art imagery using methods of standard stylis- given message based on the redundancy of the
tic analysis combined with ethnographic studies content of the composition. Initial results sug-
of the Algonquian descendants of the makers gested that there were at least three different types
(Rajnovich, 1989, 1994; Conway, 1993; Conway of sites that communicated messages of particular
significance to anyone with a basic familiarity with

1331
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

the regional iconography. In this context, most on locating pictographic and other types of rock
northern Algonquian-speaking peoples would art sites, the methodology employed by Norder
have had this knowledge since they shared similar (2003) was to locate not only pictographic rock
cosmological, ritual, and iconographic systems art sites but to also document locations where
(Norder, 2003, in press). pictographic art could have been placed, referred
Turning to the research presented here, there to from this point forward as ‘sterile’ locations.
were several questions that remained after analy- The identification of sterile locations was based
sis of the results of the 2000 survey data that we on two main criteria. First, they had to be clean
have since sought to address with the survey work surfaces relatively free of the ubiquitous growths
conducted in 2005. First and foremost was to of regional lichens and molds and, as a result,
try and develop a more complete understanding would have been suitable ‘canvases’ for rock art
of the distribution of pictograph sites in a given imagery. Second, they would need to be relatively
area. The work of prior researchers, including easy to see from the water, a common feature of
that conducted by Norder in 2000 (Norder, nearly all documented CSRAT pictograph sites.
2003), remained limited and opportunistic in its As a result, the surface angle of the rock face to
regional coverage. As a result, it was not pos- the water surface had to fall within a range of
sible to determine if the distribution of the sites between 120 to 60 degrees with the ideal, based
was the result of documentary bias or not. Much on normative criteria of other CSRAT pictograph
of the analysis that follows addresses this issue, sites, falling between 90 and 70 degrees. These
but it also examines additional factors related to sterile locations were documented in order to
issues of site placement. As we will demonstrate, demonstrate the relative availability of ‘canvases’
ancestral First Nations populations of the region well in excess of those actually selected (Norder,
clearly had numerous choices available to them but 2003, in press).
there was intentional selection of particular loca- In addition, a series of geomorphological
tions over others. The identification of intentional variables were documented for both rock art sites
spatial placement on the landscape and selection and sterile locations. Regional archaeologists have
of specific locations due to culturally significant suggested that certain geophysical properties of
geomorphological features will hopefully facili- the sites where pictographs and petroglyphs may
tate interpretations regarding the broader social be found were also typically located in conjunc-
significance of CSRAT rock art and potentially in tion with certain surface features such as rock
other comparable socio-environmental contexts. overhangs, places to sit, the presence of cracks,
quartz or other crystalline mineral veins, orienta-
tion along cardinal or ordinal directions, angle to
SURVEY METHODOLOGY water, etc. Such features were demonstrated to
have cultural significance to Algonquian com-
One of the primary features of the survey meth- munities (Arsenault, 2004; Conway, 1993; Mo-
odology employed in both 2000 and 2005 was a lyneaux, 1980; Steinbring, 1992). Basic statistical
100% shoreline survey of a selection of bodies analysis determined that there were several of
of water and navigable river systems in the study these variables, typically visual elements such
region. While this methodology was a standard as the presence of quartz veins and crevices in
feature of provincial archaeological projects done the rock surface, that were directly related to the
during the preceding decades (Adams, 1983; production of rock art sites, confirming elements
Halverson, 1988; Lambert, 1985; Pastershank, of these observations (Norder, 2003).
1989; Rusak, 1989, 1992) where the focus was

1332
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

The shoreline surveys conducted in 2000 and a two-tiered survey was undertaken. All lakes
2005 documented 51 pictograph sites (44 previ- with shorelines greater than 5 km in length were
ously documented sites and seven new pictograph surveyed in these two watersheds, while a 20
sites identified in the 2000 survey) and 610 percent random sample of those with smaller cir-
sterile locations (Figure 3). Because of the large cumferences were surveyed. Each watershed was
number of lakes within the study area, n>2000, further divided into four quadrants for a total of
a systematic strategy was employed in 2005 that eight survey regions. Six of these were surveyed
focused on two of the major watersheds that empty using this methodology. The remaining two were
into Lake of the Woods (Figure 3). Within these not due to time and access limitations. Two of the
watersheds, comprised of slightly more than 300 lakes surveyed in 2000 fell into the 2005 survey
navigable lakes and interconnecting waterways, area and were resurveyed to check for consistency

Figure 3. Locations of pictograph sites, sterile locations, and 2005 survey areas

1333
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

of documentation procedures between the field water bodies is relatively ubiquitous across the
seasons. Overall, the combined shoreline surveyed survey area and the availability of surfaces with
represented approximately 60 – 70% of the total both rock art and sterile locations on any given
available shoreline of all bodies of water in the body of water was generally only limited by the
surveyed quadrants. size of the water body. In other words, every lake
In addition to the documentation procedures surveyed had available and/or utilized surfaces.
noted above sites were also coded for particular As clustering was demonstrated for both location
image types based on a hybrid classification types we decided to take another approach toward
system of those previously developed for Cana- understanding the patterning of both sites and
dian Shield pictographic art (Dewdney & Kidd, sterile formations.
1967; Maurer & Whelan, 1977; Whelan, 1983)
and consisted of 10 major image classes and 57 Global and Local Moran’s I
sub-classes. This information was used in the
information calculation noted above. Point loca- The global Moran’s I statistic (Mitchell, 2005;
tions for both pictographs and sterile locations O’Sullivan & Unwin, 2003) was used because of
were collected using a Garmin GPSMAP 60 and its ability to analyze point patterns and assess dis-
then imported into ArcGIS 9.3. A Microsoft Ac- tributions across a region based on specific criteria.
cess personal database was created and populated In this case, the variable that we were concerned
to manage survey data that was later tied to point with was the presence or absence of rock art. A
locations. local Moran’s I (LISA) test (Anselin, 1995) was
subsequently run to identify any further cluster-
ing at a local scale (O’Sullivan & Unwin, 2003).
ANALYSIS These calculations were both run to determine
the degree to which rock art sites (represented as
Our main object here is to determine the degree points in the GIS) were either clustered, dispersed,
to which rock art sites occur in discrete, non- or distributed randomly throughout the study
random spatial patterns on the landscape. Point area (represented as polygons) (Conolly & Lake,
pattern analysis was selected first to determine 2006; O’Sullivan & Unwin, 2003; Wong & Lee,
if these sites were arranged in an ordered man- 2005). Both documented pictograph sites (n=51)
ner or if they were scattered randomly across the and sterile formations (n=610) were grouped and
landscape. The nearest neighbor analysis (Table included in the calculation, with the presence of
1) indicates clustering for both sites and sterile pictographs serving as the variable used to assess
formations at the 0.01 significance level. This spatial autocorrelation (Wheatley & Gillings,
was not unexpected given that the distribution of 2002). Our null hypothesis for these two tests was
that there was no significant statistical difference
in spatial autocorrelation between pictograph sites
Table 1. Nearest neighbor analysis for both sites and sterile formations.
and sterile formations The results of the global Moran’s I trial using
the values from the survey datasets resulted in a
Sites Sterile Formations
Moran’s I index of 0.11 at the 0.05 significance
N 51 610
level with a z score of 2.01 and a critical value of
Nearest Neighbor Ratio .67 .35
1.96. These scores indicate that rock art sites in
Z-score -4.38 -30.83
the survey area occur in a clustered pattern with
Critical Value -2.58 -2.58 less than a 5% chance that the clustering is due

1334
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

to random chance. Where the global Moran’s I is chance. To address this issue we thought it ap-
able to characterize spatial autocorrelation across propriate to assess the likelihood that a global
the study area as a whole the LISA test is able to Moran’s I calculation would return a similar result
pinpoint whether clustering is occurring at local if the site presence variable was ‘reshuffled’. For
scales. The results indicated that 74.5% (n=38) this we chose to use a Monte Carlo approach to
of the pictograph sites demonstrated clustering test randomized datasets and to compare those
at the 0.05 significance level, demonstrating a results to the original global Moran’s I analysis.
high positive z score meaning that these sites
are clustered around other sites (Mitchell, 2005; Monte Carlo Simulation
O’Sullivan & Unwin, 2003). There is a notice-
able fall-off in site placement as distance between Monte Carlo simulation methods assess the likeli-
pictograph sites increase. Forty-nine percent hood that a particular phenomenon does or does
(n=25) of sites are located within 1 kilometer of not occur due to random chance (O’Sullivan &
another site with a mean distance of 3.5 kilometers Unwin, 2003; Steinberg & Steinberg, 2006). A
between sites (Figure 4). One smaller subset of random number generator in Microsoft Excel 2007
pictograph sites (15.6%, n=8) were clustered in was used to create 99 simulated datasets that were
an area where predominately sterile formations then evaluated for spatial autocorrelation using
exist. Low negative z scores associated with these the global Moran’s I statistical analysis. Each
sites indicate that they are surrounded by locations dataset contained the same number of sites (51)
with dissimilar values (Mitchell, 2005; O’Sullivan and sterile formations (610), and the location for
& Unwin, 2003). all 661 points remained the same; however, the
The results of both spatial analyses are not designation for the presence or absence of a rock
conclusive enough to support an interpretation art site at any particular location was randomly
without further exploration at the global scale. varied in all 99 simulations.
Taken as a static distribution of sites, there remains The simulated patterning from these trials
the possibility that clustering is a result of random ranged from random (n=89) to somewhat dispersed

Figure 4. Distance between nearest neighboring pictograph sites in meters

1335
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

(n=10). The mean value for the ninety-nine simu- for site selection because open-air pictograph sites
lated global Moran’s I datasets was 0.017 with a – versus sheltered, hidden, or cave sites – need to
standard deviation of 0.02. It is important to note be created in places where people will encounter
that none of the values for these simulated datasets them if they are to facilitate communication. Rock
displayed a clustered distribution as in the case face angle (measured in degrees) for both sites and
of the actual survey dataset. The results of these sterile formations was compared for statistically
simulations and subsequent point pattern analysis significant differences between the two groups
further support our assertion that pictographs were using an independent samples t test performed
deliberately placed at locations for reasons that in SPSS 16.0 (Sirkin, 1999). The results of the
were not explainable by the earlier research of t test (two- tailed significance = 0.192) indicate
others, leaving the question remaining of “why that there is no significant difference at the 0.05
these places?” level (Norusis, 2008) between the pictograph sites
and sterile formations in terms of rock face angle
Site Choice (Table 2), and this is consistent with Norder’s
(2003) analysis.
The original study done by Norder (2003) assessed
the relationships between several geomorpho- Least Cost Path Analysis
logical variables including the following: panel
orientation, rock face angle, and outcrop surface Norder (2003) observed that all documented
area. For these he found no statistically significant rock art sites are easily observable from passing
relationship to pictograph sites suggesting they watercraft. This is especially important to con-
were not variables considered in the choice of sider when understanding how hunter-gatherers
pictograph creation. We further analyzed a subset interact with and within this particular environ-
of the pictograph sites (n=26) and sterile locations ment. Patterns of hunter-gatherer socio-spatial
(n=139) compiled from the 2000 survey dataset organization differ across the world and the need
(Norder, 2003). One geomorphological factor to move across the landscape varies depending on
crucial to both image creation and visibility, the physical geography, subsistence strategies, territo-
angle of the rock face in relation to water surface, rial behaviors, and a host of other considerations
was used for the trial. This variable is significant (Binford, 2001; Kelly, 1995; Wendrich & Barnard,

Table 2. Results of independent samples t test for rock face angle to water surface

Group Statistics
Site Pres- Std. Devia- Std. Error
ent N Mean tion Mean
No 139 88.5612 4.74894 .40280
Yes 26 90.0000 6.92820 1.35873
Assumptions=Equal variances assumed

Levene’s Test for Equality of


Variances t Test for Equality of Means
95% Confidence Interval of
the Difference
Sig. Mean Differ- Std. Error Dif-
F Sig. t df (2-tailed) ence ference Lower Upper
2.264 0.134 -1.309 163 0.192 -1.43885 1.09902 -3.60900 .73130

1336
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

2008). In the Lake of the Woods region travel by tions of computer resources to analyze the overall
water or alongside waterways was the primary dataset. Therefore, a subset of sites was selected
means of travel year-round for any sort of long for testing the least cost paths (Figures 6 and 7).
distance movement. Consequently, this had a clear The sites selected for the cost calculation were
influence on how ancestral Algonquian peoples selected from the area where the aforementioned
viewed and interacted with the landscape in terms LISA test revealed a cluster of sites conspicuously
of social practice and ideological belief (Norder, located in an area predominately surrounded by
2003). It is also no coincidence that water travel sterile locations.
is the easiest way to negotiate this rugged terrain. The Spatial Analyst tool in ArcGIS 9.3 was
A further critical step in developing our un- used to calculate the least cost path for travel in
derstanding of how pictograph sites function is to relation to the elevation of the surrounding terrain.
analyze least cost paths and their relation to known Figure 6 illustrates that if a traveler moves from
transportation routes (Lock, 2000; Wheatley & the source point in the east (designated 70105AD)
Gillings, 2002). To this end, topographic and to the destination point (designated 62805Z) in
hydrologic GIS data in both vector and raster for- the west the least cost path for travel is by water.
mats were obtained from the Geography Network Note the four pictograph sites located along the
Canada (ESRI Canada, 2010) and values were shoreline adjacent to the least cost path. A second
combined to form a cost weighted distance raster least cost path was then calculated for a source
(McCoy & Johnston, 2002) with a pictograph site (designated T1) located 7.5 kilometers to the
serving as the source for the cost calculations northeast of the first source point (70105AD).
(Figure 5). The cost raster was assigned a 10-meter Figure 7 illustrates the location of the pictographs
cell size to reflect the same contour intervals as relative to the least cost path for the second trial.
the vector data. It was not possible to create cost This statistical and geospatial analysis pre-
weighted distance rasters for every pictograph sented above suggests an answer to the question
site due to the size of the study area and limita- of why pictographs appear in clustered patterns

Figure 5. Location of pictographs relative to first least cost path trial

1337
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

Figure 6. Location of pictographs relative to second least cost path trial

Figure 7. Location of the pictographs relative to the least cost path for the second trial

1338
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

throughout the study area when they could be that required decisions regarding placement and
produced in many locations. If the purpose of what type of information to communicate, which
these images is to convey information then they could be either sacred, secular or, more likely, a
have the requirement to be observable. Pictograph combination of the two.
sites are located where individuals could see them In regards to the potential secular information
from the water, the least cost mode of transporta- encoded in these pictograph sites Norder (2003,
tion through this rugged area. In the case of the in press-b) proposed that there were at least three
example presented in the figures above it is also different classes of pictographs that communicated
important to note that the least cost path matches different types of information, each ranging from
for much of its length a known travel route used by general to specialized in their content. Drawing
the historic First Nations people as they traveled from both information theory and ethnohistorical
through the region (C.S. Reid, personal commu- accounts, he suggested that these sites and their
nication, June 23, 2005), and it is also a modern active use facilitated the exchange of information
recreational canoe route (ESRI Canada, 2010). relating to factors including group identity, land
tenure, the locations of economically significant
resources, ritual practice and belief, and broader
ROCK ART AS MEDIUM TO Algonquian cosmology. Within a landscape of
STRUCTURE MOBILITY abundant lacustrine and riverine features, the
primary mode of travel was via water either by
Our point pattern analysis indicates that pictograph canoe from late spring to fall or by foot when they
sites occur in spatially discrete patterns in the sur- were frozen over.
vey area. This finding supports previous research In this environmental context the placement
interpretations by Norder (2003) that pictograph of pictographs throughout the region in specific
site locations were deliberately selected. That is locations would have been, according to Norder
to say, while the creators of rock art had many (2003), strategic in both facilitating population
options in terms of where they could place these movements and providing information to travel-
images, the results presented above indicate that ers as they passed by specific site types. Given
these locations were selected for reasons that were the nature of Woodland period communities
not defined by just the availability of surfaces. inhabiting the region at large people were not
Some of these choices were defined by certain ignorant of territorial considerations and other
aspects of surface geomorphology associated social obligations (Howey, 2007). Rather, the
with Algonquian religious and cosmological be- people who created these images were hunting,
liefs (Conway, 1993; Norder, 2003; Steinbring, gathering and fishing specialists who needed to
1992), but given the patterning demonstrated by plan and coordinate strategic movements across
our analysis there is reason to suggest that other the landscape in order to access the diverse and
social factors were also a consideration. In this seasonably abundant resource base. Examples
case, linear patterning and clustering of sites along would include, but not be limited to, wild rice
least cost paths, which, in some documented cases fields, spring fish spawning areas (Reid, 1977),
follow known historic Indigenous travel routes, and intercept locations for migratory caribou
indicates that Woodland period peoples may have herds (John O’Shea, personal communication,
used these sites to structure population movements January 6, 2010).
at the regional scale, at least. In sum, the creation These strategic movements, known as residen-
and subsequent use of pictographic rock art in the tial and logistical population mobility, are prac-
study area were the results of social processes ticed in diverse manners by small scale hunting and

1339
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

gathering communities across the globe (Barnard were both available and could be accessed in a
& Wendrich, 2008; Binford, 1980; Kelly, 1995). landscape where others had prior claim.
For our purposes we refer to population mobility
in the sense defined by Wendrich and Barnard
(2008) where the term refers to the requirement, CONCLUSION
capability, and choice of humans to access cul-
turally defined resources across the landscape. The purpose of this investigation has been to
The social context of First Nations communities develop a broader understanding for both the
was one where mobile Algonquian communities placement and function of pictograph sites in the
practiced alternating patterns of aggregation and Lake of the Woods region of Ontario. Point pattern
dispersal, typically at the band and family level, analysis, Monte Carlo simulation and least cost
at different points during the year for social, path analysis indicate that pictograph sites were
economic, and ideological purposes (Norder, placed throughout the study area in linear and non-
2003; Reid, 1977). In this sense, social interac- linear clusters that were located along likely travel
tion between hunter-gather communities served routes and locations were people were likely to
to buffer the uncertainty of resources and share aggregate. These pictograph distributions afforded
risk with other groups, ensuring that families and ancestral Algonquian peoples the opportunity to
communities survived during periods of resource maximize information exchange while minimiz-
scarcity (Halstead & O’Shea, 1982). ing the cost of transportation by having clearly
In this setting, pictographic rock art in their marked routes of travel. This is an important
easily visible locations would have provided in- advantage for highly mobile, spatially dispersed
formation not only in the absence of face-to-face hunter-gatherers who needed to access resources
communication, the primary means of obtaining and maintain social ties across a relatively large
the above information needed for scheduling and landscape, such as that of the study area.
exploitation of regional resources, but would have The application of these results to contem-
indicated through their content, placement, and porary research has important implications for
active use where people could likely be found. considering how rock art is perceived in the main-
Active use could have been through the renewal stream literature. Less than a handful of studies
or addition of images at a pictograph site, but of rock art have applied geospatial methodologies
the most expedient method of demonstrating use to any significant degree (Bradley, 1997; Hartley
would have been to leave offerings of some type &Vawser, 1998; McCall, 2010; Norder, 2003; Rob-
that would have been easily visible to those pass- inson, 2010; Smith, 2001) despite the recognition
ing by. This latter practice was docmented in both of landscape as a significant aspect of the overall
the ethnohistoric and ethnographic literature in the social context of these sites (Chippendale & Nash,
region (Norder, 2003; Rajnovich, 1994) and is a 2004; David, 2001; Nash & Chippendale, 2002).
practice still done today by Algonquian communi- The work here contributes to an expanded view
ties living in the region (Norder, 2007, in press-a). of rock art as part of a more holistic conception
Overall, we would argue that pictograph sites of cultural landscapes in mobile hunter-gatherer
served as points on the landscape where people societies, and recognizes the significance of these
could access information that was spatial, tempo- sites in secular as well as sacred contexts.
ral, social, and spiritual in nature, which provided As a final note, we want to reiterate the im-
guidance and minimized the potential for social portance of these sites not only to research but
miscommunication in regards to how resources as heritage resources important to contemporary
community and national histories and identities.

1340
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

Within northwestern Ontario, because of human- Balac, A. M., Chapdelaine, C., Clermont, N., &
induced environmental change, these sites are Duguay, F. (Eds.). (1986). Archeologies Que-
vulnerable as are rock art sites worldwide. For becoises (pp. 17–57). Montreal, QC, Canada:
example, since one of the one of the authors be- Recherches Amerindiennes au Quebec.
gan visiting these sites in 1998 in northwestern
Barnard, H., & Wendrich, W. (Eds.). (2008). The
Ontario, the past 12 years has seen the degradation
archaeology of mobility: Old World and New
of several of them at a pace that belies their great
World nomadism. Los Angeles, CA. Los Angeles:
antiquity. In other areas of northwestern Ontario,
University of California.
humans have defaced these sites with graffiti or
removed portions of sites for private collections Binford, L. (1980). Willow smoke and dogs’ tails:
or sale. These sites are valuable cultural resources Hunter-gatherer settlement systems and archaeo-
that need to be preserved for future generations. logical site formation. American Antiquity, 45(1),
Without them, we will be missing important in- 4–20. doi:10.2307/279653
formation that helps us to understand how people
Binford, L. (2001). Constructing frames of refer-
have interacted across space and time and the rich
ence: An analytical method for archaeological
heritage of First Nations communities.
theory building using hunter-gatherer and envi-
ronmental data sets. Berkeley, CA: University of
California Press.
REFERENCES
Bradley, R. (1997). Rock art and the prehistory
Adams, N. (1983). An archaeological survey of of Atlantic Europe: Signing the land. New York,
the Nakina Lakes area near Geralton, Ontario. NY: Routledge Press.
Toronto, ON: Ontario Ministry of Citizenship
and Culture. Chippendale, C., & Nash, G. (Eds.). (2004).
The figured landscapes of Rock-Art: looking at
Anselin, L. (1995). Local indicators of spatial pictures in place. New York, NY: Cambridge
association-LISA. Geographical Analysis, 27(2), University Press.
93–115. doi:10.1111/j.1538-4632.1995.tb00338.x
Chippendale, C., & Taçon, P. (Eds.). (1998). The
Arsenault, D. (2004). From natural settings to archaeology of rock-art. New York, NY: Cam-
spiritual places in the Algonkian sacred land- bridge University Press.
scape: An archaeological, ethnohistorical and
ethnographic analysis of Canadian Shield rock- Clotte, J., & Lewis-Williams, D. (1998). Shamans
art sites. In Chippendale, C., & Nash, G. (Eds.), of prehistory. New York, NY: Harry N. Abrams
The figured landscapes of rock-art: Looking at Publishing.
pictures in place (pp. 289–317). New York, NY: Conolly, J., & Lake, M. (2006). Geographical
Cambridge University Press. information systems in archaeology. Cambridge,
Arsenault, D., Gagnon, G., Martijn, C., & Watch- UK: Cambridge University Press.
man, A. (1995). Le projet Nisula: recherché plu- Conway, T. (1989). Scotia Lake pictograph site:
ridisciplinaire autour d’un site á pictogrammes shamanic rock art in northeastern Ontario. Man
(DeEh-1) en Haute-Côte-Nord. Archaéologies in the Northeast, 37, 1–23.
québécoises, série Paléo-Québec, 23, 17-57.
Conway, T. (1993). Painted dreams: Native Ameri-
can rock art. Minocqua, WI: Northword Press.

1341
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

Conway, T., & Conway, J. (1990). Spirits on stone: Hartley, R. (1992). Rock art on the northern Colo-
The Agawa pictographs. Echo Bay, ON, Canada: rado Plateau: Variability in content and context.
Heritage Discovery Books. Brookfield, VT: Avebury Press.
Coulson, D., & Campbell, A. (2001). African rock Hartley, R., & Vawser, A. (1998). Spatial behavior
art: paintings and engravings on stone. New York, and learning in the prehistoric environment of the
NY: Harry N. Abrams. Colorado River drainage (southeastern Utah),
western North America. In Chippendale, C., &
David, B. (2001). Landscapes, rock-art and the
Taçon, P. S. C. (Eds.), The archaeology of rock-
dreaming: An archaeology of preunderstanding.
art (pp. 185–211). New York, NY: Cambridge
New York, NY: Continuum.
University Press.
Dewdney, S. (1975). The sacred scrolls of the
Hayes-Gilpin, K. (2003). Ambiguous Images:
southern Ojibway. Toronto, ON, Canada: Uni-
gender and rock art. Walnut Creek, CA: Alta
versity of Toronto Press.
Mira Press.
Dewdney, S., & Kidd, K. (1967). Indian rock
Helskog, K. (Ed.). (2001). Theoretical perspec-
paintings of the Great Lakes (2nd ed.). Toronto,
tives in rock art research. Oslo, Norway: The
ON, Canada: University of Toronto Press.
Institute for Comparative Research in Human
Diaz-Granados, C., & Duncan, J. (Eds.). (2004). Culture.
The rock-art of Eastern North America: Capturing
Howey, M. (2007). Using multi-criteria cost
images and insight. Tuscaloosa, AL: University
surface analysis to explore past regional land-
of Alabama Press.
scapes: A case study of ritual activity and social
Domingo-Sanz, I., Fiore, D., & May, S. (Eds.). interaction in Michigan, AD 1200-1600. Journal
(2008). Archaeologies of art: Time, place, identity. of Archaeological Science, 34(11), 1830–1846.
Walnut Creek, CA: Left Coast Press. doi:10.1016/j.jas.2007.01.002
ESRI Canada. (2010). Geography Network Keiser, J., & Klassen, M. (2001). Plains Indian
Canada. Retrieved from http://www.geogra- rock art. Seattle, WA: University of Washington
phynetwork.ca Press.
Halstead, P., & O’Shea, J. (1982). A friend in need Kelly, R. (1995). The foraging spectrum: Diversity
is a friend indeed: Social storage and the origins in hunter-gatherer lifeways. Washington, DC:
of social ranking. In Renfrew, C., & Shennan, S. Smithsonian Institution Press.
(Eds.), Ranking, resource, and exchange: Aspects
Lambert, P. (1985). The Northwestern Ontario
of the archaeology of early European society (pp.
rock art project: The 1984 results. Kenora, ON,
92–100). Cambridge, UK: Cambridge University
Canada: Ontario Ministry of Citizenship and
Press.
Culture.
Halverson, C. (1988). An archaeological survey
Layton, R. (1992). Australian rock art: a new
of Stephen, Cameron, Flint and Kakagi Lakes
synthesis. New York, NY: Cambridge University
in Northwestern Ontario. Kenora, ON, Canada:
Press.
Ontario Ministry of Culture and Communications.
Lock, G. (2000). Beyond the map: Archaeology
and spatial technologies. Oxford, UK: IOS Press.

1342
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

Loendorf, L., Chippendale, C., & Whitley, D. Norder, J. (in press). -a). The invention and endur-
(Eds.). (2005). Discovering North American Rock ance of memory and place among First Nations
Art. Tucson, AZ: University of Arizona Press. of northwestern Ontario, Canada. International
Journal of Historical Archaeology.
Maurer, J., & Whelan, J. (1977). A Canadian
Shield pictograph classification design. American Norder, J. (in press-b). Landscapes of memory
Antiquity, 42(2), 196–202. doi:10.2307/278980 and presence in the Canadian Shield. In L. Sund-
strom & W. DeBoer (Eds.), Enduring motives:
McCall, G. (2010). Changing views of Drak-
The archaeology of tradition and religion in
ensberg San rock art: Examining landscape use,
Native America. Tuscaloosa, AL: University of
ritual activity, and contact through multivariate
Alabama Press.
content-based spatial analysis. American Antiq-
uity, 75(4), 773–791. Norusis, M. (2008). SPSS 16.0 statistical pro-
cedures companion. Upper Saddle River, NJ:
McCoy, J., & Johnston, K. (2002). Using ArcGIS
Prentice Hall.
spatial analyst. Redlands, CA: ESRI Press.
O’Sullivan, D., & Unwin, D. (2003). Geographic
Mitchell, A. (2005). The ESRI guide to GIS analy-
information analysis. Hoboken, NJ: Wiley.
sis: Vol. 2. Spatial measurements and statistics.
Redlands, CA: ESRI Press. Pastershank, G. (1989). An archaeological survey
of Sabaskong Bay, Lake of the Woods. Kenora,
Molyneaux, B. (1980). Landscape images: rock
ON, Canada: Ontario Ministry of Culture and
paintings in the Canadian Shield. Rotunda: The
Communications.
Magazine of the Royal Ontario Museum, 13, 6–11.
Rajnovich, G. (1980a). Paired morphs at Cuttle
Nash, G., & Chippendale, C. (Eds.). (2002). Eu-
Lake in northwestern Ontario. Canadian Rock
ropean landscapes of rock art. New York, NY:
Art Research Association Newsletter, 13, 46–50.
Routledge Press.
Rajnovich, G. (1980b). Two pictographs near
Nieves, A. (2006). Reconstructing ritual: Some
Ignace, northwestern Ontario. Canadian Rock
thoughts on the location of petroglyph groups
Art Research Association Newsletter, 14, 34–38.
in the Nasca Valley, Peru. In Robertson, E. C.,
Seibert, J. D., Fernandez, D. C., & Zender, M. U. Rajnovich, G. (1989). Visions in the quest for
(Eds.), Space and spatial analysis in archaeology medicine: an interpretation of the Indian picto-
(pp. 217–226). Calgary, AB, Canada: University graphs of the Canadian Shield. Midcontinental
of Calgary Press. Journal of Archaeology, 14(2), 179–225.
Norder, J. (2003). Marking place and creating Rajnovich, G. (1994). Reading Rock Art: The
space in northern Algonquian landscapes: The Indian Rock Paintings of the Canadian Shield.
rock-art of the Lake of the Woods region, Ontario. Toronto, ON, Canada: Natural Heritage/Natural
Unpublished doctoral dissertation, University of History Press.
Michigan, Ann Arbor, MI.
Reid, C. (1977). Environment, man, and Maymay-
Norder, J. (2007). Iktomi in the land of the May- gwayshi: The dimensions of past human cultural
maygwayshi: Understanding lived experience in impact in the Lake of the Woods area. Toronto,
the practice of archaeology among American Indi- ON, Canada: Ministry of Culture and Recreation.
ans/First Nations. Archaeologies, 3(3), 230–248.
doi:10.1007/s11759-007-9044-6

1343
Applied Geospatial Perspectives on the Rock Art of the Lake of the Woods Region

Robinson, D. (2010). Land use, land ideology: Steinbring, J. (1992). Phenomenal attributes: sites
an integrated geographic information systems selection factors in rock art. American Indian
analysis of rock art within south-central California. Rock Art, 17, 102–108.
American Antiquity, 75(4), 792–818.
Steinbring, J., & Simpson, D. (1983). The Mud
Rusak, J. (1989). An archaeological survey on Portage site (DkKr-4): a preceramic context for
a portion of the Winnipeg River, Northwestern petroglyphs in Northwestern Ontario. American
Ontario. Kenora, ON, Canada: Ontario Ministry Indian Rock Art, 10, 145–156.
of Culture and Communication.
Vastokas, J., & Vastokas, R. (1973). Sacred art
Rusak, J. (1992). The White Otter Lake pictograph of the Algonkians: A study of the Peterborough
project: The 1991 results. Kenora, ON, Canada: petroglyphs. Peterborough, ON: Mansard Press.
Ontario Rock Art Conservation Association.
Wendrich, W., & Barnard, H. (2008). The ar-
Schaafsma, P. (1980). Indian rock art of the chaeology of mobility: Definitions and research
Southwest. Santa Fe, NM: School of American approaches. In The archaeology of mobility: Old
Research. World and New World nomadism (pp. 1–21).
Los Angeles, CA: University of California, Los
Shannon, C. (1949). The mathematical theory
Angeles.
of communication. Urbana, IL: University of
Illinois Press. Wheatley, D., & Gillings, M. (2002). Spatial
technology and archaeology: The archaeological
Sirkin, R. (1999). Statistics for the social sciences.
applications of GIS. New York, NY: Taylor &
Thousand Oaks, CA: Sage.
Francis. doi:10.4324/9780203302392
Smith, H. (2001). Rock art and the shape of land-
Whelan, J. (1983). Context and association in
scape. In Szabo, J. (Ed.), Painters, patrons and
Canadian Shield rock art systematic. Canadian
identity: Essays in Native American art to honor
Journal of Archaeology, 7(1), 77–84.
J. J. Brody (pp. 211–240). Albuquerque, NM:
University of New Mexico Press. Wong, D., & Lee, J. (2005). Statistical analysis
of geographic information with ArcView GIS and
Steinberg, S., & Steinberg, S. (2006). GIS: Geo-
ArcGIS. Hoboken, NJ: John Wiley & Sons, Inc.
graphic information systems for the social sci-
ences: Investigating space and place. Thousand
Oaks, CA: Sage.

This work was previously published in the International Journal of Applied Geospatial Research (IJAGR), Volume 2, Issue 4,
edited by Donald Patrick Albert, pp. 77-92, copyright 2011 by IGI Publishing (an imprint of IGI Global).

1344
1345

Chapter 81
Cartography and Stability to
Enhance and Self Regulate
Proactive Routing in MANETs
Mohamed Amine Abid
University of Manouba, Tunisia

Abdelfettah Belghith
University of Manouba, Tunisia

ABSTRACT
In this chapter, the authors propose a novel proactive routing protocol where routing advertisement
frequency is self regulated locally at and by each node depending on its mobility level. The proposal
relays on an asynchronous distributed cartography gathering algorithm and a link stability criterion.
The cartography of the network is kept constantly valid in time and for all applied speeds. Each node
senses its own dynamics and locally chooses an appropriate routing advertisement period size. As such
stationary nodes generate little signaling traffic while fast moving nodes prefer small routing periods
to mitigate the effect of their mobility.

INTRODUCTION percent routing validity while that of OLSR can


hardly deliver more than 60 percent at moderate to
The performances of our proposed routing pro- high speeds and workloads. Finally, we show that
tocol are evaluated and compared to the known our proposed protocol provides better throughput
Optimized Link State Routing (OLSR) protocol than OLSR reaching a 50% increase at moderate
through extensive simulations. We first show that to high speeds and workloads and far much less
the collected network cartography maintains over end to end delays.
time a validity ratio near one hundred percent Routing in multi hop mobile ad hoc networks
even for high node speeds. We then show that (MANETs) plays a central and vital role, yet it
our proposed routing protocol provides around 97 stands out as one of the most and fundamental
issues to attain adequate performances and viable
DOI: 10.4018/978-1-4666-2038-4.ch081 deployments. The crux of routing in such networks

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

lies in its ability to maintain and provide valid choosing routes based on positions, battery level,
routes without which traffic may wander inside etc. of the nodes would make selected paths more
the network without being able to be delivered resilient to topological changes.
to its ultimate destinations. Worse yet, on such Numerous simulation studies were conducted
invalid routes, the underlying Media Access Con- on different scenarios to evaluate the performance
trol (MAC) protocol retransmits unacknowledged of conventional protocols: for both proactive and
unicast frames for several (by default seven) reactive routing protocols as those conducted by
times which consequently amounts to over using Clausen, Jacquet and Viennot (2002), Novatnack,
the wireless communication medium and hence Greenwald and Arora (2005), or Mbarushimana
yields poor performances. We may stipulate here Amine and Shahrabi (2002). Nevertheless, due to
that the basic reason of the non proliferation of the huge number of relevant and complex events
such spontaneous networks, despite their practical that can take place in mobile wireless ad hoc
usefulness and need, is mainly due to their poor networks and their impacts on the performance of
performances caused by the lack of viable low the protocols, results do not necessarily agree as
signaling and high validity routing algorithms. to which protocol yields satisfying performances
In mobile ad hoc networks, nodes and links and lower control traffic and overhead. It is now
can appear and disappear spontaneously as a understood that proactive protocols in particular,
consequence of several facts such as the behavior such as OLSR which is the main representative
of users, the depletion of energy resources, but of this family, cannot withstand high mobility in
more inherently and subtly the random mobility an appropriate manner since the validity of the
of the different nodes. These aspects imply a routes decreases rapidly as we get farther from
dynamically and randomly evolving topology in the start of the routing period (Belghith and Abid,
both time and space making the routing function a 2009-1, 2009-2).
real difficult and challenging task. As a result, and In this paper, we propose a novel cartography-
as the elapsed time since the start of the routing based protocol that builds the network cartography
period gets farther, the topological information in an asynchronous distributed fashion, i.e., no
collected at the beginning of the routing period synchronization is required among the nodes in
becomes inaccurate and obsolete leading patently the network. This network cartography provides
to invalid paths. better vision of the network connectivity and al-
Conventional protocols, whether reactive lows selecting viable links in a way that improves
- such as the Dynamic Source Routing (DSR) the routing pertinence as a function of the network
protocol (Johnson, Maltz and Yih-Chun Hu, dynamics. Yet, the signaling overhead required to
2004) and the Ad-hoc On-demand Distance Vec- build the network cartography and to establish
tor (AODV) routing protocol (Perkins, Royer the routing tables consumes less bandwidth than
and Chakeres, 2003) - or proactive - like OLSR that of the well known and adopted OLSR. Our
(Clausen and Jacquet, 2003) and the Destination proposed routing protocol allows different nodes
Sequenced Distance Vector (DSDV) routing to have different routing period sizes. The size
protocol (Perkins and Bhagwat, 1999) -, use in of the routing period adopted by a mobile node
general the hop-count metric to compute shortest is decided locally and dynamically. Conducted
paths towards destinations. However, shortest simulations brought out the betterments achieved
paths are not always reliable especially in case of by our routing proposal in terms of a much better
dynamic networks. Finding stable routes is rather throughput, much less end to end delay and less
the main concern for dynamic multi-hop ad hoc signaling traffic rate than those provided using
networks. Several works already established that OLSR.

1346
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

The paper is organized as follows: ing is then made through paths maximizing the
received signal strength.
1. In section II, we present some of the relevant Toh (1997) proposed the Associativity-Based
research work done in the field. Routing (ABR) which uses a new metric called
2. Section III presents a mechanism to col- associativity which defines the stability of a link
lect the cartography of the network in a between any given two nodes. ABR considers that
distributed way, and that can be integrated the longer the two nodes have being neighbors,
to common proactive routing protocols. the longer they would keep connected. To express
3. In section IV, we present our routing protocol its associativity, each node broadcasts periodi-
and describe its functioning. cally a Beacon to indicate its presence. Upon the
4. Section V is devoted to the conducted simu- reception of a Beacon, a counter associated to the
lations. We compare the results given by generating node is updated. The counter is reset to
our proposed protocol to that of the OLSR zero if the associated node is no longer accessible.
(Clausen and Jacquet, 2003) routing proto- The optimal route towards a destination is the one
col and we show the remarkable efficiency maximizing its cumulative associativity metric.
achieved by our proposal. Further stability-based routing techniques
5. Finally, in section VI, we present concluding can be found in (Beraldi, Querzoni and Baldoni,
remarks and some future directions. 2006), (Dubois-Ferriere, Grossglauser and Vet-
terli, 2003), (Roth and Wicker, 2003), (Yu, Li
and Gruber, 2003), (Camp, Boleng and Davies,
RELATED WORK 2002) and (Hui Zhang and Yu-Ning Dong, 2007).
In 2009, Belghith, Ben Mnaouer and Abid
To withstand the network dynamics, stability- presented the Probabilistic Source (or flow ori-
based routing is an approach aiming essentially at ented) Routing (PSR) which is a routing protocol
choosing routes which are more stable over time using localization information gathered at the start
and hence more resilient to dynamic changes in of the routing period, along with a probabilistic
the network topology. technique to compute the most stable paths. This
In 2004, Chung proposed to classify links protocol focuses rather on the existence prob-
based on the mobility behavior of their end point ability of complete routes. The stability of routes
nodes. Links between stationary or very slowly may change as time progresses, and used paths
moving nodes are considered as stationary links. selection is made accordingly. The PSR protocol
Links which exist only for a short period of time requires, however, synchronization among all
are handled as transient links. Newly formed links nodes in the network.
are also considered to be transient as they are more Along the same line, other probabilistic tech-
likely to break down. Routing should favor the niques were proposed to cope with the unpredict-
use of stationary links whenever this is possible. able behavior of ad hoc topologies. In (Tseng,
A ticket-based probing procedure is proposed to Li and Chang, 2003), a probabilistic technique
find stable routes. is proposed to estimate the residual lifetime of
Classification of links is previously adopted routes. Routing is then made through the ones
in Dube, Rais, Wang and Tripathi (1997) work. with maximum residual lifetime. The lifetime of
Authors used the strength of the received signal a route is computed as a function of the existence
from each neighboring node to determine whether probability of each link which is derived from the
the associated link is either weak or strong. Rout- distribution governed by the underlying mobility
model.

1347
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

In contrast to the above protocols, our current Capkun, Hamdi and Hubaux, 2001; Niculescu
routing proposal does not require any synchroniza- and Nath, 2003; Nagpal, Shrobe, and Bachrach,
tion between nodes and defines a proactive routing 2003; Saad, Benslimane and Jean-Claude, 2008;
protocol based on the collected cartography of the Boukerche, de Oliveira, Nakamura and Loureiro,
network. It allows each node to compute its own 2007; and Takenaka, Mineno, Tokunaga, Miyauchi
routing period and routing period size in a local and Mizuno, 2007).
and autonomic way as a function of the locally In this current work, we assume that nodes are
perceived network dynamics. location capable, relaying on GPS or one of the
already cited localization techniques. A forwarded
hello contains then both of its originating node’s
NETWORK CARTOGRAPHY position and its forwarding node’s position. The
network cartography is built by every node through
Network cartography is the set of coordinates of the dissemination of the geographic information
the different nodes within the network area and integrated in the routing announcements (i.e.; the
as such it is much richer than the mere network Hello messages). What we need, for our current
topology that just gives the connectivity within the purpose, is a collected cartography that reflects
network. Our aim here is to design a distributed adequately and accurately the actual real network
algorithm that collects the cartography of the cartography when the routing table calculation
network in such a way that it can be integrated in is needed. To evaluate the correctness of the
any proactive routing protocol with no additional collected cartography, we compare it against the
signaling traffic apart from a very few additional real current cartography of the network at the
fields in routing announcement messages (i.e., instant of the observation. Note that the actual
Hello messages). instantaneous cartography of the network can be
First of all, we intend to get a correct and valid easily extracted from the simulator but it cannot
cartography and therefore we will not tolerate be known in practice. This is of no concern to
delayed routing information. As such, we pro- us here since all what we are looking for is to be
pose to distinguish between control (i.e., hello) sure that the collected cartography does represent
messages and data packets. Hello messages are adequately the real actual cartography. For now,
to be transmitted as soon as possible before any we will just define the validity of the Cartography;
other awaiting data packets. As such, received its evaluation will be differed to when we propose
or locally generated Hellos are put at the head our routing protocol.
of the IP sending queue in front of any awaiting Consider a target mobile node “N”. When N
data packet. firstly advertised itself (i.e., sent its own generated
Secondly, we assume that each node is capable Hello), it was at position (x0; y0). In the Collected
of knowing its own geographical location. Such Cartography of a node “A” that had already heard
an assumption can be justified by the recent avail- N’s Hello, a new entry for N was created, show-
ability of small and inexpensive low power GPS ing (x0; y0) as N’s coordinates. Since node N is
receiver devices (Giordano and Stojmenovic, mobile, its position varies as a function of time,
2003). In case where these devices are not avail- and consequently it will be at position (xt; yt) at
able or are not applicable (in-door networks case instant “t” as time progresses. We say that N’s
for instance), many other approaches for inducing position, as indicated by A’s current cartography, is
relative coordinates still could be used. We can find valid as long as the distance between the recorded
a large set of GPS-less or restrained techniques for position (x0; y0) and the actual current position
localization (Bulusu, Heidemann and Estrin, 2000;

1348
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

(xt; yt) is less than a tolerated predefined value dynamics, PSize may vary from the smallest de-
denoted by “d”. That is: fined updating period denoted by “Tmin” to the
largest updating period denoted by “Tmax”. The
target node next announcement is then scheduled
(x −x0 ) + ( yt −y0 ) ≤ d
2 2
t at instant “NextUpdate”.
Furthermore, the proposed protocol makes
The validity of the cartography, as perceived by any each node track its remoteness, from the position
given node, say node A, represents the percentage where it was at the instant of its last announcement,
of nodes having valid positions among all nodes every Tmin seconds (i.e., an observation instant
in the network. Parameter d is a tuning parameter every Tmin seconds). This is needed by each node
whose value is relative to the transmission range to correctly and appropriately follow its dynamic
used, and is in general a very small fraction of evolution in time. Let “Last_Update_t” be the
this range. instant at which the target node has made its last
announcement. Let “∆d” be the remoteness of
The Proposed Routing Protocol the target node at instant (Last_Update_t + Tmin).
The target node instantaneous remoteness speed
The proposed routing protocol is an asynchronous is then Sd = ∆d/Tmin.
self adjustable routing protocol since it is based At this Speed, the cartography tolerance value
on an asynchronous distributed cartography d will be exceeded after (d / Sd) seconds. The target
algorithm and each node regulates locally and node routing period size is then updated as follows:
dynamically its own routing period size according
to its own dynamics. • PSize = Tmax : if Sd = 0 m/s
• PSize = min(Tmax; max(Tmin; d / Sd)) :
The Protocol Description otherwise

Given a tolerance value d, our aim is to collect a This procedure is then repeated each Tmin
valid cartography. That means, if node N’s speed, seconds. The target node will consequently re-
or more exactly its remoteness from the position announce itself at:
where it was at its last announcement, is such that
this tolerance value d is exceeded (Position of NextUpdate = min(Last_Update_t + PSize ;
Node N may be no longer valid in all other nodes’ NextUpdate)
collected cartographies), N has to re-announce
itself to enforce the update of its position in the As previously pointed out, the minimum period
collected cartographies at the other nodes. So, size that can be used is Tmin. Each node periodi-
each node has to compute the instant, at which cally tracks its remoteness each Tmin seconds, and
it has to re-announce itself based on its currently if the tolerance limit is exceeded, an immediate
observed dynamics, so that it will not exceed the announcement is performed.
tolerance d and keeps the validity of collected As a result, each node will adapt its rout-
cartographies at a high acceptable level at all times. ing period according to its own sensed velocity
The period size is then locally computed by each level. As such, its position is updated in all other
node according to its own perceived dynamics. Let nodes’ collected cartographies only when needed.
“PSize” be the period size computed by a given Each node includes its own perceived position
target node. Depending on its currently observed in its locally generated Hello message, and then
broadcasts it to its neighbors. Upon the reception

1349
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

of this Hello, a node updates its perceived car- 3. Upon receiving a Hello message, a node
tography, inserts its current position in the Hello first consults the sender’s relative fields. If
and forwards the message. As such, a forwarded no entry exists in the cartography pointing
hello contains both its originating node’s posi- out the position of the next hop address field
tion as well as its forwarding node’s one to avoid then an entry is added. Otherwise if the entry
inconsistency problems already pointed out in is relative to a previous period (old sequence
(Abid and Belghith, 2009-3). Number compared to the ”Next hop sequence
The proposed cartography based routing pro- number” field), then the entry is updated ac-
tocol works as follow: cordingly. A Hello is automatically created
to advertise this new entry to other nodes in
1. It is through Hello messages that a node the network. Otherwise, the information is
can maintain its Cartography up to date. silently ignored.
The Cartography includes entries to already 4. Then, the node checks whether it is the
heard destinations. Each entry includes a originator of this Hello, in which case it
sequence Number (i.e., the period’s number), just discards the message. Otherwise, the
an entry local start instant (recall that an node tests the usefulness of the received
entry is deleted after a maximum duration hello: if its Cartography already contains
period (“Max_duration_period”), the cur- an entry for the ”Source Address” (which
rent perceived position of the destination represents the destination to be reached),
node and the destination node address. the current sequence number of this entry
2. Each node generates locally a Hello mes- and the sequence number included in the
sage every computed period (at instant Hello are compared. The entry is updated
NextUpdate). The Hello message contains if it is older. A new entry is automatically
the following fields: created in the Cartography, if the message
◦◦ Source address: the address of the announces a destination not yet known.
node that originated this Hello. 5. If the received message is useful, the node
◦◦ Source sequence number: the se- forwards it to its neighbors. It puts its ad-
quence number that the originator of dress in the ”Next hop Address” field, puts
the Hello message is currently using. its position in the corresponding field, puts
◦◦ Next hop Address: the Next hop to its currently used sequence Number in the
reach the Source Address, this is just ”Next hop sequence number”, and then
the address of the node that sent this broadcasts it to its neighbors after a certain
Hello. delay as explained below.
◦◦ Next hop sequence number: the se- 6. We attempt to make the new information
quence number that the node for- spread in the network as fast as possible, to
warding the Hello message is cur- ensure that all nodes in the network are using
rently using. the same collected cartography along time.
◦◦ Source position: the source node For that, and before forwarding the new heard
coordinates at the Hello generation useful Hello, a node computes the distance
instant. “D”, separating him from the ”Next hop
◦◦ Next hop position: the sender node Address” node (the node bringing the new
current coordinates perceived at the information). The bigger this distance is (0
very last self announcement it made. ≤ D ≤ Transmission Range = R), the fastest
the forwarding must be accomplished: we

1350
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

proposed that each node forwards the new Simulation Set Up


information after a delay time:
To ascertain the validity of the cartography as a
1 − min ( R , D ) / R function of the mobility, traffic load and time, we
delayTime =  seconds conducted an extensive set of simulations. We
2
have considered a simulation area of 1000m by
1000m, with 100 mobile nodes using a modified
New information forwarding is distributed over Random Waypoint mobility model (Camp, Boleng
one half a second, accordingly to the distance and Davies, 2002). The simulation surface was
between the sender and the forwarder, to lower divided into 9 areas, each of which contains from
the chance of collisions. 11 to 12 nodes. The Random-Waypoint mobility
Once the cartography is collected, a node can is then applied separately inside each area (Figure
compute its routing table using any shortest path 1). This is basically to mitigate the known middle
algorithm executed on the corresponding cartog- concentration effect of the Random WayPoint
raphy. Note here that the cartography collection is mobility model.
made in an asynchronous way since no synchro-
nization is required between nodes. Recall that in The Effectiveness of the Restricted
ad hoc networks, a wireless link exists whenever Random Waypoint Mobility Model
its two end point nodes are within the transmis-
sion range R. In the proposed routing protocol, As proven in several previous works (Bettstetter,
however, a node does not re-announce itself unless Lassila, and Virtamo, 2006 ; Hyytia, Lassila and
the tolerance value d is exceeded. Furthermore, if Virtamo, 2006 ; and Yoon, Liu and Noble, 2003),
we consider the worst case where two neighbor the Random Waypoint Mobility model suffers
nodes can only move away linearly, then we must from the problem of concentration of the mobile
reduce the neighborhood range by twice the value nodes towards the middle of the simulation area.
of this tolerance. This will make our routes more This behavior would result in a reduced average
stable over time since a node can’t move farther number of hops and a smaller signaling than in
more than 2×d meters. In this work, we suppose actual networks.
that this extreme case has low chance to happen,
and as such, we limit ourselves to an acceptable Figure 1. Restricted random waypoint: Applied
threshold as it will be shown later, by applying random waypoint inside each area
the chosen shortest path algorithm to our collected
cartography using an effective range between
nodes limited to R - d. Next, we investigate the
validity of the collected cartography which must
be kept at its highest levels at all applied speeds,
all loads and all times. The load effect, however,
can be overcome thanks to the adopted prioritiza-
tion IP model.

1351
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

In the quest to mitigate such an undesirable To better ascertain the odd behavior of the
effect, we propose dividing the simulation area random waypoint mobility model and its impact
into smaller zones, where the RWP mobility on simulation results, we propose next to inspect
model is applied independently. The concentration the average length of the shortest paths in the
problem will always be there, but instead inside network as a function of time. The statistics are
every sub-area. As such, we will guarantee the made by a randomly selected mobile node. For all
presence of nodes at the edges of our area as time the applied scenarios, we’ve started by the same
progresses, and therefore the average number of initial network with a shortest paths’ mean length
hops in the network will be kept enough high equal to 3.8 hops. We observe through figure s
during the whole simulation. 4 to 8 that the use of the non restricted random
Before going any further, we need to better see waypoint mobility model leads to smaller average
whether the problem is solved, or at least reduced, path lengths than when we use our restricted model.
using our proposed restricted random waypoint The mean length of the shortest paths from the
mobility model. It is obvious that this concentra- node making the statistics towards all the remain-
tion comes out deeper proportionally to the speed ing nodes in the network is smaller when using the
of the nodes. Starting the simulation scenarios with non restricted model; and this for all the applied
the initial distribution of nodes given by Figure maximum speeds. Notice here that, as pointed out
2, Figure 3 highlight the success of our model in in (Yoon, Liu and Noble, 2003), the hysteresis ef-
avoiding the concentration problem by showing fect taking place when using high speeds is due to
the nodes’ geographical distribution after 1500s the large simulation time needed by the random
of simulation time. waypoint model to reach the permanent regime,
As expected, we observe as the simulation as well as some of the initial parameters of this
time runs that the nodes using our restricted mo- model, such as the use of a non null pause time
bility model occupy the entire simulation area, and a null lower limit of the interval from which
though some concentration within each sub area. the nodes’ speed is uniformly picked up.

Figure 2. Initial geographical distribution of mobile nodes

1352
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Figure 3. The geographical distribution of the mobile nodes after simulation running time = 1500s

The Rest of the Simulation Set Up MAC retransmission count equals to 3. We used a
priority IP module at the network layer to enforce
We used a transmission range of 250 m, a toler- that Hellos are treated before any awaiting data
ance d of 25 meters that is a tolerance equals packet. The priority queue maximum size is set
to one tenth of the used transmission range, a to 100 packets where data packets are admitted
network capacity of 54 Mbps and a maximum only if less than 70 packets are within the queue.

Figure 4. The mean length of the shortest paths as a function of time at a maximum speed of 1m/s

1353
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Figure 5. The mean length of the shortest paths as a function of time at a maximum speed of 2m/s

Figure 6. The mean length of the shortest paths as a function of time at a maximum speed of 5m/s

The Maximum updating period size Tmax is Simulation Results


set to 24 seconds and the minimum, Tmin, is fixed
to 3 seconds. Finally, the Max_duration_period, Figure 9 portrays the validity of the cartography as
representing the life time of an entry in the rout- a function of time and for different node speeds.
ing table, is set to 30 seconds. All required Recall that we are using a priority IP handling and
modifications are ported on the OMNeT++ net- therefore the network load has a very little impact
work simulator. on the validity of the cartography. For a null node
speed (no mobility) as well as a speed of 1 m/sec,
we get a validity of one hundred percent. As the
speed gets higher the validity of the cartography

1354
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Figure 7. The mean length of the shortest paths as a function of time at a maximum speed of 10m/s

Figure 8. The mean length of the shortest paths as a function of time at a maximum speed of 20m/s

declines a little bit but still stays sufficiently high proposed protocol, using the cartography collect-
around 97 percent even for a speed of 10 m/s. ing algorithm and the Priority IP handling. We also
integrated OLSR under the same prioritization IP
model. Our simulation set up is as defined previ-
SUITABILITY OF THE PROPOSED ously. The OLSR TC period is set to 8 seconds and
ROUTING PROTOCOL its Hello period to 2 seconds. The initial topology
of 100 nodes is chosen randomly, but in a way to
To experiment our new stability-based routing have 11 to 12 nodes in each area (in each area, we
proposal, we conducted a set of simulations using place 11 nodes uniformly). We immobilized 10
the OMNeT++ simulator where we integrated our source nodes at the left edge of the simulation area

1355
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Figure 9. Validity of the cartography as a function of time and speed

(randomly distributed over the three sub-areas at 12, 13, and 14 portray the validity of the routes
the left edge) and similarly 10 destination nodes as a function of time within an observation Time
at the right edge. We also consider ten traffic window of 25 seconds, for the two defined proto-
flows using these fixed source-destination pairs cols with a Null applied data load and respectively
(pairs are chosen randomly). As such, we insured for the speeds 0 m/sec, 1 m/sec, 2 m/sec, 5m/sec
multi-hop routes (4 to 5 hops). Finally, the data and 10 m/sec.
packet size used is 200 bytes. We readily observe that for a null speed the
Let us now investigate the correctness or the validity provided by both protocols is, as ex-
validity of the routes given by each protocol pected, equal to one hundred percent. As node
against the actual real network. The routing period speed gets higher, the OLSR routing protocol
is divided into equally spaced observation points; starts losing its efficiency by delivering the low-
namely an observation point each second. At est route validity. More interestingly, we notice
each observation time point, we have then three that our proposed protocol succeeds in keeping
different views of the network: the actual real this validity near the 100% over the time even for
view provided by the underlying mobility model high mobility levels.
used in our OMNeT++ simulator (to which we Having such a result, we can expect that our
compare the routing protocols), the OLSR rout- proposed routing protocol is going to completely
ing table and the routing table provided by our outperform OLSR. Figures 15, 16, 17, 18 and
proposed protocol. A route from a source node 19 represent the number of received packets per
to a destination node is termed valid if it does flow (the network throughput) as a function of
exist in the real network regardless whether it is the applied data load rate per flow and for dif-
optimal (a shortest route) or not. ferent speeds. We clearly observe the sensitivity
To ascertain the efficiency and accuracy of of OLSR to the network dynamics. As the node
our proposed routing algorithm, we compute the speed gets higher, our protocol delivers more
percentage of valid routes from a designated node throughput than OLSR. While the superiority
(node 1) to all the other 99 destinations given by of our protocol is clear over the entire range of
each one of the two protocols. Figure s 10, 11, traffic loads, it gets larger as the traffic load gets

1356
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Figure 10. Validity of the routes as a function of time: our proposal vs OLSR, Speed=0 m/sec

Figure 11. Validity of the routes as a function of time: our proposal vs OLSR, Speed=1 m/sec

Figure 12. Validity of the routes as a function of time: our proposal vs OLSR, Speed=2 m/sec

1357
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Figure 13. Validity of the routes as a function of time: our proposal vs OLSR, Speed=5 m/sec

Figure 14. Validity of the routes as a function of time: our proposal vs OLSR, Speed=10 m/sec

higher. Indeed, at moderate to high workloads, ets over all defined flow. Figures 20, 21, 22, 23,
OLSR is unable to cope with the loss of the rout- and 24 portray this average end to end delay as a
ing validity as its routes validity gets deteriorated function of the traffic load per flow and for dif-
as shown in the previous Figures (11 12, 13 and ferent speeds. We clearly observe, except for the
14) and consequently more and more packets Null mobility case, that our routing protocol deliv-
wander through the network wasting resources ers packets to their ultimate destinations in much
otherwise used to correctly forward data traffic. less time than OLSR does. As the node speed gets
For a sustained traffic load of 50 generated data higher, OLSR amounts to very high average end
packets per second and a high mobility of 10 m/s to end delay attaining hundreds of milliseconds
(Figure 19), our protocol yields more than 50% (Figure 24) for even moderate traffic loads. Our
additional throughput than that of OLSR. protocol, at the opposite, sustains both high speeds
Let us now, investigate the performance of and workload and maintains a rather very adequate
each protocol in terms of the end to end delay. average end to end delay under few tens millisec-
This metric measures the average end to end onds even for a traffic load of 30 packets per
delay in seconds spent by the received data pack- second.

1358
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Figure 15. Average number of received packets / flow as a function of the applied load: this proposal
vs OLSR, Speed=0 m/sec

Figure 16. Average number of received packets / flow as a function of the applied load: this proposal
vs OLSR, Speed=1 m/sec

Figure 17. Average number of received packets / flow as a function of the applied load: this proposal
vs OLSR, Speed=2 m/sec

1359
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Figure 18. Average number of received packets / flow as a function of the applied load: this proposal
vs OLSR, Speed=5 m/sec

Figure 19. Average number of received packets / flow as a function of the applied load: this proposal
vs OLSR, Speed=10 m/sec

Figure 20. Average delay as a function of the applied load: this proposal vs OLSR, speed=0 m/sec

1360
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Figure 21. Average delay as a function of the applied load: this proposal vs OLSR, speed=1 m/sec

Figure 22. Average delay as a function of the applied load: this proposal vs OLSR, speed=2 m/sec

Figure 23. Average delay as a function of the applied load: this proposal vs OLSR, speed=5 m/sec

1361
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Figure 24. Average delay as a function of the applied load: this proposal vs OLSR, speed=10 m/sec

When nodes are stationary, OLSR has more case, the network is stationary; the shortest routes
chance to slightly outperform our protocol as can remain valid until the end of the period.
be observed from Figure 20. Indeed our protocol Now, we turn to investigate the signaling cost
builds slightly longer routes than OLSR as can required for our proposal to work and compare it
be seen on Figure 25. Recall that for the sake of to that of OLSR. Recall that our routing protocol
stability, we choose to use shorter links (R - d). exchanges extended Hello messages to set up and
As Shown in Figure 25, the average number maintain an up to date cartography on which it
of hops that data packets have made before reach- builds its routing decisions and period sizes. These
ing their destinations under our protocol is Hello messages are rather very short compared
slightly larger than the average length of the routes to the huge TC messages that OLSR builds and
built by OLSR. The latter provides exactly the distributes among the different MPRs. As such, it
shortest routes computed on the real network at is paramount to investigate not only the average
the beginning of the routing period. Since, in this number of signaling messages exchanged by each

Figure 25. Average length of used routes by received packets: this proposal Vs OLSR, with null speed

1362
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

node but instead the average number of signaling the fact that they can no more fulfill their status,
bytes per second transmitted by each node within they persist being MPRs for a period of 3×TC
the network. The later provides more insights as seconds (that is 24 seconds in our case) as speci-
to how much signaling capacity per second is fied in the OLSR protocol.
required from the underlying network. Figure 26 shows that OLSR transmits less
Figure 26 portrays the average number of routing messages than our proposed protocol, but
signaling messages (Hello and TC messages recall that each MPR accumulates all received
for OLSR and Hello messages for our proposal) signaling messages (Hello and TC messages) and
transmitted by each node during a simulation time aggregates them into one large TC packet, while
of 100 seconds, as a function of the node speed our algorithm sends each individual topological
and for a null traffic load. Recall that the signal- change as a single message. This explains the low
ing traffic is virtually independent of the traffic validity of routes used by OLSR especially when
load as we have adopted an IP priority mechanism the velocity is high. In OLSR, a collided or lost
for both implementations of OLSR and our rout- routing packet leads patently to the loss of a large
ing protocol. As expected, OLSR transmits less amount of topological information otherwise re-
routing messages than our proposed protocol. quired to update routing tables. More interestingly,
Furthermore, as the node speed gets higher, our OLSR utilizes much more capacity to transmit
protocol requires more routing messages to be its huge signaling messages to the opposite of
transmitted as the routing periods get smaller. The our proposed protocol. Figure 27 portrays the
signaling cost of OLSR, on the other hand, remains average number of signaling bytes transmitted by
virtually insensitive to the node speed as both of second by each node which represents the average
its Hello and TC periods are maintained constant. effective bandwidth consumed by the signaling
We may also notice on Figure 26 that the curve traffic transmitted by each node. We observe that
relative to OLSR is in fact slightly increasing with our proposed protocol outperforms OLSR even
the node speed. Indeed, the number of MPR nodes at null speed. Indeed, in our proposed protocol,
in the network gets slightly higher as the speed each node adapts its routing period to the perceived
increases. Since the network topology changes degree of its mobility. With a static network, our
quickly at higher speeds, new nodes are selected proposal adopts an updating period of 24 seconds
to be MPRs. But some of the older MPRs, despite (Tmax) for all the nodes, while OLSR maintains

Figure 26. Average number of signaling messages as a function of the network maximum speed: this
proposal Vs OLSR, null load

1363
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

a Hello period of 2 seconds and a TC period of 8 CONCLUSION


seconds regardless of node speeds. This amounts,
as portrayed on Figure 27 to almost doubling the In this paper, we proposed a new asynchronous-
signaling traffic for OLSR. As the node speed cartography-based-routing protocol that allows
increases, the signaling overhead gets bigger for each mobile node in the network to calibrate its
both protocols but the difference between the own routing period size according to its perceived
two protocols gets smaller as our proposal adapts local dynamics. This calibration allows each node
the routing period sizes to the mobility degree. to re-announce itself once it exceeds d meters from
For speeds of 5 m/s, respectively 10 m/s, OLSR its previously announced position. This tolerance
requires around 43%, respectively 19%, more distance represents a real tuning parameter that
signaling traffic than our proposed protocol. needs to be appropriately gauged. Increasing d
Recall here that our proposed routing protocol leads to reducing the control overhead, but simul-
generates control traffic proportionally to the taneously increases the number of hops composing
sensed node’s speed. A node never gets farther the computed routes.
than d meters from its last announced position To evaluate our local self regulating routing
without spreading its new position. As a conse- and compare its performance to that of the seminal
quence by willingly fixing the node’s speed, the OLSR, we conducted extensive simulations. As a
tolerance parameter d will directly affect the first step, we focused on the underling used mobil-
signaling overhead. At a fixed speed, the com- ity model: the random waypoint. This particular
puted routing period size of a node increases as model suffers from some known problems, mainly
the tolerance value d increases and vice versa. the middle concentration problem. We proposed a
The other side of the coin is, though, an undesir- new restricted random waypoint model to precisely
able increase in the number of hops of the se- solve this aforementioned problem. We showed
lected routes is obtained when the value of d gets through the conducted simulations that the effect
large. This leads also to a high increase of the of this problem is minimized using our proposed
number of retransmissions needed to deliver data mobility model.
packets to their ultimate destinations. As such, We then moved on to evaluate our proposed
our tolerance parameter becomes a tuning param- routing protocol. We showed in particular, through
eter that needs to be further investigated. different scenarios of node speeds and various

Figure 27. Average number of signaling bytes/sec as a function of the network maximum speed: this
proposal Vs OLSR, null load

1364
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

workloads that our routing proposal outperforms Bettstetter, C., Lassila, P., & Virtamo, J. (2006).
OLSR in terms of a better routing validity, an Spatial node distribution of the random waypoint
enhanced throughput, a much lower end to end mobility model with applications. IEEE Trans-
delay and a lower signaling overhead. actions on Mobile Computing, 5(6), 680–694.
Currently, we are investigating the scalability Retrieved from http://ieeexplore.ieee.org/lp-
of our routing protocol and its performance under docs/epic03/ wrapper.htm?arnumber=1624340.
different mobility and propagation models. We are doi:10.1109/TMC.2006.86
also investigating the tuning tolerance parameter
Boukerche, A., de Oliveira, H. A. B. F., Nakamura,
d to best improve the network performances.
E. F., & Loureiro, A. A. F. (2007). Towards an
integrated solution for node localization and data
routing in sensor networks. Proceedings 12th IEEE
REFERENCES
Symposium on Computers and Communications
Abid, M. A., & Belghith, A. (2009-3). Functional (ISCC ‘07), July 2007.
enhancements of proactive routing in mobile ad Camp, T., Boleng, J., & Davies, V. (2002). A
hoc networks. Third IFIP International Conference survey of mobility models for ad hoc network
on New Technologies, Mobility and Security (IFIP research. Wireless Communication and Mobile
NTMS 2009), Cairo, Egypt, 20-23 December, Computing (WCMC): Special ISSUE on Mobile
2009. Ad Hoc Networking: Research. Trends and Ap-
Belghith, A., & Abid, M. A. (2009-1). Autonomic plications, 2(5), 483–502.
self tunable proactive routing in mobile ad hoc Capkun, S., Hamdi, M., & Hubaux, J. P. (2001).
networks. The 5th IEEE International Conference GPS-free positioning in mobile ad-hoc networks.
on Wireless and Mobile Computing, Networking Proceedings of the 34th Hawaii International
and Communications (IEEE WiMob’09), Mar- Conference on System Sciences (HICSS), 2001.
rakech, Morocco, October 12-14. 2009.
Chung, W. H. (2004). Probabilistic analysis of
Belghith, A., & Abid, M. A. (2009-2). Dynami- routes on mobile ad hoc networks. IEEE Com-
cally self adjustable proactive routing protocols munications Letters, 8(8), 506–508. doi:10.1109/
for mobile ad hoc networks. The 34th Annual LCOMM.2004.833841
IEEE Conference on Local Area Networks (IEEE
LCN’09), Zurich, Switzerlan, October 20-23. Clausen, T., & Jacquet, P. (2003). Optimized link
state routing protocol (OLSR). Request for Com-
Belghith, A., Mnaouer, A. B., & Abid, M. A. ment 3626. MANET Working Group.
(2009). Suitability analysis of probabilistic rout-
ing for dynamic ad-hoc networks. The 5th IEEE Clausen, T., Jacquet, P., & Viennot, L. (2002).
International Conference on Wireless and Mobile Comparative study of routing protocols for mobile
Computing, Networking and Communications ad hoc networks. The First Annual Mediterranean
(IEEE WiMob 2009), Marrakech, Morocco, Oc- Ad HOC Networking Workshop, September 2002.
tober 12-14, 2009. Dube, R., Rais, C. D., Wang, K. Y., & Tripathi, S.
Beraldi, R., Querzoni, L., & Baldoni, R. (2006). K. (1997). Signal stability-based adaptive routing
A hint-based probabilistic protocol for unicast (SSA) for ad hoc mobile networks. IEEE Per-
communications in MANETs. Ad Hoc Networks, sonal Communications Magazine, 4(1), 36–45.
4, 547-566. Elsevier, 2006. doi:10.1109/98.575990

1365
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Dubois-Ferriere, H., Grossglauser, M., & Vetterli, Novatnack, J., Greenwald, L., & Arora, H. (2005).
M. (2003). Age matters: Efficient route discovery Evaluating ad hoc routing protocols with respect
in mobile ad hoc networks using encounter ages. to quality of service. Wireless and Mobile Com-
Proc. of ACM MobiHoc03, 1-3 Annapolis, MD, puting Networking and Communications WiMob,
USA, 2003. 05, 2005.
Giordano, S., & Stojmenovic, I. (2003). Position- Perkins, C. E., & Bhagwat, P. (1999). Highly
based ad hoc routes in ad hoc networks (pp. dynamic destination-sequenced distance-vector
287–300). Boca Raton, FL: CRC Press, Inc. routing (DSDV) for mobile computers. ACM
SIGCOMM’94 Conference on Communications
Hyytia, E., Lassila, P., & Virtamo, J. (2006). Spatial
Architectures, Protocols and Applications, 1999.
node distribution of the random waypoint mobility
model with applications. IEEE Transactions on Perkins, C. E., Royer, E. M., & Chakeres, I. E.
Mobile Computing, 5(6), 680–694. doi:10.1109/ (2003). Ad hoc on demand distance vector rout-
TMC.2006.86 ing protocol. Internet draft, MANET Working
Group 2003.
Johnson, D. B., Maltz, D. A., & Hu, Y.-C. (2004).
The dynamic source routing protocol for mobile Roth, M., & Wicker, S. (2003). Termite: Emergent
ad hoc networks (DSR). Internet draft, IETF ad-hoc networking. Proceedings of the Second
MANET Working Group 2004. Mediterranean Workshop on Ad-Hoc Networks,
Mehdia, Tunisia, 2003.
Kleinrock, L. (1975). Queueing systems, volume
1: Theory. In Wiley Interscience (Ed.), Queueing Saad, C., Benslimane, A., & Konig, J.-C. (2008).
systems. AT-dist: A distributed method for localization with
high accuracy in sensor networks. International
Mbarushimana Amine, C., & Shahrabi, A. (2007).
Journal Studia Informatica Universalis, Special
Comparative study of reactive and proactive
Issue on Wireless Ad Hoc and Sensor Networks,
routing protools performance in mobile ad hoc
6(1).
networks. The 21st International Conference on
Advanced Information Networking and Applica- Sridhar, K. N., & Chan, M. C. (2005). Stability and
tions Workshop, AINA’07, IEEE 2007. hop-count based approach for route computation
in MANET. In Proceedings of ICCCN05.
Nagpal, R., Shrobe, H., & Bachrach, J. (2003).
Organizing a global coordinate system from local Takenaka, T., Mineno, H., Tokunaga, Y., Miyauchi,
information on an ad hoc sensor network. Pro- N., & Mizuno, T. (2007). Performance evaluation
ceedings of the Second International Workshop of optimized link state routing-based localiza-
Information Processing in Sensor Networks tion. 2007 IEEE International Performance,
(IPSN), 2003. Computing, and Communications Conference,
(pp. 150-159).
Niculescu, D., & Nath, B. (2003). DV based
positioning in ad hoc networks. [Springer Neth- Toh, C.-K. (1997). Associativity-based routing for
erlands.]. Telecommunication Systems, 22(1-4), ad hoc mobile networks. International Journal
267–280. doi:10.1023/A:1023403323460 on Wireless Personal Communications, 4(2),
103–139. doi:10.1023/A:1008812928561

1366
Cartography and Stability to Enhance and Self Regulate Proactive Routing in MANETs

Tseng, Y., Li, Y. F., & Chang, Y. (2003). On route Yuan, X. (1999). On the extended Bellman-Ford
life time in multi-hop mobile ad hoc networks. algorithm to solve two-constrained quality of
IEEE TOC, 2(4). service routing problems. Proceedings of The
Eighth International Conference on Computer
Yoon, J., Liu, M., & Noble, B. (2003). Random
Communications and Networks (IC3N’99), Bos-
waypoint considered harmful. IEEE INFOCOM
ton, USA, 1999.
2003 Twenty-Second Annual Joint Conference
of the IEEE Computer and Communications Zhang, H., & Dong, Y.-N. (2007). A novel path
Societies (pp. 1312-1321). IEEE. Retrieved from stability computation model for wireless ad hoc
http://ieeexplore.ieee.org/lpdocs/epic03/ wrapper. networks. IEEE Signal Processing Letters, 14(12).
htm?arnumber=1208967 doi:10.1109/LSP.2007.907994
Yu, D., Li, H., & Gruber, I. (2003). Path avail- Zygmunt, J., Haas, M. R., & Pearlman, P. S. (2002).
ability in ad hoc network. Proceedings of the 10th The interzone routing protocol (IERP) for ad hoc
International Conference on Telecommunications networks. Internet Draft, work in progress.
(ICT03), volume 1, (pp. 383–387).

This work was previously published in International Journal of Business Data Communications and Networking, Volume 6, Issue
4, edited by Debashis Saha and Varadharajan Sridhar, pp. 21-37 , copyright 2010 by IGI Publishing (an imprint of IGI Global).

1367
1368

Chapter 82
Learning Geospatial Concepts
as Part of a Non-Formal
Education Robotics Experience
Viacheslav Adamchuk Neal Grandgenett
McGill University, Canada University of Nebraska at Omaha, USA

Bradley Barker Megan Patent-Nygren


University of Nebraska-Lincoln, USA University of Nebraska-Lincoln, USA

Gwen Nugent Collin Lutz


University of Nebraska-Lincoln, USA Virginia Polytechnic Institute and State
University, USA
Kathy Morgan
University of Nebraska-Lincoln, USA

ABSTRACT
In the increasingly modern and technological world, it has become common to use global navigation
satellite system (GNSS), such as Global Positioning System (GPS), receivers, and Geographic Information
Systems (GIS) in everyday life. GPS-equipped mobile devices and various Web services help users
worldwide to determine their locations in real-time and to explore unfamiliar land areas using virtual
tools. From the beginning, geospatial technologies have been driven by the need to make efficient use
of natural resources. More recently, GPS-equipped autonomous vehicles and aircraft have been under
development to facilitate technological processes, such as agricultural operations, transportation, or
scouting, with limited or virtual human control. As outdoor robotics relies upon a number of principles
related to science, technology, engineering, and mathematics (STEM), using such an instructional context
for non-formal education has been promising. As a result, the Geospatial and Robotics Technologies
for the 21st Century program discussed in this chapter integrates educational robotics and GPS/GIS
technologies to provide educational experiences through summer camps, 4-H clubs, and afterschool
programs. The project’s impact was assessed in terms of: a) youth learning of computer programming,
mathematics, geospatial and engineering/robotics concepts as well as b) youth attitudes and motivation
towards STEM-related disciplines. An increase in robotics, GPS, and GIS learning questionnaire scores
and a stronger self-efficacy in relevant STEM areas have been found through a set of project-related
assessment instruments.

DOI: 10.4018/978-1-4666-2038-4.ch082

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

INTRODUCTION the STEM fields to stay competitive in the global


economy, there is also the need for universal un-
There is growing national concern that the United derstanding of STEM concepts for everyone. For
States’ educational efforts are not producing example, the National Research Council (2011)
enough STEM (Science, Technology, Engineering recently reported that:
and Mathematics) professionals to meet the needs
of our increasingly sophisticated and technological a compelling case can also be made that under-
society, especially when compared to many other standing science and engineering, now more
countries around the world. National reports such than ever, is essential for every American citizen.
as the 2010 Rising Above the Gathering Storm Science, engineering, and the technologies they
Revisited (National Academy of Science, 2010), influence permeate every aspect of modern life.
focus on the problem of U.S. competitiveness in Indeed, some knowledge of science and engineer-
STEM areas as summarized across educational ing is required to engage with the major public
reports and statistics. This diminishing STEM policy issues of today as well as to make informed
competiveness directly threatens our economy. For everyday decisions, such as selecting among al-
example, a sample from these business statistics ternative medical treatments or determining how
indicates that, of Walmart’s 6000 suppliers, at least to invest public funds for water supply options
5000 now reside in China (Goodman & Pan, 2004). (National Research Council, 2011 p1-1).
Engineering statistics are particularly frightening.
As an example, during a recent period when two What makes an effective context for STEM
high-rise buildings were built in Los Angeles, a education is steadily changing as computer tech-
total of more than 5000 similar buildings were nology influences the ways in which we can teach
built in Shanghai (Fineberg, 2006). and learn about these important disciplines (Heid,
Educational statistics from national and 2005; Hegedus & Kaput, 2004). Instructional
international reports further illustrate the need tools such as tablet computers, online simulations,
for transformative national reforms in STEM robots, graphing calculators, and various other
education, and the need to pursue more innova- technologies are allowing schools and teachers to
tive educational strategies. In 2010, the World more effectively provide the instructional depth to
Economic Forum ranked the United States as STEM coursework and to teach core concepts in
48th in the overall quality of mathematics and sci- the context of advanced technologies (Anderson,
ence education (World Economic Forum, 2010) et. al, 2011; Heid & Edwards, 2001; Whitemeyer,
and the ACT recently reported that 78 percent of Nicoletti & DePaor, 2010). Integrated technology
high school graduates did not meet the readiness environments such as educational robotics may be
benchmark levels for one or more entry-level col- particularly engaging and motivating for students
lege courses in mathematics, science, reading and in the instruction of STEM concepts (Gura & King,
English (American College Testing, 2008). The 2007). Students engaged in STEM lessons using
United States government is becoming increas- such platforms have the opportunity to become
ingly concerned about this STEM educational involved in various interdisciplinary connections
gap, and in response to these alarming national between science, technology, engineering, and
statistics, President Obama, in his 2011 State of mathematics that cover numerous achievement
the Union Address, identified this STEM education levels (Avanzato, 2000; Heer, Traylor & Fiez,
concern as our nation’s new “Sputnik moment.” 2003; Thompson, Heer, Brown, Traylor & Fiez,
Finally, while there is little doubt that the United 2004).
States will have to develop human potential in

1369
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

An effective, relevant and motivating student learning occurs in three distinct venues including
context for STEM education is indeed at the everyday learning, designed learning environ-
foundational core of effective STEM education, ments, and programs (Table 1).
and represents a key feature of a desired national Everyday informal learning can happen just
response to this growing crisis (National Science by watching a science program on TV, participat-
Board, 2009). Such a national response at the lo- ing in on-line games, or reading a book or maga-
cal level would seem to suggest a collaborative zine (Fenichel & Schweingruber, 2010). These
effort to make STEM learning more exciting and common learning experiences are usually chosen
effective for students, within a context of real life and led by the learner, and are short in duration.
applications and relevance. Formal educational Moreover, informal learning can also take place
organizations, such as K12 school districts and at institutions outside of the classroom including
universities, must come together with informal museums, zoos, botanical gardens, libraries, and
educational organizations, such as zoos and mu- planetariums. These institutions provide designed
seums, to make STEM concepts come alive, and environments that are characterized by the use of
to help students see the relevance and excitement artifacts, media, signage, and interpretation by
in what they study, and to learn STEM concepts the educators to guide youth through the learning,
more effectively. while at the same time these experiences are
STEM learning can and does take place dur- short-term and completed in less than a day. While
ing out-of-school hours (Eshach, 2007; Martin, more structured than the everyday learning set-
2004; Wellington, 1990), which is often referred tings, the designed environments allow partici-
to as informal education (Fenichel & Schwein- pants to chose their own learning experiences.
gruber, 2010; Martin, 2004; Wellington, 1990;). Finally, informal learning programs, which are
Generally informal learning environments provide also referred to as non-formal learning environ-
experiences that are open ended, learner-centered, ments, are usually adult led and supported by a
voluntary, and nonlinear (Falk & Dierking, 2000; curriculum that can be sequential or non-sequen-
Griffin, 1998; National Research Council, 2009). tial (Fenichel & Schweingruber, 2010). Programs
Moreover, assessment in informal learning envi- include experiences such as 4-H clubs, afterschool
ronments is often comprised of immediate per- programs, summer camps, museum programs and
formance feedback from the activity. According are typically longer in duration than designed
to Fenichel & Schweingruber (2010), informal learning environments.

Table 1. Types and characteristics of informal learning environments

Characteristics Informal Learning Environments


Everyday learning Designed environments Non-formal
Social context Individual Usually group-based learning Group or individual
Place Public or private spaces Museum, zoo, library, planetarium, Public or private space
with media and artifacts including designed spaces and
schools
Impetus of participation Self Self, parent or teacher Self, parent or teacher
Time-scale Short-term Short-term, sporadic Short-term to sustained long-
term
Facilitation Learner Facilitator–led then learner choice Facilitator led
Curriculum None Media and artifacts Formal curriculum

1370
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

While formal learning environments are ar- report stresses the need for coherence in this
guably the most important centers of learning, kind of learning and an adequate supply of well-
research shows that time spent in formal class- prepared and effective STEM teachers... Informal
rooms pales in comparison with the amount of education is described as an essential conduit to
time spent outside of the classroom in the K-12 increase public interest in and understanding and
learning years. While it is hard to determine exactly appreciation of science, technology, engineering,
how much time students spend in the classroom, and mathematics. (National Research Council,
estimates range from 9% to 18.5% (Fenichel & 2010 p.13).
Schweingruber, 2010; Sosniak, 2001). Medrich,
Roizen, Rubin and Buckley (1982) estimated that It seems clear that research supports the use
individuals spend 11,000 hours in the classroom of STEM activities in the informal and the non-
and 65,000 hours outside of school during their formal learning environments to enhance, bolster
K-12 years or roughly 85% of their waking hours and support formal learning environments.
outside of the classroom (Eshach, 2007). One promising approach to increasing STEM
Because youth spend so much time outside of attitudes, knowledge, skills and workforce capac-
the classroom and because “schools cannot act ity in a non-formal learning environment is to blend
alone, and society must better understand and draw the use of robotics with geospatial technologies
on the full range of science learning experiences as an instructional platform. The University of
to improve science education broadly” (Fenichel Nebraska, with grant funding from the National
& Schweingruber, 2010, p.12) informal science Science Foundation (NSF), has developed the
programs play an increasingly vital role in STEM Geospatial and Robotics Program for the 21St
learning. In the 2009 report, Learning Science Century (GEAR-Tech-21) program to increase
in Informal Environments: People, Places, and science, technology, engineering and mathemat-
Pursuits, the National Academy of Sciences states: ics (STEM) achievement and create interest by
coalescing robotics and geospatial technolo-
Learning science in informal environments has gies. Through hands-on experimentation, such
the potential to bolster science education broadly technologies help youth to transform abstract
on a national scale. This is evident in reports from mathematics and science concepts into concrete,
national initiatives to improve education in sci- real-world applications. Recent improvements
ence, technology, engineering, and mathematics in cost and complexity make it possible for even
(STEM) in the United States... The council’s report relatively young children to engage in hands-on
cites informal education as one of three integral experimentation with robots, GPS receivers and
pieces of the U.S. education system (the other GIS software.
two being K-12 education and higher education) Robots have the potential to change and en-
needed to ensure “U.S. economic competitiveness, hance the learning process in education (Cham-
particularly the future ability of the nation’s edu- bers & Carbonaro, 2003). It is most remarkable
cation institutions to produce citizens literate in that computer-based technologies can be used
STEM concepts and to produce future scientists, as “mindtools” which involve students in using
engineers, mathematicians, and technologists” modern technologies to solve problems (Jonas-
(U.S. Department of Education, 2007, p. 5). sen, 2000). Traditional education practices typi-
Federal interest in informal environments is also cally utilize a relatively direct form of instruction
reflected in the National Science Board’s report whereby students learn from the “technology”
on the critical needs in STEM education (National (e.g. drill and practice exercises), while in con-
Science Board, 2007). The National Science Board

1371
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

trast, when using “mindtools” such as robotics, a national project. The purpose of the GEAR-
students learn with the “technology”. TECH-21 program is to improve STEM learning
When considering spatial technologies like outcomes and attitudes of youth aged 10 to 15 by
GPS and GIS, similar to robotics, there is grow- combining educational robotics and geospatial
ing potential for educational applications (Kerski, (GPS/GIS) technologies. The combination of
2003). GPS/GIS have become important tools these technologies is widely used in precision
in precision agriculture and natural resources agriculture and natural resources management and
management, which are relevant to youth in rural has many implications for improved sustainable
communities; these students often suffer directly farming methods and conservation. Information-
from poor performance in STEM-related achieve- based management has allowed producers around
ment areas. For example, GIS maps allow users to the world to optimize agriculture inputs, reduce
visualize geographic data to aid agriculture-related negative environmental impact and improve the
decision-making processes. Data used in the GIS social status of relevant professions. The program
maps can also be obtained using small handheld provides youth with lessons and experiences in
GPS receivers, which are easy to use, even for each of the fields culminating in experiences and
middle school aged youth. Combined, the GPS and challenges that combine the technologies. The goal
GIS technologies provide a powerful set of tools is to spread the GEAR-Tech-21 program to 5000
to analyze and interpret spatial information and youth in 450 clubs and camps over five years.
to represent a useful context for applied learning GEAR-Tech-21 is designed for delivery by
in the mindtools context. adult facilitators in non-formal learning environ-
The long-term goal of the program described ments such as 4-H clubs, summer camps, and
in this chapter is to improve STEM learning out- afterschool programs. In addition, GEAR-Tech-21
comes and attitudes for youth who are 11-15 years is an intensive two-year program that usually
of age. Using robotics and GPS/GIS concepts, begins with a 40-hour summer camp experience.
the program seeks to: 1) promote their interest in The camp activities include the building and pro-
STEM fields, 2) introduce basic technology skills, gramming of robots, working with handheld GPS
3) foster problem solving and inquiry skills, and receivers to explore and collect geo-referenced
4) encourage teamwork. information, and the development of GIS maps.
Youth then receive an additional 80-hours of
Program Overview hands-on robotics and GPS/GIS training during
the school year in their afterschool programs
The widespread accessibility of technologies such or in 4-H clubs. In year two, youth attend an
as the LEGO NXT Mindstorm robotics kit, GPS advanced summer camp followed by 80 more
receivers, and GIS software like GoogleEarth hours of hands-on instruction during the school
and ArcMap make it possible for youth (ages 11 year. In total, participating youth receive at least
to 15) to explore and practice STEM concepts 240 hours of focused, hands-on experience over
through the integration of these technologies. two years. Many curriculum activities have been
The project started in 2006 as the Nebraska 4-H published by the project and can be found at the
Robotics and GPS/GIS in 4-H: Workforce Skills project website http://www.gt21.org.
for the 21st Century (NSF contract #ESI-0624591), In educational lexicon bringing an intervention
and was designed as a regional project for Ne- to scale usually involves increasing the number of
braska. However, in 2008 the program received students, teachers, and locations that participate
additional funding (NSF contract #0833403) to with the goal of producing similar results (Baker,
further scale-up the program from a regional to 2005; Coburn, 2003; Fishman, 2005; McDonald,

1372
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

Keesler, Kauffman, & Schneider, 2006). More- step, in a scale-up process, is ongoing evaluation
over, discussion of scale usually relates to formal to determine efficacy and sustainability and to
educational environments and focuses on effective find an institutional mechanism to accomplish
classroom interventions that can be brought to the the scale-up (Horwitz, 2007; McDonald, et al.,
district level or interventions developed by outside 2006; Motiff, 2007).
vendors brought in to be used at a district level In summary, traditional scale-up frameworks
(Blumenfeld, Kempler, Krajcik, & Blumenfeld, focus primarily on expanding the number of youth
2006; Cohen & Ball, 2007a). Ultimately, the and sites. However, traditional scaling efforts have
goal of scaling an intervention is to successfully not typically explored additional dimensions of
replicate or adopt a program with more students, scale such as the nature of change envisioned for
in a variety of settings, and produce similar out- the intervention, how it is sustained, and the role
comes as the original intervention (Baker, 2005; of educators and sites in the scaling effort (Co-
Dede, 2006). burn, 2003). Moreover, (Dede, 2006) argues that
Research on programmatic scaling expands traditional scaling efforts need to avoid replicat-
on the definition and adds multiple dimensions ing a program intervention that worked locally
to the general notion of scale. According to scal- without taking into consideration the variations
ing is about the “adoption and diffusion of ideas and complexities of new locations. Addressing
across contexts.” Much of the research devoted the variation and complexities of a new target
to scale-up is concerned with increasing the num- population or setting, by carefully reflecting on
bers (Baker, 2005; Coburn, 2003; McDonald et scale, can lead to a better understanding of what
al., 2006). Constas and Brown, (2007) expand is needed to make a difference in teaching and
the definition to include magnitude (how many learning for a wider or larger context.
individuals and locations), breadth (what indi- Early in 2009, we began to apply the scaling
viduals and locations), and program complexity framework developed by Coburn (2003) and Dede
(how many iterations of the intervention will (2006) to the Geospatial and Robotics Technolo-
be supported). Coburn (2003) points out that in gies for the 21st Century (GEAR-Tech-21) project
some contexts the original intervention will be to help our project grow and evolve into a na-
replicated in different settings or the intervention tional effort. The guiding principle of the scaling
may be adopted and modified for use in a local framework is that scaling efforts need to avoid
setting. Furthermore, Dede, (2006) separates the replicating an intervention that worked locally
scaling of an individual intervention from the idea without taking into consideration the variations
of systemic reform. Whereas a single intervention and complexities of new locations. Addressing
can be scaled to multiple classrooms, a systemic the variation and complexities of a new target
reform typically involves a suite of interventions population or setting, by carefully reflecting on
that potentially transforms an entire school district. scale, can lead to a better understanding of what
Research also suggests that the scale-up is needed to make a difference in teaching and
process, when undertaken successfully, moves learning for a wider or larger context. Overall the
through at least three different stages. The first scaling framework is successfully being applied to
stage is to conduct individual studies or efficacy the GEAR-Tech-21 project as the project moves
trials to determine the effectiveness of the inter- from a statewide initiative to a national program.
vention in a particular setting. The second stage Clearly there are benefits and tradeoffs between
tests an intervention that has shown to be effec- scaling in an informal environment as compared
tive in the individual study in multiple settings to the formal education environment typically
with multiple populations of students. The final occurring within schools and school districts.

1373
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

One apparent benefit is that there is no systemic for leaders whereby they are trained to deliver the
reform movement in the non-formal environment. program in their local community. For example
Often scaling up an intervention fails in the schools in 2009, training was held in seven states in more
because a) they fail to take root or b) classroom than 19 different locations. In addition, not every
teachers and other educators and staff find the site that implements the program has the means
interventions inconsistent with their local practices to easily acquire the robot kits and GPS units.
and norms (Blumenfeld, et al., 2006). Furthermore, Therefore, equipment is shared amongst sites for
teachers are often resistant to changes in practice a small fee. Equipment sharing has proved to be
that originate at the district or state level and are beneficial in getting new sites started. Moreover,
filtered down to the classroom level (Blumenfeld, the program has had success in scaling to new
et al., 2006). In addition, organizational rigidity in sites and additional audiences (Table 2).
the formal education system may leave little room
for innovation (Cohen & Ball, 2007b). Conversely, Curriculum
in the non-formal learning environments there is
little rigidity in organization structure that would Foundation-building activities in Year 1 allow
prevent innovation. More importantly, non-formal youth to develop fundamental skills and to learn
organizations, like 4-H, rely on volunteers to lead the vocabulary associated with each technology.
education experiences. To support innovations, In Year 1, youth are introduced to the basics of
new volunteers can be recruited and existing robotics, review underlying geography concepts,
volunteers retrained. Volunteers enjoy a degree gain experience with GPS-based navigation and
of flexibility not found in formal education. Un- data collection and are introduced to using and
like in the formal education system, volunteers manipulating GIS maps. Year 2 foci include
lack the similarity in preparatory training and more advanced robotics programming, GIS skill
past experiences. Many volunteers do not pos- development and integration of the technologies
ses the typical background content knowledge while exploring their potential in the world of
and educational skills of certified teachers while agriculture and natural resource management.
some are professionals in STEM fields but lack In addition to the technology-focused concepts,
teaching experience. This breadth requires greater activities are also designed to develop 21st Century
and more diverse instructional resources are fre- Skills in critical thinking and problem solving,
quently needed for the training and development communication and collaboration, and informa-
of volunteers. Finally, unlike formal learning tion media and technology (Partnership for 21st
environments, where criterion-reference tests are Century Skills, 2008; Pearson & Young, 2002).
routine, data collection in a non-formal environ-
ment is an ongoing challenge to implement, due
Table 2. Project camps and club participation
to time, support and training constraints.
Overall the framework has been helpful in Year Camps Camp Clubs Club
scaling the GEAR-Tech-21 program. However, the Participants Participants
process of scaling the program from a regional to a 2007 2 38 4 46
national program has encountered some challenges 2008 6 147 13 200
including professional development and logistics. 2009 5 178 19 297
To move the program to a national scale, the de- 2010 22 450 46 732
livery approach had to be modified from GEAR- 2011* 40 800 80 1200
Tech-21 staff leading the activities and camps to *Projected numbers of camps, clubs, and participants based on
a sustainable professional development program 2011 camp requests and projected club growth.

1374
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

Designed for a non-formal learning envi- audience where these industries are the primary
ronment using Experiential (Kolb, 1984) and economic drivers. These industries are also of in-
Understanding by Design (Wiggins & McTighe, terest to non-rural youth as they are allengaged as
2005) learning models, project activities offer consumers on a daily basis. The technologies used
youth multiple experiences with each concept to in the curriculum are integrated in many formats
deepen their understanding and skill development. and are already in use by professionals working
The project uses experiential learning strate- in the field. Farmers utilize precision agriculture
gies combined with on-line eLearning delivery technology and methods to manage production in
methods. The 4-H experiential learning model a way that allows for more efficient use of reduced
provides learners with a concrete experience, a inputs, equipment automation and maximized crop
chance to reflect and observe the phenomenon quality and quantity. Environmental professionals
being studied, and then to apply the knowledge use geospatial and robotics technologies to track
to a new situation (Woffinden & Packham, 2001). and manage populations and habitats. The food
Each activity consists of an introductory, a guided processing industry uses robotics in processing
large group exercise, a prescribed exercise, an facilities and geospatial technologies in distribu-
on-line exercise where youth research real-world tion and strategic planning.
problems, industry applications and an additional The curriculum is published in a series of ac-
open-ended challenge applying the skills acquired tivities using Adobe Captivate (Adobe Systems
earlier in the program. At the conclusion, a series Inc., San Jose, California) to produce flash-based
of questions are provided for journaling, facilitated animations allowing for the inclusion of text,
group discussion or small group work to recap imagery, videos and software animations. Direct
the learning experience and help youth make the teaching of programming and mapping concepts
connections to outside, real-world applications in through click-through software animations allows
their own lives and career fields. youth to see actual settings in the software (Figure.
Such activities allow the program participants 1). This format permits participant interaction with
to relate their findings to contemporary challenges the media as well as for dissemination online, via
being solved using technology. Instead of focus- CD-ROM or in mobile formats. Adult facilitators
ing upon specific pieces of knowledge related are supported with guides and worksheets avail-
to a given activity, our emphasis has been on able online to be downloaded in PDF format, on
timing the introducion of technology to help in CD-ROM and in print. The curriculum is divided
the learning process and to determine what chal- into single-skill, flash-based modules. To maxi-
lenges outside the material of activity could be mize usage the curriculum can be accessed on-line
addressed. Eventually, this motivates participants or for sites with limited or no Internet connection,
to learn more related to the technologies and to the curriculum can be installed on the computer
extend their perceptions of the environment in and run locally. When running locally, learners
which they live. This approach also strengthens are not able to utilize the social networking tools
the connections youth participants have to career including the online discussion boards.
possibilities in related technological fields. A series of integrated activities that combine
GEAR-Tech-21 provides all youth with a basic geospatial and robotics technologies make up the
literacy in agriculture and natural resource systems. core of the advanced year of the curriculum. For
Including applications in agriculture and natural example, in the Robot Tracking activity youth
resource systems meets several curricular needs. program, the robot follows a planned course or
The agricultural and natural resources industries path while recording a track with a GPS sensor
have a particular appeal to 4-H’s traditional rural or GPS data logger. Recorded data is then im-

1375
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

Figure 1. A screenshot of a software animation from the web-based robot tracking activity

ported into a GIS or digital mapping application Measurements activity requires youth to use their
such as ArcGIS (ESRI, Redlands, California) or robot as a data collection device at predetermined
Google Earth so youth can see an aerial image waypoints, measuring reflected light of foliage
overlaid with their robot’s path. With the Robot (darker green colors indicate greater nitrogen
Sensor Exploration activity, youth use a remote availability and absorption) and height of vegeta-
control program or run a programmed course tion (more vegetative mass indicates greater nutri-
while collecting and storing sensor data for view- ent and moisture availability) with the light and
ing in a GIS or in table format generated by a ultrasonic sensors, respectively (Figure 2). The
spreadsheet application. When working on the data is then imported into GIS software and in-
Boundary Mapping activity, youth rely upon the terpolated to create a field map showing vari-
robot’s sensors to identify a field’s natural bound- ability of the measured characteristic (Figure 2).
aries while recording the track data with a GPS In the Robotic Navigation activity, participating
sensor or data logger to import to GIS software youth identify a location using a handheld GPS
to build a base field file. The Surfacing Land receiver and input the location into software de-

Figure 2. Summer camp activities: collecting data with the robot’s sensors and presenting findings

1376
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

veloped by an undergraduate student with the the input ports are used to equip the robot with
GEAR-Tech-21 program. Using a GPS data log- sound, ultrasonic distance, light reflectance and
ger and Bluetooth, youth can then autonomously mechanical switch sensors. LEGO Mindstorms
direct their robot to the designated geographic NXT software is used to program the robots and
coordinates. Or, if youth have a GPS sensor avail- to upload the code via either a USB, or a Blue-
able, the destination coordinates can be entered tooth connection.
into variable blocks within the LEGO NXT soft- Garmin eTrex Legend and Rino 110 (Garmin
ware. The robot can then autonomously find the International, Inc., Olathe, Kansas) GPS receivers
waypoint location. The sensor allows for a larger have been used to log waypoints and track paths
field to be used since Bluetooth is no longer a as well as for navigation. GoogleEarth (Google,
limiting factor to do this exercise. A follow-up Inc., Mountain View, California) software has been
challenge that is made possible with the sensor is used to introduce youth to the world of digital
for the robot to truly geocache by using its other geographic data. ArcGIS software has helped
sensors to find an object meeting given specifica- to further develop digital mapping awareness
tions within the range of error provided by the and provide freedom to customize geographic
GPS navigation. Finally, the curriculum includes data (Figure 3a). Using the DNRGarmin module
a series of open-ended challenges representing (Department of Natural Resources, St. Paul, Min-
real-world agricultural applications including nesota) allowed for a data interface between GPS
automated harvesting of row, vegetable and tree receivers and GIS software.
crops, rounding-up wayward livestock, applying Through a set of sequential stand-alone ac-
fertilizers and pesticides to particular areas of a tivities, youth gain perspective of the relationships
field, field scouting, and assessing wildlife popu- between angular measures of geographic latitude
lations. and longitude and physical distances. From the
early part of the program they learn to define
Hardware and Software travel speed and direction and to compute the
distance between two nearby locations with known
The LEGO Mindstorms NXT Education kit geographic coordinates. Later these primary skills
(LEGO Systems, Inc., Enfield, Connecticut) has are used to gain an understanding of the main
been selected as the main robotics platform for principles of navigation.
the curriculum. The unit has three output and four Blending GPS and robotic technologies results
input ports. Our activities use the output ports in a better experience for the camp participants.
to operate electrical motors (two for maneuver- When the two technologies are combined, program
ing and one for a mechanical manipulator), and participants have a greater opportunity to improve

Figure 3. Summer camp activities: digital map development and robot navigation

1377
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

their perception of navigation principles that fre- the Peter Kiewit Institute (PKI, University of
quently require both logic and creative thinking Nebraska-Lincoln, Omaha, Nebraska) has been
skills to complete the prescribed challenges. considered as an alternative to the LEGO robotic
For the integrated robotics and GPS/GIS ac- kit. The CEENBoT will be capable of operating
tivities an i-Blue747 GPS Logger (Transystem, in a non-smooth surface environment (Figure 4).
Inc., Hsinchu, Taiwan) has been used to record An integrated GPS receiver for the LEGO kit is
geographic positions of the robot. To implement another option under development.
robot navigation capability (Year 2 curriculum), Custom hardware, software, and firmware
several stand-alone applications have been de- have been developed to create an integrated GPS
veloped using LabVIEW (National Instruments solution for the LEGO Mindstorms NXT robot
Corp., Austin, Texas). Our software allows simul- (Figure 5). The integrated GPS receiver operates
taneous wireless communication between a laptop seamlessly with the LEGO Mindstorms NXT
computer and both the LEGO Mindstorms NXT robot using one of its standard input ports. The
and i-Blue747 GPS Logger in order for robot NXT brick can simultaneously take input from
motion to be adjusted automatically based upon up to four sensors. Each sensor port uses a 6-pin
its relative position to a targeted, georeferenced modular connector that offers both analog and
marker (Figure 3b). The relatively short range digital interfaces. The integrated GPS receiver
of Bluetooth communication (approximately takes advantage of a digital protocol to commu-
10 meters) and low GPS positioning accuracy nicate with the NXT brick.
(approximately 3 meters) have been the main The integrated GPS sensor is based on the
limitations at this point in the project. However, USGlobalSat EM-406A GPS receiver and the
discovering and quantifying these limitations popular Arduino microcontroller platform. The
have become a focal point of robot navigation Arduino platform offers two advantages. First,
activities. Students have the ability to experiment the hardware platform facilitates rapid prototyp-
with different parameters of our navigation algo- ing for sensor development. Second, the Arduino
rithms to appreciate the importance of raw signal is a very popular platform that simplifies micro-
averaging and relating expected outcomes with controller programming significantly. Thus, the
the technical hardware capabilities. For example, Arduino by itself could potentially be used as a
they can change parameters in relation to the speed teaching tool in future extensions of the program
with which the robot reacts to changes in desired curriculum.
travel direction and what proximity to the target
is considered acceptable. Figure 4. New educational materials: one of
A recently funded curriculum development CEENBot prototypes
project by the National Science Foundation (NSF
#0733228) relates to the use of educational robot-
ics in formal educational settings and has added
university partners and broadened the scope of the
educational programming. The use of an educa-
tional cyber-infrastructure and a train-the-trainer
model to administer the curriculum are inherent
challenges being addressed by the expanding pro-
gram, but the project is indeed steadily growing
in scale and curriculum resources. In addition,
a robotics platform (CEENBoT) developed at

1378
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

Figure 5. New educational materials: add-on


2010; Nugent et al., 2010). For example, in the
LEGO Mindstorm NXT GPS module
summer of 2010, 20 GEAR-Tech-21 camps were
held throughout the United States. A total of 371
youth participated. The overall mean age for youth
participating in the camps was 11.4, with 76%
male and 24% female. 74% were Caucasian or
Asian; 26% were from minority groups.
To measure STEM learning, a 41-item,
paper-and-pencil, multiple-choice assessment,
covering a variety of topics including computer
programming, mathematics, geospatial concepts,
engineering/robotics and engineering design was
developed. The STEM concept test was admin-
istered on the first day of the camp prior to the
start of program activities. The same assessment
The USGlobalSat EM-406A GPS receiver was also administered on the last day of camp as
features high tracking sensitivity, the ability to a post-test.
track up to 20 satellites at one time, and respect- There was a significant overall increase in
able accuracy for an inexpensive receiver. GPS learning from the pretest to the post-test (Table
information can be accessed within NXT-G, 2). There was also a significant increase in scores
which is the graphical programming environment on the questions dealing with geospatial concepts.
bundled with the LEGO Mindstorms NXT kit. A These results suggest that the 4-H robotics and
graphical programming “block” is used to request geospatial summer camp program is a promising
GPS data from the sensor and to integrate into the approach for supporting robotics and geospatial
overall program. learning in a non-formal education setting.
Using a directly connected GPS sensor removes To capture changes in attitudes of the 4-H Ro-
many of the shortcomings associated with using botics and GPS/GIS youth participants, an assess-
the Bluetooth-based i-Blue747. The Bluetooth ment instrument was developed, which included
range restrictions are effectively eliminated, al- two scales dealing with GPS/GIS. Specifically, the
lowing the robot to explore areas well outside of instrument measured a youth’s perceived value of
a 10-meter radius. Furthermore, all processing GPS/GIS technologies and their self-efficacy in
can be done locally on the robot as opposed to performing GPS/GIS tasks. A sample task value
a laptop. Custom data processing and filtering question was “I like learning new technologies
algorithms can also be utilized to improve the like GIS.” A sample self-efficacy question was: “I
accuracy of the raw GPS data stream. am confident that I can use GPS technologies to
get to places that I have never been before.” Youth
Learning Impact Research responded using a 5-point Likert scale from 1 =
strongly disagree to 5 = strongly agree. By focusing
Various assessment instruments have been used on performance tasks, these scales complemented
to document participant learning and attitudinal the multiple-choice content test, which assessed
impact during summer camps, afterschool club general comprehension and knowledge. Partici-
gatherings and other relevant activities such as pants scored higher on the post-test than on the
short-term interventions, virtual competitions, pretest for both of these GPS/GIS scales, although
etc. (Barker, Grandgenett, Nugent & Adamchuk, the task value scale was not significant (Table 3).

1379
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

Table 3. Student increases in STEM learning and attitudes

Pre Mean Post Mean t(df) P


STEM Learning
Overall 15.62 19.00 12.70 (306) p <.001
GPS/GIS 2.65 3.05 4.19 (306) p <.001
STEM attitudes
GIS/GPS task value 3.90 3.96 1.59(289) p =.057
GIS/GPS self-efficacy 3.90 4.30 8.50(279) p <.001

Future Developments In 2011, GEAR-Tech-21 expects 40 camp


sessions serving about 800 youth. The continued
The GEAR-Tech-21 program has experienced growth demands additional training opportuni-
significant and steady growth since the first pilot ties for camp coordinators. This need will be
camps and clubs in 2007. Camps have grown from met through the implementation of a train-the-
two sessions with 38 participants in 2007 to a pro- trainer model. The first group of GEAR-Tech-21
jected 40 camps with 800 participants in the summer “Regional Trainers” includes six 4-H staff and
of 2011. Similar growth patterns exist with clubs, volunteers who will provide training to camp co-
beginning with four clubs with 46 participants in ordinators and other interested educators in three
2007 to 46 clubs with 732 participants in the 2010- states. Regional Trainers were offered a stipend
2011 school year. The program’s goals are to scale and the equipment required for the program. They
up to serve at least 5000 youth in 450 clubs and are requested to train a minimum of 40 people per
camps cumulatively over five years. state. Regional Trainers will provide training to
Shifts in the delivery methods of the GEAR- at least five camps in three states while GEAR-
Tech-21 program have facilitated the growth Tech-21 staff will travel as well as use distance
from 2007 to 2011 and beyond. For the first three education technologies to reach the remaining
years, project staff directly planned, coordinated, camps. The project will continue to add three or
and taught the camp programs. In 2010, GEAR- more new Regional Trainers each year to assist
Tech-21 staff trained local camp coordinators to with training all interested GEAR-Tech-21 camp
lead the activities with youth using a camp-specific coordinators and club leaders.
curriculum. As part of the camp training sessions, The primary barriers to creating new clubs and
GEAR-Tech-21 staff openly invited all interested camps are the availability of confident, trained
educators in the area. With this method, many adult leaders and the equipment required. While
more potential GEAR-Tech-21 volunteers were the need to educate adults is well addressed through
trained than if only the camp coordinators were the training system, assisting groups in getting
trained. Camp training sessions were held at 15 equipment is more challenging to scale. GEAR-
locations across the country and reached a total Tech-21 currently lends nearly 100 robotics kits
of 180 attendees. out to clubs and camps each year for a small fee.
The increase in adult volunteers and staff This resource helps the local GEAR-Tech-21
trained to lead GEAR-Tech-21 camp programs program get started. After the initial interest is
allowed multiple sessions to occur at the same generated, equipment resources or funding are
times in different locations. Over four times as best acquired locally. GEAR-Tech-21 professional
many camp sessions (22 sessions with 450 par- development sessions address the challenge of
ticipants) were held in 2010 as compared to the finding equipment by providing ideas for com-
previous years (5 sessions with 178 participants).

1380
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

munity connections as well as brainstorming for Anderson, W., Banerjee, U., Drennan, C. L., El-
additional local resources among the attendees. gin, S., Epstein, I. R., & Handelsman, J. (2011).
Although the National Science Foundation has Changing the culture of science education at re-
made GEAR-Tech-21’s curriculum development search universities. Science, 331(6014), 152–153.
and growth possible, project leaders intend to doi:10.1126/science.1198280
make the project sustainable beyond additional
Avanzato, R. (2000). Mobile robotics for fresh-
grant funding. Initial steps toward sustaining the
man design, research, and high-school outreach.
project include designing curriculum components
Proceedings of the IEEE, International Confer-
for volunteers and participants to purchase (printed
ence Systems, Man, and Cybernetics, 1, 736–738.
leaders’ guides, youth workbooks, curriculum
DVDs, supporting camp materials, t-shirts, etc.). Baker, E. (2005). Principles for scaling up:
GEAR-Tech-21 also expects to continue to de- Choosing, measuring effects, and promoting the
velop the integrated GPS sensor for the LEGO widespread use of educational innovation. In
Mindstorms NXT kit into a commercially avail- Schneider, B., & McDonald, S. (Eds.), Scale-up
able product. In addition, GEAR-Tech-21 staff in education: Ideas in principle (pp. 37–54).
will continue to seek grant funding to assist with Lanham, MD: Rowman & Littlefield.
sustaining the project.
Barker, B., Grandgenett, N., Nugent, G., & Ad-
amchuk, V. (2010). Pairing educational robotics
with geospatial technologies in informal learning
SUMMARY
environments. Journal of Youth Development,
5(2), 48–56.
Improving STEM education is a steadily growing
and increasingly alarming national challenge, and Blumenfeld, P. C., Kempler, T. M., Krajcik, J. S.,
the Robotics and GPS/GIS in 4-H: Workforce Skills & Blumenfeld, P. (2006). Motivation and cogni-
for the 21st Century project has been initiated to tive engagement in learning environments. In
help meet that challenge and to provide a non- Sawyer, R. K. (Ed.), The Cambridge handbook of
formal education experience for 11-15 year old the learning sciences. New York, NY. Cambridge.
youth to encourage their consideration of STEM
Chambers, J., & Carbonaro, M. (2003). Design-
areas for their future academic and post-graduate
ing, developing, and implementing a course on
careers. The use of robotics, GPS, and GIS tech-
LEGO robotics for technology teachers education.
nologies under the theme of precision agriculture
Journal of Technology and Teacher Education,
and natural resource management appears to ap-
11, 209–241.
peal to many groups of youth when participating
in summer camps. Students participating in the Coburn, C. E. (2003). Rethinking scale: Mov-
program showed increased short-term STEM ing beyond numbers to deep and lasting
learning and attitudes. change. Educational Researcher, 32(6), 3–12.
doi:10.3102/0013189X032006003
Cohen, D., & Ball, D. (2007a). Conceptualizing
REFERENCES
scale-up: An educational perspective. In Sch-
American College Testing. (2008). ACT college neider, B., & McDonald, S. (Eds.), Scaling-up in
readiness report news release. Retrieved from education. Lanham, MD: Rowman & Littlefield.
http://www.act.org/newsroom/releases/view.ph
p?year=archive&p=314&lang=english

1381
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

Cohen, D., & Ball, D. (2007b). Educational in- Gura, M. M., & King, K. P. (2007). Classroom
novation and the problem of scale. In Schneider, robotics: Case stories of 21st century instruction
B., & McDonald, S. (Eds.), Scale-up in education. for millennial students. Charlotte, NC: Informa-
Lanham, MD: Rowman & Littlefield Publishers tion Age Publishing.
Inc.
Heer, R. L., Traylor, T. T., & Fiez, T. S. (2003).
Constas, M., & Brown, K. (2007). Toward a pro- Enhancing the freshman and sophomore ECE
gram of research on scale-up: Analytical require- student experience using a platform for learn-
ments and theoretical possibilities. In Schneider, ing. IEEE Transactions on Education, 46(4).
B., & McDonald, S. (Eds.), Scale-up in education: doi:10.1109/TE.2003.818752
Ideas in principle (pp. 247–258). Lanham, MD:
Hegedus, S. J., & Kaput, J. J. (2004). An introduc-
Rowman & Littlefield.
tion to the profound potential of connected algebra
Dede, C. (2006). Scaling up. In Sawyer, R. K. activities: Issues of representation, engagement
(Ed.), The Cambridge handbook of the learning and pedagogy. Proceedings of the 28th Conference
sciences (pp. 551–566). New York, NY: Cam- of the International Group for the Psychology of
bridge University Press. Mathematics Education, 3(1), 129-136.
Eshach, H. (2007). Bridging in-school and out- Heid, M. K. (2005). Technology in mathematics
of-school learning: Formal, non-formal, and in- education: Tapping into visions of the future. In
formal education. Journal of Science Education Masalski, W. J., & Elliott, P. C. (Eds.), Technology-
and Technology, 16(2), 171–190. doi:10.1007/ supported mathematics learning environments, the
s10956-006-9027-1 sixty-seventh yearbook of the National Council of
Teachers of Mathematics. Reston, VA: National
Fenichel, M., & Schweingruber, H. (2010). Sur-
Council of Teachers of Mathematics.
rounded by science: Learning science in informal
environments. Board on Science Education, Center Heid, M. K., & Edwards, M. T. (2001). Computer
for Education, Division of Behavioral and Social algebra systems: Revolution or retrofit for today’s
Sciences and Education. Washington, DC: The classrooms? Theory into Practice, 40(2), 128–136.
National Academies Press. doi:10.1207/s15430421tip4002_7
Fineberg, H. (2006). Address to The Academy of Horwitz, P. (2007). Successful scale-up in three
Medicine, Engineering & Science of Texas Annual stages. In Schneider, B., & McDonald, S.-K.
Meeting. January 5, 2006. (Eds.), Scale-up in education: Ideas in principle
(pp. 259–268). Lanham, MD: Rowman and
Fishman, B. (2005). Adapting innovations to par-
Littlefield.
ticular contexts of use: A collaborative framework.
In Dede, C., Honan, J., & Peters, L. (Eds.), Scaling Jonassen, D. H. (2000). Computers as mindtools
up success: Lessons learned from technology- for schools: Engaging critical thinking (2nd ed.).
based educational improvement. San Francisco, Upper Saddle River, NJ: Merrill.
CA: Jossey-Bass.
Kerski, J. J. (2003). The implementation and ef-
Goodman, P., & Pan, P. (2004, February 8). fectiveness of geographic information systems
Chinese workers pay for Wal-Mart’s low prices: technology and methods in secondary educa-
Retailer squeezes its Asian suppliers to cut costs. tion. The Journal of Geography, 102, 128–137.
The Washington Post. doi:10.1080/00221340308978534

1382
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

Kolb, D. A. (1984). Experiential learning: Experi- National Science Board. (2007). Science, tech-
ence as the source of learning and development. nology, engineering, and mathematics (STEM)
Englewood Cliffs, New Jersey: Prentice-Hall. education issues and legislative options. In Nata,
R. (Ed.), Progress in education (Vol. 14, pp.
Martin, D. W. (2004). Doing psychology experi-
161–189). Washington, DC.
ments (6th ed.). Wadsworth.
National Science Board. (2009). National Science
McDonald, S.-K., Keesler, V. A., Kauffman, N.
Board STEM education recommendations for the
J., & Schneider, B. (2006). Scaling-up exemplary
President–Elect Obama administration. NSB-
interventions. Educational Researcher, 35(3),
0901. Retrieved on March 3, 2011, from http://
15–24. doi:10.3102/0013189X035003015
www.nsf.gov/nsb/publications/2009/01_10_
Medrich, E. A., Roizen, J., Rubin, V., & Buckley, stem_rec_obama.pdf.
S. (1982). The serious business of growing up.
Nugent, G., Barker, B., Grandgenett, N., & Adam-
Berkeley, CA: University of California Press.
chuk, V. (2010). Impact of robotics and geospatial
Motiff, R. (2007). Forecasting the effects of technology interventions on youth STEM learning
scaling up social programs. In Schneider, B., and attitudes. Journal of Research on Technology
& McDonald, S. (Eds.), Scale-up in education: in Education, 42(4), 391–408.
Ideas in principle (pp. 173–186). Lanham, MD:
Organization for Economic Cooperation and
Rowman & Littlefield.
Development. (2010). Pisa 2009 results: What
National Academy of Sciences. (2010). Rising students know and can do - Student performance
above the gathering storm, revisited: Rapidly in reading, matchematics and science. Retrieved
approaching category 5. 2005 “Rising Above the on August 11, 2011, from http://www.oecd.org/
Gathering Storm” Committee. Washington, DC: edu/pisa/2009
National Academies Press. Retrieved from http://
Partnership for 21st Century Skills. (2008). 21st
www.nap.edu/catalog/12999.html.
century skills education and competitiveness
National Research Council. (2009). Learning sci- guide. Retrieved from http://www.p21.org/
ence in informal environments: People, places, and documents/21st_century_skills_education_and_
pursuits (P. Bell, B. Lewenstein, A. W. Shouse, competitiveness_guide.pdf
& M. A. Feder, Eds). Committee on Learning
Person, G., & Young, A. (2002). Technically
Science in Informal Environments. Board on Sci-
speaking, why all Americans need to know more
ence Education, Center for Education. Division
about technology. Washington, DC: National
of Behavioral and Social Sciences and Education.
Academy Press. Retrieved from http://www.nap.
Washington, DC: The National Academies Press.
edu/openbook.php?isbn=0309082625
National Research Council. (2011). A framework
Sosniak, L. (2001). The 9% challenge: Education
for K-12 science education: Practices, crosscut-
in school and society. Teachers College Record,
ting concepts, and core ideas. Committee on a
103, 15.
Conceptual Framework for New K-12 Science
Education Standards. Board on Science Educa- Thompson, T., Heer, D., Brown, S., Traylor, R., &
tion, Division of Behavioral and Social Sciences Fiez, T. S. (2004). Educational design, evaluation,
and Education. Washington, DC: The National and development of platforms for learning. Paper
Academies Press. presented at the ASEE/IEEE Frontiers in Educa-
tion Conference, Savanna, Georgia, October, 2004.

1383
Learning Geospatial Concepts as Part of a Non-Formal Education Robotics Experience

U.S. Department of Education. (2007). Report of Wiggins, G., & McTighe, J. (2005). Understand-
the Academic Competitiveness Council. Washing- ing by design (2nd ed.). Upper Saddle River, NJ:
ton, DC: Author. Merrill.
Wellington, J. (1990). Formal and informal learn- Woffinden, S., & Packham, J. (2001). Experiential
ing in science: The role of interactive science learning, just do it! The Agriculture Education
centers. Physics Education, 25(5), 247–252. Magazine, 73(6), 8–9.
doi:10.1088/0031-9120/25/5/307
World Economic Forum. (2010). The global com-
Whitmeyer, S., Nicoletti, J., & De Paor, D. petitiveness report 2009-2010. Retrieved from
(2010). The digital revolution in geologic map- http://www.weforum.org/pdf/GCR09/Report/
ping and presentation. GSA Today, 20(4/5), 4–10. Countries/United%20States.pdf
doi:10.1130/GSATG70A.1

This work was previously published in Robots in K-12 Education: A New Technology for Learning, edited by Bradley S. Barker,
Gwen Nugent, Neal Grandgenett and Viacheslav I. Adamchuk, pp. 284-300, copyright 2012 by Information Science Reference
(an imprint of IGI Global).

1384
1385

Chapter 83
A Reflection on the Ph.D.
Program in Spatially
Integrated Social Science at
the University of Toledo
Bhuiyan Monwar Alam
The University of Toledo, USA

Jeanette Eckert
The University of Toledo, USA

Peter S. Lindquist
The University of Toledo, USA

ABSTRACT
The use of spatial analysis tools is on the rise in many academic fields and practical applications. These
tools enhance the ability to examine data from spatial perspectives. Though the study of place and space
has traditionally been the domain of the field of geography, growing numbers of researchers are turning
to these tools in the social sciences and beyond. The University of Toledo has established a unique Ph.D.
granting program to encompass the theories, tools, and applications of spatially integrated social sci-
ence. In the first couple of years of its inception the program has attracted students from different places
and diverse backgrounds. It is expected that the program will continue to thrive in attracting diverse
students, securing external grants, and positively impacting on the economy of Northwest Ohio. This
paper is a personal reflection of the views of the authors on the Ph.D. program in Spatially Integrated
Social Science at the University of Toledo two years after its inception in fall 2009. The views, by no
means, are of the University of Toledo, its SISS program, or any of the participating departments and
faculty members.

DOI: 10.4018/978-1-4666-2038-4.ch083

Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
A Reflection on the Ph.D. Program in Spatially Integrated Social Science at the University of Toledo

INTRODUCTION (Goodchild & Janelle, 2004). Anselin (2006)


categorizes spatial analysis techniques into three
The use of a variety of spatial analysis tools is general groups: exploratory spatial data analysis
increasing in social science research. While social (searching for patterns), visualization (methods
problems and study areas span multiple disciplines, of showing the patterns), and spatial modeling
the question of location has historically been the (methods of explaining and predicting patterns).
domain of geographers. The prevalence of cutting- Recently, the development of the GeoDa software
edge technologies like Geographic Information has created a program capable of both advanced
Systems (GIS) and Global Positioning Systems spatial statistics and mapping (Anselin et al.,
(GPS) in the social sciences is facilitating the 2006).
study of the role of place in the society. Thus,
geography can be a unifying field for the social
sciences, wherein traditional data analysis can be WHY SPATIALLY INTEGRATED
conducted within the context of place. SOCIAL SCIENCE?
While each discipline has historically pos-
sessed its own set of analytic tools, the increasing As much as GIS has revolutionized many disci-
ability to measure processes in a spatiotemporal plines and practices, including the social sciences,
context facilitates sharing of these new tools, a GIS itself does not fully account for or measure
general development that can then be applied to the complexities and relationships inherent in
local and unique conditions (Goodchild et al., spatial data (Páez & Scott, 2004). Páez and Scott
2000). Human demographic data, for example, has (2004) argue that current GIS software, even with
historically been presented in a series of tables. the recent inclusion of spatial modeling, do not
However, Weeks (2004) argues that such data is adequately account for heterogeneity, interdepen-
spatial in nature because it deals with character- dence or spatial association, and thus are not as
istics of human populations in specific regions. It accurate or useful as it could be if more advanced
varies based on time, space, and scale. The recent spatial statistics were used. Logan et al. (2010)
shift towards locational information in data sets argue that GIS gives way to more complex spatial
and displaying data in interactive maps in addition analysis tools when patterns on a map lead to ad-
to traditional tables and charts has brought to light ditional questions that simple visualization cannot
the importance and usefulness of spatial ways of answer. For example, a traditional chloropleth
looking at data in the social sciences. map made in GIS does not take into account the
As GIS tools have become more powerful and outliers or an uneven population. The longstanding
more widely available, they have been increasingly method of collecting data in tables and translating
used in the application side of the social sciences. it to maps only scratches the surface of displaying
More and more it is being realized that these tools and analyzing spatial data.
are equally powerful for theoretical queries and The last decade has seen the development of
social research. As Sui (2004) asks, GIS is the a cohesive shift towards spatial ways of thinking
answer, but what is the question? Sui (2004) goes about social data and a demand for tools that facili-
on to argue that advances in geocomputation will tate this approach (Voss, 2007). This area of study
continue to bring social science disciplines into the is increasingly referred to as Spatially Integrated
trend of spatializing previously non-spatial data. Social Science (SISS). A few key programs have
Spatial analysis is both an inductive and de- progressed SISS techniques, including the Uni-
ductive tool, as it can reveal unforeseen patterns versity of California at Santa Barbara (UCSB),
and test existing theories about expected patterns which is home to many centers and programs

1386
A Reflection on the Ph.D. Program in Spatially Integrated Social Science at the University of Toledo

related to the field, perhaps most notably the criminal justice and social science researchers
Center for Spatially Integrated Social Sciences (Inter-University Consortium for Political and
(CSISS, 2011). CSISS focuses on integrating the Social Research at the University of Michigan,
social sciences via the context of place, and also 2011). This is a short sampling of some of the
holds workshops for students, researchers, and centers that have created spatial analysis tools
instructors (CSISS, 2011). Also at UCSB is the and are focusing on an interdisciplinary approach
National Center for Geographic Information and to furthering spatial research. Many universities
Analysis and the UCSB Spatial Studies Center, across the United States offer classes, certificates,
making the University a vital hub of spatial infor- and degrees in GIS. The SISS program at the
mation research. CSISS reports that the majority University of Toledo establishes a Ph.D. degree
of participants in their offered workshops, for that focuses on the theories, tools, and approaches
which they receive far more applicants than they of advanced spatial analysis.
can accept, are from the field of geography, though
the ratio is decreasing, indicating greater interest
from other disciplines (CSISS, 2003). CSISS also THE SISS PROGRAM AT THE
produces sample syllabi to be used to teach spatial UNIVERSITY OF TOLEDO
analysis in a variety of disciplines.
The Pennsylvania State University (PSU) The SISS program is the only Ph.D. granting pro-
has a strong GIS component as well, hosting the gram of this nature in the Midwest. The Master’s
multidisciplinary Geospatial Information Systems Certificate Program in GIS at the University of
Council. PSU’s Population Research Institute Toledo is housed within the Department of Ge-
(PRI) focuses on demography in the context of ography and Planning, however students from
a variety of disciplines, and conducts statistical several disciplines, ranging from public health
and GIS analyses of population data. The PRI to civil engineering, continuously enroll in GIS
also offers an interdisciplinary dual Ph.D. de- classes, demonstrating the wide application of
gree in demography (http://www.pop.psu.edu/). this technology. Contrarily, the SISS program is
PSU’s partnership with CSISS has encouraged an interdisciplinary program incorporating the de-
participation in workshops on spatial analysis partments of Geography and Planning, Economics,
by demography students and faculty, helping to Political Science and Public Administration, and
further the integration of the social sciences, GIS, Sociology and Anthropology (Lindquist, 2009).
and spatial statistics. The SISS program at the University of Toledo
The University of Illinois at Urbana-Cham- incorporates many of the approaches outlined by
paign has established the University Consortium CSISS. It is centered on spatial analysis tools, theo-
of Geographic Information Science, another ries, and problems as well as spatial information
prominent GIS program to promote the interdis- processing technologies, including GIS, remote
ciplinary use of geospatial techniques. The GeoDa sensing, and digital image analysis (SISS Ph.D.
Center for Spatial Analysis and Computation at Graduate Program Handbook, 2009).
Arizona State University is the current home of The Department of Geography and Planning at
the GeoDa software (http://geodacenter.asu.edu/). the University of Toledo was already home to the
Another spatial analysis software tool is CrimeStat, Center for Geographic Information Sciences and
developed at the Inter-University Consortium for Applied Geographics (GISAG), a well-equipped
Political and Social Research at the University research center which conducts much of the
of Michigan. CrimeStat is in use for practical University of Toledo’s GIS work (Czajkowski et
application by police departments as well as by al., 2003). Through the GISAG, there have been

1387
A Reflection on the Ph.D. Program in Spatially Integrated Social Science at the University of Toledo

many joint projects between the departments of Qualitative Analysis in SISS, Policy Evaluation
Geography and Planning, Economics, Political and SISS, Space and Society: Critical Theory in
Science, Environmental Sciences, Business Ad- SISS, Discrete Choice Spatial Process Modeling,
ministration, and Civil Engineering. Thus, the Advanced Modeling Methods and Techniques in
Ph.D. program in SISS is a natural extension of SISS, Spatial Perspectives on the Environment,
this collaboration and the increasing applicability Spatial Transport Modeling and Planning, Directed
of spatial analysis (SISS, 2011). It is expected that Readings in SISS, Seminar in Special Topics, and
this degree program will solidify such research Doctoral Dissertation. The elective courses can be
focus, attracting faculty and students of diverse taken from any of the participating departments. In
background, and strengthen the university’s role addition, she/he has to complete 24 credit hours of
as a research institution. dissertation research and successfully defend the
The original proposal for the SISS program research as the requirements to receive the Ph.D.
was submitted to the Ohio Board of Regents degree (Lindquist, 2009; SISS, 2011).
in 2007, and the program was approved by the Currently the core instructors are comprised
Board of Regents and the University of Toledo’s of nine faculty members representing the par-
Board of Trustees in September 2008 (Lindquist, ticipating departments. Because the core faculty
2009). The first and second classes of four and members have diverse background and expertise,
five students entered into the program in fall 2009 and that the admitted students come with similar
and fall 2010. Although the application process diverse backgrounds, the students have wide open
for admission is on the move, it is anticipated that options to expertise in different subfields of SISS:
the number of students who would enroll into the transportation, housing, environment, economics,
program in fall 2011 will be stable compared to public health, demography, and such. The list of
last two years. As such one can be optimistic about core faculty members and their expertise are avail-
the sustainability of the program. The admitted able from the SISS program website (SISS, 2011).
students come from a diverse background: geog-
raphy, planning, health education, engineering,
sociology, economics, and such. The applicants EXPECTED CONTRIBUTION TO
have to meet three pre-requisites for entering NORTHWEST OHIO’S ECONOMY
into the program. The first of those is that they
have to have a master’s or equivalent in a Social Northwest Ohio is, like many older industrial
Science discipline. The second and third pre- areas, in a period of self-reinvention. The City of
requites include prior successful completion of two Toledo has a strong legacy in manufacturing and
courses in GIS and one course in graduate-level supplying glass and parts to Detroit auto industry.
multivariate statistics. A typical student would The region continues to shift away from traditional
successfully complete six core courses (18 credit manufacturing due to decline in that industry, but
hours), three advanced seminar courses (nine is hopeful to make use of the skilled labor pool
credit hours) and three elective courses (nine to focus on high tech manufacturing and alterna-
credit hours) to fulfill the requirement of 36 credit tive energies. The University of Toledo is very
hours of coursework. The core courses consist of involved in this transition, and plays a vital role
Spatial Statistics, Geographical Information Sci- in the regional economy. As such, the creation
ence in SISS, Foundations of SISS, SISS Theory, of the Ph.D. program in SISS at the University
Advanced Spatial Data Analysis, and Research of Toledo is expected to contribute to this by at-
Design. The advanced seminar courses, on the tracting a unique group of students, faculty, and
other hand, include Geo-Computation, Advanced collaborators. The SISS program is also expected

1388
A Reflection on the Ph.D. Program in Spatially Integrated Social Science at the University of Toledo

to contribute to the local economy by working as taking several unique variables into account, as
a research center assisting local organizations, well as to be able to measure and understand the
governments, and businesses. relationships and connections of different places
One specific area where the SISS program ex- and scales. There are several reasons, therefore,
pects to have a significant impact is in the area of to be optimistic that the Ph.D. program in SISS
transportation research. The Greater Toledo region at the University of Toledo will

You might also like