Professional Documents
Culture Documents
Bartolino Etal2011
Bartolino Etal2011
DOI 10.1007/s10144-010-0229-2
ORIGINAL ARTICLE
Francesco Colloca
Received: 9 January 2010 / Accepted: 22 June 2010 / Published online: 24 July 2010
Ó The Society of Population Ecology and Springer 2010
123
352 Popul Ecol (2011) 53:351–359
richness values that should be managed or conserved. and Gi* to handle non-normality (e.g., Tiefelsdorf 2002;
Usually the selected threshold ranges between 1 and 5% Griffith 2010). Simulation approaches are also commonly
(Myers et al. 2000; Gjerde et al. 2004; Grand et al. 2004; used to assess significance in relation to a null hypothesis
Orme et al. 2005), but it has also been raised to 25% when of spatial randomization (Anselin 1995). Moreover, local
the identification of wider areas was required (Garcia methods are affected by the identification of the appropri-
2006). ate range of influence, i.e., by the smoothing factor in
Spatially local methods compare the values measured in kernel analyses (Horne and Garton 2006) and by the
particular locations with those measured in the immediate neighborhood structure in local measures of spatial auto-
surrounding areas. In this case, the definition of what correlation (Fortin and Dale 2005; Nelson and Boots 2008).
represents the neighborhood for the observation is usually Even though some statistical procedures can help in the
determined by subjective decision (Atkinson and Unwin choice of an appropriate neighborhood/range of influence
2002; Haining 2003; Nelson and Boots 2008). The most (Haining 2003; Nelson and Boots 2008), an important
commonly used spatially local methods are kernel esti- component of arbitrariness still remains that potentially
mation (Fortin and Dale 2005) and local measures of affects both size and location of the identified hotspots.
spatial autocorrelation, like Moran’s Ii and Getis’ Gi
(Nelson and Boots 2008). Kernel estimators have been
widely used in ecological studies, mainly for home range
Materials and methods
detection (e.g., Horne and Garton 2006), while Getis’ Gi*
and Moran’s Ii have been used especially in socio-eco-
A frequency distribution approach to hotspots
nomic studies (e.g., Anselin et al. 2005; Moons et al. 2008).
identification
Partially local methods use a combination of local and
global approaches, with a spatially global threshold that is
We propose a new spatially global method which can be
applied to a spatially local measure (Nelson and Boots
used to identify hotspots for conservation or ecological
2008). For example, Grand et al. (2004) used a 5% global
purposes. Our approach can be easily used for any phe-
threshold to identify bird and moth hotspots inside their
nomenon that is distributed over spatially autocorrelated
study area; however, in areas of particularly high species
surfaces, and is based on cumulative relative frequency
richness, they raised the threshold value to capture in their
distribution (CRFD) curves. A CRFD curve f(x) is
hotspots only the highest species counts, introducing a
approximated plotting the relative value of the variable
local component.
z (i.e., density, biodiversity, etc.), against the frequency
Spatially local methods are usually more effective for
distribution of the same variable. The two axes of the plot
hotspot analyses compared to spatially global ones, espe-
are given by:
cially when the area considered is large and the processes
being mapped are non-stationary (Nelson and Boots 2008). z
x¼ ð1Þ
Moreover, the smoothing effect of spatially local methods zm
can be important when the neighborhood structure has a where z is the variable of interest and zm is the maximum
biological or practical relevance (e.g., structural connec- value that z assumes in the study area, and
tivity in a metapopulation context, known level of uncer-
tainty in the population data, etc.). Moran’s Ii and Getis’ Gi*, MðxÞ
y¼ ð2Þ
in particular, are useful to identify clusters of values that are N
high relative to a global mean; moreover, they are usually where M(x) is the number of cases (i.e., points, pixels,
more robust than kernel estimators, being less affected by areas) a value smaller than x is encountered, and N is the
the choice of neighborhood and thus more appropriate when total number of cases. The resulting f(x) curve is charac-
only limited knowledge is available about the phenomenon terized by having both axes ranging from 0 to 1.
under consideration (Nelson and Boots 2008). The tangent to the curve f(x) at a point (x0, f(x0)) has
Although the most appropriate method should be slope f 0 (x0), corresponding to the derivative of f(x) in x0,
selected according to the specific objectives and applica- and is calculated as:
tions considered, all the approaches briefly described above
y f ðx0 Þ
suffer from one or more elements of subjectivity. Global f 0 ðx0 Þ ¼ ð3Þ
x x0
and partially local methods are based on the selection of a
pre-defined threshold that has little, if any, ecological jus- A tangent with \45° slope (f 0 (x0) \ 1) will indicate
tification, and that heavily influences the results. Local areas where the relative increase along the y-axis (the
methods, in general, work better with normally distributed cumulative relative frequency) is smaller than the relative
data although several corrections have been suggested for Ii increase along the x-axis (the variable of interest), i.e.,
123
Popul Ecol (2011) 53:351–359 353
those areas where the variable considered can be found at detect biodiversity hotspots for terrestrial vertebrates on the
high densities. A tangent with [45° slope (f 0 (x0) [ 1) will Italian peninsula.
indicate areas where the relative increase along the y-axis
is greater than the relative increase along the x-axis, i.e., Case studies
those areas where the variable considered can be found at
low densities. A tangent with 45° slope (f 0 (x0) = 1) will We obtained a dataset on fish density and one on terrestrial
indicate areas where the relative increase along the y-axis vertebrate species richness, respectively, from the pro-
is equal to the relative increase along the x-axis, i.e., those ject BECAUSE (http://www1.uni-hamburg.de/BECAUSE,
areas where the variable considered can be found at accessed on 2 March 2009; Colloca et al. 2009) and
densities proportional to the frequency. Maiorano et al. (2007). The project BECAUSE provided a
In practice, the tangent to the curve f(x) at each point can Bayesian kriging estimate of European hake recruit log-
be seen as the accumulation rate for the phenomenon of densities (n/km2) off the central-western coast of Italy (FAO
interest. The CRFD curve generated by a random process geographical sub-area 9, covering 42,419 km2; Fig. 1) over
will approximate a 45° line, but almost any biological a 2 km by 2 km estimation grid (7,290 cells). Maiorano et al.
process will produce a complex CRFD curve approaching (2007) provided validated distribution models for 468
an upper asymptote. We suggest that the highest x0 corre- species of terrestrial vertebrates occurring in the Italian
sponding to a 45° slope tangent to the curve can be used as peninsula (Fig. 1). The original models have a spatial reso-
a global threshold to identify hotspots. In fact, the 45° slope lution of 100 m by 100 m (more than 30 million cells); for
tangent, and no other arbitrary threshold, will discriminate computational reasons, we re-sampled the entire dataset with
the areas characterized by the highest values of the variable a resolution of 10 km, obtaining a total of 3,115 cells. We
z and at the same time by a rate of accumulation along the calculated species richness as the number of species occur-
x-axis higher than that along the y-axis. Other spatially ring in each cell.
global methods, and also the frequency distribution We used the two datasets to compare hotspots identified
approach, compare local values with the distribution of by the frequency distribution approach that we proposed,
values across the whole study area, regardless of the local with hotspots identified through some of the most widely
neighborhood structure. However, under the assumption of used techniques in ecology and conservation: Moran’s Ii,
spatial autocorrelation, the high values selected as hotspots used to calculate spatially local hotspots, Getis’ Gi*, for
will not occur in isolation, and the frequency distribution partially local hotspots, and a set of predefined thresholds
method will thus identify all the areas with the highest for global hotspots.
values of the phenomenon of interest (i.e., species diversity To implement the spatially local and partially local
and/or species density). methods, we set for both datasets three neighborhood lev-
We show in the remainder of the paper two applications els: 8, 24, and 48 neighboring cells. For each of the three
of the frequency distribution approach presented, one to neighborhood levels in the spatially local method we
detect density hotspots for European hake (Merluccius defined as hotspots all those cells with a positive local
merluccius) along the western coast of Italy, and another to Moran’s Ii and P \ 0.01. In the partially local method, and
123
354 Popul Ecol (2011) 53:351–359
a b c
Cumulative frequency
Frequency
Latiitude (°N)
Relative density Density Longitude (°E)
Fig. 2 a Cumulative relative frequency distribution curve of hake with the proportion of area (shade area) above the identified density
recruit density showing the 45° slope tangent to the curve (dotted line) threshold (dotted line). c Spatial distribution of hake recruit density
at the highest density value. b Frequency distribution of fish density hotspots from the frequency distribution method
Fig. 3 Spatial distribution of hake recruit density hotspots from Moran’s Ii (with 8, 24 and 48 neighboring cells), Getis’ Gi* (with 8, 24 and 48
neighboring cells), and global predefined thresholds (5, 10 and 15%)
for each of the three neighborhood levels, we applied a approach, we applied three arbitrary commonly used global
global threshold to select as hotspots all cells in the upper thresholds, considering as hotspots, respectively, the 5, 10,
5% of the Gi*-statistics distribution. In the spatially global and 15% highest values.
123
Popul Ecol (2011) 53:351–359 355
To calculate the hotspots according to the frequency Table 1 Total number of hotspots of hake recruit density, percent-
distribution approach, we built for both datasets a CRFD age of the Tyrrhenian and Ligurian Seas identified as hotspots, and
percentage of overlap with the hotspots identified by the frequency
curve and we calculated the related 45° slope tangent.
distribution method (FD)
Because the curves were approximated by discrete fre-
quencies, the tangents to the curves were calculated as the % Hotspot % Overlap with FD Number of hotspots
lines passing through pairs of adjacent points. G8 5.0 100.0 9
G24 5.0 100.0 5
G48 5.0 100.0 4
Results Glob.thr 5 5.0 100.0 10
Glob.thr 10 10.0 100.0 8
The density hotspots for European hake recruits calculated Glob.thr 15 15.0 100.0 10
with the frequency distribution approach (Fig. 2) occupy
FD 15.3 – 9
approximately the same areas as those identified by the
I8 14.6 100.0 8
other three methods (Fig. 3; Table 1). All the methods
I24 19.8 77.1 9
identified density hotspots that are consistent with previous
I48 23.5 64.7 8
knowledge on the bathymetric and spatial distribution of
hake recruits in the study area (Ardizzone and Corsi 1997; Results obtained with three different neighborhood levels (8, 24 and
48 cells) for the Getis’ Gi* and the Moran’s Ii, three global thresholds
Abella et al. 2005; Bartolino et al. 2008). Hake recruits
(Glob.thr 5, Glob.thr 10 and Glob.thr 15%), and the frequency dis-
concentrate at the limit of the continental shelf and the tribution approach
upper slope, where elevated local productivity supports the
rich demersal communities of the shelf-break (Colloca Table 2 Total number of hotspots of vertebrate species richness,
et al. 2004). However, the number and size of the hotspots percentage of the Italian peninsula and islands identified as hotspots,
being defined was different for different approaches and percentage of overlap with the hotspots identified by the fre-
quency distribution methods (FD)
(Tables 1 and 2). We identified a minimum of 4 (using
Getis’ Gi* with 48 neighboring cells) and a maximum of 10 % Hotspot % Overlap with FD Number of hotspots
hotspots (using the 5 and 15% global thresholds) for hake
G8 5.0 31.4 15
recruits. The high percentage of overlap between the hot-
G24 5.0 26.3 8
spots identified with our method and all the other local,
G48 5.0 22.4 4
partially local, and global approaches (Table 1) suggested
Glob.thr 5 4.6 21.6 27
that the same peaks of fish density were detected as hot-
Glob.thr 10 9.8 32.7 45
spots. Hotspots identified through the frequency distribu-
Glob.thr 15 14.8 69.4 44
tion approach occupied 15.3% of the study area. Similar
FD 3.2 – 22
estimates for the area occupied by the hotspots were
I8 6.2 42.7 14
obtained with Moran’s Ii with 8 neighborhood cells
I24 15.5 18.7 10
(14.6%), and with the 15% global threshold (Table 1).
I48 22.2 13.7 14
The frequency distribution biodiversity hotspots for
terrestrial vertebrates in Italy (Fig. 4) covered 3.2% of the Results obtained with three different neighborhood levels (8, 24 and
study area and clearly identified the middle elevation areas 48 cells) for the Getis’ Gi* and the Moran’s Ii, three global thresholds
(Glob.thr 5, Glob.thr 10 and Glob.thr 15%) and the frequency dis-
along the two main mountain ranges, the Alps and the tribution approach
Apennines, as already identified by Maiorano et al. (2006,
2007). All the other methods identify the same geograph- last 40 years (Falcucci et al. 2007), and some of them are
ical regions as important (Fig. 5), with generally larger under some level of protection (Maiorano et al. 2008).
areas (Table 2). In this case, the variability in area and As expected, the results obtained with the ‘‘traditional’’
number of hotspots as identified with the ‘‘traditional’’ methods (Getis’ Gi*, Moran Ii, and global threshold) were
methods was high (Table 2). Moran’s Ii with 8 neighbor- highly dependent on the particular setting that was chosen,
hood cells and the 5% global threshold identified as with hotspots appearing and disappearing (Figs. 3 and 5)
hotspots 6.2 and 4.6% of the study area, respectively. depending on the threshold or neighborhood level selected.
A maximum of 22.2% of the study area was selected as a In general, an increase in the neighborhood levels or
hotspot by the Moran’s Ii with 48 neighborhood cells thresholds corresponded to larger areas, while the number
(Fig. 5). All methods identified as important the areas of identified hotspots showed no clear relationship with the
along the lower mountain slopes in the Italian peninsula. particular settings adopted, with the exception of the Getis’
These areas, especially along the northern Apennine range, Gi* hotspots, whose number decreased with increasing
have been abandoned by agriculture and reforested in the neighborhood levels.
123
356 Popul Ecol (2011) 53:351–359
a b c
Cumulative frequency
Latiitude (°N)
Frequency
Relative species richness Species richness Longitude (°E)
Fig. 4 a Cumulative relative frequency distribution curve of area (shade area) above the identified species richness threshold
terrestrial vertebrate species richness showing the 45° slope tangent (dotted line). c Spatial distribution of terrestrial vertebrate biodiver-
to the curve (dotted line) at the highest species richness value. sity hotspots in the Italian peninsula from the frequency distribution
b Frequency distribution of species richness with the proportion of method
123
Popul Ecol (2011) 53:351–359 357
Fig. 5 Spatial distribution of terrestrial vertebrate biodiversity hotspots from Moran’s Ii (with 8, 24 and 48 neighboring cells), Getis’ Gi* (with 8,
24 and 48 neighboring cells), and global predefined thresholds (5, 10 and 15%)
across the whole spatial process, and consequently it does application of all spatially global methods, including the
not specifically search for local structures. This makes the frequency distribution approach. Consequently, the method
geometric approach presented here and all the other global can be effectively applied only in the case of spatially
measures attractive for their low computational costs, and autocorrelated processes, where high values have a low
particularly easy to be applied on high-resolution or large- probability to occur in isolation. Moreover, the spatially
scale studies. At the same time, global approaches are not global methods should be employed under the assumption
able to account for the local context in which high values of homogeneous processes (i.e., hake recruits in our study
occur. This represents an important limitation for the area are part of a single fish population). Because this
123
358 Popul Ecol (2011) 53:351–359
condition rarely occurs, large areas characterized by non- Leptometra phalangium (Echinodermata, Crinoidea). Mar Biol
stationarity should be partitioned into smaller regions of 145:1129–1142
Colloca F, Bartolino V, Jona Lasinio G, Maiorano L, Sartor P,
investigation before spatially global thresholds should be Ardizzone G (2009) Identifying fish nurseries using density and
applied without the risk of losing locally relevant hotspots. persistence measures. Mar Ecol Prog Ser 381:287–296
The example on terrestrial vertebrates that we provided Falcucci A, Maiorano L, Boitani L (2007) Changes in land-use/land-
is particular in this respect. In fact, species diversity in cover patterns in Italy and their implications for biodiversity
conservation. Landscape Ecol 22:617–631
the Italian peninsula cannot be assumed to represent an Fortin MJ, Dale MRT (2005) Spatial analysis: a guide for ecologists.
homogeneous process (Italy hosts at least 6 different bio- Cambridge University Press, Cambridge
geographic regions; Falcucci et al. 2007), but the same Garcia A (2006) Using ecological niche modeling to identify diversity
peninsula is a single management/conservation unit at the hotspots for the herpetofauna of Pacific lowlands and adjacent
interior valleys of Mexico. Biol Conserv 130:25–46
political level, necessarily asking for analyses that do not Gjerde I, Rolstad J, Blom HH, Storaunet KO (2004) Fine-scale
consider hotspots that are important only locally. diversity and rarity hotspots in northern forests. Conserv Biol
An interesting future development of our approach 18:1032–1042
would be represented by the implementation of a rational Grand J, Buonaccorsi J, Cushman SA, Griffin CR, Neel MC (2004) A
multiscale landscape approach to predicting bird and moth rarity
framework to include local analyses based on the use of hotspots in a threatened pitch pine-scrub oak community.
CRFD curves, or some other related curve (e.g., geosta- Conserv Biol 18:1063–1077
tistical aggregation curves; Petitgas 1998), and their asso- Griffith DA (2010) The Moran coefficient for non-normal data. J Stat
ciated tangent with 45° slope. In this case, the method that Plan Infer. doi:10.1016/j.jspi.2010.03.045
Haining R (2003) Spatial data analysis: theory and practice.
we propose could be specifically tailored towards a more Cambridge University Press, Cambridge
detailed definition of the boundaries of hotspots that have Horne JS, Garton EO (2006) Likelihood cross-validation versus least
been identified considering also the local spatial context. squares cross-validation for choosing the smoothing parameter in
Further testing is required using different datasets and kernel home-range analysis. J Wildl Manage 70:641–648
Houghton JDR, Doyle TK, Wilson MW, Davenport J, Hays GC (2006)
spatial processes; however, we are confident that the fre- Jellyfish aggregations and leatherback turtle foraging patterns in a
quency distribution approach can be successfully intro- temperate coastal environment. Ecology 87:1967–1972
duced in ecology and conservation studies as an effective Kerswell AP (2006) Global biodiversity patterns of benthic marine
contribute to the current set of methods available for the algae. Ecology 87:2479–2488
Kissling ML, Reid M, Lukacs PM, Gende SM, Lewis SB (2007)
identification of hotspots. Understanding abundance patterns of a declining seabird:
implications for monitoring. Ecol Appl 17:2164–2174
Acknowledgments We especially thank Nathan M. Bacheler for his Legendre P, Fortin MJ (1989) Spatial pattern and ecological analysis.
helpful comments on an earlier draft of the manuscript, and two Plant Ecol 80:107–138
anonymous reviewers for a useful discussion. Maiorano L, Falcucci A, Boitani L (2006) Gap analysis of terrestrial
vertebrates in Italy: priorities for conservation planning in a
human dominated landscape. Biol Conserv 133:455–473
Maiorano L, Falcucci A, Garton EO, Boitani L (2007) Contribution of
References the Natura 2000 network to biodiversity conservation in Italy.
Conserv Biol 21:1433–1444
Abella A, Serena F, Ria M (2005) Distributional response to Maiorano L, Falcucci A, Boitani L (2008) Size-dependent resistance
variations in abundance over spatial and temporal scales for of protected areas to land-use change. Proc R Soc Lond B
juveniles of European hake (Merluccius merluccius) in the 275:1297–1304
Western Mediterranean Sea. Fish Res 71:295–310 Marshall CT, Frank KT (1994) Geographic responses of groundfish to
Aldstadt J, Getis A (2006) Using AMOEBA to create a spatial weights variation in abundance: methods of detection and their imple-
matrix and identify spatial clusters. Geogr Anal 38:327–343 mentation. Can J Fish Aquat Sci 51:808–816
Anselin L (1995) Local indicators of spatial association––LISA. Moons E, Brijs T, Wets G (2008) Hot spots analysis: improving a
Geogr Anal 27:93–115 local indicator of spatial association for application in traffic
Anselin L, Syabri I, Kho Y (2005) GeoDa: and introduction to spatial safety. In: Gervasi O, Murgante B (eds) Computational science
data analysis. Geogr Anal 38:5–22 and its applications. Proceedings of ICCSA 2008, pp 221–231
Ardizzone GD, Corsi F (1997) Atlas of Italian demersal fishery Myers N, Mittermeier RA, Mittermeier CG, da Fonseca GAB, Kent J
resources. Biol Mar Medit 4:1–479 (2000) Biodiversity hotspots for conservation priorities. Nature
Atkinson PJ, Unwin DJ (2002) Density and local attribute estimation 403:853–858
of an infectious disease using MapInfo. Comput Geosci Nelson TA, Boots B (2008) Detecting spatial hot spots in landscape
28:1095–1105 ecology. Ecography 31:556–566
Bartolino V, Ottavi A, Colloca F, Ardizzone GD, Stefánsson G (2008) Orme CDL, Davies RG, Burgess M, Eigenbrod F, Pickup N, Olson
Bathymetric preferences of juvenile European hake (Merluccius VA, Webster AJ, Ding TS, Rasmussen PC, Ridgely RS,
merluccius). ICES J Mar Sci 65:963–969 Statterfield AJ, Bennett PM, Blackburn TM, Gaston KJ, Owen
Bonn A, Rodrigues ASL, Gaston KJ (2002) Threatened and endemic IPF (2005) Global hotspots of species richness are not congruent
species: are they good indicators of patterns of biodiversity on a with endemism or threat. Nature 436:1016–1019
national scale? Ecol Lett 5:733–741 Petitgas P (1998) Biomass-dependent dynamics of fish spatial
Colloca F, Carpentieri P, Balestri E, Ardizzone GD (2004) A critical distributions characterized by geostatistical aggregation curves.
habitat for Mediterranean fish resources: shelf break areas with ICES J Mar Sci 55:443–453
123
Popul Ecol (2011) 53:351–359 359
Roberts CM, McClean CJ, Veron JEN, Hawkins JP, Allen GR, Weathers KC, Levett GM, Likens GE, Lathrop R (2000) The effect of
McAllister DE, Mittermeier CG, Schueler FW, Spalding M, landscape features on deposition to Hunter mountain, Catskill
Wells F, Vynne C, Werner TB (2002) Marine biodiversity Mountains, New York. Ecol Appl 10:528–540
hotspots and conservation priorities for tropical reefs. Science Williams P, Gibbons D, Margules C, Rebelo A, Humphries C, Pressey
295:1280–1284 R (1996) A comparison of richness hotspots, rarity hotspots, and
Swain DP, Sinclair AF (1994) Fish distribution and catchability: what complementary areas for conserving diversity of British birds.
is the appropriate measure of distribution? Can J Fish Aquat Sci Conserv Biol 10:155–174
51:1046–1054 Worm B, Lotze HK, Myers RA (2003) Predator diversity hotspots in
Swain DP, Wade EJ (1993) Density-dependent geographic distribu- the blue ocean. Proc Natl Acad Sci USA 100:9884–9888
tion of Atlantic cod in the southern Gulf of St Lawrence. Can J
Fish Aquat Sci 50:725–733
Tiefelsdorf M (2002) The saddlepoint approximation of Moran’s I’s
and local Moran’s Ii’s reference distributions and their numerical
evaluation. Geogr Anal 34:187–206
123