Professional Documents
Culture Documents
10
12
13Abstract
14Geological Image Analysis Software (GIAS) combines basic tools for calculating
16location, with the first automated method for characterizing the aerial distribution of
18analyses include tests for: (1) Poisson, (2) Normalized Poisson, (3) Scavenged k = 1,
20Graphical User Interface (GUI) that is available as pre-parsed pseudocode for use
22systems. GIAS can process raster data (e.g., satellite imagery, photomicrographs, etc.)
23and tables of object coordinates to characterize the size, geometry, orientation, and
1 1
26use of stereology to transform 2D object sections into 3D models, and establishes a
29
32
331. Introduction
38spatial analysis may be used to quantify object distributions and investigate their
39formation processes.
42Health (NIH). Custom plug-in modules enable ImageJ to solve numerous tasks,
44sections (e.g., Szramek et al, 2006, Polacci et al., 2007). However, ImageJ and similar
45programs for Windows (e.g., Scion Image and ImageTool) and Macintosh (e.g., NIH
46Image) are not specifically designed for geological applications nor do they provide
50program combines (1) an image processing module for calculating and visualizing
2 2
51object areas, abundance, radii, perimeters, eccentricities, orientations, and centroid
52locations, and (2) a spatial distribution module that automates sample-size dependent
53nearest neighbor (NN) analyses. Although other programs can perform the basic
54functions in the “Image Analysis" module, GIAS is the first program to automate
56
572. Motivation
61sensing imagery include the study of volcanic landforms (Bruno et al., 2004; 2006;
62Baloga et al., 2007; Bishop, 2008; Hamilton et al., 2009; Bleacher et al., 2009),
63sedimentary mud volcanoes (Burr et al., 2009), periglacial ice-cored mounds (Bruno
64et al., 2006), glaciofluvial features (Burr et al., 2009), dune fields (Wilkins and Ford,
652007), and impact craters (Bruno et al., 2006). Despite widespread utilization of NN
66analyses, its value as a remote sensing tool and effectiveness as basis for comparison
71objects in photomicrographs such as crystals (Jerram et al., 1996; 2003; Jerram and
73analyses to vesicle distributions to demonstrate how GIAS can be used to validate (or
3 3
75stereological techniques to derive vesicle volumes from photomicrographs and for
78magmas and can be used to investigate the dynamics of explosive and effusive
79volcanic eruptions (e.g., Mangan et al., 1993; Cashman et al., 1994; Mangan and
80Cashman 1996; Cashman and Kauahikaua, 1997; Polacci and Papale 1997; Blower et
81al., 2001; Blower et al., 2003; Gaonac'h et al., 2005; Shin et al., 2005; Lautze and
82Houghton, 2005; Adams et al., 2006; Polacci et al., 2006; Sable et al., 2006; Gurioli
83et al., 2008). Quantitative vesicle analyses stem from Marsh (1988), who explored the
85for crystal size distributions. This research was then applied to numerous bubble size-
86frequency distribution studies (e.g., Sarda and Graham, 1990; Cashman and Mangan,
871994; Cashman et al., 1994; Blower et al., 2003). Early studies of vesicle distributions
88(e.g., Cashman and Mangan, 1994) were limited by their inability to characterize the
89full range of vesicle sizes because their methodology could not resolve the smallest
942D cross-sections into representative vesicle volumes (Mangan et al., 1993; Sahagian
95and Proussevitch, 1998; Higgins, 2000; Jerram and Cheadle, 2000); and development
97functions and their spatial characteristics (Morgan and Jerram, 2007; Proussevitch et
98al., 2007a). These difficulties have been partially addressed by improved statistical
4 4
100cannot be used to reconstruct a representative 3D vesicle distribution unless the
102known 3D distributions (Jerram et al., 1996; 2003; Jerram and Cheadle 2000;
104being used to directly generate 3D vesicle distributions (e.g., Gualda and Rivers,
1052006; Polacci et al., 2007; Proussevitch et al., 2007b), nested datasets containing
106multiple Scanning Electron Microscope (SEM) images remain the most common
107input for vesicle studies because of their superior spatial resolution relative to X-ray
108tomography. To facilitate the analysis of vesicles in SEM imagery, GIAS can be used
109to determine the geometric properties of vesicles within the plane of a given
112
114Clark and Evans (1954) proposed a simple test for spatial randomness in which the
117equivalent density. Following Clark and Evans (1954), re and expected standard error
1 0.26136
119 re ; e , [1; 2]
2 0 N 0
120where the input population density, ρ0, equals the number of objects (N) divided by
121the area (A) of the feature field (ρ0 = N / A). The following two test statistics (termed
ra ra re
123 R ; c . [3; 4]
re e
5 5
124 If a test population exhibits a Poisson random distribution, R should ideally
126test population may have a Poisson random distribution. If R > 1, then the test
127population exhibits greater than random NN spacing (i.e., tends towards a maximum
128packing arrangement), whereas if R < 1, then the NN distances in the test case are
129more closely spaced than expected within a random distribution and thus exhibit
131 To identify statistically significant departures from randomness at the 0.95 and
1320.99 confidence levels, |c| must exceed the critical values of 1.96 and 2.58,
133respectively (Clark and Evans, 1954); however, these critical values implicitly assume
134large sample populations (N > 104). Jerram et al. (1996) and Baloga et al. (2007) note
135that finite-sampling effects introduce biases into the variation of NN statistics. These
136biases become significant for small populations (N < 300) and thereby necessitate the
138 The NN methodology of Clark and Evans (1954) assumes that objects can be
143for R, which they term the random sphere distribution line (RSDL). With increasing N
144(i.e., decreasing porosity), the RSDL threshold increases from a value just above 1 to
145a maximum of 1.65 at 37% porosity, based on the experimental spherical packing
146model of Finney (1970). Jerram et al. (1996, 2003) also note that compaction,
147overgrowth, and sorting can introduce systematic variations in R relative to the RSDL
6 6
148and thus, when correctly identified, can provide information relating to the relative
152
154 require normalization of re, σe, skewness, and kurtosis values based on a user-
157 resources in their surroundings, which affects the kth nearest neighbor by
158 increasing re relative to the standard Poisson NN model. The order of the
159 scavenging model is given by the Poisson index, k, which identifies the
162 3. Logistic NN distributions apply the classical probability distribution for self-
163 limiting population growth due to the use of available resources. Under this
165 smaller skewness, but larger kurtosis compared to the Poisson NN model.
166
168skewness versus kurtosis values, allow for robust statistical tests of spatial
170method have not been automated and freely available for use.
171
1724. Method
7 7
1734.1. Input data and processing parameters
174GIAS is designed to process two input data types: images and tables of object
175coordinates. In the first instance, images may include rectangular raster files (e.g.,
176*.tiff, *.jpg, *.gif, etc.) encoded in 8-bit grey-scale formats. All inputs are assumed to
177have an orthogonal projection with equal x and y pixel dimensions. If the input image
178pixels are not square, their area can be uniquely specified. To appropriately scale the
179output results, the pixel size must be defined by the analyst for each input image.
181such that inputs are converted to a binary threshold logical matrix. GIAS assumes
182input grey-scale pixel values >250 are white; however, the upper and lower thresholds
183can be adjusted in the input options. Within input images, black pixels are assumed to
184be the objects of interest (e.g., vesicles) whereas white pixels are assumed to be part
185of the background (e.g., non-vesicles). An option is provided for image inversion.
186 Other than performing the binary conversion, GIAS does not modify input
188segment objects into feature classes. To facilitate image analysis and pre-processing
189of inputs to the NN module, GIAS supplies labeling information and metadata for
190each object in the image analysis output file; however, the level of image modification
191will depend upon the particular scientific investigation and the preferences of the
192analyst.
193 Inputs to the NN module can be passed from the image analysis module or
194loaded as a two column table (tab- or space- delimited) with x-coordinates in the first
195column and y-coordinates in the second column. GIAS automatically checks input
196files for duplicate entries and retains only unique coordinate pairs.
8 8
197 Slider bars control the detection thresholds for the minimum and maximum
198object areas, in addition to the distance threshold used within the Normalized Poisson
199tests. A check-box excludes objects that touch the periphery of the image from
201sectional properties (e.g., area, geometry, and orientation) cannot be calculated for
203area) are based on total image properties and are unaffected by peripheral object
204exclusion. Once the input options are satisfactorily set, the analyst simply presses the
206
208GIAS outputs its results to on-screen graphs and text boxes, which are presented in
209two tabbed panels. Statistics relating to the object area, geometry, and orientation are
210shown in the first panel, labeled “Image Analysis” (Figure 1). The frequency axis can
211be plotted in linear or log units with on-screen options for defining custom
213histograms, whereas object orientations are displayed using a rose plot. Object radii
214are estimated by calculating the radii of equal area circles. In addition to the graphical
215output, there are text-box summaries of the total number of objects, their abundance,
216minimum, maximum, mean, standard deviation (at 1 σ), skewness, and kurtosis based
218 Results of the spatial analyses are summarized in a second panel, labeled
219“Nearest Neighbor Analysis” (Figure 2). These results include histograms of the
221(Baloga et al., 2007). The R and c values are presented with sample-size-dependent
9 9
222thresholds for rejecting the null hypothesis (i.e., a given model of re) at significance
223levels of 1 and 2 σ. By default, the program assumes a Poisson random model for the
224expected spatial distribution; however, the user may also view results based on
229statistics, including the total number of objects, number within the convex hull, and
230NN distance statistics (minimum, maximum, mean, and standard deviation at 1 and 2
231σ). The convex hull is a polygon defined by the outermost points in a feature field
232such that each interior angle of the polygon measurers less than 180º (Graham, 1972).
233The convex hull thus forms a boundary around a feature field with the minimum
235 The input parameters and computed statistics from both tabbed panels can be
236output in separate user-named text files. The data are saved in a tab-delimited format,
237allowing the files to be viewed using a text editor or imported into a spreadsheet.
238
2394.3. Implementation
240GIAS is written and implemented in MATLAB with a Graphical User Interface (GUI)
241that is designed for use within the geological community. The program is available as
242preparsed pseudocode for use with MATLAB (provided both “Image Processing” and
244Windows and Unix systems without requiring MATLAB or its auxiliary toolboxes.
245These software products may be downloaded from the internet for free via
246http://www.geoanalysis.org.
10 10
247 The regionprops function within the MATLAB Image Processing toolbox
248calculates the following key properties of each object: area, centroid position,
250pixel positions within each vesicle. The pixel list is used to determine which objects
251are in contact with the image edge and the equal area circle diameters are used to
252calculate object radii. The eccentricity and orientation are calculated by fitting an
253ellipse about the object. The angle between the major axis of the ellipse and the x-axis
255 By default, the NN analysis utilizes centroid positions calculated in the image
258identify the vertices of the convex hull for all of the input objects. The function
259convhull is also used to calculate the area of the convex hull (Ahull) and to separate
260interior objects (Ni) from the hull vertices (see Appendix 1 for examples).
262each interior object (centroid) and all other objects within the dataset, including the
263vertices of the hull. The results for each interior point are sorted in ascending order to
264identify the minimum NN distance and calculate the actual mean NN distance (ra) by
265averaging the NN distances for all interior objects (Ni). The interior population density
266(ρi = Ni / Ahull), provides a statistical estimate of the true density within an infinite
267domain and thus substitutes for ρ0. We then calculate R as the ratio of ra-to-re.
269significance, we may define the following scenarios. If the values of R and c fall
270within ±2 σ of their expected values, then the results suggest that the model correctly
271describes the input distribution. If the value of c is outside the range of ±2 σ, then the
11 11
272inference is that the input distribution is inconsistent with the model, regardless of the
273value of R. If c is outside ±2 σ, and R is greater than the +2 σ threshold, then the input
275Conversely, if c is outside the ±2 σ range, and R less then than the -2 σ threshold, then
277outside ±2 σ, and c is within the ±2 σ range, then the test results are ambiguous,
278which generally indicates that there is insufficient data to perform the analysis and/or
282(2007) described how finite sampling would affect several NN models; however, they
283only implemented a solution for the Poisson NN case. In our application, we have
284automated the Poisson NN procedure, extended it to include the three other models of
285Baloga et al. (2007), and derived the k = 2 case for the scavenged NN distribution
286(Appendix 2). Automation of the NN method within GIAS enables users to rapidly
287determine if their data fulfills the size-dependent criteria of statistical significance for
288the following five spatial distribution models: (1) Poisson, (2) Normalized Poisson,
290these models.
292we generated simulations in MATLAB. Following the method of Baloga et al. (2007),
293confidence limits for the Poisson and the Scavenged k = 1 and 2 models were
294simulated using 1000 random draws from a modeled distribution. However, for the
295Normalized Poisson model, limits for the standard Poisson case are employed because
12 12
297intensive in real-time (e.g., requiring approximately four minutes to perform the
299 GIAS does not include the logistic density distribution described by Baloga et
300al. (2007) because its parameters (i.e., the logistic mean (m) and the standard
301deviation (b), see Table 2) are estimated using a best-fit to the input data. The logistic
302case therefore differs substantially from the other NN analyses because they involve a
304Additionally, the fit of a logistic curve to an input distribution does not necessarily
306distributions that better fits the data. Thus given the differences between the logistic
307density distribution described by Baloga et al. (2007) and the other Poisson NN
308models, we have omitted the logistic case from the geospatial analysis tools
311to recognize that the formulation of Baloga et al. (2007) is a scale-free distribution.
313participating in the scavenging process and not on the location of the objects.
314Consequently, the mean and standard deviation statistics are not directly related to the
317scenarios, with a user-defined scavenging radius, may be more appropriate for some
13 13
322separation of the Poisson and k = 1 Scavenged Poisson models, at the 95% confidence
323level, requires a minimum 50 to 60 objects for ρ0 values of 0.5 to 0.0005 objects / unit
324area, whereas 100 to 150 objects are required to distinguish between k = 1 and 2
325models.
329confidence intervals—at 90%, 95% and 99%—are plotted within GIAS to illustrate
330the expected ranges of skewness versus kurtosis for each model. Plots of skewness
331versus kurtosis are also useful for discriminating between populations with otherwise
332similar R and c values, such as ice-cored mounds (e.g., pingos) and volcanic rootless
334
3355. Results
336To demonstrate the utility of GIAS, we have applied the program to vesicle patterns
337within proximal magmatic tephra deposits from Fissure 3 of the 1783-1784 Laki cone
338row, Iceland. Tables 3 and 4 summarize the results of the “Image Analysis” and
340additional examples of geospatial distributions that are clustered and repelled relative
341to the Poisson NN model. These examples are based on the Hamilton et al. (2010a,
3422010b) and demonstrate how statistical NN analyses can be combined with field
345how the geospatial analysis tools that are provided within GIAS can be applied to
14 14
347different planetary environments, supply non-morphological evidence to discriminate
349systems.
350
353magmas, volatile exsolution dynamics, and conduit ascent processes—all of which are
355Quantification of these parameters relies largely upon the use of vesicle size, number,
356and volume distributions to establish links between natural volcanic samples and
357experimental data (Adams et al., 2006). Nested SEM images at several magnifications
360and analysis of a complete nested SEM dataset is outside the scope of this study, we
361present an example of how GIAS can be used to analyze vesicle properties within a
362single image. For this example, we have chosen a magmatic tephra sample (LAK-t-
36323c, Emma Passmore, unpublished data) from 1783-1784 Laki eruption in Iceland.
364The resolution of the image is 3.81 μm / pixel and we have followed the method of
365Gurioli et al. (2008) by using a minimum object area threshold of 20 pixels (76.2
366μm2) to avoid complications associated with poorly resolved objects near the limit of
368 The output of the "Image Analysis" module (Table 3 and Figure 1) reveals that
369the sample has a vesicularity of 65.48% and—excluding objects in contact with the
370periphery of the image—there are 537 objects larger than the detection threshold of
37176.2 μm2. Assuming the properties of an equal area circle, the vesicle diameters range
15 15
372from 19.21 μm to 2.85 × 103 μm, with a mean of 3.01 × 102 μm ±6.89 × 102 (at 1 σ).
373The ratio of equal area circle circumference to object perimeter is 0.77 ±1.53 (at 1 σ);
374however; for the 21 vesicles larger than 3.10 × 105 μm2 the ratio decreases to 0.55
375±0.21, which indicates that the largest vesicles strongly deviate from a circular
376geometry (see the graph of “Objects Perimeters”). The vesicles have a mean
377eccentricity of 0.71 ± 0.18 and a mean orientation trending 7.27° to 187.27° ±43.91°
378(n.b., 90° points towards the top of the image; see “Object Eccentricity” and “Object
380 The NN module results (Table 4 and Figure 2) show that NN distances
381between vesicle centroids range from 28.05 to 1.53 ×103 μm, with a mean (ra) of 1.75
382× 102 ±1.15 ×102 (see “Nearest Neighbor Frequency Distribution” in Figure 2). The
383sample has R and c values within the 2 σ thresholds of significance (Figure 2) and
384thus we conclude that the vesicle centroids exhibit a Poisson (random) distribution.
385We have also analyzed the sample using a Normalized Poisson threshold of 5 pixels
386(19.24 μm), which corresponds to the diameter of a circle that is equal in area to our
387object area threshold of 20 pixels. Given this threshold, the sample exhibits a repelled
388distribution with R and c values greater than their respective upper 2 σ limits. The
389Scavenged k = 1 and 2 cases overestimate re relative to ra, and thus result in clustered
391 The GIAS outputs suggest that vesicles within this sample exhibit an overall
392Poisson distribution despite the presence of a vesicle fabric trending 7.27° to 187.27°
395105 μm2 to strongly deviate from a circular geometry. Consequently, if the vesicles
16 16
397stereology, it would be necessary to acknowledge propagating errors associated with
400have applied a minimum object area threshold of 20 pixels, which excludes all
401vesicles <76.2 μm2 (i.e., 630 of 1167 objects). A detection threshold of 20 pixels is
405insufficient for filtering artifacts associated with the geometric properties of the
406smallest vesicles in the input image. For instance, eccentricity values calculated for
407vesicles within the Laki magmatic tephra sample appear to be very high (0.71 ± 0.18),
409large population of small vesicles. Resolution limitations also affect the Normalized
410Poisson NN distribution because the large number of objects below the detection
412distribution relative to the model. In instances where the majority of objects are below
413the detection limit of analysis, it is therefore imperative to use nested image datasets,
414and/or high magnification mosaics to resolve the properties of the smallest objects in
416
4176. Discussion
420Higgins (2000) and Morgan and Jerram (2007) have also applied 2D to 3D
421transformations for crystal shapes such as cubes and prisms based upon a large
17 17
422number of observed cross-sectional areas. Nevertheless, for irregular object
424single 2D image (Mock and Jerram, 2005; Sable et al., 2006). For vesicle studies,
427Gualda and Rivers, 2006; Polacci et al., 2007; Proussevitch et al., 2007b), but the
429sized vesicle walls, which can result in erroneously large estimates of vesicle
433et al., 2008). Consequently, even given the availability of 3D imaging techniques,
438provided (e.g., Jerram et al., 1996; 2003; Jerram and Cheadle, 2000). Instead, 2D
439methods characterize the aerial distribution of objects within the plane of the image.
441remember that object centroids are unlikely to coincide with the center of a vesicle in
4423D, especially when vesicles are irregularly shaped. To obtain a more meaningful
445the principal component directions of the vesicle fabric. Vertical offsets in the position
446of objects in orthometric aerial photographs and satellite imagery may similarly
18 18
447introduce discrepancies between projected object separations and true NN distances,
448but the vertical offsets are generally small relative to the separation of objects in the
450
4517. Conclusions
452We have developed Geological Image Analysis Software (GIAS) for calculating the
453geometric properties of objects and quantifying their spatial distribution. The program
455combining image processing tools for calculating object areas, abundance, radii,
457NN tests for (1) Poisson; (2) Normalized Poisson; (3) Scavenged, k = 1; (4) and
459in other software packages and, consequently, GIAS represents a novel approach to
464 Future work will focus on developing the following aspects of GIAS: (1)
4661999); (2) automated object recognition (e.g., Proussevitch and Sahagian, 2001;
467Lindquist et al., 1996; Shin et al, 2005); (3) geometric cluster analysis (e.g., Still et
468al., 2004); (4) statistical analyses to address truncated and multimodal populations
469(e.g., Proussevitch et al., 2007a); (5) processing nested image datasets (e.g., Gurioli et
470al., 2008); (6) incorporating stereological techniques for converting 2D data into 3D
471distributions and then using the 3D models to calculate vesicle number density (e.g.,
19 19
472Sahagian and Proussevitch, 1998; Proussevitch et al., 2007a); (7) developing NN
473analyses for 3D datasets (Jerram and Cheadle, 2000; Mock and Jerram, 2005); and (8)
475
476Acknowledgements
477We thank Alexander Proussevitch, Steve Baloga, Sarah Fagents, Bruce Houghton,
478Lucia Gurioli, and Mark Naylor for insightful discussions relating to vesicle
480geological thin-sections examined within this study; and Dougal Jerram and
481Guilherme Gualda for their thorough reviews, which have greatly improved this
483funding for software purchases. CDB was funded under NERC studentship award
485(RANNÍS) and the NASA Mars Data Analysis Program (MDAP) grant
487
488
489References
490Adams, N.K., Houghton, B.F., Fagents, S.A., Hildreth, W., 2006. The transition from
491explosive to effusive eruptive regime: The example of the 1912 Novarupta eruption,
492Alaska. Geological Society of America Bulletin 118, 620–634.
493doi:10.1130/B25768.1.
494
20 20
495Baloga, S.M., Glaze, L.S., Bruno, B.C., 2007. Nearest-neighbor analysis of small
498
499Bishop, M.A., 2008. Higher-order neighbor analysis of the Tartarus Colles cone
502
503Bleacher, J.E., Glaze, L.S., Greeley, R., Hauber E., Baloga, S.M., Sakimoto, S.E.H.,
504Williams, D.A., Glotch, T.D., 2009. Spatial and alignment analyses for a field of small
505volcanic vents south of Pavonis Mons and implications for the Tharsis province,
507doi:10.1016/j.jvolgeores.2009.04.008.
508
509Blower, J.D., Keating, J.P., Mader, H.M., Phillips, J.C., 2001. Inferring
512
513Blower, J.D., Keating, J.P., Mader, H.M., Phillips, J.C., 2003. The
516
517Bruno B.C., Fagents S.A., Thordarson T., Baloga S.M., Pilger E., 2004.
21 21
519nonrandom processes. Journal of Geophysical Research 109,
521
522Bruno, B.C., Fagents, S.A., Hamilton, C.W., Burr, D.M., Baloga, S.M.,
5261-16. doi:10.1029/2005JE002510.
527
528Burr, D.M., Bruno, B.C., Lanagan, P.D., Glaze, L.S., Jaeger, W.L., Soare, R.J., Wan
529Bun Tseung, J.M., Skinner, J.A., Baloga, S.M., 2009. Mesoscale raised-rim
531distributions of analogs for Mars. Planetary and Space Science 57, 579-596.
532
533Cashman, K.V., Mangan, M.T., 1994. Physical aspects of magmatic degassing: II.
537
540
541Clark, P.J., Evans, F.C., 1954. Distance to nearest neighbor as a measure of spatial
543
22 22
544Finney J., 1970. The geometry of random packing. Proceedings of the Royal Society
546
549357.
550
554Gurioli, L., Harris, A.J.L., Houghton, B.F., Polacci, M., Ripepe, M., 2008. Textural
555and geophysical characterization of explosive basaltic activity at Villarrica volcano,
556Journal of Geophysical Research 113, B08206, 116. doi:10.1029/2007JB005328.
557
560groups in the 1783-1784 Laki lava flow, Iceland. Bulletin of Volcanology, (in press)
561
562Hamilton, C.W., Fagents, S.A., Thordarson, T., 2010b, Explosive lavawater
563interactions II: selforganization processes among volcanic rootless eruption sites in
564the 17831784 Laki lava flow, Iceland. Bulletin of Volcanology, (in press).
565
23 23
566Higgins, M.D., 2000. Measurement of crystal size distributions. American
568
569Jerram, D.A., Cheadle, M.J., Hunter, R.H., Elliot, M.T., 1996. The spatial distribution
571
572Jerram, D. A., Cheadle, M. J., 2000. On the cluster analysis of rocks. American
574
575Jerram, D.A., Cheadle, M.J., Philpotts, A.R., 2003. Quantifying the building blocks of
576igneous rocks: are clustered crystal frameworks the foundation? Journal of Petrology
578
579Lautze, N.C., Houghton, B.F., 2005. Physical mingling of magma and complex
581425–428.
582
583Lindquist, W.B., Lee, S-M., Coker, D.A., Jones, K.W., Spanne, P., 1996. Medial axis
586
587Mangan, M.T., Cashman, K.V., 1996. The structure of basaltic scoria and reticulite
588and inferences for vesiculation, foam formation, and fragmentation in lava fountains.
590
24 24
591Mangan, M.T., Cashman, K.V., Newman S., 1993. Vesiculation of basaltic magma
593
594Mangan, M.T., Mastin, M., Sisson, T., 2004. Gas evolution in eruptive conduits:
597129, 23–36.
598
599Marsh, B.D., 1988. Crystal size distributions (CSD) in rocks and the
602
603Mock, A., Jerram, DA., 2005. Crystal size distributions (CSD) in three dimensions:
606
607Morgan, D., Jerram, D.A., 2006. On estimating crystal shape for crystal size
609
610Oh W., Lindquist W., 1999. Image thresholding by indicator Kriging. IEEE
612
613Polacci, M., Baker, D.R., Bai, L., Mancini L., 2007. Large vesicles record pathways
61510.1007/s00445-007-0184-8.
25 25
616
617Polacci, M., Baker, D.R., Mancini, L., Tromba, G., Zanini, F., 2006.
621doi:10.1029/2006GL026241.
622
623Polacci, M., Papale, P., 1997. The evolution of lava flows from
627
631
632Proussevitch, A., Sahagian, D., Carlson, W., 2007b. Statistical analysis of bubble and
635
636Proussevitch, A., Sahagian, D., 2001. Recognition and separation of discrete objects
638
26 26
641
645
646Sable, J., Houghton, B.F., Del Carlo, P., Coltelli, M., 2006. Changing conditions of
647magma ascent and fragmentation during the Etna 122 BC basaltic Plinian eruption:
649158, 333–354.
650
655Shin, H., Lindquist, W.B., Sahagian, D.L., Song, S-R., 2005. Analysis of the vesicular
657
658Stephenson, G., 1992. Mathematical Methods for Science Students, 2nd edn.,
660
661Still, S., Bialek, W., Bottou, L., 2004. Geometric clustering using the Information
662Bottleneck method. In: Thrun, S., Saul, L., and Schölkopf, B. (Eds.), Advances in
663Neural Information Processing Systems, 16, MIT Press, Cambridge, MA, pp. 1165-
6641172.
665
27 27
666Szramek, L., Gardner, J.E., Larsen, J., 2006. Degassing and microlite crystallization
667of basaltic andesite magma erupting at Arenal Volcano, Costa Rica. Journal of
669
670Wilkins, D.E., Ford, R.L., 2007. Nearest neighbor methods applied to dune field
671organization: The Coral Pink Sand Dunes, Kane County, Utah, USA. Geomorphology
67283, 48-57.
28 28
673Appendix 1: Additional nearest neighbor analysis examples
674Explosive interactions between lava and groundwater can generate volcanic rootless
675cones (VRCs). VRCs are well suited to nearest neighbor (NN) analyses because they
676occur in groups, which consist of a large number of landforms that share a common
678analysis tools that were used by Hamilton et al. (2010b) to investigate self-
679organization processes within VRC groups and quantitatively compare the spatial
680distribution of VRCs in the Laki lava flow in Iceland to analogous landforms in the
682 Figure 1 presents examples of rootless eruption sites within the Hnúta and
683Hrossatungur groups of the Laki lava flow. In the field, Differential Global
684Positioning System (DGPS) measurements were used to delimit rootless crater floors
685and their centroids were assumed to represent the surface projection of underlying
686rootless eruption sites (Hamilton et al., 2010a, 2010b). Hamilton et al. (2010a) used
688(Figure 1A) into chronologically and spatially distinct groups, domains, and
6901.1). Figures 1C and D show the convex hull boundaries of the complete study area
691and Subdomain 1.1 of the Hnúta Group, respectively. Figure 1 also shows the
692locations of the convex hull vertices (Nv) and interior rootless eruption sites (Ni).
694groups allowed Hamilton et al. (2010b) to examine the effects of scale and resource
696the Poisson NN model, rootless eruption sites in both the complete study area (Ni =
6972193) and Hnúta Subdomain 1.1 (Ni = 98) have |c| and |R| values that exceed their
29 29
698respective sample-size-dependent limits of confidence at 2 σ. Consequently, neither of
699these distributions are constant with a Poisson random model. Within the complete
700study area, R is 0.72, which is lower than the 2 σ confidence limit of R (0.98) and
701implies that the distribution is clustered with respect to the Poisson NN model. In
702Hnúta Subdomain 1.1, R is 1.23, which is above the upper 2 σ confidence limit of
7031.16 and implies that the distribution is repelled relative to a Poisson NN model.
704Hamilton et al. (2010b) interpret these patterns of spatial distribution within the
706regional scales, VRCs cluster in locations that contain sufficient lava and groundwater
707to initiate rootless eruptions, but on local scales rootless eruption sites tend to self-
711regions of lava inundation. In contrast, they attribute local repelling to the non-
715 Baloga et al. (2007) showed that rootless eruption sites in the Tartarus Colles
716Region of Mars have R = 1.22 and c = 6.14. These NN statistics are similar to those
717observed within Hnúta Subdomain 1.1 (R = 1.23 and c = 4.45), which suggests that a
718similar formation process in both environments. Thus the statistical NN analysis tools
719provided within GIAS can be combined with field observations and remote sensing to
722
30 30
723Appendix 1 Figure 1:
724A and B depict rootless eruption sites (red circles) superimposed on a 0.5 m/pixel
725color aerial photograph showing 1783-1784 Laki eruption lava flows in Iceland. A:
726Full extent of Hnúta and Hrossatungur groups and an inset (white rectangle) denoting
727Hnúta Subdomain 1.1 (shown in B). C and D correspond to identical regions shown in
728A and B, respectively, and provide examples of convex hull boundaries, rootless
729eruption sites that form convex hull vertices (Nv), and interior points (Ni).
730
731Appendix 1 Figure 1
31 31
732Appendix 2: Derivation for the Scavenged Nearest Neighbor (k = 2) case
733The probability of finding exactly k points within a circle of radius r on a plane can be
( 0 r 2 ) k 0 r 2
735 P(r , k ) e , [0]
k!
736where ρ0 is the spatial density—defined by the number of objects (N) divided by the
737area of the feature field (A). If the process of creating a point on the plane consumes
738(or “scavenges”), resources that would have formed the kth nearest neighbor, then the
741 [0]
746To find the mean of this distribution, we multiply by r, integrate from 0 to ∞, and
748
3 r r 5 e r dr
2
749 r 0
. [0]
r
r 2
5
e dr
0
750
751 The integrals can be solved using integration by parts and two mathematical
752identities for the odd and even terms of r (e.g., Stevenson, 1992). For the even terms,
754
32 32
755
1.3.5 (2n 1)
r
2
756 2n
e r dr , [0]
2 n 1 n 2
1
r
2
758
2 n 1
e r dr [0]
0
759can be solved using the substitution x2 = u and solving using integration by parts,
760e.g.,
1
r e dr ue
2
3 r u
761 2
du . [0]
0 0
762This leads to the following solution for the mean distribution of the nearest neighbour
763distances for k = 2:
1 3 5
7
2 15 0 15
4 3 3
764 2 . [0]
r
1 3 7 7 1 16 0
16 0 2 2
r
2
2
r 2 r 3 e r dr
3
766 r2 0
. [0]
0
r e dr
2
5 r
2
3 15 0.27572
768 2 . [0]
0 16 0 0
0.27572
770 e , [0]
N 0
33 33
771where N is the number of points in the area of interest. The third and fourth moments
773
r
2
3
r 5 e r dr
105
774 r3 0
3 ; [0]
32 0 2
r
r 2
5
e dr
0
r
2
4
r 5 e r dr
12
775 r4 0
. [14]
( 0 ) 2
r e dr
2
5 r
778 [15]
779 we obtain:
0.006663 0.0174838
780 skew 3 ; kurt . [16; 17]
4 0
2
0
3 2
34 34
781Appendix 3: Separation of population means
783significant separation of two Gaussian distributions, where the expected means and
784standard deviations are known, a commonly-used test is to examine if zero lies within
786
787 P s 0 s1 1.96 0 1 0 1 s 0 s 1 1.96 0 1 0.95 , [1]
N N N N
788where s0 and s1 are the sample means, μ0 and μ1 are the expected means and σ0 and σ1
789are the expected standard deviations. N can be varied to estimate the number of
791interval, then there is no discernible difference between the means of the distributions
793 This method is applied to simulate two different means based upon the
796 Simulation from two normal distributions with increasing numbers of N was
800
ρ0 Minimum N required to separate
(objects / unit area) Poisson NN and Scavenged NN (k = 1)
0.5 50
0.05 60
0.005 60
0.0005 60
801
35 35
ρ0 Minimum N required to separate
(objects / unit area) Poisson NN and Scavenged NN (k = 2)
0.5 100
0.05 120
0.005 130
0.0005 150
803
804 Given that these simulations assume Normal distributions, it would be wise to
805regard them as indicative of the minimum number of data points required to separate
806the competing models. A much larger number of samples should be used separate the
807Scavenged NN, k = 1 and k = 2 models because their expected means are relatively
808close together.
36 36
809Electronic Supplements
810(1) MATLAB preparsed pseudo code and standalone MATLAB executable file,
813
815Tables
816Table 1: Notation.
817
819
821
823
824Figures
825Figure 1: Image analysis program in “Image Analysis” mode, with maximum object
826areas truncated to ~104 pixels for visualization purposes which excludes very large
827(and least circular) vesicles from displayed plots and histograms. Note that object
829
830Figure 2: Image analysis program in “Nearest Neighbor Analysis” mode with outputs
37 37
832Table 1.
Variable Definition
A Area of a feature field
Ahull Area of the convex hull
C Statistic for comparing observed (actual) to expected mean NN distances
k Poisson index
N Number of features within a sample population
Ni Number of features within the convex hull
Nv Number of features forming the vertices of the convex hull
NN Abbreviation for nearest neighbor
P Probability
R Statistic for comparing actual (ra) to expected (re) mean NN distances
R Radial distance
ro Threshold distance used to calculate Normalized Poisson NN distributions
ra Mean NN distance observed in the data
re Mean NN distance calculated from a given NN model (e.g., Poisson)
P Probability density
ρ0 Population (spatial) density of the input features (ρ0 = N / A)
ρi Population (spatial) density of the input features (ρi = Ni / Ahull)
Σ Standard deviation
σe Standard error
833
38 38
834Table 2.
835
Statistic Poisson Scavenge k = 1 Scavenge k = 2 Normalized NN Logistic
Probability 2 0 re 0r
2 2
2(0 ) 2 r 3e 0r 2(0 ) 3 r 5 e 0r
2 2 2
e( r m) b
Density (P)
20 re 0 ( r0 r )
b[1 e ( r m ) b ]2
Mean (re) 1 3 15 2 0 m
r e 0 dr
2
2 r
2 0 4 0 16 0 2
e 0 r0 r0
Mode (rmode) 1 3 5 1 m
2 0 2 0 2 0 2 0
Standard Error 0.26136 0.272249 0.27572 r02 r 2 1 0 N b
(σe) N 0 N 0 N 0 3
Skewness ( 3) 0.008187 0.0066638 1 2 3
r (r0 3r ) r
2r 2
0
3 0
3 3 3 20
4 3 0 2 30 2 30 2
39 39
836Table 3.
40 40
837Table 4.
Poisson model
Expected mean NN distance (re) 1.79 ×102
R with thresholds at 2 σ 0.98 (0.97 and 1.07)
c with thresholds at 2 σ -0.91 (-1.33 and 2.83)
Conclusions (c value) Model supported (<2 σ)
41 41
839Figure 1
840
42 42
841Figure 2
842
43 43