You are on page 1of 11

ARTICLE IN PRESS

Computers & Geosciences 30 (2004) 21–31

AgeDisplay: an EXCEL workbook to evaluate and display


univariate geochronological data using binned frequency
histograms and probability density distributions$
Keith N. Sircombe*
Department of Geology and Geophysics, Tectonics Special Research Centre, School of Earth and Geographical Sciences, M.004,
University of Western Australia, 35 Stirling Highway, Crawley WA 6009, Australia
Received 17 January 2003; received in revised form 15 September 2003; accepted 16 September 2003

Abstract

Univariate diagrams such as binned frequency histograms and probability density distributions are often used for the
initial assessment and communication of geochronological data. Both diagram types are estimates of the sample
distribution and both have inherent limitations that are not widely appreciated. Binned frequency histograms are
effective at conveying frequency information, but analytical error is discarded and appearance is vulnerable to bias due
to arbitrary decisions about bin width. A method for assessing the efficiency of bin widths is presented. Probability
density distributions use a variable Gaussian kernel method that accounts for analytical error of individual datum.
While providing standardization of the display, these diagrams are limited by the lack of visually accessible frequency
information. An approach combining elements of both histograms and probability density distributions is proposed.
All methods are applied in an Excel workbook and the procedures for using this are explained.
r 2003 Elsevier Ltd. All rights reserved.

Keywords: Geochronology; Data display; Histogram; Probability density distribution

1. Introduction precision and display the geochronological data in


terms of univariate absolute ages calculated from an
Uranium–lead isotopic data are typically displayed isotopic ratio.
using either a Wetherill- or Tera-Wasserburg-style The most common univariate displays are the binned
bivariate concordia plot. These concordia plots frequency histogram and probability density distribu-
convey information about the analytical process: sample tion. The use of such diagrams is most common in
size, accuracy (relationship to concordia and repeat sedimentary provenance studies involving relatively
analyses) and precision (analytical error displayed large sets of age data (e.g. Morton et al., 1996; Pell
as ellipses or equivalent). However, as sample size et al., 1997; Rainbird et al., 1997; Sircombe, 1999;
increases, concordia diagrams become visually Fergusson et al., 2001), but it is also appropriate for the
cluttered, making graphical assessment of modes, ranges analysis of metamorphic and/or igneous rocks where the
and proportions difficult. Therefore, it is often advanta- display of complex age patterns is required (e.g. Harley
geous to filter the data based on accuracy and/or and Black, 1997). Although widely used, the limitations
of these displays are often not appreciated and one aim
$
Code on server at http://www.iamg.org/CDEditor.index.
of this paper is to outline the mathematical basis of these
htm methods.
*Tel.: +61-8-9830-7871; fax: +61-8-9380-7848. This paper builds on previous reviews (e.g. Sircombe,
E-mail address: ksircombe@tsrc.uwa.edu.au 2000) and includes an EXCEL workbook, AgeDisplay,
(K.N. Sircombe). that processes a given set of age data.

0098-3004/$ - see front matter r 2003 Elsevier Ltd. All rights reserved.
doi:10.1016/j.cageo.2003.09.006
ARTICLE IN PRESS
22 K.N. Sircombe / Computers & Geosciences 30 (2004) 21–31

2. Filtering bivariate isotopic data to provide univariate


age data

The display of univariate age data means the visual


information regarding accuracy in a concordia diagram
is lost. Therefore, prior to generating diagrams of age
data, it is usual for the data to be systematically filtered
for accuracy, particularly in sensitive high-resolution ion
microprobe (SHRIMP) studies. Concordance is defined
as the 206Pb/238U value ratioed to the 207Pb/206Pb value
and a typical constraint may be between 95% and
105%. For material younger than B1500 Ma, the Fig. 1. Example of two age estimates of same mean, but
207
Pb-235U ratio and thus the 207Pb-206Pb ratio become different variance, representing age data cut by a histogram bin.
increasingly imprecise due to poorer counting statistics. Depending on analytical error inherent in age data and
In those instances, the concordance constraint and the histogram bin width, bin may only represent a proportion of
age estimate.
accuracy filtering has to be judged on a case-by-case
basis.
Filtering of data raises an important issue for both histogram, the estimate is
binned frequency histograms and probability density
# ¼ nj ; ðxA þ jhpxi oxA þ ðj þ 1ÞhÞ;
fðxÞ ð1Þ
distributions because the process can remove discordant
results that could contain significant geological and
where nj is the number of age data, xi ; within the jth bin,
analytical information. Care should be taken to be clear
ðx0 þ jh; x0 þ ðj þ 1ÞhÞ: The value nj can then be divided
about what sort of age data are being displayed and
by the total count N to provide a relative frequency
why. In some cases, it may be beneficial to include both
histogram.
concordant and discordant results within the age display
While the histogram display is straightforward and
diagram (e.g. Roback and Walker, 1995, Fig. 8; Morton
widely understood, there are two critical limitations in
et al., 1996, Fig. 3) to illustrate that discordant data
applying binned frequency histograms to absolute age
exists and has been considered. These decisions depend
data: discarding age errors and arbitrary bin size. These
on the purpose of data display. At the very minimum,
are discussed below.
the filtering method and the type of data displayed
should be clearly stated.
3.2. Age error limitation

3. Binned frequency histograms The principal limitation with binned frequency


histograms is that the analytical error inherent in any
3.1. Mathematical definition geochronological analysis is discarded. The sample
distribution seen in the histogram is therefore estimated
The traditional method for displaying univariate age on the basis of the absolute age values alone. If an age
data is a binned frequency histogram. Modes, ranges data histogram is the principal method for communicat-
and proportions displayed in a binned frequency ing results, discarding the errors may potentially lead to
histogram may relate to the timing, duration and misinterpretation. For instance, as shown in Fig. 1, two
relative significance of geologic events. Binned frequency analyses have the same age value of 2015 Ma, but one
histograms provide an important step in the graphical has an analytical error of 20 Myr and the other 2 Myr
analysis and communication of age data, therefore it is (both at 1 s.e.). If the bin limits are 2000 and 2020 Ma,
equally important that the analyst be aware of the assuming a Gaussian distribution for the age estimate,1
definitions and inherent limitations of the method. only 29.02% of the distribution defined by the first
With a set of age data, typically measured in Ma, analysis (2015720 Ma) lies within the bin limits. In
{x1 ; y; xN }, a histogram is generated by dividing the
1
age range of interest, anchored at an origin x0 ; into a set For the purposes of discussion here, the distribution
of K equal sized bins each with a bin width h; defined by associated with an individual age calculation will be referred
to as the age estimate, and defined as a Gaussian distribution
the bin limits {x0 þ jh; x0 þ ðj þ 1Þh} for positive integer
defined by a mean xi (calculated age) and standard deviation ei
values of j up to K (Silverman, 1986; Scott, 1992; (analytical error). In the strictest sense, not all absolute age
Simonoff and Udina, 1997). An important mathematical calculations based on U–Pb isotopic measurements are
definition regarding histograms, is that a histogram is an symmetrical Gaussian distributions. For the sake of discussion
estimate of the distribution of a sample of univariate here, it is assumed that most calculations will be based on such
data or f ðxÞ: In the simplest case, the frequency distributions.
ARTICLE IN PRESS
K.N. Sircombe / Computers & Geosciences 30 (2004) 21–31 23

effect, although the mean of the age estimate distribu- bin width of 20 Myr is needed for 50% efficiency and
tion is 2015 Ma and that value lies within the bin limits over 75 Myr for 90% efficiency. Table 1 lists further bin
of 2000 and 2020 Ma, there is a 70.98% probability that widths required for 50% and 90% efficiency levels
the ‘‘true’’ value of the age lies outside the bin limits. In depending upon mean error.
comparison, 99.38% of the age estimate defined by the
second analysis (201572 Ma) lies within the bin limits,
so the bin may be considered representative. However, if 3.3. Bin size limitation
the bin width was the same, but the limits were, for
example, 1996 and 2016 Ma, then even the relatively The illustration of a histogram bin’s efficiency at
more precise second analysis would have 30.85% of its capturing age estimates also highlights the second
age estimate outside the bin limits. limitation of histogram use, i.e. the size and location
The ‘‘efficiency’’ of how well each age estimate is of the bins themselves. A histograms appearance, and
captured by a particular bin can be derived by thus its potential interpretation, is a balance between too
calculating the proportion of the Gaussian distribution much detail with narrow bin widths (undersmoothing)
within the bounds of the bin: and too little detail with wide bin widths (oversmooth-
Z zU ing). In a variety of published cases (Table 2) the choice
1 i 2=2
of bin width ranges from 5 to 100 Myr. Using SHRIMP
Ei ¼ pffiffiffiffiffiffi expx dx; ð2Þ
2p zLi derived data, Morton et al. (1996, p. 917) defined bin
where the bounds of bin j about xi are given by
ðx0 þ jhÞ  xi ðx0 þ ðj þ 1ÞhÞ  xi
zLi ¼ ; zU
i ¼ ; ð3Þ Table 1
ei ei Bin widths (in Myr) required to reach 50% and 90% efficiency
where ei is the standard deviation of each Gaussian in a set of age data with various mean errors based on empirical
distribution. analysis described in text
The mean of the individual efficiency values is defined Mean error (Myr) Bin width required for
as a proxy for the efficiency of a particular bin width at
representing the age data. Fig. 2 illustrates the relation- >50% Efficiency >90% Efficiency
ship between histogram efficiency and bin width for a 1 2 10
variety of age data, both randomly generated with given 2 5 20
mean errors and real data from the Slave Province 5 10 40
(Sircombe et al., 2001). For data with B1 Myr errors 10 15 100
(more typical of thermal ionization mass spectrometry) a 15 20 120
bin width of 10 Myr is sufficient to capture 90% of the 20 30 160
age estimates. For the Dwyer Lake and George Lake 30 45 220
samples, representing typical SHRIMP zircon data, a 50 75 360

Fig. 2. Relationship between bin width and bin width efficiency (mean proportion of age estimates within same bin as age) for a variety
of real and simulated age data with mean errors ranging from 1 to 30 Myr.
ARTICLE IN PRESS
24 K.N. Sircombe / Computers & Geosciences 30 (2004) 21–31

Table 2
Examples of bin width and other details of binned frequency histogram displays from a variety of references

Bin width (Myr) Range (Myr) Bin width as % range Mean error (Myr) Efficiency (E) Reference
a
5 70 7.14 2 70% Davis et al. (1994), Fig. 8B
20 3400 0.588 5a 80% Gehrels and Dickinson (1995), Fig. 6
25 4000 0.625 20b 60% Morton et al. (1996), Fig. 3
33.33 1900 1.75 50c 30% Scott and Gauthier (1996), Fig. 4
100 2800 3.57 3a 98% Roback and Walker (1995), Fig. 8d

Mean error and efficiency (E; explained in text) calculated for a subset of presented data and intended for only for broad indicative
purposes.
a
Thermal ionization mass-spectrometer analyses.
b
SHRIMP analyses.
c
Laser-ablation microprobe inductively coupled mass spectrometer analyses.
d
Roback and Walker (1995) histograms compiled from data in Ross et al. (1991, 1992).

widths of 25 Myr on the basis of the precision of the


(a)
analyses, although no explanation is given of how this
was done.
The subtle variation in appearance and interpretation
for a variety of bin widths is illustrated in Fig. 3.
Although all the histograms show the age distribution is
generally dispersive with two prominent modes (B2950 (b)
and B3150 Ma), the relative prominence of the two
modes change; this could alter interpretations about the
relative importance of these modes.
There are a number of methods for calculating the
optimal bin width based on the data itself. Doane (1985)
proposed a modification of the earlier Sturges (1926) (c)
method:
maxfx1 ; y; xN g  minfx1 ; y; xN g
h# ¼ ; ð4Þ
1 þ log2 N þ Ke
where h# is the estimate of the optimal bin width and N is
the sample size. Ke is the extra classes proposed by (d)
Doane (1985) to account for potential skewness in the
data:
pffiffiffiffiffi !
b1
Ke ¼ log2 1 þ pffiffiffiffiffi ; ð5Þ
s b1
pffiffiffiffiffi (e)
where b1 is a measure of skewness:
pffiffiffiffiffi PN
i¼1 ðxi  xÞ % 3
b1 ¼ h i3=2 ð6Þ
PN
i¼1 ðxi  xÞ %2
pffiffiffiffiffi
and s b1 is defined by the sample size:
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
pffiffiffiffiffi 6ðN  2Þ Fig. 3. (a–e) A series of histograms based on the same data
s b1 ¼ : ð7Þ
ðN þ 1ÞðN þ 3Þ (Dwyer Lake quartzite, 207Pb/206Pb ages, 95–105% concordant,
n ¼ 68; Sircombe et al., 2001) illustrating visual and potential
Many statistical packages use this rule, or a modifica- interpretation differences due to changes in bin width.
tion of it, to calculate the default bin width for a set of E=efficiency of bin width at capturing individual age estimates
data (Wand, 1996). Recognizing that Sturges’s rule and as described in text.
modifications may lead to oversmoothing, more recent
formulations include Scott (1979) normal reference rule: where s# is an estimate of the standard deviation;
typically, the minimum of either the sample standard
h# ¼ 3:49 sN
# 1=3 ; ð8Þ deviation or the inter-quartile range (Wand, 1996). For
ARTICLE IN PRESS
K.N. Sircombe / Computers & Geosciences 30 (2004) 21–31 25

Fig. 4. Example of individual and accumulated age estimates. (a) Individual age estimate with a small error, (b) age estimate with a
large error, (c) accumulated density distribution with six contributing age estimates in two modes. Although it is area beneath the curve
that is important (see text) probability distribution can be read as follows: probability of an age of 255 Ma within distribution is
0.015% or 1.5%.

the Dwyer Lake sample data discussed here, Eqs. (4) and probability density distribution (PDD) diagram (e.g.
(8) yields optimal bin widths of 136 and 210 Myr. In Fig. 3; Dodson et al., 1988).2 Technically, the age PDD
comparison, another example set of detrital zircon age is another estimate of the ‘‘true’’ sample distribution.
data (George Lake metagreywacke) yields optimal bin The PDD produces a density estimate of the sample
widths of 36 and 22 Myr respectively. This illustrates distribution using a Gaussian kernel (Silverman, 1986)
that such optimal binning methods do not necessarily that varies with each individual age estimate. The shape
produce a standard bin width for easy visual comparison of these age estimates, and thus the kernel, will vary
between different sets of data. Critically, because these from a narrow, tall distribution if the error is small
calculations assume an underlying Gaussian distribution (Fig. 4a), to a wide shallow distribution if the error is
in the sample data, they may not be applicable to sets of large (Fig. 4b). These individual distributions are
data that have non-Gaussian distributions (Scott, 1979, summed together to form the age PDD function, f ðtÞ;
1992). Detrital zircon analyses are typically widely for the sample being examined (e.g. Fig. 4c) using the
dispersed and the optimal bin width calculations may following formula:
only be relevant for zircon analyses producing single age
modes. X
N
1 2 2
f ðtÞ ¼ pffiffiffiffiffiffi expðtxi Þ =2ei ; ð9Þ
Finally, the selection of an origin in a binned i¼1 ei 2p
histogram display can also significantly alter the
appearance and potential interpretation of the data where xi is the ith age measurement and ei is the ith
(Simonoff and Udina, 1997). This effect should be analytical error, t is the age and N is the sample size. In
minimal for age data because, for the general aesthetic practice, the distribution function can be approximated
sense of ‘rounded’ limits proposed by Doane (1985), the by assessing the value of f ðtÞ in fixed increments
origin should either be 0 Ma or an integer multiple of the (typically 1 Myr) across a range that encompasses the
bin width. For example, with a bin width of 25 Myr and required data. Because geochronological results are
minimum value at 2037 Ma, the histogram origin would typically reported to a round Ma value, standard
not be 2036 Ma, rather it would be 2000 or 2025 Ma. increments of 1 Myr are recommended as a suitable
Any selection of a less orthodox origin would require estimation for the distribution function. In some cases,
detailed justification in terms of potential instability in for instance younger ages, the size of these increments
the appearance of the histogram (Simonoff and Udina, may be reduced further to ensure a smooth appearance.
1997). For the distribution to be a true probability distribution

2
The original probability plots applied to detrital zircon age
data were presented in Dodson et al. (1988) and were the
4. Probability density distributions
product of a technique and program (‘‘Nouveau Stats’’)
developed by Dr. P. Zeitler then at the Research School of
4.1. Mathematical definition Earth Sciences of the Australian National University (I.S.
Williams, written comment). The application of the method to
A graphical approach that attempts to address the 40
Ar–39Ar data had been demonstrated earlier by Jessberger
limitations of binned frequency histograms is the et al. (1980).
ARTICLE IN PRESS
26 K.N. Sircombe / Computers & Geosciences 30 (2004) 21–31

it must be scaled so that the cumulative total sums to proportion information. The height of the curve in a
one, i.e. the distribution function Eq. (9) should be PDD diagram is both a function of quantity and
divided by N: This approach also ensures that the precision rather than simply quantity, i.e. a precise
diagram is standardized for comparative purposes. It is analysis will have a tall peak (e.g. Fig. 4a) that may
recommended that the probability scale on the y-axis is compare, height-wise, with a peak of accumulated less
retained to allow meaningful comparison between sets of precise analyses (e.g. Fig. 4c). Area is not an easily
data. The number of individual analyses contributing to recognized attribute of a diagram and thus frequency
the distribution should also be clearly indicated on the information may be lost to the observer. For instance in
diagram. Fig. 4c, the distribution contains six individual analyses,
three in each mode, but the left-hand peak is higher
4.2. PDD application suggesting that it is the dominant mode.

The age PDD diagram counters the two limitations of


the binned frequency histogram. Firstly, the individual 4.4. Heterogeneity calculation
age errors are used in calculating the PDD and a visual
assessment of the relative precision of the set of data can The standardized PDD diagram also provides the
be made. Secondly, by standardizing the sample basis for calculating the homogeneity/heterogeneity of
distribution in increments of 1 Myr, the potential for the age distribution. A sample strongly dominated by a
altering the appearance of the diagram as bin width single mode would have a low age distribution hetero-
varies is eliminated. geneity whereas a sample with several modes would have
PDD diagrams are good at displaying the modality of a relatively higher age distribution heterogeneity.
a set of age data, as revealed by the peaks of the Heterogeneity can be quantified using the information
distribution curve. Nevertheless, care must be taken in function (Shannon and Weaver, 1949; Pelto, 1954):
drawing geological interpretations of the peaks. Over- X
lapping modes may slightly skew the location of H¼ pi ln pi ; ð10Þ
neighboring peaks, and a mode with few datum may i
be visually indistinct near a larger mode (the same
limitations of visual interpretation also apply to binned where H is the absolute value of heterogeneity. pi is the
frequency histograms). proportion of the ith component within the total, for
Defining and assessing the significance of modes instance the probability density value calculated for a
within multimodal density distributions is a sophisti- given age when the total distribution is normalized to
cated process that is beyond the scope of this paper and one. To obtain a widely comparable value a relative
will only be discussed briefly. For the purposes of initial heterogeneity, Hr ; can be calculated by (Pelto, 1954;
data analysis, modes in an age PDD can be defined in Smosna et al., 1999):
terms of the function derivatives as those points where

f 0 ðtÞ ¼ 0 and f 00 ðtÞo0 (Scott, 1992). This approach will H
identify local maxima (peaks in the PDD), but not Hr ¼ 100 ; ð11Þ
Hmax
components seen as shoulders on larger peaks. For
objectively deriving modal ages from a set of age data,
where Hmax is the maximum value of H for the data set
the analyst should use a deconvolution methodology,
if all components occur in equal proportion, thus if n is
such as the maximum-likelihood mixture modeling of
the number of components:
Sambridge and Compston (1994). Because of the
mathematical basis of this approach, it avoids any
potential bias in visual inspection alone, although these Hmax ¼ nðð1=nÞlnð1=nÞÞ: ð12Þ
calculations often define components that closely match
the prominent peaks seen in the PDD. In sandstone composition applications (e.g. Smosna
et al., 1999), the number of components is typically
4.3. PDD limitation around 10 and the value of Hmax would be 2.30. It is
proposed here, that in order to standardize the assess-
A major limitation of PDD diagrams is the loss of ment of detrital zircon age distributions, that the total
frequency information that can be seen in traditional possible age range be subdivided into 1 million year
binned frequency histograms. In the latter, frequency divisions, hence making 4500 divisions and a Hmax
information is conveyed by the height of the columns, of 8.41.
but this paradigm does not follow for PDD diagrams. Based on this premise, Fig. 5 illustrates two distribu-
Because PDD diagrams illustrate probability density it is tions—one strongly unimodal and the other polymo-
the area beneath the curve that conveys frequency and dal—and their respective relative heterogeneity values.
ARTICLE IN PRESS
K.N. Sircombe / Computers & Geosciences 30 (2004) 21–31 27

(a)

(b)

Fig. 5. Illustration of relative heterogeneity values. (a) Strongly unimodal sample, (b) polymodal sample.

5. Combined display

The simplest approach to the limitations discussed


above is to overlay a PDD diagram with an equivalent
binned frequency histogram of the same set of age data
(e.g. Fergusson et al., 2001; Nutman, 2001). The overlay
approach can also be extended to include discordant
analyses to allow comparison and validation with the
accepted data (e.g. Sircombe and Stern, 2002; DeGraaff-
Surpless et al., 2002). In practice this is achieved by
combining all data in a PDD and scaling the distribution
relative to the smaller, concordant only, distribution. An
example of this approach is illustrated in Fig. 6 using the
Dwyer Lake quartzite data. Fig. 6. Illustration of a Combined Display of histograms and
PDD diagrams using Dwyer Lake quartzite data (Sircombe
et al., 2001). Darker shaded PDD indicates age distribution of
6. AGEDISPLAY implementation concordance-filtered data at 95–105%. Histogram data also
filtered at this level. Lighter shaded PDD includes all data.
The AgeDisplay workbook contains five worksheets
* Data Entry. for output. The workbook is set up to handle up to 500
* Histogram Efficiency. age estimates and produce histograms and PDD with up
* Regular Histogram. to 5000 points.
* Probability Density Dist.
* Combined display. 6.1. Data entry worksheet

Calculations and chart production are a combination Age data is entered in the Data Entry worksheet as
of automatic and macro-based procedures. User input age, error and concordance (Fig. 7). A spot/analysis
cells are indicated by a white background and a yellow identification is optional. Error must be 1 s.e. The
background indicates unalterable cells where the appli- sample name can be added, and will be as the basis for
cation has made calculations. Newly generated charts chart titles and output filenames. Data is filtered at two
will replace previous charts on the same worksheet, and, stages. Firstly, the data will be filtered by the con-
after being produced, charts can be altered, if required, cordance value according to the value given in the Filter
ARTICLE IN PRESS
28 K.N. Sircombe / Computers & Geosciences 30 (2004) 21–31

Fig. 7. Screen shot of Data Entry worksheet illustrating areas for entering age data and errors along with automatic calculations.

cell. For instance, entering 90 will exclude all data with range displayed on the chart), end (Ma, end of the range
concordance below 90 or above 110 from further displayed on the chart) and binwidth (Myr). The Create
processing. The data is also filtered via the fifth data Histogram button is linked to a macro that uses the user-
column headed Use? Any character entered in this input values and the concordant age estimates in the
column will ensure the age data is used in further Data Entry worksheet to produce a binned frequency
calculations—assuming it passes the concordance filter. histogram chart.
This provides a means for the user to concentrate The validity of the user input values are checked prior
processing on data of interest without deleting to calculation. If any age estimates are either below the
information. origin or above the end of the user-specified range these
This worksheet automatically calculates parameters will be flagged immediately below the input cells. It is at
such as mean age and mean error, along with optimal the user’s discretion whether this indicates that the
bin width as calculated by the Doane (1985) and Scott specified range is not adequate for the data being
(1979) methods (Eqs. (8) and (4) respectively). The charted. The user-input values will also be overwritten
worksheet also contains a button Create Report linked to the Histogram Efficiency worksheet and an efficiency
to a macro that will create a copy of the workbook value calculated.
without the attached objects, formulae and macros. This
can be used when the user is satisfied with the results of
the analysis and wishes to capture a final version. 6.4. Probability Density Dist.

The Probability Density Dist. worksheet produces a


6.2. Histogram efficiency
probability density distribution chart based on the
discussion in Section 4 (Fig. 9). The user inputs origin
The Histogram Efficiency worksheet contains formu-
(Ma), end (Ma) and increment ( Myr). The increment is
lae to automatically calculate the efficiency of a
typically in 1 Myr steps, but this can be altered if
histogram at capturing the concordant age estimates
required. The Create PDD button is linked to a macro
provided in the Data Entry worksheet (based on
that performs the calculations for the PDD based on
Eq. (2)). The user enters the origin (Ma) and binwidth
concordant age estimate data and user input (including
(Myr) of a histogram and the mean efficiency is
concordance and usage filtering) in the Data Entry
automatically calculated. This allows exploration of
worksheet.
various bin widths to find a satisfactory setting for the
The resulting chart is scaled so the total area
data at the user’s discretion.
under the curve is one, thus representing a true
probability distribution. The number of age estimates
6.3. Regular histogram represented in the chart is also added to the chart
title as ‘‘n ¼ ’’ along with indication of concordance
The user can enter three values into the Regular filtering. If any part of the calculated distribution
Histogram worksheet (Fig. 8): origin (Ma, start of the is beyond the user-specified range, this will be
ARTICLE IN PRESS
K.N. Sircombe / Computers & Geosciences 30 (2004) 21–31 29

Fig. 8. Screen shot of Regular Histogram worksheet illustrating user-input cells for origin, end and bin-width, Create Histogram
button and automatic calculation of histogram efficiency.

Fig. 9. Screen shot of Probability Density Dist. worksheet illustrating user inputs origin, end and increment. Create PDD and Create
DXF buttons also shown.

flagged immediately below the user input cells. It is 6.5. Combined display
at the user’s discretion whether this indicates that
the specified range is not adequate for the data The Combined Display worksheet produces a display
being charted. The macro also calculates the local combining two PDDs of all data and concordance-
maxima of the distribution following the approach of filtered data along with a histogram of concordance-
Scott (1992) and outputs these results beside the filtered data (Fig. 10). The user inputs origin (Ma), end
distribution results. (Ma), increment (Myr), bin width (Myr) and level of
The worksheet also has a Create DXF button linked concordance filtering (%). The increment is typically in
to a macro that produces a rudimentary DXF format file 1 Myr steps, but this can be altered if required. Data is
of the PDD along with simple lines representing the x- copied from the Data Entry worksheet, but the user does
and y-axis based on the scale of the chart. This DXF file have the option of altering the Use? Column as required.
can be imported into a variety of graphing/drawing The Create combined diagram button is linked to a
packages. Because the PDD is represented as a single macro that performs the calculations based on user
and continuous curve, it can help avoid some of the input and concordant age estimate data in the Data
problems associated with directly cutting and pasting Entry worksheet. The left y-axis of the combined chart
the Excel chart. records the probability value of the PDDs and the right
ARTICLE IN PRESS
30 K.N. Sircombe / Computers & Geosciences 30 (2004) 21–31

Fig. 10. Screen shot of Combined Display worksheet illustrating user inputs: origin, end, increment and bin width. Create combined
diagram and Create DXF buttons also shown.

y-axis records the frequency value of the histogram. The Dodson, M.H., Compston, W., Williams, I.S., Wilson, J.F.,
title of the chart also records the concordant number of 1988. A search for ancient detrital zircons in Zimbabwean
data in the range against the total number of data sediments. Journal of the Geological Society, London 145,
(‘‘n ¼ ’’) and the concordance filter level. The macro 977–983.
also flags any data beyond the user-specified range. Fergusson, C.L., Carr, P.F., Fanning, C.M., Green, T.J., 2001.
Proterozoic-Cambrian detrital zircon and monazite ages
The worksheet also links to another macro via the
from the Anakie Inlier, central Queensland: Grenville and
Create DXF button to produce a rudimentary DXF Pacific-Gondwana signatures. Australian Journal of Earth
format file for importing into graphics applications. Sciences 48, 857–866.
Gehrels, G.E., Dickinson, W.R., 1995. Detrital zircon prove-
nance of Cambrian to Triassic miogeoclinal and eugeoclinal
strata in Nevada. American Journal of Science 295, 18–48.
Acknowledgements
Harley, S.L., Black, L.P., 1997. A revised Archaean chronology
for the Napier Complex, Enderby Land, from SHRIMP
Portions of this work were developed while supported ion-microprobe studies. Antarctic Science 9, 74–91.
by a NSERC Canadian Laboratories Visiting Fellow- Jessberger, E.K., Dominik, B., Staudacher, T., Herzog, G.F.,
ship at the Geological Survey of Canada, Ottawa (1998– 1980. 40Ar–39Ar Ages of Allende. Icarus 42, 380–405.
2000). Recent work and updates have been supported by Morton, A.C., Claou!e-Long, J.C., Berge, C., 1996. SHRIMP
a University of Western Australia Postdoctoral Fellow- constraints on sediment provenance and transport history in
ship (2001–2002) and Australian Research Council the Mesozoic Statfjord Formation, North Sea. Journal of
Discovery Grant DP0208797. P. Cawood and two the Geological Society, London 153, 915–929.
anonymous reviewers provided helpful comments. Nutman, A.P., 2001. On the scarcity of >3900 Ma detrital
TSRC Contribution no. 256. zircons in X3500 Ma metasediments. Precambrian Geology
105, 93–114.
Pell, S.D., Williams, I.S., Chivas, A.R., 1997. The use of
protolith zircon-age fingerprints in determining the proto-
References source areas for some Australian dunes sands. Sedimentary
Geology 109, 233–260.
Davis, D.W., Hirdes, W., Schaltegger, U., Nunoo, E.A., 1994. Pelto, C.R., 1954. Mapping of multicomponent systems.
U–Pb age constraints on deposition and provenance of Journal of Geology 62, 501–511.
Birimian and gold-bearing Tarkwaian sediments in Ghana. Rainbird, R.H., McNicoll, V.J., Th!eriault, R.J., Heaman,
West Africa; Precambrian Research 67, 89–107. L.M., Abbott, J.G., Long, D.G.F., Thorkelson, D.J.,
DeGraaff-Surpless, K., Graham, S.A., Wooden, J.L., McWil- 1997. Pan-continental river system draining Grenville
liams, M.O., 2002. Detrital zircon provenance analysis of Orogen recorded by U-Pb and Sm–Nd geochronology of
the Great Valley Group, California: evolution of an arc– Neoproterozoic quartzarenites and mudrocks, North-
forearc system. Geological Society of America Bulletin 114, western Canada. The Journal of Geology 105, 1–17.
1564–1580. Roback, R.C., Walker, N.W., 1995. Provenance, detrital zircon
Doane, D.P., 1985. Aesthetic frequency classifications. The U–Pb geochronology, and tectonic significance of Permian
American Statistician 30, 181–183. to Lower Triassic sandstone in southeastern Quesnellia,
ARTICLE IN PRESS
K.N. Sircombe / Computers & Geosciences 30 (2004) 21–31 31

British Columbia and Washington. Geological Society of Simonoff, J.S., Udina, F., 1997. Measuring the stability of
America Bulletin 107, 665–675. histogram appearance when the anchor position is changed.
Ross, G.M., Parrish, R.R., Dud!as, F.O.,. 1991. Provenance of Computational Statistics and Data Analysis 23, 335–353.
the Bonner Formation (Belt Supergroup), Montana: in- Sircombe, K.N., 1999. Tracing provenance through the isotope
sights from U-Pb and Sm-Nd analyses of detrital minerals. ages of littoral and sedimentary detrital zircon, eastern
Geology 19, 340–343. Australia. Sedimentary Geology 124, 47–67.
Ross, G.M., Parrish, R.R., Winston, D., 1992. Provenance and Sircombe, K.N., 2000. The usefulness and limitations of binned
U–Pb geochronology of the Mesoproterozoic Belt Super- frequency histograms and probability density distributions
group (northwestern United States): implications for age of for displaying absolute age data. Radiogenic age and
deposition and pre-panthalassa plate reconstructions. Earth isotopic studies, Report 13, Geological Survey of Canada,
and Planetary Science Letters 113, 57–76. Current Research 2000-F2, 11pp.
Sambridge, M.S., Compston, W., 1994. Mixture modelling of Sircombe, K.N., Bleeker, W., Stern, R.A., 2001. Detrital zircon
multi-component data sets with application to ion-probe geochronology and grain-size analysis of a B2800 Ma
zircon ages. Earth and Planetary Science Letters 128, Mesoarchean proto-cratonic cover succession, Slave Pro-
373–390. vince, Canada. Earth and Planetary Science Letters 189,
Scott, D.W., 1979. On optimal and data-based histograms. 207–220.
Biometrika 66, 605–610. Sircombe, K.N., Stern, R.A., 2002. An investigation of artificial
Scott, D.W., 1992. Multivariate Density Estimation: Theory, biasing in detrital zircon U-Pb geochronology due to
Practice and Visualization. Wiley, New York, 376pp. magnetic separation in sample preparation. Geochemica et
Scott, D.J., Gauthier, G., 1996. Comparison of TIMS (U–Pb) Cosmoschimica Acta 66, 2379–2397.
and laser ablation microprobe ICP-MS (Pb) techniques for Smosna, R., Bruner, K.R., Burns, A., 1999. Numerical analysis
age determination of detrital zircons from Paleoproterozoic of sandstone composition, provenance, and paleogeogra-
metasedimentary rocks from northeastern Laurentia, phy. Journal of Sedimentary Research 69, 1063–1070.
Canada, with tectonic implications. Chemical Geology Sturges, H.A., 1926. The choice of a class interval. Journal of
131, 127–142. the American Statistical Association 21, 65–66.
Shannon, C.E., Weaver, W., 1949. The Mathematical Theory of Wand, M.P., 1996. Data-based choice of histogram bin width.
Communication, University of Illinois Press, Illinois, 117pp. University of New South Wales, Australian Graduate
Silverman, B.W., 1986. Density Estimation for Statistics and School of Management Working Paper Series 95-011,
Data Analysis. Chapman and Hall, London, 175pp. 14pp.

You might also like