Professional Documents
Culture Documents
As discussed in Chapter 1, probabilistic seismic hazard analysis requires models for predicting
ground-motion intensity that can be coupled with the seismic source models addressed in Chapters 2
and 3. In this chapter, we focus on a particular class of models for ground-motion characterization
referred to as empirical ground-motion models (GMMs) . Empirical GMMs take a rupture scenario
(Chapter 3) and compute the probability distribution of an intensity measure that describes the ground
motion. This probability distribution is then used in the calculation of seismic hazard in Chapter 6.
Learning Objectives
By the end of this chapter, you will be able to do the following:
• Evaluate a range of intensity measures used in the engineering characterization of ground motion.
• Describe available ground-motion observations used in the development of empirical ground-
motion models.
• Describe the mathematical representation of empirical ground-motion models and their scaling
with key predictor variables.
• Compute the distribution of ground motion for a given rupture scenario and determine probabilities
of exceeding various levels of ground-motion intensity.
• Describe modeling uncertainty in empirical ground-motion models and apply multiple models and
parametric uncertainties to quantify these epistemic uncertainties.
4.1 Introduction
Empirical GMMs predict the distribution of ground-motion intensities and are principally calibrated
using observational data. Theoretical considerations also play a role in their development, but
ultimately their consistency with observational data is a primary consideration. Figure 4.1 illustrates
several of the key concepts in this chapter. The figure plots measures of ground-motion intensity
from a particular database against the distance between the earthquake rupture and the recording
station. Superimposed on these empirical data are predictions from an empirical GMM, represented
by median values and 16th and 84th percentiles of the ground-motion distribution.
Several features of empirical GMMs are notable with reference to Figure 4.1.
• An empirical GMM predicts a particular ground-motion intensity measure, such as S A(1 s), rather
than the ground-motion time series (i.e., the variation of acceleration with time).
• Because the focus is on intensity measures, little attention is given to the underlying nature of the
seismic waves (e.g., body or surface waves; see Section 5.4) that comprise the acceleration time
series.
147
• The ground-motion intensity values generally increase as the earthquake magnitude increases and
as the source-to-site distance decreases.
• Despite a large amount of data in total, there is a relative paucity of data for large magnitudes and
short source-to-site distances.
• There is large variability in the ground-motion intensity values for a given magnitude and source-
to-site distance, as illustrated by the significant difference between the 84th and 16th percentiles of
the empirical GMM distribution.
10 0 M7.5
M5.5
10 –1
10 –3
10 0 10 1 10 2
Source-to-site distance, R [km]
Fig. 4.1 Observed (pseudo) spectral acceleration, SA(1 s), data from the NGA-West2 ground-motion database and the empirical
prediction of the Boore et al. (1997) (BJF97) model. The observed data is colored based on binned magnitude ranges centered at
M = 5.5 (black) and M = 7.5 (gray), and with ±0.2 magnitude unit widths. The solid and dashed lines represent the median
and 16th/84th percentiles of the BJF97 prediction, respectively.
ground-motion modeling draw heavily upon physical theory and empirical calibration. As discussed
further in Chapter 5, so-called physics-based methods seek to explicitly address many aspects
of the underlying physics that give rise to ground motions, and predict the ground-motion time
series directly. Despite this, physics-based methods require constitutive relationships that have been
calibrated using empirical observations. In the same vein, as discussed later in this chapter, empirical
models adopt mathematical forms guided by considerations of earthquake source, wave propagation,
and site response physics. General advances in GMM prediction will, therefore, occur as a result of
symbiotic progress in both empirical and physics-based methods.
Figure 4.2 illustrates several ground-motion acceleration time series that have been recorded from
historical earthquakes. Because earthquake-induced ground motions are a complex seismic signal, it
is often desirable to describe them through parameters, or summary statistics, referred to as ground-
motion intensity measures (IMs). In this section, a variety of commonly used IMs are presented.
In general, ground-motion severity can be considered as a function of amplitude,1 frequency
content, and duration. All other things being equal, the larger the ground-motion amplitude, the more
damage it will cause. Frequency content is important because the dynamic response of a system is
a function of the similarity of the ground-motion excitation frequencies with the natural frequencies
of the system of interest. Finally, ground-motion duration influences demands for both linear and
nonlinear systems. For example, if the ground-motion amplitude and frequency content cause
nonlinear response of the system, a longer duration will generally result in a greater accumulation of
damage.
Fig. 4.2 Examples of representative ground-motion acceleration time series recorded in past earthquakes, illustrating their complexity.
1 Intensity is sometimes used as a synonym for amplitude in this context, but we use the latter to avoid confusion with the
broad definition of intensity measure (IM).
Ground-motion IMs typically represent one or more of the above three general aspects of ground-
motion severity, and the following subsections outline commonly used IMs. From the outset, it is
important to appreciate that various IMs focus on different aspects of the ground-motion time series.
Therefore, multiple IMs are generally considered necessary to provide a comprehensive description
of ground-motion severity.
Acceleration, a [g]
PGA = 0.029 g 0.1 PGA = 0.1 g
0.02
0 0
–0.02
–0.1
–0.04
0 20 40 0 20 40
5 20
PGV = 4.2 cm/s
Velocity, v [cm/s]
10
0 0
–10
PGV = 16 cm/s
–5 –20
0 20 40 0 20 40
2 5
Displacement, d [cm]
0 0
–1
PGD = 1.4 cm
PGD = 4.4 cm
–2 –5
0 20 40 0 20 40
Time, t [s] Time, t [s]
Fig. 4.3 Illustration of peak ground acceleration (PGA), velocity (PGV), and displacement (PGD) intensity measures computed for the
north-south (NS) component ground motions recorded at Yerba Buena Island (rock) and Treasure Island (soil) during the 1989
Loma Prieta earthquake. The two sites are 75–77 km from the earthquake source and separated by a distance of approximately
2 km (Finn et al., 1993).
The pseudo-spectral acceleration, SA(T ), is then computed from the spectral displacement as
SA(T ) = ωn2 SD(T )
(4.3)
= ωn2 max |x(t)| .
t
Spectral displacement and pseudo-spectral acceleration are both a function of the angular natural
frequency and damping ratio. Therefore, strictly speaking, we should define them as SD(T, ξ) and
SA(T, ξ), respectively. Because most spectral acceleration ordinates are computed for ξ = 0.05
(i.e., 5% of critical damping), we generally suppress the explicit notation of dependence on critical
damping ratio, and sometimes even the dependence on T for brevity.
2
SA(T ) = n max |x(t)|
t
Fig. 4.4 Illustration of the response spectrum concept based on the peak relative displacement response, x(t), of a SDOF system to the
ground-motion excitation, a(t).
5 As T → 0, the system is very stiff, k → ∞, and the mass effectively moves as though it were rigidly connected to the
ground. Therefore, the maximum absolute displacement of the system mass and the ground become equivalent.
10 0
10 –2
10 –3
10 –2 10 –1 10 0 10 1
Period, T [s]
Fig. 4.5 Spectral acceleration (ξ = 0.05) intensity measures computed for the north-south component ground motions recorded at
Yerba Buena Island and Treasure Island during the 1989 Loma Prieta earthquake.
response is dominated by the first-mode vibration period, T1 , then SA(T1 ) is often a valuable IM.
However, shorter vibration periods are of interest for considering higher modes of vibration, and
nonlinear seismic response leads to an elongation of the effective period associated with the first
mode of vibration, so vibration periods greater than T1 are also of interest.
where a( f ) is the Fourier transform of a(t), often referred to as the “frequency-domain” representa-
tion of the acceleration. In practice, this equation is solved using the discrete Fourier transform:
Δt
NT −1
a( f k ) = a(nΔt)e−i(2π fk )nΔt (4.5)
NT n=0
where f k are the discrete frequencies at which the Fourier transform is computed, and Δt and NT
are the time increment and number of time steps, respectively. Rather than viewing the ground
motion as an ordered set of acceleration values at particular times, in the frequency domain, we
view the ground motion as the superposition of sinusoidal components with particular amplitudes
and phase angles at each frequency.6 Just as considering the entire set of acceleration–time pairs
6 Noting Euler’s identity, e i π + 1 = 0, implies that a(t) = k |a fk |sin[2π fk t + φ fk ].
10 1
10 –1
10 –2
10 –3
Yerba Buena Island
Treasure Island
10 –4
10 –1 10 0 10 1
Frequency, f [Hz]
Fig. 4.6 Illustration of the Fourier amplitude spectra (|a(f)|) computed for the north-south component ground motions recorded at
Yerba Buena Island and Treasure Island during the 1989 Loma Prieta earthquake. Thin lines result from the direct application of
Equation 4.5, with frequency-smoothed values in bold lines.
provides a complete description of the ground motion, so too does the consideration of the entire
set of amplitude–phase–frequency triplets. Because a( f ) is complex-valued, the absolute value (i.e.,
complex modulus, |a( f )|) corresponds to the amplitude of the sinusoid for a given frequency, while
its complex argument, arg(a( f )), is related to the phase angle associated with that frequency.
Figure 4.6 illustrates the Fourier amplitude spectra of the Yerba Buena Island and Treasure Island
ground motions. The “raw” amplitude spectra vary significantly with small changes in frequency, and
therefore it is conventional to compute frequency-smoothed spectra,7 which are shown in boldface
lines in the figure.
Fourier amplitude spectra and response spectra are similar in that they both indicate the ground-
motion intensity as a function of frequency or oscillator period. However, there are some important
differences between the two.
A first difference is that the Fourier spectrum is a property of the ground-motion time series,
as compared with the response spectrum that represents the peak response of a SDOF system to
the ground-motion excitation. Because SDOF response is a simple proxy for the response of a
more complex multi-degree-of-freedom (MDOF) structure, the use of response spectra by structural
engineers is common. However, because response spectra are not a direct reflection of ground-
motion acceleration records, comparison of response spectra can pose difficulties. In particular,
when comparing empirical GMMs from different regions, the direct link between seismological
7 Using the smoothing function of Konno and Ohmachi (1998) with b = 40.
theory and Fourier spectra make it more informative than response spectra (Bora et al., 2016).
This is why Fourier spectra are more commonly adopted in physics-based simulation approaches
(see Section 5.5.3).
A second difference is that Fourier spectral amplitudes vary significantly over small frequency
ranges—reflecting physical phenomena and/or the finite frequency resolution for a given Δt.
Therefore, smoothing is implemented for some applications. Smoothed spectra are illustrated in
Figure 4.6. On the other hand, with response spectra, damped oscillators respond to excitation over
a range of frequencies, producing an inherent smoothing of amplitudes with frequency.
4.2.4 Duration
Strong-motion duration is an important factor in ground-motion severity. All other things equal, a
longer duration of shaking will produce greater resonance and result in greater cumulative damage
during nonlinear dynamic response. Despite this, strong-motion duration is a secondary intensity
measure, because it is only important if the amplitude and frequency content of the ground motion
produce a large (i.e., nonlinear) response.
There are many definitions of duration, and a detailed discussion of their merits can be found in
Bommer and Martinez-Pereira (1999). Here, three common duration measures are discussed.
where t |a(t) |> AT signifies time instants where the condition |a(t)| > AT is satisfied.
The uniform duration, DU , is defined as the cumulative time over which the acceleration amplitude
exceeds the threshold acceleration, which can be mathematically expressed as
DU = I[|a(t)| > AT ]dt (4.7)
t
where I[.] is an indicator function, which equals 1 if the condition is true, and 0 otherwise.
Figure 4.7a illustrates uniform and bracketed durations for the Treasure Island ground motion
using a threshold of AT = 0.05 g. In practice, AT should be specified based on the system being
analyzed (e.g., the yield acceleration in a slope stability context; Bray and Travasarou, 2007). In this
example, the bracketed duration is DB = 3.99 s, and the uniform duration is DU = 1.09 s. Because
the Yerba Buena Island ground motion amplitude is less than 0.05 g (see Figure 4.3), it has zero
uniform and bracketed durations for this threshold acceleration.
Significant Duration
An alternative definition of duration that does not require the specification of a threshold acceleration
is to consider the time over which a specific percentage of the total energy imparted by the ground-
motion time series occurs. It can be shown that the energy contained in the ground-motion time series
(a)
Bracketed Duration,
0.1
DB = 3.99 s
Acceleration, a [g]
0.05
–0.05
0 5 10 15 20 25 30 35 40
Time, t [s]
(b)
1
accumulation (normalized)
0.8
Arias Intensity
0.2
0
0 5 10 15 20 25 30 35 40
Time, t [s]
Fig. 4.7 Illustration of (a) uniform and bracketed, and (b) significant duration for the Treasure Island ground motion during the 1989
Loma Prieta earthquake. The uniform and bracketed durations use a threshold of AT = 0.05 g, while significant duration is
based on 5% and 95% thresholds.
then the significant duration, DS , is the time difference between the accumulation of two different
percentages of the cumulative squared acceleration, X% and Y % (where Y > X), mathematically:
DSX−Y = tY − t X . (4.9)
The value of X is almost universally considered to be 5%, and the two most common values of Y
are 75% and 95%. These two metrics are subsequently referred to as the 5–75% significant duration,
DS5−75 , and the 5–95% significant duration, DS5−95 , respectively. Although the specific percentages
(X and Y ) considered are empirical, there is a loose physical basis for their selection. Generally,
the P-wave arrivals for ground motions of engineering interest comprise 5% or less of the total
accumulated squared acceleration, so t 5 approximately coincides with the S-wave arrival time. The
time at which 75% of the total squared acceleration is accumulated, t 75 , generally includes the arrival
of the majority of S-wave phases, and t 95 includes the majority of the S-wave and surface wave
phases (Bommer et al., 2009). Hence DS5−75 and DS5−95 loosely relate to the duration of significant
body and body+surface wave arrivals, respectively.
Figure 4.7b illustrates the 5–95% significant duration for the Treasure Island ground motion. In this
instance, the value of DS5−95 = 5.83 s is similar to the bracketed duration in Figure 4.7a, except the
time interval starts approximately 2 s earlier. Looking at the acceleration time series in Figure 4.7a,
DS5−95 approximately reflects the duration of significant body and surface waves (see Section 5.4),
with the surface waves after t = 15 s small in amplitude.
Other than their difference in the use of squared acceleration (AI) versus the absolute value of
acceleration (C AV ), these IMs vary in two other ways. First, AI has a scalar coefficient in front
of the integral because it physically represents the amount of energy, per unit weight, of an infinite
number of undamped SDOF oscillators with frequencies uniformly distributed in (0, ∞); whereas
C AV has no direct theoretical representation. Second, AI has historically been computed using
Equation 4.10 with acceleration in m/s2 , giving AI units of m/s; whereas, when originally proposed,
C AV utilized acceleration in g, giving C AV units of g-s. Since the metrics could potentially have the
same units despite representing different aspects of ground-motion severity, this arbitrary difference
in unit ensures that the resulting numbers have different orders of magnitude.
Both AI and C AV consider duration because of the use of time integration. They both also
consider frequency content because, for a given amplitude, a lower frequency will lead to a larger
value of the integral over a single cycle. Figure 4.8 illustrates the temporal evolution of AI and C AV
for the two ground motions from the Loma Prieta earthquake previously considered. The Treasure
Island record has larger AI and C AV because of its larger acceleration amplitudes over a greater
time interval and broad range of vibration frequencies. Because AI is based on squared acceleration,
and acceleration peaks tend to be of high frequency, then AI is more strongly correlated to high-
frequency ground-motion IMs, whereas C AV is more correlated to moderate-frequency ground-
motion IMs (Bradley, 2012a, 2015). Figure 4.8 illustrates this dependence via the large accumulation
of AI during the strongest accelerations, while the C AV accumulation occurs over a longer time
period. Therefore, AI is a more important IM for quantifying the seismic response of structures
with higher natural vibration frequencies, while C AV has greater utility for structures with lower
natural frequencies.
8 From Parseval’s theorem this integral is also equivalent to that over the squared Fourier amplitude in the frequency domain.
(a)
0.15
0.05
(b)
Cum. Abs. Velocity, CAV [g-s]
0.3
0.25
0.2
0.15
0.1
0.05
0
0 5 10 15 20 25 30 35 40
Time, t [s]
Fig. 4.8 Illustration of the temporal accumulation of (a) Arias intensity, and (b) cumulative absolute velocity for the north-south
components of the Yerba Buena Island and Treasure Island ground motions during the 1989 Loma Prieta earthquake.
Finally, while AI and C AV consider all three aspects of ground-motion severity in a qualitative
sense, this does not mean that they are superior to the previous IMs that consider only amplitude or
frequency content. For example, neither AI nor C AV provides a precise IM for systems that respond
to a narrow band of excitation frequencies.
orthogonal horizontal directions, the IM in each recorded direction was computed and the geometric
mean of the two values was taken as the IM for the ground motion:
0.5
I MGM = I Mx × I My (4.12)
where I Mx and I My are the IM values for the two orthogonal components (x and y) of the
recorded ground motion, and I MGM is the geometric mean IM value. This approach is simple,
and the averaging of the two IM values reduces the ground motion database size by half, which
is advantageous for empirical GMM calibration (Boore et al., 1997).
A limitation of the geometric mean calculation is that the I MGM value depends on the in-situ
orientation of the ground-motion recordings.9 If the recording instrument were rotated by several
degrees, the I Mx and I My values would change and consequently I MGM also. The preferred
refinement to address this is to compute the IM value for all horizontal orientations of the ground
motion. Two common definitions are used when considering orientation-dependent ground-motion
IMs. The first is the median IM value obtained across all orientations, which yields the so-called
RotD50 metric:
I MRotD50 = median [I M (θ)] (4.13)
θ
where I M (θ) is the IM value of the horizontal ground motion along orientation θ. The “RotD”
subscript denotes that the computation is being performed over a range of rotation angles,10 and
the “50” denotes the 50th percentile (i.e., median) value over these orientations is being computed
(Boore, 2010). Alternatively, the maximum value over all orientations is:
I MRotD100 = max [I M (θ)] . (4.14)
θ
To illustrate, Figure 4.9 shows the SDOF oscillator response in the horizontal plane, for two
different oscillator periods, when subjected to the Yerba Buena ground motion from the 1989 Loma
Prieta earthquake. The spectral acceleration based on the time series for orientation θ, SAθ , divided
by SARotD100 , is also shown with a dashed line. The orientation-dependence of the ground-motion
intensity is referred to as directionality. Comparing the two figure panels, the response is very
polarized for SA(T = 2.0 s) in Figure 4.9b, and relatively less so for SA(T = 0.3 s) in Figure 4.9a.
One numerical measure of polarity is the ratio I MRotD100 /I MRotD50 , which has values of 1.17 and 1.41
for the two examples in Figure 4.9. An unpolarized ground motion would have the same intensity in
all orientations, and therefore a ratio of 1.0. A perfectly polarized ground
√ motion would have spectral
response along a single orientation, and this would yield a ratio of 2 = 1.41 (Boore, 2010; Shahi
and Baker, 2014).
While the rotation-independence of both I MRotD50 and I MRotD100 avoid limitations of past
formulations, the decision of which definition to use will be a function of the properties of the
particular structure being considered (e.g., Stewart et al., 2011). Importantly, the specific definition
used in seismic hazard and subsequent seismic response analysis calculations must be consistent
(Baker and Cornell, 2006c).
9 The arithmetic mean for AI (i.e., ( AI x + AIy )/2), and quadratic mean for Fourier amplitude (i.e., ([a x ( f )]2 +[a y ( f )]2 ) 0.5 ),
are both orientation-independent and therefore commonly adopted. Otherwise, a RotD-type definition (e.g., Equation 4.13)
is most common for other IMs.
10 The “D” term indicates that the orientation varies for the different IMs calculated, specifically in the context of SA ordinates
for different oscillator periods (Boore et al., 2006; Boore, 2010).
SA/SA RotD100
90 90
x(t )/SD RotD100
1 1
0.8 0.8
0.6 0.6
0.4 0.4
0.2 0.2
180 0 0 180 0 0
270 270
(a) (b)
Fig. 4.9 Directionality of the Yerba Buena Island ground-motion spectral response in the horizontal plane as a function of time, and
orientation-dependent spectral acceleration: (a) T = 0.3 s (IMRotD100 /IMRotD50 = 1.17) and (b) T = 2.0 s
(IMRotD100 /IMRotD50 = 1.41). Orientations are in degrees, and the spectral accelerations (SA) are normalized by the maximum
value over all orientations SARotD100 , while the oscillator displacement response x(t) is normalized by the maximum spectral
displacement SDRotD100 .
• PG A: Simplified liquefaction and other geotechnical assessments (e.g., Idriss and Boulanger,
2008)
• PGV : Deformation of underground structures and infrastructure subject to transient ground
displacements (e.g., Jeon and O’Rourke, 2005)
• S A(T ): The most common intensity measure, ubiquitous in structural applications, with the period
T based on the first-mode of vibration (e.g., Shome et al., 1998)
• DS : Quantification of the effects of long-duration ground motions on structural response (e.g.,
Hancock and Bommer, 2006)
• AI: Seismic slope displacements (e.g., Bray and Travasarou, 2007)
• C AV : Triggering of liquefaction (specifically, a modified version, C AV5 ) (e.g., Kramer and
Mitchell, 2006).
The above are common choices for their respective problems, but ultimately the analyst has the
freedom to adopt alternatives. An additional consideration is that IMs are simplified representations
of ground-motion severity. It is possible to develop more specific IMs that contain additional
information about the problem of interest, and therefore predict a specific risk metric with lower
uncertainty (e.g., Cordova et al., 2001; Luco and Cornell, 2007; Marafi et al., 2016). However, as
these IMs become more problem-specific, they also become less generally applicable. As a result,
problem-specific IMs may not have empirical GMMs necessary to characterize the hazard in this
form. Furthermore, GMMs often require extrapolation beyond the constraints of empirical data (e.g.,
to large magnitudes and small source-to-site distances), and problem-specific IMs are less likely to
have a seismological basis for their extrapolation (see Section 4.7.2). Thus, there is a tension between
using general IMs (which are widely used, and for which empirical GMMs exist and are constrained
by theoretical considerations) and more specific IMs (for which empirical GMMs may need to be
specifically developed, and may have extrapolation complications).11 In the discussion and examples
that follow, emphasis is given to general IMs for these reasons.
Given the limitation of simplified IMs for describing a complex ground-motion time series, another
option is to utilize a vector of IMs to describe ground-motion severity. Ideally, each component of
the vector would enhance the prediction of response and describe a unique aspect of ground-motion
severity. Seismic hazard analysis can be performed directly for a vector of IMs (Chapter 7). Then
fragility or vulnerability functions that utilize this vector of IMs can be used to compute seismic risk
metrics (Chapter 9). Alternatively, seismic hazard analysis can be performed for a single (scalar) IM
(Chapter 6). Then, the distribution of a vector of IMs conditioned on the scalar IM can be utilized to
obtain seismic fragility or vulnerability functions (most commonly via ground-motion selection and
response-history analyses, discussed in Chapter 10).
Instruments capable of recording strong motions for quantitative analysis, first developed in the early
1900s, have been critical to characterizing ground motions. Recordings from historical earthquakes
are archived in several databases, such as the following:
1. Pacific Earthquake Engineering Research Center (PEER): Several databases of worldwide earth-
quakes for shallow crustal, stable continental, and subduction zone earthquakes (http://peer
.berkeley.edu/peer ground motion database)
2. National Research Institute for Earth Science and Disaster Resilience (NIED): Surface
(K-Net) and subsurface (Kik-Net) databases of earthquakes occurring in Japan (http://www
.kyoshin.bosai.go.jp/)
3. Pan-European Engineering Strong Motion (ESM): Earthquakes from the European-Mediterranean
and Middle-East regions (https://esm-db.eu)
4. New Zealand GeoNet: Earthquakes occurring in New Zealand (http://www.geonet.org
.nz/)
5. Center for Engineering Strong Motion Data (CESMD): Ground motions from several different
national networks (https://strongmotioncenter.org/).
The availability in the past two decades of public strong-motion databases has enabled a significant
increase in research on empirical ground-motion characterization, through ease of access and greater
consistency in the underlying metadata. The process by which “raw” measurements from instruments
are signal-processed to optimize the signal (from the earthquake-induced ground-motion excitation)
11 One avenue is the development of conditional GMMs, which construct a model for a specific IM based on an existing
model of a more general IM (e.g., Tothong and Cornell, 2006; Macedo et al., 2019).
from the noise (all other forms of vibration) is a specialized topic (e.g., Boore and Bommer, 2005)
that is particularly important for the use of older recordings from analog instruments, but less of a
concern for modern digital instruments. Many (but not all) databases now provide ground-motion
time series and IMs that have already been signal-processed following best-practice procedures
(e.g., Ancheta et al., 2014). As a result, we do not discuss ground-motion processing and filtering
explicitly in this chapter.
As discussed in subsequent sections, ground motions observed in different locations worldwide are
often grouped to develop empirical GMMs. Because there is often a paucity of data in a single
region, grouping provides more data to constrain GMMs. However, this does come at the cost of
the developed model being generic for a type of region, and not necessarily representing the specific
conditions near the site of interest (see Chapter 8).
Figure 4.10 shows the global distribution of earthquakes from four of the databases listed in the
previous subsection. In viewing Figure 4.10, it is noted that ground-motion databases for engineering
applications are a subset of all earthquakes that have been recorded in that region. Specifically,
observations from small-magnitude earthquakes may be excluded if such earthquake events are
deemed unimportant for engineering applications. Similarly, small-amplitude ground motions may
also be excluded (for having low signal-to-noise ratios). The minimum magnitude and ground-
motion amplitude limits of interest have, however, been decreasing over time as instruments become
increasingly sensitive and small-amplitude ground motions are used to understand regional variations
in ground-motion prediction.
12 The large increase in 1999 results from the dense recording of the 1999 Chi-Chi, Taiwan, earthquake sequence.
60˚
(a) (c)
(b)
30˚
0˚
NGA East
−30˚ NGA West2
Kik−Net (d)
GeoNet
ESM
50˚
(a) (b)
45˚
45˚
40˚
40˚
35˚
35˚
−150 −100 −50 0 30˚
30˚
50˚ −35˚
(d)
(c) M8
M7
M6
45˚
M5
−40˚ M4
M3
40˚
35˚
−45˚
30˚
Fig. 4.10 Earthquake locations of several publicly available earthquake-induced ground-motion databases. In panels (a)–(d) earthquake
depth is indicated by the amount of shading, and magnitude by the marker size.
10 5
NGA-West2
10 4
10 years
10 2
10 1
10 0
1930 1940 1950 1960 1970 1980 1990 2000 2010 2020
Year
Fig. 4.11 Increase in the size of empirical ground-motion databases over time illustrated for the NGA-West2 database.
2.2 u10 4
2 All records
PGA > 0.01 g
1.8 PGA > 0.05 g
Number of ground motions
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
0
1930 1940 1950 1960 1970 1980 1990 2000 2010 2020
Year
Fig. 4.12 Number of ground motions in the NGA-West2 database as a function of the year recorded and their PGA value. The majority of
ground motions have small amplitudes, with only 2993 of 21,298 ground motions in the database having PGA > 0.05 g.
Magnitude, M 6
PGA = 0.05 g
4
PGA = 0.01 g
3
10 –1 10 0 10 1 10 2
Source-to-site distance, R [km]
Fig. 4.13 Magnitude and source-to-site distance distribution of recorded ground motions in the NGA-West2 database.
Magnitude-distance thresholds of PGA = 0.01 and 0.05 g are based on the median predictions from Chiou and Youngs (2014)
for the following reasons. First, we know that large-magnitude events are less likely to occur,13 so
the likelihood of recording a ground motion from a large magnitude earthquake is relatively low.
Second, as the source-to-site distance doubles, the area on the ground surface increases by a factor
of four. As a result, it is much more likely to observe a ground motion at a large distance from an
earthquake than at a small distance. These trends imply that we will always be data-poor in the top-
left-hand corner of a magnitude-distance distribution such as Figure 4.13, and our GMMs will have
more uncertainty for these conditions. These are also often the scenarios which dominate the seismic
hazard, as we will see in Chapter 6.
13 As discussed in Chapter 3, generally, there is a 10-fold decrease in the rate of occurrence of events for every unit increase
in magnitude.
10 4
2.2
2 All records
M > 5.0
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
0
10 –2 10 –1 10 0 10 1
Period, T [s]
Fig. 4.14 Available number of ground-motion recordings as a function of oscillator period, T, from the NGA-West2 database. Ground
motions from larger magnitude earthquakes are more likely to be usable at longer oscillator periods.
considerations. The usable period range is related to properties of the bandpass filter applied to
isolate signal from noise (e.g., Akkar and Bommer, 2006; Douglas and Boore, 2011). For example,
Figure 4.14 illustrates the number of usable ground motions as a function of the SA period of interest
for the NGA-West2 database. For T < 1 s, generally, all ground motions provide usable SA values,
but for T > 1 s this number decreases rapidly, with only ≈20% of ground motions providing usable
S A values for T = 10 s. However, the maximum usable period is strongly linked to the earthquake
magnitude.14 The trend for M > 5.0 and M > 6.5 events in Figure 4.14 illustrates the reduction in
usable ground motions is less significant for moderate- and large-magnitude earthquakes.
Having discussed ground-motion metrics of interest, and available empirical data, we now discuss
the calibration and use of empirical GMMs.
14 Specifically, the ground-motion corner frequency, which is correlated with earthquake magnitude; see Section 5.5.3.
which states that I M is a lognormal random variable with mean and standard deviation of μln I M and
σln I M , respectively. The lognormal distribution is almost universally adopted for empirical GMMs
because empirical studies have shown that this is an appropriate assumption (Section 4.4.3).
Because of the relationship between lognormal and normal distributions (Section A.5.2), ln I M has
a normal distribution. The normal random variable ln I M is commonly expressed in the following
form:
ln I M = μln I M (rup, site) + σln I M (rup, site) · ε (4.16)
where μln I M and σln I M are explicitly noted as functions that depend on the earthquake rupture,
rup, and the site considered, site; and, ε is a standard normal random variable that represents the
variability in ln I M.
The functional dependence of Equation 4.16 on rup and site is specific to individual empirical
models (see Section 4.5), but as concrete examples, rup can be represented with variables such as
magnitude (M), site by variables such as the 30-m time-averaged shear-wave velocity (VS,30 ), and
the geographical relationship between rup and site by the source-to-site distance (R). Positive and
negative values of ε result in larger- and smaller-than-average values of ln I M, respectively.
A property of the lognormal distribution is that exp(μln I M ) is the median of the distribution of I M
(Section A.5.2). As a result, reference is often made to GMMs providing models for “the median and
the standard deviation.” This notion may appear initially confusing because it is the median of I M
and the standard deviation of ln I M. However, this property of lognormal distributions provides
equivalence between the mean of ln I M and the median of I M. Because of the assumption of
lognormality, these two parameters are sufficient to provide a complete description of the distribution
of I M for any given rupture scenario and site characterization. Further discussion on this apparent
variability is provided in Section 4.4.3.
σln S A = a6 (4.18)
15 Different variable names for the coefficients than in the original BJF97 have been used to maintain simplicity. In this
instance, R is the Joyner–Boore definition on source-to-site distance (R j b ; see Section 3.7.3).
where a0 − a6 are SA period-dependent empirical coefficients which are obtained from regression
analysis on the ground-motion database that was used by BJF97. Thus, there are three predictor
variables (M, R, and VS,30 ) and seven empirical coefficients at each period.
To simplify the use of this model even further for the prediction of SA(1 s) from strike-slip
earthquakes,16 substituting in the values of the coefficients from Boore et al. (1997) gives:
√
μln S A(1 s) = −3.4415 + 1.42M − 0.032M 2 − 0.798 ln R2 + 8.41 − 0.698 ln(VS,30 ) (4.19)
Figure 4.1, and other comparisons in this chapter, shows that there is significant scatter in observed
ground-motion intensities, even after accounting for the effect of predictor variables such as magni-
tude, source-to-site distance, and site conditions, among others. In the context of Equation 4.16, the
difference between a ground-motion observation (ln im) and the mean prediction (μln I M ) is referred
to as the prediction residual, Δ = σln I M · ε. Visually this can be thought of as the vertical difference
between the data and the median model prediction in Figure 4.1.
An important characteristic of prediction residuals is the distribution that they conform to.
Figure 4.15 illustrates a typical quantile-quantile plot of normalized prediction residuals (ε) in
comparison to the quantiles from a standard normal distribution for two different IMs. The data
plot approximately on one-to-one lines beyond two standard deviations from the mean, and within
the statistical rejection bounds to nearly four standard deviations. Such empirical results indicate
that the assumption of a normal distribution for ln I M is justified up to 3.5–4 standard deviations
(Bommer et al., 2004; Jayaram and Baker, 2008; Strasser et al., 2008).
Because the lognormal distribution is theoretically unbounded, truncation of the prediction
distribution is sometimes employed for practical convenience. Figure 4.15, however, illustrates that
there is no empirical basis for truncation below approximately four standard deviations. The details
of the distribution so far into its tails are practically immaterial for most applications that are not
concerned with seismic hazard at very low exceedance rates.
(a) 4 (b) 4
3 3
1 1
0 0
–1 –1
–2 –2
–3 –3
–4 –4
–4 –2 0 2 4 –4 –2 0 2 4
Standard normal quantiles Standard normal quantiles
Fig. 4.15 Quantile-quantile plots comparing normalized residuals (ε) against standard normal quantiles, illustrating the appropriateness
of the normality assumption of ln SA(T) for (a) T = 0.2 s and (b) T = 1 s. The data lying within the gray “acceptance”region
indicate the normality assumption cannot be rejected at the 95% confidence level, using the Kolmogorov–Smirnov
criterion (Aldor-Noiman et al., 2013). Data are from M > 5 earthquakes in the NGA-West2 database.
0.9
CY14 M < 5.0
0.8
0.7
Standard deviation, s lnSA(T )
0.5
CY14 M > 6.5
0.4
0.3
0.2
0.1 BJF97
CY14
0
10 –2 10 –1 10 0 10 1
Period, T [s]
Fig. 4.16 Apparent aleatory standard deviation of the BJF97 and CY14 GMMs as a function of oscillator period. The BJF97 model σln SA is
only a function of oscillator period, T, whereas the CY14 model is a function of several variables, principally T and M.
In addition to the distribution type, the size (i.e., standard deviation) of the variability is also of
primary interest. Figure 4.16 illustrates the standard deviation of the BJF97 and CY14 GMMs as a
function of oscillator period. The (lognormal) standard deviation of the BJF97 model varies from
approximately 0.44 to 0.57. To get a sense for the significance of this level of variability, if the
median ground-motion intensity was I M50 = exp(μln I M ) = 0.5 g and the standard deviation was
σln I M = 0.6, then 68% (i.e., μ ± 1σ) of intensity measure values would be expected to fall in the
interval:
exp(μln I M ± σln I M ) = [0.275 g, 0.911 g] (4.21)
which represents approximately a factor of 3.3 in the ground-motion intensity. Extending the example
to cover 95% of IM values (i.e., μ ± 1.96σ) leads to a factor of 10.5. Given the size of this variability,
it is critical to consider it in the predictive model. Hence, these models must provide a probability
distribution, rather than just a single value.17 Later PSHA calculations in Chapter 6 will account for
the possibility of unlikely outcomes, such as extreme intensities much larger than the predicted mean.
The BJF97 model considers standard deviation solely as a function of the oscillator period.
Contemporary GMMs predict standard deviations that also vary as a function of magnitude, source-
to-site distance, and site parameters. For example, the CY14 standard deviation is a function of
earthquake magnitude, as shown in Figure 4.16, with larger variability for smaller magnitude events.
This larger variability for smaller magnitudes can be attributed to several causes: (1) the larger
uncertainty in the estimate of earthquake location and size for smaller events, (2) more “averaging”
of randomness in the source rupture properties for larger-sized events, and (3) the increased influence
of near-surface attenuation at smaller magnitudes.
In this section, the influence of rupture and site properties on IMs is considered. We will
consider these effects qualitatively by comparing several ground-motion time series, and then more
quantitatively by comparing median empirical GMM predictions with observed data. The intent of
this section is to develop an empirical understanding of the effects of these parameters on ground-
motion intensity. Additional physics-based insights into the reasons for these trends are presented in
Chapter 5.
4.5.1 Magnitude
All other factors equal, ground-motion amplitude increases with increasing magnitude, since a larger
amount of energy is released from the earthquake source. Ground-motion duration also increases as
a result of a longer duration of rupture on the causative fault plane. Figure 4.17 illustrates several
ground-motion time series corresponding to earthquake events with different magnitudes, in which
the acceleration amplitude and duration increase as magnitude increases. Although not apparent
from the acceleration time series, the longer rupture duration of larger magnitude earthquakes also
decreases the so-called corner frequency of the radiated ground-motion spectrum, resulting in an
increase in the low-frequency ground-motion amplitudes (see Section 5.5.3).
Building on the intuition gained from Figure 4.17, Figure 4.18 illustrates the variation in SA(1 s)
amplitudes of a subset of ground motions in the NGA-West2 database as a function of M, for two dif-
ferent source-to-site distance ranges. In addition to the SA(1 s) values from observations, the median
17 This is one of the fundamental problems with so-called ‘deterministic’ seismic hazard analysis, see Sections 1.3 and 6.10.3
M=8
M=6
0.2 g
M=5
0 10 20 30 40 50 60 70
Time, t [s]
Fig. 4.17 Effect of magnitude on ground-motion time series characteristics illustrated through example recorded ground motions. Other
than the variation in magnitude, the recordings have source-to-site distances of approximately R = 20 km and soil conditions
associated with VS,30 = 500 m/s.
10 0
10 –1
Spectral acceleration, SA(1 s) [g]
10 –2
Data
10 –3 BJF97
CY14
10 –4
10 –5
10 –6
3 4 5 6 7 8
Magnitude, M
Fig. 4.18 Effect of magnitude on SA(1 s) amplitudes from the NGA-West2 database for two source-to-site distance ranges of
R = 10–20 km (black) and 80–100 km (gray), and VS,30 = 400–600 m/s. The median empirical predictions from the BJF97
and CY14 models are shown as lines for the mid-point distances of the data (R = 15 and 90 km), VS,30 = 500 m/s, and
strike-slip faulting conditions. The BJF97 model is shown only over its stated M5.5–7.5 range of applicability.
values of the BJF97 and CY14 models are also depicted. Over the M = 3–5 range, there is a relatively
steep gradient in the ln SA(1 s)–M trend, which then decreases for larger M values. This decrease in
the gradient of ln I M with M occurs due to the variation in the corner frequency of the ground-motion
spectrum with magnitude (Fukushima, 1996), and further saturation for large-magnitude near-source
scenarios (Bommer et al., 2007). The extent of this slope change is a function of the IM considered,
with the slope reduction most significant for high-frequency ground-motion IMs.
Figure 4.18 shows that both models are consistent with the data. However, the median of the
BJF97 model is shown only for the magnitude range for which this model was developed (5.5–7.5).
The approximately linear nature of the model in ln SA(1 s)–M space (the quadratic M scaling in
Equation 4.19 is very weak at T = 1 s) means that it would lead to overprediction of the SA(1 s)
amplitudes if extended to lower and higher M values. In contrast, the CY14 model exhibits the
“concave from below” trend that is seen in the empirical data and is also applicable over the entire
M3–8 range of the data.
R = 100 km
R = 50 km
Acceleration, a [g]
R = 25 km
R = 5 km
0.2 g
0 10 20 30 40 50 60 70
Time, t [s]
Fig. 4.19 Effect of source-to-site distance on ground-motion time series characteristics illustrated through example recorded ground
motions. Other than the variation in source-to-site distance, the recordings have magnitudes of approximately M = 6.2 and
soil conditions associated with VS,30 = 500 m/s.
Data
BJF97
CY14
10 –1
10 –2
10 –3
10 0 10 1 10 2
Source-to-site distance, R [km]
Fig. 4.20 Effect of source-to-site distance on SA(1 s) amplitudes from the NGA-West2 database for magnitude ranges of M = 5.3–5.7
(black) and 7.3–7.7 (gray), and VS,30 = 400–600 m/s. The median empirical predictions from the BJF97 and CY14 models are
shown as lines for M = 5.5 and 7.5, VS,30 = 500 m/s, and strike-slip faulting. The BJF97 model is shown only over its stated
range of applicability of R < 80 km.
illustrates several ground-motion time series with different source-to-site distances, for which the
acceleration amplitude decreases and duration increases with increasing source-to-site distance.
Although not apparent from the acceleration time series, a larger source-to-site distance also changes
the ground motion’s relative frequency content. High-frequency components of motion experience a
greater number of oscillations for a given distance and are consequently attenuated more significantly
due to energy loss per cycle (referred to as anelastic attenuation; see Section 5.4.4).
Figure 4.20 illustrates the variation in SA(1 s) amplitudes of a subset of ground motions in the
NGA-West2 database as a function of R for two different magnitude ranges. The median values
of the BJF97 and CY14 models are also depicted. As will be discussed in Chapter 5, from a
fundamental physics perspective, there are three principal effects of source-to-site distance reflected
in the slope of the empirical model predictions in Figure 4.20. First, for the approximate range
of R = 10–100 km, geometric spreading (see Section 5.4.4) is dominant and ln SA decays linearly
with ln R. Second, at longer source-to-site distances, anelastic attenuation becomes increasingly
important, and the rate of amplitude decay with distance increases. Because anelastic attenuation
occurs due to material adsorption, it is a function of the number of vibration cycles and therefore
increases with distance and frequency. As a result, the effects of anelastic attenuation are not
pronounced in this figure for SA(1 s) but are increasingly significant for SA at shorter oscillator
periods. Finally, at short source-to-site distances, the finite rupture geometry is important, and there
is a “saturation” of the ground-motion amplitude as the source-to-site distance approaches zero (e.g.,
Abrahamson et al., 2008).
In viewing the empirical functional form of the BJF97 model given in Equation 4.19 the simple
linear scaling of ln SA with ln R18 illustrates that geometric spreading is accounted for, but there is
no anelastic attenuation term represented.19 The a4 term is responsible for the near-source distance
saturation. Contemporary models such as CY14 explicitly account for anelastic attenuation. They
also have a more complex functional form for near-source saturation as a function of the earthquake
magnitude (because this indicates the size of the finite fault), and lower geometric spreading rates
at larger distances due to the increasing contribution of surface waves and significant body wave
reflections (e.g., Moho reflection).
Acceleration, a [g]
V S,30 = 400 m/s
0.2 g
V S,30 = 150 m/s
0 5 10 15 20 25 30 35 40
Time, t [s]
Fig. 4.21 Effect of site conditions on recorded ground motions from the 1989 Loma Prieta (M6.9) earthquake at four nearly adjacent
locations. Other than the variation in 30-m time-averaged shear wave velocity, VS,30 , the recordings all have source-to-site
distances of approximately R = 75 km.
10 0
Spectral acceleration, SA(1 s) [g]
10 –1
10 –2
Data
BJF97
CY14
10 –3
10 2 10 3
30-m time-averaged shear wave velocity, V S,30 [m/s]
Fig. 4.22 Effect of near-surface site conditions (i.e., VS,30 ) on SA(1 s) amplitudes from the NGA-West2 database for larger and smaller
amplitude ground motions. The larger motions are from events with M = 6.8–7.2 and distances R = 0–15 km, and are
depicted in gray. The smaller motions are from events with M = 5.3–5.7 and distances R = 40–60 km, and are depicted in
black. The median empirical predictions from the BJF97 and CY14 models are shown as lines for M = 7.0/R = 5 km and
M = 5.5/R = 50 km, respectively, and a vertical strike-slip fault.
(a) (b)
M=8
M=7
M=6
M=5
10 –2 10 –2
10 –1 10 0 10 1 10 2 10 –1 10 0 10 1 10 2
Source-to-site distance, R [km] Source-to-site distance, R [km]
Fig. 4.23 Scaling of response spectral ordinates with magnitude and source-to-site distance for (a) SA(0.2 s), and (b) SA(1 s). Results are
based on the CY14 model for a vertical strike-slip fault observed at a stiff soil site (VS,30 = 500 m/s).
(associated with different magnitude and source-to-site distance ranges). In addition to the SA(1 s)
values from observations, the median values of the BJF97 and CY14 models are also depicted.
In general, ln SA amplitudes scale linearly with ln VS,30 . The CY14 model departs from this linear
scaling for larger amplitude ground motions, which occur at low VS,30 values during strong shaking
(i.e., for the M = 7.0/R = 5 km scenario). This departure occurs because the CY14 model
considers nonlinear near-surface site response, which is not represented in the BJF97 model
(Equation 4.19).
(a) 10 0 (b) 10 0
R = 10 km R = 50 km
10 –2 10 –2
10 –3 10 –3
M=8
M=7
M=6
M=5
10 –4 10 –4
10 –2 10 –1 10 0 10 1 10 –2 10 –1 10 0 10 1
Period, T [s] Period, T [s]
Fig. 4.24 Scaling of response spectral ordinates as a function of magnitude for source-to-site distances of (a) R = 10 km and
(b) R = 50 km. Results are based on the CY14 model for a vertical strike-slip fault observed at a stiff soil site
(VS,30 = 500 m/s).
effect is most pronounced for small source-to-site distances, and is stronger for SA(0.2 s).
For example, at R < 1 km the median SA(0.2 s) amplitudes are practically insensitive to magnitude
in the range of M = 6–8.
Figure 4.24 provides similar results to Figure 4.23, but instead via plotting response spectral
ordinates for oscillator periods from T = 0.01–10 s, for two different source-to-site distances.
Figure 4.24a further illustrates that the effect of magnitude on SA amplitudes is most pronounced
at long periods, with the SA(T = 10 s) amplitudes spanning a factor of nearly 200 from M = 5–8,
while the SA(T = 0.01 s) amplitudes span a factor of less than 10. This trend is also seen in
Figure 4.24b for R = 50 km, but the difference is less pronounced, illustrating the link between
magnitude and distance saturation.
10 –1
Ztor = 0 km d = 45 °
°
3 km 70
°
6 km 90
10 –2 10 –1
10 –2 10 –1 10 0 10 1 –20 –10 0 10 20
Period, T [s] Distance metric, R x [km]
Spectral acceleration, SA(T) [g]
10 –1 10 –1
D DPP = 0 Z1.0 = 50 m
+1 300 m
-1 800 m
10 –2 10 –2
10 –2 10–1
10 0
10 1
10 –2 10 –1 10 0 10 1
Period, T [s] Period, T [s]
Fig. 4.25 Illustration of the effects of additional factors on predicted ground-motion amplitudes in the CY14 GMM. (a) Depth to
top-of-rupture (Ztor ) effect as a function of vibration period. (b) Hanging-wall effect on SA(1 s) amplitudes as a function of
source-to-site distance, Rx , for three different fault dip angles δ. (c) Rupture directivity effect on SA amplitudes for neutral,
forward, and backward directivity conditions (ΔDPP = 0, +1, −1, respectively). (d) Effect of depth to 1 km/s shear wave
velocity horizon, Z1.0 , on SA amplitudes. Unless noted, the primary predictor variable values used are M = 7.5, Rrup = 20 km,
and VS,30 = 500 m/s, with other distance metrics (e.g., Rjb , Rx ) calculated geometrically from these values.
With an understanding from previous sections on the mathematics of empirical GMMs, their general
features relative to observed ground-motion data, and the variability in the observations that is
considered as apparent variability in the models, we are now in a position to use them for ground-
motion prediction.
Table 4.1. Probability of SA(1 s) > 0.4 g for three source-to-site distances
R [km] μln I M im50 [g] σln I M P [S A(1 s) > 0.4 g]
where Φ () is the standard normal cumulative distribution function, and μln I M , σln I M , and ε are as
defined in Equation 4.16.
Equation 4.23 uses the cumulative distribution function to compute P(I M > x | rup, site), but
sometimes it may be useful to use an alternate formulation incorporating the probability density
function (PDF) for I M. Noting that the cumulative distribution function is equivalent to an integral
of the PDF (Equation A.22), we can also write:
∞
P(I M > x | rup, site) = f I M (u) du (4.24)
x
where f I M (u) is the PDF of I M, given rup and site. Unlike the cumulative distribution function Φ (),
f I M (u) can be written out analytically. Substituting in this PDF gives an integral form, equivalent to
Equation 4.23, that can be evaluated using contemporary programming languages:
∞ 2
1 1 ln u − μln I M
P(I M > x | rup, site) = √ exp − du. (4.25)
σln I M 2π 2 σln I M
x
To connect these equations to a visual display of ground-motion predictions, consider Figure 4.26,
which shows SA(1 s) predictions on stiff soil (VS,30 = 500 m/s) for an M6.5 strike-slip earthquake,
as a function of source-to-site distance, R. Suppose we were interested in the probability of
S A(1 s) > 0.4 g. The median and ± one standard deviation of the BJF97 prediction are plotted
for distances between 1 and 100 km. At distances of R = 3, 10, and 30 km, the PDF of the
predicted distribution of SA(1 s) is also superimposed.21 At those three distances, Equation 4.19
gives predicted means of ln SA(1 s) = −1.041, −1.771, and −2.619, respectively.22 At all three
distances, the standard deviation of ln S A(1 s) is 0.52.
With knowledge of the mean and standard deviation for each scenario, we can use Equation 4.23
to compute the probabilities of exceedance. For example, at a distance of R = 3 km,
ln(0.4) − (−1.041)
P [SA(1 s) > 0.4 g | M = 6.5, R = 3 km] = 1 − Φ = 0.4050. (4.26)
0.52
Table 4.1 provides details of the resulting probabilities for all three source-to-site distances. These
probabilities correspond to the fraction of the corresponding PDFs in Figure 4.26 that are shaded. To
aid visual comparison with Figure 4.26, the BJF97 median prediction, im50 = exp(μln I M ), is also
provided in the table.
21 Because the y-axis is in log-scale, the lognormal distribution appears as a symmetric normal distribution.
22 All of the example calculations will provide answers with more significant figures than should reasonably be used
or reported. This is done to allow reproduction of the example calculations exactly, and because many answers are
intermediate results for later calculations.
10 0
10 –1
SA
10 –2
10 0 10 1 10 2
Source-to-site distance, R [km]
Fig. 4.26 Graphical depiction of the example ground-motion model for a magnitude M6.5 earthquake, and the probability of
SA(1 s) > 0.4 g at several source-to-site distances R annotated via the PDF.
Consider a second related example, which will provide intermediate results for calculations that
follow in Section 6.3.1. We want to predict the ground-motion probability distribution for a site with
VS,30 = 500 m/s that is a distance of R = 10 km from an M6.5 earthquake. The problem is similar
to the previous example, but we now consider only a single source-to-site distance and examine how
the probability of exceedance varies due to changing the threshold value x.
Using the mean and standard deviation values of the BJF97 model in the previous example, the
first and second columns of Table 4.2 provide a range of threshold values, x = 0–0.6 g, in increments
of 0.05 g, and the corresponding exceedance probabilities. As would be expected, the values of
P(I M > x i ) decrease as x increases, and vary in the range of [0, 1].
Further intuition can be gained by considering the difference between exceedance probabilities for
sequential threshold values, which yields a result proportional to the distribution PDF:
P(x i < I M ≤ x i+1 ) values from this equation are shown in the third column of Table 4.2. The largest
P(x i < I M ≤ x i+1 ) values occur near the median value of the distribution, im50 = 0.170 g, as would
be anticipated.
Residual Partitioning
For comparison of predictions against observations it is conventional to recast the predictive
functional form of Equation 4.16 as
ln imi,j = μln I M (rupi , site j ) + Δi,j
(4.28)
Δi,j = δBi + δWi,j
where ln imi,j is the natural logarithm of the observed ground-motion IM from the ith earthquake
(rupi ) at the jth site (site j ); μln I M (rupi , site j ) is the prediction of the mean ln I M value; and Δi,j is
the total residual for the jth site from the ith earthquake. By comparing Equations 4.16 and 4.28 we
can see that σln I M · ε = Δi,j .
The total residual is then further partitioned into δBi , the between-event residual for the ith
earthquake; and δWi,j , the within-event residual of the observation at the jth site from the
ith earthquake. Denoting the standard deviations of δBi and δWi,j as τ and φ, respectively, they
are related to the standard deviation of the total residual, σln I M (as defined in Equation 4.16) by
σln
2
IM = τ + φ .
2 2
(4.29)
The reason for the “partitioning” of the total residual into between- and within-event terms is that the
observed ground motions from a single earthquake i are not independent observations (they are all
causally related to the same earthquake).23 In order to achieve statistically independent data from the
direct observations, the between-event residual for the specific IM and event i is determined, and the
within-event residuals are then obtained by rearranging Equation 4.28 as
δWi,j = Δi,j − δBi . (4.30)
23 Chapter 8 provides further extensions of the form given by Equation 4.28 in which other effects are considered.
(a) (b)
10 0 10 0
10 –1 10 –1
10 –2 10 –2
0 2
10 10 10 0 10 2
Source-to-site distance, R [km] Source-to-site distance, R [km]
Fig. 4.27 SA(1 s) values of observed ground motions from the 1989 Loma Prieta earthquake compared with the (a) BJF97 model and (b)
including adjustment to the mean and standard deviation resulting from the between-event residual.
24 Such as the 1999 Chi-Chi earthquake (Figure 1.1), whose contribution to ground-motion databases is visible in Figures 4.11
and 4.13.
To compare observations and prediction in Figure 4.27 more robustly, we partition the total
residuals into their corresponding between- and within-event terms (via Equation 4.28). To do
so we use the between- and within-event standard deviations for the BJF97 model, which are
τ = 0.214 and φ = 0.474, respectively (Boore et al., 1997). Because both τ and φ are
constants, Equation B.42 (with ni = 82) can be used to determine the between-event residual
as δBi = 0.245. Equation B.42 shows that for an increasing number of observations for a given
earthquake, the between-event residual approaches the arithmetic mean of the total residuals (with a
5% difference between 0.245 and the arithmetic mean of 0.26 in this example).
Note that these ni = 82 observed ground motions yield only a single value for the between-
event residual, δBi = 0.245. Thus, ground motions from many earthquakes are required to estimate
δBi values and ascertain whether a predictive model is consistent with observed event-to-event
variability, as well as examine biases with parameters such as M and ZT OR (see Section 12.5.2).
With the between-event residual now computed for the specific earthquake, Equation 4.28 can be
reordered as:
ln imi,j = μln I M (rupi,j , site j ) + δBi + δWi,j . (4.31)
μln I M and δBi are now grouped together as known terms in what is commonly referred to as an
“event-adjusted” mean, and δWi,j represents the residual between the observations and the event-
adjusted mean.
Figure 4.27b illustrates the comparison between the observed SA(1 s) amplitudes and the event-
adjusted prediction using the BJF97 model. Relative to the (unadjusted) BJF97 model (Figure 4.27a),
the adjusted model has a median which is larger by the factor exp(δBi ) = 1.278, and because
the event-term is a constant (rather than a random variable with standard deviation τ) the standard
deviation of the event-adjusted prediction is φ, the standard deviation of δWi,j (which is 9% smaller
than σln SA(1 s) = 0.52). The event-adjusted model shown in Figure 4.27b provides a visibly good
representation of the observed data. The mean of the within-event residuals is 0.0147, and the
standard deviation is 0.477. Both of these values are very similar to the assumption that the within-
event residual is a random variable with zero mean and standard deviation of φ = 0.47 and can be
formally rejected as different using conventional hypothesis testing (Ang and Tang, 2007).
Despite the distribution of the observed within-event residuals being consistent with the model, we
can also explore the dependence of these observed residuals with other variables. Such explorations
are instructive because the BJF97 model assumes that all the parametric dependencies are accounted
for in the mean model and that the within-event residual is a random variable without any parameter
dependence. Figure 4.28 illustrates the dependence of the within-event residuals as a function of
source-to-site distance, R, and 30-m time-averaged shear-wave velocity, VS,30 . In both figures, the
residuals cluster relatively evenly about the zero residual lines, but display dependence on the x-axis
variable, as indicated by the least-squares regression fit. The slope of the lines25 suggests that the
functional dependence with R and VS,30 leads to a relative overestimation of observations at smaller
parameter values and an underestimation at larger parameter values.
25 P-values of 0.0066 and 0.0049, respectively, imply that the null hypothesis of zero slope can be rejected at the 99%
confidence level.
1 1
0 0
–0.5 –0.5
–1 –1
–1.5 –1.5
10 –1 10 0 10 1 10 2 100 200 500 1000
Source-to-site distance, R [km] 30-m time-averaged shear
wave velocity, V S,30 [m/s]
Fig. 4.28 Within-event residuals of observed ground motions in the 1989 Loma Prieta earthquake with respect to the BJF97 model as a
function of (a) R and (b) VS,30 .
Primary conclusions from this comparison of the BJF97 prediction with the Loma Prieta S A(1 s)
amplitudes are as follows:
1. The observations are, on average, larger than the mean prediction, with a between-event residual
of δBi = 0.26. Given that τ = 0.214, δBi is 1.2 standard deviations from the zero mean of the
between-event random variable.
2. The within-event residuals, δWi,j , are consistent with the theoretical distribution of mean zero and
standard deviation φ.
3. There are statistically significant dependencies of the within-event residuals with R and VS,30 .
4. Whether these dependencies in δWi,j reflect a flaw in the model for use in general conditions, or
are specific to this earthquake event and regional setting, would require additional observations to
confirm.
It is important to point out that the biases and imprecisions evident through the above comparison
are the result of the simplistic BJF97 model adopted, and are often less pronounced in contemporary
models (such as the CY14 model presented previously). Larger datasets are also generally required
to identify systematic biases in contemporary models, so the above example and discussion should
be viewed as simply illustrating relevant considerations (discussed further in Section 12.5.2).
is not correct to simply compare the 8 out of 82 ratio with the 2.3% prediction. Once the event-
adjustment is made, then Figure 4.27b illustrates that only five ground motions are above the +2σ
prediction.
While the model treats all variation about the mean prediction as purely random, there are
ultimately physical reasons for such observations departing from the average by such an amount.
Careful examination can help identify why such observations occur, leading to the discovery of
important physical phenomena (see Chapter 5). Alternatively, there may be effects associated with
the particular earthquake source, path, or site that deviate from the “global” mean and that can be
incorporated into non-ergodic models (see Chapter 8). In either case, the study of observational data
is critical to advancing our understanding of strong ground motions.
Empirical GMMs, described in the form of Equation 4.16, provide a prediction as a function of
their mean and standard deviation. The standard deviation represents the apparent variability in
the model prediction due to the complex phenomenon and inherent simplifications in any model.
In addition to the representation of apparent variability, we have another form of uncertainty as
a result of knowledge limitations, epistemic uncertainty (Section 1.7), which is uncertainty that the
adopted empirical GMM adequately represents reality. In this section, we consider the manifestations
of such uncertainty, its root sources, and methods to account for it. Section 6.8 provides additional
discussion on epistemic uncertainties in the context of PSHA and necessitates the issues raised in
this section.
(a) (b)
0 0
10 10
M=6 M = 7.5
lnSA(1 s) [g]
–1 –1
10 10
Median, e
CY14
BSSA14
ASK14
CB14
–2 –2
10 10
10 –1 10 0 10 1 10 2 10 –1 10 0 10 1 10 2
lnSA
(c) (d)
0.8 0.8
M=6 M = 7.5
Standard deviation,
0.75 0.75
0.7 0.7
0.65 0.65
0.6 0.6
10 –1 10 0 10 1 10 2 10 –1 10 0 10 1 10 2
Source-to-site distance, R [km] Source-to-site distance, R [km]
Fig. 4.29 Illustration of epistemic uncertainty in ground-motion prediction via four plausible models for active shallow crustal
earthquakes from the NGA-West2 project. Panels (a) and (b) are median SA(1 s) predictions, while (c) and (d) standard
deviations for two different magnitude values.
calculated exceedance probabilities. For example, the probabilities of S A(1 s) > 1.0 g for the R =
0.1 km scenario range from 0.17 to 0.37, whereas the probabilities of S A(1 s) > 0.3 g for the R = 20
km scenario range from 0.26 to 0.30.
This example illustrates that model uncertainty often varies with the scenario being considered.
Observational data are more abundant for some scenarios than others. Hence, we should expect
the range of model predictions to vary depending on the amount of data constraining the models.
Figure 4.13 illustrated the sparsity of ground-motion data in the magnitude-distance parameter space
around M = 7.5, R = 0.1 km, which could explain the large variation in model predictions for this
scenario (Figure 4.30a). However, this same logic does not explain why there is little variation in
the four model predictions for the R = 20 km scenario relative to the R = 100 km scenario, since
Figure 4.13 illustrates that there is appreciably more data at larger source-to-site distances. This same
issue is present in the context of the larger difference between median predictions for the M6 event
at R = 20 km versus M7.5 at R = 20 km, with more observational data for the former than the
latter scenarios. The consideration of alternative models alone is not sufficient to capture epistemic
uncertainty.
0 0 0
0 1 2 3 0 0.5 1 0 0.1 0.2
Spectral acceleration, SA(1 s) [g]
Fig. 4.30 Probability density functions (PDFs) from model predictions in Figure 4.29 at three source-to-site distances.
Intensity measure, IM
Intensity measure, IM
(a) Ground motion dataset (b) Model formulation
Variable X Variable X
Intensity measure, IM
Variable X
Fig. 4.31 Conceptual illustration of sources of epistemic uncertainty in empirical ground-motion modeling through the dependence of IM
prediction with respect to a generic variable X: (a) ground-motion dataset selection, (b) alternative model parametric forms,
and (c) parametric uncertainty from regression analysis. In (a) and (b), the black and gray lines illustrate two alternative
models. In (c), the dashed lines indicate a range of possible models based on parametric uncertainty.
Ground-Motion Dataset
Ground-motion time series and their metadata are of variable quality and relevance to a given
application area, which leads to subjective decisions regarding the specific data that should be used
for empirical GMM development. Figure 4.31a illustrates the effect of different dataset inclusion
criteria. The larger data subset indicates no apparent trend between I M and X, whereas a smaller
subset indicates that I M increases with X. Thus, the choice of the subset will ultimately influence
the model regression results.
Factors that may determine whether a ground motion is retained for model development include
the quality of the ground-motion time series, the “free-field” nature of the instrument location,
availability and quality of metadata (e.g., M, R, VS,30 ), maximum allowable source-to-site distance
(generally a function of magnitude), classification of an event as a “mainshock” or “aftershock,”
the minimum allowable number of records from a specific event, and exclusion of events that are
“overrepresented” or considered as outliers. The four models depicted in Figures 4.29 and 4.30
illustrate the differences that result from alternative views on the above considerations. All
four models started with the same database (Ancheta et al., 2014) of 21,366 potential ground
motions from 600 earthquakes. Through exercising different decisions with respect to the above
factors, they ended with datasets having 12,244–15,750 ground motions from 300–326 earthquakes
(Abrahamson et al., 2014; Boore et al., 2014; Campbell and Bozorgnia, 2014; Chiou and Youngs,
2014; Gregor et al., 2014). Decisions on retaining data are particularly challenging where data of
marginal quality exist for conditions that are poorly covered in the observational dataset, as these
strongly influence decisions on model functional form and consequent regression parameters, as
discussed next.
Model Formulation
Figure 4.31b illustrates the effect of different model formulations (i.e., functional forms) on the
prediction of a ground-motion IM for a fixed dataset. While physics-based insights are considered in
the interpretation of ground-motion observations (such as described in Section 4.5), there is a range
of functional forms that can fit the data to a similar degree. Figure 4.29 provides an example of the
effect of different functional forms for four alternative models. The effect of the functional forms
is particularly evident in how the medians and standard deviations vary with source-to-site distance
and magnitude.
The differences between alternative formulations become particularly significant as a model is
used for predictions poorly constrained by data or extrapolating beyond the data.26 Examples of
poorly constrained phenomena include large-magnitude scaling, hanging-wall effects, directivity,
sedimentary basin effects, and nonlinear site response. Because sufficient observational data cannot
be obtained quickly, insights from theoretical considerations and numerical simulations can guide
the nature of functional forms. For example, the following theory or simulations have been used
by NGA-West2 models: Chiou et al. (2010) (magnitude scaling), Donahue and Abrahamson (2014)
(hanging-wall), Spudich and Chiou (2008) (directivity), Day et al. (2008) (sedimentary basin), and
Seyhan et al. (2014) and Kamai et al. (2014) (nonlinear site response).
The adopted functional form and dataset also influence the extent to which an empirical GMM is
generic for use globally or tailored for region- or site-specific application. As discussed in Chapter 8,
contemporary empirical GMMs seek to incorporate additional data and model components that
reflect the specific conditions at the site of interest. Epistemic uncertainty will exist in the application
of a model to a specific site (Campbell, 2003).
26 The schematic example in Figure 4.31b is oversimplified in being one-dimensional. In practice, models have many input
parameters and a high-dimension parameter space, meaning that the extrapolation problem is pervasive.
Meta-Models
The above two approaches consider that a collection of existing models represents model uncertainty,
but does not directly focus on representing the uncertainty in ground-motion exceedance probability.
Instead of weighting models, suppose we focus on this overall objective of characterizing the
exceedance probability uncertainty. We can represent the uncertainty in both the model mean and
aleatory standard deviation through a meta-model, also referred to as a “backbone” model (Atkinson
et al., 2014). A meta-model, or backbone model, can be thought of as a new model that can be
configured to directly represent the uncertainty in the probability of exceedance. Its functional form
can be adopted or adapted from an existing model (Bommer and Stafford, 2020), or can be defined
to represent the scaling implied by an ensemble of models (Abrahamson et al., 2019). The benefit of
the meta-model approach is that it allows for consideration of all three primary sources of modeling
uncertainty, not simply the variation among existing models. A practical example of using meta-
models for handling GMM uncertainty can be found in Abrahamson et al. (2019).
Meta-modeling approaches are particularly useful in cases where we are trying to tailor predictions
to a target site. As empirical GMMs are often generic (i.e., they contain few site-specific features),
adjustments are needed to make them more region- and site-specific in order to obtain predictions
with greater accuracy and precision (Bommer and Stafford, 2020). This is more easily achieved with
a single meta-model, rather than a collection of alternative existing models. Further details on site-
specific modeling are provided in Chapter 8.
Empirical GMMs have been developed for over 50 years to predict earthquake-induced ground
motions. Despite the increasing refinement of empirical GMMs over time, they are still a highly
simplified representation of reality. Relative to a more comprehensive attempt to model the physics
of ground motions (Chapter 5), notable simplifications include:
1. The earthquake rupture is represented by simply the earthquake magnitude, and possibly its style
of faulting and depth. Explicit consideration is typically not given to the location of the hypocenter
(except when directivity models are included), or the spatial and temporal evolution of slip on the
fault.
2. The effects of wave propagation are represented simply via one or more source-to-site distance
metrics. No explicit consideration is given to the stratigraphy of the Earth’s crust in the region
considered. The geometry of sedimentary basins can be particularly important, but basin effects
are considered only by a simplified basin depth parameter (if at all).
3. The effect of surficial (local) site response is often considered simply in terms of the 30-m time-
averaged shear-wave velocity of the site, VS,30 . No consideration is given to the particular soil
stratigraphy in this top 30 m, or depths greater than this. The particular stress-strain properties of
the soils, which influence nonlinear response, are also not considered.
The consequence of the above simplifications (among others) is both a lack of precision in ground-
motion prediction for a specific location and large apparent aleatory variability. Strasser et al. (2009)
illustrate that over the approximately 40-year period until 2009, there has been almost no reduction in
the apparent aleatory variability in empirical GMMs. This trend can also be reflected in the outcomes
from the suite of models over the past decade, including the NGA-West2 models (Figure 4.29). This
lack of apparent aleatory variability reduction has occurred despite the massive growth in recorded
strong ground motions (Figure 4.11) and advancements in the scientific understanding of earthquake-
induced ground-motion phenomena.
However, stable apparent aleatory variability does not imply a lack of progress in empirical
GMM development. As noted in Section 4.7.2, significant progress has been made using theoretical
or numerical simulations to constrain empirical GMMs for scenarios that are poorly represented
by empirical data. However, precisely because of a lack of empirical data, such advancements do
not have any practical impact on the resulting apparent aleatory variability of resulting models. In
emphasizing the importance of this progression, Abrahamson and Silva (2008) refer to the fact that
contemporary empirical GMM development is not merely “curve fitting” with observational data.
Instead, it is a “model-building” process that synthesizes such data with seismological and geotech-
nical information, part of the transition toward the use of physics-based simulations of ground motion
(Chapter 5).
The simplifications of an inherently complex problem and the use of global (rather than site-
specific) datasets are the principal reasons for the large apparent aleatory variability in the empirical
GMMs. Such models also have a lack of precision in the predicted median ground-motion intensity
at a given location as a result of being derived by historical data recorded from many earthquake
events at an even larger number of locations. Each earthquake, and each recording location, has its
own specific (source, path, and site) “features” that deviate from the group “average.” Those features
are repeatable effects that are not explicitly modeled with this approach. The explicit consideration of
such repeatable effects and the development of “site-specific” empirical GMMs is discussed further
in Chapter 8.
The use of physics-based ground-motion simulation methods (Chapter 5) is another alternative
with the potential to overcome many of the simplifications of empirical GMMs presented in this
chapter and enable predictions with greater accuracy and precision. Achieving this, however, requires
that the simulation input models and parameters be probabilistically characterized. Otherwise, the
physics-based models will simply transfer the empirical GMMs’ apparent aleatory variability into
equivalent parametric uncertainty in the simulation model parameters (equivalently described in
Figure 2.16 for seismic source modeling). There would be no benefits to such an exercise. With
careful characterization, physics-based models do hold great promise, however, both for informing
empirical GMMs and as predictive models on their own.
Exercises
4.1 Fourier spectra (F AS) and response spectra (SA) are intensity measures that both reflect
ground-motion amplitude as a function of frequency. Compare and contrast these two
measures, in particular, the situations in which one is perceived as advantageous over the other
from theoretical and/or practical perspectives. Select a ground motion from a public database
and compute F AS and SA values to support your answer.
4.2 Define the bracketed, DB , uniform, DU , and significant, DS , durations of a ground-motion
time series. As the absolute acceleration threshold for DB increases from zero, how would you
expect the values of bracketed and uniform duration to vary? For an earthquake of a particular
magnitude, explain how you would expect DB and DS to vary with distance?
4.3 Not all observed ground motions are suitable for the reliable determination of some intensity
measures (IMs). Section 4.3.4 discussed this idea in the context of the maximum period for
response spectra that are suitable. How does this concept apply for the determination of all the
common IMs discussed in Section 4.2?
4.4 Explain why empirical ground-motion databases will always have a relative paucity of
observational data from large-magnitude earthquakes at small source-to-site distances.
4.5 Name at least three factors that limit the number of ground motions in an empirical database.
4.6 An empirical ground-motion model for spectral acceleration, PG A [g], has a prediction defined
by the function:
μln PG A = c0 + c1 M + c2 ln(Rrup + c3 )
4.7 Explain why ground-motion models predict spectral amplitudes that tend to saturate at large
magnitudes, and why the degree of this saturation varies with spectral period.
4.8 When reviewing a seismic hazard report, you note that the authors have used a ground-motion
model for predicting the spectral acceleration, SA, that is defined as:
μln S A = 1.0 + 0.9M − 1.1 ln R jb + 0.4 ln VS,30 .
The equation for the total variance (reflecting the aleatory variability) is given as σ2 = τ 2 +
φ2 , with the between-event standard deviation being τ = 0.5 and the within-event standard
deviation being φ = 0.3.
Note that M is the moment magnitude, R jb is the Joyner–Boore distance, and VS,30 is the
average shear-wave velocity measured over the uppermost 30 m of the site.
There are at least four problems with this equation. Identify these problems and explain what
is wrong in each case.
4.9 The equation used for predicting PG A [g] in a particular region can be defined as:
μlog10 (PG A) = −1.34 + 0.77M − 0.074M 2 + (−3.16 + 0.32M) log10 R2jb + 7.72
with
where M is the moment magnitude, R jb is the closest distance to the surface projection of the
rupture.