Professional Documents
Culture Documents
Morten Jakobsen (University of Bergen) and Ru-Shan Wu (University of California, Santa Cruz)
Researchers in the mathematical physics community have re- 1998; Pratt, 1999; Sirgue and Pratt, 2004). The method is a
cently proposed a conceptually new method for solving nonlin- cascade inversion starting from the lowest frequency available
ear inverse scattering problems which is inspired by the theory for the recovery of the largest scale possible. Recent devel-
of nonlocality of physical interactions. The conceptually new opment of low-frequency land source (down to 1.5 Hz) has
method, which may be referred to as the T-matrix completion allowed multi-scale FWI to use 1-D smooth starting model
method, is very interesting since it is not based on linearization (Baeten et al., 2013). They showed that the lowest frequency
at any stage. The TMC method is based on the observation that band (1.5-2.0 Hz) is crucial in recovering the correct long-
the scattered wavefield that one can observe at the surface of wavelength background velocity structure. However, in gen-
the Earth is linearly related with the so-called T-matrix, which, eral, the ultra-low frequency sources are very expensive and
in turn, is related with the unknown scattering potential by an usually not available. Therefore, a good starting model is still
integral equation of the Lippmann-Schwinger type, indepen- required for conventional FWI.
dent of the source-receiver configuration. The main physical
requirements are that the T-matrix should be data-compatible Shin and Cha (2009) developed a method for waveform inver-
and the scattering potential operator should be dominantly lo- sion in the Laplace-Fourier domain that can be used indepen-
cal; although a non-local scattering potential operator is al- dently or to generate a satisfactory starting model for conven-
lowed in the intermediate iterations. The convergence radius tional FWI. There are also some methods which operates with
of the original TMC method is seriously restricted by its use scattering angle filters or combinations of waveform inversion
of single-frequency scattering data only. In this study, we have with travel time inversion wave equation migration analysis or
therefore developed a multi-scale version of the TMC method tomography (see Luo and Wu, 2014). In addition, the seismic
with much better convergence properties, required for seismic envelope inversion method (SEI) has been introduced as a tool
FWI in the absence of a good starting model. Essentially, we for nonlinearly separating the response of large-scale structure
use a simplified real-space version of the original TMC method from the fine-scale structures Luo and Wu, 2014). However, it
within two nested loops over angular frequency and the target is not clear if the hybrid SEI + FWI will always converge in the
degree of non-locality. The loop over angular frequency pro- presence of strong contrasts (e.g., related to salt structures).
vides a first layer of multi-scale regularization similar to the Seismic FWI is essentially a nonlinear inverse scattering prob-
standard sequential frequency inversion method. The second lem similar to those in physics (Pike and Sabatier, 2002; We-
loop over the target degree of non-locality provides a second glein et al., 2003; Weglein, 2013; Jakobsen and Ursin, 2015;
layer of multi-scale regularization which is particularly impor- Wu and Zheng, 2014; Wu et al., 2015a,b). A concept from
tant in the absence of a good starting model and/or ultra-low quantum field theory called renormalization has recently been
frequencies. A formal proof of convergence is still lacking, used to derive convergent scattering series (Lesage et al., 2014;
but the results we have obtained in a series of numerical ex- Jakobsen and Wu, 2016a) and to justify and develop the en-
periments based on synthetic data for the strongly scattering velop inversion method (see Wu et al., 2014; Wu and Luo,
SEG/EAGE salt model are very encouraging. 2015). The so-called renormalization group has also been used
to obtain a new interpretation of the well-known Born and Ry-
tov approximations (Kirkinis, 2008) that are often used for the
INTRODUCTION calculation of sensitivities in traditional FWI. Multi-scale as-
pects of the present work is related with renormalization.
The full waveform inversion (FWI) method often produces im-
ages of higher quality and resolution than conventional seismic Inspired by the theory of nonlocality of physical interactions,
amplitude and/or traveltime analysis. However, there are still Levinson and Markel (2016a,b) proposed a conceptually new
several challenges related with this comprensive and emerging T-matrix completion (TMC) method for solving nonlinear in-
imaging method; including it’s huge computational cost and verse scattering problems. In contrast with established meth-
sensitivity to the starting model (Virieux and Operto, 2009). ods for FWI and nonlinear inverse scattering (Virieux and Op-
Traditional FWI is often implemented with a gradient-based erto, 2009; Jakobsen and Ursin, 2015), the TMC method is not
local optimization method (e.g. Pratt and Worthington, 1990; based on linearization at any stage. Since the more direct TMC
Pratt; 1999; Sirgue and Pratt, 2004). A faster convergence rate method does not involve any objective functions, the problem
may be achieved by using Newton-based methods (Pratt et al., with convergence to local minimae have aparently been elim-
1998; Pratt, 1999), at the expence of increasing the compu- inated. However, since the original TMC method is based on
tational cost. Since conventional FWI is based on the use of the use of single frequency data only, there are convergence
local optimization methods for solving a nonlinear optimiza- problems in strongly scattering media. Based on the work of
tion problem, it should not come as a surprise that there are Jakobsen and Wu (2016c), we have developed a multi-scale
often convergence problems associated with the application of version of the TMC method, which seem to converge even in
FWI to strongly scattering media (e.g., sub-salt structures). the absence of a good starting model or ultra-low frequencies.
Green’s functions (Jakobsen and Ursin, 2015). By decompos- describe a simplified version of the single-frequency T-matrix
ing the actual medium with velocity field c(x) into an arbitrary completion method of Levinson and Markel (2016a,b). If the
background medium with velocity field c(0) (x) and a corre- scattered wavefield δ GRS is assumed known (observed) then a
sponding perturbation, one can show that the corresponding first estimate of the T-matrix can be obtained from (Levinson
Green’s functions G(x, x0 ) and G(0) (x, x0 ) for the actual and and Markel, 2016a,b)
reference media are related by a volume integral equation of
(0) (0)
the Lippmann-Schwinger type (Jakobsen, 2012). After spatial T (1) = GV R δ GRS GSV (6)
discretization, this integral equation can be represented by two
coupled matrix equation (Jakobsen and Ursin, 2015): where
(0) (r) (0) (r)
GV R ≡ (GRV )+ , GSV ≡ (GV S )+ (7)
(0) (0)
GRS = GRS + GRV V GV S (1) and the +-symbol denotes the More-Penrose pseudoinverse
(Levinson and Markel, 2016a,b). Equation (6) can be regarded
(0) (0)
GV S = GV S + GVV V GV S (2) as a transformation of the scattered wavefield data from the
(0) (0) (0) data domain to the image domain (see Figure 1). In our direct
Here, GV S , GVV and GRV are the so-called source-volume, nonlinear inversion method, the data are used only one time;
volume-volume and volume-receiver Green’s function matri- that is, to calculate the experimental T-matrix from the scat-
ces, and V is the scattering potential matrix (associated with tered wavefield residuals using equation (6). Given the exper-
perturbations in the squared slowness). Following Jakobsen imental T-matrix, one can reconstruct the unknown scattering
(2012), we now define the T -matrix by potential using an iterative process that involves the interme-
(0) diate use of a non-local scattering potential operator. The ex-
V GV S = T GV S . (3)
perimental T-matrix can be used to find a first estimate of the
The T-matrix accounts for all nonlinear effects of multiple scat- scattering potential, denoted by V (1) . By inserting the experi-
tering (Jakobsen, 2012; Jakobsen and Ursin, 2015). From mental T-matrix into the exact relation (4) between the scatter-
equations (2) and (3) and the fact that the background medium ing potential V and the T-matrix, one obtains a first estimate of
arbitrary, we obtain (Jakobsen and Ursin, 2015) the unknown scattering potential:
(0)
T = V +V GVV T.
(0)
(4) V (1) = (I − T (1) GVV )−1 T (1) (8)
It follows from equations (1) and (3) that If the first estimate T (1) of the T-matrix was exactly equal to
the true T-matrix for the unknown true model then first es-
(0) (0) (0)
δ GRS ≡ GRS − GRS = GRV T GV S . (5) timate V (1) would exactly equal to the true scattering poten-
tialy which is perfectly diagonal. However, the first estimate
Since the observable scattered wavefield is given by the prod- of the T-matrix will generally be very different from the true
uct of δ GRS and the source wavelet (Jakobsen and Ursin, 2015), T-matrix, so it is clear that V (1) will generally be nonlocal. The
we regard δ GRS as data when the source wavelet is known. use of a nonlocal scattering potential in the intermediate steps
Equations (5) and (4) suggest that the nonlinear inverse scat- of an iterative algorithm is satisfactory from a mathematical
tering problem can potentially be solved in two steps: First point of view. However, our inversion algorithm should out-
one performes a linear inversion of δ GRS for the T-matrix. (2) put a local scattering potential. The local scattering potential
Then one solves equation (4) for the scattering potential under model corresponding to V (1) is given by
the assumption that the T-matrix is known or determined via
an experiment (See Figure 1). D(1) = diag(V (1) ). (9)
|V (1) − D(1) |
η (1) = . (10)
|V (1) |
tential. Thus, the second estimate of the T-matrix is given by each iteration within the two nested loops over ω and η. The
T (2) = (I − D(1) G(0) )−1V (1) (11) outer loop over ω provide a first layer of regularization similar
to the standard sequential frequency inversion method of Pratt
The first estimate of the T-matrix was clearly compatible with
(1999). The inner loop over η represents an additional layer of
the scattered wavefield data, since it was obtained by using the
regularization, which is particularly important in the absence
Moore-Penrose inverse directly in conjunction with the data
of a good starting model and/or ultra-low frequencies. In the
equation. However, the second estimate of the T-matrix will
inner loop over η, the target degree of non-locality ηtarget is
generally not be compatible with the observed data. Therefore,
gradually decreased from a relatively high value (correspond-
one needs to add a correction term ∆T (2) to the second estimate
ing to a very non-local scattering potential) to a much smaller
of the T-matrix in order to be consistent with the data;
value (corresponding to a dominantly local scattering poten-
T (2) → T̃ (2) = T (2) + ∆T (2) (12) tial). By allowing for a non-local scattering potential in the
intermediate iterations, we effectively increase the numbers of
Data compatibility implies that
freedom, so that we avoid cycle-skipping problems, even when
(0)
δ GRS = GRV T̃ (2) GV S (13) the starting frequency is so high that the standard sequential
frequency inversion method is garanteed to fail.
If one multiplies the above equation with the Moore-Penrose
(0) (0)
invers matrices GV R and GSV from the left and right, respec-
tively; then one can verify that the correction term is given by NUMERICAL EXAMPLE
(2) (1) (0) (0)
∆T =T − GV RRV ∆T (2) GV SSV (14)
We consider a resampled SEG/EAGE salt model (2D) which is
where 2088 m wide and 432 m deep. We assume 174 sources and 174
(0) (0) (0) (0) (0) (0) receivers uniformly located along a single line at the surface.
GV RRV ≡ GV R GRV , GV SSV ≡ GV S GSV (15)
Since the grid size is 24 m in each direction, there are 87 and
are N × N dimensional matrices, where N is the number of 18 grid blocks in the horizontal and vertical directions, respec-
pixel’s in the seismic image. tively. We assume a Ricker wavelet with a central frequency
equal to 7.5. We employ the multi-scale TMC method to a set
The kth estimate of T and V
of synthetic frequency domain waveform data without noise.
Following Levinson and Markel (2016), we consider the case In Figure 3, one can see that the inverted model is converging
when the iterations start from an initial guess. We first set k = 1 towards the true model.
and use equation (6) to calculate T (k) = T (1) . We then estimate
the kth approximation to the scattering potential (Step 1):
CONCLUDING REMARKS
(0)
V (k) = T (k) (I + GVV T (k) )−1 .
We have developed a conceptually new method for FWI in the
If V (k) is not sufficiently local/diagonal, we regard it as un-
absence of a good starting model and/or ultra-low frequencies,
physical and extract it’s diagonal (Step 2):
which is based on the use of a real-space T-matrix completion
D(k) = diag(V (k) ). algorithm within two nested (multi-scale regularization) loops
over the angular frequency and degree of non-locality of phys-
If the reconstructed scattering potential is dominantly local, in
ical interaction. A formal proof of convergence is still lacking,
the sense that η (k) ≡ |V (k) − D(k) |/|V (k) | << ηtarget , then we
but the new method has given encouraging results in numerical
exit the iterative loop. Otherwise, we proceed to (Step 3):
experiments based on synthetic data. The computational cost
0 (0)
T (k) = (I −V (k) GVV )−1V (k) . of our new multi-scale TMC method is relatively high, but it
may in principle be significantly reduced via the use of domain
Unlike T (k) , the new estimate Tk0 is no longer data compati- decomposition methods (e.g., Jakobsen and Wu, 2016b; Wang
ble. Therefore, we add a correction to make the T-matrix data et al., 2016) and/or lowest frequency inversion on a courser
compatible and advance the iterative index k by one (Step 4): grid for generation of starting models for conventional FWI.
0 0
(0)
T (k+1) = T (1) + T (k) − GV RRV T (k) GV SSV .
(0) The theoretical work presented here can in principle be gen-
eralized to anisotropic elastic media (see Jakobsen and Ursin,
Finally, we return to Step 1. The above workflow, which is 2015; Jakobsen et al., 2015).
illustrated in Figure 1, represents a simplified real space rep-
resentation of the original TMC algorithm of Levinson and Acknowledgments The work of Morten Jakobsen is associ-
Markel (2016a,b). We have found that the original TMC method ated with Petromaks 2 project 228357 funded by the Norwe-
can be applied to seismic FWI if and only if it is supplemented gian Research Council. Ru-Shan Wu acknowledge support
by some kind of multi-scale regularization. from the sponsors of the WTOPI consortium at the UCSC.
Figure 2: Workflow for single-frequency T-matrix completion. The multi-scale TMC algorithm was designed by locating the
single-frequency TMC within two nested loops over frequency and the degree of non-locality of the physical interaction. The exact
formal expressions for the T and V matrices can be replaced by direct and inverse cluster expansions or thin-slab propagators.
Figure 3: Numerical example of multi-scale TMC. The left column shows the inversion results we have obtained at 3 Hz by
gradually searching for an increasingly local scattering potential model. The right column shows the results of a sub-sequent
sequential frequency inversion. The starting model was laterally homogeneous and involved a linear increase of the velocity with
increasing depth from 2000 m/s to 4500 m/s; that is, very different from the true model.
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Baeten, G., J. W. de Maag, R. E. Plessix, M. Klaassen, T. Qureshi, M. Kleemeyer, F. ten Kroode, and R.
Zhang, 2013, The use of the low frequencies in a full waveform inversion and impedance
inversion land seismic case study: Geophysical Prospecting, 61, 701–711,
http://doi.org/10.1111/1365-2478.12010.
Bunks, C., F. M. Salech, S. Zaleski, and G. Chavent, 1995, Multiscale seismic waveform inversion:
Geophysics, 60, 1457–1473, https://doi.org/10.1190/1.1443880.
Jakobsen, M., 2012, T-matrix approach to seismic forward modelling in the acoustic approximation:
Studia Geophysica et Geodaetica, 56, 1–20, http://doi.org/10.1007/s11200-010-9081-2.
Jakobsen, M., and B. Ursin, 2015, Full waveform inversion in the frequency domain using direct iterative
T-matrix methods: Journal of Geophysics and Engineering, 12, 400–418,
https://doi.org/10.1088/1742-2132/12/3/400.
Jakobsen, M., I. Pilskog, and M. Lopez, 2015, Generalized T-matrix approach to seismic modeling in
fractured reservoirs and related anisotropic systems: 77th Annual International Conference and
Exhibition, EAGE, Extended Abstracts, http://doi.org/10.3997/2214-4609.201412933.
Jakobsen M., and R. S. Wu, 2016a, Renormalized scattering series for frequency-domain waveform
modelling of strong velocity contrasts: Geophysical Journal International, 206, 880–899,
https://doi.org/10.1093/gji/ggw169.
Jakobsen, M., and R. S. Wu, 2016b, Domain decomposition method for efficient waveform inversion in
strongly scattering media: 86th Annual International Meeting, SEG, Expanded Abstracts, 1395–
1399, https://doi.org/10.1190/segam2016-13951062.1.
Jakobsen, M., and R. S. Wu, 2016c, Direct nonlinear inversion by multi-frequency T-matrix completion:
AGU Annual Fall meeting, Abstract and oral presentation.
Kirkinis, E., 2008, Renormalization group interpretation of the Born and Rytov approximations: Journal
of the Optical Society of America A, 25, 2499–2508, https://doi.org/10.1364/JOSAA.25.002499.
Lesage, A. C., J. Yao, F. Hussain, and D. J. Kouri, 2014, Multi-dimensional inverse scattering series
using the Volterra renormalization of the Lippmann-Schwinger equation: 84th Annual
International Meeting, SEG, Expanded Abstracts, 3118–3122,
https://doi.org/10.1190/segam2014-1349.1.
Levinson, H. W., and V. A. Markel, 2016a, Solution to the inverse scattering problem by T-matrix
completion I. Theory: Physical Review E, 94, 043317,
https://doi.org/10.1103/PhysRevE.94.043318.
Levinson, H. W., and V. A. Markel, 2016b, Solution to the inverse scattering problem by T-matrix
completion. II. Simulations: Physical Review E, 94, 043318,
https://doi.org/10.1103/PhysRevE.94.043318.
Luo, J., and R. S. Wu, 2014, Nonlinear scale separation and misfit configuration of envelope inversion:
84th Annual International Meeting, SEG, Expanded Abstracts, 1216–1221,
https://doi.org/10.1190/segam2014-0830.1.
Pike, R., and P. Sabatier, 2002, Scattering and inverse scattering in pure and applied science: Academic
Press.
Pratt, R. G., and M. H. Worthington, 1990, Inverse-theory applied to multisource cross-hole tomography:
Acoustic wave equation method: Geophysical Prospecting, 38, 287310,
https://doi.org/10.1111/j.1365-2478.1990.tb01846.x.
Shin, C., and Y. Cha, 2009, Waveform inversion in the Laplace-Fourier domain: Geophysical Journal
International, 177, 1067–1079, https://doi.org/10.1111/j.1365-246X.2009.04102.x.
Sirgue, L., and R. G. Pratt, 2004, Efficient waveform inversion and imaging: A strategy for selecting
temporal frequencies: Geophysics, 69, 231–248, https://doi.org/10.1190/1.1649391.
Virieux, J., and S. Operto, 2009, An overview of full waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC127–WCC152, https://doi.org/10.1190/1.3238367.
Weglein, A. B., 2013, A timely and necessary antidote to indirect methods and so-called P-wave FWI:
The Leading Edge, 32, 1192–1204, https://doi.org/10.1190/tle32101192.1.
Weglein, A. B., F. V. Araujo, P. M. Carvalho, R. H. Stolt, K. H. Matson, R. T. Coates, D. Corrigan, D. J.
Foster, and S. A. Shaw, 2003, Inverse scattering series and seismic exploration: Inverse
Problems, 19, R27–R83, https://doi.org/10.1088/0266-5611/19/6/R01.
Wu, R. S., B. Wang, and C. Hu, 2015a, Renormalized nonlinear sensitivity kernel and inverse thin-slab
propagator in T-matrix formalism for wave-equation tomography: Inverse Problems, 31, 115004,
https://doi.org/10.1088/0266-5611/31/11/115004.
Wu, R. S., B. Wang, and M. Jakobsen, 2015b, Greens function and T-matrix reconstruction using surface
data for direct nonlinear inversion: 85th Annual International Meeting, SEG, Expanded Abstracts,
1286–1291, https://doi.org/10.1190/segam2015-5926527.1.
Wu, R. S., and J. Luo, 2015, Nonlinear scale separation and a renormalization interpretation in seismic
envelope inversion: AGU meeting, Abstract.
Wu, R. S., J. Luo, and B. Wang, 2014, Seismic envelope inversion and modulation signal model:
Geophysics, 79, no. 3, WA13–WA24, https://doi.org/10.1190/geo2013-0294.1.
Wu, R. S., and Y. Zheng, 2014, Nonlinear partial derivative and its De Wolf approximation for nonlinear
seismic inversion: Geophysical Journal International, 196, 1827–1843,
https://doi.org/10.1093/gji/ggt496.
SUMMARY Using too few parameters can lead to under-fitting the data, and
estimating biased parameters. On the other hand, considering
Seismic data are used to generate high resolution subsurface too many model parameters can over-fit the data, which leads
images, which require detailed velocity models. Full Wave- to estimating under-determined parameters with enormous un-
form Inversion (FWI), has recently gathered immense popular- certainty (Dosso et al., 2014). Thus it makes sense to make the
ity in inverting for the elastic wave velocities from the seismic number of model parameters itself a parameter to be solved for.
data. FWI is a non-linear and non-unique inverse problem that One other limitation of FWI is the requirement of a good start-
uses complete time and amplitude information for estimating ing model. Although this problem has recently been addressed
the elastic properties. Typically FWI is performed using lo- using global optimization methods (e.g., Datta and Sen, 2016),
cal optimization methods in which the subsurface model is de- no attempt has yet been made to characterize uncertainty using
scribed by using a large number of grids. The number of model a fully nonlinear sampling method. Zhu et al. (2016) estimate
parameters is determined a priori. In addition, the convergence uncertainty assuming the posteior distribution (PPD) to be a
of the algorithm to the globally optimum answer is largely Gaussian. In this paper, we primarily address these two issues
dictated by the choice of a starting model. Here, we apply using a transdimensional approach.
a trans-dimensional approach, which is based on a Bayesian
framework to solve the waveform inversion problem. In our Monte Carlo Markov Chain (MCMC) based on the Metropolis-
approach, the number of model parameters is also treated as a Hastings update rule is the most common method for sam-
variable, which we hope to estimate. We use Voronoi cells and pling from a distribution that does not have simple analytic
represent our 2D velocity model using certain nuclei points form in fixed-dimensional problems. Its extension to prob-
and employ a recently developed method called the Reversible lems with variable dimensional model space (reversible jump
Jump Hamiltonian Monte Carlo (RJHMC). RJHMC is an ef- MCMC) was developed by Green (1995) and subsequently
fective tool for model exploration and uncertainty quantifica- applied to geophysical inverse problems by Malinverno and
tion. It combines the reversible jump MCMC with the gradient Leaney (2000, 2005); Malinverno (2002); Ray et al. (2016).
based Hamiltonian Monte Carlo (HMC). We solve our forward Most recently, Sen and Biswas (2017) combined RJMCMC
problem using time-domain finite difference method while ad- with the Hamiltoniam Monte-Carlo (RJHMC) to speed up con-
joint method is used to compute the gradient vector required at vergence and applied to 1D seismic inversion. Here we apply
the HMC stage. We demonstrate our algorithm with noisy syn- RJHMC to a computationally intesive 2D FWI problem and
thetic data for the well known Marmousi model. Convergence demonstrate its feasibility and usefulness using a noisy syn-
of the chain is attained in about 3000 iterations; marginal pos- thetic dataset.
terior density plots of velocity models demonstrate uncertainty
in the obtained velocity models.
THEORY
and the seismic P-wave velocity is given by the k vector v = move, we first perform the fixed dimensional HMC to change
(V1 , . . . ,Vk ). To generate the full scale velocity model from the state of model from m∗ to m0 having k0 number of nuclei
the nuclei, we first create the Voronoi connection between the and then perform the trans-D step to move model m0 having k0
nuclei and the grid point which falls in the Voronoi area gets nuclei to model state m having k nuclei. The transition from
the velocity from the respective nucleus. Figure 1 shows an model m (k) to model m0 (k0 ) to final model m∗ (k0 ) is given
example of the transformation from nuclei cells to a complete by the RJHMC acceptance probability.
velocity model.
First: Trans-dimensional step
Forward Modeling
For a first tarns-dimensional step the random moves can jump
Once a gridded model is obtained, we perform forward model- between the states having different model dimensions (number
ing using the constant density acoustic wave equation, where of nuclei here). Here we have the transition from model m
the only elastic parameter is the P-wave velocity. Here we with k nuclei to model m0 having k0 nuclei. To represent such
assume that the reflections are due to the contrast in P-wave a transition, the Bayes’ rule can be modified to accommodate
velocities only. Such type of wave equation is given by the variability in model parameterization so as to represent the
PPD and can be given as
1 ∂ 2P
= ∇2 P + S(x,t), (2) p(k)p(m|k)p(d|m, k)
c2 ∂t 2 π(k, m) = P R , (6)
k∈κ p(k)p(m|k)p(d|m, k)dm
where P is the pressure wavefield, c is the P-wave velocity, ∇2
2 2
is the Laplacian given by ∂∂x2 + ∂∂z2 , and S(x,t) is the source where κ represents all possible values of k. p(k) is the prior
term. distribution for choosing the model dimension. We define it to
be a bounded uniform distribution between the minimum num-
At each iteration, after the velocity model is perturbed, the ber of nuclei kmin and the maximum number of nuclei possible
wave equation is solved to compute new seismogram (dcal ) kmax . p(m|k) is the prior distribution on the parameter values
and the difference from the observed seismogram (dobs ) is cal- and is defined as a Gaussian prior as
culated using " #
E = kdobs − dcal k2 , (3) 1 1 T −1
p(m|k) = 1/2 exp − 2 (m− m̄) (C̄ p ) (m− m̄) ,
The gradient is calculated by the adjoint state method, where (2π)k det(C̄ p )
the zero lag cross correlation of forward propagating source (7)
and backward propagating data residuals in time is calculated where C̄ p is the prior covariance matrix and m̄ represents the
by prior mean model.
1 ∂ 2R
= ∇2 R + ∆S(x,t), (4) p(d|m, k) in equation 6 represents the likelihood, which is given
c2 ∂t 2 by the magnitude of the error in the predicted data. Assuming
where R is the adjoint wavefield, and ∆S(x,t) is the data resid- the error covariance matrix to be a identity matrix, the mag-
ual between the observed data (dobs ) and the calculated data nitude of error can be given by L2 norm of the difference in
(dcal ). The gradient of the misfit function is calculated by sum- seismogram as given in equation 3. Thus the likelihood can be
ming over the zero lag cross correlation summed over multiple represented as
shots
∂E 1 X 1 E2
h i
= P̈(x, z,t)R(x, z,t), (5) p(d|m, k) = N/2 exp − . (8)
∂ m c(x, z)3 2
shots 2π
where P and R are the source and adjoint wavefields in space
and time. Second: Fixed-dimensional step (HMC)
Inverse formulation This is a fixed Dimensional MCMC step, where the model in
FWI is a non-unique, non linear inverse problem. It is impos- state m0 with k0 nuclei moves to a new state m∗ with same
sible to get a unique solution in FWI. Therefore using a prob- k0 nuclei using Hamiltonian Monte Carlo. HMC uses Hamil-
abilistic approach seems to be a better way to address such a tonian Dynamics to produce distant proposal for Metropolis
problem. Here, we follow the Reversible Jump Hamiltonian algorithm and avoid slow exploration by using gradient infor-
Monte Carlo algorithm (Sen and Biswas, 2017). This is a mation while changing state. In HMC, we calculate the total
stochastic method which is based on Bayes’ rule. It consist Potential energy (U) and Kinetic energy (K) of the state, i.e.
of two step (Al-Awadhi et al., 2004). The first step is a trans- the Hamiltonian Energy. HMC augments the state space of the
dimensional step similar to RJMCMC. In this step the model target distribution by adding auxiliary sets of variables termed
20 2
18
3 5000
14
11
40
4000
Depth
60
80 7 3000
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
100
9
17
13 2000
120 10 12 8 16
Figure 1: Figure showing the velocity model created from nuclei cells represented in black dot. The white cross are the grid points
in the velocity model. The color in the Voronoi cells are the velocity of that respective nuclei in that area.
Iter=0
True Velocity NC=1000
a b 5000
Depth (km) 1 4000
2 3000
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
2000
3
2 4 6 8 2 4 6 8
Iter=1258 Iter=3969
NC=3010 NC=6850
c d 5000
Depth (km)
4000
3000
2000
2 4 6 8 2 4 6 8
X Dir. (km) X Dir. (km)
Figure 3: Figure showing velocity models at different iterations of RJHMC: (a) the true velocity model, (b)-(d) models at progressive
iterations. Plots show the iteration number and the number of nuclei count in the model represented as NC.
3000
Figure 4 shows the velocity model at X = 7140m. It shows
2000
1 4000
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Summary (Luo and Sava, 2011), (Ma and Hale, 2013) and (Warner
and Guasch, 2014).
Full waveform inversion (FWI) with 𝐿𝐿2 norm objective
function often suffers from cycle skipping that causes the Recently, the Wasserstein distance has been proposed to
solution to be trapped in a local minimum, usually far from replace the 𝐿𝐿2 distance for the objective function in FWI
the true model. We introduce a new norm based on the (Engquist and Froese, 2014). The Wasserstein distance is a
optimal transport theory for measuring the data mismatch well-defined metric from the theory of optimal transport in
to overcome this problem. The new solution uses an mathematics. It was first brought up by Gaspard Monge in
exponential encoding scheme and enhances the phase 1781 (Monge, 1781) and more recently by Kantorovich
information when compared with the conventional 𝐿𝐿2 norm. (Kantorovich, 1942) seeking the optimal cost of
The adjoint source is calculated trace-wise based on the 1D rearranging one density into the other, where the
Wasserstein distance. It uses an explicit solution of the transportation cost per unit mass is the Euclidean distance
optimal transport over the real line. It results in an efficient or Manhattan distance.
implementation with a computational complexity of the
adjoint source proportional to the number of shots, Wasserstein distance has the ability to consider both phase
receivers and the length of recording time. We demonstrate shifts and amplitude differences It has been demonstrated
the effectiveness of our solution by using the Marmousi in (Engquist, Froese and Yang, 2016) that 𝑊𝑊 2 bears some
model. A second example, using the BP 2004 velocity advantageous mathematical properties, such as convexity
benchmark model, illustrates the benefit of the combination with respect to shift and dilation and insensitivity to noise.
of the new norm and Total Variation (TV) regularization. In (Yang Engquist, Sun and Froese 2016), 𝑊𝑊 2 on 2D data
is applied to FWI on synthetic benchmark models. The
Introduction calculation of the corresponding adjoint source requires
solving a Monge–Ampère equation that can be
FWI is formulated as a nonlinear inverse problem matching computationally demanding. Another popular optimal
modeled data to the recorded field data (Tarantola, 1984). transport metric used for FWI is the 1-Wasserstein distance
Usually, a least-square objective function is used for ( ), approximated by the Kantorovich Rubinstein (KR)
measuring the data misfit. This misfit is minimized with norm (Métivier, et al, 2016). For this metric the transport
respect to model parameter and the model update is map is not unique. The KR norm doesn't require data to be
computed using the adjoint state method. FWI can produce positive and mass preserved. Therefore it can be directly
high-resolution models of the subsurface when compared to applied to the seismic data without transferring them into
ray-based methods. Due to the large scale of the problem, probability density function (pdf). Both analysis and
local rather than global optimization methods are numerical results shows the potential of FWI with optimal
mandatory. However, FWI is often an ill posed problem transport to mitigate cycle-skipping problem.
due to the band-limited nature of the seismic data and the
limitations of the acquisition geometries. Furthermore, the The Wasserstein metric is designed to measure the distance
non-convexity resulting from the least-square objective between two pdfs. Thus, non-negativeness and unit mass
function causes the local minima, i.e., cycle-skipping are desired for the input. But, oscillation and sign-change
problem, especially with data lacking low frequency are typical features of the seismic data. Therefore, we need
information. a misfit function that takes the global features of data into
consideration and is robust to periodicity and sign-change.
It is well known that the least-square formulation of FWI Since seismic data are not naturally positive, a proper
tends to produce many local minima. This is because only normalization method is the key to Wasserstein distance
the pointwise amplitude difference is measured based inversion. Some previous methods may lead to non-
with 𝐿𝐿2 norm while the phase or travel-time information differentiable misfit function and are not compatible with
embedded in the data is more critical for the inversion. adjoint-state method, or lose information of original data
There are different approaches proposed to capture the during the normalization.
travel-time difference, such as dynamic time warping and
convolution based methods. This information is used in Here, we address the issue of how to transform seismic data
order to convexify the objective function or enlarge the true into pdfs. The new solution uses an exponential encoding
solution valley. In this direction, we mention the works in scheme and enhances the phase information when
compared with the conventional 𝐿𝐿2 norm. The algorithm
uses of the 1D Wasserstein metric. As a result, the To make use of the phase information from the negative
implementation of the adjoint source has the same order of part of the data, we balance this uneven encoding by also
computational complexity as of the conventional 𝐿𝐿2 norm. taking into account the data reformed by the map
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Numerical experiments
Figure 2: (a): True model, (b) Initial model, (c) FWI with
𝑳𝑳𝟐𝟐 (d) FWI with 𝑾𝑾𝟐𝟐 .
Conclusions
Aknowledgments
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Billette, F. J., and S. Brandsberg-Dahl, 2005, The 2004 bp velocity benchmark: 67th Annual International
Conference and Exhibition, EAGE, Extended Abstracts.
Ramos-Martinez, J., S. Crawley, S. Kelly, and B. Tsimelzon, 2011, Full-waveform inversion by pseudo-
analytic extrapolation: 81st Annual International Meeting, SEG, Expanded Abstracts,
http://dx.doi.org/10.1190/1.3627750.
Tarantola, A., 1984, Inversion of seismic refection data in the acoustic approximation: Geophysics, 49,
1259–1266, http://doi.org/10.1190/1.1441754.
Engquist, B., and B. D. Froese, 2014, Application of the Wasserstein metric to seismic signals:
Communications in Mathematical Sciences, 12, 979–988,
http://dx.doi.org/10.4310/CMS.2014.v12.n5.a7.
Engquist, B., B. D. Froese, and Y. Yang, 2016, Optimal transport for seismic full waveform inversion:
Communications in Mathematical Sciences, 14, 2309–2330,
http://dx.doi.org/10.4310/CMS.2016.v14.n8.a9.
Yang, Y., B. Engquist, J. Sun, and B. D. Froese. Application of optimal transport and the quadratic
wasserstein metric to fullwaveform inversion, arXiv preprint arXiv:1612.05075, 2016.
Metivier, L., R. Brossier, Q. Merigot, E. Oudet, and J. Virieux, 2016, Measuring the mis t between
seismograms using an optimal transport distance: application to full waveform inversion:
Geophysical Journal International, 205, 345–377.
Warner, M., and L. Guasch, 2014, Adaptive waveform inversion: Theory: 84th Annual International
Meeting, SEG, Expanded Abstracts, 1089–1093, http://dx.doi.org/10.1190/segam2014-0371.1.
Luo, S., and P. Sava, 2011, A deconvolution-based objective function for wave-equation inversion: 81st
Annual International Meeting, SEG, Expanded Abstracts, 2788–2792,
http://dx.doi.org/10.1190/1.3627773.
Ma, Y., and D. Hale, 2013, Wave-equation refection traveltime inversion with dynamic warping and full-
waveform inversion: Geophysics, 78, R223-R233, https://doi.org/10.1190/geo2013-0004.1.
Monge, G., 1781, Memoire sur la theorie des deblais et des remblais: Del’Imprimerie Royale.
Kantorovich, L. V., 1942, On the translocation of masses: In Dokl. Akad. Nauk SSSR, 37, 199–201.
Qiu, L., N. Chemingui, Z. Zou, and A. Valenciano, 2016, Full-waveform inversion with steerable
variation regularization: 86th Annual International Meeting, SEG, Expanded Abstracts, 1174–
1178, http://dx.doi.org/10.1190/segam2016-13872436.1.
We summarize and compare four different misfit functions for We focus on two features in particular. One is integration of
full waveform inversion (FWI): the conventional least-squares data and the other is the need to rescale the data to be non-
norm, the integral wavefields misfit functional, the Normal- negative. Integration provides a global comparison between
ized Integration Method (NIM) and the quadratic Wasserstein observed and synthetic data and also shifts the focus to lower
metric. The integral wavefields misfit functional and NIM are frequencies. Nonnegativity further reduces the risk of cycle
equivalent to the norm for Soblev space, which has intrinsic skipping.
connections with the quadratic Wasserstein metric. We extract
two important features of optimal transport. The first one is
integration of data, which reduces high frequencies and glob- THEORY
ally compares observed and synthetic seismic waveforms. The
other is rescaling of the data to be nonnegative. Numerical Full waveform inversion is a PDE-constrained optimization
results illustrate that FWI with quadratic Wasserstein metric problem, minimizing the data misfit d( f , g) by updating the
can effectively overcome the cycle skipping problem. A math- model m, i.e. :
ematical study on the convexity of the four misfit functions
demonstrates the importance of data nonnegativity and inte- m� = argmin d( f (xr ,t; m), g(xr ,t)), (1)
m
gration in dealing with local minima in inversion.
where g is observed data, f is simulated data, xr are receiver
locations, and m is the model parameter. We get the modeled
data f (x,t; m) by solving a wave equation with a finite differ-
INTRODUCTION ence method (FDM) in both the space and time domain (Alford
et al., 1974).
Full waveform inversion (FWI) is a data-driven method to ob-
tain high resolution subsurface properties by minimizing the Generalized least squares functional is a weighted sum of the
difference between observed and synthetic seismic waveforms squared errors and hence a generalized version of the standard
(Virieux et al., 2017). In the past three decades, the least- least squares misfit function. The formulation is
squares norm (L2 ) has been widely used as a misfit function ��
(Tarantola and Valette, 1982; Lailly, 1983), which is known J1 (m) = |W ( f (xr ,t; m)) −W (g(xr ,t))|2 dt, (2)
to suffer from cycle skipping issues with local minimum trap- r
ping and sensitivity to noise (Virieux and Operto, 2009). Other
misfit functions proposed in literature, include the L1 norm where W is an operator. In the conventioinal L2 misfit, W = I,
(Brossier et al., 2010), the Huber norm (Ha et al., 2009), fil- the identity operator.
ter based misifit functions (Warner and Guasch, 2014; Zhu and The integral wavefields misfit functional (Huang et al., 2014) is
Fomel, 2016), seismic envelop inversion (Luo and Wu, 2015) a generalized least squares functional applied on full-waveform
and some others. �t
inversion (FWI) with weighting operator W (u) = 0 u(x, τ)dτ.
A recently introduced class of misfit functions are optimal- The objective function is defined as
transport related (Engquist and Froese, 2014; Métivier et al., � �2
� � �� t � t
�
2016; Engquist et al., 2016; Métivier et al., 2016; Yang et al., J2 (m) = � f (xr , τ; m)dτ − g(xr , τ)dτ � dt, (3)
� �
2016). As useful tools from the theory of optimal transport, the r 0 0
quadratic Wasserstein metric (W2 ) computes the optimal cost
of rearranging one distribution into another with a quadratic �t
If we define the integral wavefields U(x,t) = 0 u(x, τ)dτ, then
cost function, while 1-Wasserstein metric (W1 ) using absolute misfit function (3) is the ordinary least squares misfit
value cost function. � t between
the observed and predicted integral wavefields 0 g(xr , τ)dτ
�t
In this paper, we will also discuss about Normalized Intergra- and 0 f (xr , τ; m)dτ. The integral wavefields still satisfy the
tion Method (NIM) which computes the least-squares differ- original acoustic
�t wave equation with a different source term:
ence between two normalized data sets (Liu et al., 2012; Chau- δ (�x −�xs ) 0 s(τ)dτ = δ (�x −�xs )H(t) ∗ s(t), where s is the origi-
ris et al., 2012; Donno et al., 2013). If we consider the data are nal source term and H(t) is the Heaviside step function (Huang
properly rescaled, the misfit of NIM is the norm of Sobolev et al., 2014).
space H −1 in mathematics. The connection between W2 and
Normalized Integration Method (NIM) is another generalized
H −1 is not obvious from the optimal transport definition, but
least squares functional, similar to the integral wavefields mis-
is clear from the 1D closed solution formula. We shall also
fit functional. However, compared with integral wavefields
see that this is valid in higher dimensions even if there is no
misfit functional which directly integrates the observed and
explicit solution formula.
Conventional L2
where M is the set of all maps T f ,g that rearrange the distribu-
Integral L2
100
f
30 tion f into g (Villani, 2003).
f
g 20
g
50 10
The optimal transport formulation� requires non-negative
� dis-
0
tributions and equal total masses, f (x)dx = g(x)dx, which
0 -10
are not natural for seismic signals. Therefore a proper data
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
NIM/ W2/ W1 We can compare the data trace by trace and use the Wasserstein
1
F metric (Wp ) in 1D to measure the misfit. The overall misfit is
0.8 dist
G
then
W = |F -1 -G -1 | R
1 � p
0.6
dist 2
J4 (m) = Wp ( f (xr ,t; m), g(xr ,t)), (7)
2
W2 = |F -1 -G -1 | 2 r=1
0.4
where R is the total number of traces. In this paper, we mainly
0.2
dist' 2
discuss the quadratic Wasserstein metric (W2 ) when p = 2 in (6)
NIM = |F-G|
2 and (7).
0
0 2 4 6 8
4000 100
Misfit
0.5
2000 50 0.5
0 0 0
-2 -1 0 1 2 -2 -1 0 1 2 -2 -1 0 1 2
Shift Shift Shift
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
0
W2, p(x)=x*x W2,
-3
10
p(x)=ax+b W2, p(x)=exp(c*x) 1
4 4 0.015 0.8 1.6
0.6 1.8
2
3 3 0.4 2.2
0.01
2.4
v p,0
2 2
0.005 (a) L2
1 1
0 0 0
-2 -1 0
Shift
1 2 -2 -1 0
Shift
1 2 -2 -1 0
Shift
1 2
W2 misfit
Figure 2: The misfit between f (x) and f (x − s) by six differ- 1
Misfit
0.5
0.4 2.2
2.4
v p,0
(b) W2
where dµ = f (x)dx. In one word, the quadratic Wasserstein As demonstrated by Engquist et al. (2016), the squared Wasser-
metric is a weighted H −1 norm. stein metric has several properties that make it attractive as
a choice of misfit function. One highly desirable feature is
Besides, the dynamical characterization of the Wasserstein met- its convexity with respect to several parameterizations. How-
ric proposed by Benamou-Brenier (Benamou and Brenier, 2000) ever, the convexity highly depends on the data normalization
gives insights to consider that H −1 and W2 belongs to the same method to satisfy nonnegativity and mass balance. The curves
class of measures. One can refer to Dolbeault et al. (2009) and in the second row of Figure 2 are W22 distance with differ-
Cardaliaguet et al. (2012) for more theoretical details, and Pa- ent scaling functions: p1 (x) = x2 , and p2 (x) = ax + b and
padakis et al. (2014) for computational schemes. Mathemati- p3 (x) = exp(c · x). Theoretically p1 gives perfect convexity,
cally, the misfits computed by NIM and W2 are close also in but having difficulty in inversion with adjoint-state method.
higher dimensions. From Taylor expansion p3 is very close to p2 when c is small,
but easy to blow up with large c. Our current choice is to nor-
Convexity
malize data with p2 , but it is worth thinking a new normaliza-
In order to illustrate the convexity of different objective func- tion function that is able to preserve the convexity better.
tions, we borrow an example from Engquist and Froese (2014)
It is interesting to compare the graph for NIM (upper right)
that compares the misfit between a Ricker wavelet f and its
4
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
5
NUMERICAL EXAMPLE
6
(a)
In this section, we use a part of the BP 2004 benchmark ve-
0 2 4 6 8 10 12 14 16 locity model (Billette and Brandsberg-Dahl, 2005) (Figure 4a)
0
and an initial model without the upper salt part (Figure 4b)
1
to do inversion with W2 and L2 norm respectively. A fixed-
2 spread surface acquisition is used, involving 11 shots located
every 1.6km on top. A Ricker wavelet centered on 5Hz is used
z (km)
5
CONCLUSION
6
(c)
In this paper, we summarize and compare four misfit func-
0
0 2 4 6 8 10 12 14 16 tions: the conventional least-squares inversion (L2 ), the in-
tegral wavefields misfit function, the Normalized Integration
1
Method (NIM), and the quadratic Wasserstein metric (W2 ) from
2
optimal transport. The L2 norm is popular in general inverse
z (km)
3
problems, but suffers from cycle skipping in seismic inversion.
4 The other three methods all incorporate the idea of integra-
5 tion the waveforms. Integration helps in enhancing the lower
6 frequency component, but cannot avoid local minima coming
(d) from the oscillatory periodicity of the data. It is ideal to have
a preconditioning operator which can “break” the periodicity
Figure 4: (a) True model velocity (b) Initial velocity (c) Inver- and “record” the previous data information in time.
sion result using L2 (d) Inversion result using W2
One solution is to combine the nonnegativity and integration in
time together. Both NIM and the quadratic Wasserstein met-
ric include these ideas as essential steps. A detailed discussion
with the one of W2 (lower left) both of which are using the illustrates that the quadratic Wasserstein metric and the H −1
same normalization function (p1 ) and globally convex with re- norm which NIM computes belong to the same family of math-
spect to the shift s. When f (x) and f (x − s) are close (i.e. |s| ematical measures. Moreover, H −1 and W2 become equivalent
is small), W2 is a weighted H −1 as (10) states. Both curves when the two data sets are close. The analysis among these
have good convexity as O(s2 ) around zero. As |s| gets larger, misfit functions of FWI brings additional insights into the im-
W22 ( f , fs ) is still O(s2 ), while the misfit measured by NIM is portance of seismic data preconditioning, which also can be
O(s). The convexity of NIM becomes a bit weaker. seen in examples of large scale FWI.
REFERENCES
Alford, R., K. Kelly, and D.M. Boore, 1974, Accuracy of finite-difference modeling of the acoustic wave
equation: Geophysics, 39, 834–842, http://dx.doi.org/10.1190/1.1440470.
Benamou, J.-D., and Y. Brenier, 2000, A computational fluid mechanics solution to the monge-
kantorovich mass transfer problem: Numerische Mathematik, 84, 375–393,
http://dx.doi.org/10.1007/s002110050002.
Billette, F., and S. Brandsberg-Dahl, 2005, The 2004 bp velocity benchmark: Presented at the 67th EAGE
Conference & Exhibition.
Brenier, Y., 1991, Polar factorization and monotone rearrangement of vector-valued functions:
Communications on pure and applied mathematics, 44, 375–417,
http://dx.doi.org/10.1002/(ISSN)1097-0312.
Brossier, R., S. Operto, and J. Virieux, 2010, Which data residual norm for robust elastic frequency-
domain full waveform inversion?: Geophysics, 75, no. 3, R37–R46,
http://dx.doi.org/10.1190/1.3379323.
Cardaliaguet, P., G. Carlier, and B. Nazaret, 2012, Geodesics for a class of distances in the space of
probability measures: Calculus of Variations and Partial Differential Equations, 1–26,
https://doi.org/10.1007/s00526-012-0555-7.
Chauris, H., D. Donno, and H. Calandra, 2012, Velocity estimation with the normalized integration
method: Presented at the 74th EAGE Conference and Exhibition incorporating EUROPEC 2012,
https://doi.org/10.3997/2214-4609.20148721.
Dolbeault, J., B. Nazaret, and G. Savaré, 2009, A new class of transport distances between measures:
Calculus of Variations and Partial Differential Equations, 34, 193–231,
http://dx.doi.org/10.1007/s00526-008-0182-5.
Donno, D., H. Chauris, and H. Calandra, 2013, Estimating the background velocity model with the
normalized integration method: 75th Annual International Conference and Exhibition
incorporating SPE EUROPEC, EAGE, Extended Abstracts, http://dx.doi.org/10.3997/2214-
4609.20130411.
Engquist, B., and B. D. Froese, 2014, Application of the Wasserstein metric to seismic signals:
Communications in Mathematical Sciences 12, https://doi.org/10.4310/cms.2014.v12.n5.a7.
Engquist, B., B. D. Froese, and Y. Yang, 2016, Optimal transport for seismic full waveform inversion:
Communications in Mathematical Sciences, 14, 2309–2330,
http://dx.doi.org/10.4310/CMS.2016.v14.n8.a9.
Ha, T., W. Chung, and C. Shin, 2009, Waveform inversion using a back-propagation algorithm and a
huber function norm: Geophysics, 74, no. 3, R15–R24, http://dx.doi.org/10.1190/1.3112572.
Huang, G., H. Wang, and H. Ren, 2014, Two new gradient precondition schemes for full waveform
inversion: arXiv preprint arXiv:1406.1864.
Lailly, P., 1983, The seismic inverse problem as a sequence of before stack migrations: Conference on
inverse scattering: Theory and application: Society for Industrial and Applied Mathematics, 206–
220.
Liu, J., H. Chauris, and H. Calandra, 2012, The normalized integration method-an alternative to full
waveform inversion?: Presented at the 25th Symposium on the Application of Geophysics to
Engineering & Environmental Problems, https://doi.org/10.3997/2214-4609.20144373.
Métivier, L., R. Brossier, Q. Mérigot, E. Oudet, and J. Virieux, 2016, An optimal transport approach for
seismic tomography: application to 3d full waveform inversion: Inverse Problems, 32, 115008,
http://dx.doi.org/10.1088/0266-5611/32/11/115008.
Papadakis, N., G. Peyré, and E. Oudet, 2014, Optimal transport with proximal splitting: SIAM Journal on
Imaging Sciences, 7, 212–238, http://dx.doi.org/10.1137/130920058.
Tarantola, A., and B. Valette, 1982, Generalized nonlinear inverse problems solved using the least
squares criterion: Reviews of Geophysics, 20, 219–232,
http://dx.doi.org/10.1029/RG020i002p00219.
Villani, C., 2003, Topics in optimal transportation: American Mathematical Society, Graduate Studies in
Mathematics 58.
Virieux, J., A. Asnaashari, R. Brossier, L. Métivier, A. Ribodetti, and W. Zhou, 2017, In 6. An
introduction to full waveform inversion: Encyclopedia of Exploration Geophysics, R1-1–R1-40,
http://dx.doi.org/10.1190/1.9781560803027.entry6.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, http://dx.doi.org/10.1190/1.3238367.
Warner, M., and L. Guasch, 2014, In Adaptive waveform inversion: Theory: 84th Annual International
Meeting, SEG, Expanded Abstracts, 1089–1093, http://dx.doi.org/10.1190/segam2014-0371.1.
Yang, Y., B. Engquist, J. Sun, and B.-D. Froese, 2016, Application of optimal transport and the quadratic
wasserstein metric to full-waveform inversion: arXiv preprint arXiv:1612.05075.
Zhu, H., and S. Fomel, 2016, Building good starting models for full-waveform inversion using adaptive
matching filtering misfit: Geophysics, 81, no. 5, U61–U72, http://dx.doi.org/10.1190/geo2015-
0596.1
Equation 1 provides a true “kinematic” objective function un- effects, resulting in a leakage of amplitude information into the
der the assumption of single-event asymptotic representations misfit. If forward modeling is not dynamically accurate, this
in Equations 2 and 3. Of course, instead of Equation 1 we can leakage may result in significant inversion errors.
use a simple frequency-domain normalized misfit
2
û dˆ EXAMPLES
− (5)
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
|û| ˆ
|d|
We conduct time-domain full-waveform inversion experiments
in either time or frequency-domain FWI. As with Equation 1, with the objective functions in Equations 1 and 7 using syn-
Equation 5 provides an amplitude-insensitive objective func- thetic data with and without amplitude attenuation. The adjoint
tion for fitting single events of Equations 2 and 3. However, source for the new objective function is computed as
our proposed objective function in Equation 1 provides a broad- n 2i
band inversion over an arbitrary range of frequencies with the o
−1
f (t, s, r) = Fω→t Im wû wû − dˆ (8)
data amplitude spectrum acting as a frequency-dependent mis- û
fit weight. Any desired shaping can be applied to the observed ˆ û| and Fω→t
−1
where w(ω) = |d|/| is the inverse Fourier trans-
data amplitude spectrum in a single data preprocessing step,
form. We generated synthetic data using acoustic modeling
for example, boosting lower frequencies to improve FWI con-
with density. The true velocity model used in our experiments
vergence and reduce sensitivity to cycle skips (Lazaratos et al.,
is shown in Figure 1, the true density model is obtained from
2011; Plessix and Li, 2013). Another advantage of the new ob-
the true velocity model by dividing it by 1500 (setting water
jective function is its ability to handle singularities in the nor-
density to 1). Both forward modeling and inversion are per-
malized observed wave field due to notches in the amplitude
formed on a 1000 (horizontal) by 800 (vertical) computational
spectrum in the presence of complex multipathing. In any re-
grid with a 10 m horizontal and 5 m vertical spacing. A Ricker
alistic experiment the observed wave field is the sum of multi-
wavelet centered at 10 Hz is used for source, and absorbing
ple events of Equation 2, representing various transmitted and
boundary conditions are applied at the surface to avoid surface-
reflected waves,
related multiples. A streamer acquisition is used with 39 shots
X h i
ˆ s, r) = j j and a 260 m shot spacing, with offsets ranging from 10 m to 10
d(ω, Ad (ω, s, r) exp iωτd (ω, s, r) , (6)
km. We use a starting velocity model obtained from the true
j
model using a 400 m smoothing filter.
j j
with various traveltimes τd and amplitudes Ad . Complex multi-
pathing may result in the amplitude spectrum of the multiple-
event trace of Equation 6 being close to zero, leading to errors
in the normalized wave fields in Equations 1 and 5. Weighting
of the integrand in Equation 1 by the observed data amplitude
spectrum effectively eliminates the contribution of such singu-
larities. It must be noted that when a single-event asymptotic
of Equation 2 is not valid, our method can no longer be re-
garded as a true kinematic inversion. Indeed, the phase spec-
trum of a multiple-event trace in Equation 6 is determined by
both traveltimes and amplitudes of the individual events. Am-
plitude errors in either forward-modeled or observed data leak
into the phase spectrum and inverted models, reducing contri-
bution of the weakest events to the inversion. This obvious
limitation of the proposed method can be partially overcome
using data masking for isolating individual events of interest
as, for example, in a target-oriented or time-lapse inversion.
One existing broadband alternative to the proposed method is
the normalized L2 objective function (Routh et al., 2011),
2 Z 2
u d
u d
kuk − kdk
= kuk − kdk dt. (7)
We use no frequency continuation and conduct a 0-25 Hz broad- locity model of Figure 1, however, velocity contrasts are over-
band time-domain full-waveform inversion to convergence, us- predicted. FWI using the normalized L2 objective function of
ing the objective functions of Equations 1 and 7. In our ex- Equation 7 produced a similar result with no clear advantage
periments we intentionally avoid density inversion in order to to any method, as demonstrated by the forward-modeled traces
study the effect of amplitude errors on our objective functions. shown in Figure 3. In the absence of frequency dispersion,
Since the true model generates multiple refraction and reflec- both approaches deliver similar results, and both suffer from
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
tion events, the neglected density effects are expected to leak amplitude leakage into the phase spectrum of multiple-event
into the velocity inversion. Indeed, for the acoustic reflection traces in Equation 6.
coefficient in the absence of elastic conversions, for small re-
flection angles less than ≈ 30◦ we have (Aki and Richards,
1980; Shuey, 1985):
1 ∆VP ∆ρ 1 ∆VP
R(θ ) ≈ + + sin2 θ , (9)
2 VP ρ 2 VP
ACKNOWLEDGMENTS
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
CONCLUSIONS
REFERENCES
Aki, K., and P. G. Richards, 1980, Quantitative seismology: Theory and methods: W. H. Freeman and
Co.
Bozdag, E., J. Trampert, and J. Tromp, 2011, Misfit functions for full waveform inversion based on
instantaneous phase and envelope measurements: Geophysical Journal International, 185, 845–
870, http://dx.doi.org/10.1111/j.1365-246X.2011.04970.x.
Fichtner, A., 2011, Full seismic modeling and inversion: Springer.
Fichtner, A., and H. Igel, 2008, Efficient numerical surface wave propagation through the optimization of
discrete crustal models — a technique based on non-linear dispersion curve matching (DCM):
Geophysical Journal International, 173, 519–533, http://dx.doi.org/10.1111/j.1365-
246X.2008.03746.x.
Gee, L. S., and T. H. Jordan, 1992, Generalized seismological data functionals: Geophysical Journal
International, 111, 363–390, http://dx.doi.org/10.1111/j.1365-246X.1992.tb00584.x.
Lazaratos, S., I. Chikichev, and K. Wang, 2011, Improving the convergence rate of full wavefield
inversion using spectral shaping: 81st Annual International Meeting, SEG, Expanded Abstracts,
2428–2432, http://dx.doi.org/10.1190/1.3627696.
Luo, Y., and G. T. Schuster, 1991, Wave-equation traveltime inversion: Geophysics, 56, 645–653,
http://dx.doi.org/10.1190/1.1443081.
Maharramov, M., B. L. Biondi, and M. A. Meadows, 2016, Time-lapse inverse theory with applications:
Geophysics, 81, no. 6, R485–R501, http://dx.doi.org/10.1190/geo2016-0131.1.
Plessix, R.-E., and Y. Li, 2013, Waveform acoustic impedance inversion with spectral shaping:
Geophysical Journal International, 195, 301–314, http://dx.doi.org/10.1093/gji/ggt233.
Routh, P. S., J. R. Krebs, S. Lazaratos, A. I. Baumstein, I. Chikichev, N. Downey, D. Hinkley, and J. E.
Anderson, 2011, Full-wavefield inversion of marine streamer data with the encoded simultaneous
source method: 73rd Annual International Conference and Exhibition, EAGE, Extended
Abstracts, F032, http://dx.doi.org/10.3997/2214-4609.20149730.
Shuey, R. T., 1985, A simplification of the Zoeppritz equations: Geophysics, 50, 609–614,
http://dx.doi.org/10.1190/1.1441936.
Van Leeuwen, T., and W. A. Mulder, 2010, A correlation-based misfit criterion for wave-equation
traveltime tomography: Geophysical Journal International, 182, 1383–1394,
http://dx.doi.org/10.1111/j.1365-246X.2010.04681.x.
Marmousi model
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Agarwal, A., K. Sain, and S. Shalivahan, 2016, Traveltime and constrained AVO inversion using FDR
PSO: 86th Annual International Meeting, SEG, Expanded Abstracts, 577–581.
http://doi.org/10.1190/segam2016-13959236.1.
Clayton, R., and B. Enquist, 1977, Absorbing boundary conditions for acoustic and elastic wave
equations: Bulletin of the Seismological Society of America, 67, 1529–1540.
Engquist, B., and A. Majda, 1977, Absorbing boundary conditions for numerical simulation of waves:
Proceedings of the National Academy of Sciences, 74, 1765–1766,
http://doi.org/10.1073/pnas.74.5.1765.
Fernández-Martínez, J. L., and E. García-Gonzalo, 2009, The PSO family: Deduction, stochastic analysis
and comparison: Swarm Intelligence, 3,245–273, http://doi.org/10.1007/s11721-009-0034-8.
Gauthier, O., J. Virieux, and A. Tarantola, 1986, Two-dimensional nonlinear inversion of seismic
waveforms: Numerical results: Geophysics, 51, 1387–1403, http://doi.org/10.1190/1.1442188.
Hager, W. W., and H. Zhang, 2006, A survey of nonlinear conjugate gradient methods: Pacific Journal of
Optimization, 2, 35–58.
Kennedy, J., and R. C. Eberhart, 1995, Particle swarm optimization: Proceedings of the IEEE
International Conference on Neural Networks, 4, 1942–1948.
Luis, J., T. Mukerji, E. García- Gonzalo, and A. Suman, 2010, Reservoir characterization and inversion
uncertainty via a family of Particle Swarm Optimizers: 80th Annual International Meeting, SEG,
Expanded Abstracts, 2334–2339, http://doi.org/10.1190/1.3513319.
Onwunalu, J. E., & L. J. Durlofsky, 2010, Application of a particle swarm optimization algorithm for
determining optimum well location and type: Computational Geosciences, 14, 183–198,
http://doi.org/10.1007/s10596-009-9142-1.
Paasche, H., and J. Tronicke, 2014, Nonlinear joint inversion of tomographic data using swarm
intelligence: Geophysics, 79, no. 4, R133–R149, http://doi.org/10.1190/geo2013-0423.1.
Shaw, R., and S. Srivastava, 2007, Particle swarm optimization: A new tool to invert geophysical data:
Geophysics, 72, no. 2, F75–F83, http://doi.org/10.1190/1.2432481.
Shi, Y., and R. C. Eberhart, 1998, Parameter selection in particle swarm optimization: Proceedings of the
7th Annual Conference on Evolutionary Programming, 591–600,
http://doi.org/10.1007/BFb0040810.
Tarantola, A., 1984, Inversion of seismic reflection data in the acoustic approximation: Geophysics, 49,
1259–1266, http://doi.org/10.1190/1.1441754.
Tronicke, J., H. Paasche, and U. Böniger, 2012, Crosshole traveltime tomography using particle swarm
optimization: A near-surface field example: Geophysics, 77, no. 1, R19–R32,
http://doi.org/10.1190/geo2010-0411.1.
Van den Bergh, F., 2002, An analysis of particle swarm optimizers: Ph.D. thesis, University of Pretoria.
Yang, P., J. Gao, and B. Wang, 2015, A graphics processing unit implementation of time-domain full-
waveform inversion: Geophysics, 80, no. 3, F31–F39, http://doi.org/10.1190/geo2014-0283.1.
Zhang, Y., Y. Jun, G. Wei, and L. Wu, 2010, Find multi-objective paths in stochastic networks via
chaotic immune PSO: Expert Systems with Applications, 37, no. 3, 1911–1919,
http://doi.org/10.1016/j.eswa.2009.07.025.
SUMMARY (2011a); Métivier et al. (2013, 2014) allow assessing the effect
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
• • ? •
ma
k−1 ×
•
• ? a
mk+1 ×
•
• f
mk d(mk )
f
•
• alone with the same settings, starting from m0 . This also im-
Forecast
Observation
plies that our initial model was sufficiently good to ensure con-
(FWI)
dobs,k−1 ? (modeling)
dobs,k+1 ?
vergence.
step
The approximated covariance matrix is extracted as a low-
k−1 k k + 1 (modeling frequency)
rank version from the ensemble repartition. The covariance
for the velocity is given in m2 .s−2 and represents the local di-
Figure 1: EnKF algorithm schematics. Ensemble’s members versity through the ensemble members. The variance (diago-
are represented by dots, data by stars and ellipses represent nal of the covariance matrix) can be displayed as a 2D map
uncertainty. The forecast ensemble is denoted in blue, the an- in fig. 2-D. This result corresponds to expectations from the
alyzed ensemble in red, the observed data in green, and the theoretical understanding of the FWI problem with a surface
modeled data from the forecast are depicted in gray. In bold acquisition setup, in term of uncertainty quantification. The
we have the general EnKF operations while in parenthesis we variance map can be interpreted as the superimposition of a
have the associated case for our ETKF-FWI application. The low-wavenumber background and a high-wavenumber pertur-
dashed lines denotes the Analysis steps. bation. The low-wavenumber background has low variance
values near the acquisition and progressively increases with
distance and depth due to the decrease of wavefield coverage
which provides the model minimizing the l2 norm of the mis- and wave amplitude with the geometrical spreading. The high-
fit between modeled data H (mi,k ) = di,k and measured data wavenumber component of the variance map highlights the in-
dobs,k . The ETKF-FWI scheme is represented in figure 1. terfaces. This can be attributed to band-limited data, which
does not constrain the solution enough.
The common way to generate an ensemble with a given statis-
tics would be to first factorize the desired covariance matrix
with a Cholesky decomposition as P = LLT , and then build- DISCUSSION AND CONCLUSION
ing a vector v satisfying this covariance by
ETKF-FWI seems to be a powerful and straightforward
v = Lu, (14) method allowing uncertainty quantification in FWI. Variance
from a random vector u. However, targeting large-scale ap- maps are easily readable to evaluate inversion results, and res-
plications, a Cholesky decomposition is not achievable. A olution could be studied from lines of P. Still, this original
straightforward and pragmatical way to generate the initial en- application set-up many questions that will require extensive
semble is considered instead. The population is built from a work.
consistent starting model m0 , by considering Ne zero mean First, the actual meaning of “uncertainty” as extracted from the
random vectors ui (white noise), convolved with a 2D Gaus- ensemble must be understood. Working with finite-frequency
sian filter with realistic correlation lengths. Each ensemble waves propagation and limited coverage, cause a filter-like ef-
member can be considered as m0,i = m0 +G ui , with G the con- fect. Thus can the quantitative uncertainties be associated with
volution operator with the 2D Gaussian filter. Thus, the prior direct uncertainty on real physical parameters? Alternatively,
covariance obtained with this ensemble generation strategy is will it only be able to account for the apparent macro-model
a Gaussian squared as P = G G T . seen by the waves, as questioned by Capdeville et al. (2010)
for homogenization and down-scaling problems?
APPLICATION - MARMOUSI EXAMPLE Considering the current state of development of our method,
many points need to be explored: How to design the mea-
In this part, a synthetic experiment is conducted on the 2D surement noise matrix R? This parameter should be simple to
Marmousi model (fig.2-A) with our ETKF-FWI strategy. 2D consider and be related with the recording noise level and sen-
visco-acoustic frequency-domain is chosen for this applica- sor design. Up to now, the process noise matrix has been left
tion. The formulation and operators are set as described in the aside but should be associated with the noise and error of the
previous section and frequency evolution replace the dynamic forecasting operator. Classically, it is a troublesome parame-
evolution. 25 dB of white noise have been applied to the data, ter to estimate even for linear operators in DA. In the frame of
preventing noise-free inverse-crime and reviewing the sensibil- ETKF-FWI, for which the FWI process is considered as fore-
ity of the technique to noise. The acquisition is a fixed spread casting, estimating this matrix could be a challenge but will ul-
surface geometry with 144 sources and 660 receivers at a 25m timately be needed in practical applications. The initial ensem-
depth position under water surface. ETKF-FWI has been ap- ble repartition, directly linked to the prior covariance P, is also
plied from 3 to 15Hz, each 0.4Hz, from m0 initial model (fig.2- an open question. A pragmatical approach is to use a Gaus-
B). The 200 ensemble members are generated by using a 500m sian filter, leading to a Gaussian squared covariance. However,
Velocit (m.s⁻¹)
3200 bers for large scale problems. The optimal number of members
for ETKF-FWI scheme is still to be determined and may vary
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Velocit (m.s⁻¹)
(Krebs et al., 2009; Warner et al., 2013) may prove to be per-
3200
tinent. This would also result in a reduction of the technique’s
cost.
2400
A more global view of the approach also leads to questioning
1600 the variables and observations of the filter itself. Only the ve-
locity properties have been accounted for up to now, but multi-
parameters unknowns are inherently easy to consider. Well-
C log data, for instance, may be used as a direct observations
4000 or constraint in addition to the seismic wavefield. The entire
Velocit (m.s⁻¹)
0.2 How does the method compare to a Markov chain Monte Carlo
1500 sampling of the misfit function at the convergence point as pro-
2000 posed by Fang et al. (2014), or the random sampling of the
0.1
Hessian at the convergence point as suggested by Fichtner and
2500 van Leeuwen (2015)?
0.0
0 2000 4000 6000 8000 10000 12000 Of course, one point to carefully consider is the computational
Offset (m)
cost, as each ensemble members requires to solve the FWI
problem on its own, increasing the cost of one or two order of
Figure 2: Velocity models and variance map associated with magnitude compared to classical FWI. Nonetheless, remind-
the experiment depicted in the Application section. A) 2D ing that this scheme is embarrassingly parallel and thanks to
Marmousi true velocity model. B) ETKF-FWI initial model the development of hardware capacities towards the exascale
m0 . C) ETKF-FWI final mean model after 30 assimilation and the current trends in grid computing, ETKF-FWI appli-
steps from 3Hz to 15Hz each 0.4Hz. D) ETKF-FWI final vari- cations may be promptly achievable even for large scale FWI
ance map after 30 assimilation step from 3Hz to 15Hz each problems, as it is the case for actual DA problems.
0.4Hz.
ACKNOWLEDGMENTS
other filters as Laplace or Bessel filter (Trinh et al., 2017) may
be relevant if used with the same strategy. These filters directly This study was partially funded by the SEISCOPE consortium
(http://seiscope2.osug.fr), sponsored by CGG, CHEVRON, EXXON-
affect the spatial shape of the covariance between parameters,
MOBIL, JGI, SHELL, SINOPEC, STATOIL, TOTAL and WOOD-
but also its amplitude. This amplitude should be cautiously SIDE. This study was granted access to the HPC resources of the
set, as too large values could lead to significant kinematics dif- Froggy platform of the CIMENT infrastructure (https://ciment.ujf-
ferences in the data resulting in cycle-skipping. However, the grenoble.fr), which is supported by the Rhône-Alpes region (GRANT
values should be sufficiently large to ensure satisfying explo- CPER07 13 CIRA), the OSUG@2020 labex (reference ANR10
ration of the model space and provide meaningful information LABX56) and the Equip@Meso project (reference ANR-10-EQPX-
29-01).
about the misfit function’s local curvature. Dramatically low
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Bishop, C. H., B. J. Etherton, and S. J. Majumdar, 2001, Adaptive sampling with the ensemble transform
kalman filter. Part I: Theoretical aspects: Monthly Weather Review, 129, 420–436,
http://doi.org/10.1175/1520-0493(2001)129<0420:ASWTET>2.0.CO;2.
Bunks, C., F. M. Salek, S. Zaleski, and G. Chavent, 1995, Multiscale seismic waveform inversion:
Geophysics, 60, 1457–1473, http://doi.org/10.1190/1.1443880.
Capdeville, Y., L. Guillot, and J.-J. Marigo, 2010, 2-D non-periodic homogenization to upscale elastic
media for P-SV waves: Geophysical Journal International, 182, 903–922,
http://doi.org/10.1111/j.1365-246X.2010.04636.x.
Du, Z., E. Querendez, and M. Jordan, 2012, Resolution and uncertainty in 3D stereotomographic
inversion: 74th Annual International Conference and Exhibition, EAGE, Extended Abstracts,
http://doi.org/10.3997/2214-4609.20148576.
Evensen, G., 1994, Sequential data assimilation with nonlinear quasi-geostrophic model using Monte
Carlo methods to forecast error statistics: Journal of Geophysical Research, 99, 143–162,
http://doi.org/10.1029/94JC00572.
Evensen, G., 2009, Data assimilation: The ensemble Kalman filter: Springer.
Fang, Z., F. J. Herrmann, and C. D. Silva, 2014, Fast uncertainty quantification of 2D full-waveform
inversion with randomized source subsampling: 76th Annual International Conference and
Exhibition, EAGE, Extended Abstract, http://doi.org/10.3997/2214-4609.20140715.
Fichtner, A., and J. Trampert, 2011a, Hessian kernels of seismic data functionals based upon adjoint
techniques: Geophysical Journal International, 185, 775–798, http://doi.org/10.1111/j.1365-
246X.2011.04966.x.
Fichtner, A., and J. Trampert, 2011b, Resolution analysis in full waveform inversion: Geophysical
Journal International, 187, 1604–1624, http://doi.org/10.1111/j.1365-246X.2011.05218.x.
Fichtner, A., J. Trampert, P. Cupillard, E. Saygin, T. Taymaz, Y. Capdeville, and A. V. nor, 2013,
Multiscale full waveform inversion: Geophysical Journal International, 194, 534–556,
http://doi.org/10.1093/gji/ggt118.
Fichtner, A., and T. van Leeuwen, 2015, Resolution analysis by random probing: Journal of Geophysical
Research: Solid Earth, 120, 5549–5573, http://doi.org/10.1002/2015JB012106.
Hunt, B., E. Kostelich, and I. Szunyogh, 2007, Efficient data assimilation for spatiotemporal chaos: A
local ensemble transform kalman filter: Physica D: Nonlinear Phenomena, 230, 112–126,
http://doi.org/10.1016/j.physd.2006.11.008.
Jin, L., M. K. Sen, and P. L. Stoffa, 2008, One-dimensional prestack seismic waveform inversion using
ensemble kalman filter: 78th Annual International Meeting, SEG, Expanded Abstracts, 1920–
1924, http://doi.org/10.1190/1.3063815.
Jordan, M., 2015, Estimation of spatial uncertainties in tomographic images: 77th Annual International
Conference and Exhibition, EAGE, Extended Abstracts, http://doi.org/10.3997/2214-
4609.201413555.
Kalman, R., 1960, A new approach to linear filtering and prediction problems: Journal of basic
Engineering, 82, 35–45, https://doi.org/10.1115/1.3662552.
Krebs, J., J. Anderson, D. Hinkley, R. Neelamani, S. Lee, A. Baumstein, and M. D. Lacasse, 2009, Fast
full-wavefield seismic inversion using encoded sources: Geophysics, 74, no. 6, WCC105–
WCC116, http://doi.org/10.1190/1.3230502.
Métivier, L., R. Brossier, J. Virieux, and S. Operto, 2013, Full waveform inversion and the truncated
Newton method: SIAM Journal On Scientific Computing, 35, B401–B437,
http://doi.org/10.1137/120877854.
Operto, S., A. Miniussi, R. Brossier, L. Combe, L. Métivier, V. Monteiller, A. Ribodetti, and J. Virieux,
2015, Efficient 3-D frequency-domain mono-parameter full-waveform inversion of ocean-bottom
cable data: application to Valhall in the visco-acoustic vertical transverse isotropic
approximation: Geophysical Journal International, 202, 1362–1391,
http://doi.org/10.1093/gji/ggv226.
Ott, E., B. R. Hunt, I. Szunyogh, A. V. Zimin, E. J. Kostelich, M. Corazza, E. Kalnay, D. Patil, and J. A.
Yorke, 2004, A local Ensemble Kalman filter for atmospheric data assimilation: Tellus A, 56,
415–428, http://doi.org/10.1111/j.1600-0870.2004.00076.x.
Plessix, R. E., 2009, Three-dimensional frequency-domain full-waveform inversion with an iterative
solver: Geophysics, 74, no. 6, WCC53–WCC61, http://doi.org/10.1190/1.3211198.
Plessix, R.-E., G. Baeten, J. W. de Maag, and F. ten Kroode, 2012, Full waveform inversion and distance
separated simultaneous sweeping: a study with a land seismic data set: Geophysical Prospecting,
60, 733–747, http://doi.org/10.1111/j.1365-2478.2011.01036.x.
Sirgue, L., O. I. Barkved, J. Dellinger, J. Etgen, U. Albertin, and J. H. Kommedal, 2010, Full waveform
inversion: The next leap forward in imaging at Valhall: First Break, 28, 65–70,
http://doi.org/10.3997/1365-2397.2010012.
Sirgue, L., and R. G. Pratt, 2004, Efficient waveform inversion and imaging: a strategy for selecting
temporal frequencies: Geophysics, 69, 231–248, http://doi.org/10.1190/1.1649391.
Tarantola, A., 1984, Linearized inversion of seismic reflection data: Geophysical Prospecting, 32, 998–
1015, http://doi.org/10.1111/j.1365-2478.1984.tb00751.x.
Tarantola, A., 2005, Inverse problem theory and methods for model parameter estimation: Society for
Industrial and Applied Mathematics.
Tippett, M. K., J. L. Anderson, C. H. Bishop, T. M. Hamill, and J. S. Whitaker, 2003, Ensemble square
root filters: Monthly Weather Review, 131, 1485–1490.
Trinh, P.-T., R. Brossier, L. Métivier, J. Virieux, and P. Wellington, 2017, Bessel smoothing filter for
spectral element mesh: Geophysical Journal International, 209, 1489-1512,
https://doi.org/10.1093/gji/ggx103.
van Leeuwen, T., and F. J. Herrmann, 2013, Mitigating local minima in full-waveform inversion by
expanding the search space: Geophysical Journal International, http://doi.org/10.1093/gji/ggt258.
Wang, X., C. H. Bishop, and S. J. Julier, 2004, Which is better, an ensemble of positive-negative pairs or
a centered spherical simplex ensemble?: Monthly Weather Review, 132, 1590–1605,
https://doi.org/10.1175/1520-0493(2004)132%3C1590:wibaeo%3E2.0.co;2.
Warner, M., A. Ratcliffe, T. Nangoo, J. Morgan, A. Umpleby, N. Shah, V. Vinje, I. Stekl, L. Guasch, C.
Win, G. Conroy, and A. Bertrand, 2013, Anisotropic 3D full-waveform inversion: Geophysics,
78, R59–R80, http://doi.org/10.1190/geo2012-0338.1.
Zhu, H., E. Bozdag, and J. Tromp, 2015, Seismic structure of the European upper mantle based on adjoint
tomography: Geophysical Journal International, 201, 18–52, https://doi.org/10.1093/gji/ggu492.
1. Key Laboratory of Shale Gas and Geoengineering, Institute of Geology and Geophysics, Chinese Academy of
Sciences
2. University of Chinese Academy of Sciences
respectively. The cross-correlation between observed and 𝐶𝑎𝑢𝑡𝑜 (𝜏) = ∫ 𝑝𝑜 (𝒓𝑔 , 𝑡; 𝒓𝑠 )𝑝𝑜 (𝒓𝑔 , 𝑡 + 𝜏; 𝒓𝑠 )𝑑𝑡 , (4)
calculated wavefields is given as
where 𝜏 ∈ [−𝑚𝑎𝑥𝑡𝑎𝑢, 𝑚𝑎𝑥𝑡𝑎𝑢] is a linear weight function
𝐶𝑐𝑟𝑜𝑠𝑠 (𝜏) = ∫ 𝑝𝑐 (𝒓𝑔 , 𝑡; 𝒓𝑠 )𝑝𝑜 (𝒓𝑔 , 𝑡 + 𝜏; 𝒓𝑠 )𝑑𝑡 . (1) in equation(3), and 𝐶𝑎𝑢𝑡𝑜 is the auto-correlation of observed
Conventional correlation-based misfit function for wave- wavefields. The new misfit function attempts to drive cross-
equation traveltime inversion (Van Leeuwen and Mulder, correlation to match auto-correlation due to the least-square
2010) is defined as connection between them. Finally, the desires solution will
1 locate at the global minimum, and the traveltime shift will
𝐸 = ∑ ∫{𝑊(𝑡) ∙ 𝐶𝑐𝑟𝑜𝑠𝑠 (𝜏)}2 𝑑𝜏 , (2) tend to zero. This new misfit function combines all
2
𝒓𝑔 ,𝒓𝑠 advantages of previous proposed approaches. There is no
where 𝑊(𝑡) is the weight function that penalize energy at need to pick traveltime as an optimization criterion, and it
non-zero time lags. We use a 1D seismogram test to illustrate can prompt the inversion to converge to global minimum.
the first problem of conventional correlation-based misfit
function. Figure 1 shows the calculated and observed Comparison between Born and Rytov approximation
seismogram and their auto-correlation and cross-correlation.
In the process of traveltime inversion, we try to minimize the Born and Rytov approximation both are derived under the
cross-correlation. However, if the auto-correlation of assumption of weak scattering of wavefields. However,
observed wavefields has some energy at non-zero traveltime Born approximation make use of the amplitude of
shift, when the cross-correlation is minimized, there still has wavefields, while Rytov approximation based on the
traveltime shift exist. Thus, the desired solution cannot complex wavefields phase.
locate at the global minimum (black solid line in Figure 1b),
and it will be stuck in the local minimum (black dotted line Born approximation: We give a function 𝑂(𝒓) = 𝑣0 /
in Figure 1b). 𝑣(𝒓) − 1 which represents the velocity perturbations.
(a) Substituting 𝑂(𝒓) into scalar acoustic wave equation in
frequency domain
𝜔2
(∆ + 2 ) 𝑝(𝒓) = 0 , (5)
𝑣 (𝒓)
we can obtain
∆𝑝 + 𝑘02 𝑝 = −2𝑘02 𝑂(𝒓)𝑝(𝒓) − 𝑘02 𝑂 2 (𝒓)𝑝(𝒓) . (6)
Assuming that the wavefields has series solution 𝑝 =
∑∞ 𝑡ℎ
𝑚=0 𝑝𝑚 , where 𝑝𝑚 is related to 𝑚 -order of 𝑂(𝒓). Then,
(b) substituting the series solution into equation (6) and
comparing each order of 𝑂(𝒓), we can get the first-order
Born approximation
𝑝1 (𝒓) = −2𝑘02 ∫ 𝑑𝒓′ 𝐺 (𝒓 − 𝒓′ )𝑂(𝒓′ )𝑝0 (𝒓′ ) , (7)
where 𝐺 is the Green’s function, 𝑝0 and 𝑘0 represent the
incident wavefields and background wavenumber,
respectively. When the scale of the object (it means the
velocity perturbations) is small, we can use the incident
Figure 1: (a) The observed and calculated seismogram, (b) wavefields 𝑝0 to replace the total wavefields 𝑝. However,
are their auto-correlation and cross-correlation. when the scale of the object is too large, Born approximation
is invalid. Born approximation can be applied only when the
magnitude of the differential field is smaller than the
The new least-square correlation-based misfit function incident wavefields, which implies the change in phase
between the incident field and the wave propagating through
To avoid the local minima caused by conventional the object is less than π (Woodward, 1992).
correlation-based misfit function, we take the auto-
correlation into consideration, and extend the optimized Rytov approximation: If we consider the incident wave as
correlation-based misfit function (Choi and Alkhalifah, 2016) a plane wave 𝑝 = 𝐴𝑒 Ψ , where Ψ = 𝑖𝜔𝑡 is the complex
to propose a new least-square correlation-based misfit wavefields phase. Assuming that the phase has series
function, which defined as solutionΨ = ∑∞
𝑚=0 Ψ𝑚 , and using the same derivation of
Born approximation, we can obtain the first-order Rytov with respect to the model parameter (slowness s(r)), we can
approximation obtain
2𝑘02 𝜕𝐸 𝜕𝐶𝑐𝑟𝑜𝑠𝑠 𝜕∆𝜏
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
EXAMPLES
Gradient and Adjoint source In the second example, we apply our method in a complex
near surface model shown in Figure 4b to illustrate the
From the above discussion, we recognize that Rytov capability of our approach in near surface velocity
approximation is a better choice for traveltime inversion. reconstruction. The grid size of the model is 833*100 with
Thereupon we utilize Rytov approximation to derive our grid interval of 10m. We use a source wavelet of peak
gradient. Following the idea of FTI, we ignore amplitude and frequency at 15Hz to model the data. Sources and receivers
put emphasis on more reliable traveltime information, are evenly distributed on the rugged surface. In order to
because amplitude can cause highly nonlinear when it comes obtain more accurate wavefields, we use Spectral Element
to traveltime inversion. We take the derivative of our new Method to simulate near surface seismic wave propagation.
least-square correlation-based misfit function in equation (3) Figure 4a is the irregular quadrilateral mesh for spectral
based method is shown in Figure 4c. it clearly shows its The conventional correlation-based misfit function for
gradient doesn’t produce the correct direction. Figure 4d traveltime inversion is hard to converge to global minimum.
displays the inverted model using our proposed method after Meanwhile, Born approximation based gradient is unable to
25 iterations, and it converges very well. The result of FWI obtain an accurate gradient direction. Therefore, we
which use our inverted result as initial model is shown in proposed a new least-square correlation-based misfit
Figure 4e, and it matches the true model very well. function to tackle the first problem. In order to solve the
second problem, we utilized Rytov approximation to derive
(a) (b) (c) the gradient and obtained a correct gradient direction. Based
on successful numerical experiments, we have validated our
method which could generate good convergent results, even
did not require an accurate initial model. Our approach
overcame or even eliminated the cycle-skipping problem,
and the inverted result could be a very good initial model for
FWI. Finally, we got a high resolution near surface velocity
model through the combination of our proposed method and
conventional FWI.
ACKNOWLEDGEMENTS
(a)
(b)
(c)
(d)
(e)
Figure 4: (a) The irregular quadrilateral mesh for spectral element method, the mesher is sparse for demonstration, (b) the true
complex near surface model, (c) the inversion result of conventional correlation-based traveltime inversion, (d) the inversion
result of our method, (e) the inversion result of FWI, using our inverted model as initial velocity model.
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Chi, B., Y. Wang, Y. Liu, and L. Dong, 2012, Hybrid Born and Rytov scattering series and its application
in full waveform inversion: 82nd Annual International Meeting, SEG, Expanded Abstracts, 1–5,
http://doi.org/10.1190/segam2012-0934.1.
Choi, Y., and T. Alkhalifah, 2016, An optimized correlation-based full waveform inversion: 76th Annual
International Conference and Exhibition, EAGE, Extended Abstracts, Tu P1 13,
http://doi.org/10.3997/2214-4609.201600642.
Luo, Y., Y. Ma, Y. Wu, H. Liu, and C. Lei, 2016, Full-traveltime inversion: Geophysics, 81, no. 5, R261–
R274, http://doi.org/10.1190/geo2015-0353.1.
Luo, Y., and G. T. Schuster, 1991, Wave-equation traveltime inversion: Geophysics, 56, 645–663,
http://doi.org/10.1190/1.1443081.
Mulder, W., and R. E. Plessix, 2008, Exploring some issues in acoustic full-waveform inversion:
Geophysical Prospecting, 56, 827–841, http://doi.org/10.1111/j.1365-2478.2008.00708.x.
Nolet, G., 1987, Seismic tomography: Reidel.
Pratt, R. G., C. Shin, and G. J. Hicks, 1998, Gauss-Newton and full Newton methods in frequency domain
seismic waveform inversion: Geophysical Journal International, 133, 341–362,
https://doi.org/10.1046/j.1365-246X.1998.00498.x.
Shen, X., 2014, Early-arrival waveform inversion for near-surface velocity estimation: Ph.D. thesis,
Stanford Exploration Project, Geophysics Department.
Tarantola, A., 1984, Inversion of seismic reflection data in the acoustic approximation: Geophysics, 49,
1259–1266, http://doi.org/10.1190/1.1441754.
Tromp, J., D. Komatitsch, and Q. Liu, 2008, Full-traveltime inversion: Communications in
Computational Physics, 3, 1–32.
Van Leeuwen, T., and W. A. Mulder, 2010, A correlation-based misfit criterion for wave-equation
traveltime tomography: Geophysical Journal International, 182, 1383–1394,
https://doi.org/10.1111/j.1365-246X.2010.04681.x.
Woodward, M. J., 1992, Wave-equation tomography: Geophysics, 57, 15–26,
http://doi.org/10.1190/1.1443179.
Xia, J., R. D. Miller, C. B. Park, and G. Tian, 2002, Determining Q of near-surface materials from
Rayleigh waves: Journal of Applied Geophysics, 51, 121–129, http://doi.org/10.1016/S0926-
9851(02)00228-8.
Yuan, Y. O., F. J. Simons, and E. Bozdag, 2015, Multiscale adjoint waveform tomography for surface
and body waves: Geophysics, 80, no. 5, R281–R302, http://doi.org/10.1190/geo2014-0461.1.
Zhang, J., and M. N. Toksöz, 1998, Nonlinear refraction traveltime tomography: Geophysics, 63, 1726–
1737, http://doi.org/10.1190/1.1444468.
China University of Petroleum-Beijing, State Key Laboratory of Petroleum Resources and Prospecting, CNPC Key
Laboratory of Geophysical Exploration
Shifted-Wavelet Spectrum
1
the envelope data, G represents the Green’s function, and
denotes a temporal convolution. According to Equation
Amplitude
0
is shown in Figure 2b. The first-order acoustic wave
equation is used to produce the synthetic data. The forward
-0.5
modeling is performed with a high-order staggered grid
finite difference scheme in time domain. The grid interval
-1
-0.5 -0.4 -0.3 -0.2 -0.1 0
Time(s)
0.1 0.2 0.3 0.4 0.5 is 10m×10m in horizontal and vertical directions. When
generating synthetic seismograms, the Ricker source with
(b) Corresponding wavelets and their envelopes of (a) in time the peak frequency of 20 Hz was filtered with a 7-Hz low-
domain
cut taper. 45 sources are exploded at the depth of 50m with
Figure 1. Invariance of wavelet envelope under frequency shift the distance between adjacent sources of 100 m.
Then,
T u
p dt{Rres dcal E (d cal ) p 2 [ Rres d cal E (d cal ) p 2 ]h } , (7)
m 0 m
where
d cal Su S (G wTarget ) , (8)
(b)
Rres =E[d cal ] E[ P( dobs )] ,
p p
(9)
Figure 2. The true velocity model (a) and the linear initial model (b)
Figure 3 shows the behavior of the frequency shift filter, in The inverted model after 12 iterations of Marmousi by the
which Figure 3a and 3d demonstrate the original shot conventional envelope inversion is shown in Figure 4a. The
record generated by low-cut (<7 Hz) 20 Hz Ricker wavelet final result of EI + FWI is shown in Figure 4b. The velocity
and its envelope, respectively. Figure 3b shows the original model from the conventional EI method is seriously
shot record after filtered by the frequency shift filter using contaminated by high wavenumber information. In addition,
5 Hz Ricker wavelet as target wavelet with f 0 =7hz . the final result of EI + FWI is far away from the true model.
Compared with the original record, the filtered record in
Figure 3b has more events (black arrow) and a lower We firstly apply our proposed frequency shift filter on the
resolution. But Figure 3b and the reference shot record in data sets. Then, we take Equation 6 as the misfit function.
Figure 3c generated by 5 Hz Ricker wavelet have the same The result of our FSF method using 5 Hz Ricker wavelet as
envelope (Figure 3e and Figure 3f) or resolution (Knapp, target wavelet with frequency shift f 0 =7hz is shown in
1990). That is the relationship between low- and high- Figure 5a after 12 iterations. The final inverted model of
frequency information with constant frequency bandwidth FSF + FWI (Figure 5b) is closer to the true one than that
and the reason why we can use the filtered record to shown in Figure 4b. The experimental results demonstrate
recover the low-wavenumber components. that the long wavelength components recovered by FSF
Discussion
Conclusions
(b)
In this paper, we designed a frequency shift filter into the
Figure 4. (a) The result of conventional EI method. (b) The final
misfit function to build a relationship between low- and
result of EI + FWI.
high-frequency information, based on the invariance of
wavelet envelope under frequency shift. Therefore, we can
use the high-frequency signal in seismic data to recover the
low-wavenumber components in the model. The
effectiveness of the FSF method is proved through a 2D
synthetic data set generated from the Marmousi model. The
numerical experimental results demonstrate that the
proposed FSF method can update the long-wavelength
information effectively and build a better initial model for
FWI.
Acknowledgments
(b)
Figure 5. (a) The result of FSF method. (b) The final result of FSF
+ FWI.
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Alkhalifah, T., 2015, Scattering-angle based filtering of the waveform inversion gradients: Geophysical
Journal International, 200, 363–373, http://dx.doi.org/10.1093/gji/ggu379.
Chi, B. X., L. G. Dong, and Y. Z Liu, 2015, Correlation-based reflection full-waveform inversion:
Geophysics, 80, no. 4, R189–R202, http://dx.doi.org/10.1190/geo2014-0345.1.
Hu, W., 2014, FWI without low frequency data-beat tone inversion: 84th Annual International Meeting,
SEG, Expanded Abstracts, 1116–1120, http://dx.doi.org/10.1190/segam2014-0978.1.
Knapp, R. W., 1990, Vertical resolution of thick beds, thin beds, and thin-bed cyclothems: Geophysics,
55, no.6, 1183–1190, http://dx.doi.org/10.1190/1.1442934.
Köhn, D., 2011, Time domain 2D elastic full waveform tomography: Ph.D. dissertation, Kiel University.
Ma, Y., and Hale, D., 2013, Wave-equation reflection traveltime inversion with dynamic warping and
full-waveform inversion: Geophysics, 78, no. 6, R223–R233, http://dx.doi.org/10.1190/geo2013-
0004.1.
Qu, Y. M., Z. C. Li, J. P. Huang, and J. L. Li, 2016, Viscoacoustic anisotropic full waveform inversion:
Journal of Applied Geophysics, 13, no. 3, 511–518,
http://dx.doi.org/10.1016/j.jappgeo.2016.12.001.
Shin, C., and Y. H. Cha, 2008, Waveform inversion in the Laplace domain: Geophysical Journal
International, 173, 922–931, http://dx.doi.org/10.1111/j.1365-246X.2008.03768.x.
Tarantola, A., 1984, Inversion of seismic reflection data in the acoustic approximation: Geophysics, 49,
1259–1266, http://dx.doi.org/10.1190/1.1441754.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, http://dx.doi.org/10.1190/1.3238367.
Wang, G. C., S. X. Wang, and S. Y. Yuan, 2017, Multi-scale envelope inversion method based on scale
separation: 79th Annual International Meeting, EAGE, Expanded Abstracts.
Wu, R.-S., J. Luo, and B. Wu, 2014, Seismic envelope inversion and modulation signal model:
Geophysics, 79, no. 3, WA13–WA24, http://dx.doi.org/10.1190/geo2013-0294.1.
Xu, S., D. Wang, Y. Chen, Y. Zhang, and G. Lambare, 2012, Full waveform inversion for reflected
seismic data: 74th Annual International Conference and Exhibition, EAGE, Extended Abstracts,
http://dx.doi.org/10.3997/2214-4609.20148725.
Texas at Austin
Building accurate subsurface elastic parameters using FWI observed data d obs and the predicted data d pre , and the
has become increasingly popular in exploration application superscript † represents the adjoint operator. In FWI, d pre
(Tarantola 1984; Pratt 1999; Virieux and Operto 2009).
FWI in time (Mora 1987) and frequency domains (Pratt et has a non-linear relation with m . The best set of model
al., 1998) have been demonstrated to be effective in parameters describing the data can be found when the
recovering subsurface model parameters with the help of misfit function reaches its minimum.
low frequency and long offset data (Mora 1988; Bunks et
al., 1995; Xue et al., 2016). Assuming our model is close to the true model, the non-
linear problem can be linearized by expanding the misfit
FWI is typically formulated as a local optimization method function in the vicinity of the best solution (Pratt et al.,
where the gradient of the misfit function are required to 1998; Virieux and Operto 2009). We can apply the gradient
minimize the differences between observed and predicated method to update the model parameters iteratively by
data. Both SI (Chen et al., 2007; Tao and Sen 2013a) and
adjoint-state methods (Plessix 2006) can be used to m k+1 = m k + α∇ m E k , (2)
construct the gradient. In exploration application, where the
number of sources is generally significant smaller than the where k is the iteration number and α is the step length
number of receivers, implementing the adjoint-state method along the updating direction. ∇ m E is the gradient of the
is believed to be more efficient than the SI method.
Nevertheless, both methods become computationally misfit function with respect to the model parameter.
expensive with increasing seismic data volume. ∇ m E = Re(J †δ d), where J † is the conjugate transpose of
Simultaneous source (Herrmann et al., 2009; Ben-Hadj-Ali the Frechet derivative matrix. In frequency domain,
et al., 2011) and plane wave (Vigh and Starr 2008; Tao and assuming constant density acoustic media and weak
Sen 2013b) FWI methods are proposed to improve the scattering (Sirgue and Pratt 2004, Zhao 2017), the gradient
computation efficiency. can be written element wise as
ps pr
δ d (s, r, ω ) = ω 4 ∫∫ δ d (p s , p r , ω )exp[−iω Plane wave Green’s functions (i.e. Equation (8)) do not
(5)
(p s ⋅ (s − x ref ) + p r ⋅ (r − x ref ))]dp s dp r , depend on source or receiver locations. A plane wave
Green’s function can be used for either p s or p r . Based on
where p s and p r are the source and receiver plane wave plane waves ranges and model setups, the number of
ray-parameters, respectively. xref is the reference point for Green’s functions need to be computed for the entire model
is relatively small. Therefore, computational efficiency of
DPW transform, and δ d (p s , p r , ω ) is the difference computing the gradient can be greatly improved. Selecting
between observed and predicted DPW data. We will only one p s - p r pair from Equation (7), we identify that
explain how to compute δ d (p s , p r , ω ) later in this section. the sensitivity kernel can be written as
×dp s dp r dsdr). The number of plane wave Green’s functions required for
computing the gradient without aliasing depends on
Reorganizing terms and slant stacking Green’s functions in frequencies, source line and receiver line lengths. The
Equation (6), we obtain quantitative requirement for representing a spherical wave
with plane waves is given by (Zhang et al., 2005)
pmax − pmin 1
Δp = ≤ (11)
Np fL
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
NUMERICAL EXAMPLE
CONCLUSIONS
REFERENCES
Ben-Hadj-Ali, H., S. Operto, and J. Virieux, 2011, An efficient frequency-domain full waveform
inversion method using simultaneous encoded sources: Geophysics, 76, no. 4, R109–R124,
http://dx.doi.org/10.1190/1.3581357.
Brossier, R., S. Operto, and J. Virieux, 2009, Seismic imaging of complex onshore structures by 2D
elastic frequency-domain full-waveform inversion: Geophysics, 74, no. 6, WCC105–WCC118,
http://dx.doi.org/10.1190/1.3215771.
Bunks, C., F. M. Saleck, S. Zaleski, and G. Chavent, 1995, Multiscale seismic waveform inversion:
Geophysics, 60, 1457–1473, http://dx.doi.org/10.1190/1.1443880.
Chen, P., T. H. Jordan, and L. Zhao, 2007, Full three-dimensional tomography: A comparison between
the scattering-integral and adjoint-wavefield methods: Geophysical Journal International, 170,
175–181, http://dx.doi.org/10.1111/j.1365-246X.2007.03429.x.
Fokkema, J. T., and P. M. van den Berg, 1992, Reflector imaging: Geophysical Journal International, 110,
191–200, http://dx.doi.org/10.1111/j.1365-246X.1992.tb00721.x.
Herrmann, F. J., Y. A. Erlangga, and T. T. Y. Lin, 2009, Compressive simultaneous full-waveform
simulation: Geophysics, 74, no. 4, A35–A40, http://dx.doi.org/10.1190/1.3115122.
Mora, P., 1987, Nonlinear two-dimensional elastic inversion of multioffset seismic data: Geophysics, 52,
1211–1228, http://dx.doi.org/10.1190/1.1442384.
Mora, P., 1988, Elastic wave-field inversion of reflection and transmission data: Geophysics, 53, 750–
759, http://dx.doi.org/10.1190/1.1442510.
Plessix, R. E., 2006, A review of the adjoint-state method for computing the gradient of a functional with
geophysical applications: Geophysical Journal International, 167, 495–503,
http://dx.doi.org/10.1111/j.1365-246X.2006.02978.x.
Pratt, R. G., 1999, Seismic waveform inversion in the frequency domain: Part 1, theory and verification in
a physical scale model: Geophysics, 64, 888–901, http://dx.doi.org/10.1190/1.1444597.
Pratt, R. G., C. Shin, and G. J. Hick, 1998, Gauss–newton and full newton methods in frequency–space
seismic waveform inversion: Geophysical Journal International, 133, 341–362,
http://dx.doi.org/10.1046/j.1365-246X.1998.00498.x.
Sen, M. K., and L. N. Frazer, 1991, Multifold phase space path integral synthetic seismograms:
Geophysical Journal International, 104, 479–487, http://dx.doi.org/10.1111/j.1365-
246X.1991.tb05695.x.
Sirgue, L., and R. G. Pratt, 2004, Efficient waveform inversion and imaging: A strategy for selecting
temporal frequencies: Geophysics, 69, 231–248, http://dx.doi.org/10.1190/1.1649391.
Stoffa, P. L., M. K. Sen, R. K. Seifoullaev, R. C. Pestana, and J. T. Fokkema, 2006, Plane-wave depth
migration: Geophysics, 71, no. 6, S261–S272, http://dx.doi.org/10.1190/1.2357832.
Tao, Y., and M. K. Sen, 2013a, Frequency-domain full waveform inversion with a scattering-integral
approach and its sensitivity analysis: Journal of Geophysics and Engineering, 10, 065008.
Tao, Y., and M. K. Sen, 2013b, Frequency-domain full waveform inversion with plane-wave data:
Geophysics, 78, no. 1, R13–R23, http://dx.doi.org/10.1190/geo2012-0267.1.
Tarantola, A., 1984, Inversion of seismic reflection data in the acoustic approximation: Geophysics, 49,
1259–1266, http://dx.doi.org/10.1190/1.1441754.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, http://dx.doi.org/10.1190/1.3238367.
Xue, Z., N. Alger, and S. Fomel, 2016, Full-waveform inversion using smoothing kernels: 86th Annual
International Meeting, SEG, Expanded Abstracts, 1358–1363,
http://dx.doi.org/10.1190/segam2016-13948739.1.
Zhang, Y., J. Sun, C. Notfors, S. H. Gray, L. Chernis, and J. Young, 2005, Delayed-shot 3D depth
migration: Geophysics, 70, no. 5, E21–E28, http://dx.doi.org/10.1190/1.2057980.
Zhao, Z., 2017, Sensitivity kernel for double plane wave full waveform inversion: 87th Annual
International Meeting, SEG, Expanded Abstracts, submitted.
Zhao, Z., M. K. Sen, and P. L. Stoffa, 2015a, Plane wave reverse time migration in VTI media using
Green’s function: 85th Annual International Meeting, SEG, Expanded Abstracts, 4002–4007,
http://dx.doi.org/10.1190/segam2015-5926023.1.
Zhao, Z., M. K. Sen, and P. L. Stoffa, 2016, Double-plane-wave reverse time migration in the frequency
domain: Geophysics, 81, no. 5, S367–S382, http://dx.doi.org/10.1190/geo2015-0687.1.
Zhao, Z., M. K. Sen, and P. L. Stoffa, 2017, Double plane-wave reverse-time migration: Geophysical
Prospecting, http://dx.doi.org/10.1111/1365-2478.12507.
Zhao, Z., M. K. Sen, P. L. Stoffa, and H. Zhu, 2015b, Double plane wave least squares reverse time
migration: 85th Annual International Meeting, SEG, Expanded Abstracts, 4170–4174,
http://dx.doi.org/10.1190/segam2015-5924229.1.
Calgary
SUMMARY
To establish a better-posed reconstruction of low
Although full waveform inversion (FWI) is a highly wavenumber model components, and help FWI to converge
nonlinear inverse problem, it is usually solved as a local to a global minimum, several approaches can be employed,
optimization problem under a linear approximation, where e.g., by retrieving the low frequency information in the data
small angle backscattered data (via the residuals) are as in Laplace-domain and Laplace-Fourier-domain
treated linearly. Adding nonlinearity within each update inversion (Shin and Cha, 2008, 2009), envelop inversion
may have important consequences for convergence rates (Wu, Luo, and Wu, 2014); by building the low
and parameter accuracy. One approach is to include higher- wavenumber background model through reflection
order scattering terms into the sensitivities during the information alone (Xu et al., 2012, Brossier, Operto, and
construction of the gradient, by varying not the current but Virieux, 2015), or, together with refractions (Wang et al.,
the updated model at each iteration. By applying inverse 2015, Zhou et al., 2015); and by building the background
scattering theory, this additional sensitivity term can be model and perturbation simultaneously, in the data domain
computed from the data residuals at the current iteration. A or mixed data/image domain (Sun and Symes, 2012,
nonlinear frequency-domain FWI inversion scheme, with Albertin, Shan, and Washbourne, 2013, Biondi and
an inner and an outer loop, implementing this idea is Almomin, 2014, Wu and Alkhalifah, 2015, Alkhalifah and
presented here. A perturbation is inverted from the data Wu, 2016).
residuals within the inner loop, and the descent direction
based on the nonlinear sensitivity to update the model is Mitigation of linearization errors within each iteration
computed involving this perturbation in the outer loop. We could have a significant impact on convergence rates and
test this nonlinear FWI on acoustic single-parameter inversion results. Using nonlinear sensitivities, e.g., in
Marmousi synthetics. The inverted results vary depending resistivity inversion (Mcgillivray and Oldenburg, 1990),
on data frequency ranges and initial models, but we optical imaging (Kwon and Yazici, 2010) and seismic
conclude that the nonlinear FWI has the capability to inversion (Wu and Zheng, 2014, Innanen, 2015) has been
generate high resolution model estimates in both shallow considered. Here, we consider an extension of frequency
and deep regions, and to converge rapidly, relative to a domain FWI (e.g., Sirgue and Pratt, 2004) to incorporate
benchmark FWI approach involving the standard gradient. nonlinear sensitivities, building on the procedure
introduced by Innanen (2015). A two-loop inversion
INTRODUCTION scheme is employed, in which, in the inner iterations, a
perturbation is determined from the data residuals using
Although FWI (Lailly, 1983, Tarantola, 1984, Virieux and linear inversion, and this is used, in the outer iterations, to
Operto, 2009) is a highly nonlinear inverse problem, it is determine a descent direction which in principle anticipates
usually solved as a local optimization problem in the some of the curvature of the objective function caused by
framework of the Born approximation. The fundamentally data-model nonlinearity. In wave physics terms, the
nonlinear data-model relationship is accounted mainly introduction of higher order sensitivities involves
through iteration and updating of linearized relationships. transmission wave paths from each scattering point to both
Linearization error in seismic inversion can be significant sources and receivers at the surface. The application on the
in many cases (Wu and Zheng, 2014), especially when Marmousi model shows that this nonlinear FWI converges
large-contrast and spatially sustained perturbations are more rapidly than does FWI with a conventional gradient,
involved. The former situation prevails when large-angle and appears to be adept at reconstructing low wavenumber
reflection information is being considered, which, in components even absent rich low frequencies.
reflection configurations, is the regime in which many
high-priority elastic and/or anisotropic parameters are THEORY
distinguished (Innanen, 2015). The Fréchet derivative, or
sensitivity, restricts the resolution of FWI and could cause a Nonlinear sensitivity
strong deficit of low wavenumber components in the
updated model due to the lack of large-aperture In this paper, the space/frequency-domain isotropic
illumination and low frequency information. Furthermore, acoustic wave equation with constant density is used to
cycle-skipping artifacts occur when the Born describe wave motion:
approximation is no longer valid, leading the optimization
to a local minimum (Virieux and Operto, 2009).
s r G r, r , r r ,
2 2
s s (1)
where is the frequency, s r is the squared slowness, rs is which includes multiple interactions with sn r and one
the source location, and G r, rs , is the Green’s function. interaction with s in the remaining terms in equation (5).
For simplicity, we hereafter omit in all wave
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
gn r Re d s G r , r | s P r, r
2 *
n g n s | sn outperforms SD FWI; the NFWI result is particularly close
to the true model.
rs , rg
, (12)
P rg , r | sn P r, rs | sn G rg , r | sn P r, rs | sn
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
P rg , r | sn 1 P r, rs | sn G rg , r | sn P r, rs | sn 1
.(14)
Re d s G r , r | s P r, r
rs ,rg
2 *
n 1 g n 1 s | sn 1
P rg , r | sn 1 P r, rs | sn 1
EXAMPLES
CONCLUSIONS
ACKNOWLEDGEMENTS
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Albertin, U., G. Shan, and J. Washbourne, 2013, Gradient orthogonalization in adjoint scattering-series
inversion: 75th Annual International Conference and Exhibition, EAGE, Extended Abstracts,
1058–1062, https://doi.org/10.1190/segam2013-0580.1.
Alkhalifah, T., and Z. D. Wu, 2016, The natural combination of full and image-based waveform
inversion: Geophysical Prospecting, 64, 19–30, http://doi.org/10.1111/1365-2478.12264.
Biondi, B., and A. Almomin, 2014, Simultaneous inversion of full data bandwidth by tomographic full-
waveform inversion: Geophysics, 79, no. 3, WA129–WA140, https://doi.org/10.1190/geo2013-
0340.1.
Brossier, R., S. Operto, and J. Virieux, 2015, Velocity model building from seismic reflection data by
full-waveform inversion: Geophysical Prospecting, 63, 354–367, http://doi.org/10.1111/1365-
2478.12190.
Innanen, K. A., 2015, Full waveform inversion updating in the presence of high angle/high contrast
reflectivity: 85th Annual International Meeting, SEG, Expanded Abstracts, 1314–1319,
https://doi.org/10.1190/segam2015-5924279.1.
Kwon, K., and B. Yazici, 2010, Born expansion and Frechet derivatives in nonlinear diffuse optical
tomography: Computers & Mathematics with Applications, 59, 3377–3397,
https://doi.org/10.1016/j.camwa.2009.07.088.
Lailly, P., 1983, The seismic inverse problem as a sequence of before stack migrations: Conference on
Inverse Scattering, Theory and Application, Society of Industrial and Applied Mathematics,
Expanded Abstracts, 206–220.
Mcgillivray, P. R., and D. W. Oldenburg, 1990, Methods for calculating Frechet derivatives and
sensitivities for the nonlinear inverse problem — a comparative-study: Geophysical Prospecting,
38, 499–524, http://doi.org/10.1111/j.1365-2478.1990.tb01859.x.
Metivier, L., R. Brossier, J. Virieux, and S. Operto, 2013, Full waveform inversion and the truncated
Newton method: SIAM Journal on Scientific Computing, 35, B401–B437,
https://doi.org/10.1137/120877854.
Pan, W. Y., K. A. Innanen, and W. Y. Liao, 2017, Accelerating Hessian-free Gauss-Newton full-
waveform inversion via l-BFGS preconditioned conjugate-gradient algorithm: Geophysics, 82,
no. 2, R49–R64, https://doi.org/10.1190/geo2015-0595.1.
Shin, C., and Y. H. Cha, 2008, Waveform inversion in the Laplace domain: Geophysical Journal
International, 173, 922–931, http://doi.org/10.1111/j.1365-246X.2008.03768.x.
Shin, C., and Y. H. Cha, 2009, Waveform inversion in the Laplace-Fourier domain: Geophysical Journal
International, 177, 1067–1079, http://doi.org/10.1111/j.1365-246X.2009.04102.x.
Sirgue, L., and R. G. Pratt, 2004, Efficient waveform inversion and imaging: A strategy for selecting
temporal frequencies: Geophysics, 69, 231–248, https://doi.org/10.1190/1.1649391.
Sun, D., and W. W. Symes, 2012, Waveform inversion via non-linear differential semblance
optimization: 82nd Annual International Meeting, SEG, Expanded Abstracts, 1–7,
https://doi.org/10.1190/segam2012-1190.1.
Tarantola, A., 1984, Inversion of seismic-reflection data in the acoustic approximation: Geophysics, 49,
1259–1266, https://doi.org/10.1190/1.1441754.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, https://doi.org/10.1190/1.3238367.
SUMMARY 2009) can be used. Ma and Hale (2013) used dynamic warp-
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
w(t) is the unknown local warping function. Once I solve the seismogram registration problem, the inverted
local time warping function quantifies travel time differences
Z T between observations and predictions. Then I can define a mis-
1
c(w) = [d(t + w(t)) p(t)]2 dt , (1) fit function based on the local warping function and apply it in
2 0 velocity model building procedure. Similar to Luo and Schus-
ter (1991) and Ma and Hale (2013), I choose the following
Local warping of a signal can be implemented by shifting the
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
misfit function
coordinate of input signal and then performing interpolation to
regular coordinate. In this study, I use cubic spline functions
for the interpolation. Z
Ns Nr T
1 XX
Considering the non-uniqueness and nonlinearity of this opti- J= [w(t)]2 dt , (6)
2 0
mization problem, I add regularization to the misfit function. s=1 r=1
The choices of regularization schemes depend on the a priori
where Ns and Nr are the number of sources and receivers, re-
assumption of solutions, either L2 or L1 regularization can be
spectively.
used. If the a priori assumption is that the solution should be
smooth and with finite energies, we can add Tikhonov regular- In order to use this misfit function is waveform inversion, I
ization to the misfit function c(w) as need to compute adjoint sources, which are used to drive ad-
joint wavefields and construct misfit gradients (Tromp et al.,
2005). Based on Plessix (2006), the misfit gradient can be
Z T 2 Z T
L2 dw(t) computed as
c (w) = c(w) + l1 dt + l2 [w(t)]2 dt ,
0 dt 0 Ns Nr Z Ns Nr Z
XX T XX T
(2) ∂J ∂ w(t) ∂ w(t) ∂ p(t)
= w(t) dt = w(t) dt ,
where l1 and l2 are the regularization parameters. ∂ c(x) 0 ∂ c(x) 0 ∂ p(t) ∂ c(x)
s=1 r=1 s=1 r=1
In this study, I choose sparsity constraints with the a priori as- (7)
∂ w(t)
sumption that the local time warping function should be sparse where c(x) is the velocity and I use chain rule for ∂ c(x)
. For
with some basis functions. Therefore, the misfit function used ∂ p(t)
∂ c(x)
,
I use the sensitivity kernel based on Born approxima-
for the seismogram registration problem is tion (Tarantola, 1984; Luo and Schuster, 1991)
Z T
c L1 (w) = c(w) + l3 |w(t)|dt . (3)
0 ∂ p(t) 2 0 ∂ 2 p(x0 ; xs )
= G(xr ; x ) ⇤ , (8)
where l3 is the regularization parameter which can be chosen ∂ c(x0 ) c3 ∂t 2
RT
based on the relative magnitudes of c(w) and 0 |w(t)|dt. Substitute the sensitivity kernel into Equation 7, and I can de-
Considering the potential local minima and cycle skipping prob- fine adjoint source f † (t) for each pair of shot and receiver as
lems of the seismogram registration problem (Baek et al., 2013), ∂ w(t)
in this study, I choose a MCMC method (Mosegaard and f † (t) = w(t) , (9)
∂ p(t)
Tarantola, 1995) to minimize the misfit function c L1 (w), which
allows us to directly sample the global minimum and avoid lo-
cal minima. The efficiency of the MCMC method depends Therefore, the key procedure is to compute the derivative of
on the size of model space. In order to reduce the number the local warping function with respect to the prediction. Luo
of unknowns and impose smoothness to the solution, I choose and Schuster (1991) provided a connectivity function method
the following model parameterization with cubic spline basis to compute such kind of derivative. I introduce a connectivity
function fk (t). Thus, the unknown model parameters are re- function F(t) and rewrite the previous expression as
duced to the coefficients of the spline functions, i.e., wk .
∂ F(t) ∂ F(t)
f † (t) = w(t) / , (10)
N
X ∂ p(t) ∂ w(t)
w(t) = fk (t)wk , (4) The connectivity function F(t) has to satisfy
k=1
= d(t , (13)
∂ p(t)
and
Figure 1: Seismogram registration with a random time warp-
Z ing function. (a) compares an input signal (red) and its time
∂F ⇥ ⇤2
= [d(t + w(t)) p(t)] d(t ˙ + w(t))
¨ +w(t))+ d(t dt , warped version (dashed blue) with a local warping function in
∂ w(t) (c). (b) compares the input (red) and unwarped (dashed blue)
(14) signals. (c) compares synthetic (blue) and recovered (dashed
Assuming that once I have obtained the correct warping func- red) local warping functions. (d) presents the evolution of mis-
tion, the warped data should approximate prediction, i.e., fit c L1 .
NUMERICAL EXAMPLES
c d
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Baek, H., H. Calandra, and L. Demanet, 2013, Velocity estimation via registration-guided least-squares
inversion: Geophysics, 79, no. 2, R79–R89, http://doi.org/10.1190/geo2013-0146.1.
Bunks, C., F. M. Saleck, S. Zaleski, and G. Chavent, 1995, Multiscale seismic waveform inversion:
Geophysics, 60, 1457–1473, http://doi.org/10.1190/1.1443880.
Diaz, E., and P. Sava, 2017, Seismic tomography using local correlation functions: Geophysics, in press.
Engquist, B., and B. Froese, 2014, Application of the Wasserstein metric to seismic signals:
Communications in Mathematical Sciences, 12, no. 5, 979–988,
http://doi.org/10.4310/CMS.2014.v12.n5.a7.
Fichtner, A., B. Kennett, H. Bunge, and H. Igel, 2008, Theoretical background for continent and global
scale full-waveform inversion in the time-frequency domain: Geophysical Journal International,
175, 665–685, http://doi.org/10.1111/gji.2008.175.issue-2.
Fomel, S., and L. Jin, 2009, Time-lapse image registration using the local similarity attribute: Geophysics,
74, no. 2, A7–A11, http://doi.org/10.1190/1.3054136.
Hale, D., 2006, Fast local cross-correlation of images: 76th Annual International Meeting, SEG,
Expanded Abstracts, 3160–3163.
Hale, D., 2013, Dynamic warping of seismic images: Geophysics, 78, no. 2, S105–S115,
http://doi.org/10.1190/geo2012-0327.1.
Luo, Y., and G. Schuster, 1991, Wave-equation traveltime inversion: Geophysics, 56, 645–653,
http://doi.org/10.1190/1.1443081.
Ma, Y., and D. Hale, 2013, Wave-equation reflection traveltime inversion with dynamic warping and full-
waveform inversion: Geophysics, 78, no. 6, R223–R233, http://doi.org/10.1190/geo2013-0004.1.
Maggi, A., C. Tape, M. Chen, D. Chao, and J. Tromp, 2009, An automated time-window selection
algorithm for seismic tomography: Geophysical Journal International, 178, 257–281,
http://doi.org/10.1111/gji.2009.178.issue-1.
Metivier, L., R. Brossier, Q. Merigot, E. Oudet, and J. Virieux, 2016a, Measuring the misfit between
seismograms using an optimal transport distance: Application to full waveform inversion:
Geophysical Journal International, 205, 345–377, http://doi.org/10.1093/gji/ggw014.
Metivier, L., R. Brossier, Q. Merigot, E. Oudet, and J. Virieux, 2016b, An optimal transport approach for
seismic tomography: Application to 3D full waveform inversion: Inverse Problems, 32, 115008,
http://doi.org/10.1088/0266-5611/32/11/115008.
Mosegaard, K., and A. Tarantola, 1995, Monte Carlo sampling of solutions to inverse problems: Journal
of Geophysical Research, 100, 12431–12447, http://doi.org/10.1029/94JB03097.
Plessix, R., 2006, A review of the adjoint-state method for computing the gradient of a functional with
geophysical applications: Geophysical Journal International, 167, 495–503,
http://doi.org/10.1111/j.1365-246X.2006.02978.x.
Pratt, R. G., 1999, Seismic waveform inversion in the frequency domain, Part 1: Theory and verification
in a physical scale model: Geophysics, 64, 888–901, http://doi.org/10.1190/1.1444597.
Rothman, D., 1985, Nonlinear inversion, statistical mechanics, and residual statics estimation:
Geophysics, 50, 2784–2796, http://doi.org/10.1190/1.1441899.
Rothman, D., 1986, Automatic estimation of large residual statics corrections: Geophysics, 51, 332–346,
http://doi.org/10.1190/1.1442092.
Tomographic full waveform inversion (TFWI) (Symes, The conventional L2 objective function for TFWI can be
2008; Sun and Symes, 2012; Biondi and Almomin, 2012) is written as follows:
a new inversion procedure that retains all the advantages and
benefits of FWI while avoiding its strict initial model 1 2 𝜖
𝐽 = ‖𝐅̃(𝐦 ̃ ‖22 ,
̃ )𝐬 − 𝐝𝐨𝐛𝐬 ‖2 + ‖𝐀𝐦 (1)
requirement and cycle-skipping challenges. To achieve this 2 2
goal, TFWI modifies full waveform inversion (FWI) by
combining its classical form with a modified form of wave- where 𝐦̃ is the extended model, 𝐬 is the source function, 𝐅̃
equation migration-velocity analysis (WEMVA). This is the extended forward modeling operator, 𝐝𝐨𝐛𝐬 is the
combination manifests itself as an extension of the velocity observed surface data, and 𝐀 is the defocusing operator. The
model through virtual axes (Biondi and Almomin, 2013). ̃ ) can be written as follows:
gradient 𝐠(𝐦
There are a few convergence challenges that occur in The enhancing operator is required to create a kinematic
practice. First, the data-fitting term creates both the shift in the regularization term that can be back-projected
reflectors and tomographic updates, which makes it more into tomographic updates. In the case of differential
difficult to balance, or emphasize either amplitude or phase semblance optimization (DSO), the enhancing operator, i.e.,
fitting. Second, the data-fitting term takes several iterations the complement of the focusing operator, as described in
until it becomes sufficiently small mainly because of Equation 3, is:
amplitude differences resulting from using an adjoint instead
of an inverse. If the regularization term focuses the model 𝐪
̃ (𝐪) = (1 −
𝐄DSO 𝐦 ̃ (𝐪),
)𝐦 (5)
before the model data fits the observed data, then the data 𝐪max
residual does not produce tomographic updates. Third, the
regularization term continues to focus the model at a rate that where 𝐪 is the extended axis value, either in subsurface
only depends on the weighting term epsilon, regardless of offset or time lag. As described in the previous section, a
the data-fitting term, resulting in a strong sensitivity to proper enhancing operator should result in a residual that is
epsilon. If epsilon is too small, the focusing is too slow, and a 90 degree phase rotation of the modeled data. However,
the inversion might take thousands of iterations before the DSO operator creates a tomographic update without
producing any useful tomographic updates. If epsilon is too creating such a phase-rotated residual. Instead, we scale the
large, the model is focused too fast and results in cycle amplitude for the combined effect of all lags (or all offsets
skipping. in data space) to result in the desired tomographic update.
To solve the previous challenges, we want to modify the This DSO approach has a few shortcomings. First, DSO
regularization term for it to directly produce the tomographic assumes the amplitudes do not change significantly along the
updates instead of indirectly through the data-fitting term. reflectors or along offset. This assumption breaks down in
We can start by rewriting the regularization term as follows: many cases, such as in the presence of amplitude versus
offset (AVO) effects, complex geometry, or irregular
𝜖 𝜖 𝜖 acquisition. Second, DSO enhances the image by scaling
̃ ‖22 = ‖𝐦
‖𝐀𝐦 ̃ ‖22 = ‖𝐦
̃ − (𝐈 − 𝐀)𝐦 ̃ ‖22 , (3)
̃ − 𝐄𝐦
2 2 2 down the amplitudes along the extended axis. Because the
total energy in the data is conserved, this approach assumes
where 𝐄 is an enhancing or focusing operator which is the the energy reduction in the image is to be converted to
complement of 𝐀. It is easier to see that the regularization tomographic updates in the velocity model, therefore it
term minimizes the difference between the extended model indirectly focuses the reflection energy toward zero lag.
and an enhanced version of the extended model by focusing However, this assumption ignores the possibility that the
the model. This formulation makes it similar to the data- energy can simply be converted into reflectors in the null-
fitting term, however it is still missing the wave-equation space of the modeling operator, which are usually present at
operators. Therefore, we propose a new regularization term the edge of the model.
as follows:
We illustrate the effects of these shortcomings in a simple
1 2 𝜖 2 constant velocity synthetic example with a single reflector.
𝐽 = ‖𝐅̃(𝐦
̃ )𝐬 − 𝐝𝐨𝐛𝐬 ‖2 + ‖𝐅̃(𝐦
̃ )𝐬 − 𝐅̃(𝐄𝐦
̃ )𝐬‖2 , (4)
2 2 We calculate a shot profile from the modeled data with a
slow background velocity using an extended image. The
This new regularization term completely changes the extension of the image preserves all the kinematic
behavior of TFWI because it does not modify the model. information in the observed data. We then calculate a shot
Instead, it directly calculates a residual between the modeled profile from the modeled data with a slow background
data and focused modeled data, and back-projects it into a velocity using DSO operator regularization on the extended
tomographic update. This regularization residual is image. Figure 1 compares a modeled trace at 2 km offset
guaranteed to have the correct amount of kinematic shift (bottom) with the DSO regularization residual (middle). We
compared to the modeled data regardless of how well the can see there is no significant change in the phase when we
observed data is fit. Therefore, the data-fitting term only compare the DSO regularization residual to the modeled data
produces reflectors while the regularization term only and only a small amplitude change.
produces tomographic updates. Moreover, epsilon now
balances amplitude fitting and phase fitting without any We propose using a different enhancing operator that shifts
danger of cycle skipping at any value, making the inversion the energy toward the zero lag of the extended axis as
process less sensitive to epsilon. follows:
This shifting operator directly forces the energy to move range between amplitude-only methods (e.g. least-squares
toward the zero lag. The main advantage of this enhancing reverse time migration) and phase-only methods (e.g. phase-
operator compared to the DSO enhancing operator is that it only FWI).
rotates each trace by 90 degrees. This means that the
tomographic update does not depend on how the amplitudes ACKNOWLEDGMENTS
of different traces affect each other. In other words, we have
now removed the amplitude dependence. Furthermore, We wish to thank Saudi Aramco for the opportunity to
because the energy is directly focused, there are no artifacts publish this paper and the Stanford Exploration Project
at the edge of the model. affiliate companies for financial support.
CONCLUSIONS
Figure 3: The tomographic update of using DSO regularization of a Figure 4: The tomographic update of using shifting operator
fast anomaly (top) and a slow anomaly (bottom). regularization of a fast anomaly (top) and a slow anomaly (bottom).
REFERENCES
Almomin, A., and B. Biondi, 2013, Tomographic full-waveform inversion (TFWI) by successive
linearizations and scale separations: 83rd Annual International Meeting, SEG, Expanded
Abstracts, 1048–1052, http://dx.doi.org/10.1190/segam2013-1378.1.
Biondi, B., and A. Almomin, 2012, Tomographic full waveform inversion (TFWI) by combining full
waveform inversion with wave-equation migration velocity analysis: 82nd Annual International
Meeting, SEG, Expanded Abstracts, 1–5, http://dx.doi.org/10.1190/segam2012-0275.1.
Biondi, B., and A. Almomin, 2013, Tomographic full-waveform inversion (TFWI) by extending the
velocity model along the time-lag axis: 83rd Annual International Meeting, SEG, Expanded
Abstracts, 1031–1036, http://dx.doi.org/10.1190/segam2013-1255.1.
Fei, W., and P. Williamson, 2010, On the gradient artifacts in migration velocity analysis based on
differential semblance optimization: 80th Annual International Meeting, SEG, Expanded
Abstracts, 4071–4076, http://dx.doi.org/10.1190/1.3513710.
Pratt, R. G., 1999, Seismic waveform inversion in the frequency domain, Part 1: Theory and verification
in a physical scale model: Geophysics, 64, 888–901, http://dx.doi.org/10.1190/1.1444597.
Shen, P., and W. Symes, 2015, Horizontal contraction in image domain for velocity inversion:
Geophysics, 80, no. 3, R95–R110, http://dx.doi.org/10.1190/geo2014-0261.1.
Shin, C., and W. Ha, 2008, A comparison between the behavior of objective functions for waveform
inversion in the frequency and Laplace domains: Geophysics, 73, no. 5, VE119–VE133,
http://dx.doi.org/10.1190/1.2953978.
Sun, D., and W. Symes, 2012, Waveform inversion via non-linear differential semblance optimization:
82nd Annual International Meeting, SEG, Expanded Abstracts, 1–7,
http://dx.doi.org/10.1190/segam2012-1190.1.
Symes, W., 2008, Migration velocity analysis and waveform inversion: Geophysical Prospecting, 56,
765–790, http://dx.doi.org/10.1111/j.1365-2478.2008.00698.x.
than exponentially with traditional point-wise array storage, the second dimension, to form U(1,2) , and so on for the other
which makes it computationally tractable for parametrizing matrices. See (Da Silva and Herrmann, 2015) for more details.
high-dimensional problems. To help describe the HT format,
we give some preliminary definitions. By virtue of the recursive construction in Figure 1, it is not
necessary to store the intermediate matrices Usrc x, rec x or
Definition 1: The matricization X(l) of a tensor X 2 Usrc y, rec y . It is sufficient to reconstruct a d dimensional
Cn1 ⇥n2 ⇥···⇥nd reshapes the dimensions specified by full tensor by only storing d small matrices Ut and d 2
l = {l1 , l2 , . . . , li } ⇢ {1, 2, . . . , d} into the row indices small 3 dimensional transfer tensors Bt . Hence, the storage
and l c := {1, 2, . . . , d} \ l into the column indices of the matrix requirement is bounded above by dNk + (d 2)k3 + k2 ,
X(l) . where N = max{n1 , n2 , . . . , nd } and k is the maximum rank
(Hackbusch and Kühn, 2009). Compared to N d parameters
For example, given a 4 dimensional tensor X 2 Cn1 ⇥n2 ⇥n3 ⇥n4 , needed to store the full data, the HT format greatly reduces the
X (1,2) is an n1 n2 ⇥ n3 n4 matrix with the first two dimensions number of parameters needed to be stored and manipulated. For
along the rows and the last two dimensions along the columns. 3D seismic data, the internal ranks of tensor format increases as
Definition 2: The multilinear product of a three-dimensional temporal-frequency grows, so that lower frequencies compress
tensor X 2 Cn1 ⇥n2 ⇥n3 with the matrix Ai 2 Cmi ⇥ni for i = 1, 2, 3, more easily than higher frequencies, as shown in Table 1.
denoted A1 ⇥1 A2 ⇥2 A3 ⇥3 X, is defined in vectorized form as
vec(A1 ⇥1 A2 ⇥2 A3 ⇥3 X) := (A3 ⌦A2 ⌦A1 )vec(X). Intuitively,
this is simply multiplying the tensor X by Ai in the ith dimension Bsrc x,rec x,src y,rec y
for each i.
Definition 3: A dimension tree T for a d dimensional tensor Bsrc x,rec x Bsrc y,rec y
is a binary tree where each node is associated to a subset of
{1, 2, . . . , d}, the root node troot = {1, 2, . . . , d}, and each non-
leaf node t is the disjoint union of its left and right children,
t = tl [ tr , tl \ tr = 0./ We can think of a dimension tree as Usrc x Urec x Usrc y Urec y
defining a recursive partitioning of groups of dimensions, where
the dimensions present in the left child node are “separated”
from the dimensions in the right child node.
1
Figure 2: Non-canonical dimension tree for the HT format
T k34 T
B1234 U34 applied to seismic data
n 1 n2
n1 n2
X (1,2) = U12
n3 n 4
It is critical to note that the organization of the tensor has a
n3 n4 k12 major impact on its low-rank behaviour. In the seismic context,
we permute our data from the typical or canonical organization
B12 (source x, source y, receiver x, receiver y) into a non-canonical
n 1 n2
Table 1: Compression ratio comparison between non-canonical and canonical organizations with hierarchical Tucker truncation
method. Synthetic data is generated on the 3D Overthrust model with 502 sources and 3962 receivers, resulting in each frequency
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Given fully sampled seismic data in the non-canonical orga- Algorithm 1 Extracting a common shot gather from com-
nization, one can truncate the full tensor to HT form via the pressed hierarchical Tucker parameters
algorithm in (Tobler, 2012), given a prescribed error tolerance Input: Source position ix and iy , and dimension tree
and maximum inner rank. In the more realistic case when we 1. Extract the vector usrcx from the matrix Usrcx (ix , :)
have subsampled data, we can use the algorithm described in 2. Multiply Bsrcx,recx along the ksrcx dimension with the vec-
(Da Silva and Herrmann, 2015) to recover the full data volume tor usrcx (in the sense of Definition 2)
by solving 3. Multiply the matrix obtained from step 2 along the krecx di-
min kA f (x) bk22 . (2) mension (second dimension) with Urecx , resulting in a ma-
x
trix Usrcx,recx of size nrecx ⇥ k(srcx,recx)
Here x is the vectorized set of HT parameters (Ut , Bt ) from (1), 4. Repeat steps 1, 2, 3 along the y coordinate to obtain the ma-
f maps x to the fully-expanded tensor f (x), A is the subsam- trix Usrcy,recy of size nrecy ⇥ k(srcy,recy)
pling operator, and b is our subsampled data. This algorithm T
5. The product Usrcx,recx Bsrcx,recxUsrcy,recy results in the fi-
can interpolate each 4D monochromatic slice quickly and effi-
nal shot gather
ciently as it does not compute SVDs on large matrices.
ON-THE-FLY EXTRACTION OF SHOT/RECEIVER We generate our data from 4 frequencies between 3Hz and 6Hz
GATHERS with spacings of 1Hz using a 50 ⇥ 50 source grid with 400m
spacing and a 396 ⇥ 396 receiver grid with 50m spacing on
Irrespective of our sampling regime, once we have a representa- the ocean bottom. The size of each frequency slice is roughly
tion of our data volume in the HT format, we can greatly reduce 5.8GB. From the full data, we randomly remove 80% of re-
the computational costs of working with our data. In order to ceivers from each frequency slice. We then are able to obtain
make use of the data directly in its compressed form, we present our compressed data volume for each monochromatic slice by
a method for extracting a shot (or receiver) gather at a given interpolating in the HT format. Figure 3 demonstrates the suc-
position (ix , iy ) directly from the compressed parameters. We cessful interpolation of the data volume from a high level of
use Matlab colon notation A(i, :) to denote the extraction of the missing data, resulting in shot-gathers that are simple to extract
ith row of the matrix A, and similarly for column extraction. with (3).
The common shot gather can be reconstructed by computing
0 0
Usrcx,recx = Usrcx (ix , :) ⇥1 Urecx ⇥2 Bsrcx,recx
5 5
Usrcy,recy = Usrcy (iy , :) ⇥1 Urecy ⇥2 Bsrcy,recy (3)
Receiver Y [km]
Receiver Y [km]
15 15
vector products, outlined in Algorithm 1. Note that the interme- (a) True data (b) Missing 80% data
diate quantities can be constructed through efficient multilinear
0
products and are much smaller than the ambient dimensionality.
0
5 5
Receiver Y [km]
Receiver Y [km]
10 10
z [km]
z [km]
2.3 2.3
each, where each node has 20 CPU cores and 256GB of RAM.
z [km]
z [km]
2.3 2.3
We run the 3D FWI experiments for both the full data and com-
pressed HT data recovered from interpolation, fixing the total 3.5 3.5
y [km]
10 10
15 15
The techniques outlined in this work have the potential to sub-
stantially reduce data communication costs in distributed wave-
20
0 5 10 15 20
20
0 5 10 15 20 equation based inversion. In a parallel environment, we can
x [km] x [km]
cheaply store a compressed form of the full data volume at a
(a) (b)
given frequency on every node. This technique also lends itself
0 0 to generating simultaneous shots on-the-fly in a similar manner
5 5
to Algorithm (1), without the associated data communication
costs one would incur from distributing the full data volume
over the source dimension. The very high compression ratios
y [km]
y [km]
10 10
REFERENCES
Abma, R., C. Kelley, and J. Kaldy, 2007, Sources and treatments of migration-introduced artifacts and
noise: 77th Annual International Meeting, SEG, Expanded Abstracts, 2349–2353.
Aravkin, A., R. Kumar, H. Mansour, B. Recht, and F. J. Herrmann, 2014, Fast methods for denoising
matrix completion formulations, with applications to robust seismic data interpolation: SIAM
Journal on Scientific Computing, 36, S237–S266, http://dx.doi.org/10.1137/130919210.
Curry, W., 2010, Interpolation with Fourier-radial adaptive thresholding: Geophysics, 75, no. 6, WB95–
WB102, http://dx.doi.org/10.1190/1.3500977.
Da Silva, C., and F. J. Herrmann, 2015, Optimization on the hierarchical tucker manifold — Applications
to tensor completion: Linear Algebra and its Applications, 481, 131–173,
http://dx.doi.org/10.1016/j.laa.2015.04.015.
Da Silva, C., and F. Herrmann, 2016, A unified 2D/3D software environment for large-scale time-
harmonic full-waveform inversion: 86th Annual International Meeting, SEG, Expanded
Abstracts, 1169–1173, http://dx.doi.org/10.1190/segam2016-13869051.1.
Hackbusch, W., and S. Ku¨hn, 2009, A new scheme for the tensor representation: Journal of Fourier
analysis and applications, 15, 706–722, http://dx.doi.org/10.1007/s00041-009-9094-9.
Hennenfent, G., and F. J. Herrmann, 2006, Application of stable signal recovery to seismic data
interpolation: 76th Annual International Meeting, SEG, Expanded Abstracts, 2797–2801,
http://dx.doi.org/10.1190/1.2370105.
Herrmann, F. J., and G. Hennenfent, 2008, Non-parametric seismic data recovery with curvelet frames:
Geophysical Journal International, 173, 233–248, http://dx.doi.org/10.1111/gji.2008.173.issue-1.
Kabir, M. N., and D. Verschuur, 1995, Restoration of missing offsets by parabolic radon transform:
Geophysical Prospecting, 43, 347–368, http://dx.doi.org/10.1111/gpr.1995.43.issue-3.
Kreimer, N., and M. D. Sacchi, 2012, A tensor higher-order singular value decomposition for prestack
seismic data noise reduction and interpolation: Geophysics, 77, no. 3, V113–V122,
http://dx.doi.org/10.1190/geo2011-0399.1.
Kumar, R., C. Da Silva, O. Akalin, A. Y. Aravkin, H. Mansour, B. Recht, and F. J. Herrmann, 2015,
Efficient matrix completion for seismic data reconstruction: Geophysics, 80, no. 5, V97–V114,
http://dx.doi.org/10.1190/geo2014-0369.1.
Oropeza, V., and M. Sacchi, 2011, Simultaneous seismic data denoising and reconstruction via
multichannel singular spectrum analysis: Geophysics, 76, no. 3, V25–V32,
http://dx.doi.org/10.1190/1.3552706.
Sacchi, M., S. Kaplan, and M. Naghizadeh, 2009, Fx gabor seismic data reconstruction: 71st Annual
International Conference and Exhibition, EAGE, Extended Abstracts,
http://dx.doi.org/10.3997/2214-4609.201400441.
Schmidt, M. W., E. Van Den Berg, M. P. Friedlander, and K. P. Murphy, 2009, Optimizing costly
functions with simple constraints: A limited-memory projected quasi-Newton algorithm:
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics
(AISTATS-09), 456–463.
Tobler, C., 2012, Low-rank tensor methods for linear systems and eigenvalue problems: Ph.D. thesis,
ETH Zu¨rich.
http://dx.doi.org/10.1109/DCC.1996.488345.
Wang, J., M. Ng, and M. Perz, 2010, Seismic data interpolation by greedy local radon transform:
Geophysics, 75, no. 6, WB225–WB234, http://dx.doi.org/10.1190/1.3484195.
2
Laboratory for Marine Mineral Resources, Qingdao National Laboratory for Marine Science and Technology,
Qingdao, China
Summary The VSP with small amount of data and the SFSA
algorithm with high efficiency provide the possibility for
In order to solve the problem of low computational this long time inversion. The result is the foundation of the
efficiency of the global optimization algorithm in the wave precise time-depth relationship and explanation and
equation inversion, we propose a segmented fast simulated prediction of oil and gas.
annealing (SFSA) algorithm for velocity full waveform
inversion of VSP data. The SFSA algorithm adopts a kind Disturbance model based on non-uniform mutation
of segmented strategy using different disturbance models
and annealing methods in different iterations. At the same In genetic algorithm, the mutation operator assists the
time, we generate model disturbance combined with the crossover operator to generate new individuals and decides
idea of non-uniform mutation on the later stage of the local search ability of the algorithm. The main idea of
algorithm in order to improve the convergence speed. We non-uniform mutation is to search the small area near the
perform zero offset VSP P-wave velocity inversion with original one. On the non-uniform mutation operation from
FWI using SFSA algorithm and very fast simulated X X 1 X 2 ... X k ... X l to X X 1 X 2 ...X k ' ...X l , the formula
annealing (VFSA) respectively on the same initial '
temperature and same markov chain length. The results for the generation of new genes X k is:
confirm the effectiveness and efficiency of the SFSA ' X (t , U max
k
X k )ifradom(0,1) 0
algorithm. Xk k k k (1)
X k (t , X U min )ifradom(0,1) 1
Introduction Here, (t , y ) represents a random number which is
consistent with the non-uniform distribution in the range of
As an effective nonlinear optimization algorithm, the [0, y]. Jiang et al (2007) based on this idea proposed a new
simulated annealing algorithm has been proved to be method to enhance the ability of local search. He used a
rigorous in theory and effective in practical applications non-uniform mutation strategy to generate new model
under the condition of enough model perturbation and parameters. That is:
iteration times as well as strict annealing scheme. However, xi' xi yi ( xi max xi min )
in practical applications, these requirements can not be
satisfied, and the efficiency is not high enough. Many yi r (1 T / N ) a sgn( r 0.5) (2)
scholars have improved the simulated annealing algorithm. Here, R is a random number between (0, 1), T is the
Ingber (1989) proposed a very fast simulated annealing current temperature and N represents the maximum number
algorithm, which has a certain ability to solve practical of iterations associated with the maximum temperature and
problems, but the efficiency is still low. In this paper, the the minimum temperature. a is a constant which
segmented fast simulated annealing algorithm based on determines the degree of non-uniformity.
non-uniform mutation is proposed to further improve the
efficiency of the algorithm. In order to further improve the ability of local search and
the search efficiency, we can make some constraints on the
The random search algorithms with global optimization model, and gradually reduce the perturbation space of the
performance, such as simulated annealing algorithm, are model, so as to quickly approximate the optimal solution.
not required for the initial model, and can search for the In this paper, we add a restriction factor m (k) which is
target in the global space randomly in order to ensure that relevant to iteration times (k) to the disturbance model and
the inversion result is the optimal solution. These the new formula can be expressed as:
algorithms have been widely used in seismic inversion. Liu
et al. (1995) used the synthetic simulated annealing yi r (1 T / N ) a sgn( r 0.5) / m(k ) (3)
algorithm to perform one-dimensional acoustic inversion. When k increases, m also increases and the search range
Zhang (2005) inversed wave impedance by using fast becomes smaller. Therefore, with the increase of the
simulated annealing algorithm. In this paper, we use the number of iterations, the perturbation model will be carried
SFSA algorithm to perform zero offset VSP wave equation out in a smaller and smaller interval around the current
inversion to get the precise P-wave velocity around the hole. model in order to find the optimal solution.
250
E1 is the energy function of current model, the
Here, the 500
300
1400
E is the energy function of new model, and E is the 600
350
relative energy, h, and are given nonnegative real 700 400
1200
numbers. The algorithm can have faster search ability with 800 450
the optimized criteria of acceptance probability. 500
900 Depth
20 40 60 80 100 20 40 60 80
(m)
The SFSA algorithm 200 400 600 800
(a) (b)
In this paper, we propose a segmented fast simulated Figure 1: P-wave velocity model (a) and the seismic record
annealing algorithm based on the non-uniform mutation. In (b)
the early stage of the algorithm when the number of
iterations is less than the truncated iteration number, K, the Zero offset VSP P-wave velocity inversion
model uses a global perturbation pattern and the entire
solution space is traversed. The method of model First of all, we need to determine the truncated iteration
perturbation is the same as the conventional fast simulated number, K. In this paper, several iteration numbers are
annealing algorithm (formula (5)). taken as the K, and corresponding experiments are carried
|2 1| out to choose the best number. The initial temperature of
, yi T sgn( 0.5)[(1 1 / T ) 1] (5) inversion is 10 degree centigrade and markov chain length
Here, is the random number on the (0,1) .The matching is 20. The total the number of iterations is 529. The
annealing plan is: inversion results of the velocity (Figure 2) , the change of
1/ N the temperature and the disturbance modulus (Figure 3) can
T (k ) T0 k (6) be seen that the smaller the K, the stronger the local search
Here, T0 is for the initial temperature, k is the number of ability algorithm will have, but the more easily the
iterations, N represents the number of parameters to be algorithm falls into local the local extremum. When the
inversed and is the attenuation coefficient. In this stage, truncation iteration number is larger, it is not conducive to
find the optimal solution either. Here, we can get better
we set to 0.98. The aim of the early stage is to search
inversion results when the truncation number is in the range
and lock the interval of the optimal solution.
of [100,150].
In the later stage of the algorithm when iterations times is
greater than the truncated iteration number, K, in order to
improve the local search ability of the algorithm, we use
the formula (3) to generate the perturbation of the model
and the formula (5) with =0.97 as the annealing plan.
The purpose is to search the optimal solution in the locked
interval quickly in this stage. The criterion of acceptance
probability of the whole algorithm is formula (4).
100 5
100
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Iteration
200 0
200
Disturbance 0 -3 100 200 300 400 500 600 times
modulus 5 x 10
300 300
400 400 0
500 500
Iteration
-5
0 100 200 300 400 500 600 times
600 600 (a)The variation of temperature (top) and disturbance
modulus (bottom) with the number of iterations when K=10
700 700
Temperature
10
800 800
2
200 200
0
300 300 -2
Iteration
-4
400 400 0 100 200 300 400 500 600 times
(b)The variation of temperature (top) and disturbance
500 500 modulus (bottom) with the number of iterations when K=50
Temperature
600 600
10
700 700
5
0
100 100 -2
Iteration
200 200
-4
0 100 200 300 400 500 600 times
(c)The variation of temperature (top) and disturbance
300 300 modulus (bottom) with the number of iterations when K=100
Temperature
400 400 10
500 500
5
Velocity Velocity
900 (m/s) 900
1000 1500 2000 2500 500 1000 1500 2000 2500 (m/s)
0
(e)K=250 (f)K=300
Iteration
Figure 2: Different velocity inversion results on the different -5 times
number as the truncated iteration number K. (The red line is 0 100 200 300 400 500 600
the true value, and the blue line is the result of inversion.) (d)The variation of temperature (top) and disturbance
modulus (bottom) with the number of iterations when K=150
Depth(m) Depth(m)
Temperature 0
0
10
100
100
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
5
200
200
Iteration
0 300
0 100 200 300 400 500 600 times 300
Disturbance
modulus x 10-3 400 400
5
500 500
0
600 600
-5 Iteration
0 100 200 300 400 500 600 times 700 700
(e)The variation of temperature (top) and disturbance
modulus (bottom) with the number of iterations when K=250 800 800
Temperature Velocity
10 900 900 Velocity
1000 1500 2000 2500(m/s) 1000 1500 2000 2500(m/s)
(a)VFSA (b)SFSA
5
Figure 5: The velocity inversion results using different algorithms
Iteration on the condition of same initial temperature (10) and the same
0 markov chain length (20) (The red line is the true value, and the
0 100 200 300 400 500 600 times
Disturbance blue line is the result of inversion.)
modulus -3
x 10
5 The VFSA algorithm and SFSA algorithm are used to
inverse the P-wave velocity of VSP data under the situation
0 of other conditions being equal. The inversion results using
the initial temperature of 100 degree centigrade and markov
Iteration chain length of 200 is shown in Figure 4. And the iteration
-5
0 100 200 300 400 500 600 times
(f)The variation of temperature (top) and disturbance times of VFSA and SFSA are 798 and 604 respectively.
modulus (bottom) with the number of iterations when K=300 Figure 5 demonstrates the inversion results with a
Figure 3: The variation of temperature and disturbance temperature of 10 degree centigrade and markov chain
modulus with different K length of 20. And the iteration times of VFSA and SFSA
are 912 and 529 respectively. The results reveal that the
Depth(m) Depth(m)
0 0 SFSA algorithm can obtain good inversion effect under the
condition of lower temperature and less iteration times. The
10 100 SFSA algorithm can effectively improve the efficiency of
the algorithm.
20 200
30 300 Conclusions
40 400
The traditional VFSA algorithm has been applied in
50 500 seismic inversion, but the efficiency is still low. In order to
improve the efficiency of the simulated annealing
60 600 algorithm in the zero offset VSP velocity inversion with
FWI, we perform a segmented fast simulated annealing
700
70
algorithm. After the model test, the SFSA algorithm can
80 800 obviously improve the convergence speed, and can get
Velocity Velocity better inversion effect under the less iteration times than
90
(m/s) 900
1000 1500 2000
(m/s)
2500 VFSA. The results demonstrate the effectiveness of the
1000 1500 2000 2500
SFSA algorithm.
(a)VFSA (b)SFSA
Figure 4: The velocity inversion results using different algorithms
on the condition of same initial temperature (100) and the same
markov chain length (200) (The red line is the true value, and the
blue line is the result of inversion.)
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Bohachevsky, I. O., M. E. Johnson, and M. L. Stein, 1986, Generalized simulated annealing for function
optimization: Technometrics, 28, 209–217, http://dx.doi.org/10.2307/1269076.
Ingber, L., 1989, Very fast simulated annealing: Mathematical and Computer Modeling, 12, 967–973,
http://dx.doi.org/10.1016/0895-7177(89)90202-1.
Liu, P.-C., and S. H. Hartzell, 1993, Nonlinear multiparameter inversion using a hybrid global search
algorithm: Examples from 1D acoustic waveform inversion: Geophysical Journal International,
submitted.
Longcong, J., and L. Jiangping, 2007, Simulated annealing algorithm and its improved: Chinese Journal
of Engineering Geophysics, 2, 135–140.
Mrinal, K. S., and P. L. Stoffa, 1995, Global optimization methods in geophysical inversion,
https://doi.org/10.1016/s0921-9366(06)x8001-5.
Štekl, I., and R. G. Pratt, 1998, Accurate viscoelastic modeling by frequency-domain finite differences
using rotated operators: Geophysics, 63, 1779–1794, http://dx.doi.org/10.1190/1.1444472.
picking the arrival times of the direct P- and S-waves in a vectors, with subscript q indicating the station position, E
borehole or at the surface (Rutledge and Scott, 2003). Also, - the diagonal matrix of weights defined by the data error,
microseismic events can be located without time picking, and symbols “H, T”- the Hermitian conjugate and transpose
by employing stacking (Anikiev et al., 2014) and operations respectively. The observed data vectors consist
migration-based methods (Zhang and Zhang, 2013). Most
of the Laplace-Fourier image of elastic displacement position of the source at point rs0 and define the best fit for
velocities, obtained from the measured time-domain
the moment tensor. Because of the known position of the
seismic wavefield data with a set of complex frequencies
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
velocity field and depend upon the velocity model and the 6 solutions of the forward problem (6), separately for
source model each moment tensor component. Specifically
B k = b1 , b 2 ,..., b 6 , b l = B k ⋅ el = QG k P ⋅ el , l = 1...6 (8)
,k = Qq vk
d qsim ˆ (3)
ˆ is the interpolation operator applied to the
where Q where el is a 6-component vector with the one in the l -
q
position and zeros in the others.
calculated velocity field in the vicinity of the station. For
At the second step we update the source position
the elastic medium with density ρ , bulk and shear moduli
using NLCG method (Polyak & Ribiere 1969), where the
κ , µ , the velocity components v k = ( vxk , v yk , vzk ) specified in difference between observed and predicted data
eq.(3), satisfy the equations of motion, which may be e0k ( d obs
= k − dk
sim ,0
) , where d ksim,0 = Bk ( rs0 ) m0k is obtained at
directly obtained taking the Laplace-Fourier transform of
the first for the assumed source position rs0 . Thus
the time-domain system for complex frequency sk . After
finite-difference implementation (Virieux 1986), v k can be (
rs1 =rs0 + α∇ rs φ , ∇rs φ =−2 Re ( e0k E ) E∇ rs d ksim ,0 =
H
) (9)
written via a Green function G k :
= ( Hˆ ∇ P (r ) m
−2 Re g ⋅ sk b Dτ rs
0
s
0
k )
( )
−1
=v k G=
kMk , Gk
ˆ kμ D
sk sk2I − b D τ
ˆ
v
ˆ ,
b D τ Here the vector g H is the solution of the adjoint equation
(4)
M k = ( M xx , M xy , M xz , M yy , M yz , M zz ) (symbol “*” is the conjugate operation)
T
K Tk g* QT EET ( d obs
k ( sk ) − d k )
*
difference operators, which explicit expression can be Implementation for Raft River elastic model
found in (Petrov, Newman 2012, 2014). When the source-
receiver distance is much larger than the source scale and For the algorithm validation we use an existing seismic
the wavelength, the seismic source can be considered as a monitoring survey with the 8 stations at the Raft River
point source at the position rs = ( xs , ys , zs ) and can be geothermal field (Figure 1). The Raft River geothermal
field is a Department of Energy EGS test site, located in
represented by its moment tensor (Aki and Richards, 2009) Cassia county Idaho roughly 100 miles northwest of Salt
Mk (r ) (m , m yzk , mzzk ) δ ( rs − r ) ,
T
= k
xx , mxyk , mxzk , m yy
k
Lake City on the Utah-Idaho border. The Elba quartzite,
(5) located in Precambrian rocks, is the primary geothermal
m k = ∫ M k ( r ) dV reservoir with an average resource temperature of 300°F
Then equation (4) and, accordingly, the vector of predicted (Ayling et al., 2011; Jones et al., 2011). Analysis of water
data d ksim = ( d1,simk , d 2,simk , d qsim
, k ...d Nq , k ) transform to
sim chemistry indicates that the field is bisected into two
regions separated by shear faulting, termed the Narrows
=v k G k=
Pm k , d ksim B=
kmk, B k QG k P, (6) Zone (Figure 1), as described by Ayling & Moore (2013).
The Narrows Zone strikes to the northeast through the
=
where operator P diag (1 1 1 1 1 1) ⊗ δ ( r − rs ) middle of the field. Microseismic activity attributed to plant
defines the source position for each component of moment activity suggests that while acting as a barrier between the
tensor and sign ⊗ denotes the Hadamard product. two regions, the Narrows zone allows for fluid movement
To carry out simultaneous inversion for rs and along its length. Tracking production and distribution of
geothermal fluids is realized with the microseismic
m k we employ an iterative scheme, where each iteration is monitoring, which includes definition of seismic source
divided into two steps. At first we suppose the known location and mechanisms. Essentially the knowledge of
seismic-source mechanisms can provide insights into the position of the source inside the convergence radius,
fracturing behavior of the reservoir and surrounding rocks roughly 1 km.
and the understanding of the evolution of the stress field.
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
= es V pexact
,s − V psmooth
,s V pexact
,s *100% (11) in Figure 8. Clearly moment tensor inversion results are
quite sensitive to the assumed velocity model and moment
tensor estimates can vary significantly with a poor choice
of the velocity model.
Conclusions
We present a FWI method for estimation of microseismic
sources parameters (location and moment tensor) in
Laplace-Fourier domain for 3D elastic heterogeneous-
isotropic media. The stability of the algorithm was tested
for the input data contaminated with Gaussian noise and
inexact velocity models. In noise-free tests the method
converges to the exact values of the source parameters,
Figure 6: The relative error of moment tensor components inverted provided initial source locations estimates are roughly 1 km
with the random noise in initial data. from the actual event location. In the variance of the noise
and inexact velocity model the inversion still provide very
Despite the difference between exact and used models the good source location estimates, but moment tensor
errors in the estimated source coordinates are relatively estimates are degraded corresponding to the noise
small (less than 20 m) in distinction from moment-tensor magnitude and velocity model deviation. The methodology
elements (Figure 8), which are more sensitive to the shows the importance of accurate background velocity
velocity model. model and is now a focus in our ongoing research efforts
on full waveform source moment inversion. Nevertheless,
the proposed algorithm can provide robust inversion of
microseismic parameters with the definition of a robust
background velocity model. We have demonstrated this
finding using an existing micro-seismic monitoring array at
the Raft River geothermal field.
a) b) c) Acknowledgments
Figure 7: Inexact velocity models with the different levels of
deviation smoothing) from the exact model
This material is based upon work supported by the U.S.
Department of Energy Office of Energy Efficiency and
a) es = 6% , b) es = 9% , c) es = 12% .
Renewable Energy (EERE) Geothermal Technologies
Program, under Award Number GT-480010-19823-10.
Increasing difference between used and exact models leads Computational resources were provided by the National
to the increasing of corresponding errors in the moment Energy Research Scientific Computing (NERSC) Center.
tensor estimates, although relations between different All simulations were performed on the CRAY XC30
components are still approximately conserved. If the supercomputers. The authors acknowledge Joseph Moore,
relative error for the moment tensor is defined as John Queen for providing the 3D elastic Raft River model.
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Aki, K., and P. G. Richards, 2002, Quantitative seismology: University Science Books.
Anikiev, D., J. Valenta, F. Staněk, and L. Eisner, 2014, Joint location and source mechanism inversion of
microseismic events:
Ayling, B., and J. N. Moore, 2013, Fluid geochemistry at the Raft River geothermal field, Idaho, USA:
New data and hydrogeological implications: Geothermics, 47, 116–126,
https://doi.org/10.1016/j.geothermics.2013.02.004.
Ayling, B., P. Molling, R. Nye, and J. Moore, 2011, Fluid geochemistry at the Raft River geothermal
field, Idaho: new data and hydrogeological implications: 36thWorkshop on Geothermal Reservoir
Engineering, SGP-TR-191.
Benchmarking on seismicity induced by hydraulic fracturing: Geophysical Journal International, 198,
249–258, https://doi.org/10.1093/gji/ggu126.
Gajewski, D., and E. Tessmer, 2005, Reverse modeling for seismic event characterization: Geophysical
Journal International, 163, 276–284, https://doi.org/10.1111/j.1365-246X.2005.02732.x.
Jones C., J. Moore, W. Teplow, and S. Craig, 2011, Geology and hydrothermal alteration of the Raft
River geothermal system: 36th Workshop on Geothermal Reservoir Engineering, SGP-TR-191.
Michel Jarillo, O., and I. Tsvankin, 2014, Gradient calculation for waveform inversion of microseismic
data in VTI media: Journal of Seismic Exploration, 23, 201–217.
Nash, G. D., and J. N. Moore, 2012, Raft river EGS project: A GIS-centric review of geology: GRC
Transactions, 36, 951–958.
Petrov, P. V., and G. A. Newman, 2012, 3D finite-difference modeling of elastic wave propagation in the
Laplace-Fourier domain: Geophysics, 77, no. 4, T137–T155, https://doi.org/10.1190/geo2011-
0238.1.
Petrov, P. V., and G. A. Newman, 2014, Three-dimensional inverse modelling of damped elastic wave
propagation in the Fourier domain: Geophysical Journal International, 198, 1599–1617,
https://doi.org/10.1093/gji/ggu222.
Plessix, R. E., G. Baeten, J. W. de Maag, M. Klaasen, Z. Rujie, and T. Zhifei, 2010, Application of
acoustic full-waveform inversion to a low-frequency large-offset land data set: 80th Annual
International Meeting, SEG, Expanded Abstracts, 930–934, https://doi.org/10.1190/1.3513930.
Polyak, E., and G. Ribière, 1969, Note sur la convergence desméthods conjugées: Revue Francaise
d’Informatique et de Recherche Opérationnelle, 16, 35–43.
Rutledge, J. T., and P.W. Scott, 2003, Hydraulic stimulation of natural fractures as revealed by induced
microearthquakes, Carthage cotton valley gas field, east Texas: Geophysics, 68, 441–452.
Sirgue, L., O. I. Barkved, J. Dellinger, J. Etgen, U. Albertin, and J. H. Kommedal, 2010, Full waveform
inversion: the next leap forward in imaging at Valhall: First Break, 28, 65–70,
https://doi.org/10.3997/1365-2397.2010012.
Virieux, J., 1986, P-SV wave propagation in heterogeneous media: Velocity-stress finite-difference
method: Geophysics, 51, 889–901, https://doi.org/10.1190/1.1442147.
Zhang, W., and J. Zhang, 2013, Microseismic migration by semblance weighted stacking and
interferometry: 83rd Annual International Meeting, SEG, Expanded Abstracts, 397, 2045–2049,
https://doi.org/10.1190/segam2013-0970.1.
Kai Zhang, Javier Subia, Chevron; Chanjuan Sun, Hao Shen, Nuree Han, CGG
typical rugose seafloor in the area is presented in Figure 1, The top panel is modeled data with the initial model; the
where canyons carved about 300m into the continental middle panel is modeled data with the FWI updated model,
slope. and the bottom panel shows the recorded field data. The
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
2015 final models. The latter puts a smoother velocity Figure 1 Campos Basin Location Map and water bottom
inside canyon flanks (black circle) and a stronger velocity map of the study area.
variation around 1,500m. The oscillating velocities healed
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Acknowledgements
The authors thank Chevron and CGG for their support for
this project.
Additional acknowledgements:
Chris Manuel, Chevron
Vanessa Brown, Chevron
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Arnaud, J., S. Hollingworth, A. Woodcock, L. Ben-Brahim, C. Tindle, and S. Varley, Water Bottom
channels’ impact on pre-stack depth migration on elgin field, North Sea: 70th EAGE Conference
and Exhibition incorporating SPE EUROPEC 2008, http://dx.doi.org/10.3997/2214-
4609.20147984.
Berryhill, J., 1986, Submarine canyons: Velocity replacement by wave-equation datuming before stack:
Geophysics, 51, 1572–1579, http://dx.doi.org/10.1190/1.1442207.
Birdus, S., 2009, Geomechanical modeling to resolve velocity anomalies and image distortions below
seafloor with complex topography: 71th Annual International Conference and Exhibition, EAGE,
Extended Abstracts, U014.
Chen, G., and H. Shen, 2012, High-resolution tomography for paleo-canyons: a case study in Para-
Maranhao Basin, Brazil: 82nd Annual International Meeting, SEG, Expanded Abstracts,
http://dx.doi.org/10.1190/segam2012-1498.1.
Chen, G., and L. Hu, 2014, Dip-constrained tomography with weighting flow for paleo-canyons: a case
study in Para-Maranhao Basin, Brazil: 84nd Annual International Meeting, SEG, Expanded
Abstracts, http://dx.doi.org/10.1190/segam2014-0513.1.
Debenham, H., and Westlake, S., 2014, Pre-stack depth migration for improved imaging under seafloor or
canyons: 2D case study of Browse Basin, Australia: Exploration Geophysics, 45, 216–222,
http://dx.doi.org/10.1071/EG12085.
Deeks, J., and D. Lumley, 2015, Prism waves in seafloor canyons and their effects on seismic imaging:
Geophysics, 80, S213–S222, http://dx.doi.org/10.1190/geo2015-0014.1.
Dent, B., 1983, Compensation of marine seismic data for the effects of highly variable water depth using
ray-trace modeling, a case history: Geophysics, 48, 910–933,
http://dx.doi.org/10.1190/1.1441519.
Fruehn, J., V. Valler, S. Greenwood, S. Phani, S. Sarkar, C. G. Rao, P. Kumar, and P. Routray, 2015,
Velocity model update via inversion of non-parametric RMO picks over canyon areas offshore
Sri Lanka: 14th International Congress of the Brazilian Geophysical Society and EXPOGEF,
1063–1066, http://dx.doi.org/10.1190/sbgf2015-211.
Graham, C., and L. Richard, 2009, Channel tomography: 71th Annual International Conference and
Exhibition, EAGE, Extended Abstracts.
Guillaume, P., M. Reinier, G. Lambaré, A. Cavalié, M. Adamsen, and B. Bruun, 2013, Dip-constrained
non-linear slope tomography: an application to shallow channel characterization: 75th Annual
International Conference and Exhibition, EAGE, Extended Abstracts, 1587–1591.
Sun, Y., Q. Guo, S. Carroll, J. Chen, and E. Liebes, 2011, A high-resolution velocity anisotropy case
study: 81th Annual International Meeting, SEG, Expanded Abstracts, 3918–3922,
http://dx.doi.org/10.1190/1.3628023.
David Dickinson* (Woodside Energy Ltd), Fabio Mancini (Woodside Energy Ltd), Xiang Li (CGG), Kai Zhao
(CGG) and Shuo Ji (CGG)
Summary
In this paper, we use a real 3D data example (with a 7km
The robustness of diving wave Full-Waveform Inversion cable) from Northern Carnarvon Basin (Western Australia)
(FWI) has been proven in industry, but the effectiveness is to evaluate FWI’s business impact: what value can high
limited by its penetration depth. To target deeper reservoirs resolution FWI bring us at the reservoir level?
, the application of FWI using reflection energy is
necessary. This paper presents a real data 25Hz VTI FWI Geology background
case study from North-West Shelf (NWS) Australia
utilizing the full wave-field. Starting from a high-quality The survey is located in an area with rugose water bottom
reflection tomography VTI model, a top-down approach and larger sea floor depth variations (200 ~1200m, Figure
has been adopted. Diving wave FWI updates the shallow, 1). The field lies beneath the continental slope and the
then reflection FWI is introduced to further update the target is a gas-bearing Triassic reservoir from 3 to 4km
deeper section. The updated FWI model demonstrates depth. Local geology challenges include a large tilted
significant uplifts in increasing resolution and conformance faulting system and complex shallow overburden.
with underlying geology. Two promising aspects can be Numerous seafloor canyons and escarpments make it quite
observed: (1) the fairly solid uplifts in mitigating the a challenge for both velocity model building and imaging at
imaging challenges: FWI reduces wave-field distortions, reservoir level. The distorted wave-fields in turn reduce the
leads to overall improved focusing, gather flatness, confidence of the underlying target interpretations due to
continuity, and better positioning in depth; and (2) structural uncertainty and amplitude distortions.
uncovers geological features beyond imaging: high-
50m North
resolution FWI delineates small shallow anomalies and
velocity boundaries across faults, and reveals the strong
acoustic impedance contrasts at reservoir level. It
demonstrates FWI can aid both in reducing the velocity
uncertainty as well as understanding underlying geological
formation.
Introduction
The starting model is of very high quality with good spatial indications of gas-water contacts. Additional sonic log
and vertical resolution. After more than 10 iterations, it comparisons at other well locations (Figure 6) within the
matches sonic logs low frequency components well (Figure survey further enhances confidence in FWI’s detailed
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
2, Figure 6), captures major velocity variations conforming delineation of stratigraphy at the reservoir level (Figure 5).
to geology (Figure 3), and overall the corresponding
migration gathers are fairly flat. Although it is a state-of-
the-art reflection tomography flow, the vertical resolution Discussion: Factors affecting FWI applications
declines with depth (due to the decreasing subsurface
In FWI applications, certain factors may affect the accuracy
reflection angle). At reservoir level, the tomography
of the output velocity model besides the well-known Signal
velocity has a vertical resolution ~500m, but well data
to Noise ratio: 1) Complex density and anisotropy
indicates the existence of important shorter wavelength
variations: In this case, density was via Gardener’s law and
velocity variations (Figure 2, Figure 5). The missing high
low frequency background anisotropy (VTI) models were
frequencies in the velocity model, in both shallow and deep
employed so as to only update the P velocity. When we
sections, lead to noticeable wave-field distortions in both
push toward higher frequencies for fine details, those
(Kirchhoff) migrated stacks and gathers (Figures 4(a) and
assumptions might no longer hold, due to the fact the
4(b)). Small perturbations in the common image gathers
localized density and anisotropy variations could have
(CIGs) further indicate velocity anomalies of small
impacts of about the same magnitude as the high frequency
horizontal dimensions (relative to max offset), and the
velocity components, so detailed density and anisotropic
resultant small pull-ups and push-downs in the stack at
models might be required. 2) Seismic attenuation (Q):
reservoir level can be observed. Knowing it would be
Similar to the discussion above, when we pursue higher
difficult to push ray-based tomography further, FWI is
resolution, detailed Q modeling might be needed. Even if
introduced. During FWI updates, the anisotropy parameters
the Q field is indeed smooth, Q-FWI is likely needed,
δ and ε are kept unchanged. Diving wave analysis shows
especially in the shallow water environment.
that refraction energy illumination is limited to a maximum
depth ~1.7km. Refraction energy is introduced first to
Conclusions
update the complex shallow area; Reflections are then
employed to further update the deeper section where
Starting from a good quality tomography VTI model, our
refraction could not reach.
25 Hz FWI, by utilizing both refraction and reflection
energies in a top-down layer-stripping manner, produces a
Final results
high-resolution model that conforms well with geology.
For depth imaging: 1) FWI resolves the gather distortions
By incorporating both refraction and reflection data, our
due to lateral velocity variations from shallow to deep; 2)
FWI yields fairly robust results. In Figure 2, the velocity
Events at reservoir level are overall geologically flatter. At
profiles indicate that 7Hz FWI already catches variations of
the same time, the resultant FWI model significantly
wavelengths ~150m, and the following 25Hz FWI yields
improves the velocity resolution at reservoir level, closely
even higher resolution, reaching wavelengths ~50m,
matches the sonic logs, and uncovers geological features
matching the wells nicely (Figure 2). The velocity
beyond imaging: High-resolution FWI delineates small
evolution is illustrated in figure 3. Starting from a top
shallow anomalies and velocity boundaries across faults,
quality tomography model, FWI gradually reveals more
and reveals the strong acoustic impedance contrasts at the
velocity details hidden in the recorded wave-fields. In
reservoir level. It demonstrates FWI can aid both in
Figure 5, the 3D FWI model follows the geologic structure
reducing the velocity uncertainty as well as providing a
closely in the complex overburden and delineates small
geological interpretation.
geo-bodies, channels and faults clearly. Figure 4(b) and
4(d) further demonstrate in the CIG domain how FWI
mitigates those imprints of missing high frequency Acknowledgments
components in the input model at reservoir level:
We thank our joint venture participants for permission to
perturbations in the gathers have been greatly reduced and
present this work. We thank Sergey Birdus and Alexey
focusing has been improved. FWI also helps to improve the
Artemov of CGG for the model building work, Todd
focusing in the gas reservoir beneath the unconformity
Mojesky of CGG for reviewing, and CGG R&D team for
(Figure 4(a) and 4(c), ~3.2km), as the updated model has
technical support .
better resolved the complex structures above the reservoir.
1500m/s 4km
1000m/s
(a) PSDM model
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
PSDM Model
4km
7Hz FWI Model
25Hz FWI Model
4000m/s
500m
Figure 2: Frequency-evolution FWI comparison Figure 3: Velocity model panels (a) final PSDM model. (b) and (c) are the
- Sonic log QC. 7Hz and 25Hz FWI results.
(a) Full angle stack using Tomography model (b) CDP gathers using Tomography model)
1.5km
0.5 km
(c) Full angle stack using 25Hz FWI final model (d) CDP gathers using 25Hz FWI final model)
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
1.5km
0.5 km
Figure 4: Kirchhoff PSDM stack and CDP gathers. (a) and (b) are migrated results using tomography model (FWI input).
(c) and (d) are migrated resutls using 25Hz FWI model.
(a) Initial tomography model (b) 25Hz FWI final model
4km 4km
1500m/s
4km
4km
4000m/s
Figure 5: 3D model view: (a) Initial tomography model. (b) 25Hz FWI model.
1000m/s
PSDM Model
FWI Model
500m
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Lambare, G., P. Guillaume, and J. P. Montel, 2014, Recent advances in ray-based tomography: 76th
Annual International Conference and Exhibition, EAGE, Extended Abstracts,
http://doi.org/10.3997/2214-4609.20141151.
Lu, R., S. Lazaratos, S. Hughes, and D. Leslie, 2016, 78th Annual International Meeting, SEG, Expanded
Abstracts, 1242–1246.
Mancini, F., K. Prindle, T. Ridsdill-Smith, and J. Moss, 2016, Full-waveform inversion as a game
changer: Are we there yet?: The Leading Edge, 35, 445–451,
http://doi.org/10.1190/tle35050445.1.
Privitera, A., A. Ratcliffe, and N. Kotova, 2016, A full-waveform inversion case study from offshore
Gabon: 78th Annual International Conference and Exhibition, EAGE, Extended Abstracts,
http://doi.org/10.3997/2214-4609.201600830.
Gboyega Ayeni, Rishi Bansal, Jaewoo Park, Jacob Violet, Spyros Lazaratos, Rongrong Lu, Eric Wildermuth,
Neelamani Ramesh (ExxonMobil Upstream Research Company), Michael Gurch and Gary Lewis (ExxonMobil
Exploration Company (Div. of Exxon Mobil Corporation)
(b)
Figure 3: Inverse Q anomalies associated with gas
accumulations within the study area. (a) N-S cross-section
through the study area, and (b) depth slice highlighted with
dashed line in (a).
Conclusions
Acknowledgements
(a)
(b)
Figure 4: Migrated images derived from (a) Conventional PSDM and (b) FWI. Note that FWI provides improved imaging within
the graben and across the faults
(a) (b)
Figure 5: Kirchhoff image gathers derived from (a) Initial Model and (b) FWI Model. Note the improved gather flatness derived
from the high-resolution FWI model
Figure 6: Shot records from (a) forward simulation through initial model (b) forward simulation through FWI model and (c) field
data. Note that timing errors (indicated by the arrows) which are present in the simulated data through the initial model have been
corrected by FWI.
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Aikulola, U. O., S. O. Olotu, and I. Yamusa, 2010. Investigating fault shadows in the Niger Delta: The
Leading Edge, 29, 16–22, http://doi.org/10.1190/1.3284048.
Bunks, C., F. M. Saleck, S. Zaleski, and G. Chavent, 1995, Multiscale seismic waveform inversion:
Geophysics, 60, 1457–1473, http://doi.org/10.1190/1.1443880.
Fagin, S., 1996, The fault shadow problem: Its nature and elimination: The Leading Edge, 15, 1005–
1013, http://doi.org/10.1190/1.1437403.
Lu, R., S. Lazaratos, K. Wang, Y. H. Cha, I. Chikichev, and R. Prosser, 2013, High-resolution elastic
FWI for reservoir characterization: 75th Annual International Conference and Exhibition, EAGE,
Extended Abstracts, Th 10–02, http://doi.org/10.3997/2214-4609.20130113.
Mora, P., 1987, Nonlinear two-dimensional elastic inversion of multioffset seismic data: Geophysics, 52,
1211–1228, http://doi.org/10.1190/1.1442384.
Pratt, R. G., C. Shin, and G. J. Hicks, 1998, Gauss-Newton and full Newton methods infrequency-space
seismic waveform inversion: Geophysical Journal International, 133, 341–362,
http://doi.org/10.1046/j.1365-246X.1998.00498.x.
Reilly, J., and C. Edoziem E., 1998, Seeing below gas seepage, in P. Shultz ed., The seismic velocity
model as an interpretation asset (No. 2): SEG.
Sirgue, L., O. I. Barkved, J. P. Van Gestel, O. J. Askim, and J. H. Kommedal, 2009, 3D waveform
inversion on Valhall wide-azimuth OBC: 71st 75th Annual International Conference and
Exhibition, EAGE, Extended Abstracts, U038, http://doi.org/10.3997/2214-4609.201400395.
Tarantola, A., 1984, Inversion of seismic-reflection data in the acoustic approximation: Geophysics, 49,
1259–1266, http://doi.org/10.1190/1.1441754.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, http://doi.org/10.1190/1.3238367.
Warner, M., A. Ratcliffe, T. Nangoo, J. Morgan, A. Umpleby, N. Shah, V. Vinje,I. Štekl, L. Guasch, C.
Win, G. Conroy and A. Bertrand, 2013, Anisotropic 3D full-waveform inversion: Geophysics, 78,
no. 2, R59–R80, http://doi.org/10.1190/geo2012-0338.1.
reflectivity of the water bottom was refined by modeling and is dominated by high-frequency dipping artefacts in the
synthetic gathers at regular intervals. A VTI anisotropy shallow section. Similar noise is also present at a lower
model was derived using residual moveout on Kirchhoff spatial frequency in Figure 1c below 3 km, but is less
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
offset-domain common-image gathers (ODCIGs) migrated prevalent; the same noise does not obviously exist in
using the initial model, and the depth to a marker horizon in Figures 1a or 1b.
these ODCIGs relative to the nearby well.
In a second experiment, Figure 2, we compared full-
Results bandwidth AWI and FWI at 23 Hz with the results of
conventional multiscale FWI, beginning at 3 Hz and running
The first experiment was to compare the results of AWI, up to 23 Hz. For this test, shallow velocities in the starting
undertaken without using multiscale inversion, when using model, Figure 2a, were reduced by 1.5%. This is sufficient
different bandwidths of input data. This was done to assess to introduce cycle skipping into the data for full-bandwidth
the frequency at which full-bandwidth AWI breaks down, FWI. For AWI, in order to avoid the generation of sub-
and to determine whether different bandwidths produce vertical noise at depth, we smoothed the model partway
comparable results. Figure 1 shows the results of four AWI through the inversion before continuing. This smoothing
runs using input data that had been low-pass filtered at 5, 11, increased with depth, and was predominantly horizontal.
23 and 47 Hz, respectively. Each inversion used a starting
model similar to that shown in Figure 2a, identical The results of the smoothed full-bandwidth AWI are shown
parameterization and 80 iterations using the full bandwidth in Figure 2b; comparison with Figure 1c shows the effect of
of the input data. the smoothing in suppressing the noise. Figure 2c shows the
results of applying full-bandwidth FWI at 23 Hz. This
As would be expected, the main difference between each run model is now severely compromised by cycle skipping and
is the resolution captured by the inversion. Figures 1a-c there is significant spurious structure around and above the
show broadly the same features: a low-velocity, horizontal, reservoir, even in the shallow subsurface. This is
laterally-extensive reservoir at approximately 3 km depth, a unsurprising; the starting model is not especially accurate
number of shallow gas pockets in an otherwise benign and conventional FWI is not designed to begin at these high
overburden, and simple horizontal stratigraphy. The final frequencies from an imperfect starting model.
inversion, Figure 1d, however does not capture the reservoir
(a) (b)
(c) (d)
Figure 1: Full-bandwidth AWI results, using low-pass filters with corner frequencies of: (a) 5 Hz, (b) 11 Hz,
(c) 23 Hz, and (d) 47 Hz. Each inversion used a spatial filter of identical scale length applied to the AWI gradient.
Using multiscale FWI, Figure 2d, largely avoids the effects that these are likely to be spurious.
of cycle skipping, and produces a more geologically realistic
model, particularly in the first 1.5 km below mudline. The In an effort to assess which of Figures 2b and 2d is a more
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
reservoir in Figure 2d appears fragmented at its fringes, with accurate representation of the true subsurface, the data were
suggestions of gas migration into the overburden. Full- demultipled, deghosted and zero-phased, and migrated using
bandwidth AWI converges to a similar solution in the Kirchhoff prestack depth migration (PreSDM). The
shallow section, but it produces a flatter reservoir that agrees resultant full stack sections are displayed in Figure 3,
closely with the reflection image. overlain on the respective velocity models used to migrate
the data. Whilst the differences between the images
Full-bandwidth AWI and multiscale FWI also differ in the themselves are subtle, the AWI result, Figure 3b, appears to
impact of the limited data fold towards the margins of the correlate significantly better with its stack. As the image and
model. Spurious updates in this region are stronger in the model were generated using two quite distinct regimes of
FWI results than in AWI, most likely because of the wavefield propagation, this agreement suggests that the AWI
narrower aperture of the AWI impulse response. In the left- model provides a reliable account of the geology. The AWI
hand side of Figure 2d, in particular, these sweeping migration result is also generally flatter and more regular.
artefacts appear to merge into geology, placing uncertainty
on the model update here, and impacting migration quality. ODCIGs were generated as a byproduct of the PreSDM,
Figure 4. It can be seen that the multiscale FWI gathers
Another noticeable difference between the two results is the contain non-systematic errors throughout. It is especially
shape and position of the target reservoir. The reservoir noticeable that the gathers in the weaker lower-half of the
appears more central and flatter in the AWI result than the FWI section are under migrated, suggesting that the FWI
multiscale FWI. One concern with the AWI model is the model has not been able to recover from the perturbation that
presence of broad bowl-like features in the shallow was introduced deliberately into the starting model. The
subsurface, similar in appearance to migration aperture AWI gathers are flatter, brighter, and more continuous in
artefacts in imaging. Comparable structures are prominent form. This suggests that the AWI model is indeed more
in Figure 1d, where AWI has clearly gone awry, suggesting kinematically accurate than its multiscale FWI counterpart.
(a) (b)
(c) (d)
Figure 2: (a) Early-stage reflection tomography model, adjusted by 1.5%, used as a starting model; (b) full-bandwidth AWI
model at 23 Hz; (c) full-bandwidth FWI model at 23 Hz; and (d) multiscale FWI model increasing by stages from 3 to 23 Hz.
(b) AWI has the potential to be integrated with and lend its
benefits to other waveform inversion frameworks. An
example of such is the combined local and global inversion
for anisotropy parameters (Debens et al., 2015), where the
use of AWI provides resilience against the effects of local
minima related to cycle skipping.
Acknowledgements
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Bunks, C., F. M. Saleck, S. Zaleski, and G. Chavent, 1995, Multiscale seismic waveform inversion:
Geophysics, 60, 1457–1473, https://doi.org/10.1190/1.1443880.
Debens, H. A., M. Warner, A. Umpleby, and N. V. da Silva, 2015, Global anisotropic 3D FWI: 85th
Annual International Meeting, SEG, Expanded Abstracts, 1193–1197,
https://doi.org/10.1190/segam2015-5921944.1.
Harrowfield, G., 2015, Mass transport complexes of the Rakhine Basin, Myanmar – Deepwater examples
from recent 3D seismic data: SEAPEX Exploration Conference.
Mancini, F., K. Prindle, T. Ridsdill-Smith, and J. Moss, 2016, Full-waveform inversion as a game
changer: Are we there yet? The Leading Edge, 35, 445–448, 450–451,
https://doi.org/10.1190/tle35050445.1.
Tarantola, A., 1984, Inversion of seismic reflection data in the acoustic approximation: Geophysics, 49,
1259–1266, https://doi.org/10.1190/1.1441754.
Warner, M., and L. Guasch, 2016, Adaptive waveform inversion: Theory: Geophysics, 81, no. 6, R429–
R445, https://doi.org/10.1190/geo2015-0387.1.
Summary
In practice, depending in part upon the acquisition
Three-dimensional anisotropic acoustic FWI has become a geometry, it is often possible to reduce the source density
relatively routine component of depth-model building for in 3D FWI below that required in pure 2D so that this
PSDM and shallow-hazard identification, and is increas- scaling is not quite as severe as suggested, but 3D FWI is
ingly being used for pore-pressure prediction and reservoir always significantly more expensive than 2D. In addition,
characterization. However, 3D FWI is relatively resource as the maximum frequency of the data increases, the
intensive, especially at higher frequencies. Consequently number of mesh points required to capture the wavefield
2D FWI can provide a low-cost option when extensive accurately also increases so that the difference in cost
initial testing is required for parameter selection and quality between 2D and 3D FWI becomes more marked.
control of starting models. In addition, during early
exploration, full 3D coverage may not be available Field data
everywhere, and long 2D lines are not uncommonly used to
tie new 3D surveys to more-distant wells and provide We have applied both 2D and 3D FWI to two datasets from
regional context. In these circumstances, 2D FWI may the Carnarvon basin on the NW Australian shelf. In the
have a role to play as part of a larger 3D FWI workflow. first dataset, an 80-km 2D line was acquired using a single
source and single streamer. Data acquisition was optimized
Here we apply both 2D and 3D FWI to the same field for FWI by employing a 10,000-m cable, towed at 25-m
datasets to explore the utility, accuracy and limitations of depth, and a large low-frequency source array towed at 10-
the former. We show that 2D FWI applied to long regional m; the shot interval was 50 m. For this survey, the shallow
lines has exploration benefit and that the additional benefit velocity model was reasonably benign for FWI but there
of applying full 3D FWI to this type of data is limited. We are thin high-velocity igneous intrusions in the deeper
also demonstrate that early testing may be rapidly and section. The water depth ranged from 600 to 1600 m.
usefully performed using 2D FWI ahead of full-3D produc- Figure 1a shows a typical shot record; there are strong
tion FWI with a consequent saving in both time and cost. water-bottom multiples, significant refracted energy at
Initial 2D testing can be especially relevant ahead of cost- longer offsets, and good signal-to-noise at low frequencies.
sensitive decisions to run FWI to higher frequencies where
the 2D results can provide a low-cost initial indication of The second dataset was taken from a conventional 3D
the potential benefits of increased bandwidth in 3D. narrow-azimuth towed-streamer survey designed princip-
ally to enhance spatial resolution rather than for improved
Introduction FWI. The survey used flip-flop sources and eight cables.
The cable length was 5500 m., towed at 6-m depth, and the
3D FWI allows the use of velocity models that vary in three sources were towed at 5-m depth with an 18.75 shot
dimensions, but it also allows sources and receivers to be interval. The underlying velocity model was more
properly distributed in space and it allows those sources complicated than for the first survey with several
and receivers to act as points rather than the lines that 2D generations of buried channels, mini-basins and velocity
wave propagation assumes. The latter changes both the inversions, and this velocity model is difficult to recover
amplitude and phase spectra of the predicted data and without assistance from extensive reflection tomography.
influences its temporal decay. Consequently, 3D The water depth is around 1400 m. Figure 1b shows a
simulation provides different results and is superior to 2D typical shot record; there is less refracted energy, less low-
modelling of the same data. Some of these differences can frequency signal, and more noise than in Figure 1a.
be mitigated during 2D FWI and some cannot.
We applied acoustic VTI anisotropic 2D and 3D FWI on
Most algorithms that are used to simulate the seismic both datasets with minimal preprocessing. Both refracted
wavefields required for FWI scale as n3 in 2D and as n4 in and reflected arrivals were used throughout, and surface
3D for a single source where n is a measure of the linear ghosts and multiples were retained within the field data.
dimensions of the model in mesh points. In addition, 3D We took no explicit account of attenuation or elastic
modelling requires additional source coverage in the third effects, but our FWI code is designed to be robust against
dimension so that in 3D the number of sources required systematic amplitude variations in the field data that do not
scales as n2 whereas in 2D it scales as n. Consequently the match our acoustic assumptions. We used regional values
computational cost of 3D FWI is around n2 times greater for anisotropy and held these values constant during FWI.
than is 2D FWI; typically n is a few hundred or more.
Results – dataset two Qualitatively the 2D and 3D results are similar, but the
velocity anomalies introduced by FWI are almost always
Unlike dataset one, here the data were acquired with full more intense in the 3D model. It is not clear why that
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
3D acquisition. Consequently the main role of 2D FWI in should be the case. Significantly more data is used to
this survey is in early parameter testing and in assessing the generate the 3D result, and this additional data will act to
commercial value of more-expensive algorithms, for reduce the effective signal-to-noise ratio during FWI. This
example extending FWI to higher frequencies. This can be dataset has high noise levels at low frequencies because of
especially relevant when production runs are performed on the shallow tow depth, and we speculate that the amplitudes
the cloud and early testing is performed using more-limited of the velocity updates during 2D FWI are suppressed by
local resources. these higher noise levels acting to compromise the data fit.
Figure 7 shows the start model and results of 2D and 3D For this survey, we used extensive testing with 2D FWI to
anisotropic FWI for this survey. FWI was run between 3.6 design the optimal parameterization for subsequent 3D
and 7.8 Hz; similar parameterization was used for both inversion. This is a difficult dataset to invert, and previous
inversions. In dataset one, 2D FWI used a single source attempts have relied heavily on reflection tomography. We
and single cable so that cable feathering becomes an issue. found the initial 2D testing to be invaluable in developing
In dataset two, we have full 3D acquisition, and so are able the full 3D workflow. Figure 8 demonstrates that the final
to select a subset of sources and receivers from multiple anisotropic 3D result matches check-shot data at the well,
cables that closely match a true 2D geometry; this match is and serves to validate this approach to 3D FWI.
not perfect because of the finite cross-line spacing of both
sources and receivers, but it is much closer than can
normally be obtained by using data from a single cable.
Figure 8: Comparison of
check shots, starting model,
and 2D and 3D anisotropic
FWI models. The location
of the well is shown in
Figure 7.
Conclusions
REFERENCES
No references.
SUMMARY compare the final velocity model with the available sonic log
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
WAVEFORM INVERSION
INTRODUCTION
We adopted the 2D frequency domain visco-acoustic wave-
Waveform tomography can provide subsurface images of high
form tomography approach as described in (Pratt et al., 1998;
resolution (velocity, density, attenuation) that can potentially
Pratt, 1999). The success of the inversion was determined by
be directly comparable to seismic migration. The usual diffi-
3 important factors: the preconditioning of the input data, the
culties in the implementation of waveform tomography are its
starting model, and the inversion strategy.
non linearity due to the abundance of information used from
the input data (amplitude and phase), the variable data qual- Data preconditioning
ity especially at low frequencies, the starting models and the The successful application of waveform tomography requires
starting frequency. Most of the time, it is necessary to de- a good preconditioning of the input data. Any aspect of the
velop a specific inversion strategy for a successful applica- input data that cannot be predicted by the 2D vico-acoustic
tion (e.g. Takougang and Calvert, 2013; Kamei et al., 2014). propagation scheme, e.g., shear waves, coherent noise, shot to
Several applications, ranging from crustal studies using wide shot energy variations, amplitude discrepancy and bad traces,
angle refraction data to reflection seismic data have been re- was removed or corrected. Figure 1 shows a receiver gather at
ported (e.g. Takougang and Calvert, 2013; Pratt et al., 1998; shallow depth; the presence of shear waves (indicated by the
Pratt, 1999; Pratt et al., 2004; Operto et al., 2006; Malinowski blue arrow in Figure 1a), probably due to mode conversion at
et al., 2011; Mothi and Kumar, 2014; Malinowski et al., 2011; the water-seabed interface, can only be predicted by the elastic
Kamei et al., 2013; Sirgue et al., 2010), some with crosshole wave-equation, and was removed from the input data. The
data (e.g. Wang and Rao, 2006; Zhou and Greenhalgh, 2003; following preconditioning steps were adopted:
Pratt et al., 2004), but only a few with walkaway VSP data
(e.g. Gao et al., 2007; Barnes and Charara, 2009; Yang et al., 1. A f-k filter was chosen to remove the shear waves from
2011; Takougang and Bouzidi, 2016). However, VSP data are the input data (see Figure 1). The data were further
generally of greater resolution with higher frequency contain muted with a maximum time window of 200 ms after
compare to surface seismic, due to less energy loss from propa- the first arrival to avoid the inclusion of late converted
gation effects. Therefore, the application of waveform tomog- waves as well as late multiples. A time-window Tw of
raphy to walkaway VSP data can reveal additional subsurface 2 s was used for the inversion.
information that could be used to complement interpretation
2. The amplitudes of the data were corrected from
√ 3D to
based on surface seismic data.
2D propagation by multiplying the data by t (t is the
This paper presents the application of visco-acoustic waveform propagation time), and traces contaminated by noise
tomography to walkaway VSP data from an oil field in a car- were removed. At this stage, the amplitude variation
bonate reservoir offshore Abu Dhabi in the United Arab Emi- with offset (AVO) of the data were scaled using a scal-
rates. We describe the waveform tomography preprocessing ing factor obtained upon comparison of the logarithm
steps as well as the inversion strategy that resulted in forming of the Root Mean Square (RMS) AVO of the field and
high resolution P-wave velocity and attenuation structures. We the modelled data (Brenders and Pratt, 2007), to force
the AVO of the input data to have a behavior similar to tifacts during the inversion at the receivers’ locations, large
that of the AVO predicted by the visco-acoustic mod- smoothness constraints were used. After 80 iterations, a veloc-
elling. ity model (Figure 2a) was obtained with an average root mean
square ( RMS) misfit of 11 ms. The velocity model was then
(a) tested to ensure that it predicts the first arrival traveltimes picks
to within half a cycle. A requirement necessary to ensure a
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
ner law are generally lower than the actual velocity values in
carbonate rock) (Liebermann and Sondergeld, 1994). For this
2 reason, the density model was estimated from empirical corre-
lation derived from the sonic and density log at the borehole
location. We derived the following relationship:
ρ = 293.2Vp0.25616 , (1)
300 m
300 m
300 m 300 m
4400 4400
Depth
Depth
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
3400 3400
R
2400 2400
Distance Distance
(c) (d)
1/Qp 1/Qp
0.03 0.06
300 m
300 m
300 m 300 m
0.04
Depth
Depth
0.02
0.02
R
0.01 0
Distance Distance
Figure 2: Velocity and attenuation models with the receivers’ locations (gray dots) in the deviated borehole superimposed. (a)
Starting velocity model from traveltime tomography, (b) velocity model after 4-50 Hz waveform inversion, (c) starting attenuation
model from the frequency shift method and (d) attenuation model after 4-15 Hz waveform inversion. There is a significant improve-
ment of the velocity and attenuation models after waveform inversion. There is a decrease in velocity associated with increase in
attenuation at a reservoir location (R).
After Shin and Cha (2009), the Laplace transform of a time- 0.4 s, 0.8 s, 1.2 s and 1.6 s. The lower values of τ (0.5 s)
domain wavefield u(x;t) can be expressed as : suppresses waveforms arriving after 0.7 s of our selected 2 s
Z +∞ time window, while τ(s) = 0.8 s, τ(s) = 1.2 s and τ(s) = 1.6
u(x; s) = u(x;t)e−st dt, (4) s progressively include late arrivals. The frequencies 4-25 Hz
0 were used with τ = 0.4 s,0.8 s and 1.2 s while the frequen-
where s is the complex-valued Laplace parameter. By intro- cies 25-50 Hz were used with τ = 1.6 s. Only the velocity
ducing τ, a time decay constant and ω a real-valued angular model was recovered during stage 1, while both velocity and
frequency, we can write s as: attenuation models were recovered during stage 2. This was
done to reduce coupling between velocity and attenuation pa-
s = 1/τ + iω. (5) rameters during models updates (see Pratt et al., 2004; Kamei
and Pratt, 2008; Kamei et al., 2013). A more reliable attenu-
Considering u(x;t < 0) = 0, equation 4 can then be expressed ation model is generally obtained when a high resolution ve-
as (Kamei et al., 2013): locity model is used (see Kamei and Pratt, 2008; Takougang
Z +∞ and Calvert, 2013; Kamei et al., 2013). We started the inver-
u(x; τ, ω) = u(x;t)e−t/τ e−iωt dt. (6) sion at the lowest frequency (4 Hz) and gradually progress to
−∞
the highest frequencies, with a frequency interval of 0.5 Hz.
equation 6 can be seen as the real Fourier transform of a time The frequencies were grouped in 5, spaced every 0.5 Hz, and
domain wavefield multiply by a time damping term, e−t/τ . The were inverted at a time. 5 iterations were used for each group
damping term τ serves to suppress or attenuates late arrivals. of frequencies. The updated models obtained after the inver-
The smaller the decay constant τ, the larger the suppression sion of each group of frequencies was used as starting models
of late arrivals. We realized that a judicious selection of the for the inversion of the next group of frequencies. The source
decay constant τ contributed significantly to the success of the signature was obtained following the method of Pratt (1999)
inversion. A larger selection of τ at an early stage of the inver- and was re-estimated after the inversion of each group of fre-
sion resulted in the failure of the inversion with convergence quencies. During stage 1 of the inversion procedure, we sweep
to local minima, while smaller values of τ focus on early ar- through the frequencies with our selected values of τ (0.5 s, 0.8
rival waveforms which are less prone to cycle skipping. The s, 1.2 s and 1.6 s) and repeat the same procedure during stage
inversion was performed in 2 stages, with frequencies rang- 2. The logarithmic data residual (Bednar et al., 2007; Shin and
ing from 4 Hz to 50 Hz, and 4 values of τ(s) were selected:
Ming, 2006) was used during the inversion. The logarithmic seismic attenuation in carbonate rocks from Abu Dhabi oil
data residual defined by the difference between the logarithmic field (Bouchaala et al., 2016). Deeper, A low velocity zones
of the observed and the logarithm of the predicted wavefield (Vp = 2500-3200 m/s) associated with increase in attenuation
had proven to provide a more uniform and deeper illumination (Q p = 0.05-0.06) correlate with a known location of hydro-
than the conventional data residual defined by the difference carbon reservoirs (R). A comparison of 1D velocity profiles
of the predicted and observed wavefield (Kamei et al., 2014). at the borehole location and the sonic log shows an overall
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
The gradient was preconditioned by wavenumber filtering to good match (Figure 3). The decrease in velocity from wave-
reduce high wavenumber artifacts (Sirgue and Pratt, 2004). form tomography (4-50 Hz model) correlates with decrease in
velocity from the sonic log at the reservoir location (R). How-
ever, the waveform tomography velocity model appears to be
Sonic slightly underestimated in few places, for example at 790 m
Starting
depth (see arrow in Figure 3). The inaccuracy of the veloc-
4-50 Hz
ity model at 790 m depth can be explained by the absence of
receivers around that depth.
CONCLUSIONS
The starting models and the result of the inversion are dis- We are grateful to the oil-subcommittee of the Abu-Dhabi Na-
played in Figure 2. The waveform tomography velocity model tional Oil Company (ADNOC) and its operating companies
was recovered for the frequency range 4-50 Hz, while the in- (OPCOs) for sponsoring this project and providing the data.
version for attenuation was stopped at 15 Hz due to increas- Special thanks to Marc Michel Durandeau and to all the com-
ing artifacts. The velocity (Figure 2a) and attenuation (Fig- mittee members for their inputs and encouragements to the
ure 2c) models from waveform tomography have much greater project. The waveform tomography code was provided by Ger-
resolution compare to the starting velocity from traveltime to- hard Pratt.
mography and starting attenuation from the frequency shift
method. The attenuation values are generally consistent with
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Aldridge, D. F., and D. Oldenburg, 1993, Two-dimensional tomographic inversion with finite-difference
traveltimes: Journal of Seismic Exploration, 2, 257–274.
Barnes, C., and M. Charara, 2009, Viscoelastic full waveform inversion of north sea offset vsp data: 79th
Annual International Meeting, SEG, Expanded Abstracts, 2278–2282,
http://dx.doi.org/abs/10.1190/1.3255315.
Bednar, J. B., C. Shin, and S. Pyun, 2007, Comparison of waveform inversion, part 2: Phase approach:
Geophysical Prospecting, 55, 465–475, http://dx.doi/10.1111/j.1365-2478.2007.00618.x.
Bouchaala, F., M. Ali, and J. Matsushima, 2016, Estimation of seismic attenuation in carbonate rocks
using three different methods: Application on VSP data from Abu Dhabi oilfield: Journal of
Applied Geophysics, 129, 79–91, https://doi.org/10.1016/j.jappgeo.2016.03.014
Brenders, A. J., and R. G. Pratt, 2007, Full waveform inversion tomography for lithospheric imaging:
results from a blind test in a realistic crustal model: Geophysical Journal International, 168, 133–
151, https://doi/10.1111/j.1365-246X.2006.03156.x.
Bunks, C., F. M. Saleck, S. Zaleski, and G. Chavent, 1995, Multiscale seismic waveform inversion:
Geophysics, 60, 1457–1473. https://doi.org/10.1190/1.1443880
Gao, F., A. Levander, R. Pratt, C. Zelt, and G.-L. Fradelizio, 2007, Waveform tomography at a
groundwater contamination site: Surface reflection data: Geophyics, 72, no. 5, G45–G55,
https://org/doi/abs/10.1190/1.2752744.
Gardner, G. H. F., L. W. Gardner, and A. R. Gregory, 1974, Formation velocity and densitythe diagnostic
basics for stratigraphic traps: Geophysics, 39, 770–780, https://org/doi/abs/10.1190/1.1440465.
Kamei, R., and R. Pratt, 2008, Waveform tomography strategies for imaging attenuation structure for
cross-hole data: 70th Annual International Conference and Exhibition, EAGE, Extended
Abstracts, Extended Abstracts, F019, https://org/doi.10.3997/2214-4609.20147680.
Kamei, R., R. G. Pratt, and T. Tsuji, 2013, On acoustic waveform tomography of wide-angle obs data –
strategies for pre-conditioning and inversion: Geophysical Journal International, 194, 1250–1280,
https://doi.org/10.1093/gji/ggt165.
Kamei, R., R. G. Pratt, and T. Tsuji, 2014, Misfit functionals in laplace-fourier domain waveform
inversion, with application to wide-angle ocean bottom seismograph data: Geophysical
Prospecting, 62, 1054–1074, https://doi/10.1111/1365-2478.12127.
Liebermann, R. C., and C. H. Sondergeld, 1994, Experimental techniques in mineral and rock physics:
The schreiber volume: Birkhauser Verlag AG.
Malinowski, M., S. Operto, and A. Ribodetti, 2011, High-resolution seismic attenuation imaging from
wide-aperture onshore data by visco-acoustic frequency-domain full-waveform inversion:
Geophysical Journal International, 186, 1179–1204. https://doi/10.1111/j.1365-
246X.2011.05098.x.
Mothi, S., and R. Kumar, 2014, Detecting and estimating anisotropy errors using full waveform inversion
and ray-based tomography: A case study using long-offset acquisition in the gulf of mexico: 84th
Annual International Meeting, SEG, Expanded Abstracts, 1066–1071,
http://doi/abs/10.1190/segam2014-0324.1.
Operto, S., J. Virieus, J. X. Dessa, and G. Pascal, 2006, Crustal seismic imaging from multifold ocean
bottom seismometer data by frequency domain full waveform tomography: Application to the
Wei Wang*, Po Chen, Ian S. Keifer, Ken G. Dueker, Department of Geology and Geophysics, University of Wyoming
Summary A variety of GITs have been applied to image the DCZ and
each has its own strengths and limitations (Parsekian et al.,
We successfully applied the Full-3-D tomography (F3DT) 2015). In practice, by combining results from multiple
based on adjoint-wavefield method to the passive source types of GITs, we can potentially obtain a more robust
ambient-noise data over a critical zone site in Blair Wallis interpretation. It has been shown that images of subsurface
Watershed, Wyoming. The aim of this study is to image the seismic velocities (i.e., physical quantities related to density
Critical Zone in three-dimensional in order to better and elastic moduli) obtained from seismic refraction
understand its architecture and associated processes. In the surveys (SRS) using ray-theoretic travel-time tomography
forward modeling, we utilize the 3-D elastic wave equation (RTT) correlate well with the degree of weathering in the
discontinuous Galerkin (DG) solver to simulate wave DCZ and can be used to delineate the weathering interfaces
propagation within a model volume, which is constructed (WI) (or reaction zone, depending upon the observation
by tetrahedral elements with variable size determined by scale) separating soil, weathered bedrock and fresh
starting model velocity as well as the desired resolution. crystalline bedrock (e.g., Hunter et al., 1984; Dasios et al.,
We applied the adjoint-wavefield method to calculate 1999; Befus et al., 2011; Holbrook et al., 2014). The
sensitivity (Fréchet) kernels using frequency-dependent architecture of the WI is an important indicator of the
group delay misfit between model-predicted waveforms balance between the rates of weathering and erosion and
and field data. often controls the flow and storage of subsurface fluids
(e.g., Drake et al., 2009; Brantley et al., 2011; Holbrook et
Introduction al., 2014). It is therefore critical to be able to image not
only the overall geometry but also the internal structure of
The critical zone is the outermost layer of the solid earth the WI. The resolution of the images obtained from SRS
that extends from the deepest reach of the groundwater using first-arrival RTT is typically on the order of tens of
chemical reactions to the top of the vegetation canopy meters, which is comparable to or even larger than the
(Anderson et al., 2007; Brantley et al., 2007). It is a highly thicknesses of some WI (Brantley et al., 2011). Our
dynamic layer that hosts a wide variety of physical, preliminary results (Want et al., 2016) strongly indicate that
chemical, hydrological and biological processes. These the application of F3DT is capable of improving the
processes and the interactions among them create and resolution of seismic velocity images to (sub)meter-scale.
transform the environment that sustains agriculture and
most terrestrial life. To understand these processes and to By applying F3DT to image the DCZ with the passive-
build predictive quantitative models that can accurately source ambient-noise seismic waveform data, we will
describe them require us to be able to characterize the provide high-resolution, three-dimensional seismic velocity
structure of the critical zone across its full depth range, images that will lead to a better understanding of the DCZ
from the soil at the surface and shallow depths to the weathering processes.
saprolite, regolith and fractured bedrock, which may extend
to depths of tens to hundreds of meters and are often Field site and data acquisition
difficult to access directly. Drilling and coring is expensive
and only provides direct spot measurements of the deep The field site is located in the Laramie Range with
critical zone (DCZ), which may not be representative of the approximately 21 km south east of Laramie (Fig. 1). The
whole area, especially in areas with strong lateral three arrays (Fig. 1) were deployed in the Blair-Wallis
heterogeneities, where geostatistical interpolation may fail Watershed during the summer of 2015. Green, red, and
to provide an adequate representation. Geophysical yellow dots show locations of the Blair Wallis 1, 2, and 3
imaging techniques (GITs) are minimally invasive, nodal arrays, respectively. Blair Wallis 1 array is the data
relatively inexpensive and provide indirect estimates of we used presented in this work. Stars indicate the locations
physical properties over large areas quickly. Thus, GITs are of borehole wells, of which the three arrays were centered
highly useful complements to drilling/coring, especially for ground truthing. Each array is roughly 200 m by 200 m.
when they can be calibrated with direct measurements, and Each receiver recorded the ambient-noise field
they can provide crucial guidance for selecting the most continuously for 3-4 days. The ambient-noise Green’s
useful sites for excavation and sampling efforts (Riebe & Function (ANGF) is extracted from raw ambient-noise data
Chorover, 2013). followed by seismic interferometry processes (e.g. Bensen
et al., 2007). A scalable parallel algorithm for seismic
interferometry (pSIN) (Chen et al., 2016) is employed to more corrections can be made.order accuracy on fully
efficiently speed up the workflow. unstructured tetrahedral meshes, which is suitable to deal
with geological structure models with complex geometries
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
equations.
200
The DG method is particularly suitable for solving
elastodynamic equations in a complex geological model,
such as the highly heterogeneous critical zone. The
0 advantage of the DG scheme over the spectral-element
200 400 600 scheme is that the DG solution is allowed to be
meters discontinuous across element boundaries. For the Blair
Figure 1: A map of the study area and the ambient-noise data Wallis 1 data array, we constructed a 3D tetrahedral mesh
array with element adapting to both spatial variations of seismic
velocities and desired resolution (Fig. 2). The 1D starting
Methods
model (Fig. 3b) extracted by average the velocity model
from ambient-noise data analysis was extended by
F3DT is in essence an iterative numerical optimization
interpolation onto out 3D tetrahedral mesh. An ANGF
algorithm, in which we search for an optimal subsurface
envelope data example from one station is shown in Fig 3a.
seismic structure model that can minimize the waveform
As the starting model is generally increased with depth that
discrepancies between model-predicted (i.e., synthetic)
indicates wavelengths at shallower depths are much shorter.
seismograms and observed seismograms. The seismic
Hence, we used tetrahedral elements with smaller size (4m
structure model is usually represented using (visco)elastic
at the ground surface) than those at larger depth (10m at he
parameters, such as elastic moduli, density and seismic
about 90m depth). As the seismic velocity model was
velocities (Vp and Vs). These parameters are 3D continuous
updated from iteration to iteration, the tetrahedral mesh can
functions of the space coordinate and can in general be
be updated accordingly to adapt to the changing velocity
discretized using a 3D spatial mesh. During optimization
model.
we search for the optimal discrete representation of the
structural parameters.
4
Measurement count
0
−0.5 −0.4 −0.3 −0.2 −0.1 0 0.1 0.2 0.3 0.4 0.5
Group Delay(s)
REFERENCES
Anderson, S. P., F. von Blanckenburg, and A. F. White, 2007, Physical and chemical controls on the
critical zone: Elements, 3, 315–319, http://dx.doi.org/10.2113/gselements.3.5.315.
Befus, K. M., A. F. Sheehan, M. Leopold, S. P. Anderson, and R. S. Anderson, 2011, Seismic constraints
on critical zone architecture, Boulder Creek watershed, Front Range, Colorado: Vadose Zone
Journal, 10, 1342, http://dx.doi.org/10.2136/vzj2010.0108er.
Bensen, G. D., M. H. Ritzwoller, M. P. Barmin, A. L. Levshin, F. Lin, M. P. Moschetti, N. M. Shapiro,
and Y. Yang, 2007, Processing seismic ambient noise data to obtain reliable broad-band surface
wave dispersion measurements: Geophysical Journal International, 169, 1239–1260,
http://dx.doi.org/10.1111/j.1365-246X.2007.03374.x.
Brantley, S. L., M. B. Goldhaber, and K. V. Ragnarsdottir, 2007, Crossing disciplines and scales to
understand the critical zone: Elements, 3, 307–314, http://dx.doi.org/10.2113/gselements.3.5.307.
Brantley, S. L., H. Buss, M. Lebedeva, R. C. Fletcher, and L. Ma, 2011, Investigating the complex
interface where bedrock transforms to regolith: Applied Geochemistry, 26, S12–S15,
http://dx.doi.org/10.1016/j. apgeochem.2011.03.017.
Chen P., N. J. Taylor, K. G. Dueker, I. S. Keifer, A. K. Wilson, C. L. McGuffy, C. G. Novitsky, A. J.
Spears, and W. S. Holbrook, 2016, pSIN: a scalable, Parallel algorithm for Seismic
INterferometry of large-N ambient-noise data: Computers & Geosciences, 93, 88–95,
http://dx.doi.org/10.1016/j.cageo.2016.05.003.
Dasios, A., C. McCann, T. R. Astin, D. M. McCann, and P. Fenning, 1999, Seismic imaging of the
shallow subsurface: shear-wave case histories: Geophysical Prospecting, 47, 565–591,
http://dx.doi.org/10.1046/j.1365-2478.1999.00138.x.
Drake, H., E. Tullborg, and A. B. MacKenzie, 2009, Detecting the near-surface redox front in crystalline
bedrock using fracture mineral distribution, geochemistry and U-series disequilibrium: Applied
Geochemistry, 24, 1023–1039, http://dx.doi.org/10.1016/j.apgeochem.2009.03.004.
Holbrook, W. S., C. S. Riebe, M. Elwaseif, J. L. Hayes, K. Basler-Reeder, D. L. Harry, A. Malazian, A.
Dosseto, P. C. Hartsough, and J. W. Hopmans, 2014, Geophysical constraints on deep weathering
and water storage potential in the Southern Sierra Critical Zone Observatory: Earth Surface
Processes and Landforms, 39, 366–380, http://dx.doi.org/10.1002/esp.3502.
Hunter, J. A., S. E. Pullan, R. A. Burns, R. M. Gagne, and R. L. Good, 1984, Shallow seismic reflection
mapping of the overburden–bedrock interface with the engineering seismograph; some simple
techniques: Geophysics, 49, 1381–1385, http://dx.doi.org/10.1190/1.1441766.
Lee, E. J., P. Chen, T. H. Jordan, P. B. Maechling, M. A. Denolle, and G. C. Beroza, 2014, Full-3-D
tomography for crustal structure in southern California based on the scattering-integral and the
adjoint-wavefield methods: Journal of Geophysical Research: Solid Earth, 119, 6421–6451 .
Parsekian, A. D., K. Singha, B. J. Minsley, W. S. Holbrook, and L. Slater, 2015, Multiscale geophysical
imaging of the critical zone: Reviews of Geophysics, 53, 1–26,
http://dx.doi.org/10.1002/2014RG000465.
Riebe, C. S. and J. Chorover, 2013, Report on drilling, sampling, and imaging the depths of the critical
zone, an NSF workshop.
Lei Fu
King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.
We test the feasibility of applying multiscale phase inver- F [p(g,t; s)cal ] = A(g, ω; s)cal eiφ (g,ω;s)cal , (1)
iφ (g,ω;s)obs
sion (MPI) to seismic marine data. To avoid cycle-skipping, F [p(g,t; s)obs ] = A(g, ω; s)obs e . (2)
the multiscale strategy temporally integrates the traces several
times , i.e. high-order integration, to produce low-boost seis- Here, s is the location of the source, and g is the location of
mograms that are used as input data for the initial iterations the geophone for a monochromatic source at frequency ω. The
of MPI. As the iterations proceed, higher frequencies in the modified traces p̄(g,t; s)cal are obtained by replacing A(g, ω; s)cal
data are boosted by using integrated traces of lower order as with A(g, ω; s)obs and performing the inverse Fourier trans-
the input data. Results with synthetic data and field data from form,
the Gulf of Mexico produce robust and accurate results if the n o
p̄(g,t; s)cal = F −1 L(ω)A(g, ω; s)new eiφ (g,ω;s)cal , (3)
model does not contain strong velocity contrasts such as salt-
n o
sediment interfaces. p̄(g,t; s)obs = F −1 L(ω)A(g, ω; s)obs eiφ (g,ω;s)obs , (4)
One of the most significant problems with full waveform inver- Misfit function. The modified predicted and observed traces
sion (FWI) is the cycle-skipping problem (Virieux and Operto, are time-integrated and their residual are used for the MPI mis-
2009; Warner et al., 2013; Warner and Guasch, 2014), where fit function,
an iterative solution gets stuck in a local minimum. Another XZ
problem is that the amplitudes of the predicted traces do not ε mpi = dt [I n p̄(g,t; s)cal − I n p̄(g,t; s)obs ]2 , (5)
fully match those of the observed data because all of the actual s,g
physics is not used in computing the predicted traces. To miti- where I n is an integration operator I ≡ dt performed n times,
R
gate both problems, Sun and Schuster (1993) proposed a multi- and p̄(g,t; s)cal and p̄(g,t; s)obs are the modified traces in equa-
scale phase inversion (MPI) method. To avoid cycle-skipping, tions (3) and (4). If we set A(g, ω; s)new = A(g, ω; s)cal in
the multiscale strategy temporally integrates the traces several equation (3), then the MPI misfit function becomes that for
times to produce low-boost seismograms that are used as input full wave inversion, except that the traces have been modified
data for the initial iterations of MPI. To avoid the necessity of by the filter L(ω) and integration operator.
exactly predicting amplitudes, only the phase of the seismic
data is predicted and the amplitude information is largely ig- Gradient. The gradient of MPI misfit function ε mpi w.r.t. the
nored. The penalty in not matching amplitudes is a moderate velocity field c(x)
loss in resolution in the velocity tomogram. Sun and Schuster
∂ ε mpi
(1993) demonstrated the feasibility of this method by applying γ mpi (x) = (6)
it to synthetic crosswell data. We now test the MPI strategy on ∂ c(x)
Z
both synthetic data and field data recorded in a marine seismic 1 X
dt [I n p̄˙(x,t; s)] I n p̄˙0 (x,t; s) ,
=
sruvey. The next section describes the theory of MPI, which c(x)3 s
is then followed by the numerical results section. Both syn-
thetic data and field data are inverted with the MPI strategy for where dot means time differentiation, p̄(x,t; s) is the pressure
sediments with moderate velocity contrasts. The final section wavefield by the source at s, and p̄0 (x,t; s) is the wavefield
presents the summary. computed by backprojecting the seismogram residual (Luo and
Schuster, 1991) δ p̄,
p̄0 (x,t; s) =
X
THEORY g(x, −t; g, 0) ∗ δ p̄, (7)
r
dient is
Z Distance (km)
4
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
1
∂ ε f wi
Z
1 X
γ f wi (x) = dt [ ṗ(x,t; s)] ṗ0 (x,t; s) ,
= 3
∂ c(x) c(x)3 s 2
2
(9)
0 1 2 3 4
Q Model log10Q
where the FWI gradient is the dot product between the source 0 4
Z Distance (km)
with the data residual δ p = p(g,t; s)obs − p(g,t; s)cal . 1 3
2 2 2 0
plex conjugate, ω is the angular frequency, and ε is a damping
factor to prevent numerical instability. One formula for choos-
ing optimal frequency bands proposed by Sirgue and Pratt (2004) 3 3 3
is
fn
fn+1 = , (11) −0.01
αmin 0 2 4 0 2 4 0 2 4
Offset (km) Offset (km) Offset (km)
where fn is the current frequency,
√ fn+1 is the next frequency
Figure 2: Synthetic seismic data, a) constant density acoustic data, b) visco-acoustic data, and
to be chosen, and αmin = z/ h2 + z2 is the parameter that de- c) elastic data, for the shot location at x = 0 km. All subplots have the same scale.
pends on the maximum half offset h and the maximum depth z
to be imaged. a) Initial Model km/s b) Initial Model km/s
0 0
Z Distance (km)
4 4
1 1
3 3
2 2
NUMERICAL RESULTS 2 2
0 1 2 3 4 0 1 2 3 4
To demonstrate the effectiveness of MPI and its advantages, c) FWI Tomogram km/s d) FWI Tomogram km/s
0 0
Z Distance (km)
ing the constant-density acoustic wave equation (Alford et al., e) MPI Tomogram km/s f) MPI Tomogram km/s
0 0
Z Distance (km)
Acoustic Data Figure 3: Inversion results for acoustic data. a) The smoothed initial velocity model with
an average velocity error of 12%, c) FWI and e) MPI tomograms based on the initial model
The original acoustic data are generated by solving the con- a), g) velocity profile comparison for true (black), initial (blue), FWI (green) and MPI (red)
stant density acoustic equation with a 15-Hz Ricker wavelet. A tomograms; b) the v(z) initial velocity model with an average velocity error of 22%, d) FWI
and f) MPI tomograms based on the initial model b), h) velocity profile comparison for true
common shot gather for the source at (z, x) = (0, 0)m is shown (black), initial (blue), FWI (green) and MPI (red) tomograms.
in Figure 2a. Different bandpass filters are applied to the origi-
nal data, and the frequency multiscale strategy is used for both
the FWI and MPI methods. Figure 3a is the smoothed initial velocity model with an aver-
age velocity error of 12%, and Figure 3b is the v(z) initial ve-
© 2017 SEG Page 1401
SEG International Exposition and 87th Annual Meeting
locity model with an average velocity error of 22%. The FWI the source wavelet and acquisition geometry are the same as
and MPI tomograms with the smoothed initial model (Fig- in the acoustic case. The pressure is injected in the water and
ure 3a) are shown in Figures 3c and 3e, respectively. The FWI the pressure field is recorded as the negative of the average of
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
and MPI tomograms with the v(z) initial model (Figure 3b) the normal stresses. The true v p model is shown in√Figure 1a,
are shown in Figures 3d and 3f, respectively. Figures 3g and the density is given by ρ = 0.31v0.25
p , and vs = v p / 3, except
3h shows the velocity profile comparison for the true (black), the shear velocity of the ocean water is set to 0 m/s. The elas-
initial (blue), FWI (green) and MPI (red) velocity models at tic data at shot location (z, x) = (0, 0)m is shown in Figure 2c.
different offsets. We can see that both the FWI and MPI to- We can see some converted waves which do not appear in the
mograms have a good agreement with the true model when the acoustic data. The FWI and MPI tomograms are shown in
initial model is not far away from the true model. However, Figure 4b and 4c, respectively, and the corresponding velocity
when the initial model is far away from the true model, tradi- profiles are shown in Figure 4c. We can see that the MPI to-
tional FWI gets stuck in a local minima, while MPI provides an mogram is moderately more accurate than the FWI tomogram.
accurate tomogram. Thus, the MPI method has a more robust
convergence than FWI for this model. Gulf of Mexico Data
The MPI method is applied to a streamer data set recorded in
Visco-acoustic Data the Gulf of Mexico using 515 shots with a shot interval of 37.5
We now use visco-acoustic data as input traces to the acous- m, a time-sampling interval of 2 ms, a recording time of 10
tic FWI and MPI algorithms. The goal is to test the sen- s, and 480 hydrophones per shot. The hydrophone interval is
sitivity of each method to the unmodeled attenuation effects 12.5 m, with the minimum and maximum source-receiver off-
in the data. The visco-acoustic data are generated by solving sets of 198 m and 6 km, respectively. The v(z) velocity model
visco-acoustic equations, where the source wavelet and acqui- shown in Figure 5a is used as the initial model for multiscale
sition geometry are the same as in the acoustic case. A pres- FWI and MPI. The initial velocity model is discretized into
sure source is injected in the water, and the pressure field is 402 × 3008 grids with a grid spacing of 6.25 m in both direc-
recorded. The true v p model is shown in Figure 1a and the tions.
Q model is shown in Figure 1b, where the minimum Q is 5.
The visco-acoustic data for the shot location at (z, x) = (0, 0)m a) Initial Model km/s
is shown in Figure 2b, where we can see that reflections are 0 2.5
4 4 2
1 1
3 3
2 2
2
2 2
1.5
0 1 2 3 4 0 1 2 3 4 0 5 10 15
c) MPI Tomogram km/s d) MPI Tomogram km/s c) MPI Tomogram km/s
0 0
Z Distance (km)
4 4 0 2.5
1 1
3 3
Depth (km)
2 2 1
2 2
2
0 1 2 3 4 0 1 2 3 4
e) Profile Comparison f) Profile Comparison 2
0 0
Z Distance (km)
1.5
1 1 0 5 10 15
X Distance (km)
2 2
Figure 5: The a) v(z) initial model, b) multiscale FWI, and c) MPI tomograms for the Gulf of
0 1 2 3 4 0 1 2 3 4 Mexico data.
X Distance (km) X Distance (km)
Figure 4: Inversion results for visco-acoustic and elastic data. a) FWI and, c) MPI tomograms,
e) velocity profile comparison for true (black), initial (blue), FWI (green) and MPI (red) tomo-
grams at different offsets for visco-acoustic data; b) FWI and d) MPI tomograms , f) velocity Figures 5b and 5c depict the FWI tomogram after 26 iterations
profile comparison for true (black), initial (blue), FWI (green) and MPI (red) tomograms at
different offsets for elastic data. and MPI tomogram after 52 iterations, respectively. Both the
FWI and MPI tomograms have a higher resolution compared
Elastic Data with the initial velocity model. In addition, the resolution of
We now use elastic data as input traces to the acoustic FWI the MPI tomogram is slightly higher than that seen in the FWI
and MPI algorithms. The goal is to test the sensitivity of each tomogram. In order to verify the reconstructed FWI and MPI
method to the unmodeled elastic effects in the data. The elastic tomograms, we compare the migration images and angle do-
data are generated by solving the elastic wave equation, where main common image gathers (ADCIGs).
© 2017 SEG Page 1402
SEG International Exposition and 87th Annual Meeting
a) RTM Image Computed with the Initial Velocity Model Average Correlation for Each Shot
1
0 0.1 Initial
FWI
Depth (km)
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
MPI
1 0.8
0
2 0.6
Correlation
−0.1
0 5 10 15
0.4
b) RTM Image Computed with FWI Tomogram
0 0.1
0.2
Depth (km)
1
0
0
2
−0.1 2 4 6 8 10 12 14 16
0 5 10 15 X Distance (km)
c) RTM Image Computed with MPI Tomogram Figure 8: The average correlation, which is calculated by the correlations between the ob-
served and predicted data, for each shot.
0 0.1
Depth (km)
1
0
that the ADCIGs associated with the FWI and MPI tomograms
2 are flatter than those from the initial velocity model. And the
−0.1 ADCIGs (in the red box) from MPI are slightly flatter then
0 5 10 15 FWI. Figure 8 shows the data similarity between the observed
X Distance (km)
and predicted data. We can find that the flattened MPI traces
Figure 6: RTM migration images computed from a) initial velocity model, b) FWI tomogram,
and c) MPI tomogram. have a better similarity to one another than the traces obtained
from the FWI tomogram.
a) ADCIGs Computed with the Initial Velocity Model
0.2
SUMMARY AND CONCLUSIONS
0.1
Depth (km)
1
0 The multiscale strategy temporally integrates the traces several
2 −0.1 times to produce low-boost seismograms that are used as input
−0.2 data for the initial iterations of MPI. Synthetic examples show
4 6 8 10 12 14 that both the MIP and FWI methods can obtain similar tomo-
b) ADCIGs Computed with FWI Tomogram grams when the initial velocity model is not far away from
0.2
the true model. However, limited tests suggest that the MPI
0.1 method gives a more accurate tomogram than FWI when the
Depth (km)
1
0 initial model is far from the true model. In addition, tests sug-
−0.1
gest that MPI can provide a more accurate tomogram than FWI
2
when inverting elastic data. These examples show that MPI is
−0.2
4 6 8 10 12 14 more robust than FWI for inverting seismic marine data.
c) ADCIGs Computed with MPI Tomogram
0.2
In the GOM marine data case, both the FWI and MPI methods
successfully inverted the marine data to obtain tomograms that
0.1
Depth (km)
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Alford, R., K. Kelly, and D. M. Boore, 1974, Accuracy of finite-difference modeling of the acoustic wave
equation: Geophysics, 39, 834–842, http://doi.org/10.1190/1.1440470.
Boonyasiriwat, C., P. Valasek, P. Routh, W. Cao, G. T. Schuster, and B. Macy, 2009, An efficient
multiscale method for time-domain waveform tomography: Geophysics, 74, no. 6, WCC59–
WCC68, http://doi.org/10.1190/1.3151869.
Levander, A. R., 1988, Fourth-order finite-difference P-SV seismograms: Geophysics, 53, 1425–1436,
http://doi.org/10.1190/1.1442422.
Luo, Y., and G. T. Schuster, 1991, Wave-equation traveltime inversion: Geophysics, 56, 645–653,
http://doi.org/10.1190/1.1443081.
Operto, S., J. Virieux, P. Amestoy, J.-Y. LExcellent, L. Giraud, and H. B. H. Ali, 2007, 3D finite-
difference frequency domain modeling of visco-acoustic wave propagation using a massively
parallel direct solver: A feasibility study: Geophysics, 72, no. 5, SM195–SM211,
http://doi.org/10.1190/1.2759835.
Sirgue, L., and R. G. Pratt, 2004, Efficient waveform inversion and imaging: A strategy for selecting
temporal frequencies: Geophysics, 69, 231–248, http://doi.org/10.1190/1.1649391.
Sun, Y., and G. T. Schuster, 1993, Time-domain phase inversion: 63rd Annual International Meeting,
SEG, Expanded Abstracts, 684–687, http://doi.org/10.1190/1.1822588.
Virieux, J., and S. Operto, 2009, An overview of full waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, http://doi.org/10.1190/1.3238367.
Warner, M., and L. Guasch, 2014, Adaptive waveform inversion — FWI without cycle skipping-theory:
76th Annual International Conference and Exhibition, EAGE, Extended Abstracts,
http://doi.org/10.3997/2214-4609.20141092.
Warner, M., T. Nangoo, N. Shah, A. Umpleby, and J. Morgan, 2013, Full-waveform inversion of cycle-
skipped seismic data by frequency down-shifting: 83rd Annual International Meeting, SEG,
Expanded Abstracts, 903–907, http://doi.org/10.1190/segam2013-1067.1
where is the angular frequency, vz, vh and vnmo are the W1 ( x, t )S ( x, t ) R ( x, T t )
vertical, horizontal and NMO velocities, ki are the 1 dt (6)
2 Avz ( x ) t W2 ( x, t ) 2
Gv z ( x ) 1 S ( x , t ) R ( x , T t )
wavenumber vector components along the space
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
v z ( x ) t t
coordinates x. After transforming equation (1) to space-
time domain, and solving the time derivative using a
second-order finite-difference approximation, we obtain the 2 1 f h ( k x , k y ) S ( k , t )
A ( x )
G (x ) FT R(x, T t )dt (7)
following time-stepping scheme: ( f n ( k x , k y , k z ) S ( k , t )
Conclusions
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Acknowledments
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Alkhalifah, T., 2015, Conditioning the full-waveform inversion gradient to welcome anisotropy:
Geophysics, 80, no. 3, R11–R122, https://doi.org/10.1190/geo2014-0390.1.
Alkhalifah, T., and R. E. Plessix, 2014, A recipe for practical full-waveform inversion in anisotropic
media: An analytical parameter resolution study: Geophysics, 79, no. 3, R91–R101,
https://doi.org/10.1190/geo2013-0366.1.
Cheng, X., K. Jiao, D. Su, and D. Vigh, 2016, Multiparameter estimation with acoustic vertical transverse
isotropic full-waveform inversion of surface seismic data: Interpretation, 4, SU1–SU16,
https://doi.org/10.1190/INT-2016-0029.1.
Chiu, C., and P. L. Stoffa, 2011, Application of normalized pseudo-Laplacian to elastic wave modeling
on staggered grids: Geophysics, 76, no. 11, T113–T121, http://dx.doi.org/10.1190/geo2011-
0069.1.
Debens, H., M. Warner, A. Umpleby, and N. da Silva, 2015, Global anisotropic 3D FWI: 85th Annual
International Meeting, SEG, Expanded Abstracts, https://doi.org/10.1190/segam2015-5921944.1.
Etgen, J. T., and S. Brandsberg-Dahl, 2009, The pseudo-analytical method: application of pseudo-
Laplacians to acoustic and acoustic anisotropic wave propagation: 79th Annual International
Meeting, SEG, Expanded Abstracts, 2552–2555, https://doi.org/10.1190/1.3255375.
Goldstein, T., and O. Stanley, 2009, The split Bregman method for l1-regularized problems: SIAM
Journal on Imaging Sciences, 2, 323–343, https://doi.org/10.1137/080725891.
Guo, Z., and M. de Hoop, 2013, Shape optimization and level set method in full waveform inversion with
3D body reconstruction: 83th Annual International Meeting, SEG, Expanded Abstracts,
https://doi.org/10.1190/segam2013-1057.1.
Plessix, R. E., and Q. Cao, 2011, A parameterization study for surface seismic full waveform inversion in
an acoustic vertical transversely isotropic medium: Geophysical Journal International, 185, 539–
556, https://doi.org/10.1111/j.1365-246X.2011.04957.x.
Ramos-Martinez, J., S. Crawley, Z. Zou, A. A. Valenciano, L. Qiu, and N. Chemingui, 2016, A robust
gradient for long wavelength FWI updates: 76th Annual International Conference and Exhibition,
EAGE, Extended Abstracts, https://doi.org/10.3997/2214-4609.201601536.
The core of this non-linear tomography is the use of the so- solving the Eikonal equation, which provides a more stable
called kinematic invariants, which represent the kinematic solution than ray tracing. However, solving the anisotropic
characteristics of locally coherent events in the un-migrated Eikonal equation can be computer intensive. Here we split
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
domain. They are generally obtained through a kinematic first break modeling into two steps. First, we solve the
demigration of dip and RMO picks in the pre-stack depth isotropic Eikonal equation |∇𝑇|2 = 1/𝑉ℎ2 for each source
migration domain. They are then used to feed a non-linear using the horizontal velocity 𝑉ℎ = 𝑉𝑣 √1 + 2𝜖. This gives
iterative algorithm involving kinematic migrations and an approximated traveltime map. A trajectory between
updates of the velocity model in order to minimize the source and receiver is computed following the traveltime
slope of RMO (Montel et al., 2009). gradients. Then we perturb this trajectory using a ray
bending algorithm to get rays that satisfy Fermat’s principle
We propose to combine this with diving ray tomography, in the full anisotropic model.
which can be seen as a ray-based version of diving wave
FWI, taking advantage of the non-linear capabilities of the Which parameters should we invert for? Several studies
two methods. The associated joint cost function can be deal with sensitivity analysis. Djebbi et al. (2017) compute
expressed as the traveltime sensitivity kernels in a VTI context for
𝑁𝑅𝑀𝑂 𝑁𝐹𝐵
𝑙
several parameterizations. They conclude that diving waves
𝐶(𝑚) = ∑ 𝑎𝑖 |𝑑𝑅𝑀𝑂𝑖 |𝑙 + 𝑤 ∑ 𝑏𝑗 |Δ𝑡𝑗 | + 𝑅(𝑚) are mostly sensitive to the horizontal velocity 𝑉ℎ whereas
𝑖=1 𝑗=1 reflections are mostly sensitive to the NMO
where 𝑁𝑅𝑀𝑂 and 𝑁𝐹𝐵 are the number of picked reflected velocity 𝑉𝑁𝑀𝑂 = 𝑉𝑣 √1 + 2𝛿. When diving waves and
events and the number of picked first breaks, respectively; reflections are used simultaneously, they suggest to
𝑑𝑅𝑀𝑂 is the slope of the reflected event in the CIG use (𝑉𝑁𝑀𝑂 , 𝜂, 𝛿), where the anellipticity parameter is
(derivative of the depth position with respect to the CIG 𝜖−𝛿
defined by 𝜂 = . It was demonstrated that we cannot
parameter, namely offset or angle); Δ𝑡 is the traveltime 1+2𝛿
misfit (difference between computed time in the current recover the three parameters using surface P-wave seismics
model and picked traveltime on the data); 𝑎𝑖 and 𝑏𝑗 are only in layered models (Alkhalifah and Tsvankin, 1995).
Usually, 𝛿 is derived from, or constrained by, well data or
weights on each data (they can be offset-based or include a
data quality factor, for example); w is a global weight regional geological knowledge, while 𝑉𝑣 and 𝜖 are
applied to diving ray misfit term; l describes the chosen estimated keeping 𝛿 fixed (or obeying a priori rock
norm, and finally 𝑅(𝑚) stands for additional constraint and physics-based relationship between 𝜖 and 𝛿). In the
regularization terms applied to the model (including example below, we choose to keep 𝛿 fixed during the
Tikhonov type, or Laplacian, in particular), that are tomography. In this case, solving for (𝑉𝑣 , 𝜖) or (𝑉𝑁𝑀𝑂 , 𝜂)
required in every ill-posed inverse problem. does not make much difference, and we choose the former
for simplicity. We allow long wavelength spatial variations
The cost function is minimized through a non-linear only for in order to mitigate the tradeoff between
iterative multi-scale procedure. The first step involves a anisotropy and velocity (as Stopin et al., 2014).
kinematic migration which allows re-localizing the
invariants in the updated velocity model. Then Fréchet Additionally, the computation of the Fréchet derivatives
derivatives, with respect to the model parameters, are gives access to the Gauss-Newton approximation of the
computed and the model perturbation is found by solving Hessian, which is used to precondition the gradient and
the normal equations of the least squares problem using an limit the crosstalk between parameters.
iterative linear solver.
Land 3D field example
Dealing with anisotropy
We tested our method on a 3D land broadband wide
Our approach aims at estimating accurately the anisotropy azimuth vibroseis dataset from the Sultanate of Oman
parameters for the later FWI update. In a tilted transverse (Mahrooqi et al., 2012), with a 9 s sweep from 1.5 to 86
isotropic (TTI) medium with known tilt angles, there are Hz. The acquisition design is 50 m by 50 m interval for the
three unknowns: the velocity along the principal symmetry shots, and 250 m by 25 m interval for the receivers. A full
axis 𝑉𝑣 and, the anisotropy parameters 𝜖 and 𝛿, following time and depth processing project has been completed on
Thomsen (1986). In practice, we commonly assume that the this dataset, without FWI, using multi-layer TTI reflection
tilt angles follow the structures of the migrated image. tomography. In the example shown here we work on a
subset area of 800 km2 on which an FWI study has been
Kinematic migration of reflected events in an anisotropic conducted recently (Sedova et al., 2017).
medium does not bring much complexity with respect to First breaks were picked up to 10 km offset, while RMO
the isotropic case. First breaks are usually modeled by picks were available from the recently completed depth
processing project. Two wells are located in the center of derived depth model, and using the aforementioned
the study area. We computed an initial velocity model by constant anisotropy, we perform a first pass of joint
smoothing the pre-stack time migration (PreSTM) RMS tomography to update 𝑉𝑣 and 𝜖. Then we update 𝑉𝑣 and 𝛿
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
velocities in the time domain and converting them to using well data extrapolated along two horizons, and we
interval velocities in the depth domain. The initial perform a second pass of joint tomography for both 𝑉𝑣 and
anisotropy parameters were constant: 𝜖 = 12% and 𝛿 = 𝜖. The and models after joint tomography are shown in
5%, coming from known regional values. Figure 3, overlaid on a migrated stack section. The output
1 a) b) c) model is used as an input to FWI inverting both diving
waves and reflections from 3 Hz up to 13 Hz, for updating
𝑉𝑣 only. The preprocessing applied to the data prior to FWI
is the same as in Sedova et al. (2017). We observe on
2 Figure 4 that the convergence of diving wave FWI is
improved when we start from the joint tomography model
rather than the simpler initial model, especially at near
offsets, due to the better estimation of anisotropy.
3
To QC our final result after the FWI at 13 Hz, we compare
in Figure 5 a CIG computed in the legacy multi-layer TTI
Figure 1: Common image point snail gathers migrated in depth for: reflection tomography model with the same CIG computed
a) initial model, b) reflection tomography model, and c) joint in FWI model. We clearly see a reduction of wobbling
reflection-diving ray tomography model. The vertical scale is in
km and the offsets are from 0 to 4 km.
across offsets and an overall satisfactory flatness of events.
2 We also observe good agreement between the velocity
a) b) c) model and the well log. Synthetics overlaid on the real
seismics are shown in Figure 6 for the same two models.
Unsurprisingly, the match between synthetics and real data
is better after FWI than after reflection tomography, and the
modeling using the FWI model is able to reproduce more
3 events. Hence described approach allows honoring both the
reflections and the diving wave kinematics, and matching
well data.
We ran two tomography tests updating jointly 𝑉𝑣 and 𝜖 We presented a joint reflection-diving ray tomography that
starting from the same initial model: the first time using allows the recovered anisotropy to be used in FWI. We
reflections only, and the second time using reflections and applied it to a real land dataset from the Sultanate of Oman.
first breaks jointly. CIGs and wave equation modeled Computed FWI model gives an excellent match between
diving wave synthetics are shown in Figure 1 and Figure 2, synthetic and real data, especially at long offsets. It flattens
respectively. Reflection tomography and joint reflection- the CIGs and the vertical velocity profile compares nicely
diving ray tomography both achieve a good flattening of with the well log, demonstrating the capability of the
the CIGs, but only the latter also correctly models the method.
diving wave kinematics. This gives us good confidence in
the method: flat CIGs and honored first break traveltimes Acknowledgements
mean that the estimated 𝑉𝑣 and 𝜖 are reliable.
We acknowledge PDO and the Ministry for Oil and Gas of
Using joint tomography as a starting model for FWI the Sultanate of Oman for permission to use the data. We
thank our colleagues Gilles Lambaré and Patrice Guillaume
We now use the joint reflection-diving ray tomography to for many helpful discussions, and CGG for permission to
build a starting model for FWI. Starting from the PreSTM- publish this work.
From From
1 initial joint
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
tomo
a) b) c) 1 Legacy FWI
1
2 2
3 3
2000 6000 Figure 6: Common receiver gathers displayed back to back. Real
Figure 5: a) Snail CIG in legacy tomography model, b) snail CIG data (black wiggles) is overlaid on synthetics (blue / red) computed
in FWI model, and c) comparison to well log: cyan is FWI, blue is using: left) legacy multi-layer tomography model, right) 13 Hz
the well log. The vertical scale is depth in km. Offset is from 0 to 4 FWI model. The vertical scale is in seconds and the offsets are
km; velocity is in m/s. from 2 to 8 km. Black wiggles should overlap with red.
2 km
840 1200 1405 2 km
4 km
0.5 km
REFERENCES
Alkhalifah, T. and I. Tsvankin, 1995, Velocity analysis for transversely isotropic media: Geophysics, 60,
1550–1566, http://dx.doi.org/10.1190/1.1443888.
Cheng, X., K. Jiao, D. Sun, and D. Vigh, 2014, Anisotropic parameter estimation with full-waveform
inversion of surface seismic data: 84th Annual International Meeting, SEG, Expanded Abstracts,
1072–1077, https://doi.org/10.1190/segam2014-0821.1.
Debens, H. A., M. Warner, A. Umpleby, and N. V. da Silva, 2015, Global anisotropic 3D FWI: 85th
Annual International Meeting, SEG, Expanded Abstracts, 1193–1197,
http://dx.doi.org/10.1190/segam2015-5921944.1.
Djebbi, R., R.-É. Plessix and T. Alkhalifah, 2017, Analysis of the traveltime sensitivity kernels for an
acoustic transversely isotropic medium with a vertical axis of symmetry: Geophysical
prospecting, 65, 22–34, http://dx.doi.org/10.1111/1365-2478.12361.
Guillaume, P., G. Lambaré, O. Leblanc, P. Mitouard, J. Le Moigne, J.-P. Montel, T. Prescott, R. Siliqi, N.
Vidal, X. Zhang, and S. Zimine, 2008, Kinematic invariants: an efficient and flexible approach
for velocity model building: 78th Annual International Meeting, SEG, Expanded Abstracts,
3687–3692, http://dx.doi.org/10.1190/1.3064100.
Mahrooqi, S., S. Rawahi, S. Yarubi, S. Abri, A. Yahyai, M. Jahdhami, K. Hunt, and J. Shorter, 2012,
Land seismic low frequencies: Acquisition, processing and full wave inversion of 1.5-86 Hz:
82nd Annual International Meeting, SEG, Expanded Abstracts, 1–5,
http://dx.doi.org/10.1190/segam2012-0961.1.
Montel, J.-P., N. Deladerriere, P. Guillaume, G. Lambaré, T. Prescott, J.-P. Touré, Y. Traonmilin, and X.
Zhang, 2009, Kinematic invariants describing locally coherent events: an efficient and flexible
approach to non linear tomography: 71st Annual International Conference and Exhibition, EAGE,
Extended Abstracts, Workshop WS 1: Locally Coherent Events–A New Perspective for Seismic
Imaging.
Mothi, S., and R. Kumar, 2014, Detecting and estimating anisotropy errors using full waveform inversion
and ray-based tomography: A case study using long-offset acquisition in the Gulf of Mexico: 84th
Annual International Meeting, SEG, Expanded Abstract, 1066–1071,
http://dx.doi.org/10.1190/segam2014-0324.1.
Prieux, V., G. Lambaré, S. Operto, and J. Virieux, 2013, Building starting model for full waveform
inversion from wide-aperture data by stereotomography: Geophysical Prospecting, 61, 109–137,
http://dx.doi.org/10.1111/j.1365-2478.2012.01099.x.
Qin, B., V. Prieux, H. Bi, A. Ratcliffe, J.-P. Montel, D. Carotti, and G. Lambaré, 2014, Towards high-
frequency full waveform inversion-A Case Study: 76th Conference and Exhibition, EAGE,
Expanded Abstracts, We E106 07, http://dx.doi.org/10.3997/2214-4609.20141086.
Sedova, A., G. Royle, O. Hermant, M. Retailleau, and G. Lambaré, 2017, High-resolution land full-
waveform inversion: a case study on a data set from the Sultanate of Oman: 79th Conference and
Exhibition, EAGE, Expanded Abstracts, http://dx.doi.org/10.3997/2214-4609.201701163.
Stopin, A., R.-É. Plessix, and S. Al Abri, 2014, Multiparameter waveform inversion of a large wide-
azimuth low-frequency land data set in Oman: Geophysics, 79, WA69—WA77,
http://dx.doi.org/10.1190/geo2013-0323.1.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, http://dx.doi.org/10.1190/1.3238367.
Woodward, M. J., D. Nichols, O. Zdraveva, P. Whitfield, and T. Johns, 2008, A decade of tomography:
Geophysics, 73, no. 5, VE5–VE11, http://dx.doi.org/10.1190/1.2969907.
Xie Y., B. Zhou, J. Zhou, J. Hu, L. Xu, X. Wu, N. Lin, F. C. Loh, L. Liu, and Z. Wang, 2017,
Orthorhombic full-waveform inversion for imaging the Luda field using wide-azimuth ocean-
bottom-cable data: The Leading Edge, 36, 75–80, http://dx.doi.org/10.1190/tle36010075.1
SUMMARY P-wave velocity and Q it is, in contrast, the differences in the frequency-
dependence of scattering amplitudes which permits them to be distin-
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Simultaneous use of data within relatively broad frequency bands is es- guished (Innanen and Weglein, 2007; Hak and Mulder, 2011). This
sential to discriminating between velocity and Q errors in the construc- fact ties together, in an unusually close manner, issues of multiscale
tion of an-acoustic full waveform inversion (QFWI) updates. Individ- FWI, in which iterations or groups of iterations involve different fre-
ual frequencies or narrow bands in isolation cannot provide sufficient quency bands, parameter cross-talk, and the degree of approximation
information to resolve cross-talk issues in a surface seismic acquisition with which off-diagonal elements of the inverse Hessian are incorpo-
geometry. Truncated Newton (TN) optimization methods offer the po- rated through TN iterations. In this paper we analyze this relationship
tential for reducing computational cost while incorporating approxi- in the context of synthetic an-acoustic frequency domain FWI. Be-
mate versions of the Newton update to reduce these cross-talk issues, cause the exact manner in which dispersion is modelled determines
with the trade-off being mediated by the chosen number of inner TN the character of the cross-talk, the attenuation model type, which must
iterations. In fact, in TN-QFWI we are able to choose between two be selected prior to formulating a detailed FWI algorithm, plays a key
qualitatively distinct “modes” of an-acoustic inversion: one in which role. This issue is discussed in a companion paper (Keating and Inna-
the estimation of a velocity model uncorrupted by the influence of Q is nen, 2017).
the desired outcome, and another in which both a velocity model and
a Q model are the desired outcomes. Both can in principle be accom-
plished in the context of TN-QFWI, with the former at significantly THEORY
reduced computational expense.
Cross-talk
INTRODUCTION
The FWI problem considered here has an objective function given by
Full waveform inversion (FWI) is a technique which attempts to re-
cover the true subsurface parameters by iteratively minimizing the 1
φ (m) = ||dobs − dmod ||22 , (1)
difference between measured data and modeled data generated from 2
the current estimated subsurface parameters (Lailly, 1983; Tarantola,
1984; Virieux and Operto, 2009). While multiparameter versions of where φ , a function of the subsurface model m, measures the discrep-
FWI have been formulated and studied, the majority of research on ancy between the measured data dobs and the modelled data dmod . To
FWI is focused on a single parameter problem, specifically that in recover the true properties of the subsurface, this objective is mini-
which acoustic wave propagation is assumed and density is treated mized in FWI through gradient-based, or Newton type updates.
as constant. In this problem, only P-wave velocity varies in the model. Cross-talk is the phenomenon where data residuals introduced by an
However, in the effort to make FWI effective in the determination of error in one model parameter are attributed to errors in the estimate of
larger numbers of smaller scale (e.g., reservoir) properties, the multi- another parameter. For example, cross-talk is present if an estimate of
parameter FWI problem must be brought to bear. In multiparameter density is modified due to data residuals introduced by errors in a ve-
FWI (e.g., Operto et al., 2013; Plessix et al., 2013; Pan et al., 2016), locity estimate. Cross-talk is a major concern in FWI, as it can severely
allowance is made in the gradient/Hessian quantities for simultaneous harm the accuracy of the recovered model and the convergence of the
and independent variations of several parameters, either to support ve- scheme (e.g., Plessix et al., 2013; Innanen, 2014; Pan et al., 2016).
locity model building, or to push towards elastic characterization of Gradient updates are particularly vulnerable to cross-talk. This is due
the subsurface (Tarantola, 1986; Choi et al., 2008). to the fact that the gradient considers only the derivative of the objec-
tive function with respect to each variable parameterizing the model.
Attenuation and dispersion play important roles in both of these ap- If changes in several different variables can reduce the same part of
plications of multiparameter FWI. It can be a powerful nuisance to the data residual, all will be changed in a gradient update.
acoustic and elastic FWI, strongly influencing the amplitude and phase
of the waveforms we would like to interrogate for acoustic/elastic in- Newton optimization employs both the first order (gradient) and sec-
formation, but it can also be a rich source of information by which ond order (Hessian) derivatives of the objective function. In Newton
fluids and viscosities can be determined or discriminated. So, we can optimization, the update p is given by
also distinguish between whether we wish to specifically determine Q
in FWI, or merely “protect” the recovery of other parameters from its p = −H−1 g , (2)
influence. Either motivation requires that the physics of attenuation be
included in an FWI scheme. An-acoustic FWI (QFWI for short), in where g is the gradient of the objective function, and H is the Hessian
which attenuation and dispersion parameters are determined simulta- matrix. The Hessian provides information about how the derivative
neously alongside their elastic counterparts, has been carefully investi- with respect to one variable will change as another variable changes.
gated (e.g., Hak and Mulder, 2011; Hicks and Pratt, 2001; Malinowski This helps to prevent several variables from being used in reducing the
et al., 2011; Kamei and Pratt, 2013; Métivier et al., 2015). In much data residual introduced by an error in one, mitigating cross-talk. Un-
of this existing research, however, incorporating attenuation is treated fortunately, in realistic FWI applications, Newton optimization tends
as a small addition to the classical acoustic/elastic FWI problem, with not to be a viable approach, because of the excessive cost for the stor-
relatively little focus on how the nature of the problem changes. Pa- age and inversion of the Hessian.
rameter cross-talk, in which one parameter is mistakenly updated to
account for data residuals caused by another, affects an-acoustic FWI Optimization
significantly and in a unique manner requiring special study.
Two approaches which attempt to approximate exact Newton opti-
Simultaneous variations in acoustic and/or elastic properties can be mization but at reduced cost are quasi-Newton methods and truncated
separately estimated in FWI primarily because of differences in the Newton methods. Quasi-Newton methods obtain an exact solution
angle-dependence of scattering from one parameter to another. With to an equation approximating equation 2, whereas truncated Newton
(TN) methods are those which obtain an approximate solution to equa- where c is the acoustic wave velocity at the reference frequency ω0 ,
tion 2. Both attempt to provide an efficient alternative to exact Newton and Q is the quality factor. For a chosen ω0 , the an-acoustic FWI prob-
optimization while still retaining important information about the Hes- lem is to determine the unknown spatial distributions of two parame-
sian, which helps to mitigate cross-talk. In this report we focus on the ters, c and Q. Inspection of equation 9 identifies a specific challenge
TN method. that the QFWI problem faces. The size of the frequency dependent
term in s, which models dispersion, is determined by Q. In effect, both
TN optimization is similar to exact Newton optimization, but rather c and Q co-determine the wave velocity at a given frequency. This
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
than directly solving 2, an approximate solution is obtained by itera- opens the possibility of considerable cross-talk, and is suggestive that
tively minimizing (Nocedal and Wright, 2006) variations from one frequency to another will be instrumental in miti-
gating it.
1 T
θ (p) = p Hp + gT p. (3)
2 Predicting cross-talk with an-acoustic scattering potentials
At a minimum of this objective function, the gradient of θ is zero, so The radiation patterns, or scattering potentials, of point perturbations
in active FWI parameters, plotted as functions of experimental vari-
Hp + g = 0, (4) ables (e.g., angle between incoming and outgoing rays, frequency,
etc.), are often used to determine the degree of expected parameter
satisfying equation 2. In this research, the truncated Gauss-Newton cross talk in multi-parameter FWI. Parameters which generate poten-
method is used, where H is replaced with HGN , the residual indepen- tials with proportional amplitude variations over a given range of these
dent part of the Hessian. Following Metivier et al. (2013), FWI updates experimental variables are easily confused with one another. The scat-
are iteratively constructed in what will be called the outer loop, and the tering potential V for position x and frequency ω associated with our
minimization of equation 3, which occurs once for each FWI update, chosen an-acoustic wave equation is
but is itself iterative, involves what will be called the inner loop. Pro-
ω2
vided a suitable optimization approach is employed in the inner loop, V (x, ω) ≈ − [VQ (x, ω) +Vc (x, ω)] , (10)
this method does not require the storage or inversion of the Hessian c0 (x)2
matrix HGN , only the product of the Hessian with an arbitrary p. This where
Hessian-vector product can be efficiently calculated using the adjoint
F(ω) F(ω)
state method, as described in Metivier et al. (2013). VQ (x, ω) = ∆Q(x), Vc (x, ω) = 1 + ∆c(x) (11)
Q0 (x) Q0 (x)
We implement the inner loop of the TN FWI algorithm with a BFGS
and where F(ω) = i − (2/π) log(ω/ω0 ); the ∆· quantities,
inner solver, wherein p is determined by iteratively solving
Q0 (x) c0 (x)2
pk = pk−1 + αk ∆p, where ∆p = −Q∇pk−1 , (5) ∆Q(x) = 1 − , ∆c(x) = 1 − , (12)
Q(x) c(x)2
Q is the BFGS approximation of the inverse Hessian of θ (which is represent localized jumps in their corresponding model parameters Q
the same as the inverse Hessian of φ ), and and c.
NUMERICAL EXAMPLES
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Choi, Y., D. Min, and C. Shin, 2008, Two-dimensional waveform inversion of multi-component data in
acoustic-elastic coupled media: Geophysical Prospecting, 56, 863–881,
http://doi.org/10.1111/j.1365-2478.2008.00735.x.
Hak, B., and W. Mulder, 2011, Seismic attenuation imaging with causality: Geophysical Journal
International, 184, 439–451, https://doi.org/10.1111/j.1365-246X.2010.04848.x.
Hicks, G., and R. Pratt, 2001, Reflection waveform inversion using local descent methods: Estimating
attenuation and velocity over a gas-sand deposit: Geophysics, 66, 598–612,
https://doi.org/10.1190/1.1444951.
Innanen, K. A., 2014, Seismic AVO and the inverse Hessian in precritical reflection full waveform
inversion: Geophysical Journal International, 199, 717–734, https://doi.org/10.1093/gji/ggu291.
Innanen, K. A., and A. B. Weglein, 2007, On the construction of an absorptive-dispersive medium model
via direct linear inversion of reflected seismic primaries: Inverse Problems, 23, 2289–2310,
https://doi.org/10.1088/0266-5611/23/6/001.
Kamei, R., and R. Pratt, 2013, Inversion strategies for visco-acoustic waveform inversion: Geophysical
Journal International, 194, 859–884, https://doi.org/10.1093/gji/ggt109
Keating, S., and K. A. Innanen, 2017, Characterizing and mitigating uncertainty in the physics of
attenuation in an-acoustic full waveform inversion: 87th Annual International Meeting, SEG,
Expanded Abstracts (submitted).
Lailly, P., 1983, The seismic inverse problem as a sequence of before stack migrations: Conference on
Inverse Scattering, Theory and Application, Society for Industrial and Applied Mathematics,
Expanded Abstracts, 206–220.
Malinowski, M., S. Operto, and A. Ribodetti, 2011, High-resolution seismic attenuation imaging from
wide-aperture onshore data by visco-acoustic frequency-domain full-waveform inversion:
Geophysical Journal International, 186, 1179–1204, https://doi.org/10.1111/j.1365-
246X.2011.05098.x.
Metivier, L., R. Brossier, J. Virieux, and S. Operto, 2013, Full waveform inversion and the truncated
newton method: Siam Journal on Scientific Computing, 35, B401–B437,
https://doi.org/10.1137/120877854.
Metivier, L., R. Brossier, S. Operto, and J. Virieux, 2015, Acoustic multi-parameter FWI for the
reconstruction of P-wave velocity, density and attenuation: preconditioned truncated Newton
approach: 85th Annual International Meeting, SEG, Expanded Abstracts, 1198–1203,
https://doi.org/10.1190/segam2015-5875643.1.
Nocedal, J., and P. S. Wright, 2006, Numerical optimization (2nd ed.): Springer.
Operto, S., Y. Gholami, V. Prieux, A. Ribodetti, R. Brossier, L. Metivier, and J. Virieux, 2013, A guided
tour of multiparameter full-waveform inversion with multicomponent data: From theory to
practice: The Leading Edge, 32, 1040–1054, https://doi.org/10.1190/tle32091040.1.
Pan, W., K. A. Innanen, G. F. Margrave, M. Fehler, X. Fang, and J. Li, 2016, Estimation of elastic
constants for HTI media using Gaussnewton and full Newton multi-parameter full waveform
inversion: Geophysics, 81, no. 5, E323–E339, https://doi.org/10.1190/geo2015-0594.1.
Plessix, R.-E., P. Milcik, H. Rynia, A. Stopin, K. Matson, and S. Abri, 2013, Multiparameter full-
waveform inversion: Marine and land examples: The Leading Edge, 32, 1030–1038,
https://doi.org/10.1190/tle32091030.1.
z (km)
initial value, such that near-offset reflections are still matched Caprock
3 3
during Step [2]. a) b)
4 4
(km/s)
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
z (km)
3 3
We apply JFWI to an OBC line acquired in the North Sea. c) d)
4 4
A velocity model has been built by 3D reflection traveltime to-
mography (Fig. 1a), which shows a low-velocity gas area (blue 60 80 100 120 140 160 180 200
5 7 9 11 13 15 5 7 9 11 13 15
area) embedded in the soft sediments above the caprock at 2.4 1 1
km depth (Barkved et al., 2010; Haller et al., 2016). Using 2 2
z (km)
the tomographic model as the initial model, various applica- 3 3
tions of 3D FWI (e.g., Sirgue et al., 2010; Operto et al., 2015) 4
e)
4
f)
Joint FWI 2 2
z (km)
We shall use two initial models of increasing accuracy to test 3 3
a) b)
the sensitivity of JFWI to the initial model. The first one is 4 4
1 1
to remove the reflectivity component followed by a lateral av-
2 2
eraging (Fig. 3a). Hence, the purpose of using this 1D model
z (km)
3 3
is to assess whether JFWI can recover the lateral variations c) d)
4 4
generated by the gas cloud. We assess the velocity model by
computing migrated images or IP perturbations by IpWI. The
flatness of the sediment-caprock interface, revealed by former
studies, is used to assess the faithfulness of the velocity in the
gas cloud (Fig. 1e,f). The migrated image computed in the 1D
initial model shows the deepening of the sediment-caprock in-
terface below the gas cloud (Fig. 3c), which results from over-
estimated velocities in the overburden.
Fig. 3b shows the inversion result. Not having imaged the gas
cloud, JFWI creates a pair of high-velocity anomalies along re-
flection paths above the caprock together with a low-velocity
blob in the middle of the gas cloud. This velocity artifacts
in the overburden leads to an inaccurate migrated image high-
lighted by the non-flatness of the caprock reflector (Fig. 3d). Figure 3: (a) 1D initial model. (b) Final JFWI velocity model. (c,d)
Fig. 3c,d shows the data fit achieved by the 1D initial and the Corresponding migrated images. (e,f) Data fit at 5.1 Hz. Note the
synclinal-shape caprock image resulting from overestimated velocities
resulting JFWI VP models, in which the IP perturbations are
in the overburden.
added to generate reflected waves. Both VP models provide
good data match at short offsets. The postcritical reflections Desired shift
are missing in the synthetic data; these phases may either not
Before JFWI
is shown after JFWI; however, this is a cycle-skipped fit. To Env. of recorded trace
Modeled trace Cycle skipped shift
verify this statement, we show the seismograms at the x = 5 Env. of modeled trace
3 3
Fig. 5b shows the JFWI result. No cycle skipping is witnessed. a) b)
4 4
Sufficiently low velocities are recovered in the gas cloud pre-
5 7 9 11 13 15 5 7 9 11 13 15
serving flat structures in the migrated image (Fig. 5d). To 1 1
further assess this JFWI model as an initial model for FWI, 2 2
z (km)
2 2
z (km)
we have a pity cycle-skipping phenomenon for near-offset di- access to the HPC resources of CIMENT infrastructure and CINES/IDRIS
rect waves at the border of the near-surface (gray arrow). A 3D under the allocation 046091 made by GENCI. The authors thank BP
Norge AS and Hess Norge AS for providing the data set and the per-
inversion combined with more prudent frequency continuation
mission to publish this work. The first author appreciates fruitful dis-
schemes may help solve this problem (Operto et al., 2015). cussions with F. Audebert (TOTAL), H. Chauris (MinesParisTech),
Standard FWI A. Górszczyk (PAN), G. Lambaré (CGG), L. Métivier (LJK-CNRS,
UGA), and R.-E. Plessix (Shell).
To assess the effectiveness of JFWI in initial macro model
building for FWI, we perform FWI using the 2D smooth model
(Fig. 5a) as initial model. The result is shown in Fig. 5f.
We also show in Fig. 7 the direct comparison between the two
FWI results along an horizontal profile at 2.3 km depth cross-
ing the gas cloud. Both suggest that the FWI result starting
from the JFWI model has a higher lateral resolution in the gas
cloud with a richer low wavenumber content than the other
FWI result. This is because JFWI has succeeded to update
the low horizontal wavenumbers along sub-vertical wavepaths
connecting the reflectors to the surface, whereas FWI is more
suitable to update the low vertical wavenumbers along the wave-
paths associated with wide-aperture arrivals (diving waves and
postcritical reflections).
Such deficit of low wavenumbers in the FWI model of Fig. 5f
cannot be easily detected when we assess the accuracy of the
model through data fit (Fig. 6d). The improved match of div-
ing wave by the standard FWI model is undesirable as these
waves have underwent out-of-plane propagation. In order to
reduce the misfit function, standard FWI may have shifted the
synthetic postcritical reflections to earlier traveltimes, leading
to overestimated velocities in the gas cloud as shown in Fig. 7,
black arrow. In contrast, it seems that the higher sensitivity
of JWI to lateral subsurface variations has prevented such ar-
tifacts and helped recover the low velocities in the gas cloud
although the misfit value is slightly higher.
CONCLUSIONS Figure 6: Data fit at 5.1 Hz (a,b) and 7.1 Hz (c,d). Synthetics are
computed in (a) 2D initial model (Fig. 5a), (b) JFWI model (Fig. 5b),
(c) FWI result using JFWI model (Fig. 5e), and (d) FWI result using
We have combined the sensitivity kernels associated to early
2D initial model (Fig. 5f). Compared with (c), the FWI data fit of
arrivals and reflections for velocity macromodel building that (d) shows a over-fitting of the postcritical reflections (black arrows)
is suitable for standard FWI implementation. The approach despite a better match of shallow transmitted waves (gray arrows).
has been applied to a 2D OBC data set collected across a gas
cloud. We have considered attenuation in the wave simulation 2.2
part of the inversion, the significance of which has been illus-
Vp (km/s)
REFERENCES
Barkved, O., U. Albertin, P. Heavey, J. Kommedal, J. van Gestel, R. Synnove, H. Pettersen, and C. Kent,
2010, Business impact of full waveform inversion at Valhall: 91st SEG Annual International
Meeting, https://doi.org/10.1190/1.3513929.
Brossier, R., S. Operto, and J. Virieux, 2015, Velocity model building from seismic reflection data by full
waveform inversion: Geophysical Prospecting, 63, 354–367, https://doi.org/10.1111/1365-
2478.12190.
Bunks, C., F. M. Salek, S. Zaleski, and G. Chavent, 1995, Multiscale seismic waveform inversion:
Geophysics, 60, no. 5, 1457–1473, https://doi.org/10.1190/1.1443880.
Carcione, J., D. Kosloff, and R. Kosloff, 1988, Wave propagation simulation in a linear viscoacoustic
medium: Geophysical Journal International, 93, 393–401, https://doi.org/10.1111/j.1365-
246x.1988.tb02010.x.
Emmerich, H., and M. Korn, 1987, Incorporation of attenuation into time-domain computation
: Geophysics, 52, 1252–1264, https://doi.org/10.1190/1.1442386.
Haller, N., R. Flateboe, C. Twallin, V. Dahl-Eriksen, P. Heavey, E. Kjos, R. Milne, and W. Rietveld,
2016, Valhall case study — value of seismic technology for reducing risks in a reactive
overburden: Presented at the 78th EAGE Annual Meeting, https://doi.org/10.3997/2214-
4609.201600817.
Jannane, M., W. Beydoun, E. Crase, D. Cao, Z. Koren, E. Landa, M. Mendes, A. Pica, M. Noble, G.
Roeth, S. Singh, R. Snieder, A. Tarantola, and D. Trezeguet, 1989, Wavelengths of Earth
structures that can be resolved from seismic reflection data: Geophysics, 54, 906–910,
https://doi.org/10.1190/1.1442719.
Kurzmann, A., A. Przebindowska, D. Kohn, and T. Bohlen, 2013, Acoustic full waveform tomography in
the presence of attenuation: a sensitivity analysis: Geophysical Journal International, 195, 985–
1000, https://doi.org/10.1093/gji/ggt305.
Luo, Y., Y. Ma, Y. Wu, H. Liu, and L. Cao, 2016, Full-traveltime inversion: Geophysics, 81, R261–
R274, https://doi.org/10.1190/geo2015-0353.1.
Metivier, L., R. Brossier, Q. Merigot, E. Oudet, and J. Virieux, 2016, Measuring the misfit between
seismograms using an optimal transport distance: Application to full waveform inversion:
Geophysical Journal International, 205, 345–377, https://doi.org/10.1093/gji/ggw014.
Moczo, P., and J. Kristek, 2005, On the rheological models used for time-domain methods of seismic
wave propagation: Geophysical Research Letters, 32, https://doi.org/10.1029/2004gl021598.
Operto, S., R. Brossier, Y. Gholami, L. Metivier, V. Prieux, A. Ribodetti, and J. Virieux, 2013, A guided
tour of multiparameter full waveform inversion for multicomponent data: from theory to practice:
The Leading Edge, 32, 1040–1054, https://doi.org/10.1190/tle32091040.1.
Operto, S., A. Miniussi, R. Brossier, L. Combe, L. Metivier, V. Monteiller, A. Ribodetti, and J. Virieux,
2015, Efficient 3-D frequency-domain mono-parameter full-waveform inversion of ocean-bottom
cable data: application to Valhall in the visco-acoustic vertical transverse isotropic
approximation: Geophysical Journal International, 202, 1362–1391,
https://doi.org/10.1093/gji/ggv226.
Plessix, R., 2016, Visco-acoustic full waveform inversion: Presented at the 78th EAGE Annual Meeting,
https://doi.org/10.3997/2214-4609.201600827.
Sirgue, L., O. I. Barkved, J. Dellinger, J. Etgen, U. Albertin, and J. H. Kommedal, 2010, Full waveform
inversion: the next leap forward in imaging at Valhall: First Break, 28, 65–70.
Virieux, J., and S. Operto, 2009, An overview of full waveform inversion in exploration geophysics:
Geophysics, 74, WCC1-WCC26.
Wang, S., F. Chen, H. Zhang, and Y. Shen, 2013, Reflection-based full waveform inversion (RFWI) in
the frequency domain: SEG Technical Program Expanded Abstracts, SEG, 877–881,
https://doi.org/10.1190/segam2013-0671.1.
Warner, M., and L. Guasch, 2014, Adaptative waveform inversion — FWI without cycle skipping —
theory: 76th EAGE Conference and Exhibition 2014, We E106 13.
Xu, S., D. Wang, F. Chen, G. Lambare, and Y. Zhang, 2012, Inversion on reflected seismic wave: SEG
Technical Program Expanded Abstracts 2012, 1–7, https://doi.org/10.1190/segam2012-1473.1.
Yang, P., R. Brossier, L. Metivier, and J. Virieux, 2016, A review on the systematic formulation of 3D
multiparameter full waveform inversion in viscoelastic medium: Geophysical Journal
International, 207, 129–149, https://doi.org/10.1093/gji/ggw262.
Zhou, W., R. Brossier, S. Operto, and J. Virieux, 2015, Full waveform inversion of diving and reflected
waves for velocity model building with impedance inversion based on scale separation:
Geophysical Journal International, 202, 1535–1554, https://doi.org/10.1093/gji/ggv228.
Zhou, W., R. Brossier, S. Operto, and J. Virieux, 2016, Joint full waveform inversion of early arrivals and
reflections: A real obc case study with gas cloud: SEG Technical Program Expanded Abstracts
2016, SEG, 1247–1251, https://doi.org/10.1190/segam2016-13954099.1.
EXAMPLES km. The two surveys are shifted by 100 meters with respect to
each other. Our starting velocity model was obtained from the
We generated synthetic data using acoustic modeling with den- true baseline model using a 400 m smoothing filter. We use
sity. The true baseline velocity model used in our experiments frequency continuation from 10 to 30 Hz, with the maximum
is shown in Figure 1, the difference between the monitor and frequency for each experiment determined using the method
baseline is shown in the zoomed-in Figure 2. The true density of Sirgue and Pratt (2004). In each experiment we conduct a
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
model and model difference are obtained from the true velocity broadband (from 0 to the maximum frequency) time-domain
model and model difference by dividing them by 1500 (setting full-waveform inversion to convergence, using the objective
water density to 1). functions of Equations 1 and 2 at the first stage of the inver-
sion, and Equations 3 and 4 at the second stage. We intention-
The true velocity difference of Figure 2 was designed to imi- ally avoid density inversion in order to demonstrate the effect
tate three large 30 m-thick reservoir compartments, and three of amplitude errors on our 4D FWI.
smaller compartments located up-dip from a partially perme-
able fault. Velocity changes of −300, −200 and 150 m/s are
prescribed in the large compartments to model the effect of gas
coming out of the solution (Johnston, 2013) and water substi-
tution (in the lowest compartment). The three small compart-
ments have only negative velocity changes of −200, −100, and
−50 m/s to model the effect of gas migrating up-dip through
the partially conductive fault.
CONCLUSIONS
ACKNOWLEDGMENTS
REFERENCES
Alemie, W., and M. Sacchi, 2016, Joint reparametrized time-lapse full-waveform inversion: 86th Annual
International Meeting, SEG, Expanded Abstracts, 1309–1314,
http://dx.doi.org/10.1190/segam2016-13879371.1.
Asnaashari, A., R. Brossier, S. Garambois, F. Audebert, P. Thore, and J. Virieux, 2012, Time-lapse
imaging using regularized FWI: A robustness study: 82nd Annual International Meeting, SEG,
Expanded Abstracts, 1–5, http://dx.doi.org/10.1190/segam2012-0699.1.
Ayeni, G., 2011, Time-lapse seismic imaging by linearized joint inversion: Ph.D. thesis, Stanford
University.
Ayeni, G., and B. Biondi, 2012, Time-lapse seismic imaging by linearized joint inversion — A Valhall
Field case study: 82nd Annual International Meeting, SEG, Expanded Abstracts, 1–6,
http://dx.doi.org/10.1190/segam2012-0903.1.
Girard, A., and I. Vasconcelos, 2010, Image-domain time-lapse inversion with extended images: 80th
Annual International Meeting, SEG, Expanded Abstracts, 4200–4204,
http://dx.doi.org/10.1190/1.3513744.
Johnston, D., 2013, Practical applications of time-lapse seismic data: SEG.
Maharramov, M., and U. Albertin, 2007, Localized image-difference wave-equation tomography: 77th
Annual International Meeting, SEG, Expanded Abstracts, 3009–3013,
http://dx.doi.org/10.1190/1.2793096.
Maharramov, M., and B. Biondi, 2014, Joint full-waveform inversion of time-lapse seismic data sets:
84th Annual International Meeting, SEG, Expanded Abstracts, 954–959,
http://dx.doi.org/10.1190/segam2014-0962.1.
Maharramov, M., and B. L. Biondi, 2017, Full waveform inversion for reservoir monitoring — Pushing
the limits of subsurface resolution: 79th Annual International Conference and Exhibition, EAGE,
Extended Abstracts, We PRM 16, http://dx.doi.org/10.3997/2214-4609.201700025.
Maharramov, M., B. L. Biondi, and M. A. Meadows, 2016, Time-lapse inverse theory with applications:
Geophysics, 81, no. 6, R485–R501, http://dx.doi.org/10.1190/geo2016-0131.1.
Maharramov, M., B. Biondi, and S. Ronen, 2015, Robust simultaneous time-lapse full-waveform
inversion with total-variation regularization of model difference: 77th Annual International
Conference and Exhibition, EAGE, Extended Abstracts, We P3 09,
http://dx.doi.org/10.3997/2214-4609.201413085.
Qu, S., and D. Verschuur, 2016, Getting accurate time-lapse information using geology-constrained
simultaneous joint migration-inversion: 86th Annual International Meeting, SEG, Expanded
Abstracts, 5451–5456, http://dx.doi.org/10.1190/segam2016-13964374.1.
Raknes, E., W. Weibull, and B. Arntsen, 2013, Time-lapse full waveform inversion: Synthetic and real
data examples: 83rd Annual International Meeting, SEG, Expanded Abstracts, 944–948,
http://dx.doi.org/10.1190/segam2013-0540.1.
Routh, P. S., J. R. Krebs, S. Lazaratos, A. I. Baumstein, I. Chikichev, N. Downey, D. Hinkley, and J. E.
Anderson, 2011, Full-wavefield inversion of marine streamer data with the encoded simultaneous
source method: 73rd Annual International Conference and Exhibition, EAGE, Extended
Abstracts, F032, http://dx.doi.org/10.3997/2214-4609.20149730.
http://dx.doi.org/10.1071/ASEG2012ab197.
Sirgue, L., and R. Pratt, 2004, Efficient waveform inversion and imaging: A strategy for selecting
temporal frequencies: Geophysics, 69, 231–248, http://dx.doi.org/10.1190/1.1649391.
Willemsen, B., and A. Malcolm, 2015, Regularizing velocity differences in time-lapse FWI using
gradient mismatch information: 85th Annual International Meeting, SEG, Expanded Abstracts,
5384–5388, http://dx.doi.org/10.1190/segam2015-5908610.1.
Yang, D., A. E. Malcolm, and M. C. Fehler, 2014, Time-lapse full waveform inversion and uncertainty
analysis with different survey geometries: 76th A Annual International Conference and
Exhibition, EAGE, Extended Abstracts, We ELI1 10, http://dx.doi.org/10.3997/2214-
4609.20141120.
Zheng, Y., P. Barton, and S. Singh, 2011, Strategies for elastic full waveform inversion of timelapse
ocean bottom cable (OBC) seismic data: 81st Annual International Meeting, SEG, Expanded
Abstracts, 4195–4200, http://dx.doi.org/10.1190/1.3628083.
Multiple seismic data sets are often recorded to monitor terizing the uncertainty pixel-by-pixel in a large model is not
changes in Earth properties. To image these changes, sev- computationally feasible and it is not clear that doing so would
eral different 4D full waveform inversion (FWI) schemes have help in the interpretation due to the volume of information gen-
been successfully applied over the past decade. We compare erated. We focus instead on the idea of characterizing the un-
three different 4D FWI schemes on two simple numerical ex- certainty of key elements of the image. As a first step towards
amples to quantify how each method performs. To do this, we this goal, we estimate some parameters of the image which
create correlated gaussian noise realizations and add them to we refer to as spatial characteristics, and we explore how are
our models to determine how errors in the models are trans- recovered by different FWI methods.
lated to errors in the final images. We computed spatial char-
acteristics of the recovered models and compare the perfor-
mance of the different 4D FWI schemes. Our results indicate THEORY
that while there are minor differences between the different
proposed methods all perform reasonably well for this type of The most commonly used objective function in FWI is a least
noise in these simple models. The methods that specifically squares measure:
target 4D changes do result in fewer artifacts outside the re-
1
gion of true change, but all methods recover the true change E(m) = kF(m) − dk2 , (1)
with similar accuracy. 2
1 1
E(m0 , m1 ) = kF(m0 ) − d0 k2 + kF(m1 ) − d1 k2
2 2
1 m0 − m1 2
+ k k , (3)
2 β
Figure 1: Horizontal reflectors example: True time-lapse
where m0 and d0 are the baseline model and data, m1 and d1
change together with the recovered from all three FWI
are the monitor model and data, and β is calculated via:
schemes time-lapse changes for the same model realization.
X
β= (1 − sgn[(mi−1 − mi )(mi+1 − mi )])|mi+1 − mi |. (4)
i
image stacking procedure as follows. For each horizontal po-
sition, we compute the distance ∆z between the recovered re-
In the next section, we describe the two numerical examples
flectors by taking the maximum value of the image in a win-
and the spatial characteristics calculation for the respective ex-
dow. We then average the recovered distances over the entire
amples. We then compare the recovered spatial characteristics
image to obtain an average ∆z for a particular image. Fig-
for the three methods on two simple models.
ure 2 shows the histograms of all the recovered ∆z for each of
the FWI schemes together with their calculated standard devi-
NUMERICAL EXAMPLES ations.
also consider methods such as that proposed by Hale (Hale, when adding noise to the velocity, rather than to the squared
2013) that warp one image or data set onto another to more slowness, we get significantly poorer recovered images. This
quickly compute the changes in a data set caused by small is likely because the wave equation depends linearly on the
perturbations in the model. This would open up the possibil- squared slowness but non linearly on the velocity, resulting
ity of using more robust statistical inversion techniques.Since in a deterioration in the recovered models in the latter case.
we are comparing different methods, its good to compare their This deterioration of results primarily flattens the associated
computational cost. AFWI takes almost 5 times as long as histograms and is particularly noticeable for the parallel FWI
DDFWI and Parallel FWI, which may become important in case. Our results also indicate that both AFWI and DDFWI
more complex models. Last but not least, the surveys consid- are successful at attenuating artifacts outside of the region of
ered were perfectly repeatable, which is clearly a big assump- true change.
tion. Changing acquisitions is a situation in which the extra
cost of AFWI may be justified as it is equipped to handle such
changes where DDFWI is not.
Figure 6: Case
1 2 noise results from all three methods. Left:
Histograms of the area deviations in depth dz. Right: His-
tograms of area deviation in the total amount of depth.
Figure 5: Case 1 noise results from all three FWI
schemes.Left: Histograms of the area deviations in depth dz.
Right: Histograms of area deviation in the total depth.
ACKNOWLEDGMENTS
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Asnaashari, R., R. Brossier, S. Garambois, F. Audebert, P. Thore, and J. Virieux, 2015, Time-lapse
seismic imaging using regularized full-waveform inversion with a prior model: Which strategy?:
Geophysical Prospecting, 63, http://dx.doi.org/10.1111/1365-2478.12176.
Yang, D., M. Meadows, P. Inderwiesen, J. Landa, A. Malcolm, and M. Fehler, 2015, Double Difference
Waveform Inversion: Feasibility and Robustness Study with Pressure Data: Geophysics, 80,
M129–M141, http://dx.doi.org/10.1190/geo2014-0489.1.
Denli, H., and L. Huang, 2009, Double difference elastic waveform inversion tomography in the time
domain: 79th Annual International Meeting, SEG, Expanded Abstracts, 28, 2302–2306,
https://doi.org/10.1190/1.3255320.
Hale, D., 2013, Dynamic warping of seismic images: Geophysics, 78, S105–S115,
http://dx.doi.org/10.1190/geo2012-0327.1.
Hewett, R., and L. Demanet, 2013, The pysit team, pysit: Python seismic imaging toolbox v0.5: Release
0.6.
Kotsi, M., and A. Malcolm, 2017, Estimating the Error Distribution of Recovered Changes in Earth
Properties with Full – Waveform Inversion: 13th International Conference on Mathematical and
Numerical Aspects of Wave Propagation, Extended Abstract.
Lumley, D. E., 2001, Time-lapse seismic reservoir monitoring: Geophysics, 66, 50–53,
http://dx.doi.org/10.1190/1.1444921.
Maharramov, M., and B. Biondi, 2014, Joint full waveform inversion of time-lapse seismic data sets: 84th
Annual International Meeting, SEG, Expanded Abstracts, 1–5,
http://dx.doi.org/10.1190/segam2014-0962.1.
Orange, A., K. Key, and S. Constable, 2009, The feasibility of reservoir monitoring using time-lapse
marine CSEM: Geophysics, 74, F21–F29, http://dx.doi.org/10.1190/1.3059600.
Tarantola, A., 1984, Inversion of seismic reflection data in the acoustic approximation: Geophysics, 49,
1259–1266, http://dx.doi.org/10.1190/1.1441754.
Virieux, J. and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, WCC1–WCC26, http://dx.doi.org/10.1190/1.3238367.
Waldhauser, F., and W. L. Ellsworth, 2000, A double difference earthquake location algorithm: method
and application to the Northern Hayward Fault, California: Bulletin of the seismological society
of America, 90, 1353–1368, http://dx.doi.org/10.1785/0120000006.
Watanabe, T., S. Shimizu, E. Asakawa, T. Matsuoka, 2005, Differential waveform tomography for time-
lapse crosswell seismic data with application to gas hydrate production monitoring: 75th Annual
International Meeting, SEG, Expanded Abstracts, http://dx.doi.org/10.1190/1.1845221.
Zheng, Y., P. Barton, and S. Singh, 2011, Strategies for elastic full waveforms inversion of time-lapse
ocean bottom cable (OBC) seismic data: 81st Annual International Meeting, SEG, Expanded
Abstracts, 4195–4200, http://dx.doi.org/10.1190/1.3628083.
Yang, D., M. Fehler, A. Malcolm, F. Liu, and S. Morton, 2013, Double difference waveform inversion of
4D ocean bottom cable data: Application to Valhall, North Sea: 83rd Annual International
Meeting, SEG, Expanded Abstracts, 4966–4970, http://dx.doi.org/10.1190/segam2013-1318.1.
University of Texas at Dallas, Takuji Mouri, Masashi Nakatsukasa, Mamoru Takanashi, Ayato Kato, JOGMEC
Summary
Introduction
Seismic monitoring provides valuable information regarding Figure 1: (top) Microbubble water injection schedule, (bottom)
time-varying changes in subsurface physical properties schematic diagram of acquisition geometry. Green shading
during hydrocarbon production, hydraulic fracturing, CO2 indicates the source depths, orange shading depths of receivers
deployed during both baseline and monitoring surveys, and
geosequestration, groundwater injection/withdrawal and yellow shading the depths of receivers deployed only during the
others. However, the resulting changes in subsurface baseline survey.
properties are often small both in terms of magnitude and
spatial extent, leading to seismic data differences that are
field data applications by estimating the P-wave velocity
difficult to detect at typical non-repeatable noise levels. In
changes that occur during the injection of microbubble water
order to better extract information from the time-lapse data,
into shallow sediments. Microbubble is a gas bubble of a
exploiting the full seismic waveform information can be
diameter less than 1 mm, and used for CO2 EOR (Klusman,
critical, since amplitude or traveltime changes may not be
2003), geosequestration (Koide and Xue, 2009), and ground
realiably detected. liquefaction mitigation (Kobayashi et al., 2010). We use a
cross-well geometry that can be useful when the near-surface
Full waveform inversion (FWI) represents a set of methods
condition is sub-optimal (e.g., due to strong weathering and
to find a subsurface model that fits waveforms based on the
scattering).
numerical solution of the wave equation, and is becoming
established as a new technique to estimate high-resolution
Data
velocity models (Virieux and Operto, 2009). Time-lapse
FWI is a favourable approach to fully utilise waveform
We injected total of 8000 m3 of water infused with air
information and to estimate small time-lapse velocity microbubbles into the unconsolidated Quaternary sediments
changes. However time-lapse inversion strongly depends on
in the Kanto basin of Japan over 74 hours at depths between
the knowledge of a velocity model, and the repeatability of
22 and 25 m from the surface (Figure 1). The fluvial
acquisition parameters and data noise (Kamei and Lumley,
sediments mostly consist of sands and clay, and the water
2017). Thus, developing a robust time-lapse FWI method
table is at an approximate depth of 13 m.
has been challenging. A variety of inversion strategies have
been proposed to mitigate the issues (e.g., Routh et al. 2012,
The injection was monitored by repeating cross-well seismic
Asnaashari et al., 2014, Kamei and Lumley 2017). In this
surveys using the geometry depicted in Figure 1b. A pre-
study, we demonstrate the potential of time-lapse FWI in
(a)
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
(b)
(c)
(d) (e)
Figure 2: (a) Observed baseline shot gathers shown for every other shot. (b) Predicted and (c) residual baseline shot gather after the baseline
FWI. (e-f) Baseline and monitoring observed same-level traces at the (c) 35-meter and (d) 24- meter depth. Monitoring 0 indicates the baseline
survey.
injection baseline cross-well survey was conducted 10 days and 4 kHz at depths between 13 and 50 m at 1-m intervals.
before the injection, and monitoring cross-well surveys were Note that 1 and 4 kHz sources were employed during the
conducted 14 times during and shortly after the injection baseline survey only. A source encoding technique was used
(Figure 1a). The distance between source and receiver wells to reduce the acquisition time; an array of six piezoelectric
is 64.4 m, and the injection well is located at a horizontal transducers simultaneously excited six different
distance of 24 m from the source well. Hydrophones were pseudorandom source wavelets with amplitude modulation.
placed in 1-m intervals at depths between 13 and 50 m
during the baseline survey, and at depths between 13 and 36 The acquired baseline and monitoring data exhibit high
m during the monitoring surveys. Pseudo-random sources signal-to-noise ratio. We display the recorded baseline shot
(Takanashi et al., 2016) were used to achieve good source gathers for the 1-kHz sources in Figure 2a after application
repeatability, and were excited at three frequencies at 1, 2 of a bandpass filter between 100 and 1050 Hz. We can
Figure 5: FWI-estimated velocity change at (a) 26, (b) 28, and (c) Figure 6: Predicted and observed shot gathers at 28 hour after the
74 hours after the start of the injection. start of the injection for a source located at a depth of 26 m.
Predicted waveforms are computed from (top) the starting model
Asnaashari et al., 2014, Kamei and Lumley 2017). After for time-lapse FWI, and (bottom) the final time-lapse FWI velocity
applying similar data preconditioning, the 2 kHz baseline model.
and monitoring data are inverted for P-wave velocity,
starting from the FWI-estimated baseline velocity model. disagreement in both traveltime and amplitudes (Figure 6a).
The inversion runs are conducted in parallel for each data After time-lapse FWI, we observe clear improvement in
set, and the time-lapse velocity changes are estimated by waveform fitting (Figure 6b): The predicted and observed
subtracting the updated baseline FWI model from the waveforms nearly perfectly fit each other, indicating the
monitoring FWI models. high reliability of the inversion results.
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Asnaashari, A., R. Brossier, S. Garambois, F. Audebert, P. Thore, and J. Virieux, 2014, Time-lapse
seismic imaging using regularized full-waveform inversion with a prior model: Which strategy?:
Geophysical Prospecting, 63, 78–98, http://doi.org/10.1111/1365-2478.12176.
Jang, U. J., and D. Lumley, 2015, Full waveform inversion comparison of conventional and broadband
marine seismic streamer data, NW Shelf Australia: 24th International Geophysical Conference
and Exhibition, Perth, Australia, ASEG-PESA.
Kamei, R., and D. Lumley, 2017, Full waveform inversion of repeating seismic events to estimate time-
lapse velocity changes: Geophysical Journal International, Accepted.
Kamei, R., R. G. Pratt, and T. Tsuji, 2014, On misfit functions for Laplace-Fourier waveform inversion,
with applications to wide-angle OBS data: Geophysical Prospecting, 62, 1054–1074,
http://doi.org/10.1111/1365-2478.12127.
Kobayashi, M., N. Suemasa, T. Katada, and K. Nagano, 2010, Feasibility study on countermeasure
against liquefaction using micro bubble: Proceedings of the 12th International Offshore and Polar
Engineering Conference, Beijing, China.
Koide, H., and Z. Xue, 2009, Carbon microbubbles sequestration: A novel technology for stable
underground emplacement of greenhouse gases into wide variety of saline aquifers, fractured
rocks and tight reservoirs: Energy Procedia, 1, 3655–3662,
http://doi.org/10.1016/j.egypro.2009.02.162.
Klusman, R. W., 2003, Evaluation of leakage potential from a carbon dioxide EOR/sequestration project,
Energy Conversation and Management, 44, 1921–1940, http://doi.org/10.1016/S0196-
8904(02)00226-1.
Routh, P., G. Palacharla, I. Chikichev, and S. Lazaratos, 2012, Full wavefield inversion of time-lapse data
for improving imaging and reservoir characterization: 82nd Annual International Meeting, SEG,
Expanded Abstracts, http://doi.org/10.1190/segam2012-1043.1.
Shragge, J., and D. Lumley, 2013, Time-lapse wave-equation migration velocity analysis: Geophysics,
78, no. 2, S69–S79, http://doi.org/10.1190/geo2012-0182.1.
Takanashi, M., Y. Nakamura, M. Nakatsukasa, and J. Sakakibara, 2016, Crosstalk-free simultaneous
acquisition by arbitrary sweeps with amplitude modulation: 78th Annual International Conference
and Exhibition, EAGE, Extended Abstracts, http://doi.org/10.3997/2214-4609.201601412.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, http://doi.org/10.1190/1.3238367
Princeton University
Introduction
Despite its high resolution (Virieux et al., 2009), full Figure 1: An schematic illustration of the proposed localized
waveform inversion requires repeated full wavefield inversion. Black star denotes the physical source (air-gun). Grey
modeling and it is thus computationally expensive to invert and yellow triangles denote physical and virtual receivers,
for a large area. Computational cost can be particularly respectively. The Dashed square represents the absorbing boundary
and the solid square represents the source injection boundary.
more severe for time-lapse surveys in exploration
seismology, which require real-time model estimations on a
regular basis (daily or weekly). Since the oil/gas production Method – Wavefield injection and extrapolation
does not change the overall substructures, there should be
much redundancy in conventional full-model waveform For the source side, wavefield injection method
inversions. In this study, we propose a methodology that (Robertsson et al., 2000; Borisov et al., 2015; Masson et al.,
allows to locally perform waveform inversions in the 2017; Yuan et al., 2017) reconstructs a source expression
region disconnected from both source and receiver arrays. on the surface of target regions, which serves as a localized
forward modeling scheme and allows efficient calculation
Fig. 1 shows how our methodology works. We have a of synthetic seismograms after model alterations within a
physical source (array) and a receiver array on the Earth's certain area. Wavefield injection is essentially a hybrid
surface or ocean bottom. Instead of performing inversion approach, where we impose an artificial boundary
inside the whole model region, we would like to focus on a enclosing the target regions and divide the whole
subvolume in V+, where the model changes are expected to computational domain into two subdomains V+ and V- (Fig.
occur. All forward and backward simulations will be 1). Given a baseline (background) model of the full region,
conducted within the reduced absorbing boundary (the we can calculate wavefield along the injection boundary
dashed square shown in Fig. 1). Such a strategy of localized (Fig. 1, solid square) for each physical source and store
waveform inversion requires two levels of wavefield them (both traction and velocity components). The stored
reconstructions: i) representation of seismic sources in the wavefield will serve as new source distributions (boundary
subdomain (wavefield injection); and ii) extrapolation of conditions) for subsequent localized forward simulations.
residual wavefield between baseline and monitor surveys The only missing part of the reconstructed wavefield using
injection method is the higher-order interactions between model inversion, the localized inversion retrieves the
the scattered wavefield caused by the altered model within velocity anomalies with a higher accuracy after the same
target area and the unaltered baseline model outside the 100 iterations. The corresponding 1D velocity profiles are
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
reduced absorbing boundary. To maintaining all shown in Fig. 2e-f. Fig. 3 shows the vertical velocity
interactions, the so-called exact boundary condition should components of initial and final residuals from one shot
be applied (van Manen et al., 2007; Malcolm et al., 2016; gather. The residuals of the major events are better fitted by
Masson et al., 2017). However, it is more computationally the localized inversion (Fig. 3d) in comparison to the final
demanding and requires more accuracy of the baseline full-model inversion residuals (Fig. 3b). We specially plot
model since the higher-order scattered wavefield are more the theoretical initial residuals (Fig. 3e) in case that we
sensitive to the model error. Besides, as we show in directly record wavefield at the same positions as the
following numerical examples, those higher-order, long- extrapolated virtual receivers. The residual difference
range interactions may impose limited effect on the final between Fig. 3c and 3e is shown in Fig. 3f. This difference
inversion results. Considering the facts above, we stick to essentially represents the spurious events resulting from
the wavefield injection without implementing exact wavefield extrapolation. We notice that the localized
boundary condition. inversion does not ``see'' those spurious events when
comparing it with final residuals of the localized inversion
For the receiver side, classical wavefield extrapolation (Fig. 3d and 3f). We also calculated the inverted model
(redatuming) from physical to virtual receivers in the errors as a function of iterations (Fig. 4), showing that
subsurface is needed in our methods to perform localized localized inversions converge faster. Hence, the localized
waveform inversion. We extrapolate residual wavefield inversions require fewer iterations to obtain models with
between baseline and monitor in frequency domain using the same accuracy as full-model inversions. This will
correlation-type representation theorem (Wapenaar et al., further enhance the speed-up and efficiency.
2006; Ravasi et al., 2014), which is similar to reverse time
migration. The Green's functions between physical and Marmousi model - Vp
virtual receivers, serving as back-propagators, are 1
calculated from two-way wave equations and stored just full−model
localised
once using the baseline model. Both truncated integral due
0.8
Normalised model error
profile of the true time-lapse velocity perturbations is proposed strategy reveals that putting virtual sources and
shown in Fig. 5b. receivers in the vicinity of target region contribute to
improving the accuracy and robustness of waveform
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
The 3D Vp perturbations obtained from full-model and inversion. Due to the reduced modeling size in localized
localized inversions under true baseline model are shown in inversions, it not only takes much less time for a single
Fig. 5c-d, respectively. The corresponding 2D profiles are iteration, but also converges much faster. The severe
shown in Fig. 5e-f. This is the preliminary 3D result and we computational cost of 3D elastic full waveform inversion
may notice some inversion errors (high velocity anomalies during practical time-lapse surveys can be reduced to a
in Fig. 5f), which are probably due to the inaccurate more acceptable and economic level. Furthermore, it also
extrapolated wavefield. However, compared to the indicates the potentiality in determining high-resolution
conventional full-model inversion, the localized inversion elastic imaging of reservoir by inverting higher frequencies
still retrieves the velocity anomalies with a higher accuracy (above 30~Hz) at relatively low computational cost.
after the same 30 iterations.
Acknowledgments
Conclusions
The study was carried out as a part of the Paris Exploration
We combine wavefield injection and extrapolation to Geophysics Group project (GPX) funded by the French
perform the localized waveform inversion for time-lapse National Research Agency (ANR), CGG, TOTAL and
seismic surveys. It shows that the localized inversion can Schlumberger. A significant part of the calculations in this
enhance the image quality of local time-lapse variation work was performed on S-CAPAD at Institut de Physique
while reducing the computational cost to a large extent. The du Globe de Paris.
Depth (km)
Depth (km)
Depth (km)
1 1
3500 0 1.2 1.2
1.5
1.4 −50 1.4 1.4
2500
2
Local perturbations −100 1.6 1.6
1500 1.8 Iteration: 100
2.5 −150
1.25 3.75 6.25 (m/s) 3 4 5 6 (m/s) −200 0 200 −200 0 200
X direction (km) X direction (km) P−wave velocity (m/s) P−wave velocity (m/s)
True time−lapse perturbation − Vp Localized inversion − Vp (error:16.3%) 1D perturbation − #1 1D perturbation − #2
(b) 150 (d) 150 (f) (h)
0.8 true 0.8 true
0.6 100 0.6 100 inv inv
#1 #2
1 1
50 50
Depth (km)
Depth (km)
Depth (km)
Depth (km)
1 1
0 0 1.2 1.2
Figure 2: Time-lapse Vp model and inverted results. (a) The baseline Vp model selected from the true Marmousi model and the acquisition
geometry. Red star and yellow triangles denote physical source and receivers, respectively. White triangles denote the extrapolated virtual
receivers. Two arrows point to the locations where time-lapse perturbation occurs. (b) The zooming true time-lapse Vp perturbations within the
dashed square in (a). (c) Full-model inversion result. (d) Localized inversion result. The 1D true and inverted velocity perturbations (#1 and #2)
in (c) and (d) are shown in (e, g) and (f, h), respectively.
Time (ms)
Time (ms)
2000 0 2000 0 2000 0
Time (ms)
Time (ms)
Time (ms)
Figure 3: Vertical component of inversion residuals from one shot example under true baseline Marmousi model. (a-b) Initial and final (after 100
iterations) residuals using full-model inversion. (c-d) Initial and final (after 100 iterations) residuals using localised inversion. (e) Initial residuals
of localised inversion using the true wavefield directly observed at virtual receiver positions without extrapolation. (f) The residual difference
between (c) and (e).
Iteration: 30
Iteration: 30
Figure 5: 3D SEG/EAGE overthrust Vp model and inverted results. (a) The baseline Vp model. (b) The 2D profile of true time-lapse Vp
perturbations. (c) Full-model inversion result. (d) Localized inversion result. 2D profiles of inverted velocity perturbations in (c) and (d) are
shown in (e) and (f), respectively.
REFERENCES
Borisov, D., S. C. Singh, and N. Fuji, 2015, An efficient method of 3-D elastic full waveform inversion
using a finite-difference injection method for time-lapse imaging: Geophysical Journal
International, 202, 1908–1922, https://doi.org/10.1093/gji/ggv268.
Malcolm, A., and B. Willemsen, 2016, Rapid 4D FWI using a local wave solver: The Leading Edge, 35,
1053–1059, https://doi.org/10.1190/tle35121053.1.
Masson, Y., and B. Romanowicz, 2017, Fast computation of synthetic seismograms within a medium
containing remote localized perturbations: A numerical solution to the scattering problem:
Geophysical Journal International, 208, 674–692, https://doi.org/10.1093/gji/ggw412.
Ravasi, M., and A. Curtis, 2013, Elastic imaging with exact wavefield extrapolation for application to
ocean-bottom 4C seismic data: Geophysics, 78, no. 6, S265–S284,
https://doi.org/10.1190/geo2013-0152.1.
Robertsson, J. O., and C. H. Chapman, 2000, An efficient method for calculating finite-difference
seismograms after model alterations: Geophysics, 65, 907–918,
https://doi.org/10.1190/1.1444787.
van Manen, D. J., J. O. Robertsson, and A. Curtis, 2007, Exact wave field simulation for finite-volume
scattering problems: The Journal of the Acoustical Society of America, 122, EL115–EL121,
https://doi.org/10.1121/1.2771371.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, https://doi.org/10.1190/1.3238367.
Wapenaar, K., and J. Fokkema, 2006, Green’s function representations for seismic interferometry:
Geophysics, 71, no. 4, SI33–SI46, https://doi.org/10.1190/1.2213955.
Yuan, S., N. Fuji, S. Singh, and D. Borisov, 2017, Localised time-lapse elastic waveform inversion using
wavefield injection and extrapolation: 2D parametric studies: Geophysical Journal International,
209, 1699–1717, https://doi.org/10.1093/gji/ggx118.
Conventional full waveform inversion (FWI) has been exten- fer to this method as reconstructed full waveform inversion
sively applied to real seismic data and has successfully gener- with the extended source (RFWI). Having simulated the for-
ated high-fidelity earth models for better seismic imaging and ward modeled data and source wavefield, conventional FWI
structural interpretation. Considering the nonlinearity and ill- searches for earth models such that the synthetic data have the
posedness of the problem for conventional FWI, the success best match to the field data in the least-squares sense. RFWI
in providing reliable updated models heavily relies on the ac- optimizes over earth models and the source wavefield jointly to
curacy of the initial models and low frequency contents in the minimize the data misfit subject to the wavefield being consis-
acquired data. tent with the wave equation in an `2 sense. By reconstructing
a better source wavefield from the extended source instead of
To relax the requirements of good initial models and adequate the original source signature, RFWI relaxes the severe require-
low frequencies, we propose a novel approach to time do- ments for FWI and provides more reliable inverted models.
main reconstructed full waveform inversion with the extended
source (RFWI). RFWI relaxes the constraint that the forward By including the source wavefield as an additional parameter
modeled data exactly solve the wave equation as in conven- to the search space, RFWI adds the wave equation error as a
tional FWI, and instead uses an `2 approximate solution. RFWI penalty term to the original data misfit in conventional FWI
estimates earth models and jointly reconstructs an extended and formulates a joint minimization problem. We reconstruct
source by minimizing an objective function that penalizes the the source wavefield and estimate the earth models in an al-
wave equation error while fitting the data. RFWI extends the ternating fashion. We first reconstruct the extended source by
solution space and therefore overcomes some of the problems minimizing the wave equation error together with the data mis-
with local minima that prevent conventional FWI from obtain- fit. It is estimated from solving the normal equation which is
ing a reliable solution with an inaccurate starting model and/or equivalent to the least-squares solution. The source wavefield
insufficient low-frequency data . is then reconstructed from forward propagation of the extended
source and the receiver wavefield is reconstructed from another
backward propagation. With the reconstructed wavefields and
INTRODUCTION the extended source, models are updated with a gradient based
optimization method and inverted models are used for recon-
Conventional FWI has been an essential tool to build high- structing another extended source and wavefields at next it-
fidelity earth models by minimizing the misfit between the eration. Time-domain derivation and implementation differ-
acquired and forward modeled data (Lailly, 1983; Tarantola, entiate RFWI from other previous works that are related to
1984; Virieux and Operto, 2009). This least-squares problem wavefield reconstruction inversion or source extension in the
has been implemented in both the time and frequency domain frequency domain (van Leeuwen and Hermann, 2013; Huang
(Sirgue and Pratt, 2004; Wang et al., 2013). However, it is a et al., 2016) and provides a more suitable solver for processing
highly nonlinear, ill-posed inversion problem and mitigating 3D large-scale production data sets.
convergence to local minima is a severe challenge. The great-
est limitation of FWI that affects the success in generating re- By expanding the search space, RFWI reconstructs the for-
liable solutions for large-scale production jobs is the critical ward modeled data to better fit the field data and avoid cycle
requirement of low-frequency data coupled with good starting skips. Therefore it mitigates some of the problems with lo-
models. cal minima with inaccurate initial models and/or inadequate
low-frequency contents that limit the success of FWI. While
Over the last decade, various effort has been made to miti- FWI usually relies on diving waves, RFWI takes advantages
gate the problems of local minima and many alternative meth- of reflected waves from wavefield reconstruction and produces
ods have been proposed (Shen and Symes, 2008; van Leeuwen deeper model updates. It also compensates the wavefield er-
and Hermann, 2013; Biondi and Almomin, 2014; Warner and rors that relates to the acoustic assumptions and approxima-
Guasch, 2016; Wang et al., 2016; Huang et al., 2016). All these tions during the wave propagation. From the observations of
previous works and our proposed method in this paper aim to both synthetic and field examples, RFWI demonstrates more
avoid convergence to local minima by adding additional pa- advantages in areas with sharp velocity contrasts, especially
rameters to the models and expanding the solution space. with the presence of the salt.
We now focus on the time-domain method and implementation This paper first presents the theory and methodology for 3D
using finite difference scheme. To compute the misfit function time domain RFWI. It also discusses the differences and simi-
for time-domain FWI, the conventional forward modeled data larities between conventional FWI and RFWI. The benefits of
are extracted from the source wavefield generated by solving RFWI over FWI are demonstrated using a 2D synthetic exam-
the forward wave equation exactly with the given source sig- ple. Finally, the applicability of RFWI on field data is illus-
nature. Our reconstructed forward modeled data are obtained trated on both 2D and 3D streamer data from offshore Mexico.
(6)
the wave operator or D’Alembert operator, u is the forward
propagated or source wavefield, and f is the source signature. where
Let S[m] denote the solution operator of the forward propa- δ˜f = S∗ P∗ (d0 − PS f )/λ 2 . (7)
gated wave equation (1). At each iteration, conventional FWI
solves the wave equation exactly with the given source and Our next step is to minimize the above objective function (4)
the current model to obtain the source wavefield u = S[m] f . w.r.t. m using the reconstructed g̃. Note that we can now re-
The objective function for conventional FWI, which uses the construct the forward source wavefield ũ = Sg̃. Then we need
`2 norm of the difference between the acquired field data and an explicit form of the wave operator and let’s consider the the
simulated forward modeled data and depends on the model m following operator for a VTI system of two coupled second-
only, is defined as order partial differential equations in terms of P-wave vertical
velocity v, Thomsen parameters, ε and δ , assuming a constant
1 density and zero shear velocity
J[m] = k PS[m] f − d0 k22 . (2)
2 2
1 1 + 2ε 1 ∂x + ∂y2 0
Here P is the restriction operator (a projection) that records the 2[v] , 2 ∂t 2 − 2 , (8)
source wavefield u at the receiver locations and d0 is the field v 1 + 2δ 1 0 ∂z
data. where v is the velocity model that will be inverted for, while
Unlike conventional FWI, the idea of RFWI is to relax the con- anisotropy parameters are fixed. The velocity model v then can
straint that u be an exact solution of the wave equation to an `2 be updated using a conjugate gradient method and the gradient
approximation, by adding the wave equation error as a penalty for the objective function (4) w.r.t. v can be calculated using
term. Thus a new penalized objective function depending on ∇v Jλ [g(v), v] ≈ ∇v Jλ [g, v]
both the source wavefield u and the model m is introduced as
2
1 λ2 = − 3 ∂t2 Sg, S∗ P∗ d0 − S∗ P∗ PSg
J¯λ [u, m] = k Pu − d0 k22 + k 2[m]u − f k22 , (3) v
2 2
2
where λ is a penalty factor that requires to be updated during ≈ − 3 ∂t2 Sg̃, S∗ P∗ d0 − S∗ P∗ PSg̃
v
the inversion. Source wavefield u should be forward going 2
2λ 2 ˜f
and therefore in the range of S, i.e. u = S[m]g for some g. We ≈ − ∂ ũ, δ
refer to g as the extended or reconstructed source since it varies v3 t
with space as well as with time for each shot and it is used to 2
+ 3 ∂t2 S f , S∗ P∗ PSδ˜f .
reconstruct the source wavefield u. Note then that 2[m]u = g v
and so the objective function with the extended source g and
This gradient computation can be easily extended to more gen-
the model m can be redefined as
eral wave equations and used to perform multi-parameter in-
1 λ2 version for velocity and other earth models, such as anisotropy
Jλ [g, m] = k PS[m]g − d0 k22 + k g − f k22 . (4)
2 2 parameters, attenuation quality factor, and/or density, either si-
To make this joint minimization problem computationally fea- multaneously or sequentially. When the penalty factor λ is
sible, we solve g and m in an alternating fashion. We first large enough, RFWI and conventional FWI converge to very
minimize the above objective function w.r.t. g using the cur- similar results. Thus the penalty factor has to be chosen care-
rent model. Since the data mismatch will be built into the fully to make RFWI produce favorable results and it varies
extended source, the reconstructed wavefield better matches with iterations. The second term in the gradient computation
the true wavefield for both reflections and refractions, which may be ignored for certain λ to reduce the computation cost.
mitigates the issues of cycle skipping associated with conven-
tional FWI. The least-squares problem for reconstructing the
extended source g is equivalent to solving the following nor- SYNTHETIC EXAMPLE
mal equation
We first demonstrate the advantage of RFWI by applying it to a
S∗ P∗ PSg + λ 2 g = S∗ P∗ d0 + λ 2 f . 2D synthetic data set and draw a comparison with conventional
Now assume that the extended source g is the solution of the FWI. The true model is a modified SMAART Pluto synthetic
normal equation and write g as a perturbation of f salt model as shown in Figure 1(a), which was used to generate
the field data set that has 250 shot gathers with a shot spacing
g = f +δ f, of 20 m. Each shot gather contains 600 receivers with an inter-
val of 20 m. The lowest frequency used for inversion was 4 Hz
where
and maximum offset was 6000 m. The initial velocity model
δ f = S∗ P∗ d0 /λ 2 − S∗ P∗ PS f /λ 2 + O(1/λ 4 ), (5) is displayed in Figure 1(b), which is a smoothed version of
(a) Initial
(c) Inverted model from conventional FWI (d) Inverted model from RFWI
the true model. If we compare the inverted models from 87 (b) RFWI inverted
iterations of conventional FWI in Figure 1(c) and 54 iterations
from RFWI in Figure 1(d), we notice that RFWI recovers a Figure 2: Velocity models
more detailed results with faster convergence, especially for
the salt body and sub-salt area.
FIELD EXAMPLES
(a) Initial
ACKNOWLEDGEMENTS
CONCLUSION
We would like to thank SMAART for providing the pluto syn-
We presented the methods and applications of our proposed thetic model. The Campeche reimaging program data was re-
novel inversion method - time domain RFWI. RFWI helps processed and reimaged by ION in partnership with Schlum-
avoid cycle skipping issues to overcome some of the prob- berger, who holds data licensing rights. We also thank ION for
lems with local minima and relaxes the requirements for con- permission to publish the results and our colleagues for pro-
ventional FWI. It demonstrates more advantages in areas with viding valuable discussion and support.
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Biondi, B., and A. Almomin, 2014, Simultaneous inversion of full data bandwidth by tomographic full
waveform inversion: Geophysics, 79, no. 3, WA129–WA140, https://doi.org/10.1190/geo2013-
0340.1.
Huang, H., W. Symes, and R. Nammour, 2016, Matched source waveform inversion: Volume extension:
86th Annual International Meeting, SEG, Expanded Abstracts, 1364–1368,
https://doi.org/10.1190/segam2016-13954250.1.
Lailly, P., 1983, The seismic inverse problem as a sequence of before-stack migrations, In: Bednar, J., ed.,
Conference on Inverse Scattering: Theory and Applications: Society for Industrial and Applied
Mathematics, 206–220.
van Leeuwen, T., and F. Hermann, 2013, Mitigating local minima in full-waveform inversion by
expanding the search space: Geophysical Journal International, 195, 661–667,
https://doi.org/10.1093/gji/ggt258.
Shen, P., and W. Symes, 2008, Automatic velocity analysis via shot profile migration: Geophysics, 73,
no. 5, VE49–VE59, https://doi.org/10.1190/1.2972021.
Sirgue, L., and R. Pratt, 2004, Efficient waveform inversion and imaging: A strategy for selecting
temporal frequencies: Geophysics, 69, 231–248, https://doi.org/10.1190/1.1649391.
Tarantola, A., 1984, Inversion of seismic reflection data in the acoustic approximation: Geophysics, 49,
1259–1266, https://doi.org/10.1190/1.1441754.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, 127–152, https://doi.org/10.1190/1.3238367.
Wang, C., D. Yingst, J. Bai, J. Leveille, P. Farmer, and J. Brittan, 2013, Waveform inversion including
well constraints, anisotropy, and attenuation: The Leading Edge, 32, 1056–1062,
https://doi.org/10.1190/tle32091056.1.
Wang, C., D. Yingst, P. Farmer, and J. Leveille, 2016, Full-waveform inversion with the reconstructed
wavefield method: 86th Annual International Meeting, SEG, Expanded Abstracts,
https://doi.org/10.1190/segam2016-13870317.1.
Warner, M. and L. Guasch, 2016, Adaptive waveform inversion: Theory: Geophysics, 81, no. 6, R429–
R445, https://doi.org/10.1190/segam2014-0371.1.
s(x, t )r (x, t ) H
1
al., 2014). However, the high vertical-wavenumber g t ( x) z ( s ( x, t )) H z ( r ( x, t )) dt
2
component that dominates the FWI gradient of reflection 0 , (1)
data has limited impact on the model kinematics. t max
s(x, t )r (x, t ) H
1
g m ( x) z ( s ( x, t )) H z ( r ( x, t )) dt
In the last few years, several methods have been proposed 2
0
to increase the significance of reflection data in the FWI
workflow, e.g., Xu et al. (2012), Tang et al. (2013), where s is the source wavefield, r is the back-propagated
Brossier et al. (2014), Alkhalifah et al. (2014), Irabor and residual wavefield, Hz represents the Hilbert transform in kz
Warner (2016), Vigh et al. (2016), and Ramos-Martinez et direction, and gt and gm are the tomographic and migration
al. (2016), among others. A common feature in all these terms, respectively.
methods is the extraction and/or enhancement of the low-
wavenumber component of the FWI gradient of reflection In order to produce the back-scattered energy necessary to
data. As shown by Mora (1989), reflection data produce generate the rabbit ears, a bootstrapping approach is used to
two different components in the FWI gradient: the high- estimate the location of the reflectors. More specifically, in
wavenumber component, also known as the migration term, the first iteration, the high-wavenumber component gm of
the gradient is used to estimate a density model that will susceptible to cycle-skipping, as the timing error normally
contain the necessary sharp contrasts. This is followed by a increases as we go further from the reference offset.
velocity update iteration, this time using the low-
wavenumber component gt of the gradient. These iterations It is clear that despite having the potential to extend the
are then alternated, meaning that both the background maximum update depth beyond that of diving-wave FWI,
velocity and reflector locations are sequentially updated, RFWI is also subject to its own restrictions, due to the
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
until a convergence criterion is reached. “tomographic” nature of the problem. In the following two
sections, some limitations of this approach are revisited,
Although the assumption that all reflection data are focusing on the contribution provided by deeper layers.
generated by density contrasts is not precise, the placement
of the reflectors is consistent with the current velocity
model; therefore, the traveltime information obtained with
the estimated density model can be used to infer kinematic
errors in the background velocity model.
the RFWI gradient (in red) provides good coverage of the strategies such as top-down inversion and regularization
horizontal wavenumbers (kx), but it is concentrated around can be considered.
the low vertical wavenumbers (kz).
Reflector depth uncertainty
In addition to the RFWI gradient, the dominant
wavenumbers (larger than -30 dB) of each anomaly in Another challenge faced by RFWI is the uncertainty
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Figure 2 are calculated and plotted in Figure 3. Comparing regarding the true reflector depth. Unlike conventional
the spectra, it is clear that Anomaly #1 is well aligned with diving-wave FWI, which only requires a smooth velocity,
the RFWI gradient, while the other two have many RFWI needs sharp contrasts in the model in order to
wavenumbers that are not sampled by the deep reflector. generate the backscattered energy that forms the rabbit
ears. Since the traveltimes depend on both velocity and
Figure 4 shows the RFWI result, starting from a constant reflector position, the non-linearity of the problem is
velocity of 2500 m/s, after a total of 35 iterations. As increased, i.e., RFWI can converge to incorrect velocities
expected from the wavenumber analysis, while horizontal and reflector depths that still match the traveltimes, just as
wavenumbers are well resolved, only the small vertical ray-based reflection tomography can.
wavenumbers are recovered by RFWI. This is sufficient for
Anomaly #1 but insufficient for Anomalies #2 and #3, This problem is illustrated in Figure 5, in which RFWI
which contain higher vertical wavenumbers. As a result, using only deep reflectors is performed with (Figure 5c)
Anomaly #1 is well resolved and the other two are smeared and without (Figure 5d) a priori information about the
vertically from the reflector location to the surface. reflector depths. Since the initial velocity error is large (up
However, a migration QC indicates that, for all three to 30%) and there are not enough events to fully constrain
anomalies, the kinematics of the velocity model are well the inversion, RFWI without a priori information converges
recovered at the reflector depth. to an alternative model that does not give the correct
stacked image, although it improves the flatness of the
migrated gathers (Figure 5h). On the other hand, with a
priori information about the reflector depth, RFWI
correctly recovers the wavenumbers sampled by the deep
reflector and is able to match the true image (Figure 5b) at
that depth.
In practice, the spectrum of the RFWI gradient can be Figure 5: Migrated image and velocity perturbation: a) Initial, b)
extended by the presence of additional reflectors at true model, c) RFWI with a priori information, d) RFWI without a
different depths and with varying dips (Alkhalifah, 2016). priori information. e), f), g), and h) are SOGs corresponding to
However, unlike tomographic methods based on residual models a), b), c), and d) respectively. The location of the gathers is
moveout, in which each sensitivity kernel — i.e., the indicated by the arrows.
sensitivity of the data residual to the model parameters —
is computed individually, the contribution of many kernels Although this velocity-depth ambiguity is well known in
is calculated simultaneously in RFWI. As a result, the migration velocity analysis (MVA) methods (Stork, 1992),
sensitivity kernel is more susceptible to the effects of imposing constraints in RFWI is less straightforward since
amplitude imbalance, such as poor illumination or low the contribution from many events is combined together.
reflectivity events. Ultimately, this can lead to an initial Therefore, for the moment, we recommend applying RFWI
dominance by stronger events, which can introduce a bias starting from a reasonably good initial model, in which the
towards certain wavenumbers. To alleviate this problem, location and focusing of the reflectors are not too damaged.
Real data example reflectivity shales between the shallow folds and the deep
events. However, the recovered wavenumbers are still able
Finally, we applied RFWI to a deep-water survey on the to significantly improve the kinematics throughout the
Mexican side of the Gulf of Mexico (GoM). The area of section, most notably at the Wilcox and Cretaceous but also
interest is located on the prolific Perdido fold belt. The in the shales, and the final velocity model has good
water bottom depth ranges from 200 m to 3500 m. The consistency with the geology.
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Figure 6: Vertical section with the velocity model overlaid on an RTM stack for: a) initial model, b) RFWI updated model, and c) RFWI velocity
perturbation. RTM surface offset gathers over the same line from: d) initial model, and e) RFWI updated model.
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Alkhalifah, T., 2014, Scattering-angle based filtering of the waveform inversion gradients:
Geophysical Journal International, 200, 363–373, http://doi.org/10.1093/gji/ggu379.
Alkhalifah, T., 2016, Full-model wavenumber inversion: An emphasis on the appropriate
wavenumber continuation: Geophysics, 81, no. 3, R89-R98,
https://doi.org/10.1190/geo2015-0537.1.
Brossier, R., S. Operto, and J. Virieux, 2014, Velocity model building from seismic reflection
data by full-waveform inversion: Geophysical Prospecting, 63, 354-367,
https://doi.org/10.1111/1365-2478.12190.
Chazalnoel, N., A. Gomes, W. Zhao, and B. Wray, 2017, Revealing shallow and deep complex
geological features with FWI: Lessons learned: 79th Annual International Conference
and Exhibition, EAGE, Extended Abstracts, We-A3-02.
Dellinger, J., A.J. Brenders, J.R. Sandschaper, C. Regone, J. Etgen, I. Ahmed, and K.J. Lee,
2017, The Garden Banks model experience: The Leading Edge, 36, 151–158,
https://doi.org/10.1190/segam2015-5830959.1.
Guitton, A., 2014, On the velocity-density ambiguity in acoustic full-waveform inversion: 76th
Annual International Conference and Exhibition, EAGE, Extended Abstracts, We-E106-
03, https://doi.org/10.3997/2214-4609.20141082.
Irabor, K., M. and Warner, 2016, Reflection FWI: 86th Annual International Meeting, SEG,
Expanded Abstracts, 1136–1140, https://doi.org/10.1190/segam2016-13944219.1.
Liu, F., G. Zhang, S. Morton, and J. Leveille, 2011, An effective imaging condition for reverse-
time migration using wavefield decomposition: Geophysics, 76, no. 1, S29-S39,
https://doi.org/10.1190/1.3533914.
Mora, P., 1989, Inversion = migration + tomography: Geophysics, 54, 1575–1586,
https://doi.org/10.1190/1.1442625.
Qin, B., V. Prieux, H. Bi, A. Ratcliffe, J.P. Montel, D. Carotti, and G. Lambaré, 2014, Towards
high-frequency full waveform inversion - A case study: 76th Annual International
Conference and Exhibition, EAGE, Extended Abstracts, We-E106-07,
https://doi.org/10.3997/2214-4609.20141086.
Ramos-Martinez, J., N. Chemingui, S. Crawley, Z. Zou, A. Valenciano, and E. Klochikhina,
2016, A robust FWI gradient for high-resolution velocity model building: 86th Annual
International Meeting, SEG, Expanded Abstracts, 1258–1262,
https://doi.org/10.1190/segam2016-13872681.1.
Sirgue, L., and G. Pratt, 2004, Efficient waveform inversion and imaging: A strategy for
selecting temporal frequencies: Geophysics, 69, 231–248,
https://doi.org/10.1190/1.1649391.
Stork, C., 1992, Reflection tomography in the postmigrated domain: Geophysics, 57, 680–692,
https://doi.org/10.1190/1.1443282.
Vigh, D., K. Jiao, X. Cheng, D. Sun, W. Lewis, 2016, Earth-model building from shallow to
deep with full-waveform inversion: The Leading Edge, 35, 1025–1030,
https://doi.org/10.1190/tle35121025.1.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration
geophysics: Geophysics, 74, no. 6, WCC1–WCC26, https://doi.org/10.1190/1.3238367.
Xu, S., D. Wang, F. Chen, Y. Zhang, and G. Lambaré, 2012, Full waveform inversion for
reflected seismic data: 74th Annual International Conference and Exhibition, EAGE,
Extended Abstracts, W024, https://doi.org/10.3997/2214-4609.20148725.
Zhou, W., 2016, Velocity model building by full waveform inversion of early arrivals &
reflections and case study with gas cloud effect: Ph.D. thesis, Grenoble Alpes University.
One of the most challenging tasks for full waveform inver- 1989). For data with sufficient transmissions (such as long-
sion (FWI) is to construct background velocity models with offset data with rich diving and refracted energy), the low-
reflections, especially in deep regions. To accomplish this wavenumber components dominate this gradient and, thus, con-
task, we describe a reflection-based FWI algorithm to robustly ventional LS-FWI or inversion with other alternative objec-
build kinematically correct velocity models with reflected en- tives can effectively build kinematically correct low-wavenumber
ergy. This approach decomposes a subsurface model into a models, especially for shallow regions well illuminated by trans-
smooth background that is updated by means of minimiz- mitted energy (Luo and Schuster, 1991; Ravaut et al., 2004;
ing a kinematics-oriented objective function, and a rough re- Vigh and Starr, 2008; Sirgue et al., 2009; Vigh et al., 2011;
flectivity that is computed through a migration at the cur- Jiao et al., 2015). However, for data dominated by reflections,
rent background. Based on this model decomposition strat- the high-wavenumber components dominate, and conventional
egy and the Born modeling, we can explicitly compute the data-fitting-based inversion is not amenable to providing low-
low-wavenumber gradient components based on reflections, wavenumber updates, especially in deep regions.
which cannot be achieved with conventional approaches. To
guarantee that these low-wavenumber components contribute Among various strategies to promote low-wavenumber updates
to updating the background in proper directions, an adjustive during inversion, we adopt the model decomposition strategy
objective function is employed to robustly identify the kine- seen in some data domain approaches (Zhang et al., 2011; Xu
matic discrepancies between the predicted and observed re- et al., 2012; Ma and Hale, 2013; Brossier et al., 2014) and in
flections. More specifically, our approach measures the move- various migration velocity analysis variants (Symes and Caraz-
out differences between the predicted and observed reflections zone, 1991; Mulder and ten Kroode, 2002; Sava and Biondi,
in terms of local traveltime shifts, and then updates the back- 2004; Symes, 2008; Biondi and Almomin, 2014) that decom-
ground model to reduce such moveout discrepancies. Numeri- pose the subsurface model into a smooth background and a
cal experiments with both synthetic and real data demonstrate rough reflectivity. With this model decomposition and the Born
the success of the proposed algorithm for robustly constructing approximation, the low-wavenumber background gradient can
kinematically correct models in complex geological settings. be explicitly computed based on reflections. The remaining
question is how to ensure that this low-wavenumber gradient
will contribute to updating the background model in correct
directions (or, in other words, to avoid cycle skips).
Introduction
To achieve this goal, a kinematics-driven objective function is
Conventional full waveform inversion (FWI) makes inferences desired. Because the moveouts of reflection events are mainly
about subsurface models from recorded seismograms by solv- driven by model kinematics, we adopt the adjustive objective
ing a nonlinear least-squares (LS) optimization problem: function discussed in our previous work (Jiao et al., 2015).
This objective measures the local traveltime differences be-
min 12 ∥p[m] − d∥2 , (1) tween predicted and observed reflections, and forms them as
m
a function of space and time. Such a local traveltime func-
where m stands for subsurface models, d denotes observed tion well quantifies the moveout discrepancies between pre-
seismograms, and p[m] is the prediction from wavefield sim- dicted and observed reflections. Hence, the proposed RFWI
ulations. Although FWI is capable of reconstructing high- algorithm updates the background model to minimize moveout
resolution models , the LS objective is very ill-conditioned and discrepancies dominated by model kinematic errors. Applica-
has many spurious local minima for typical seismic data that tions of the new approach to both synthetic and real data sets
has limited offset and lack usable very-low-frequency energy. demonstrate that the proposed algorithm can effectively recon-
To be successful, conventional FWI usually requires a kine- struct kinematically correct background models with reflected
matically appropriate initial model, or that the recorded data energy in complex geophysical settings.
contains sufficient transmitted energy that illuminates the tar-
get regions (usually in the shallow part of a model). In a pre-
vious study (Sun et al., 2016), we proposed a reflection-based Method
full waveform inversion (RFWI) algorithm that used a data-
domain differential semblance optimization to build kinemati- This section introduces the proposed RFWI strategy and its
cally correct models using reflected energy, especially for deep main components. Here, we consider that the observed data d
regions that cannot be illuminated by transmitted energy. This mainly consists of reflections, and assume that the model m is
work further improves the robustness of our RFWI process by composed of a smooth low-wavenumber component v (back-
adopting the adjustive objective (Jiao et al., 2015) instead of ground model) and a rough high-wavenumber component r
the data-domain differential semblance objective.
Figure 1: Results for model with 20% lower background velocity: (a) predicted and observed reflections; (b) computed local travel-
time shifts; (c) RFWI gradient (indicating positive updating direction). Results for model with 20% higher background velocity: (d)
predicted and observed reflections; (e) computed local traveltime shifts; (f) RFWI gradient (indicating negative updating direction).
(reflectivity), such that Among them, the adjoint of the derivative operator Dδ p ∆ϕ can
be written as:
m = v(1 + r). { [ ]}
( )T H [δ p] qδ p
D δ p ∆ϕ q = q + H ,
Then, the predicted reflection δ p is computed through the Born E(x,t)2 E(x,t)2
modeling (or de-migration) procedure defined as:
where H [·] stands for Hilbert transform, E(x,t) denotes the
δ p[v, r] := F[v]r, envelope of δ p(x,t), and q indicates any vector in the range of
( )T
Dδ p ∆ϕ . To compute gradient (4), we take
where F[v] denotes the Born operator at background model v.
The inverse problem we want to solve is: ( )T
q = D∆ϕ ∆T ∆T.
Given observed reflections d,
(2)
find v, r so that F[v]r is close to d. Note that, in gradient (4), the operator (Dv δ p)T explicitly gen-
erates the reflection-based low-wavenumber components, and,
( )T
Note that, in this work, for the sake of computational effi- D∆ϕ ∆T ∆T dictates the updating direction.
ciency, we compute r[v] through standard reverse time migra-
tion (RTM) instead of LS-RTM, i.e., r[v] := F[v]T d.
Numerical examples
The proposed RFWI updates v through minimizing the trav-
eltime shift between the Born prediction and observed reflec- The first example is based on a simple 8 by 15 km velocity
tions. In time domain, we view this traveltime shift as a local model with a constant background velocity at 2.28 km/s and
attribute and, hence, a function of both space and time that can one horizontal reflector at a depth of 4 km. There are 188 shots
be translated into the corresponding unwrapped instantaneous and 751 receivers evenly distributed at a depth of 10 m. We
phase error indicating the local phase misalignment. This way, computed the true reflection d, the Born predictions δ p, and
the new optimization problem can be formulated as: the corresponding local traveltime shifts ∆T for two incorrect
models whose velocities are 20% lower than and 20% higher
min J[v] := 21 ∥∆T (x,t)∥2 = 12 ∥∆T [∆ϕ ](x,t)∥2 , (3) than the true background velocity, respectively. Figures 1(a)
v
and (d) demonstrate that the Born predictions δ p and d share
where ∆T and ∆ϕ are the local traveltime shift and instanta- the same arrival time at offset 0 and start to diverge along dif-
neous phase difference, respectively, between δ p and d. ferent moveouts as offset increases, which is driven by differ-
ent model kinematics. Such moveout discrepancies are quan-
Using the chain rule and standard adjoint state derivation, the
tified by local traveltime shifts ∆T between δ p and d (Figures
gradient of J[v] with respect to v can be computed through:
1(b) and (e)). Figures 1(c) and (f) show the RFWI gradients for
( )T ( )T the two incorrect models. Apparently, those gradients based on
∇J = (Dv δ p)T Dδ p ∆ϕ D∆ϕ ∆T ∆T, (4)
the proposed objective indicate desired updating directions.
where Dv δ p, Dδ p ∆ϕ , and D∆ϕ ∆T are the derivative opera-
( )T The second experiment is based on the Marmousi model (Fig-
tors of δ p, ∆ϕ , and ∆T , respectively. (Dv δ p)T , Dδ p ∆ϕ , ure 2(f)) with a fixed-spread acquisition geometry consisting
( )T
and D∆ϕ ∆T stand for the corresponding adjoint operators. of 151 shots and 301 receivers evenly distributed at a depth
of 10 m. The target synthetic data is generated with a Ricker
Figure 2: (a) initial model; (b) RFWI inverted model; (c) RFWI + LS-FWI inverted model; (d) reflectivity at initial model; (e)
reflectivity at RFWI inverted model; (f) true model; (g) Born prediction at initial model; (h) target shot; (i) Born prediction at RFWI
inverted model; (j) LS-FWI only inverted model.
wavelet with 30 Hz maximum frequency and an isotropic finite- data; the reconstructed model ensures a successful application
difference simulator that is different from the one used by our of conventional FWI to further improve model quality.
inversions. All of the following inversion tests run in a 3 to
18 Hz frequency band with 7 Hz as the dominant frequency. The third example is based on a wide-azimuth (WAZ) field
We first run the proposed RFWI starting from a 1D model (Fig- dataset with approximately 9.6 km maximum offset acquired
ure 2(a)). The target reflection-dominant data is extracted from in the Gulf of Mexico. The geology environment has exten-
the full synthetic data by muting out diving waves and refrac- sive salt/shale sheets with intervening deep-water sediment-
tions beyond water bottom reflections. Figure 2(h) plots one of filled mini-basins. This reflection-based inversion starts from
the target shot gathers and highlights the moveout of a major a model computed from an early-arrival adjustive FWI (Jiao
reflection with a green curve. After 20 RFWI iterations, the in- et al., 2015) that employed a simple 1D initial model and per-
verted background model (Figure 2(b)) already infers the cor- formed in a multi-scale manner with two frequency bands,
rect velocity trend. Figures 2(d) and (e) show the reflectivities i.e., 10 iterations at 4 Hz and 7 iterations at 6 Hz. After the
for the initial and RFWI inverted models, which demonstrate first stage inversion, the model updates were limited to a depth
a clear uplift of the reflectivity image due to the background around 2.7 km. To further build velocity in the deep parts of the
improvement. Figure 2(g) plots one of the shot gathers for the basins and beneath the salts, we exercised the proposed RFWI
initial model, delineates the moveout of a major reflection with with dominant frequency at 4.5 Hz. After 12 iterations, the
a red curve, and draws the target moveout curve in green; Fig- RFWI leads to promising mobile shale and subsalt improve-
ure 2(i) shows the shot gather for RFWI inverted model in the ments as demonstrated by comparing Figures 3(a) and (b) that
similar way as Figure 2(g) does. Clearly, RFWI greatly re- plot one inline of RTM images for the RFWI input and in-
duced the moveout discrepancies between the predicted and verted models. The overlaid velocities show deep updates in
target reflections. Starting from the RFWI inverted model, regions where transmitted energy cannot reach. Due to those
conventional LS-FWI further improves the accuracy and res- deep RFWI updates, the RTM image below the salt presents
olution of the reconstructed model; the inverted model after clear uplift in terms of reflector continuity and focusing, and
20 iterations of LS-FWI is plotted in Figure 2(c). On the con- thus the sub-salt structures become more prominent. What’s
trary, starting directly from the initial background model (Fig- more, Figures 4(a) and (b) present the Kirchhoff gathers for the
ure 2(a)), conventional LS-FWI stalls at an incorrect model RFWI input and inverted models near the location indicated by
(Figure 2(j)). As shown, the proposed algorithm effectively the yellow shaded zone in Figures 3(a) and (b). Clearly, RFWI
corrects the erroneous background using reflection-dominant improves the model kinematics.
Figure 3: RTM image and velocity overlay for: (a) RFWI input model; (b) RFWI inverted model (after 12 iterations)
Conclusions
Acknowledgements
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Biondi, B., and A. Almomin, 2014, Simultaneous inversion of full data bandwidth by tomographic full-
waveform inversion: Geophysics, 79, no. 3, WA129–WA140, https://doi.org/10.1190/geo2013-
0340.1.
Brossier, R., S. Operto, and J. Virieux, 2014, Velocity model building from seismic reflection data by
full-waveform inversion: Geophysical Prospecting, 63, 354–367, https://doi.org/10.1111/1365-
2478.12190.
Jiao, K., D. Sun, X. Cheng, and D. Vigh, 2015, Adjustive full waveform inversion: 85th Annual
International Meeting, SEG, Expanded Abstracts, 1091–1095,
https://doi.org/10.1190/segam2015-5901541.1.
Luo, Y., and G. T. Schuster, 1991, Wave-equation traveltime inversion: Geophysics, 56, 645–653,
https://doi.org/10.1190/1.1443081.
Ma, Y., and D. Hale, 2013, Wave-equation reflection traveltime inversion with dynamic warping and full-
waveform inversion: Geophysics, 78, no. 6, R223–R233, https://doi.org/10.1190/geo2013-
0004.1.
Mora, P., 1989, Inversion = migration + tomography: Geophysics, 54, 1575–1586,
https://doi.org/10.1190/1.1442625.
Mulder, W. A., and A. P. E. ten Kroode, 2002, Automatic velocity analysis by differential semblance
optimization: Geophysics, 67, 1184–1191, https://doi.org/10.1190/1.1500380.
Ravaut, C., S. Operto, L. Improta, J. Virieux, A. Herrero, and P. Dell’Aversana, 2004, Multiscale imaging
of complex structures from multifold wide-aperture seismic data by frequency-domain full-
waveform tomography: Application to a thrust belt: Geophysical Journal International, 159,
1032–1056, https://doi.org/10.1111/j.1365-246X.2004.02442.x.
Sava, P., and B. Biondi, 2004, Wave-equation migration velocity analysis. I. theory: Geophysical
Prospecting, 52, 593–606, https://doi.org/10.1111/j.1365-2478.2004.00447.x.
Sirgue, L., O. I. Barkved, J. P. Gestel, O. J. Askim, and J. H. Kommedal, 2009, 3D waveform inversion
on Valhall wide-azimuth OBC: 71st Annual International Conference and Exhibition, EAGE,
Extended Abstracts, https://doi.org/10.3997/2214-4609.201400395.
Sun, D., K. Jiao, X. Cheng, and D. Vigh, 2016, Reflection based waveform inversion: 86th Annual
International Meeting, SEG, Expanded Abstracts, 1151–1156,
https://doi.org/10.1190/segam2016-13966097.1.
Symes, W. W., 2008, Migration velocity analysis and waveform inversion: Geophysical Prospecting, 56,
765–790, https://doi.org/10.1111/j.1365-2478.2008.00698.x.
Symes, W. W., and J. J. Carazzone, 1991, Velocity inversion by differential semblance optimization:
Geophysics, 56, 654–663, https://doi.org/10.1190/1.1443082.
Vigh, D. and E. W. Starr, 2008, 3D plane-wave full-waveform inversion: Geophysics, 73, no. 5, VE135–
VE144, https://doi.org/10.1190/1.2952623.
Vigh, D., J. Kapoor, N. Moldoveanu, and H. Li, 2011, Breakthrough acquisition and technologies for
subsalt imaging: Geophysics, 76, no. 5, WB41–WB51, https://doi.org/10.1190/geo2010-0399.1.
Xu, S., D. Wang, F. Chen, G. Lambare, Y. Zhang, 2012, Inversion on reflected seismic wave: 82nd
Annual International Meeting, SEG, Expanded Abstracts, 1–7,
https://doi.org/10.1190/segam2012-1473.1.
Denes Vigh*, Kun Jiao, Xin Cheng, Dong Sun and Lu Xin Zhang, Schlumberger WesternGeco
upgoing reflected receiver wavefield S2 R3 and the starting and the updated model (Figure 2b and Figure 2d)
correlation of the upgoing source-side reflected wavefield the first obvious conclusion is that FWI picked up the small
and medium scale high velocity carbonate carapaces and
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Figure 2: (a) Initial 1D velocity model vertical section, (b) Initial 1D velocity model depth slice at 2 km, (c) FWI updated velocity after 2
frequency band update in vertical section, (d) FWI updated velocity after 2 frequency band update depth slice at 2 km.
Figure 3: (a) Input model to reflection FWI with image overlay in the shale area, (b) Reflection FWI updated model with image overlay in the
shale area.
Figure 4: (a) Input model to reflection FWI with image overlay in the subsalt area, (b) Reflection FWI updated model with image overlay in the
subsalt area.
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Brossier, R., S. Operto, and J. Virieux, 2014, Velocity model building from seismic reflection data by
full-waveform inversion: Geophysical Prospecting, 63, 354–367, http://doi.org/10.1111/1365-
2478.12190.
Jiao, K., D. Sun, X. Cheng, and D. Vigh, 2015, Adjustive full waveform inversion: 85th Annual
International Meeting, SEG, Expanded Abstracts, 1091–1095, http://doi.org/10.1190/segam2015-
5901541.1.
Xu, S., F. Chen, G. Lambaré, Y. Zhang, and D. Wang, 2012, Inversion on reflected seismic wave: 82nd
Annual International Meeting, SEG, Expanded Abstracts, 1–7, http://doi.org/10.1190/segam2012-
1473.1.
T
Gmig Gdu Gdu 0
u s , d ur , u u s ,u ur , d dt , (2a) Gmig/tomo
1
V
2u †
T 2u
u H z 2 H z u † dt , (7)
0 t
t
Ns , Nr 3 2
Ns , Nr
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
T
Gtomo Gdd Guu 0
us , d ur , d u s , u ur ,u dt , (2b) Where 1 for the migration-like kernel, and 1 for
Ns , Nr
the tomography-like kernel. If we set 0 , the kernel
where the subscripts s and r represent the source and becomes the conventional FWI kernel. The above kernels
receiver, respectively, and the subscripts u and d represent could also be obtained using other possible approaches such
the up- and down-going components. In the framework of as that based on the Fourier transform (Liu et al., 2011). The
adjoint-state method, the receiver wavefield is a result of the advantage of using Equation (7) is that its computational
misfit signals back-propagating from the receiver location as efficiency is higher than the others.
the adjoint source. Equation (2) shows that the migration
kernel is the cross-correlation result of the source and Figure 1 shows the difference between the migration-like
receiver wavefields propagating in different spatial and tomography-like kernels for a simple two-layer model.
directions, while the tomographic kernel results from those It illustrates that the high-wavenumber migration isochrone
wavefields propagating in the same spatial directions. (Figure 1a) and the low-wavenumber wavepath associated
with the direct waves and backscattering reflections (Figure
Separating Gdu Gdu from Gdd Guu is not a trivial task. 1b) can be separated from the full mixed kernel (Figure 1c)
We employ the approach of Fei et al. (2015). This approach accurately using Equation (7).
was originally developed for isolating only the cross-
correlation between the down-going source wavefield and
the up-going receiver wavefield for RTM:
T
I du 0
usur H z us H z ur
Ns , Nr (3)
us H z H t ur H z us H t ur dt ,
where H z and H t are Hilbert transforms in the depth and
(a) Migration-like kernel
time domains, respectively. The above imaging condition is
derived based on the so-called extended (or analytic)
wavefields in the depth and time domains. Analogously, the
cross-correlation between the up-going source wavefield
and the down-going receiver wavefield can be expressed as
T
I ud 0 usur H z us H z ur
Ns , Nr (4)
us H z H t ur H z us H t ur dt.
(b) Tomography-like kernel
Combining Equations (3) and (4) results in
T
I du I ud 2 us ur H z us H z ur dt. (5)
0
Ns , Nr
Similarly, we obtain
T
I dd I uu 2 us ur H z us H z ur dt. (6)
0
Ns , Nr
wavenumber model perturbation V . After a few number Figure 3a is the migration-like gradient, resulting in high-
of LSRTM iterations, we turn to the tomography stage using wavenumber model perturbation updates. The tomography-
Equation (7) with 1 to update the low-wavenumber like gradient of our LSRTM-guided FWI in Figure 3b
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
background velocity V0 above the reflectors of LSRTM. provides smooth velocity updates above the reflectors.
Consequently, the model perturbation and the background
Next, we test our LSRTM-guided FWI method on the
velocity are updated alternatively.
Marmousi model (Figure 4a). The grid interval for both the
horizontal and vertical directions is 12.5 m. We position 50
We use synthetic seismic data for a layered model and the
shots with a spatial interval of 100 m at the depth of 12.5 m
Marmousi model to validate our new FWI method.
and 400 receivers at all grid points from a distance of 0 to
5 km also at the depth of 12.5 m. A Ricker wavelet with a
Numerical Examples
center frequency of 15 Hz is used for the modeling and
inversion. The initial model is a Gaussian smoothed model
We first use a simple 2D synthetic example to show
(Figure 4b). To illustrate the effectiveness of our LSRTM-
migration-like and tomography-like gradients of our
guided FWI method, the inversion experiment is performed
LRSTM-guided FWI. Figure 2 depicts a four-layer model
without using any multi-scale strategy.
containing a 5% negative anomaly and a 5% positive
anomaly (125 m/s) in the second and third layer relative to
the first layer, respectively. The initial velocity model for
FWI is a homogeneous model with the velocity of the first
layer. The source wavelet is a Ricker time function with a
center frequency of 15 Hz.
(a) Migration-like gradient Because the initial model deviates substantially from the true
model and no multi-scale strategy is employed, the
conventional FWI produces the reasonable updates mainly
in the shallow region of the Marmousi model. The target area
bellow 1 km, or the anticline structures are not recovered, as
indicated by the inverted model shown in Figure 5a.
REFERENCES
Alkhalifah, T., and Z. Wu, 2016, The natural combination of full and image-based waveform inversion:
Geophysical Prospecting, 64, 19–30, http://doi.org/10.1111/1365-2478.12264.
Chen, T., and L. Huang, 2014, Elastic reverse-time migration with an excitation amplitude imaging
condition: 84th Annual International Meeting, SEG, Expanded Abstracts, 1868–1872,
https://doi.org/10.1190/segam2014-1548.1.
Chi, B., L. Dong, and Y. Liu, 2015, Correlation-based reflection full-waveform inversion: Geophysics,
80, no. 4, R189–R202, http://doi.org/10.1190/geo2014-0345.1.
Fei, T. W., Y. Luo, J. Yang, H. Liu, and F. Qin, 2015, Removing false images in reverse time migration:
The concept of de-primary: Geophysics, 80, no. 6, S237–S244, http://doi.org/10.1190/geo2015-
0289.1.
Liu, F., G. Zhang, S. Morton, and J. Leveille, 2011, An effective imaging condition for reverse-time
migration using wavefield decomposition: Geophysics, 76, no. 1, S29–S39,
http://doi.org/10.1190/1.3533914.
Mora, P., 1989, Inversion = migration + tomography: Geophysics, 54, 1575–1586,
http://doi.org/10.1190/1.1442625.
Shen, P., and U. Albertin, 2015, Up-down separation using Hilbert transformed source for causal imaging
condition: 85th Annual International Meeting, SEG, Expanded Abstracts, 4175–4179,
https://doi.org/10.1190/segam2015-5862960.1.
Tan, S., and L. Huang, 2014, Least-squares reverse-time migration with a wavefield-separation imaging
condition and updated source wavefields: Geophysics, 79, no. 5, S195–S205,
http://doi.org/10.1190/geo2014-0020.1.
Tang, Y., and S. Lee, 2013, Tomographically enhanced full wavefield inversion: 82nd Annual
International Meeting, SEG, Expanded Abstracts, 1037–1041,
https://doi.org/10.1190/segam2013-1145.1.
Tarantola, A., 1984, Inversion of seismic reflection data in the acoustic approximation: Geophysics, 49,
1259–1266, http://doi.org/10.1190/1.1441754.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, http://doi.org/10.1190/1.3238367.
Wang, F., D. Donno, H. Chauris, H. Calandra, and F. Audebert, 2016, Waveform inversion based on
wavefield decomposition: Geophyscis, 81, no. 6, R457–R470, http://doi.org/10.1190/geo2015-
0340.1.
Xu, S., D. Wang, F. Chen, G. Lambare, and Y. Zhang, 2012, Inversion on reflected seismic wave: 82nd
Annual International Meeting, SEG, Expanded Abstracts, 1–7,
https://doi.org/10.1190/segam2012-1473.1.
Yoon, K., and K. Marfurt, 2006, Reverse-time migration using the Poynting vector: Exploration
Geophysics, 37, 102–107, http://doi.org/10.1071/EG06102.
where obs and cal are the travel time difference between
The travel time difference is measured by DIW method
different traces within the observed and de-migrated data,
which is suitable for the rapidly varying time shifts (Hale
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
respectively.
2013). However instability can be observed for the noisy
data. In this paper, to further improve the performance of
As shown in Figure (1), we explain the physical meaning of
traditional DIW method, we add a regularization term to
the proposed misfit function as following. The black solid
the conventional DIW misfit function, which can be
line and dashed line represent the real reflector and the
expressed as:
migration image, respectively. The migration image differs
from the real reflector position because of the error of 1
Dcal (l ) dxs dxr dt
background model. t11 and t 21 are the travel time of the 2
observed data d11 , d 21 , respectively. t10 and t20 are the [( pc ( xr xr , t l ( xr , t ; xs ); xs ) pc ( xr , t ; xs )) 2 + l 2 ]
1
travel time of the de-migrated data d10 , d 20 , respectively. In Dobs (l ) dxs dxr dt
2
zero offset, the travel time of observed data t01 equals the
[( pobs ( xr xr , t l ( xr , t ; xs ); xs ) pobs ( xr , t ; xs )) 2 + l 2 ]
travel time of de-migrated data t00 , while for the far offset (2)
data, cycle skipping can be observed because of the where pobs and pc represent the observed and modeled
inaccurate background model. The conventional WERTI
data, respectively, l represents the travel time difference
aims at minimizing the difference between t11 and t10 and
for the data recorded at receiver position xr and source
the travel time difference is measured between the observed
and de-migrated data. Because DIW relies on the amplitude position xs . xr represents the trace interval for travel
information of seismic traces and the waveforms of de- time difference estimation within the common shot gather.
migrated and observed data are usually different, the travel is a weight factor which is determined based on human
time difference picked by DIW is not accurate enough for experience. The travel time difference is obtained by
inversion. However, the misfit function proposed in this minimizing the misfit function defined by equation (2). In
paper aims at minimizing the difference between this paper, the optimization problem is solved following the
t21 t11 and t20 t10 . Because the waveforms of different strategy proposed by Hale (2013).
traces within the modelled or de-migrated common shot
gather are similar, the travel time difference can be Figure (2) shows the travel time difference measured by
obtained accurately. Besides, the travel time difference DIW with and without the regularization term for the noisy
within the common shot gathers is measured between the data. The dashed black line represents the real travel time
near traces and a summation is performed from the near difference and the red line represents the estimated travel
offset to the far offset, which can further avoid the time difference. It is clear that, because the panel of the
waveform difference between the near offset trace and far misfit function is very irregular for the noisy data, the
offset trace. When the global minimum of the misfit travel time difference measured by conventional DIW
function is reached, using the zero offset trace as reference, method is different from the real travel time difference. We
we can claim that the travel time information of the also note that for where the amplitude of the signal is zero,
inverted model is accurate. the travel time difference is dominated by the noise, which
may leads to error for inversion. On the contrary, with the
help of regularization term, the measured travel time
difference fits the real travel time difference very well.
(a)
In this paper, we only invert for the background model Then we use the DWERTI method to update the
using DWERTI method. Using the Lagrange multiplier background model. Firstly the reflectivity model is
technique (Ma and Hale, 2013), the gradient of misfit generated by migration using the near offset data. The
function with respect to background model parameters can imaging result is high passed in wavenumber domain to
be expressed as: eliminate the low wavenumber noises. The travel time
qb x, t; xs pc x, t; xs (3)
difference between different traces within the de-migrated
( x) dxs dt and observed data is measured by regularized DIW method.
q x, t; x p x , t ; x
c s b s We use the gradient of misfit function (Eq.3) to update the
where: background model. With the updated background model,
xr , t ; xs ( cal xr , t ; xs obs xr , t ; xs ) the reflectivity is gradually corrected to the real position.
The update of low-wavenumber background model and the
p x xr , t cal xr , t ; xs ; xs reflectivity take place in the same iteration but in an
c r alternating fashion (Ma and Hale, 2013).
p x x , t x , t ; x ; x p x , t ; x
c r r cal r s s c r s
2 Figure (3.c) shows the updated low wavenumber
p c xr xr , t cal xr , t ; xs ; xs background model. Because the background model is
accurate for the kinematic information, conventional full
2
m ( x ) mr ( x ) 2
2 qb ( x, t ) waveform inversion is capable of providing a high
t resolution inversion result using the inverted model of
DWERTI as initial model. As shown in Figure (3.d), the
p c xr xr , t cal xr , t ; xs ; xs recovered model is almost identical to the real model for
the shallow to middle depth (<1.5km). Some artificial is
2 2 2
m( x) 2 qc ( x, t ) mr ( x) 2 qb ( x, t ) presented in the deep model for the poor illumination.
t t Overall, the inversion result of DWERTI is capable of
where m and mr represent the smooth background model providing accurate initial model for conventional FWI.
and the reflectivity model obtained by near offset migration To further investigate the reliability of the inversion result
at position x , respectively. of DWERTI, portions of the RTM results using the real
model, the initial model and the inversion result of
We iteratively update the background model using equation DWERTI are compared in Figure (4). Because the constant
(3) and the reflectivity by near offset migration. The gradient model is significantly far from the real model,
inverted background model, which provides accurate travel almost all of the reflectors are located in incorrect position
time information, can be further used as initial model for and the energy is not focused in the RTM result (Figure
conventional FWI. 4.b), compared with the RTM result using the real model.
With the accurate background model provided by DWERTI,
Example the RTM image is substantially improved. Two major
faults are better resolved and reflectors are now close to the
The synthetic data test involves a portion of SigsBee 2A true position.
model which is shown in Figure (3.a). The initial model is
set as linear gradient model except for the water layer, Conclusions
which is the same as the true model. 79 shots and 399
receivers are evenly distributed on the surface of the model In this paper, we propose a double difference wave
at 80m and 16m intervals, respectively. A Ricker wavelet equation travel time inversion with a regularized DIW
method. Based on the theoretical analysis and numerical Figure 3 (a) The real model; (b) The inversion result of the
test, we can draw the conclusions that include: (1) conventional FWI using the linear increasing model as
conventional FWI relies heavily on the accuracy of initial initial model; (c) The inversion result of DWERTI; (d) The
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
model and fails to produce satisfying inversion result inversion result of conventional FWI using (c) as initial
except for the shallow model illuminated by early arrivals model
when the initial model is not accurate enough. (2) Making
use of the travel time information from the seismic data is
the key point for building the accurate initial model for
conventional FWI. The method proposed in this paper can
make use of the travel time accurately and provide a
reasonable initial model for conventional FWI.
Acknowledgment
(b)
(a)
(c)
(b)
Figure 4 Part of the migration result using (a) real model (b)
the initial model and (c) the inversion result of DWERTI
(c)
(d)
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Chi, B., L. Dong, and Y. Liu, 2015, Correlation-based reflection full-waveform inversion: Geophysics,
80, no. 4, R189–R202, http://doi.org/10.1190/geo2014-0345.1.
Hale, D., 2013, Dynamic warping of seismic images: Geophysics, 78, no. 2, S105–S115,
http://doi.org/10.1190/geo2012-0327.1.
Ma, Y., and D. Hale, 2013, Wave-equation reflection traveltime inversion with dynamic warping and full-
waveform inversion: Geophysics, 78, no. 6, R223–R233, http://doi.org/10.1190/geo2013-0004.1.
Venstad, J. M., 2014, Dynamic time warping — An improved method for 4D and tomography time shift
estimation?: Geophysics, 79, no. 5, R209–R220, http://doi.org/10.1190/geo2013-0239.1.
Wang, H., S. C. Singh, F. Audebert, and H. Calandra, 2015, Inversion of seismic refraction and reflection
data for building long-wavelength velocity models: Geophysics, 80, no. 2, R81–R93,
http://doi.org/10.1190/geo2014-0174.1.
Xu, S., D. Wang, F. Chen, G. Lambaré, and Y. Zhang, 2012, Inversion on reflected seismic wave: 82nd
Annual International Meeting, SEG, Expanded Abstracts, 1–7, http://doi.org/10.1190/segam2012-
1473.1.
Bingbing Sun and Tariq Alkhalifah, King Abdullah University of Science and Technology
SUMMARY deal with up-going and down-going waves and can not distin-
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Z where for the far field approximation, we assume that 1/R <<
w2 (3) w/v.
= w(w)G(x, xs , w)G(x, xr , w)I(x)
v(x)2
Substitution of the derivatives of equation ( 10 ) and ( 11 ) into
5 · [5G(x, xs , w)I(x)] G(x, xr , w)w(w)dx
Z equation (8), we obtain
w2 ZZ " iwR/v ⇣ #
= w(w)G(x, xs , w)I(x)G(xr , x, w) e w z0 ⌘
v(x)2 p(x0 , y0 , z0 , w) = u 2
kz kz dxdy.
R v R
5 · [5G(x, xs , w)I(x)] G(xr , x, w)w(w)dx z=0
(12)
where ⇤ denotes the adjoint operation, and over line represents It is an oscillatory integral and can be approximated using the
the complex conjugate. We re-express equation 3 using a par- stationary phase method. The phase of this oscillatory integral
tial differential equation (PDE) in the time domain as: is expressed as :
1 ∂2p 1 ∂ 2u R kx x ky y
= Dp + 2 2 d v 5 · (d v 5 u), (4) F(x, y) =+ + . (13)
v ∂t
2 2 v ∂t v w w
where, we replaced the image I(x) with the velocity perturba- The stationary phase approximation gives the result in the form
tion d v here, u is the incident (background) wavefield and p is of
the Born scattered wavefield. " #
eiwR/v ⇣ 2 w z0 ⌘
p= c u kz kz , (14)
high frequency asymptotic analysis R v R ⇤ ⇤
(x,y)=(x ,y ),z=0
In RWI, the velocity perturbation d v is a band limited Dirac where c is the weighting term at the stationary point. (x⇤ , y⇤ )
delta function. In the following, we will demonstrate mathe- is the stationary point; it is calculated by setting the gradient
matically that when we consider the velocity perturbation to of the phase of equation (13) to zero, i.e., 5F = 0:
be a Dirac function, ENBS given by equation (4) will produce
∂ F x⇤ x0 kx
pure reflections. = + = 0; (15)
∂x Rv w
As the background wavefield u satisfies the wave equation ∂ F y⇤ y0 ky
= + = 0. (16)
1 ∂ 2u ∂y Rv w
= 5 · 5u. (5) Thus, at the stationary point, we have
v2 ∂t 2
The source term for ENBS of equation (4 ) can be simplified w 2 (x⇤ x0 )2 (y⇤ y0 )2
as kx2 + ky2 = 2 +
v R2 R2
1 ∂ 2u (17)
f = 2 2 d v 5 · (d v 5 u) = 5 d v · 5u. (6) w 2 w 2 z20
v ∂t = 2 .
In the frequency domain, the scattering wavefield can also be v v2 R2
expressed as a 3D integration using the Green function: Using the dispersion relation for plane waves in equation (10),
ZZZ it can be simplified as
eiwR
p(x0 , y0 , z0 , w) = 5 d v · 5u dV (7) w2 2 2 w
R kz2 = z /R or R = |z0 |. (18)
p v2 0 v|kz |
where R = (x x0 )2 + (y y0 )2 + (z z0 )2 .
Substitute this equation into equation (14), we obtain the Born
Suppose the velocity perturbation is a Dirac delta function d v = wavefield as
d (z). Equation (7) can be simplified as " #
ZZ ✓ ◆ eiwR/v z0
∂ ∂ u eiwR p= c u kz2 sign(kz )kz2 .
p(x0 , y0 , z0 , w) = dxdy. (8) R ⇤ ⇤
|z 0|
(x,y)=(x ,y ),z=0
∂z ∂z R z=0 (19)
Here we use the shifting properties for the Dirac delta function: Considering the term in the second brace, It is clear that if the
Z incident wavefield is down going with kz > 0, the Born wave-
∂ d (z) ∂ q(z)
q(z)dz = |z=0 . (9) field using ENBS would be zero for z0 > 0 and it corresponds
∂z ∂z
to the zone for transmission waves in this setup. In the deriva-
Considering the incident wavefield as a plane wave, we have: tions, we assumed the velocity perturbation or the image as a
w2 Dirac delta function. As we pointed out before, in RWI the im-
u =ei(kx x+ky y+kz z) , kx2 + ky2 + kz2 = , age would actually be a band limited Dirac function, we show
v2
(10) that in this slightly degenerated situation, the proposed ENBS
∂u ∂ 2u
= ikz u, = kz2 u. can still perform well in attenuation of transmission waves.
∂z ∂ z2
depth [km]
2
Figures 1b and 1c are snap shots of the Born scattered wave-
field at t = 2.8s for the conventional Born and for ENBS, re-
spectively. From this result, it is obvious that ENBS can atten- 3
uate transmission waves in the scattered wavefield effectively
for both vertical and horizontal propagated incident wavefields.
4
The same features holds in the frequency domain by solving
for the scattered wavefield using the Helmholtz wave equation
5
with the proper source function. This time the source is placed
on the surface in the middle (yellow star in Figure 2a, and the
background velocity is 2 km/s. Figure 2a shows the veloc- (a)
(b)
CONCLUSIONS
distance [km]
We proposed an effective and efficient method to attenuate 0 2 4
transmissions in Born modeling, which we refer to as the En- 0
ergy Norm Born Scattering (ENBS). We showed analytically
that given a velocity perturbation represented by a Dirac delta
function, ENBS produces pure reflections, i.e., transmission 1
energy is attenuated. Numerically, similar observations are
realized with band limited Delta functions. Specifically, the
depth [km]
2
numerical examples show that the proposed method produces
pure reflections in both the time and frequency domains. Using
ENBS, we can produce a smooth gradient in RWI free of high 3
wavenumber artifacts, which is essential for good convergence
of the inversion process.
4
(c)
1
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
depth [km]
2
distance [km]
3 0 2 4
0
4
1
5
depth [km]
2
(a)
3
distance [km]
0 2 4 4
0
5
1
(a)
depth [km]
2
distance [km]
3 0 2 4
0
4
1
5
depth [km]
2
(b)
3
distance [km]
0 2 4 4
0
5
1
(b)
depth [km]
2
Figure 3: Application of ENBS in reflection waveform inver-
sion: wavepath-based gradients computed using (a) the con-
3 ventional born modeling, and using (b) ENBS.
(c)
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Irabor, K., and M. Warner, 2016, Reflection FWI: 81st Annual International Meeting, SEG, Expanded
Abstracts, 1136–1140, https://doi.org/10.1190/segam2016-13944219.1.
Liu, F., G. Zhang, S. A. Morton, and J. P. Leveille, 2011, An effective imaging condition for reverse-time
migration using wavefield decomposition: Geophysics, 76, S29–S39,
https://doi.org/10.1190/1.3533914.
Rocha, D., N. Tanushev, and P. Sava, 2016, Acoustic wavefield imaging using the energy norm:
Geophysics, 81, no. 4, S151–S163, https://doi.org/10.1190/geo2015-0486.1.
Virieux, J., and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics:
Geophysics, 74, no. 6, WCC1–WCC26, https://doi.org/10.1190/1.3238367.
Elastic reflection waveform inversion (ERWI) utilize the re- approach requires huge computational cost, especially in 3D
flections to update the low and intermediate wavenumbers in case.
the deeper part of model. However, ERWI suffers from the
cycle-skipping problem due to the objective function of wave- Traveltime information are more sensitive and linearly related
form residual. Since traveltime information relates to the back- to the low-wavenumber components of the model. Therefore,
ground model more linearly, we use the traveltime residuals traveltime inversion will be more robust and helpful to build
as objective function to update background velocity model us- initial models containing long-wavelength components for con-
ing wave equation reflected traveltime inversion (WERTI). The ventional FWI (Chi et al., 2015; Luo et al., 2016). Ma and Hale
reflection kernel analysis shows that mode decomposition can (2013) introduced a wave equation reflected traveltime inver-
suppress the artifacts in gradient calculation. We design a sion (WERTI) method to build the low wavenumber of the
two-step inversion strategy, in which PP reflections are firstly model. Unfortunately, in elastic case, traveltimes of a particular
used to invert P wave velocity (Vp ), followed by S wave ve- wave mode are difficult to extract due to the complicated mode-
locity (Vs ) inversion with PS reflections. P/S separation of conversions. The multi-parameter trade-offs will increase the
multi-component seismograms and spatial wave mode decom- nonlinearity as well. Wang and Cheng (2017) obtained good
position can reduce the nonlinearity of inversion effectively results by utilizing the wave mode decomposition to precon-
by selecting suitable P or S wave subsets for hierarchical in- dition the gradients in EFWI. The mode decomposition has
version. Numerical example of Sigsbee2A model validates the been proved an efficient tool to provide decomposed data for
effectiveness of the algorithms and strategies for elastic WERTI hierarchical strategies.
(E-WERTI). We calculate the reflection wavepath and its components of
different wave modes in elastic media. The investigation of
reflection wavepath (kernel) shows the effectiveness of mode
INTRODUCTION decomposition to suppress the artifacts in the gradient cal-
culation. P/S separation of multicomponent seismograms is
With the development of high-performance computational abil- applied to the observed and predicted reflection data to extract
ities, people are paying more attention to the elastic full wave- the individual traveltime residuals through DIW (Hale, 2013).
form inversion (EFWI) to recover the elastic properties of the Based on the analysis of elastic reflection kernels and the sep-
subsurface (Vigh et al., 2014). EFWI provides high-resolution arated traveltime residuals, we design a two-stage workflow to
model estimation but notoriously surfers from the same non- implement the E-WERTI method, in which the traveltime of
linearities or cycle-skipping problems as in acoustic case and PP is firstly used to recover the background Vp model followed
also the trade-offs of multi-parameter inversion (Operto et al., by inverting the background Vs model through the traveltime
2013). In the absence of low frequency data and/or good initial of PS reflections. During the Vs inversion, we precondition
models, EFWI falls into local minimal easily because of its the Vs gradient through spatial wave mode decomposition. Fi-
incapability to recover the low and intermediate wavenumber nally, the numerical example of Sigsee2A model proves the
components of the model. robustness and validity of our E-WERTI method.
Xu et al. (2012) suggested using a reflection waveform inver-
sion (RWI) method to suppress the nonlinearity in FWI, which
aim to invert the long-wavelength components of the model by METHOD
using the reflections predicted by migration/demigration pro-
cess. RWI highly relies on the accurate reflectivity to generate Objective function and gradients of elastic WERTI
the reflections that can match the observed data. However, it is Assume that there is a perturbation δci j k l in the background
very challenging and also expensive to obtain a good reflectiv- elastic media ci j k l , the background wavefileds ui and perturbed
ity model through least-square migration when initial model is wavefields δui satisfy:
far away from the true value. Through minimizing misfit func- ∂u2 ∂ ∂u
tion of waveform in data domain, the RWI method is developed ρ 2i − ci j k l k = f i , (1)
∂t ∂xj ∂ xl
by several works (Wu and Alkhalifah, 2015; Zhou et al., 2015),
and recently extended to elastic case by Guo and Alkhalifah and
(2016). The misfit function also can be built in image domain
∂δu2i ∂ ∂δuk ∂ ∂uk
in the manner of wave equation migration velocity analysis ρ 2 − c = δci j k l , (2)
(WEMVA), which tries to maximize energy at zero offset loca- ∂t ∂ x j i j k l ∂ xl ∂xj ∂ xl
tion in the extended image space (Biondi and Almomin, 2012; where δui can be seen as the demigrated reflection data using
Sun and Symes, 2012). Raknes and Weibull (2016) developed the image perturbation δci j k l obtained from RTM or other
the image-domain method in the 3D elastic media. Wang et al. imaging method. In WERTI, we aim to minimize the traveltime
differences between observed data do and calculated data dc , where m0 ∈ {Vp , Vs } and M, N ∈ {P, S}. K m M N represents the
0
then the objective function is: cross correlation between the M mode forward wavefields and
the N mode adjoint wavefields. Note, it does not denote the
c o 2
τ(xr, t; xs ) = argτmin k d (xr , t; x s ) − d (xr , t + τ; x s ) k
kernel of M N mode data. For example, the reflection kernel of
Z PS data should be (u P · δψ P + δuS · ψ S ) but not K PS .
E = 1
τ 2 (xr, t; xs )dtdxr dxs,
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Depth(km)
Vp (km/s)
Vs (km/s)
ond term indicate the source and receiver part of the reflection 0.75 3.2 0.75 1.6
= 2ρVp δi j δk l ,
∂Vp ∂ci j k l (a) (b)
(7)
∂E ∂E
= 2ρVs (−2δ i j δ k l + δ ik δ jl + δ il δ j k ). Position(km) Position(km)
∂Vs ∂ci j k l 0
0 1.5 3
0
0 1.5 3
Depth(km)
flection kernel. RWI and WERTI utilize different objective 0.75 0.75
functions which only induce different types of adjoint sources,
but share similar reflection kernels. Due to the complex mode
conversions in elastic wavefields, wavepath of elastic reflec-
tions will be far more complicated than that in acoustic case.
1.5 1.5
Here, we decompose the origin kernel into four components
which represent cross-correlation of different wave modes. (c) (d)
For simplicity, we rewrite (4) as follow: Figure 1: Kernels with single reflector in Vp model. (a) Vp
Z model, (b) Vs model, (c) KVp , (d) KVs .
∇E(m0 ) = − (u · δψ + δu · ψ) (8)
with u and ψ are the forward and adjoint background wave- In the second model, we use the Vs reflector (Fig 2a and b) to
fields, δu and δψ are the forward and adjoint perturbed wave- generate both PP and PS reflection. The Vp kernel excludes
fields. The operator · denotes the cross correlation between two S-wavefield automatically because of the divergence operator
wavefields. Note that, equation (8) just schematically shows the implied in the term δ i j δ k l (equation (7)). However, δψ con-
manner of cross correlation. The detailed formulas should be tains the non-physical converted SP wavefields generated by
derived according to the parameter m0 through chain rule, just the back-propagated ψ S at the location of reflector. These SP
as equation (7). Considering mode decomposition, the above wavefields make the Vp kernel slightly different from that in
formula can be decomposed into four types with: Figure 1c. If we only back-propagate the PP data, Vp kernel
Z will be the same as Figure 1c.
MN
Km 0
= − (u M · δψ N + δu M · ψ N ), (9) For Vs kernel (Figure 2d), due to mode conversions, multi-
Depth(km)
Depth(km)
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
to KVPPp
but with an opposite sign. KVPS s
and KVSsP mainly 0.75 0.75
Position(km) Position(km)
0 1.5 3 0 1.5 3
0 3.4 0 1.7
✸ ▼ ✸ ▼
Depth(km)
Depth(km)
✸ ▼ ✸ ▼
0.75 0.75
Depth(km)
Depth(km)
Vp (km/s)
Vs (km/s)
0.75 3.2 0.75 1.6
1.5 1.5
1.5 3.0 1.5 1.5
(c) (d)
(a) (b)
✸ ▼ ✸ ▼
Since only the traveltime is considered in WERTI, just ERTM
Depth(km)
Depth(km)
2.5
Depth (km)
Depth (km)
2.5
Vp (km/s)
Vs (km/s)
Depth (km)
1 1
Vp (km/s)
1.5 1
2.0
2 2
2.0
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
1.5 1
(a)
2.5
Depth (km)
Depth (km)
Vp (km/s)
Vs (km/s)
1 1
1.5
Position (km)
2.0
0 1 2 3 4 5
0 2
2 2
1.5 1
Depth (km)
Figure 4: Sigbee2A model example. On the top are true models
Vs (km/s)
1
1.5
of Vp (a) and Vs (b). On the bottom are initial models of Vp
(c) and Vs (d) linearly increasing with depth.
2
(b)
NUMERICAL EXAMPLE
Position (km)
We select a part of the Sigsbee2A model (Figure 4a and 4b) 0
0 1 2 3 4 5
3.0
to test the inversion algorithm and strategy. The Vs model is
generated using fixed Poisson’s ratio. The initial model for E-
2.5
Depth (km)
WERTI are shown in Figure 4c and 4d. The linearly increasing
Vp (km/s)
1
Figure 5a and 5b show the inverted results of E-WERTI. After Position (km)
0 1 2 3 4 5
40 iterations for each stage, WERTI provides a good recovery of 0 2
Vs (km/s)
1
tion is insufficient for WERTI to rebuild the long-wavelength 1.5
Figure 5: Inverted results of WERTI and EFWI. (a) and (b) are
CONCLUSIONS inverted Vp and Vs model through two-stage elastic WERTI
with the linearly increased models as initial models. (c) and
Reflection traveltime inversion only minimizes traveltime mis- (d) are inverted Vp and Vs through EFWI using (a) and (b) as
fits which are more sensitive and linearly related to the low- starting models.
wavenumber model perturbation. The kernel analysis of differ-
ent wave modes show that mode decomposition can suppress
artifacts and recover the correct reflection wavepath in gradient ACKNOWLEDGEMENT
calculation. With the aid of DIW and P/S separation of 3C seis-
mograms, we can obtain the travel time differences of PP and PS This work is supported by the National Natural Science Foun-
reflections, respectively. To build the long-wavelength compo- dation of China (NO.41474099, 41674117 & 41630964). This
nent of the model, we introduce a two-stage WERTI workflow paper is also based upon the work supported by the King Abdul-
by firstly using PP then PS reflections, through which the non- lah University of Science and Technology (KAUST) Office of
linearity of reflection inversion is reduced effectively. In the Sponsored Research (OSR) under award NO. 2230. We appre-
second stage, the wave mode decomposition is introduced to ciate the open-source package of DENISE from https://github.com/daniel-
calculate the gradient of Vs to mitigate the trade-off between koehn/ and Mines Java Toolkit from https://github.com/dhale.
Vp and Vs . The Sigsbee2A model example shows that even We thank the useful advice from Tariq Alkhalifah (KAUST),
starting with a bad initial model, the two-stage E-WERTI can Qiang Guo (KAUST), Zedong Wu (KAUST), Chenlong Wang
provide reliable starting model for conventional EFWI. (Tongji University) and Benxin Chi (Los Alamos).
REFERENCES
Downloaded 09/26/17 to 80.82.77.83. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/
Biondi, B., and A. Almomin, 2012, Tomographic full waveform inversion (TFWI) by combining full
waveform inversion with wave-equation migration velocity analysis: SEG.
Technical Program Expanded Abstracts 2012: Society of Exploration Geophysicists.
Chi, B., L. Dong, and Y. Liu, 2015, Correlation-based reflection full-waveform inversion: Geophysics,
80, no. 4, R189–R202, http://dx.doi.org/10.1190/geo2014-0345.1.
Guo, Q., and T. Alkhalifah, 2016, A nonlinear approach of elastic reflection waveform inversion: SEG,
1421–1425.
Hale, D., 2013, Dynamic warping of seismic images: Geophysics, 78, no. 2, S105–S115,
http://dx.doi.org/10.1190/geo2012-0327.1.
Li, Z., X. Ma, C. Fu, B. Gu, and G. Liang, 2016, Frequency wavenumber implementation for P- and S-
wave separation from multi-component seismic data: Exploration Geophysics, 47, 32,
http://dx.doi.org/10.1071/EG14047.
Luo, Y., Y. Ma, Y. Wu, H. Liu, and L. Cao, 2016, Full-traveltime inversion: Geophysics, 81, no. 5,
R261–R274, http://dx.doi.org/10.1190/geo2015-0353.1.
Ma, Y., and D. Hale, 2013, Wave-equation reflection traveltime inversion with dynamic warping and full-
waveform inversion: Geophysics, 78, no. 6, R223–R233, http://dx.doi.org/10.1190/geo2013-
0004.1.
Operto, S., Y. Gholami, V. Prieux, A. Ribodetti, R. Brossier, L. Metivier, and J. Virieux, 2013, A guided
tour of multiparameter full-waveform inversion with multicomponent data: From theory to
practice: The Leading Edge, 32, 1040–1054, http://dx.doi.org/10.1190/tle32091040.1.
Raknes, E. B., and W. Weibull, 2016, Combining wave-equation migration velocity analysis and full-
waveform inversion for improved 3D elastic parameter estimation: 86th Annual International
Meeting, SEG, Expanded Abstracts, 1320–1324, http://dx.doi.org/10.1190/segam2016-
13858670.1.
Sun, D., and W. W. Symes, 2012, Waveform inversion via nonlinear differential semblance optimization:
82nd Annual International Meeting, SEG, Expanded Abstracts, 1–7,
http://dx.doi.org/10.1190/segam2012-1190.1.
Vigh, D., K. Jiao, D. Watts, and D. Sun, 2014, Elastic full waveform inversion application using
multicomponent measurements of seismic data collection: Geophysics, 79, no. 2, R63–R77,
http://dx.doi.org/10.1190/geo2013-0055.1.
Wang, C., W. Weibull, J. Cheng, and B. Arntsen, 2017, Automatic shear-wave velocity analysis with
elastic reverse time migration: 79th EAGE Conference and Exhibition 2017, Expanded Abstracts.
Wang, T., and J. Cheng, 2017, Elastic full waveform inversion based on mode decomposition: the
approach and mechanism: Geophysical Journal International, 209, 606–622,
http://dx.doi.org/10.1093/gji/ggx038.
Wu, Z., and T. Alkhalifah, 2015, Simultaneous inversion of the background velocity and the perturbation
in full-waveform inversion: Geophysics, 80, no. 6, R317–R329,
http://dx.doi.org/10.1190/geo2014-0365.1.
Xu, S., D. Wang, F. Chen, G. Lambare, and Y. Zhang, 2012, Inversion on reflected seismic wave: 82nd
Annual International Meeting, SEG, Expanded Abstracts, 1–7,
http://dx.doi.org/10.1190/segam2012-1473.1.