You are on page 1of 10

GEOPHYSICS, VOL. 71, NO. 3 共MAY-JUNE 2006兲; P. J41–J50, 8 FIGS.

10.1190/1.2197491
Downloaded 07/01/16 to 141.218.1.105. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

Structural inversion of gravity data using linear programming

Tim van Zon1 and Kabir Roy-Chowdhury1

ratory or in situ in boreholes. These properties can be used then to


ABSTRACT characterize and/or differentiate lithologies. These properties may,
however, vary within the same lithology at different sites or at dif-
Structural inversion of gravity data — deriving robust ferent depths. Different rock types can, moreover, have similar
images of the subsurface by delineating lithotype bound- physical properties 共partially overlapping range of values兲, e.g.,
aries using density anomalies — is an important goal in a density or susceptibility, thus making difficult the discrimination of
range of exploration settings 共e.g., ore bodies, salt flanks兲. the lithology based on these physical properties. For lithologies
Application of conventional inversion techniques in such with distinctly different physical properties, appropriate geophysi-
cases, using L2-norms and regularization, produces smooth cal techniques can be used to delineate the boundaries among
results and is thus suboptimal. We investigate an L1-norm- them. Some examples of such properties are density contrast 共salt
based approach which yields structural images without the structures, heavy ore bodies兲, susceptibility contrast 共magnetic-ore
need for explicit regularization. The density distribution of bodies and dikes兲 and resistivity contrast 共aquifers, ore bodies兲.
the subsurface is modeled with a uniform grid of cells. The Determining geologic boundaries from geophysical measurements
density of each cell is inverted by minimizing the L1-norm is a type of geophysical inversion.
of the data misfit using linear programming 共LP兲 while sat- A classic approach for L2-norm-based geophysical inversion,
isfying a priori density constraints. The estimate of the henceforth called the tomographic inversion, parameterizes the re-
noise level in a given data set is used to qualitatively de- gion of interest with many homogeneous 2D/3D prisms/cells and
termine an appropriate parameterization. The 2.5D and 3D estimates the physical property 共density, acoustic velocity, conduc-
synthetic tests adequately reconstruct the structure of the
tivity, etc.兲 of each cell. The solution is obtained using the maxi-
test models. The quality of the inversion depends upon a
mum likelihood 共ML兲 method 共Menke, 1984; Press et al., 1992兲,
good prior estimation of the minimum depth of the anoma-
which finds the model for which the data, given a noise distribu-
lous body. A comparison of our results with one using trun-
tion, are most likely.
cated singular value decomposition 共TSVD兲 on a noisy syn-
Sometimes the estimates of the model parameters, given by the
thetic data set favors the LP-based method.
ML method, can have large variances; small perturbations in the
There are two advantages in using LP for structural inver-
data will cause large changes in the estimates. To reduce these vari-
sion of gravity data. First, it offers a natural way to incorpo-
ances, additional constraints are imposed on the model parameters.
rate a priori information regarding the model parameters.
For example, regularization, often used in the tomographic inver-
Second, it produces subsurface images with sharp bound-
sion, is the rule that the parameters should be as small as possible
aries 共structure兲.
with respect to some background 共starting兲 model or that some spa-
tial derivative of the parameters should be small, i.e., the model
should have a certain smoothness 共Tikhonov regularization, Tik-
INTRODUCTION honov and Arsenin, 1977兲. An example of tomographic inversion
of gravity data is given by Bear et al. 共1995兲.
An important aspect of geophysical exploration is the delinea- Jackson 共1972兲 took another approach, using the singular value
tion of the lithologies present in the subsurface. Contrasts in spe- decomposition 共SVD兲 of the forward operator, which maps the val-
cific physical properties are used as discriminants for this purpose. ues of the model parameters into data points. SVD shows which
Physical properties of rocks, e.g., density, magnetic susceptibility, combinations of the parameters have a relatively large influence on
acoustic velocities, and conductivity can be measured in the labo- the data 关large singular value 共SV兲兴, a relatively small influence on

Presented, in part, at the 74th Annual Meeting, Society of Exploration Geophysicists. Manuscript received by the Editor April 30, 2004; revised manuscript
received October 6, 2005; published online May 24, 2006.
1
Utrecht University, Department of Earth Sciences, Budapestlaan 4, 3584 CD Utrecht, the Netherlands. E-mail: zon@geo.uu.nl; kabir@geo.uu.nl.
© 2006 Society of Exploration Geophysicists. All rights reserved.

J41
J42 van Zon and Roy-Chowdhury

the data 共small SV兲, or none at all 共zero SV兲. A solution 共model兲 density 共minimum-gradient support兲, while limiting the influence
can then be obtained by ignoring the relatively small SVs below a of large density contrasts to a constant value. Additionally, they set
chosen threshold. The eigenvectors corresponding to the neglected a limit on the maximum and minimum density values to prevent
SVs are linear combinations of the model parameters, which hard- geologically unrealistic models. They were able to regularize a
ly influence the predicted data and can, in turn, be poorly deter- gravity inversion and obtain structured images. Li and Oldenburg
mined from the measurements. The extreme case of zero-valued 共1998兲 devised an objective function in which an a priori model
Downloaded 07/01/16 to 141.218.1.105. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

SVs corresponds to eigenvectors spanning the so-called null space. could be specified that the result should resemble. Furthermore, the
Neglecting the relatively small SVs results in a robust solution level of smoothness of the result could be specified in all three di-
with small variances for the parameters. mensions. With the aid of a properly chosen weighting function,
An advantage of the tomographic approach is the ease of solving they obtained good results.
the problem. A gridded medium 共consisting of many homogeneous Use of a non-L2-norm for SI also has been investigated. Olden-
cells兲 is used, and the physical property of each cell is inverted. burg and Ellis 共1991兲 presented a scheme for an explicit SI using
With the maximum likelihood method, a quadratic cost function is linear programming 共LP兲. They applied LP to a 1D magnetotelluric
used, which is a function of the model parameters. The set of pa- inverse problem, utilizing approximate inverse mapping. Within an
rameter values corresponding to the minimum of the cost function iterative scheme, they used LP to calculate a smooth update for
is the solution to the problem. The cost function is usually the sum their layered conductivity model. Their objective function con-
of two terms, a term for the data misfit 共say, the L2-norm兲, and a tained the absolute value of the data residuals in trade-off with the
regularization term. The minimum is efficiently found by 共sparse兲 absolute value of the differences in conductivity between all neigh-
conjugate gradient solvers. Furthermore, gridded models allow boring layers. Obviously, the trade-off parameter must be chosen
easy manipulation and representation of arbitrary complexities. to correctly balance the data fit with the size of the model update.
The main problem in using the tomographic approach with Linear programming can be used also to perform an implicit SI.
smoothing for structural inversion is the smoothing, often forced Esparza and Gómez-Treviño 共1997兲 gave an example to interpret
on the model by the regularization. Sharp boundaries in the real electrical sounding data. They applied LP to invert for the resistivi-
medium get smeared out in the reconstructions. Many economi- ties of a large number of thin layers 共a mathematical model兲. The
cally interesting geologic structures 共ore bodies, salt flanks, etc.兲 aim was to obtain a nonsmooth result with only a few different val-
are not smooth spatially; they have relatively sharp boundaries. ues for the resistivity, i.e., a few layers 共a geologic model兲.
Hence, results of tomographic inversions may be difficult to inter- Here, we propose a method for an implicit structural inversion
pret in terms of distinct boundaries between lithologies. of geophysical data in a two-lithology setting. We illustrate the
In contrast to the tomographic approach, explicit structural in- technique using synthetic gravity data 共with noise兲, but the ap-
version 共SI兲 aims to explain the observed data in terms of a limited proach can be used for other types of geophysical data as well. We
number of anomalous bodies and to find their shapes. Sometimes, invert for densities associated with the homogeneous cells of a sub-
physical rock properties are solved for as well. The outcome of an surface gridded model using LP to minimize the L1-norm of the
explicit SI is a clear spatial distribution of the rock properties, data residuals, assuming a maximum density contrast.
which can easily be interpreted as a lithological image of the sub-
surface 共Jupp and Vozoff, 1975; Ditmar, 2002兲. LP FOR GEOPHYSICAL INVERSION
The explicit SI techniques have certain drawbacks, too. First,
they require rather detailed a priori information about the medium; LP is a method to perform constrained optimization; see, for in-
at least, the number of anomalies and their approximate locations stance, Brown and Brown 共1992兲 or Press et al. 共1992兲. Here, the
should be known. Second, manipulating a model with structural in- constraints are linear inequalities in the variables, and the function
formation explicitly included may be difficult, especially in a 3D to optimize is also a linear function of the variables.
situation. Last, computing the model response 共the forward prob- In general, a set of inequalities has no solution, one solution, or
lem兲 and the Jacobian for such models may also be difficult, espe- more than one solution. If the constraints on some variable x are
cially if the computation is based on solving differential equations mutually inconsistent, e.g., x ⬎ 5 and x ⬍ 3, then, obviously, no
numerically, e.g., for the inversion of electrical conductivity data. solution exists. However, usually there are many combinations of
The drawbacks of the two inversion approaches 共tomographic the variables that satisfy all the constraints. The set of all these
and explicit兲 could be overcome by combining them. In this ap- combinations is called the feasible set.
proach, the subsurface is represented by a rectangular grid. Struc- A graphic example of a feasible set in two variables, x and y, is
tural constraints can be implemented in this type of model in an given in Figure 1. This figure also contains contours of an objective
implicit form — by encouraging a high contrast in properties be- function ␾ = y − x/2. Nonnegativity 共x,y ⱖ 0兲 restricts the set to
tween neighboring cells. This should make interpreting the result, the positive quadrant. The two solid lines represent some addi-
in terms of delineation of lithologies, easier than in the conven- tional constraints given by y ⱕ −0.5x + 5 and y ⱕ −3x + 15. The
tional tomographic approach. feasible set is the shaded area, the model-space for x,y for which
Structural inversion also can be carried out implicitly. Several all constraints are satisfied. The dashed lines are contours of the
authors 共e.g., Last and Kubik, 1983; Guillen and Menichetti, 1984; objective function. The minimum of this objective function, satis-
Acar and Vogen, 1994; Li and Oldenburg, 1998; Zhdanov and Hur- fying all the constraints, is at 共5,0兲, at a corner of the feasible set,
san, 2000; Portniaguine and Zhdanov, 2002; Farquharson and Old- with ␾ = −2.5.
enburg, 2003兲 have used this approach for obtaining nonsmooth The location of the extremum 共maximum or minimum兲 of the
models from different types of geophysical data. Portniaguine and linear objective function is normally on a corner of the 共convex兲
Zhdanov 共1999兲 used this approach for the inversion of gravity feasible set. In the special case that a constraint is parallel to the
data. They minimized the area with a nonzero spatial gradient in objective function, the extremum of the objective function can be a
Structural inversion using LP J43

line segment or a 共hyper兲 plane, which is then a model null space. Gm ⱕ d + ⑀ , 共4兲
A LP problem can be solved with the Simplex method. The
implementation used here is the one given by Press et al. 共1992兲. where the vector ⑀ has positive components that need to be prop-
erly chosen, e.g., on the basis of the noise characteristics. The left-
hand side and the right-hand side here are both column vectors of
Application of LP to gravity inversion
the same length. The notation with the ⱕ sign in this equation 共and
Downloaded 07/01/16 to 141.218.1.105. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

To apply linear programming to exploration geophysics, an ob- in the following兲, should be read as an inequality equation for each
jective function needs to be defined, and constraints need to be set. row.
In the case of gravity anomalies, the model vector m contains the Similarly, the greater-than constraint equations are given by:
density contrasts of the prisms with respect to some background
value. The first set of constraints needed on the unknowns m j, the Gm ⱖ d − ⑀ and
components of the model vector m are the nonnegativity con-
m ⱖ 0. 共5兲
straints: m j ⱖ 0. The use of additional constraints and the choice
of the function to optimize have been investigated by such authors
Safon et al. 共1977兲 used LP to optimize various moments of the
as Safon et al. 共1977兲 and Cuer and Bayer 共1980兲. The former have
densities of 2D prisms, to get lower and upper bounds for the total
used the constraints that the difference between the predicted data
mass of the anomalous body, an estimate for the location of its cen-
from a trial density distribution and the actual measurements must
ter of mass and limits for its horizontal extent. This can be consid-
be smaller than a certain error ⑀, specified individually for each da-
ered an explicit structural inversion.
tum. Below, we take a closer look at their approach.
The drawback of the approach of Safon et al. 共1977兲 and of Es-
The subsurface is parameterized with prisms for which the den-
parza and Gómez-Treviño 共1997兲 is that in equations 4 and 5 they
sity contrasts are sought. Nagy 共1966兲 derived a formula 共by per-
do not allow deviations larger than ⑀ between the forward-mod-
forming the volume integration兲 for the force of gravity of a buried
eling results and the measured data. The maximum deviation 共posi-
prism of unit density at any measuring station:
tive and negative兲 is fixed. Some of our tests showed that in the

冏冏冏
case of Gaussian noise, or worse, 共noise distributions with many
outliers兲 this can cause incompatible constraints, i.e., there may be
f=G x ln共y + r兲 + y ln共x + r兲 no feasible set and thus no solution.
Furthermore, the inner workings of the algorithm will cause

− z arcsin 冉 z2 + y 2 + yr
共y + r兲冑y 2 + z2
冊冏 冏 冏z = z2 y = y 2 x = x2

z = z1 y = y
1 x = x1
,
some of the data-imposed greater-than type constraints to be satis-
fied as equalities. The data will be fitted improperly as datum plus
共or minus兲 the allowed deviation.
The objective functions used by Oldenburg and Ellis 共1991兲 and
共1兲 Esparza and Gómez-Treviño 共1997兲 were L1-norm objective func-
tions, which are not linear. These absolute-value functions can be
G being the universal gravitational constant and r = 冑x2 + y 2 + z2. handled within the LP formalism by introducing some extra vari-
We follow the notation used by Nagy on the right hand side, i.e., ables and constraints, as will be shown in the next section.
兩 f共t兲兩t=t
t=t2
1
should be read as 共 f共t2兲 − f共t1兲兲. The eight corners of the
prism, relative to the measuring position, are given by 共xu,y v,zw兲, Formulating L1-norm objective functions for LP
with u, v, w = 1, 2. The formula is valid for points outside or on Like Oldenburg and Ellis 共1991兲, we will minimize the absolute
the border of the prism. See Nagy 共1966兲 for details. value of the data misfit, minimizing the L1-norm of the difference
Using equation 1 and writing Gij as the contribution of the jth
prism of unit density at the ith measurement station, the total force
of gravity di can be written as

M
di = 兺
j=1
Gijm j , 共2兲

m j being the densities of the prisms. This can be written in a matrix


formalism as

Gm = d. 共3兲

The column vector m, of length M, contains the densities of the


prisms. The data are contained in the column vector d, of length N.
The 共N ⫻ M兲 matrix G has components Gij. Figure 1. A 2D example of a feasible set in two variables x and y,
both of which are expected to be positive. The solid lines y ⱕ
The constraints of Safon et al. 共1977兲 limit the residuals r. The −0.5x + 5 and y ⱕ −3x + 15 represent some further constraints.
residual for datum i is defined as ri = Gijm j − di 共summation over The feasible set is the shaded area. The dashed lines are contours of
j兲. Writing all the constraints ri ⱕ ⑀i in matrix form gives the objective function ␾ = y − x/2.
J44 van Zon and Roy-Chowdhury

between the data and the forward modeling response. Furthermore, is best to work with as few unknowns as possible. Care should also
we set a maximum for the density difference between the back- be taken to avoid fitting noise present in the real data during inver-
ground and the anomalous body 共lithological contrast兲. This is im- sion. The aim should, instead, be to fit the data to the expected
plicit SI; the desired structural characteristics result from the fact level of noise. We propose to start to parameterize with large cells
that the solution of an LP-problem is always at a corner of the fea- and decrease the cell size until a satisfactory data fit is achieved.
sible set.
Downloaded 07/01/16 to 141.218.1.105. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

For the parameterization test, we used an earth model containing


The L1-norm objective function to minimize for the gravity a dipping dike-shaped intrusion with a known minimum depth-to-
problem in matrix notation is the-top. There are no anomalies in this top layer so that this layer
N
does not need to be parameterized and its gravity effect can be cor-


rected for in a preinversion step.
␾= 兩Gijm j − di兩, 共6兲 The width of the 2.5D model is 2 km and the depth is 1 km. The
i=1
length of the prismatic cells along strike 共into the plane of the fig-
with implied summation over j. Gij are the coefficients of the LP ure兲 is 1 km. The true earth is made up of 40 ⫻ 20 prisms, 共each
operator as given in equation 1. The di represents the values of the with a 30 ⫻ 30-m section. 共Note that the parameterization of the
N measurements, each divided by its assumed measurement error. earth model does not fill the area for which the density distribution
The data vector d thus contains N dimensionless numbers. The is sought.兲 The data consist of 200 measurements, 2 m above the
vector m is the model vector with M unknown components. The surface, with 10-m separation between locations. The leftmost sta-
unknowns are the density contrast of each homogeneous prism tion is located 10 m to the left of the earth model. The data have
relative to the background density, assumed to be known a priori. been scaled with the assumed measurement error ␪ = 0.04 mGal to
The constraints are: obtain dimensionless numbers. These scaled data were then cor-
rupted with unit-variance, zero-mean Gaussian noise. The maxi-
0 ⱕ m ⱕ ␳max , 共7兲 mum permissible density contrast 共 ␳max兲 was set at the correct
value of 0.4 g/cm3. The earth model and the data are plotted in Fig-
with ␳max the column vector with the positive upper limits for the ure 2a.
density contrasts, assumed to be known a priori. In this study, all This noisy data set was inverted several times with different pa-
the components of ␳max are the same, i.e., 0 ⱕ m j ⱕ ␳max 共scalar兲.
The LP formalism can, in most cases, be used to optimize an
L1-norm objective function. This is done by introducing some new
variables and by adding additional constraints. For this, we intro-
duce two column vectors, y and z, each of length N, as additional
dummy variables; the LP problem is then to minimize the objective
function ␾ = 兺i=1
N
共y i + zi兲 with the constraints:

y ⱖ Gm − d,
z ⱖ d − Gm,
m,y,z ⱖ 0,
m ⱕ ␳max . 共8兲

This LP problem can be solved with the simplex method as given


by Press et al. 共1992兲.
In the following, we will first investigate the problem of choos-
ing a proper parameterization 共cell size兲. Then, we test the unregu-
larized L1-norm-based inversion of gravity data for robustness
against noise. Next, we compare the results of the inversions of the
underdetermined system of equation 3 using LP with those ob-
tained using TSVD, which is well known to be robust against noise
in the data. We will then test the robustness of the algorithm further
by perturbing the data set with several noise simulations and per-
forming the inversion of all perturbed data sets. The averages and
standard deviations of the retrieved model parameters give some
insight into the well-resolved features of the model. Finally, we
will apply our approach to a synthetic 3D situation.
Figure 2. Effect of parameterization on inversion. 共a兲 The true-
RESULTS earth model used and the dimensionless noisy data. The scale bar
denotes the density contrast in g/cm3. 共b兲–共f兲 Typical inversion re-
Choice of parameterization sults, and the respective residuals, for different cell sizes used. The
number of cells in the grid are: 共b兲 10 ⫻ 5, 共c兲 20 ⫻ 10, 共d兲 60
In real applications, the parameterization cannot describe all the ⫻ 20, 共e兲 40 ⫻ 10, and 共f兲 40 ⫻ 20. Only 共d兲 shows no apparent
details of the true 共unknown兲 earth. To make the method robust, it spatial correlation in the residuals.
Structural inversion using LP J45

rameterizations of the subsurface using prismatic cells. Some of surface. As expected, the data misfit decreases as the number of
the results are shown in Figures 2b–2f, along with the residuals. cells in the models increases.
There is a clear spatial correlation of the residuals if the parameter- The model with 400 cells 共40 ⫻ 10兲, each 50 ⫻ 100 m 共Figure
ization is too coarse. This effect is very pronounced in the case 2e兲, gives an L1-data misfit of 164, very close to the expected mis-
with only 50 共10 ⫻ 5兲 prisms each of 200- ⫻ 200-mcross section fit, while the L2-data misfit is still rather high at 256. The param-
共Figure 2b兲, but also visible in the 400 共40 ⫻ 10兲 cell model with eterization with 800 cells 共40 ⫻ 20兲, each 50 ⫻ 50 m 共Figure 2f兲,
Downloaded 07/01/16 to 141.218.1.105. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

50- ⫻ 100-m cross sections, 共Figure 2e兲 and even in the 800 共40 produces an L1-norm data misfit of 155 indicating a slight over-
⫻ 20兲 cell model with 50- ⫻ 50-m cross sections 共Figure 2f兲. This fit 共expected optimum value: 160兲; the corresponding L2-norm data
means that the data have features that cannot be modeled with misfit is 237, indicating an underfit 共expected optimum value:
these parameterizations. We also tested three parameterizations 200兲.
with 800, 1000, and 1200 prisms, each with a 30- ⫻ 30-m section, Combining the knowledge of the expected data misfit, the ab-
respectively. These were special, because the parameterization was sence of spatial correlation of the residuals together with the crite-
perfect. The inversion of the noisy data for the first two of these rion to use as few parameters as possible, we prefer the 400-prism
共not shown兲 showed edge-effects on the right, whereas no such ef- parameterization of the subsurface, using prisms with a cross sec-
fect is visible for the case with 1200 prisms 共Figure 2d兲. The ques- tion of 50 ⫻ 100 m. The corresponding inversion result is shown
tion now arises as to which of the parameterizations reported here in Figure 2e.
is preferable. Our approach is not guided solely by the goodness of
the data fit, but also by taking the noise level into account.
For this, we start with the noiseless synthetic data, given by: Comparison with inversion using TSVD
dtheor = Gmtrue, where mtrue is the true causative body. If zero-mean, To test the robustness of inversion of noisy gravity data using
unit variance Gaussian noise n is added to this to obtain the noisy the above approach, we use the earth model shown in Figure 4d,
data d, the L2-norm of the data residuals, corresponding to a model which schematically represents two diapirs. For the inversion, we
m is use the same parameterization as the earth model, i.e., the param-
eterization fits the model perfectly. There are 200 cells in the model
N N
and only 100 measurements 共N = 100兲; the system of equations is

i=1
共di − Gijm j兲 2
= 兺
i=1
共dtheor,i + ni − Gijm j兲2 . 共9兲 thus underdetermined.
A remarkable result of our LP-based inversion scheme is that, if
the correct ␳max is used and the data are noise free, then the true
Hence, even if the true-earth model, mtrue, could be exactly recov- earth model is perfectly retrieved 共see Discussion兲. In the case of a
ered, the expected value of the L2-norm of the data fit for N data well-posed system of equations with as many unknowns as mea-
points would be surements, this could also have been achieved with the singular
value decomposition 共SVD兲 by including all singular values. 关See
N
Jackson 共1972兲 or Press et al. 共1992兲 for further discussion of
具 兺 共dtheor,i + ni − Gijmtrue,j兲2典 SVD.兴
i=1 It is well known, however, that constructing a model using sin-
N gular values that are much smaller than the largest one will lead to
= 具 兺 n2i 典 = N具n2i 典 = N共=200 in this case兲. 共10兲 unstable inversion results for real 共noisy兲 data, the variances of the
i=1 estimated parameters being proportional to the variance of the data

This is because the expected value of n2i is the variance, which is


one. In other words, this is the best one should aim for. If the data
fit for an earth model obtained is less than N, it is obvious that
noise characteristics have partly been fitted. Similarly, the expected
value of the L1-norm of the data fit is

N
具 兺 兩di − Gijm j兩典 = N具兩ni兩典. 共11兲
i=1

With the definition of the expected value 具x典 = 兰xP共x兲dx, with


P共x兲 the probability density function, we obtain

N具兩ni兩典 = N 冕 兩ni兩
1
冑2␲ e
−0.5共ni兲2
dni =
2N
冑2␲ ⬇ 160. Figure 3. Comparing L1-norm data misfit 共crosses兲 and L2-norm
data misfit 共diamonds兲 resulting from inverting a noisy data set
with 200 measurements as a function of the number of cells in the
共12兲 models. There were two models with 800 cells in this test, one with
40 ⫻ 20 cells of 30 ⫻ 30 m 共not shown兲 representing a 1200-
⫻600-m subsurface 共lower diamond兲 and the other with 40 ⫻ 20
The L2- and the L1-norms of the data misfits of several tests, in- cells of 50 ⫻ 50 m 共Figure 2f兲, representing a 2000-⫻1000-m
cluding all those shown in Figure 2, are summarized in Figure 3 as subsurface 共upper diamond兲. Note: The former parameterization
a function of the number of prisms used to parameterize the sub- fits the true model perfectly.
J46 van Zon and Roy-Chowdhury

and inversely proportional to the square of the smallest singular


value. Hence, if very small SVs are included, a little noise can dra-
matically change the outcome of the inversion. To make the inver-
sion more robust, only the larger singular values should be used to
S/N = 冑 兺i d2i
N

兺i n2i
N , 共13兲

calculate the inverse. An appropriate cutoff needs, therefore, to be


where di is the scaled ith theoretical measurement and ni is the ith
Downloaded 07/01/16 to 141.218.1.105. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

set — usually after some experimentation.


Gaussian noise sample. For generating a noisy data set with a
Below, we will compare the robustness of the LP-method with
higher level of noise, the noise is drawn from the same distribution,
one using TSVD for an underdetermined inversion problem, but but its amplitude is multiplied.
first, some definitions and explanations. We generate synthetic The L1 model misfit 共as a percentage兲 is defined as:
Gaussian noise with zero-mean and unit variance and add this to
the 共previously scaled兲 measurement values. The signal-to-noise M
100%
ratio S/N is defined as model misfit = 兺 兩mi − mtrue,i兩,
M ␳anom i
共14兲

where mtrue is the reference 共true earth兲 model, m is the inversion


result, M is the number of cells, and ␳anom is the expected density
difference between the lithologies. This represents the average er-
ror 共as a percentage兲 associated with the inverted densities for the
cells. With this definition, if the density of most cells is either equal
to zero or ␳anom, then the model misfit is approximately equal to the
percentage of cells with the wrong density.
The cutoff value for the truncation of the singular values was
chosen as a percentage of the largest one, and all eigenvectors with
eigenvalues smaller than the cutoff value were excluded from the
inversion. For the test, this cutoff value was chosen by inverting
several synthetic data sets, corrupted with zero-mean, unit variance
Gaussian noise, each with a different cutoff for the singular values.
We selected a cutoff of 5%, which yielded the best model misfit.
The linear programming assumes that all variables are positive.
It is therefore not possible to use negative values for the measure-
ments. For this reason, we chose to change the negative gravity
anomalies, that may have resulted after adding simulated noise, to
small positive values.
The earth model as shown in Figure 4d was used in this test. In
Figure 4a, the noise-free data are shown, together with the cor-
rupted version 共noise added兲. The S/N ratio for this realization is
6.3 共or approximately 16% noise兲. Figure 4e shows the inversion
result with the linear programming method. Figures 4b and 4c
show the residuals corresponding to the inversion results with LP
and TSVD. As expected, the LP approach results in a structural
model, the visual fit to the true earth model 共Figure 4d兲 is to be put
in the context of the rather large noise.
The inversion results of both methods are quite different. The re-
sult using LP shows two bodies with roughly the correct-depth ex-
tent and a small, deep anomaly at the lower-left border of the grid.
The result using TSVD 共not shown兲 could also be interpreted as
having two anomalous bodies, but the depth extent is very poorly
resolved. TSVD concentrates the density anomaly at shallower
levels. The data residual for the LP-based inversion is much
smoother than that for the TSVD result, although the maximum
difference with the noise-free data is almost the same.
Figure 4. LP-based inversion results for a double diapir model 共d兲.
共a兲 Shows 100 noise-free data 共solid line兲 and their perturbed ver- The above test is done with one realization of a noise sequence.
sion 共crosses兲, with 16% noise added, which is actually used for A different realization would almost surely have given a different
the inversions. The 2.5D medium is parameterized with 20 ⫻ 10 result. For a statistical comparison of the two approaches, the test
cells of 100 ⫻ 100 m 共d兲 that are 2-km long in the y-direction. 共e兲 needs to be repeated with many realizations of the simulated noise.
The result of an LP-based inversion with the dimensionless residu- For this, we first generate 100 noise sequences from a normal dis-
als shown in 共b兲. 共c兲 The dimensionless residuals from an inversion
共not shown兲 using TSVD. 共f兲 The standard deviation 共SD兲 of the tribution with zero-mean and unit variance 共each sequence contain-
results of 100 LP-based inversions of the data each time perturbed ing 100 samples to fit the length of our data兲. Five noise data sets
with about 3% noise. 共with different levels of noise兲 are then obtained by multiplying
Structural inversion using LP J47

each sample of these 100 sequences, respectively, by the factors 1, from the results: the mean density of each cell and the standard de-
3, 5, 7, or 10. These five sets of noise are used to corrupt the theo- viation of the densities per cell. The results are summarized in Fig-
retically computed data. This results in five groups of data sets cor- ure 6.
rupted by different levels of noise, each containing 100 data sets to Figure 6a and 6b show the average models recovered for the two
be inverted. These groups are each categorized 共labeled兲 with their parameterizations and their theoretical responses. Figures 6c and
respective average S/N, 共S/N兲, and the corresponding standard de- 6d show the standard deviations associated with the cells of the av-
Downloaded 07/01/16 to 141.218.1.105. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

共 冑 兲
1
viation = P 兺iP兵共S/N兲i − S/N其2 , with P being the number of erage model. In general, the result is robust against small perturba-
tions of the data; only a small number of cells have nonzero stan-
data sets used兲. Note, that the S/N are computed after the statistical
dard deviations, which means that most cells end up with the same
noise generation, directly from the actual realizations.
density after each inversion.
Each noisy data set is inverted separately, using truncated singu-
To be more precise, the interior of the body seems to be well de-
lar values and LP. Figure 5 summarizes the results from these 1000
termined from the data, except for the deepest part on the right,
inversions 共two methods, each with 100 data sets corrupted with
which is not imaged at all. All large nonzero values of standard de-
five different noise levels兲. The model misfits corresponding to the viation are found near the edges of the body and/or near the upper
inversion results are plotted against the noise level of its data set. part of the subsurface. The 800-cell model has more ambiguous
This test shows that inversion, using truncated singular values, is cells near the upper part of the model. This is a further confirma-
relatively robust against noise; the standard deviation in model tion that the choice of the 400-cell parameterization is preferable
misfit is small. However, even the models resulting from data with and validates our approach to determine the optimum cell size for
little or no noise have a poor model misfit. A noise-free data set is inversion as outlined above. Note that the two inversions from the
also inverted, with all the nonzero singular values included. This parameterization test 共Figure 2e and f兲 resemble these averaged
gives a better result, namely, with a model misfit of 16.2%. The models very well.
true model cannot be retrieved. Because the system is underdeter-
mined 共there will be SVs equal to zero兲, there is a null space. When
a model is constructed, truncation is used to minimize the amount 3D example
of null space in the solution. This has a smoothing effect on the re- Encouraged by the results reported above, we test our approach
sulting image. for a more realistic 3D situation. We use a dikelike 3D body em-
The LP performs better in this noise test. The model misfits are bedded for this purpose in a 2000 cell 共20 ⫻ 10 ⫻ 10兲 subsurface,
much better for all noise levels. The standard deviations are larger consisting of homogeneous cubic cells with sides equal to 50 m.
though. The noise-free inversion yields, as mentioned above, the The anomalous density with respect to the background is 0.4 g/
true-earth model. cm3. The results are summarized in Figure 7. The shape of the ac-
Instead of showing the results of each different noise realization, tual anomaly is outlined in all the slices in white. The intrusion
we summarize them with the average density and the associated ranges from y = 150, to y = 350, corresponding to the slices f–i in
standard deviation, SD, per cell 共SD = 冑兺iP共mi − m̄兲2 /P兲. We cal- Figure 7. Note that there is no obvious strike direction. This is evi-
culate the average density per cell for the models with the noise dent also from the dimensionless noisy data shown in Figure 7a.
level around 3% 共not shown兲. The standard deviation of these re-
sulting models is shown in Figure 4f. The LP inversions show
variations around the edges of the bodies and some at the border of
the grid. The good spatial resolution and the poor depth resolution,
inherently present for the gravitational problem, are reflected in
this figure.

Robustness against noise


Geophysical data are always contaminated with noise. One nor-
mally tries to remove 共minimize兲 coherent noise by some type of
preprocessing, but spatially or temporally incoherent noise gets in-
verted along with the true-earth response. An inversion scheme
has, therefore, to be robust with respect to noise.
We use the dike model as in the parameterization test 共Figure
2a兲. The corresponding contaminated raw data 共at 200 surface lo-
cations兲, also shown in the same figure, are used for the inversion. Figure 5. Statistical comparison of inversion results based on L1-
In order to study which parts of the inverted model are robust, we norm 共using LP兲, and L2-norm 共using SVD兲. The vertical axis
investigate if and where the inversion result is affected if the raw shows the noise level, 100%/共S/N兲, in the perturbed data to be in-
data are perturbed. Also, we investigate the effect of parameteriza- verted, the horizontal axis the model misfit 共equation 14兲. The 500
tion 共cell size兲 on the inversion of the perturbed data. LP inversions are marked with a ⫹ sign 共left clusters兲; the 500 us-
ing truncated singular values 共with the 5% cutoff兲 are marked with
In ten separate experiments we add zero-mean, unit variance a triangle. The open square 共0,0兲 is the perfect result with LP for
Gaussian noise to the raw data. These data sets are inverted for the noise-free data; the diamond 共16.2,0兲 is the SV inversion result
two best parameterizations of the subsurface, obtained from the with all nonzero SVs included in the inversion, and the filled star
earlier test 共Figure 2e and 2f兲. Finally, we derive some statistics 共30.4, 0兲 is the truncated SV result, with the cutoff at 5%.
J48 van Zon and Roy-Chowdhury

A synthetic 2D data set is generated at 800 locations on a regular Depth to the top
horizontal 40 ⫻ 20 grid. After scaling and addition of noise, the
Tests show that the top of the source body does need to be
dimensionless noisy data set is shown by means of contours in Fig-
known a priori. To illustrate this point clearly, two inversion results
ure 7a. The maximum 共scaled兲 data value is 42, and the value of the
for a buried spherical body are shown in Figure 8. The extent and
S/N turns out to be 14.1. Again, note the absence of any obvious
the parameterization of the subsurface inverted for is shown with
Downloaded 07/01/16 to 141.218.1.105. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

symmetry of the density distribution with respect to the parameter-


the cells. In one test, the anomalous body is actually buried in the
ization of the subsurface.
middle of the parameterized area at an unknown depth. The inver-
The problem of inverting for 2000 elements — given 800 data
sion result, as shown in Figure 8b, is completely wrong. Figure 8c
points — is clearly an underdetermined one. The LP-based inver- shows the result when the top of the anomalous body coincides
sion is carried out with the maximum density contrast set at the
correct value of 0.4 g/cm3. The contoured data corresponding to
the inversion result is shown in Figure 7b. The data fit is 524,
whereas, theoretically, we would expect 684 共see earlier discussion
regarding the expected value of data fit for noisy data兲. The data
are thus slightly overfitted. This inversion takes about 14 minutes
on a Pentium 4 processor.
The inversion has placed the anomalous mass at the correct lat-
eral position in x and y, even though there is no favorable symme-
try in the geometry of the acquisition of the synthetic data. Very
little mass is placed outside the slices containing the intrusion. The
deeper part of the intrusion is, as before, not recovered properly.
Some small density perturbations can be seen in the cells near the
surface, above the lower part of the intrusion.

Figure 6. Robustness of LP-based scheme against noise using dif-


ferent parameterizations. The true model 共also shown in Figure 2a兲
is outlined in all the panels. 共a兲 The average density of each cell,
obtained from 10 inversions using perturbed data sets and the pa-
rameterization of Figure 2e; 共b兲 the same with the parameterization Figure 7. An example of a 3D inversion. The subsurface grid used
used in Figure 2f. The upper part of 共a兲 and 共b兲 shows the response for inversion consists of 2000 cubical cells, 50 ⫻ 50 ⫻ 50 m
of the average models. The gravity data, responses, and residuals each. The realistic 共dimensionless兲 data, computed at 800 stations
are dimensionless. The gray scale for the average density is shown in the x-y plane and corrupted with Gaussian noise 共S/N = 14.1兲,
on the lower left. 共c兲 and 共d兲 The standard deviations 共SD兲 of the is shown in 共a兲. The contours are drawn every three dimensionless
cell densities retrieved in the 10 inversions averaged in 共a兲 and 共b兲. units. The contoured 共dimensionless兲 data corresponding to the in-
The corresponding gray scale is plotted in the lower right. The up- version result is shown in 共b兲; 共c兲 through 共l兲 show vertical slices of
per parts of 共c兲 and 共d兲 show the dimensionless residuals of the av- the gridded model, obtained after LP-inversion, at y = 25, y
erage model responses with respect to the unknown noise-free data = 75, . . . , y = 475 m, respectively. The asymmetrical dikelike
共top兲 and the known noisy data 共bottom兲 that has been measured. source body is outlined in white in all slices.
Structural inversion using LP J49

The only tunable parameter needed in the method presented here


is the maximum density contrast, but even that is not a hard con-
straint. A possible alternative approach would require an extra pa-
rameter to be added to the cost function. This extra parameter, to be
minimized, would be smaller than the absolute boundary set by the
maximum density contrast, while larger than all other densities. In
Downloaded 07/01/16 to 141.218.1.105. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

effect, this upper-limit parameter 共 ␳ul兲 enforces a flexible upper


bound, which is to be chosen as small as possible. In this case, the
objective function would become ␾ = ␤␳ul + 兺i=1 N
共y i + zi兲, where
␤ is a weight factor needed to balance the need to make ␳ul small,
while still fitting the data adequately. Of course, the constraints
␳ul ⱖ 0 and ␳ul ⱕ ␳max need to be added. However, in this case, a
trade-off value ␤ between data fit and importance of a small, flex-
ible upper limit must also be chosen. We have not investigated this
Figure 8. Importance of top depth in LP-based inversion. 共a兲 The further.
anomalous body and the corresponding dimensionless data 共scaled
and perturbed with noise兲. 共b兲 The inversion result when the top of Other additions to the objective function could be a term to
the body is unknown and is located in the middle of the parameter- minimize or maximize the mass of the system in a trade-off with
ized area. The outline of anomaly is plotted in white. 共c兲 The inver- the data fit. The noise test, however, shows that this type of extra
sion result with the parameterization chosen so that the top of the information is not necessary to obtain robust results.
anomalous body coincides with the top of the parameterized area.
The residuals in 共b兲 and 共c兲 are calculated with respect to the di-
mensionless noise-free data.
CONCLUSIONS
The tests indicate that our L1-norm-based inversion scheme us-
with the top of the parameterization. The nearer the top of the
ing LP yields robust structural 2.5D models from a limited amount
source body is to the top of the parameterized area, the better the
of 1D data, even in the presence of substantial noise. One 2D data
inversion result. This is to be expected, given the nature of the field
set contaminated with noise also is inverted successfully for a 3D
共gravity兲 being inverted.
model. This can be of interest in exploration settings with sharp
density contrasts. Such as salt flanks, ore bodies, dikes, etc.
DISCUSSION The noise test indicates that the LP method is less robust against
noise than the one with the truncated SVs, but its performance in
Perfect inversion results can be achieved with noise-free data, terms of model misfit is much better than that of the latter, which is
even for an 共originally兲 underdetermined set of equations, probably based on L2-norm.
for three reasons: 共1兲 the constraints on the density differences Advantages of our LP-based scheme are the absence of a
limit the model space and add enough information to make this an weighting function or other tunable parameters. Only the maxi-
overdetermined problem, 共2兲 the parameterization used could per- mum density contrast is needed a priori. The results are, however,
fectly represent the true earth, and 共3兲 the minimum of a linear cost sensitive to a proper estimation of the minimum depth to the caus-
function is always situated on a corner 共or, in a degenerate case, a ative anomalous density.
side, or a face兲 of the convex set of the allowed models in the A suitable parameterization can be found by trial and error. The
model space. The way the algorithm works prevents the search selection criterion is the expected data fit, given the noise distribu-
ending up somewhere inside the convex set. An implementation of tion and the absence of large, spatially correlated differences be-
the conjugate gradient algorithm could, on the other hand, get tween the observed and the computed data. Furthermore, the ro-
stuck in a local minimum while minimizing a quadratic cost func- bustness of the result can be assessed by perturbing the data set
tion. several times and comparing the inversion results. A suitable pa-
The need to know the top of the anomalous body does, at first, rameterization should yield robust inversion results.
appear to be a serious shortcoming of our approach. However, our
focus is on delineating the structure laterally 共sides of dikes, flanks
of salt-structures, etc.兲. Information regarding the tops of such ACKNOWLEDGMENTS
structures may often be available from other geophysical tech-
The manuscript has benefited from the comments and sugges-
niques or from geologic considerations. We conclude that the bet-
tions of the reviewers. This research was made possible by the
ter the top of the source body is known, the better the inversion re-
award of a Ph. D. position to van Zon by the Geodynamic Research
sults are.
Institute, Utrecht University, the Netherlands.
Advantages of our L1-norm-based inversion using LP include
the absence of a weighting function and many other tunable param-
eters. A weighting function is often needed to balance the influence
REFERENCES
of the individual parameters on a regularization term. This function
only depends upon the model parameters and not on the data and is Acar, R., and C. Vogel, 1994, Analysis of total variation penalty methods:
needed to account for the differences between the influences of the Inverse Problems, 10, 1217–1229.
model in the forward calculation, and thus on the data fit. If the Bear, G., H. Al-Shukri, and A. Rudman, 1995, Linear inversion of gravity
data for 3-D density distributions: Geophysics, 60, 1354–1364.
maximum density contrast is set at 共or close to兲 the true density Brown, R. F., and B. W. Brown, 1992, Finite mathematics: Ardsley House
contrast, we do not need a smoothing term. Publishers, Inc.
J50 van Zon and Roy-Chowdhury

Cuer, M., and R. Bayer, 1980, Fortran routines for linear inverse problems: Li, Y., and D. Oldenburg, 1998, 3-D inversion of gravity data: Geophysics,
Geophysics, 45, 1706–1719. 63, 109–119.
Ditmar, P., 2002, Finding the shape of a local heterogeneity by means of a Menke, M., 1984, Geophysical data analysis: Discrete inverse theory: Aca-
structural inversion with constraints: Geophysical Prospecting, 50, 209– demic Press Inc.
233. Nagy, D., 1966, The gravitational attraction of a right rectangular prism:
Esparza, F. J., and E. Gómez-Treviño, 1997, 1-D inversion of resistivity Geophysics, 31, 362–371.
and induced polarization data for the least number of layers: Geophysics, Oldenburg, D. W., and R. Ellis, 1991, Inversion of geophysical data using
62, 1724–1729. an approximate inverse mapping: Geophysical Journal International,
Downloaded 07/01/16 to 141.218.1.105. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

Farquharson, C., and D. Oldenburg, 2003, Constructing piece-wise- 105, 325–353.


constant models using general measures in non-linear, minimum- Portniaguine, O., and M. S. Zhdanov, 1999, Focusing geophysical inver-
structure inversion algorithms: 6th International Symposium, Society of sion images: Geophysics, 64, 874–887.
Exploration Geophysicists of Japan, Expanded Abstracts, 240–243. ——–, 2002, 3-D magnetic inversion with data compression and image fo-
Guillen, A., and V. Menichetti, 1984, Gravity and magnetic inversion with cusing: Geophysics, 67, 1532–1541.
minimization of a specific functional: Geophysics, 49, 1354–1360. Press, W., S. A. Teukolsky, W. T. Vetterling, and B. Flannery, 1992, Nu-
Jackson, D. D., 1972, Interpretation of inaccurate, insufficient and inconsis- merical recipes in C. The art of scientific computing, 2nd ed.: Cambridge
tant data: Geophysical Journal of the Royal Astronomical Society, 28, University Press.
97–109. Safon, C., G. Vasseur, and M. Cuer, 1977, Some applications of linear pro-
Jupp, D., and K. Vozoff, 1975, Stable iterative methods for the inversion of gramming to the inverse gravity problem: Geophysics, 42, 1215–1229.
geophysical data: Geophysical Journal of the Royal Astronomical Soci- Tikhonov, A. N., and V. Y. Arsenin, 1977, Solutions of ill-posed problems:
ety, 42, 957–976. W. H. Winston and Sons.
Last, B., and K. Kubik, 1983, Compact gravity inversion: Geophysics, 48, Zhdanov, M., and G. Hursan, 2000, 3-D electromagnetic inversion based
713–721. on quasi-analytical approximation: Inverse Problems, 16, 1297–1322.

You might also like