Jpse 2017 PDF

Accepted Manuscript
Solving geosteering inverse problems by stochastic Hybrid Monte Carlo method
Qiuyang Shen, Xuqing Wu, Jiefu Chen, Zhu Han, Yueqin Huang
PII: S0920-4105(17)30921-X
DOI: 10.1016/j.petrol.2017.11.031
Reference: PETROL 4447
To appear in: Journal of Petroleum Science and Engineering
Received Date: 4 August 2017

Revised Date: 2 November 2017
Accepted Date: 13 November 2017
Please cite this article as: Shen, Q., Wu, X., Chen, J., Han, Z., Huang, Y., Solving geosteering inverse
problems by stochastic Hybrid Monte Carlo method, Journal of Petroleum Science and Engineering
(2017), doi: 10.1016/j.petrol.2017.11.031.
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to
our customers we are providing this early version of the manuscript. The manuscript will undergo
copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please
note that during the production process errors may be discovered which could affect the content, and all
legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT
PT
RI
U SC
AN
M
D
TE
EP
C
AC
Faster converging speed by HMC Inverse earth model by HMC with quantified uncertainty
ACCEPTED MANUSCRIPT
Solving Geosteering Inverse Problems by Stochastic

Hybrid Monte Carlo Method
PT
Qiuyang Shena , Xuqing Wub , Jiefu Chena,∗, Zhu Hana , Yueqin Huangc
a
Department of Electrical and Computer Engineering, University of Houston
RI
b
Department of Information and Logistics Technology, University of Houston
c
Cyentech Consulting LLC
SC
Abstract
U
The inverse problems arise in almost all fields of science where the real-world
parameters are extracted from a set of measured data. The geosteering inver-
AN
sion plays an essential role in the accurate prediction of oncoming strata as
well as a reliable guidance to adjust the borehole position on the fly to reach
one or more geological targets. This mathematical treatment is not easy to
M
solve, which requires finding an optimum solution among a large solution
space, especially when the problem is non-linear and non-convex. Nowa-
days, a new generation of logging-while-drilling (LWD) tools has emerged on
D
the market. The so-called azimuthal resistivity LWD tools have azimuthal
sensitivity and a large depth of investigation. Hence, the associated inverse
TE
problems become much more difficult since the earth model to be inverted
will have more detailed structures. The conventional deterministic methods
are incapable to solve such a complicated inverse problem, where they suffer
from the local minimum trap. Alternatively, stochastic optimizations are in
EP
general better at finding global optimal solutions and handling uncertainty

quantification. In this paper, we investigate the Hybrid Monte Carlo (HMC)
based statistical inversion approach and suggest that HMC based inference
C
is more efficient in dealing with the increased complexity and uncertainty

faced by the geosteering problems.
AC
Keywords: Statistical Inversion, Hybrid Monte Carlo, Geosteering,

Logging While Drilling, Well Logging
∗
Corresponding author
Preprint submitted to Journal of Petroleum Science and Engineering November 15, 2017
ACCEPTED MANUSCRIPT
1. Introduction
Obtaining reliable and detailed information about the earth subsurface
is of great challenge. The task to infer the interior structure is fundamen-
PT
tal for the geological exploitation or economic evaluation [1]. Geosteering
5 is a technique to actively adjust the direction of drilling, often in horizontal
wells, based on real-time formation evaluation by using logging-while-drilling
RI
(LWD) measurements [2]. This process enables drillers to efficiently reach
the target zone and actively respond while drilling to geological changes in
the formation so that they can maintain the maximal reservoir contact [3].
SC
10 The real-time adjustment of a geosteering process is determined by the cur-
rent position and depth of the tool to minimize the gas or oil breakthrough
and maximize the economic production [4]. Among several technologies that
U
can be used for geosteering, such as nuclear, acoustic, gamma ray, or elec-
tromagnetic measurement, azimuthal resistivity LWD tools are widely used
15
AN
in geosteering worldwide due to its relatively large depth of detection and
azimuthal directional sensitivity [5]. Compared to other conventional LWD
propagation resistivity tools, the azimuthal directional LWD tool can obtain
M
more information related to the formation anisotropy in addition to the re-
sistivity measurements [6, 7]. Hence, it provides a more sensitive detection
20 of formation boundaries. In the while-drilling environment, this capacity can
D
be very useful for making directional drilling decisions.

TE
X
EP
Z
T5 T3 T1 R3 R1 R2 R4 T2 T4 T6
C
AC
Figure 1: The structure and schematic of an azimuthal resistivity LWD tool. T1, T2, T3,
and T4 are the transmitters whose moment are with the tool axis (Z direction), while T5
and T6 are transverse antennas that perpendicular to the tool axis (X direction). Similarly,
R1 and R2 are the receivers directing along the tool axis. R3 and R4 are receiver antennas
with azimuthal sensitivity.
An azimuthal resistivity logging tool consists of a set of antennas with
2
ACCEPTED MANUSCRIPT
different polarizations and working frequencies. They play the roles as trans-
mitters and receivers that are oftentimes in pairs. Each pair of antennas
25 can provide a group of electromagnetic signals while the tool is drilling [8].
PT
A schematic model diagram of an azimuthal resistivity LWD tool is shown
in Figure 1. It has transmitters and receivers both along the tool axis (Z
direction) or perpendicular to it (X direction). While drilling, transmitters
RI
are energized and the voltage changes recorded by the receivers. Processing
30 algorithms in the electronic hardware deduce the measured signals from the
receiver antennas to the full mutual inductance tensor, related to the resistiv-
SC
ity tensor of the geological formation occupied by the tool [9]. These differ-
ent inductance tensors, with working frequencies in the range from kilohertz
to megahertz, are used to generate different types of measurement curves.
The curve responses are collected as measurement data and served for the
U
35
geosteering inversion process.

AN
The geosteering process relies on the real-time reconstruction of the sub-
surface earth model. Since the oil and gas companies dedicate to develop
and provide such service that combines in-field drilling with real-time data
processing [10], the geosteering inversion plays a pivotal role in this part to
M
40
evaluate the measurements from the deep-reading tool, and present the re-
constructed earth model in real time. Hence, a robust and efficient inverse
algorithm is always necessary to satisfy the job requirement. In practice,
D
modeling and inversion are applied to a 1D model, in which the interfaces

between layers are assumed infinitely extended and parallel to each other.
TE
45
A group of inverse results consisting of the distance-to-boundaries and the

resistivity of each layer is collected to represent the earth model with respect
to the current tool position. Along with the drilling trajectory, inversion is
EP
conducted at a fixed or varying interval of distance. Thus, a complete sub-

50 surface profile can be drawn by grouping parameters of the inverse model
together.
C
As the new generation of deep electromagnetic LWD tool extends the

depth of investigation to around 100 ft or further from the wellbore [11], the
AC
earth model is complicated and requires more descriptive parameters. As

55 a result, the corresponding inverse problem is more challenging due to the
sparsity and uncertainty of the measurement as well as incomplete knowledge
of the operating circumstance [12]. The gradient-based deterministic inver-
sion, which minimizes the distance between the observation and the forward
response, suffers from the local minimum problem due to non-convexity. In
60 the view of Bayes, the optimal solution of inverse parameters can be ex-
3
ACCEPTED MANUSCRIPT
tracted from the statistical characteristics of the posterior distribution [13].

When the analytical solution for the posterior is not available, the Markov
Chain Monte Carlo (MCMC) method is one of the most popular sampling
PT
approaches to draw samples from an unknown distribution [14]. The MCMC
65 method guarantees an asymptotically exact solution for recovering the poste-
rior distribution, though the computational cost is inevitably high and most
RI
MCMC algorithms suffer from a low acceptance rate and slow convergence
with long burn-in periods [15]. In fact, solving geophysical inverse problems
by statistical method is not new. The implementation of Bayesian inference
SC
70 on the interpretation of sonic logging measurements has been realized [16].
However, the poor acceptance rate with long burn-in time is unbearable while
applying this scheme on geosteering inversion cases. In this article, Hybrid
Monte Carlo (HMC) is used to solve the geosteering inverse problem. HMC
U
[17] united the MCMC and molecular dynamics and provided a way of sam-
75
AN
pling from the canonical density by simulating the dynamics of a physical
system. Many experiments show that the sampling performance is improved
via the HMC method through avoiding the random walk behavior used by
the traditional MCMC [18].
M
In the following sections, we will first give a brief review on the back-
80 ground of the geosteering problem and its statistical scheme. Then we will
discuss the HMC method and its implementation for the geosteering inverse
D
problem in details. Next, a discussion of large-scale inverse problems by us-

ing multiple chains is presented. Finally, we will verify the performance of
TE
the HMC method with the synthetic model and demonstrate its advantages
85 on solving statistical geosteering inverse problems.
EP
2. Geosteering Inversions
2.1. Deterministic Inversions
C
The most commonly used strategy for the deterministic optimization

method is to fit the model function to measured data by minimizing an
AC
error term between the forward model responses and observations. Assume
a collected N measurement data denoted by m ∈ RN . x ∈ RM represents M
earth model parameters of the real world inverted from the measured data.
A computational model function or so-called forward function S : RM → RN
is designed to synthesize N responses from M model parameters. The for-
ward transformation from the model parameters to the responses is calcu-
lated based on the 1D electromagnetic modeling, which is essentially solving
4
ACCEPTED MANUSCRIPT
Maxwell’s equations with magnetic dipole sources in layered media [19, 20].
The objective of the inverse problem is to infer the model parameters through
the observed measurement. A good agreement between the response of the
PT
forward model and measured data will be reached if inverse parameters of the
physical model are accurate. The difference between the forward response
and measurements is defined as data misfit F (x), which is written as:
RI
F (x) = S(x) − m (1)
Since both forward responses and measurements are vectors, a cost function
SC
is defined as the square of L2 norm of the misfit function F (x):
N
X
f (x) = Fi2 (x) = kF (x)k22 (2)
U
i=1
AN
where f (x) is the cost function representing the magnitude of the data mis-
fit. Hence, the inverse problem is to find the optimal model parameters,
x, which minimize the cost function given a forward model function and
measurements. Mathematically, this problem is presented as:
M
min f (x) (3)
x∈RM
D
This is an unconstraint nonlinear least-square minimization problem. Many

iterative numerical algorithms, such as gradient descent method, Gauss-
TE
90 Newton method, and the Levenberg-Marquardt algorithm (LMA) [21, 22],

have been well established to solve this least-square problem.
Although most of the aforementioned optimization algorithms are robust
and have been implemented on some other geophysical inverse problems [23],
EP
solving a non-convex optimization problem requires prior knowledge to make

95 a feasible initial guess. Otherwise, it can only find the local optimum espe-
cially when the inverse problem is nonlinear and ill-posed.
C
2.2. Statistical Inversions

AC
Statistical inversions arise as an alternative approach to deal with many

ill-posed scientific inverse problems. Built upon the Bayesian theorem, it can
be concluded as a method to obtain the posterior distribution from which
the solution is deduced after combining the likelihood and the prior. This
relationship is shown as:
p(x|y) ∼ p(y|x) · p(x) (4)
5
ACCEPTED MANUSCRIPT
The assumptions made by the forward model y = f (x) (y is data and x de-
notes the earth model parameter) may not include all factors that affect mea-
surements. Suppose the noise is additive and comes from external sources,
PT
the relationship between observed outputs ye and corresponding model pa-
rameters can be represented as:
ye = f (x) + ε (5)
RI
where ε denotes additive noises. Experiments empirically suggest that ad-
ditive noises usually follow a zero-mean Gaussian random distribution: ε ∼
SC
N (0, σ 2 I). Given model parameters x and observed data ye, the likelihood
can be deduced as
y |x) = p(ε) ∼ N (0, σ 2 I)
p(e (6)
U
Suppose the prior distribution of x is governed by a zero-mean isotropic
Gaussian distribution such that p(x) ∼ N (0, β 2 I). By virtue of the Bayes
AN
equation (4), the posterior distribution of x is given by
y ) ∼ N (0, σ 2 I)N (0, β 2 I)
p(x|e (7)
It suggests that the posterior distribution of model parameters x given ob-
M
servations ye can be obtained by calculating the product of two Gaussian dis-
100 tributions. The solution of x can be sampled and estimated according to the
D
probability distribution function p(x|ey ). It is an effective way to overcome

the shortcomings of deterministic inversion especially when the problems
TE
are underdetermined (ill-posed) because of the large parameter space and

the sparsity of the measurements. Compared to the deterministic methods,
105 which only look for a local optimal solution, the statistical approach removes
its problem by the search of the entire distribution of the earth model pa-
EP
rameters. And these parameters are determined by sampling from posterior

distribution p(x|ey ) while the measurements ye have been acquired.
Drawing samples from posterior distribution p(x|e y ) is never an easy work
C
110 when f indicates a non-linear mapping relationship between x and y. The

MCMC method draws samples by constructing a Markov chain that has the
AC
desired distribution as its equilibrium distribution. The statistical proper-

ties, like mean and variance values, play the role of final inverse result and
corresponding uncertainty of inverse parameters.
115 2.3. Random Walk MCMC

The Metropolis-Hastings (MH) algorithm, first developed by Metropolis
and then generalized by Hastings [24], is the most popular MCMC method.
6
ACCEPTED MANUSCRIPT
In brief description, a MH step of an invariant distribution p(x) and a pro-

posal distribution or jumping function q(x∗ |x) involves sampling a candidate
value x∗ given the current value x according to q(x∗ |x). The Markov chain
PT
then moves towards x∗ with the following acceptance probability
p(x∗ )q(x|x∗ )

∗
A(x, x ) = min 1, , (8)
p(x)q(x∗ |x)
RI
otherwise, it remains at x. In the random walk MH algorithm, a zero-mean
normal distribution is a popular choice of q(x∗ |x) as a symmetric candidate-
SC
∗)
generating function, which helps reduce the moving probability to p(xp(x)
. Al-
gorithm 1 presents the MH algorithm for sampling from the posterior distri-
120 bution:
U
Algorithm 1 The Random Walk MCMC Method
AN
Input: initial x(0) , jumping function q(x(i) |x(j) ), max chain length K
Output: x(k) , where k < K
Initialize with arbitrary value x(0)
while k ≤ K do
M
Generate x(k) from q(x (k) (k−1)
n |x (k) ) o
p(x |ey)
A(x(k) , x(k−1) ) = min 1, p(x (k−1) |e
y)
D
Generate A0 from uniform distribution U(0, 1)

if A0 < A(x(k) , x(k−1) ) then
TE
keep x(k)
else
x(k) = x(k−1)
EP
end if
save x(k) in the chain
end while
C
The MH algorithm is attractive since the random exploration of posterior

AC
distribution requires very few parameter tuning. However, although the ran-
dom walk MCMC method guarantees asymptotically exact recovery of the
posterior distribution as the number of posterior samples grows, it may suffer
125 from an extremely long burn-in period before reaching the equilibrium and
slow convergence. The time cost may be prohibitively high for the inverse
problem when the forward model is repeatedly computed by every sample
drawn from the MH, which may take at least O(N ) operations to draw one
7
ACCEPTED MANUSCRIPT
sample [25]. Even worse, with the increasing dimension of model parameters,
130 the burn-in period will extend exponentially.
PT
3. Hybrid Monte Carlo
3.1. Hamiltonian Dynamics
RI
The HMC or so-called Hamiltonian Monte Carlo replaces the proposal
function of the random walk by the simulation of a dynamic process. The
135 idea initially came from the observation of the motion of molecules and was
SC
well described by the Hamiltonian dynamics in the landmark paper by Duane
et al. [17]. Generalized by Neal in 1993 [26], HMC converges more quickly
to the absolute probability distribution.
U
To interpret the algorithm of HMC, first, we introduce the Hamiltonian
dynamics with two sets of real variables q and p. For a physical interpre-
AN
tation of these variables, the vector q = {q1 , q2 , . . . , qn } stands for the posi-
tion of n molecules on a frictionless surface of varying height. The vector
p = {p1 , p2 , . . . , pn } represents their current momentums. The corresponding
energy functions, U (q) and K(p), also share the physical meaning as the po-
M
tential energy and the kinetic energy, respectively. Suppose we are going to
sample from the canonical distribution for the states of molecules, it becomes
our interest to find the probability distribution of the molecules’ position. In
D
the Boltzmann probability, this canonical distribution P (q) is related to the

potential energy U (q) in a form of
TE
1
P (q) = exp(−U (q)) (9)
Ze
EP
where Ze is the normalizing constant. The equation (9) bridges the view
of energy and statistical probability given a state of the object. In the real
world, vector q is a set of unknown parameters that we want to find its
C
distribution. For each real variable qi , there exists an associated variable pi .

Leaving out its physical meaning, the variable pi is introduced independently
AC
to allow the problem to be given as a dynamical problem. p is usually chosen

as with Gaussian distributions of the zero mean and unit variance. The
kinetic function is thereby defined as
n
1X 2
K(p) = p (10)
2 i=1 i
8
ACCEPTED MANUSCRIPT
Then the Hamiltonian function is concluded as an total energy function for

variables q and p
H(q, p) = U (q) + K(p) (11)
PT
It describes the dynamical system through the perspective of the total energy,
to which the partial derivatives determine how q and p change over continuous
time t according to the Hamiltons equation:
RI
dqi ∂H
=
dt ∂pi
SC
(12)
dpi ∂H
=−
dt ∂qi
The stochastic gradient equations above construct the foundation of HMC
U
process. According to the definition of the Hamiltonian function, some prop-
AN
erties are able to be concluded and promise the feasibility applying on the
MCMC process. Starting from the first property, the Hamiltonian dynam-
ics is reversible, which means from the current state at time t, (q(t), p(t)),
to the state at time t + s, (q(t + s), p(t + s)), is one-to-one, and hence has
M
an inverse. The reversibility of dynamic guarantees the desired distribution
invariant during sampling. The second property is conservation, which de-
notes that the Hamiltonian function keeps invariant along time, which can
D
be easily shown as
TE
n
dH X dqi ∂H dpi ∂H
= + =0 (13)
dt i=1
dt ∂q i dt ∂p i
EP
The invariance of Hamiltonian reveals an equilibrium state of the Markov

140 chain, or a certain acceptance for Metropolis updates using a proposal found
by Hamiltonian dynamics. It also ensures the construction of the HMC sam-
C
pling from an invariant distribution. Another property of the Hamiltonian

function is that it preserves volume in the (q, p) space, known as the Liou-
AC
villes theorem. One brief explanation is that if we apply the time mapping T
145 to the points in some region R of the (q, p) space with volume V , the image
of R under T will also have volume V [27]. This property guarantees the
acceptance probability for the Metropolis updates without any influence by
the change in its volume.
In order to sample the state via variables qi and draw a distribution to
reflect the statistical properties, the continuous-time Hamiltons equations
9
ACCEPTED MANUSCRIPT
must be approximated by discretizing the time with step ε. A commonly

used scheme, the leapfrog method, is a way to simulate the state after a
period of time dT by alternatively moving p and q at halftime step ε, which
PT
minimizes the error introduced by the discretization. A single step to upgrade
the position and momentum, p and q is shown as,
ε ∂
p(k) (t + ε/2) = p(k) (t) − · (k) U (q (k) (t))
RI
2 ∂q
ε
q (k) (t + ε) = q (k) (t) + ε · p(k) (t + ) (14)
2
SC
ε ∂
p(k) (t + ε) = p(k) (t + ε/2) − · (k) U (q (k) (t + ε))
2 ∂q
It starts with a half step update for the momentum variable p, followed by
U
150 a full step update for the target variable q, and finally another half step for
p. With dT /ε times updating, the system will move to a new state. The
AN
discretized leapfrog process realizes a full update of a sample q, governed by
the Hamiltonian dynamic. The update of the auxiliary variable p is always
ignored since p is drawn randomly each time at the beginning of leapfrog
M
155 process.
3.2. HMC method

D
We now present a complete Hamiltonian Monte Carlo algorithm in Al-

gorithm 2, which is used to sample from continuous distributions. There
TE
are two main processes at each iteration in the HMC algorithm. In the first
160 part, new values of the momentum variables p are drawn randomly from their
Gaussian distribution. Starting from the current state (q, p), an update is
performed using the simulation of Hamiltonian dynamics and moves (q, p) in
EP
a distance by the leapfrog method to a proposed state (q ∗ , p∗ ). In the second

part, the Metropolis update is executed. The proposed state is accepted as
165 the next state of the Markov chain at a probability, which is the same as
C
equation (8). If the proposed state is rejected, the current q will be kept for
the next iteration.
AC
Solving a geosteering inverse problem with HMC is similar to the random

walk MCMC approach. Recall the statistical inversions introduced previously
and we rewrite model parameters and observations by q and d. Then the
posterior distribution of the earth model parameters can be represented by
the prior knowledge and likelihood function as:
p(q|d) ∼ N (d − f (q), σ 2 I)N (0, β 2 I) (15)
10
ACCEPTED MANUSCRIPT
Algorithm 2 Hybrid Monte Carlo Method

Input: initial q (0) , max chain length K
Output: q (k) , where k < K
PT
Initialize with arbitrary value q (0) , stepsize ε, and leapfrog steps l
while k ≤ K do
Randomly generate p(k) from N (0, 1)
RI
(k+1)
q0 = q (k)
(k+1)
p0 = p(k) − 2ε · ∂q∂(k) U (q (k) )
for i = 1 to l do
SC
(k+1) (k+1) (k+1)
qi = qi−1 + ε · pi−1
(k+1) (k+1) ∂ (k+1)
pi = pi−1 − ε · (k+1) U (qi )
∂qi
end for
U

A(q (k+1) , q (k) ) = min 1, exp[ U (q (k) ) − U (q (k+1) ) + K(p(k) ) − K(p(k+1) )]
Generate A0 from uniform distribution U(0, 1)
if A0 < A(q (k+1) , q (k) ) then
keep q (k+1)
AN
else
M
q (k+1) = q (k)
end if
save q (k+1) in the chain
D
end while
TE
According to equation (9), the corresponding potential energy U (q) is as

follows:
U (q) = − log(Ze · p(q|d)) (16)
EP
where the constant Ze is set to 1. If we rewrite the posterior distribution by

the multiplicative distribution of prior and likelihood functions, then
C
U (q) = − log(N (de − f (q), σ 2 I)N (0, β 2 I)) . (17)

AC
Following Algorithm 2, a Markov chain is launched to draw samples for the

model parameter q until the update reaches an equilibrium.
170 3.3. HMC Implementation

There are few concerns about the implementation of HMC on geosteering
inversions. The first is the gradient of the potential function U (q). The com-
putational cost of the Jacobian matrix of forward function f (q) is inevitably
11
ACCEPTED MANUSCRIPT
high compared to the random walk samplers. However, leveraged by the

175 gradient-drifted property, where the proposal is always drifted to the region
with a higher probability to be accepted, HMC has much faster convergence
PT
rate than the random walk exploration. Also, the average acceptance rate of
HMC is 0.65 while the random walk is around 0.23 [27]. The higher accep-
tance rate leads to a faster exploration of the canonical distribution and its
RI
180 statistical properties. In the meantime, the random mechanism inside the
Hamiltonian dynamics differentiates HMC from deterministic inversions and
avoids local optimums. As a class of MCMC methods, choosing a suitable
SC
stepsize and leapfrog steps is crucial when implementing HMC. Together, the
stepsize and leapfrog steps determine the length of the trajectory in fictitious
185 time, εL. A large stepsize may result in unstable Hamiltonian dynamics due
to errors introduced into the system by the leapfrog discretization. Too small
U
a stepsize will produce little change of the state, waste computation time,
AN
and cause a slow exploration of the target distribution [27]. The selection
of leapfrog steps is also a subtle work and the distance to move within a
190 simulation of the Hamiltonian may differ from one state to another state. In
practice, a preliminary run of HMC is often required to determine a suit-
M
able choice of the stepsize and trajectory length. An effective strategy is
to select a range of these two parameters according to the preliminary run,
and randomly choose a pair of these parameters within this range for each
D
195 simulation of the Hamiltonian dynamics. The randomness helps increase the
overall possibility of fast exploration of the target distribution.
TE
Samples from the sequence of a single chain have tendency to be unduly

influenced by the slow-moving realization of iterative simulation. Whereas
multiple starting points can weaken the strong correlation [28]. Along with
EP
200 the increasing dimension of the sampling space, the multiple-chain sampling
strategy has proven to be helpful on weakening the correlation of each sample
and thus improving the possibility of convergence [29]. Multiple-chain HMC
C
was implemented in this paper to solve the geosteering inverse problem. The
target distribution and model parameters are obtained by taking between-
AC
205 sequence as well as within sequence samples. The multiple-chain sampling

scheme is very suitable for the parallel computing [30]. A simple strategy
of a distributed MCMC method is built on the parallelization of multiple
chains, which distributes the data and task of the HMC sampling to multiple
processing units. A multiple-point earth model inversion can be deployed
210 to multiple cluster nodes. Within each task, multiple-chain sampling for
one point is launched and run in parallel on multiple processors or cores.
12
ACCEPTED MANUSCRIPT
Parallel implementation of HMC sampling can have a large positive impact

in exploring computationally efficient method to a solved real-time statistical
inverse problem.
PT
215 4. Convergence and Model Inversions
RI
In this section, we demonstrate the application of the HMC method on
solving the geosteering inverse problem and present the advantages of the
HMC compared to the traditional inversion methods. The examples are con-
SC
ducted based on the synthetic dataset generated by the simulated azimuthal
220 resistivity LWD tool. To its configuration, multiple working frequencies are
used, which is from the lowest 1 kHz to the highest 2 MHz. The correspond-
ing transmitter-receiver spacing ranges from 800 inches to 30 inches. The
U
assumption to the depth of investigation (DOI) implies it can achieve up to
100 ft from the wellbore by using curves with working frequency of 1 kHz.
AN
225 Actually, in these couple of years, many new generation LWD tools with
such deep reading ability have emerged on the oil and gas market. This sim-
ulated tool is presented and verified according to the most recent industrial
M
development [10]. A synthetic earth structure is constructed by models with
three or more layers. The performance of HMC method is evaluated through
230 several aspects including the convergence plot, uncertainty analysis, and full
D
logging model inversions.

An examination of convergence is presented for the first example. The
TE
metric to evaluate the performance of geosteering inversions is usually de-

fined as the misfit between the true value and the inverse value. There are
235 two possible comparison ways, to which are defined as data misfit and model
EP
misfit. Of course, in the practical geosteering work, the true solution of earth
model parameters are never available while the comparison of data misfit is
the only approach to evaluate the inversion performance. However, the non-
C
convexity caused local minimum denotes a very small data misfit even the
240 inverse model is tremendously different from a real one. Hence in our ex-
AC
ample, we use model misfit to represent the performance of each method,

since the real model parameters are synthetically constructed by ourselves.
Comparing random walk sampler and Hamiltonian dynamic governed sam-
pler, two instances are launched separately, where the first uses the random
245 walk MCMC sampling method, and the second is conducted under the HMC
sampling method. The goal is to sample from a three-layer model with five
parameters, the resistivity in each layer and the distance from the two to
13
ACCEPTED MANUSCRIPT
150
HMC sampling chain
MCMC sampling chain
PT
model misfit
100
RI
50 MCMC chain
SC
HMC chain
0
U
100 101 102 103 104
ANsamples
Figure 2: Convergence of model misfit with the increasing number of samples.The upper
MCMC chain converges after 2000 samples while the lower HMC chain converges faster
M
within 30 samples with higher acceptance rate.
layer boundaries. We use the model misfit, an L2 norm of the difference

D
between the true solution and the inverse result, as an indicator to show the
250 performance and chain convergence.
TE
In Figure 2, the result shows that the chain governed by HMC enters an
equilibrium state after a burn-in period within 30 iterations, and the model
misfit is near to zero, which means the samples are drawn from the tar-
EP
get distribution successfully. However, the samples drawn by random walk

255 MCMC are still out the canonical distribution with a much higher model
misfit. The misfit of the MCMC sampling is always higher than the HMC
samples along the total chain length. In other words, random move of param-
C
eters has very low efficiency to explore the parameter space given a limited
AC
chain length. Our experiment indicates that the model misfit of random walk
260 chain converges to around 0 after 2000 samples. This test agrees with that
the statistical inversion by the HMC sampling method is more efficient than
the MCMC method. HMC is able to draw samples precisely on the canon-
ical distributions of model parameters. The other prominent improvement
of HMC is that the acceptance rate is much higher, which guarantees the
265 effectiveness in sampling the canonical distributions. MCMC suffers from
14
ACCEPTED MANUSCRIPT
PT
RI
SC
(a) The synthetic three-layer earth model, where the logging tool is drilling horizontally in
the middle high-resistive layer.
U
AN
M
D
TE
(b) The inverse model with uncertainty towards two boundaries. The inverse results of
further distance-to-boundary have higher uncertainty.
EP
Figure 3: The uncertainty evaluation of the inverse model.
the low acceptance rate as shown in Figure 2 where the upper line of chain
C
maintains the same sample value for many iterations. This behavior indi-
cates that the randomness is the only force to make the parameters change,
AC
whereas to the HMC method, a gradient-drifted manner helps to explore the

270 canonical distribution faster. Hence, this explained that a much longer chain
for random walk MCMC is needed to obtain the same precision as the HMC
achieves.
In the second example, a reconstruction to a synthetic earth model is
conducted by applying the HMC statistical inversion to 40 logging points in
15
ACCEPTED MANUSCRIPT
275 a 2000 ft horizontal well. The synthetic earth model, shown in Figure 3(a),
is a three-layer model, where the resistivity is 10 ohm-m, 50 ohm-m and 1
ohm-m from the top to bottom, respectively. The central dash line indicates
PT
the tools navigation trajectory. In this case, we assume the tool relative dip
angle is fixed at 90 degrees and the drilling trajectory is horizontal. The
280 depth to either the upper or lower boundary is varying on different positions,
RI
where the furthest boundary is up to 70 ft and closest one is 3 ft.
In Figure 3(b), we show the inversion result by the HMC method with
the uncertainty of depth of layer boundaries. It is obvious that the inverted
SC
model by HMC agrees with the true earth model with satisfactory perfor-
285 mance. Beyond this, the error bar given by the HMC method on the two
boundaries denotes the uncertainty of inversion results. It presents the vari-
ance of collected samples from a drawn distribution. In this case, the uncer-
U
tainty becomes larger when the tool goes far from the boundary, and this is
290
AN
consistent with drilling engineers’ experiences in the practice. Overall, the
statistical HMC inversion is able to reconstruct the earth model and gives the
relevant information about inverse uncertainty, which is meaningful to the
real-time geosteering operation and helps keep the drilling tool from breaking
M
out.
To the last example, we use a complicated five-layer model to verify the
capability of HMC inversion, comparing with deterministic inversion. The
D
295
Figure 4(a) presents the real model, where the tool is drilling across from
the top layer to the bottom along the dash line. The formation resistivity
TE
changes from low to high and to low alternatively. Figure 4(b) presents in-
version results by a deterministic method based on the Levenberg-Marquardt
300 algorithm. Generally, the deterministic inversions rely heavily on the initial
EP
guess. The outcome is unsatisfactory, and many results suffer from incorrect
recovery because of the non-convexity problem. We yield a hybrid result as
shown in Figure 4(c). The HMC method is launched on a multiple-point
C
inverse model while we collect the drawn samples after 150 iterations. We
305 calculate the mean value of 100 samples after this burn-in period as the
AC
intermediate result. The final inverse result is refined by the Levenberg-

Marquardt algorithm. From the final result, the HMC method presents the
capability to find the distribution of model parameters, and the refinement
of deterministic method help recover the model precisely. Although to some
310 part of the region, the further boundaries have not been detected due to the
sensitivity of our current synthetic tool, and a five-layer model is replaced by
a three-layer one, which assumes the further layers across two beds have the
16
ACCEPTED MANUSCRIPT
PT
RI
(a) The true five-layer model.
U SC
AN
(b) The inverse model by Levenberg-Marquardt algorithm.
M
D
TE
EP
(c) The inverse model by HMC method, refined by LMA.
Figure 4: Convergence of model misfit

C
same resistivity as the furthest detectable one. Overall, the performance of

AC
HMC inversion is very promising.
315 5. Conclusion
Experiment results with our synthetic earth model draw a promising
scheme that the statistical inversion by the HMC method is capable to in-
vert the model parameters and reconstruct the earth structure accurately
17
ACCEPTED MANUSCRIPT
despite high uncertainty. Since more advanced measuring technologies have

320 been commercialized and are pushed into the market around these years, the
increasing complexity of measured data undoubtedly will require a more ef-
PT
fective and accurate solution for solving these inverse problems. Meanwhile,
a more complicated earth model with five, seven or more layer-structures
poses great challenges to the conventional deterministic inversion scheme.
RI
325 The stochastic strategies undoubtedly will become promising players on the
stage of inversions. The proposed HMC method possesses an innovation and
a higher accuracy compared to traditional MCMC sampling methods, which
SC
can meet requirements for the fast-growing industry.
6. Acknowledgement
U
330 This material is based upon work supported by the U.S. Department
of Energy, Office of Science, and Advanced Scientific Computing Research,
AN
under Award Numbers DE-SC0017033.
References
M
[1] M. S. Zhdanov, Inverse theory and applications in geophysics, Vol. 36,
335 Elsevier, 2015.
D
[2] Q. Li, D. Omeragic, L. Chou, L. Yang, Duong, Khanh, New directional

TE
electromagnetic tool for proactive geosteering and accurate formation

evaluation while drilling, in: SPWLA 46th Annual Logging Symposium,
Society of Petrophysicists and Well-Log Analysts, 2005.
EP
340 [3] M. Moorkamp, N. Linde, A. Khan, Integrated imaging of the earth:

Theory and applications, Vol. 218, John Wiley & Sons, 2016.
[4] W. Lesso Jr, S. Kashikar, The principles and procedures of geosteering,

C
in: SPE/IADC Drilling Conference, Society of Petroleum Engineers,

AC
1996.
345 [5] M. S. Bittar, Electromagnetic wave resistivity tool having a tilted an-
tenna for determining the horizontal and vertical resistivities and rel-
ative dip angle in anisotropic earth formations, Google Patents, u.S.
Patent 6,163,155 (Dec. 19 2000).
18
ACCEPTED MANUSCRIPT
[6] M. S. Bittar, J. D. Klein, B. Randy, G. Hu, M. Wu, J. L. Pitcher,

350 C. Golla, G. D. Althoff, M. Sitka, V. Minosyan, A new azimuthal deep-
reading resistivity tool for geosteering and advanced formation evalua-
PT
tion, SPE Reservoir Evaluation & Engineering 12 (02) (2009) 270–279.
[7] S. Li, J. Chen, T. L. Binford Jr, Using new lwd measurements to evalu-
ate formation resistivity anisotropy at any dip angle, in: SPWLA 55th
RI
355 Annual Logging Symposium, Society of Petrophysicists and Well-Log
Analysts, 2014.
SC
[8] J. Chen, Y. Yu, An improved complex image theory for fast 3d resistiv-
ity modeling and its application to geosteering in unparallel layers, in:
SPE Annual Technical Conference and Exhibition, Society of Petroleum
U
360 Engineers, 2014.
AN
[9] J. Chen, Y. Huang, T. L. Binford Jr, X. Wu, Managing uncertainty
in large-scale inversions for the oil and gas industry with big data, in:
Guide to Big Data Applications, Springer, 2018, pp. 149–173.
M
[10] Ø. Bø, J.-M. Denichou, U. Ezioba, E. Mirto, J. Donley, J. Telford,
365 C. Dupuis, L. Pontarelli, G. Skinner, M. Viandante, Reservoir mapping
while drilling, Oilfield Review 27 (1) (2015) 38–47.
D
[11] R. Beer, L. C. T. Dias, A. M. V. da Cunha, M. R. Coutinho, G. H.

TE
Schmitt, J. Seydoux, C. Morriss, E. Legendre, J. Yang, Q. Li, Geosteer-

ing and/or reservoir characterization the prowess of new-generation lwd
370 tools, in: SPWLA 51st Annual Logging Symposium, Society of Petro-
physicists and Well-Log Analysts, 2010.
EP
[12] K. Key, 1d inversion of multicomponent, multifrequency marine csem

data: Methodology and synthetic studies for resolving thin resistive lay-
C
ers, Geophysics 74 (2) (2009) F9–F20.

AC
375 [13] A. M. Stuart, Inverse problems: a bayesian perspective, Acta Numerica

19 (2010) 451–559.
[14] J. Sanz-Serna, Markov chain monte carlo and numerical differential

equations, in: Current challenges in stability issues for numerical dif-
ferential equations, Springer, 2014, pp. 39–88.
19
ACCEPTED MANUSCRIPT
380 [15] W. Neiswanger, C. Wang, E. Xing, Asymptotically exact, embarrass-

ingly parallel mcmc, arXiv:1311.4780 (2013).
PT
[16] S. Huang, Q. Yang, P. J. Matuszyk, C. Torres-Verdı́n, High-resolution
interpretation of sonic logging measurements using stochastic inversion
with spatial slowness sensitivity functions, in: SEG Technical Program
Expanded Abstracts 2013, Society of Exploration Geophysicists, 2013,
RI
385
pp. 524–528.
[17] S. Duane, A. D. Kennedy, B. J. Pendleton, D. Roweth, Hybrid monte
SC
carlo, Physics letters B 195 (2) (1987) 216–222.
[18] Q. Shen, H. Lu, X. Wu, X. Fu, J. Chen, Z. Han, Y. Huang, Statistical
U
390 geosteering inversion by hamiltonian dynamics monte carlo method, in:
SEG Technical Program Expanded Abstracts 2017, Society of Explo-
AN
ration Geophysicists, 2017, pp. 900–904.
[19] M. Huang, L. C. Shen, Computation of induction logs in multiple-layer

dipping formation, IEEE Transactions on Geoscience and Remote Sens-
M
395 ing 27 (3) (1989) 259–267.
[20] L. Zhong, J. Li, A. Bhardwaj, L. C. Shen, R. C. Liu, Computation of

D
triaxial induction logging tools in layered anisotropic dipping formations,

IEEE Transactions on Geoscience and Remote Sensing 46 (4) (2008)
TE
1148–1163.
400 [21] K. Levenberg, A method for the solution of certain non-linear problems
in least squares, Quarterly of applied mathematics 2 (2) (1944) 164–168.
EP
[22] D. W. Marquardt, An algorithm for least-squares estimation of nonlinear

parameters, Journal of the society for Industrial and Applied Mathemat-
ics 11 (2) (1963) 431–441.
C
[23] S. Huang, C. Torres-Verdı́n, Inversion-based interpretation of borehole

AC
405
sonic measurements using semianalytical spatial sensitivity functions,

Geophysics 81 (2) (2016) D111–D124.
[24] W. K. Hastings, Monte carlo sampling methods using markov chains

and their applications, Biometrika 57 (1) (1970) 97–109.
20
ACCEPTED MANUSCRIPT
410 [25] T. Bui-Thanh, O. Ghattas, A scaled stochastic newton algorithm for

markov chain monte carlo simulations, SIAM Journal on Uncertainty
Quantification (2012) 1–25.
PT
[26] R. M. Neal, Probabilistic inference using markov chain monte carlo
methods, Tech. rep., Department of Computer Science, University of
Toronto Toronto, Ontario, Canada (1993).
RI
415
[27] R. M. Neal, MCMC using Hamiltonian dynamics, Vol. 2, CRC Press

New York, NY, 2011.
SC
[28] A. Gelman, D. B. Rubin, Inference from iterative simulation using mul-
tiple sequences, Statistical science (1992) 457–472.
U
420 [29] Q. Shen, X. Wu, J. Chen, Z. Han, Distributed markov chain monte
carlo method on big-data platform for large-scale geosteering inversion
AN
using directional electromagnetic well logging measurements., Applied
Computational Electromagnetics Society Journal 32 (5) (2017) 405–412.
M
[30] L. Murray, Distributed markov chain monte carlo, in: Proceedings of
425 Neural Information Processing Systems workshop on learning on cores,
clusters and clouds, Vol. 11, 2010.
D
TE
C EP
AC
21
ACCEPTED MANUSCRIPT
Highlights
• A stochastic hybrid Monte Carlo method is proposed for geosteering inverse problems.
• The stochastic HMC increases the sampling efficiency compared to other statistical methods.
• The HMC method is capable of searching for the global solution of earth model parameters.
PT
RI
U SC
AN
M
D
TE
C EP
AC

Jpse 2017 PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Jpse 2017 PDF

Uploaded by

Copyright:

Available Formats

Accepted Manuscript

Solving geosteering inverse problems by stochastic Hybrid Monte Carlo method

To appear in: Journal of Petroleum Science and Engineering

Received Date: 4 August 2017

Solving Geosteering Inverse Problems by Stochastic

general better at finding global optimal solutions and handling uncertainty

is more efficient in dealing with the increased complexity and uncertainty

Keywords: Statistical Inversion, Hybrid Monte Carlo, Geosteering,

be very useful for making directional drilling decisions.

An azimuthal resistivity logging tool consists of a set of antennas with

geosteering inversion process.

modeling and inversion are applied to a 1D model, in which the interfaces

A group of inverse results consisting of the distance-to-boundaries and the

conducted at a fixed or varying interval of distance. Thus, a complete sub-

As the new generation of deep electromagnetic LWD tool extends the

earth model is complicated and requires more descriptive parameters. As

tracted from the statistical characteristics of the posterior distribution [13].

problem in details. Next, a discussion of large-scale inverse problems by us-

The most commonly used strategy for the deterministic optimization

This is an unconstraint nonlinear least-square minimization problem. Many

90 Newton method, and the Levenberg-Marquardt algorithm (LMA) [21, 22],

solving a non-convex optimization problem requires prior knowledge to make

2.2. Statistical Inversions

Statistical inversions arise as an alternative approach to deal with many

probability distribution function p(x|ey ). It is an effective way to overcome

are underdetermined (ill-posed) because of the large parameter space and

rameters. And these parameters are determined by sampling from posterior

110 when f indicates a non-linear mapping relationship between x and y. The

desired distribution as its equilibrium distribution. The statistical proper-

115 2.3. Random Walk MCMC

In brief description, a MH step of an invariant distribution p(x) and a pro-

Generate A0 from uniform distribution U(0, 1)

The MH algorithm is attractive since the random exploration of posterior

the Boltzmann probability, this canonical distribution P (q) is related to the

distribution. For each real variable qi , there exists an associated variable pi .

to allow the problem to be given as a dynamical problem. p is usually chosen

Then the Hamiltonian function is concluded as an total energy function for

The invariance of Hamiltonian reveals an equilibrium state of the Markov

pling from an invariant distribution. Another property of the Hamiltonian

must be approximated by discretizing the time with step ε. A commonly

3.2. HMC method

We now present a complete Hamiltonian Monte Carlo algorithm in Al-

a distance by the leapfrog method to a proposed state (q ∗ , p∗ ). In the second

Solving a geosteering inverse problem with HMC is similar to the random

Algorithm 2 Hybrid Monte Carlo Method

According to equation (9), the corresponding potential energy U (q) is as

where the constant Ze is set to 1. If we rewrite the posterior distribution by

U (q) = − log(N (de − f (q), σ 2 I)N (0, β 2 I)) . (17)

Following Algorithm 2, a Markov chain is launched to draw samples for the

170 3.3. HMC Implementation

high compared to the random walk samplers. However, leveraged by the

Samples from the sequence of a single chain have tendency to be unduly

205 sequence as well as within sequence samples. The multiple-chain sampling

Parallel implementation of HMC sampling can have a large positive impact

logging model inversions.

metric to evaluate the performance of geosteering inversions is usually de-

ample, we use model misfit to represent the performance of each method,

layer boundaries. We use the model misfit, an L2 norm of the difference

get distribution successfully. However, the samples drawn by random walk

Figure 3: The uncertainty evaluation of the inverse model.

whereas to the HMC method, a gradient-drifted manner helps to explore the

intermediate result. The final inverse result is refined by the Levenberg-

(c) The inverse model by HMC method, refined by LMA.