Professional Documents
Culture Documents
www.elsevier.com/locate/amc
Abstract
In this paper, we investigate an one-dimensional inverse problem in diffusion based
optical tomography using iteratively regularized Gauss–Newton (IRGN) algorithm for
ill-posed nonlinear problems. We devise a stable reconstruction algorithm for the in-
verse problem using iterative regularization with Armijo–Goldstein–Wolf (AGW) type
line search strategy. We demonstrate the efficacy of the IRGN combined with AGW by
reconstructing the scattering parameter relevant to the inverse problem in optical
tomography.
Ó 2003 Elsevier Inc. All rights reserved.
1. Introduction
In the last decade research in the area of biomedical optics has flourished. In
particular there has been considerable new development in biomedical imaging
q
This work is supported by NSF grant (award number DMS-0207050).
*
Corresponding author.
E-mail addresses: khan@clemson.edu (T. Khan), matabs@suez.cs.gsu.edu (A. Smirnova).
0096-3003/$ - see front matter Ó 2003 Elsevier Inc. All rights reserved.
doi:10.1016/j.amc.2003.12.019
150 T. Khan, A. Smirnova / Appl. Math. Comput. 161 (2005) 149–170
S n1 ½0; T
Þ and f 2 L1 ðX
Let a, b 2 L1 ðXÞ with a > b > 0, u 2 H 1 ðX
n1
S ½0; T
Þ. Then the RTE is given by
1 ou
ðx; s; tÞ þ s ruðx; s; tÞ þ aðxÞuðx; s; tÞ
c ot Z
bðxÞ Hðs s0 Þuðx; s0 ; tÞ ds0 ¼ f ðx; s; tÞ ð1Þ
S n1
uðx; s; tÞ ¼ 0 on C ; ð3Þ
Now, we can define the forward problem as: given sources fj in X and q in Q,
a vector of model parameters, for example the coefficient of diffusion D and the
T
coefficient of absorption la (i.e. q ¼ ðD; la Þ ) that belongs to a parameter set Q,
find the data u on oX and the inverse problem as: given data z on oX find q. We
can recast the forward problem in an abstract setting as the following
parameter dependent equation:
r DðxÞruðx; qÞ þ la ðxÞuðx; qÞ ¼ f ðxÞ; ð14Þ
where x is in X, uðqÞ is in an appropriate abstract space H , and f represents a
source or a forcing distribution. In general, measurement of uðqÞ may not be
possible, only some observable part CuðqÞ of the actual state uðqÞ may be
measured. In this abstract setting, the objective of the inverse or parameter
estimation problem is to choose a parameter q in Q, that minimizes an error
criterion or cost functional J ðuðqÞ; CuðqÞ; qÞ over all possible q in Q e Q
subject to uðqÞ satisfying the diffusion approximation. A typical observation
operator is
m m
ou 1
Cf uðqÞ ¼ D ðxi ; q; f Þ ¼ uðxi ; q; f Þ ; ð15Þ
om i¼1 2 i¼1
–6
x 10
7
0
0.15 0.2 0.25 0.3 0.35 0.4
parameter value q
3. System approximation
The general analytic method for solving ODEs or PDEs containing a delta
distribution source is the GreenÕs function method. However for complex
geometries, the analytic solution is intractable. Therefore one requires
numerical solutions. The FEM is more versatile than other methods including
the finite difference method because of its ease in complex geometries and
modelling boundary effects. The FEM is a variational method used to
approximate the solution by a family of finite dimensional basis functions.
Then the forward problem is reduced to one of linear algebra and one com-
putes the approximate solution using FEM codes.
156 T. Khan, A. Smirnova / Appl. Math. Comput. 161 (2005) 149–170
If we let X ¼ ½0; L
be the 1D domain under consideration with surface
oX ¼ f0; Lg, the weak formulation of the problem corresponding to Eq. (11) is
to find uðxÞ 2 H 1 ðXÞ such that for all vðxÞ 2 H 1 ðXÞ, the following variational
equation is satisfied
Z Z
1 1
rv Dru dX þ la vu dX þ vð0Þuð0Þ þ vðLÞuðLÞ
X X 2 2
Z
¼ vf dX: ð21Þ
X
The FEM is derived by projecting the weak form of (21) onto a finite
dimensional function space. The finite dimensional function space is the set of
continuous and twice differentiable, piecewise cubic polynomials on subinter-
vals of the domain X. We approximate u with the functions
X
N p þ1
where
x x
i
Bi ðxÞ ¼ B
h
with partitions
x1 < x0 ¼ 0 < x1 < < xNp 1 < xNp ¼ L < xNp þ1 ;
and stepsize
h ¼ xi xi1 for 0 6 i < Np þ 1;
and BðxÞ is the cubic B-spline. By using the test functions v ¼ Bj on the weak
form of the PDE in (21), we obtain the Galerkin approximation for all
1 6 j 6 Np þ 1
Z Z
1 1
rBj DruNp dX þ la Bj uNp dX þ Bj ð0ÞuNp ð0Þ þ Bj ðLÞuNp ðLÞ
X X 2 2
Z
¼ Bj f dX: ð23Þ
X
From the above we obtain a system of equations for cfi where the coefficients
of the system equations are given by
ij þ Mij þ Pij ;
Aij ¼ D ð24Þ
where the diffusion inner product
Z
ij ¼
D rBi DrBj dX; ð25Þ
X
T. Khan, A. Smirnova / Appl. Math. Comput. 161 (2005) 149–170 157
This linear system then can be solved using any one of the number of
techniques.
3
computed solution
2.5
1.5
0.5
0
0 10 20 30 40 50
3
exact solution
2.5
1.5
0.5
0
0 10 20 30 40 50
0.6
0.4
0.2
0
0 10 20 30 40 50
–3
x 10
1
0.5
absolute error
–0.5
1
0 10 20 30 40 50
2.5
1.5
0.5
0
0 10 20 30 40 50
jxxs j2
Fig. 6. Constant values of D and la , f ¼ pffiffiffiffiffiffi
A ffi
2p2
e 22 .
A jxx j2
s
f ¼ pffiffiffiffiffiffiffiffiffi e 22 ; ð30Þ
2p2
with A ¼ 1 and ¼ 0:1 where xs ¼ 42:5 is the source location (see Fig. 8). Here
this particular f is an approximation to the dðx xs Þ source for which the
GreenÕs function solution is available (see Fig. 6 for approximate solution and
compare it to analytic solution in Fig. 7. The absolute error is given in Fig. 9)
Fig. 13 illustrates numerical simulation with nonconstant D and la (see Figs. 10
and 11), and f as in (30) with A ¼ 1, ¼ 0:1, and xs ¼ 42:5 (Fig. 12).
In the last section, we outlined the numerical method for the forward
problem. In this section, we will describe the computational method for the
2.5
1.5
0.5
0
0 10 20 30 40 50
jxxs j2
Fig. 7. Constant values of D and la , f ¼ pffiffiffiffiffiffi
A ffi
2p2
e 22 .
160 T. Khan, A. Smirnova / Appl. Math. Comput. 161 (2005) 149–170
2.5
1.5
0.5
0
0 10 20 30 40 50
jxxs j2
Fig. 8. Constant values of D and la , f ¼ pffiffiffiffiffiffi
A ffi
e 22 .
2p2
Absolute error
0.05
0.05
0.1
0.15
0 10 20 30 40 50
jxxs j2
Fig. 9. Constant values of D and la , f ¼ pffiffiffiffiffiffi
A ffi
2p2
e 22 .
11
10
6
0 10 20 30 40 50
jxxs j2
A ffi
Fig. 10. Nonconstant values of D and la , f ¼ pffiffiffiffiffiffi
2p2
e 22 .
T. Khan, A. Smirnova / Appl. Math. Comput. 161 (2005) 149–170 161
0.5
0.4
0.3
0 10 20 30 40 50
jxxs j2
A ffi
Fig. 11. Nonconstant values of D and la , f ¼ pffiffiffiffiffiffi
2p2
e 22 .
2.5
1.5
0.5
0
0 10 20 30 40 50
jxxs j2
A ffi
Fig. 12. Nonconstant values of D and la , f ¼ pffiffiffiffiffiffi
2p2
e 22 .
1.5
0.5
0
0 10 20 30 40 50
jxxs j2
A ffi
Fig. 13. Nonconstant values of D and la , f ¼ pffiffiffiffiffiffi
2p2
e 22 .
162 T. Khan, A. Smirnova / Appl. Math. Comput. 161 (2005) 149–170
X
N
la ðxÞ ¼ ^ k ðxÞ;
ek B ð32Þ
k¼1
where B^ k are the basis functions may be on a different set of grid points than
the grid points for the basis functions Bk . The model parameter vector q is
approximated by
T
qN ¼ ðd1 ; . . . ; dN ; e1 ; . . . ; eN Þ ; ð33Þ
where qN is to be identified from the minimization of the cost functional Jk
m
2
1X ms X
fj Np N f
2
Jk qN ¼
Ci u q zi j
þ kkqN ^qN k ; ð34Þ
2 j¼1 i¼1
where T ½qN
¼ CuNp ðqN Þ and k is the regularization parameter.
Let u ¼ v0 þ v where u solves Eqs. (11) and (12) and subtracting Eqs. (37)
and (38) from Eqs. (11) and (12), we obtain ignoring second-order terms in h1 ,
h2 , and v
where v is the solution to Eq. (39). The adjoint of this linearized equation (39)
can be derived using the GreenÕs formula
Z
ð r Drv þ la vÞw vðr Drw þ la w Þ dX ð42Þ
X
Z !
ov
ow
¼ D v
w ds ð43Þ
oX om om
Now we will derive the discrete sensitivity relations which will be used for
the Jacobian calculation for our numerical scheme below. If we plug in the
expression for DðxÞ and lðxÞ Eqs. (31) and (32) into the equations for the
system matrices D and M Eqs. (25) and (26) respectively, we get
X
N
ij ¼
D k;ij ;
dk D ð47Þ
k¼1
164 T. Khan, A. Smirnova / Appl. Math. Comput. 161 (2005) 149–170
where
Z
k;ij ¼
D ^ k rBi rBj dX;
B ð48Þ
X
and similarly
X
N
Mij ¼ ek Mk;ij ; ð49Þ
k¼1
where
Z
Mk;ij ¼ ^ k Bi Bj dX:
B ð50Þ
X
Now we describe our numerical scheme for the inverse problem. The inverse
problem in Eq. (34) can be casted as a finite dimensional nonlinear minimi-
zation problem mainly
m
2 1
1X ms X
fj Np N f
2 1Xmtot
min J qN ¼
Ci u q zi j
¼ kR qN k ¼ ri2 qN
N
q 2R 2N 2 j¼1 i¼1 2 2 i¼1
T
with zero or nonzero residual, where, RðqN Þ ¼ ðr1 ðqN Þ; r2 ðqN Þ; . . . ; rmtot ðqN ÞÞ
f f
with mtot ¼ m ms , rn ðqN Þ ¼ Ci j uNp ðqN Þ zi j with n ¼ i þ ðj 1Þm, and the
N
norm kRk is called the residual at q . Let H ðqN Þ ¼ K ðqN ÞK ðqN Þ, where K ðqN Þ
N
is the Jacobian matrix of Rðq Þ,
orn ðqN Þ
K qN ¼ ; n ¼ 1; . . . ; mtot ; k ¼ 1; . . . ; N : ð54Þ
oqk
T. Khan, A. Smirnova / Appl. Math. Comput. 161 (2005) 149–170 165
where
¼ 1 B1 ðxi Þ; B0 ðxi Þ; . . . ; BN ðxi Þ; BN þ1 ðxi Þ T ;
B ð59Þ
2 p p
and Cfj is given by Eq. (29). Therefore using the sensitivity equations (52) and
(53) for the expression oCfj =oqk in Eq. (58) we can compute the Jacobian
matrix KqN for our inverse calculation. One can also use the adjoint formu-
lation to derive this Jacobian matrix which can be beneficial in two and three
dimensions. However in one dimension speed of the forward sensitivity rela-
tions is adequate to compute the Jacobian of the inverse problem.
Given the Jacobian of the inverse problem, we can define the IRGN method
as follows:
qNkþ1 ¼ qNk ck ½K qNk K qNk þ kk I
1 K qNk R qNk þ kk ðqNk ^qN Þ ;
The reader may consult [1,4,16] for the convergence analysis of IRGN under
different types of conditions on the initial guess q0 . The best convergence rate
was achieved when we used the iteratively regularized analog of a line search
procedure suggested in [22] for the well-posed case. Namely, a backtracking
strategy with ck ¼ 1; 1=2; 1=4, until the following two requirements were
simultaneously fulfilled:
Jkk ðqkþ1 Þ 6 Jkk ðqk Þ þ dchrJkk ðqk Þ; sk i ð60Þ
166 T. Khan, A. Smirnova / Appl. Math. Comput. 161 (2005) 149–170
which is the Wolfe type strategy. Here sk is the solution to ðK ðqk ÞKðqk Þ þ
kk IÞsk ¼ ðK ðqk ÞRðqk Þ þ kk qk Þ. Our code uses the parameters d ¼ 0:0001 and
q ¼ 0:99 as in [13].
The optimization of the functional Jk was performed using the IRGN iter-
ative method described above. After 20 iterations using equally spaced grid
points with Np ¼ 40 and N ¼ 20 with k0 ¼ 1:0E 6 with the choice of cubic B-
splines satisfying Rubin boundary condition for the solution and Neumann
boundary condition for the parameter, we reconstructed the D profile for a set
of 10 sources. We modelled the sources as in Eq. (30) with xs ¼ 0:1, 0.45, 0.8,
1.15, 1.5, 41.0, 41.3750, 41.75, 41.1250, 42.5. The reconstructed profile is de-
picted in Fig. 14. The line corresponds to true D, the circles correspond to the
optimal profile for the space of N -dimensional cubic B-spline basis functions
for D, and the stars correspond to the constructed D using IRGN.
Since the problem is exponentially ill-posed the proper choice of the initial
value of kk was extremely important for the numerical simulations. In Figs. 15–
18 the dependence of the accuracy of the reconstructed solutions on the choice
of k0 is illustrated. For a posteriori stopping of IRGN iterations the generalized
discrepancy principle (see [4]) was implemented, i.e. the iterations were stopped
at the first index ^k, for which the residual kRðq^Nk Þk is less than or equal to sr,
s > 1:
1.2
1.1
1
0.9
0.8
0.7
0.6
0.5 exaact D
0.4 ****** computed D
oooo optimal D
initial D
0.3
0.2
0 10 20 30 40 50
1.2
0.8
D
0.6
exact D
0.4 computed D
initial D
0.2
0 10 20 30 40 50
1.2
0.8
D
0.6
exact D
0.4 computed D
initial D
0.2
0 10 20 30 40 50
1.2
0.8
D
0.6
exact D
0.4 computed D
initial D
0.2
0 10 20 30 40 50
1.2
0.8
D
0.6
exact D
0.4 computed D
initial D
0.2
0 10 20 30 40 50
Table 1
Effect of regularization parameter
k0 IRGN iteration Inner iterations Relative error Residual
1.0E)1 10 56 0.12677 7.5060E)4
1.0E)2 10 56 0.11652 4.2413E)4
1.0E)3 10 56 0.09710 2.3323E)4
1.0E)4 10 36 0.08179 1.10069E)4
1.0E)5 4 11 0.07483 3.5921E)5
1.0E)6 6 28 0.07024 4.5446E)5
1.0E)7 1 1 div div
In Table 1 one can see the summary of measured accuracy and work done
for values of k0 in the range from 1.0E)4 to 1.0E)7. For every k0 IRGN
approximations were calculated until either (62) was satisfied with sr ¼ 1:0E)4
or the number of iterations k became greater than 10. One can notice that the
required accuracy of the residual was attained for two values of k0 :1.0E)5 and
1.0E)6. For k0 ¼ 1:0E)4 the residual is very close to 1.0E)4 but still greater.
For other regularizing sequences kk , for example kk ¼ k0 =k n , 0:5 6 n, and
kk ¼ k0 = expðkÞ the optimal values of the parameters can also be found, how-
ever when kk ¼ k0 =k the accuracy of reconstruction is higher.
5. Conclusion
1.2 1.2
1.1 1.1
1 1
0.9 0.9
0.8 0.8
0.7 0.7
0.6 0.6
0.5 exact D 0.5 exact D
0.4 ****** computed D 0.4 ****** computed D
oooo optimal D oooo optimal D
0.3 initial D 0.3 initial D
0.2 0.2
(a) 0 10 20 30 40 50 (b) 0 10 20 30 40 50
Fig. 19. D reconstructed using (a) IRGN with AGW algorithm, (b) Levenberg–Marquardt trust
region algorithm.
References
[1] A.B. Bakushinsky, Iterative methods for nonlinear operator equations without regularity, new
approach, Dokl. Russian Acad. Sci. 330 (1993) 282–284.
[2] S.R. Arridge, Optical tomography in medical imaging: topical review, Inverse Problems 15
(1999) R41–R93.
[3] S.R. Arridge, J.C. Hebden, Optical imaging in medicine: 2. Modelling and reconstruction,
Phys. Med. Biol. 42 (1997) 841–853.
[4] B. Blaschke, A. Neubauer, O. Scherzer, On convergence rates for the iteratively regularized
Gauss–Newton method, IMA J. Num. Anal. 17 (1997) 421–436.
[5] H. Bremmer, Random volume scattering, Radiat. Sci. J. Res. 680 (1964) 967–981.
[6] B. Chance, R.R. Alfano (Eds.), Photon migration and imaging in random media, and tissues,
vol. 1888, SPIE, 1993.
[7] B. Chance, R.R. Alfano (Eds.), Optical Tomography, Photon migration, and spectroscopy of
tissue and model media: theory, human studies, and instrumentations, Part 1 and 2, vol. 2389,
SPIE, 1995.
[8] B. Chance, C.E. Cooper, D.T. Delpy, E.O.R. Reynolds, Near-infrared spectroscopy and
imaging of living systems, Phil. Trans. R. Soc. Lond. B 352 (1997) 643–648.
[9] R. Chandrasekhar, Radiation Transfer, Oxford, Clarendon, 1950.
[10] National Research Council, Mathematics and Physics of Emerging Biomedical Imaging,
National Academy Press, Washington DC, 1996.
170 T. Khan, A. Smirnova / Appl. Math. Comput. 161 (2005) 149–170
[11] D.T. Delphy, M. Cope, Quantification in tissue near-infrared spectroscopy, Phil. Trans. R.
Soc. Lond. B 352 (1997) 649–659.
[12] J.E. Dennis, R.B. Schnabel, Numerical Methods for Unconstrained Optimization and
Nonlinear Equation, Prentice-Hall, Englewood Cliffs, NJ, 1983.
[13] M. Hanke, J. Nagy, C. Vogel, Quasi-Newton approach to nonnegative image restorations,
Linear Algebra Appl. 316 (2000) 223–236.
[14] J. Hebden, S.R. Arridge, D.T. Delphy, Optical imaging in medicine: 1. Experimental
techniques, Phys. Med. Biol. 42 (1997) 825–840.
[15] A. Hielscher, R. Alcouffe, R. Barbour, Comparison of finite-difference transport and diffusion
calculations for photon migration in homogenous and heterogenous tissues, Phys. Med. Biol.
42 (1998) 1285–1302.
[16] T. Hohage, Logarithmic convergence rates of the iteratively regularized Gauss–Newton
method for an inverse potential and inverse scattering problem, Inverse Prob. 13 (1997) 1279–
1299.
[17] A. Ishimaru, Single Scattering and Transport, Theory (Wave Propagation and Scattering in
Random Media I), Academic, New York, 1978.
[18] M. Klibanov, T.R. Lucas, Numerical solution of a parabolic inverse problem in optical
tomography using experimental data, SIAM J. Appl. Math. 59 (5) (1999) 1763–1789.
[19] K. Levenberg, A method for the solution of certain problems in least squares, Quart. Appl.
Math. 2 (2) (1944) 164–168.
[20] H.W. Lewis, Multiple scattering in an infinite medium, Phys. Rev. 78 (1950) 526–529.
[21] D. Marquardt, An algorithm for least, squares estimation of nonlinear parameters, SIAM J.
Appl. Math. 11 (2) (1963) 431–441.
[22] S.G. Nash, A. Sofer, Linear and Nonlinear Programming, McGraw-Hill, New York, 1996.
[23] F. Natterer, F. W€ ubbeling, Mathematical methods in image reconstruction, SIAM Mono-
graphs on Mathematical Modeling and Computation, 2001.