Professional Documents
Culture Documents
Donald Estep
Department of Mathematics Department of Statistics Colorado State University
Research supported by Department of Energy: DE-FG02-04ER25620 and DE-FG02-05ER25699; Sandia Corporation: PO299784; National Aeronautics and Space Administration: NNG04GH63G; National Science Foundation: DMS-0107832, DGE-0221595003, MSPA-CSE-0434354
Motivation
Contact Contact
Contact
Conductor
Conductor 250 m
Vswitch
Vswitch
Vswitch
Multiple physical components, multiple scales = complicated analytic and computational issues
results to experimental observations Applications requiring results for a range of data and parameter values raise a critical need for quantication and control of uncertainty
Donald Estep, Colorado State University p. 8/196
computed information The context is important: It is often difcult or impossible to obtain uniformly accurate solutions of multiscale, multiphysics problems throughout space and time
Fundamental Tools
We employ two fundamental mathematical tools
Duality and adjoint operators Variational analysis
These tools have a long history of application in analysis of model sensitivity and optimization More recently, they have been applied to a posteriori error estimation for differential equations Currently, they are being applied to the analysis and application of multiphysics problems
(f ) =
a
f (x) dx
and
(f ) = f (y ) for a y b
Sampling a Vector
A linear functional is a one dimensional sample of a vector Example The linear functional on Rn given by the inner product with the basis vector ei gives the ith component of a vector Example Statistical moments like the expected value E (X ) of a random variable X are linear functionals Example The Fourier coefcients of a continuous function f on [0, 2 ],
2
cj =
0
f (x) eijx dx
are functionals of f
Sampling a Vector
Using linear functionals of a solution means settling for a set of samples rather than the entire solution Presumably, it is easier to compute accurate samples than solutions that are accurate everywhere In many situations, we settle for an incomplete set of samples Example We are often happy with a small set of moments of a random variable Example In practical applications of Fourier series, we truncate the innite series to a nite number of terms,
J
cj eijx
j = j =J
cj eijx
x=0
It is easy to
C ([a,b])
sup
f C ([a,b]) max |f |=1 a
f (x) dx
by looking at a picture.
possible functions a
-1
Computing the dual norm of the integration functional The maximum value for I (f ) is clearly given by f = 1 or f = 1, and I C ([a,b]) = b a
Donald Estep, Colorado State University p. 20/196
Lp ()
Lq ()
g (x)f (x) dx
We can identify (Lp ) with Lq when 1 < p, q < The cases p = 1, q = and p = , q = 1 are trickier
The Riesz Representation theorem says this is the only kind of linear functional on a Hilbert space We can identify the dual space of a Hilbert space with itself Linear functionals are commonly represented as inner products
Riesz Representors
Some useful choices of Riesz representors for functions f in a Hilbert space include:
= /| | gives the error in the average value of f over a
of f on a curve c in Rn , n = 2, 3, and = s gives the average value of f over a plane surface s in R3 ( denotes the corresponding delta function) similarly
c f (s) ds
We can obtain average values of derivatives using dipoles = f / f gives the L2 norm of f
X ,
x X, y X
Adjoint Operators
some x X
involved?
Given a sample value, what can we say about x? Given a collection of sample values, what can we say about
L?
Donald Estep, Colorado State University p. 26/196
Adjoint of a Matrix
Example Let X = Rm and Y = Rn with the standard inner product and norm
L L(Rm , Rn ) is associated with a n m matrix A: a11 . A= . . a1m . . . ,
m
a n1
anm
y1 . y= . . , yn 1in
x1 . x= . . xm
and
yi =
j =1
aij xj ,
Adjoint of a Matrix
The bilinear identity reads
(Lx, y ) = (x, L y ), x Rm , y Rn .
=
j =1 m
y1 a1j xj + n yi aij xj
. . . m j =1 anj xj
yn anj xj
m j =1 a1j xj
j =1
=
j =1 i=1
Adjoint of a Matrix
L (y ) is given by the inner product with y = ( y1 , , y m ) where
n
y j =
i=1
yi aij .
anm
a n1 . . . =A .
L 1 = L1 0 = 0
(L1 + L2 ) = L + L 1 2
(L1 ) = L 1,
all R
is obtained by a succession of integration by parts Boundary terms involving functions and derivatives arise from each integration by parts We use a two step process 1. We rst compute the formal adjoint by assuming that all functions have compact support and ignoring boundary terms 2. We then compute the adjoint boundary and data conditions to make the bilinear identity hold
Donald Estep, Colorado State University p. 33/196
Formal Adjoints
Example Consider
d Lu(x) = dx d d a(x) u(x) + (b(x)u(x)) dx dx
d dx
=
0
1 0 1 0
=
0
d u(x) dx
d a(x) v (x) dx
d dx + u(x)a(x) v (x) dx
Formal Adjoints
1 0 1 0 1 d u(x)b(x) v (x) dx+b(x)u(x)v (x) , dx 0
d (b(x)u(x))v (x) dx = dx
d = L v = dx
Formal Adjoints
In higher space dimensions, we use the divergence theorem Example A general linear second order differential operator L in Rn can be written
n n
L(u) =
i=1 j =1
2u + aij xi xj
n i=1
u bi + cu, xi
L (u) =
i=1 j =1
2 (aij v ) xi xj
n i=1
(bi v ) + cv xi
(s v sv ) dx = (vs sv )
1 0
v u u v n n
ds,
the Dirichlet and Neumann boundary value problems for the Laplacian are their own adjoints
Since
(uv v u) dx =
v v n s
u u + n s
ds,
du v dt = u(t)v (t) dt
T 0
dv u dt dt
The boundary term at 0 vanishes because u(0) = 0 The adjoint is a nal-value problem with initial condition v (T ) = 0 The adjoint problem has dv dt and time runs backwards
Condition of an Operator
There is an intimate connection between the adjoint problem and the stability properties of the original problem Theorem The singular values of a matrix L are the square roots of the eigenvalues of the square, symmetric transformations L L or LL This connects the condition number of a matrix L to L
Theorem A necessary condition that b is in the range of L is y (b) = 0 for all y in the null space of L This is a sufcient condition if the range of L is closed in Y Example If A is an n m matrix, a necessary and sufcient condition for the solvability of Ax = b is b is orthogonal to all linearly independent solutions of A y = 0
right away
This yields a natural denition of a solution in the
over-determined case
This gives conditions for a solution to exist in the
under-determined case The more over-determined the original system, the more under-determined the adjoint system, and so forth
or
The adjoint to div is -grad modulo boundary conditions If F = grad u, where u is subject to the boundary condition u() = 0, then we obtain the square, well-determined problem div grad u = u = , which has a unique solution because of the boundary condition
Greens Functions
Suppose we wish to compute a functional (x) of the solution x Rn of a n n system
Lx = b
We can compute many solutions by computing one adjoint solution and taking inner products
Greens Functions
This is the method of Greens functions in differential equations Example For
u = f, u = 0, x , x
This yields
u(x) = (u, x ) = (f, )
The generalized Greens function solves the adjoint problem with general functional data, rather than just x The imposition of the adjoint boundary conditions is crucial
Donald Estep, Colorado State University p. 55/196
Nonlinear Operators
Nonlinear Operators
There is no natural adjoint for a general nonlinear operator We assume that the Banach spaces X and Y are Sobolev spaces and use ( , ) for the L2 inner product, and so forth We dene the adjoint for a specic kind of nonlinear operator We assume f is a nonlinear map from X into Y , where the domain of f is a convex set
A Perturbation Operator
We choose u in the domain of f and dene
F (e) = f (u + e) f (u),
We assume that the domain of F is independent of e and dense in X Note that 0 is in the domain of F and F (0) = 0
A Perturbation Operator
Two reasons to work with functions of this form:
This is the kind of nonlinearity that arises when estimating
Denition 1
The rst denition is based on the bilinear identity Denition An operator A (e) is an adjoint operator corresponding to F if
(F (e), w) = (e, A (e)w)
Denition 1
Example Suppose that F can be represented as F (e) = A(e)e, where A(e) is a linear operator with the domain of F contained in the domain of A For a xed e in the domain of F , dene the adjoint of A satisfying
(A(e)w, v ) = (w, A (e)v )
for all w domain of A, v domain of A Substituting w = e shows this denes an adjoint of F as well If there are several such linear operators A, then there will be several different possible adjoints.
where a > 0 is a constant and the domain of F is the set of continuously differentiable functions.
v e + a+ A2 (e)v = t x
Denition 2
If the nonlinearity is Frechet differentiable, we base the second denition of an adjoint on the integral mean value theorem The integral mean value theorem states
1
f (U ) = f (u) +
0
f (u + se) ds e
Denition 2
We rewrite this as
F (e) = f (U ) f (u) = A(e)e
with
1
A(e) =
0
f (u + se) ds.
A(e) =
0
F (se) ds.
Denition 2
Denition For a xed e, the adjoint operator A (e), dened in the usual way for the linear operator A(e), is said to be an adjoint for F Example Continuing the previous example,
v e v F (e)v = +e + a+ t x x
v.
After some technical analysis of the domains of the operators involved, w e w A (e)w = + aw. t 2 x This coincides with the third adjoint computed above
Ce
Lt
dp+1 y dtp+1
L (0,t)
It is typical for an a priori convergence bound to be orders of magnitude larger than the error
Donald Estep, Colorado State University p. 71/196
where
L(D, x)u = a(x)u + b(x) u + c(x)u(x),
Rn , n = 1, 2, 3, is a convex polygonal domain a = (aij ), where ai,j are continuous and there is a a0 > 0
This says that the solution solves the average form of the problem for a large number of weights v
Th
hK
U=0
Denition The nite element method is: Compute U Vh such that A(U, v ) = (f, v ) for all v Vh This says that the nite element approximation solves the average form of the problem for a nite number of weights v
Donald Estep, Colorado State University p. 77/196
1 u0 -1 x y
error/estimate
0 0.0
0.1
1.0
10.0
100.0
percent error
(e, ) dt =
0 0
+
0
u(0) = u0
component 1 component 2
2 Time
0 < t,
50 40 30
U3
U2
U1
Component Error
U1 U2 U3 Time
Componentwise Errors Computed using Solutions with Tolerances .001 and .0001
Donald Estep, Colorado State University p. 93/196
U1
U2
U3
U1
U2
U3
space)
use of quadrature to compute integrals in a variational
equations
model error data and parameter error operator decomposition
The stability factor is a weak condition number for the quantity of interest We can obtain from by taking the sup over all = 1
Donald Estep, Colorado State University p. 100/196
0 < t,
-10 U1 30 -20 U2
30
-10 U1 30 -20 U2
30
10 t
20
10 t
20
-6 -6
Adaptive Computation
Adaptive Computation
The possibility of accurate error estimation suggests the possibility of optimizing discretizations Unfortunately, cancellation of errors signicantly complicates the optimization problem In fact, there is no good theory for adaptive control of error There is good theory for adaptive control of error bounds The standard approach is based on optimal control theory The stability information in adjoint-based a posteriori error estimates is useful for this
where ( , )K is the inner product on K The element contributions in the bound do not cancel
Donald Estep, Colorado State University p. 108/196
^
Donald Estep, Colorado State University p. 115/196
6 4 2 0
6 4 2 8 x 10 0
2 y 1 0 0 2
2 y1 0 0 2
6 4 x
10
= = E+ E
elt. contrib.
We apportion the number of elements N to be rened between the positive and negative contributions as
N
+
E+ , =N + E +E
E =N + E + E
log(Y)
7.61
2.54
12.7
17.7
22.8
27.9
43.1
Time
48.2
33
38
Contribution
2.3 6.9 11.5 16.1 20.7 25.3 29.9 34.5 39.1 43.7 48.3
Contribution
2.5E-10
Y1 Y2 Y3
Time
Element contributions to the error: nal time (left) and average error (right)
We see that the element contributions are largest in the brief transient periods.
Length
0.01
9.82
19.6
32.8
35.5
24.6
38.1
40.6
44.5
47.9
between components
We account for the fact that the adjoints to the original
problem and an operator decomposition discretization are not the same Additional work is required to obtain computable estimates
Donald Estep, Colorado State University p. 126/196
x x , x ,
2 b=
25 sin(4x) sin(x)
and the average value Estimating the error requires auxiliary estimates of the error in the information that is passed between components
2 1.5 1 0.5 0 1 0.5 0 0 0.2 0.4 0.6 0.8 1 0.5 0 0 0.2 0.4 0.6 0.2 0.15 0.1 0.05 0 0.05 0.1 1 0.8 1
0 < t,
The diffusion component u induces stability and change over long time scales The reaction component F induces instability and change over short time scales
...
tn1 < t tn ,
tn1 < t tn ,
Un
Un-1
D Un-1
UD n
Un
t1 s
t2 s
t3 s
t4 s
t5
s Reaction Integration: s
1,0
2,0
...
s
2
2,M
s 3
4,0
...
4,M
...
1,M
3,0
...
3,M
and Backward Euler with time step of .004 for the reaction
-1
-2
-3
-4
p slo
-3 -2
e1
t = 6.4 t = 16 t = 32 t = 64 t = 80 10 t
-1
-5
10
10
10
10
Spatial Location
component
Q2
E a computable estimate for the error in the adjoint arising from operator splitting
higher order
Error
Error
Component
Time
Error
Error
Component
Component
Space of outputs
Assume that the data and/or the parameters are unknown within a given range and/or subject to random or unknown variation Problem: determine the effect of the uncertainty or variation on the output of the operator We consider the input to be a random vector associated with a probability distribution The output of the model is random vector associated with a new distribution
A Finite-Dimensional Problem
We determine the distribution of a linear functional (x, ) given the random vector Rd , where x satises
f (x; ) = b
Mean
1 10.1 1 5 1 1
Perturbation
50% 50% 50% 50% 50% 50%
We use the L1 norm of the prey population at t = 10 as the quantity of interest ( = (t 10)(0, 1) ) We use a 12400 point Monte-Carlo computation as a reference
Donald Estep, Colorado State University p. 150/196
1 Cumulative Densities .8 .6 .4 .2 0 0 4 Monte-Carlo (12400) FAPS (19) FAPS (41) FAPS (1001) 8 12 16 20 q()
Donald Estep, Colorado State University p. 156/196
8 6 4 2 2 1.5 1
15 10
5 0
0 < t,
We use 1000 point Monte-Carlo sampling with both a xed time step computation and an adaptive computation with error smaller than 105
Error
0 0 0.01
0.03
0.05
0.07
The distribution for the quantity of interest using a xed time step
The distribution for the quantity of interest using adapted time steps
Donald Estep, Colorado State University p. 162/196
u(x; ) (x) dx
We use a conjugate gradient method using the Hestenes-Stiefel formula and the secant method for the line search
old
q(u()) q(u())
new
x , x .
Numerical Approximations
Standard nite element approximation: compute U Vh
(aU, v ) = (f (U ), v )
for all v Vh
We need to control the errors in the value and the gradient used for the search
This is the computable correction for the effects of numerical approximation This expression reects the change in stability arising from the change in parameter value If we control this error, we can prove that the search leads to a minimum
element contribution
The corresponding bound on the error in the gradient has the form
|error in q (U )|
elements
Optimization Example
We optimize (u, 1) where u solves
u = u2 + tanh2 20e1 (11 ) (x e2 (12 )1) cos2 ( 2 x), u(1) = u(1) = 0 [1, 1],
0 1 4 3 2 2 1 0 1 2 1 0 2 1 1 3 4
17 Search Iterations
1 1
0.5
0.5
Domain Decomposition
Denition The quantities {(U, pi )} corresponding to the data {i = pi } are called the localized information corresponding to the partition of unity We consider the problem of estimating the error in the localized information for 1 i N
Donald Estep, Colorado State University p. 175/196
We hope that this will require a locally rened mesh because the corresponding data has localized support
Vp =
i=1
Vi p i =
i=1
vi pi : vi Vi
Theorem The error of the partition of unity nite element solution satises
N
(u Up , ) =
i=1
(x, y ) , (x, y ) ,
u(x, y ) = 0,
Level Elements Estimate 1 80 .0005919 2 193 .001595 3 394 .0009039 4 828 .0003820 5 1809 .0001070 6 3849 .00004073 7 9380 .00001715 8 23989 .000007553
11 1
12 2
13 3
14 4
15 5
16 6
17 7
18 8
19 9
20 10
^
Donald Estep, Colorado State University p. 187/196
Work in Progress
domain decomposition
References
References
These references are a starting point for further investigation
Ainsworth, M. and J. T. Oden, A Posteriori Error Estimation In Finite Element Analysis, Pure and Applied Mathematics Series, 2000, Wiley-Interscience, 2000. Babuska, I. and T. Strouboulis, The Finite Element Method and its Reliability, 2001, Oxford University Press: New York. W. Bangerth and R. Rannacher, Adaptive Finite Element Methods for Differential Equations, 2003. Birkhauser Verlag: New York. Becker, R. and R. Rannacher, An Optimal Control Approach to A Posteriori Error Estimation in Finite Element Methods, Acta Numerica, 2001. p. 1-102. Braack, M. and A. Ern, A Posteriori Control Of Modeling Errors And Discretization Errors, SIAM J. Multiscale Modeling and Simulation, 2003. 1, p. 221-238. Cacuci, D.G., Sensitivity and Uncertainty Analysis I. Theory, 2003, Chapman and Hall: Boca Raton, Florida. Cacuci, D.G., M. Ionescu-Bujor, and I. M. Navon, Sensitivity and Uncertainty Analysis. II. Applications to Large Scale Systems, 2005, Chapman and Hall: Boca Raton, Florida.
References
Carey, V., D. Estep, and S. Tavener, A Posteriori Analysis and Adaptive Error Control for Operator Decomposition Methods for Elliptic Systems I: One Way Coupled Systems, submitted to SIAM Journal on Numerical Analysis, 2007. Carey, V., D. Estep, and S. Tavener, A Posteriori Analysis and Adaptive Error Control for Operator Decomposition Methods for Elliptic Systems II: Fully Coupled Systems, in preparation. Eriksson, K., D. Estep, P. Hansbo and C. Johnson, Introduction To Adaptive Methods For Differential Equations, Acta Numerica, 1995. p. 105-158. Eriksson, K., D. Estep, P. Hansbo and C. Johnson, Computational Differential Equations, 1996, Cambridge University Press: New York. Estep, D., A Short Course on Duality, Adjoint Operators, Greens Functions, and A Posteriori Error Analysis, Sandia National Laboratories, Albuquerque, New Mexico, 2004. Notes can be downloaded from http://math.colostate.edu/estep Estep, D., M. G. Larson, and R. D. Williams, Estimating The Error Of Numerical Solutions Of Systems Of Reaction-Diffusion Equations, Memoirs of the American Mathematical Society, 2000. 146, p. 1-109.
References
Estep, D. and D. Neckels, Fast And Reliable Methods For Determining The Evolution Of Uncertain Parameters In Differential Equations, Journal on Computational Physics, 2006. 213, p. 530-556. Estep, D. and D. Neckels, Fast Methods For Determining The Evolution Of Uncertain Parameters In Reaction-Diffusion Equations, to appear in Computer Methods in Applied Mechanics and Engineering, 2006. Estep, D., V. Ginting, D. Ropp, J. Shadid and S. Tavener, An A Posteriori Analysis of Operator Splitting, submitted to SIAM Journal on Numerical Analysis, 2007. Estep, D., M. Holst and M. Larson, Generalized Greens Functions And The Effective Domain Of Inuence, SIAM Journal on Scientic Computing, 2005. 26, p. 1314-1339. Estep, D., A. Malqvist, and S. Tavener, A Posteriori Analysis For Elliptic Problems With Randomly Perturbed Data And Coefcients, in preparation Estep, D., A. Malqvist, and S. Tavener, Adaptive Computation For Elliptic Problems With Randomly Perturbed Data And Coefcients, in preparation
References
Estep, D., S. Tavener and T. Wildey, A Posteriori Analysis and Improved Accuracy for an Operator Decomposition Solution of a Conjugate Heat Transfer Problem, submitted to SIAM Journal on Numerical Analysis, 2006. Giles, M. and E. Suli, Adjoint Methods for PDEs: A Posteriori Error Analysis And Postprocessing By Duality, 2002. Acta Numerica, p. 145-236. Hundsdorfer, W. and Verwer, J., Numerical solution of time-dependent advection-diffusion-reaction equations, Springer Series in Computational Mathematics, 2003. Springer-Verlag: New York. Lanczos, C., Linear Differential Operators, 1996. SIAM: Philadelphia. Marchuk, G.I., Splitting and Alternating Direction Methods, in Handbook of Numerical Analysis, 1990, P. G. Ciarlet and J. L. Lions, editors. North-Holland: New York, p. 197-462. Marchuk, G.I., Adjoint Equations and Analysis of Complex Systems, 1995. Kluwer: Boston. Marchuk, G.I., V. I. Agoshkov, and V. Shutyaev, Adjoint Equations And Perturbation Algorithms In Nonlinear Problems, 1996. CRC Press: Boca Raton, Florida.
References
Oden, J.T. and Prudhomee, S., Estimation of Modeling Error in Computational Mechanics, Journal on Computational Physics, 2002. 182, p. 496-515.