Professional Documents
Culture Documents
Introduction To Simulation - Lecture 21: Boundary Value Problems - Solving 3-D Finite-Difference Problems
Introduction To Simulation - Lecture 21: Boundary Value Problems - Solving 3-D Finite-Difference Problems
Outline
Reminder about FEM and F-D
1-D Example
Krylov Method
Communication Lower bound
Preconditioners based on improving communication
Heat Flow
1-D
Example
Normalized 1-D Equation
T ( x )
u ( x)
= hs
= f ( x)
2
x
x
x
2
u xx ( x ) = f ( x )
FD Matrix
properties
x0 x1 x2
2
1
1
0
2
x
M
0
SMA-HPC 2002 MIT
uxx = f
1-D Poisson
Finite Differences
xn xn +1
2 1
O O
O O
L 0
L
O
O
O
1
u j +1 2u j + u j
x
= f (x j )
u1
f ( x1 )
M
0 M
M
M M
0 M = M
M
1 M
2 M
M
u
f ( x )
n
n
Residual Equation
Using Basis
Functions
Partial Differential Equation form
u
2 = f
x
2
u (0) = 0 u (1) = 0
u ( x ) uh ( x ) = i i ( x )
{
i =1
Basis Functions
d i ( x )
R ( x ) = i
+ f ( x)
2
dx
i =1
n
Basis Weights
Galerkin Scheme
( x ) R ( x ) dt = 0
l
d i ( x )
i
+ f ( x ) dx = 0 l {1,..., n}
2
0 l ( x )
dx
i =1
Basis Weights
Using Basis
Functions
d l ( x )
0 dx
1
d i i ( x )
i =1
dx
dx i ( x ) f ( x ) dx = 0
0
l {1,..., n}
SMA-HPC 2002 MIT
Structural Analysis of
Automobiles
Equations
Force-displacement relationships for
mechanical elements (plates, beams, shells)
and sum of forces = 0.
Partial Differential Equations of Continuum
Mechanics
Equations
Navier-Stokes Partial Differential Equations.
Engine Thermal
Analysis
Equations
The Poisson Partial Differential Equation.
FD Matrix
properties
Discretized Poisson
x1
x2
x2m
xm +1
u j +1 2u j + u j 1
244
x
144
3
u
xx
2
xm
u j + m 2u j + u j m
y
1442443
u
yy
2
= f (x j )
FD Matrix
properties
3-D Discretization
FD Matrix
properties
Discretized Poisson
x j m
x j 1
x j m2
xj
x j +1
x j + m2
u j +1 2u j + u j 1
( x )
1442443
u xx
2
u j + m 2u j + u j m
( y )
1442443
u yy
2
x j +m
u j + m 2 2u j + u j m 2
( z )
144
42444
3
u zz
2
= f (xj )
FD Matrix
properties
3-D Discretization
Matrix nonzeros, m = 4 example
FD Matrix
properties
Summary
Numerical Properties
| Aii | | Aij |
j i
1
1
2 D 2 4, 3 D 2 6,
Summary
FD Matrix
properties
Structural Properties
2 D m 2 m 2 , 3 D m3 m3
Aij = 0
|i j | > 1
2D
Aij = 0
|i j| > m
3 D
Aij = 0
| i j | > m2
3,
2D
5, 3 D
Basics of GE
A11
A
0
21
A
031
041
A
SMA-HPC 2002 MIT
Triangularizing
Picture
A
A2323 A
A24
A22
22 A
24
A03232 A33
A
34
34
33
Triangularizing
GE Basics
Algorithm
For i = 1 to n-1 {
For each Row
For j = i+1 to n {
For each Row below pivot
For k = i+1 to n { For each element beyond Pivot
Ajk Ajk
Multiplie
r
}
}
Aji
Aii
Pivot
Aik
Form n-1 reciprocals (pivots)
Form
n 1
n2
(n i ) =
2
i =1
n 1
Perform
SMA-HPC 2002 MIT
multipliers
(n i)2
i =1
2 3
n
3
Multiply-adds
Complexity of GE
1 D
O ( n3 ) = O ( m3 )
2D
3 D
Triangularizing
Banded GE
Algorithm
b
b
For i = 1 to n-1 {
For j = i+1 to ni+b-1
{ {
{
For k = i+1 to n i+b-1
{
Ajk Ajk
NONZEROS
Aii
Aik
}
}
}
n 1
Perform
(min(b 1, n i))
i =1
Aji
O (b2n )
Multiply-adds
Complexity of
Banded GE
1 D
O (b 2 n) = O (m)
2D
3 D
Preconditioning
Preconditioning
Krylov Methods
Diagonal Preconditioners
Let A = D + And
(
Krylov Methods
Convergence Analysis
Optimality of GCR poly
Therefore
Any polynomial which satisfies the
constraints can be used to get an
upper bound on
SMA-HPC 2002 MIT
r k +1
r
xexact (1 digit)
b = r0
-.5
1.25 -1 -.5
D A
D 1 A
1-D Discretized
PDE
0.5
0
0 1
1
0.5
1
0
O
0
0.5
0
O
O 0.5 M
0
0
0.5
1 0
14444
4244444
3
SMA-HPC 2002 MIT
D1A
1
1.25
0.5
M0.5
0
0
xexact (1 digit)
b = r0
1
D A
1
D A
-.5
1.25 -1 -.5
0
k +1
j
= x + j r 0
Need at least m iterations for x
m
j =1
1
0
If r 0 =
M
0
Takes 2m = O ( m ) iters
m2
SMA-HPC 2002 MIT
for x k +1
)m
Eigenanalysis
0.5
0
0 1
1
0.5
1
0
O
0
0.5
0
O
O 0.5 M
0
0
0.5
1 0
14444
3
4244444
1
1.25
0.5
M0.5
0
0
D1A
k
Recall Eigenvalues of D A = 1 cos
m +1
1
Eigenanalysis
For D 1 A,
max
min
m
= 1 cos
1
cos
+
+
m
1
m
1
1
2
+1
k =
log
convergence
tolerance
1 m
log
+1
O (m)
O (m)
O (m)
O (m
O (m
)
O (m )
4
)
O (m )
3
GCR
O (m
)
O (m )
O (m )
Gauss-Seidel Preconditioning
Preconditioning
Approaches
Physical View
u0 u1
u2(old)
u1(new)
(new)
2
Gauss Seidel
u3(old)
un(new)
1
un(new) u
n +1
Gauss-Seidel Preconditioning
Preconditioning
Approaches
xexact (1 digit)
b =1 r 0
( D + L) A
( D + L)
b =1 r
( D + L) A
( D + L)
Krylov Vectors
1-D Discretized
PDE
X=nonzero
Gauss-Seidel Preconditioning
Preconditioning
Approaches
Symmetric Gauss-Seidel
(new)
u
u0 1
u2(old)
(new)
u
(new)
2
u1
u3(old)
un(new)
1
u
(newer)
u
u 1
0
(new)
n2
un(newer)
1
un(new) u
n +1
un( new)
u2(newer)
Gauss-Seidel Preconditioning
Preconditioning
Approaches
Symmetric Gauss-Seidel
k +1
= ( D + U ) L ( D + L ) Ux k
1
1
1
+ ( D +U ) b ( D +U ) L ( D + L) b
k +1
= x ( D + U ) D ( D + L ) Ax k
+ ( D +U ) D ( D + L) b
1
Block Diagonal
Preconditioners
Preconditioning
Approaches
Line Schemes
Grid
Matrix
m2
Block Diagonal
Preconditioners
Preconditioning
Approaches
Line Schemes
Grid
Problem
Lines preconditioners
communicate rapidly in
only one direction
Solution
m2
Do lines first in x,
then in y.
Block Diagonal
Preconditioners
Preconditioning
Approaches
Domain Decomposition
Approach
The trade-off
m2
Block Diagonal
Preconditioners
Preconditioning
Approaches
Domain Decomposition
m points
Block
index
m
l2
m
points
l
2
m
Block cost: factoring l l grids, O ( m 2l ) , sparse GE
l
m
GCR iters: Communication bound gives O iterations.
l3
Suggests insensitivity to l: Algorithm is O ( m ) .
Preconditioning
Approaches
Line Schemes
Grid
Matrix
m2
Overlapping Domain
Preconditioners
Preconditioning
Approaches
Grid
Matrix
m2
Preconditioning
i
Approaches
Incomplete Factorization
Schemes
Outline
Sparse Matrices
Fill-In
Example
X X X
X X 0
X 0 X
X X X
X X X
0 X
X X
X= Non zero
Fill-ins
SMA-HPC 2002 MIT
Fill-In
Sparse Matrices
Second Example
Fill-ins Propagate
X
X
X
0
X
0
X
0
X
0
X
0
Sparse Matrices
Fill-In
Very Sparse
Very Sparse
Dense
Sparse Matrices
Fill-In
Unfactored Random Matrix
Sparse Matrices
Fill-In
Factored Random Matrix
FD Matrix
properties
3-D Discretization
Matrix nonzeros, m = 4 example
Preconditioning
i
Approaches
Incomplete Factorization
Schemes
Key idea
Summary
3-D BVP Examples
Aerodynamics, Continuum Mechanics, Heat-Flow
Krylov Method
Communication Lower bound
Preconditioners based on improving communication