Numerical Methods: System of Linear Equations

Lecture 3
Numerical methods
System of linear equations
Lecture 3
OUTLINE
1. System of linear equations (linear system)

2. Gaussian elimination
3. Pivoting
4. LU decomposition
5. Cholesky decomposition
6. Effect of round-off errors
7. Conditionality
8. System of homogeneous linear equations
System of linear equations (linear system)
Basic definitions
x1a11 +  + xn a1n = b1
x1a 21 +  + xn a 2 n = b2
(1)

x1a m1 +  + xn a mn = bm
We can assign two matrixes to the system (1):

- coefficient matrix - augmented matrix
æ a11 a12  a1n ö÷ æ a11 a12  a1n b1 ö÷

çç ÷÷ çç ÷÷
çç a 21 a 22  a 2n ÷÷ çç a 21 a 22  a 2n b2 ÷÷
A = çç ÷÷ A ¢ = çç ÷÷
çç     ÷÷ çç     ÷÷
çç ÷÷ çç ÷÷
èa m1 a m2  a mn ø èa m1 a m2  a mn bm ø
Denote
æ x1 ö æ b1 ö
çç ÷÷ çç ÷÷
çç x2 ÷÷ çç b2 ÷÷
x = çç ÷÷÷ b = çç ÷÷÷
çç  ÷÷ çç  ÷÷
çç ÷÷ çç ÷÷
è xn ø èbm ø
Then the linear system (1) reads
æ a11  a1n ÷öçæ x1 ÷ö çæ b1 ÷ö

ççç ÷ç ÷÷ ç ÷÷
÷
çç a 21  a 2n ÷÷çç x2 ÷÷ çç b2 ÷÷
÷çç ÷÷ = çç ÷÷
ççç  ÷
  ÷÷çç  ÷÷ ç  ÷÷
çèça m1 ÷÷çç ÷÷ ççç ÷÷
 a mn øè xn ø èbm ø
Ax = b
 0
 
 0
If b=  then the system (1) is called homogeneous
 
 0
 
 0
 
 0
if b  then it is called nonhomogeneous

 
 0
 
Definition We say that the n-tuple (r1, …, rn) is solution of system (1),
if Ar = b, where
 r1 
 
r=  r2  .

 
r 
 n
Note
System (1) is consistent,

if it has at least one solution;
otherwise it is inconsistent.
Direct methods
attempt to solve the problem

by a finite sequence of operations.
In the absence of rounding errors,
direct methods would deliver an exact solution.
Iterative methods
delivers only approximate solution.

The number of steps depends
on required precision.
Example: Find the solution of linear system in real numbers.
x-2y+4z+ t =-6
2x+3y- z+2t =13 | (2)-2*(1)
2x+5y+ z+ t = 8 | (3)-2*(1)
3x+ y+3z+ t = 1 | (4)-3*(1)
x-2y+4z+ t =-6
7y-9z =25
9y-7z- t =20 | (3)-9/7*(2)
7y-9z-2t =19 | (4)-7/7*(2)
x-2y+ 4z+ t = -6
7y -9z = 25
32/7z- t =-85/7
-2t = -6
x = 1, y = 1, z = - 2, t = 3
Elementary row operations
Definition: Elementary row operations (ERO)

on a matrix are:
– row switching ↔
– row multiplication →
– row addition →
Row equivalence of matrices
Definition: We say that matrix A of type m x n is

row equivalent with the matrix B of type m x n
if it is possible to transform A into B
by a sequence of elementary row operations.
Upper triangular matrix
Definition: We say that squared matrix U is

upper triangular (or right triangular)
if all the entries below the main diagonal are zero.
Row equivalence of matrices
Theorem: Each matrix is row equivalent

with some upper triangular matrix.
Rank of a matrix
Definition: The rank of matrix A is

the number of nonzero rows
of triangular matrix equivalent
to the matrix A,
we will denote it h(A).
or
The rank of A is the maximal number

of linearly independent rows of A.
Consequence: Row equivalent matrices have the same rank.

x-2y+4z+ t =-6 1  2 4 1  6
 
2x+3y- z+2t =13 /(2)-2*(1)  2 3 1 2 13 
2x+5y+ z+ t = 8 /(3)-2*(1) 2 5 1 1 8
 
3x+ y+3z+ t = 1 /(4)-3*(1) 3 1 3 1 1 

æ1 - 2 4 1 -6 ö÷
x-2y+4z+ t = -6 çç ÷÷
7y-9z = 25 çç0 7 -9 0 25÷÷
çç ÷÷
9y-7z- t = 20 /(3)-9/7*(2) çç0 9 -7 -1 20÷÷
çç ÷÷
7y-9z-2t = 19 /(4)-7/7*(2) è 0 7 -9 - 2 19ø
æ1 - 2 4 1 -6 ö
x-2y+ 4z+ t = -6 çç ÷÷
7y- 9z = 25 ççç0 7 -9 0 25 ÷÷÷
32/7z- t =-85/7 çç0 0 32 / 7 -1 -85/7÷÷÷
çç ÷÷
-2t = -6 çè0 0 0 -2 -6 ÷ø
x = 1, y = 1, z = - 2, t = 3
Gaussian elimination
We showed a principle of
Gaussian elimination (GE):
Using the finite number of elementary row operations

on the coefficient matrix we get
the upper triangular matrix
and
using back substitution we calculate the vector of unknowns.
We showed a principle of
Gaussian elimination (GE):
Using the finite number of elementary row operations

on the coefficient matrix we get
the upper triangular matrix
and
using back substitution we calculate the vector of unknowns.
Forward elimination:
Let
A(0) = A, b(0) = b
elements of matrix A(0) will be aij(0) = aij
elements of vector b(0) will be bi(0) = bi
algorithm of GE
algorithm of GE
multiplicator assures
that
element (i,k) of matrix A(k)
will be zero
algorithm of GE
leading coefficient
(pivot)
has to be non-zero
Example:
Solve the linear system
on the hypothetical computer,

which works in decimal system
with five digits mantissa.
The exact solution is

Gaussian elimination – partial pivoting
Partial pivoting
is the modification of GE which assures
that the absolute values of multiplicators
are less than or equal to 1.
The algorithm selects the entry

with largest absolute value
from the column of the matrix
that is currently being considered
as the pivot element.
Gaussian elimination – partial pivoting
Gaussian elimination – complete (maximal) pivoting
Pivoting
Partial or complete pivoting?

usually partial pivoting is enough
Partial pivoting is generally sufficient to adequately reduce round-off error.
Complete pivoting is usually not necessary

to ensure numerical stability and,
due to the additional cost
of searching for the maximal element,
the improvement in numerical stability
is typically outweighed
by its reduced efficiency
for all but the smallest matrices.
Computational efficiency
2 3
The number of arithmetic operations in forward elimination: » n
3
The number of arithmetic operations in back substitution : » n2

LU decomposition (factorization)
x-2y+4z+ t =-6 1  2 4 1  6
 
2x+3y- z+2t =13 /(2)-2*(1)  2 3 1 2 13 
2x+5y+ z+ t = 8 /(3)-2*(1) 2 5 1 1 8
 
3x+ y+3z+ t = 1 /(4)-3*(1) 3 1 3 1 1 

æ1 - 2 4 1 -6 ö÷
x-2y+4z+ t = -6 çç ÷÷
7y-9z = 25 çç0 7 -9 0 25÷÷
çç ÷÷
9y-7z- t = 20 /(3)-9/7*(2) çç0 9 -7 -1 20÷÷
çç ÷÷
7y-9z-2t = 19 /(4)-7/7*(2) è 0 7 -9 - 2 19ø
æ1 - 2 4 1 -6 ö
x-2y+ 4z+ t = -6 çç ÷÷
7y- 9z = 25 ççç0 7 -9 0 25 ÷÷÷
32/7z- t =-85/7 çç0 0 32 / 7 -1 -85/7÷÷÷
çç ÷÷
-2t = -6 çè0 0 0 -2 -6 ÷ø
x = 1, y = 1, z = - 2, t = 3
x-2y+4z+ t =-6 1  2 4 1  6
 
2x+3y- z+2t =13 /(2)-2*(1)  2 3 1 2 13 
2x+5y+ z+ t = 8 /(3)-2*(1) 2 5 1 1 8
 
3x+ y+3z+ t = 1 /(4)-3*(1) 3 1 3 1 1 

æ 1 -2 4 1 -6 ö÷
x-2y+4z+ t = -6 çç ÷÷
7y-9z = 25 çç2 7 -9 0 25÷÷
çç ÷÷
9y-7z- t = 20 /(3)-9/7*(2) çç2 9 -7 -1 20÷÷
çç ÷÷
7y-9z-2t = 19 /(4)-7/7*(2) è 3 7 -9 - 2 19ø
æ 1 -2 4 1 -6 ö
x-2y+ 4z+ t = -6 çç ÷÷
7y- 9z = 25 çç2 7 -9 0 25 ÷÷÷
çç ÷÷
32/7z- t =-85/7 çç2 0 32 / 7 -1 -85/7÷÷
-2t = -6 çç ÷÷
è3 0 0 -2 -6 ø
x = 1, y = 1, z = - 2, t = 3
x-2y+4z+ t =-6 1  2 4 1  6
 
2x+3y- z+2t =13 /(2)-2*(1)  2 3 1 2 13 
2x+5y+ z+ t = 8 /(3)-2*(1) 2 5 1 1 8
 
3x+ y+3z+ t = 1 /(4)-3*(1) 3 1 3 1 1 

æ 1 -2 4 1 -6 ö÷
x-2y+4z+ t = -6 çç ÷÷
7y-9z = 25 çç2 7 -9 0 25÷÷
çç ÷÷
9y-7z- t = 20 /(3)-9/7*(2) çç2 9 -7 -1 20÷÷
çç ÷÷
7y-9z-2t = 19 /(4)-7/7*(2) è 3 7 -9 - 2 19ø
æ 1 -2 4 1 -6 ö
x-2y+ 4z+ t = -6 çç ÷÷
7y- 9z = 25 çç2 7 -9 0 25 ÷÷÷
çç ÷÷
32/7z- t =-85/7 çç2 9 / 7 32 / 7 -1 -85/7÷÷
-2t = -6 çç ÷÷
è3 1 0 -2 -6 ø
x = 1, y = 1, z = - 2, t = 3
x-2y+4z+ t =-6 1  2 4 1  6
 
2x+3y- z+2t =13 /(2)-2*(1)  2 3 1 2 13 
2x+5y+ z+ t = 8 /(3)-2*(1) 2 5 1 1 8
 
3x+ y+3z+ t = 1 /(4)-3*(1) 3 1 3 1 1 

æ 1 -2 4 1 -6 ö÷
x-2y+4z+ t = -6 çç ÷÷
7y-9z = 25 çç2 7 -9 0 25÷÷
çç ÷÷
9y-7z- t = 20 /(3)-9/7*(2) çç2 9 -7 -1 20÷÷
çç ÷÷
7y-9z-2t = 19 /(4)-7/7*(2) è 3 7 -9 - 2 19ø
æ 1 -2 4 1 -6 ö
x-2y+ 4z+ t = -6 çç ÷÷
7y- 9z = 25 çç2 7 -9 0 25 ÷÷÷
çç ÷÷
32/7z- t =-85/7 çç2 9 / 7 32 / 7 -1 -85/7÷÷
-2t = -6 çç ÷÷
è3 1 0 -2 -6 ø
x = 1, y = 1, z = - 2, t = 3
Multiplicators mij from the forward elimination are placed

into lower triangular matrix L)
We obtained LU decomposition of matrix A

(so called Doolittle version – ones on the diagonal of matrix L)
because
A = L⋅ U
æ1 -2 4 1ö æ1 0 0 0ö æ1 -2 4 1 ö
çç ÷÷ ç ÷÷ ç ÷÷
ç
çç2 3 -1 2÷÷ çç2 1 0 0÷÷ çç0 7 ç ÷÷
÷÷ = ç ÷÷⋅ ç - 9 0 ÷÷
çç
çç2 5 1 1÷÷÷ çç2 9 / 7 1 0÷÷÷ çç0 0 32 / 7 -1÷÷÷
çç ÷÷ ççç ÷÷ ççç ÷÷
3 1 3 1ø è3 1 0 1ø è0 0
è 0 -2
 ø
A L U
Doolittle algorithm
makes the LU decomposition
using the half number
of arithmetic operations
1 3
» n
3
LU decomposition of A
is useful for solving of sequences of tasks
if the new right-hand-side bi could be estimated only after

we solve the previous linear systems
for k < i.
To solve the system means to solve

LU decomposition with partial pivoting

is standard routine of each library of codes.
Input parameter is matrix A.
Output are three matrices:

L, U and permutation matrix P
where
LU = PA.
Permutation matrix comes form identity matrix

after row exchange.
Solution of Ax=b is given by solution of
(show this)
Instead of matrix P we work with

the vector of row permutations p.
At the beginning we put
and we just exchange the elements of vector p.

Definition: We say that square matrix A

is diagonally dominant if
Definition: Symmetric matrix A is positive-definite if

for any non-zero vector x it holds
xT ⋅ A ⋅ x > 0
If A is regular then
AT ⋅ A
is positive-definite.
Sylvester’s criterion: Symmetric matrix A

is positive-definite, if and only if the leading
principal minors are positive, i.e. if
is numerically stable for
diagonally dominant
or
positive-definite matrices.
Cholesky decomposition
Cholesky decomposition theorem: There is only one way, how the

decompose the positive-definite matrix A into
A = L ⋅ LT
where L is the lower triangular matrix.
Cholesky decomposition
Cholesky decomposition theorem: There is only one way, how the

decompose the positive-definite matrix A into
A = L ⋅ LT
where L is the lower triangular matrix.
Cholesky algorithm:
1 3
The number of arithmetic operations: » n
3
The effect of round-off errors
Example:
On the hypothetical computer working decimal system
with three digits mantissa solve the following system
The effect of round-off errors
GE with partial pivoting

assures the small residua
However, the small residuum does not give

an estimation of the error of solution
Conditionality
In order to evaluate the conditionality of problem

„to find solution x of system Ax = b“
we need to asses
the effect of change of A a b
to the solution x.
Conditionality
Norm of the vector

class of vector norms lp,
The most often we use p=1, p=2, or p
Manhattan Euclid Chebyshev

Conditionality
Properties
1. av = | a | v , (absolute homogeneity or absolute scalability)
2. u+v ≤ u v (triangle inequality or subadditivity)
3. If v = 0 then v is the zero vector (separates points)

Conditionality
Condition number of a matrix
Let
where maximum and minimum is considered for all nonzero x.

Ratio M/m is called condition number of matrix A:
Conditionality
Lets consider linear system

Ax = b
and other system, which could be obtained by changing the r.h.s.:
, or
we can consider as an error of b

and
corresponding error of solution x.
Conditionality
Because then
from the definition of M and m
it follows that
so for
where is a residuum.
Condition number of matrix acts as

an amplifier of relative error !
It is possible to show, that also changes in matrix A

leads to the similar effects on results.
Conditionality
Norm of matrix
the number M is also known as the norm of matrix
and also it holds
Equivalently we could write the condition number of matrix as

Conditionality
“Entrywise” norm of matrix

1
æ n n pöp
÷÷
A = vec (A) p = çççåå aij ÷÷
p
çè j=1 j=1 ÷ø
n n
A F
= åå ij
a 2
Frobenius norm (consistent with l2)
i=1 j =1
A ¥
= max aij max norm
i, j
x+2y+ z- t = 1
2x+3y- z+2t = 3
4x+7y+ z = 5
5x+7y-4z+7t =10
1 2 1 1 1 
x+2y+ z- t = 1  
2x+3y- z+2t = 3 2 3 1 2 3 
4x+7y+ z = 5 4 7 1 0 5
 
5x+7y-4z+7t =10 5 7 -4 7 10 

1 2 1 1 1 
 
0 1  3 4 1 
0 1  3 4 1 
 
0  3  9 12 5 

x+2y+ z- t = 1 1 2 1 1 1 
 
-y-3z+4t = 1 0 1  3 4 1 
0z+0t = 2 0 0 0 0 2
 
0t = 0 0 0 0 0 0 

system has no solution
Another example: Find the solution of linear system in rational numbers.
x-2y+4z+ t =-6
2x+3y- z+2t =13
3x+ y+3z+3t = 7
x+5y-5z+ t =19
1  2 4 1 -6
x-2y+4z+ t =-6  
2x+3y- z+2t =13  2 3 1 2 13 
3x+ y+3z+3t = 7 3 1 3 3 7
 
x+5y-5z+ t =19 1 5 -5 1 19 

1  2 4 1 -6 
 
0 7  9 0 25 
0 7  9 0 25 
 
0 7  9 0 25 

x-2y+4z+ t =-6 1  2 4 1 - 6
 
7y-9z+0t =25 0 7  9 0 25 
0z+0t = 0 0 0 0 0 0
 
0t = 0 0 0 0 0 0 

1  2 4 1 - 6
x-2y+4z+ t =-6  
7y-9z+0t =25 0 7  9 0 25 
0z+0t = 0 0 0 0 0 0
 
0t = 0 0 0 0 0 0 

System has infinite number of solutions
t = p, z = q, p,q  Q
25 9 8 10
y  q x   qp
7 7 7 7
 8 10 25 9  
S    q  p,  q, q, p , p, q  Q 
 7 7 7 7  
  2 34 
For example: if q = 1, p = 0, we get particular solution  , ,1,0 
 7 7 
System of linear equations
Frobenius theorem: System of nonhomogeneous equations

has solution
if and only if,
the rank of coefficient matrix
is equal to
the rank of augmented matrix.
Consequence 1: If h(A) = h(A´) = n (n is the number of unknowns),

then the system has just one solution.
Consequence 2: If h(A) = h(A´) < n (n is the number of unknowns),

then the system has infinite number of solutions
and
n-h unknowns could be freely chosen.
Consequence 3: If h(A)  h(A´) then the system has no solution.

System of homogeneous linear equations
System of homogeneous linear equations
x1a11   x n a1n  0
x1a 21   x n a 2n  0

x1a m1   x n a mn  0
The system has always solution because h(A)=h(A´)
Theorem: System of homogeneous equations has

nontrivial solution
if and only if h(A)  n.
(trivial solution is (0,0,...,0) )
Example: Find the solution of linear system in rational numbers.
x-2y+4z+ t =0 1  2 4 1 0
 
2x+3y- z+2t =0  2 3 1 2 0
3x+ y+3z+3t =0 3 1 3 3 0
 
x+5y-5z+ t =0 1 5 -5 1 0 

1  2 4 1 0
 
0 7  9 0 0
0 7  9 0 0
 
0 7  9 0 0 

x-2y+4z+ t =0 1 2 4 1 0
 
7y-9z+0t =0 0 7 9 0 0
0z+0t =0 0 0 0 0 0
 
0 0 0 0 0 
0t =0 
Example: Find the solution of linear system in rational numbers.
x-2y+4z+ t =0
7y-9z+0t =0
0z+0t =0
0t =0
t = p, z = q, p,q  Q
9
y  q
7
10
x   qp
7
 10 9  
S    q  p, q, q, p , p, q  Q 
 7 7  
Summary of notions
• homogeneous and nonhomogeneous system of linear

equations
• coefficient matrix and augmented matrix
• elementary row operations
• row equivalent matrices
• rank of a matrix
• upper and lower triangular matrix
• Gaussian elimination (pivoting)
• LU decomposition (Doolittle method)
• Cholesky decomposition
• Frobenius theorem
• solution of homogeneous linear systems

Numerical Methods: System of Linear Equations

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Numerical Methods: System of Linear Equations

Uploaded by

Copyright:

Available Formats

Lecture 3

1. System of linear equations (linear system)

We can assign two matrixes to the system (1):

æ a11 a12  a1n ö÷ æ a11 a12  a1n b1 ö÷

Then the linear system (1) reads

æ a11  a1n ÷öçæ x1 ÷ö çæ b1 ÷ö

System (1) is consistent,

attempt to solve the problem

delivers only approximate solution.

Definition: Elementary row operations (ERO)

Definition: We say that matrix A of type m x n is

Definition: We say that squared matrix U is

Theorem: Each matrix is row equivalent

Definition: The rank of matrix A is

The rank of A is the maximal number

Consequence: Row equivalent matrices have the same rank.

Using the finite number of elementary row operations

Using the finite number of elementary row operations

on the hypothetical computer,

The exact solution is

The algorithm selects the entry

Partial or complete pivoting?

Partial pivoting is generally sufficient to adequately reduce round-off error.

Complete pivoting is usually not necessary

The number of arithmetic operations in back substitution : » n2

Multiplicators mij from the forward elimination are placed

We obtained LU decomposition of matrix A

if the new right-hand-side bi could be estimated only after

To solve the system means to solve

LU decomposition with partial pivoting

Input parameter is matrix A.

Output are three matrices:

Permutation matrix comes form identity matrix

Solution of Ax=b is given by solution of

Instead of matrix P we work with

At the beginning we put

and we just exchange the elements of vector p.

Definition: We say that square matrix A

Definition: Symmetric matrix A is positive-definite if

Sylvester’s criterion: Symmetric matrix A

Cholesky decomposition theorem: There is only one way, how the

Cholesky decomposition theorem: There is only one way, how the

GE with partial pivoting

However, the small residuum does not give

In order to evaluate the conditionality of problem

Norm of the vector

The most often we use p=1, p=2, or p

Manhattan Euclid Chebyshev

1. av = | a | v , (absolute homogeneity or absolute scalability)

2. u+v ≤ u v (triangle inequality or subadditivity)

3. If v = 0 then v is the zero vector (separates points)

Condition number of a matrix

where maximum and minimum is considered for all nonzero x.

Lets consider linear system

we can consider as an error of b

Condition number of matrix acts as

It is possible to show, that also changes in matrix A

and also it holds

Equivalently we could write the condition number of matrix as

“Entrywise” norm of matrix

Frobenius theorem: System of nonhomogeneous equations

Consequence 1: If h(A) = h(A´) = n (n is the number of unknowns),