Professional Documents
Culture Documents
Department of Mathematics
IIT Guwahati
March-June 2023
1 / 37
Outline
Topics:
Matrix-vector multiplication
Matrix-matrix multiplication
Block matrices
Outer product of vectors
Elementary matrices
Gauss-Jordan method
2 / 37
Matrices
Definition: A matrix is an array of numbers. An m × n matrix A has m rows and n
columns and is of the form
a11 a12 · · · a1n
a21 a22 · · · a2n
A= . .. .
. ..
. . ··· .
am1 am2 · · · amn
3 / 37
Matrices
Definition: A matrix is an array of numbers. An m × n matrix A has m rows and n
columns and is of the form
a11 a12 · · · a1n
a21 a22 · · · a2n
A= . .. .
. ..
. . ··· .
am1 am2 · · · amn
a1j
..
The j-th column of A : aj := . for j = 1 : n.
amj
The i-th row of A: âi := ai1 ai2 · · · ain for i = 1 : m.
3 / 37
Matrices
Definition: A matrix is an array of numbers. An m × n matrix A has m rows and n
columns and is of the form
a11 a12 · · · a1n
a21 a22 · · · a2n
A= . .. .
. ..
. . ··· .
am1 am2 · · · amn
a1j
..
The j-th column of A : aj := . for j = 1 : n.
amj
The i-th row of A: âi := ai1 ai2 · · · ain for i = 1 : m. Then
a11 · · · a1n − â1 −
A = ... .. .. = a a · · · a = ..
.
. . 1 2 n .
am1 · · · amn − âm −
3 / 37
Matrices
An m × n matrix said to be a square matrix if m = n.
4 / 37
Matrices
An m × n matrix said to be a square matrix if m = n. An m × n matrix D := [dij ] is
said to be a diagonal matrix if dij = 0 for all i 6= j.
4 / 37
Matrices
An m × n matrix said to be a square matrix if m = n. An m × n matrix D := [dij ] is
said to be a diagonal matrix if dij = 0 for all i 6= j. An n × n diagonal matrix D with
diagonal entries d1 , . . . , dn is given by
d1
D = diag(d1 , . . . , dn ) =
.. .
.
dn
Identity matrix: An n × n diagonal matrix with all diagonal entries equal to 1 is called
the identity matrix and is denoted by In or I .
4 / 37
Matrices
An m × n matrix said to be a square matrix if m = n. An m × n matrix D := [dij ] is
said to be a diagonal matrix if dij = 0 for all i 6= j. An n × n diagonal matrix D with
diagonal entries d1 , . . . , dn is given by
d1
D = diag(d1 , . . . , dn ) =
.. .
.
dn
Identity matrix: An n × n diagonal matrix with all diagonal entries equal to 1 is called
the identity matrix and is denoted by In or I .
Zero matrix: An m × n matrix with all entries 0 is called the zero matrix and is
denoted by Om×n or simply by O.
4 / 37
Matrices
An m × n matrix said to be a square matrix if m = n. An m × n matrix D := [dij ] is
said to be a diagonal matrix if dij = 0 for all i 6= j. An n × n diagonal matrix D with
diagonal entries d1 , . . . , dn is given by
d1
D = diag(d1 , . . . , dn ) =
.. .
.
dn
Identity matrix: An n × n diagonal matrix with all diagonal entries equal to 1 is called
the identity matrix and is denoted by In or I .
Zero matrix: An m × n matrix with all entries 0 is called the zero matrix and is
denoted by Om×n or simply by O.
1 0 0 0 0 0 0
Example: I := 0 1 0 and O := 0 0 0 0 .
0 0 1 0 0 0 0
4 / 37
Matrix addition and scalar multiplication
Let Fm×n denote the set of all m × n matrices with entries in F by where F = R or
F = C.
5 / 37
Matrix addition and scalar multiplication
Let Fm×n denote the set of all m × n matrices with entries in F by where F = R or
F = C. Let A := [aij ] and B := [bij ] be matrices ∈ Fm×n and α ∈ F.
5 / 37
Matrix addition and scalar multiplication
Let Fm×n denote the set of all m × n matrices with entries in F by where F = R or
F = C. Let A := [aij ] and B := [bij ] be matrices ∈ Fm×n and α ∈ F.
5 / 37
Matrix addition and scalar multiplication
Let Fm×n denote the set of all m × n matrices with entries in F by where F = R or
F = C. Let A := [aij ] and B := [bij ] be matrices ∈ Fm×n and α ∈ F.
5 / 37
Matrix addition and scalar multiplication
Let Fm×n denote the set of all m × n matrices with entries in F by where F = R or
F = C. Let A := [aij ] and B := [bij ] be matrices ∈ Fm×n and α ∈ F.
5 / 37
Matrix addition and scalar multiplication
Let Fm×n denote the set of all m × n matrices with entries in F by where F = R or
F = C. Let A := [aij ] and B := [bij ] be matrices ∈ Fm×n and α ∈ F.
5 / 37
Properties of matrix addition and scalar multiplication
6 / 37
Properties of matrix addition and scalar multiplication
6 / 37
Properties of matrix addition and scalar multiplication
6 / 37
Properties of matrix addition and scalar multiplication
6 / 37
Properties of matrix addition and scalar multiplication
Proof: Exercise.
6 / 37
Transpose and Conjugate transpose
Transpose: The transpose of an m × n matrix A = [aij ]m×n is the n × m matrix
denoted by A> and is given by A> = [aji ]n×m .
7 / 37
Transpose and Conjugate transpose
Transpose: The transpose of an m × n matrix A = [aij ]m×n is the n × m matrix
denoted by A> and is given by A> = [aji ]n×m .
>
1 2 >
1 4 7 1 + i 2 1 + i 3
Example: 4 5 = and =
2 5 8 3 4 + 5i 2 4 + 5i
7 8
7 / 37
Transpose and Conjugate transpose
Transpose: The transpose of an m × n matrix A = [aij ]m×n is the n × m matrix
denoted by A> and is given by A> = [aji ]n×m .
>
1 2 >
1 4 7 1 + i 2 1 + i 3
Example: 4 5 = and =
2 5 8 3 4 + 5i 2 4 + 5i
7 8
7 / 37
Transpose and Conjugate transpose
Transpose: The transpose of an m × n matrix A = [aij ]m×n is the n × m matrix
denoted by A> and is given by A> = [aji ]n×m .
>
1 2 >
1 4 7 1 + i 2 1 + i 3
Example: 4 5 = and =
2 5 8 3 4 + 5i 2 4 + 5i
7 8
(a) (A + B)> = A> + B > (b) (αA)> = αA> and (αA)∗ = ᾱA∗ (c) (A> )> = A.
8 / 37
Transpose and conjugate transpose
(a) (A + B)> = A> + B > (b) (αA)> = αA> and (αA)∗ = ᾱA∗ (c) (A> )> = A.
8 / 37
Transpose and conjugate transpose
(a) (A + B)> = A> + B > (b) (αA)> = αA> and (αA)∗ = ᾱA∗ (c) (A> )> = A.
8 / 37
Transpose and conjugate transpose
(a) (A + B)> = A> + B > (b) (αA)> = αA> and (αA)∗ = ᾱA∗ (c) (A> )> = A.
8 / 37
Transpose and conjugate transpose
(a) (A + B)> = A> + B > (b) (αA)> = αA> and (αA)∗ = ᾱA∗ (c) (A> )> = A.
Remark: Let A := [aij ]n×n . If A> = −A then ajj = 0 for j = 1 : n. On the other hand,
if A∗ = −A then Re(ajj ) = 0 for j = 1 : n.
8 / 37
Linear combination
Let u and v be vectors in Fn . Let α and β be in F. Adding αu and βv gives the linear
combination αu + βv.
Example: Let u := [1, 1, −1]> , v := [2, 3, 4]> and w := [4, 5, 2]> . Then w = 2u + v.
Thus w is a linear combination of u and v.
9 / 37
Linear combination
Let u and v be vectors in Fn . Let α and β be in F. Adding αu and βv gives the linear
combination αu + βv.
Example: Let u := [1, 1, −1]> , v := [2, 3, 4]> and w := [4, 5, 2]> . Then w = 2u + v.
Thus w is a linear combination of u and v.
9 / 37
Linear combination
Let u and v be vectors in Fn . Let α and β be in F. Adding αu and βv gives the linear
combination αu + βv.
Example: Let u := [1, 1, −1]> , v := [2, 3, 4]> and w := [4, 5, 2]> . Then w = 2u + v.
Thus w is a linear combination of u and v.
9 / 37
Matrix-vector multiplication
Let A := a1 · · · an ∈ Fm×n and x := [x1 , . . . , xn ]> ∈ Fn . We define the
10 / 37
Matrix-vector multiplication
Let A := a1 · · · an ∈ Fm×n and x := [x1 , . . . , xn ]> ∈ Fn . We define the
xn
10 / 37
Matrix-vector multiplication
Let A := a1 · · · an ∈ Fm×n and x := [x1 , . . . , xn ]> ∈ Fn . We define the
xn
10 / 37
Matrix-vector multiplication
Let A := a1 · · · an ∈ Fm×n and x := [x1 , . . . , xn ]> ∈ Fn . We define the
xn
10 / 37
Matrix-vector multiplication
A row vector ai1 · · · ain is a 1 × n matrix. Therefore
x1
.
ai1 · · · ain .. = ai1 x1 + · · · + ain xn .
xn
11 / 37
Matrix-vector multiplication
A row vector ai1 · · · ain is a 1 × n matrix. Therefore
x1
.
ai1 · · · ain .. = ai1 x1 + · · · + ain xn .
xn
11 / 37
Matrix-vector multiplication
a11 · · · a1n x1 a11 a1n
.. .. .. . .
More generally . ··· . . = x1 .. + · · · + xn ..
am1 · · · amn xn a amn
m1
a11 x1 + · · · + a1n xn a11 · · · a1n x
= .. ..
= .
. .
am1 x1 + · · · + amn xn am1 · · · amn x
12 / 37
Matrix-vector multiplication
a11 · · · a1n x1 a11 a1n
.. .. .. . .
More generally . ··· . . = x1 .. + · · · + xn ..
am1 · · · amn xn a amn
m1
a11 x1 + · · · + a1n xn a11 · · · a1n x
= .. ..
= .
. .
am1 x1 + · · · + amn xn am1 · · · amn x
− â1 −
Writing A :=
a1 · · ·
an and A = ..
, we have
.
− âm −
a11 x1 + · · · + a1n xn â1 x
Ax = x1 a1 + · · · + xn an = .. ..
= . .
.
am1 x1 + · · · + amn xn âm x
12 / 37
Matrix-matrix multiplication
Let A ∈ Fm×n and B := ∈ Fn×p .
b1 · · · bp
Definition: Define the matrix-matrix multiplication AB by
AB := Ab1 · · · Abp .
13 / 37
Matrix-matrix multiplication
Let A ∈ Fm×n and B := ∈ Fn×p .
b1 · · · bp
Definition: Define the matrix-matrix multiplication AB by
AB := Ab1 · · · Abp .
Reason: Define AB to be the m × p matrix such that (AB)x = A(Bx) for all x ∈ Fp .
13 / 37
Matrix-matrix multiplication
Let A ∈ Fm×n and B := ∈ Fn×p .
b1 · · · bp
Definition: Define the matrix-matrix multiplication AB by
AB := Ab1 · · · Abp .
Reason: Define AB to be the m × p matrix such that (AB)x = A(Bx) for all x ∈ Fp .
Let C := AB be given by C = c1 · · · cp . Let ej ∈ Fp be the standard unit vector.
Then for j = 1 : p, we have Bej = bj and
13 / 37
Matrix-matrix multiplication
Let A ∈ Fm×n and B := ∈ Fn×p .
b1 · · · bp
Definition: Define the matrix-matrix multiplication AB by
AB := Ab1 · · · Abp .
Reason: Define AB to be the m × p matrix such that (AB)x = A(Bx) for all x ∈ Fp .
Let C := AB be given by C = c1 · · · cp . Let ej ∈ Fp be the standard unit vector.
Then for j = 1 : p, we have Bej = bj and
cj = C ej = (AB)ej = A(Bej ) = Abj =⇒ C = Ab1 · · · Abp .
13 / 37
Matrix-matrix multiplication
Let A ∈ Fm×n and B := ∈ Fn×p .
b1 · · · bp
Definition: Define the matrix-matrix multiplication AB by
AB := Ab1 · · · Abp .
Reason: Define AB to be the m × p matrix such that (AB)x = A(Bx) for all x ∈ Fp .
Let C := AB be given by C = c1 · · · cp . Let ej ∈ Fp be the standard unit vector.
Then for j = 1 : p, we have Bej = bj and
cj = C ej = (AB)ej = A(Bej ) = Abj =⇒ C = Ab1 · · · Abp .
13 / 37
Matrix-matrix
multiplication
−â1 −
..
∈ Fm×n , B := b1 · · · ∈ Fn×p . Then
Let A = bp
.
−âm −
â1 b1 · · · â1 bp â1 B
. .. = .. .
AB = Ab1 · · · Abp = .. ··· . .
âm b1 · · · â1 bp âm B
14 / 37
Matrix-matrix
multiplication
−â1 −
..
∈ Fm×n , B := b1 · · · ∈ Fn×p . Then
Let A = bp
.
−âm −
â1 b1 · · · â1 bp â1 B
. .. = .. .
AB = Ab1 · · · Abp = .. ··· . .
âm b1 · · · â1 bp âm B
14 / 37
Matrix-matrix
multiplication
−â1 −
..
∈ Fm×n , B := b1 · · · ∈ Fn×p . Then
Let A = bp
.
−âm −
â1 b1 · · · â1 bp â1 B
. .. = .. .
AB = Ab1 · · · Abp = .. ··· . .
âm b1 · · · â1 bp âm B
15 / 37
Example
4 −1
1 3 2
Let A = and B := 1 2 . Then
0 −1 1
3 0
4 −1
1 3 2 13 1 3 2 5
Ab1 = 1 = and Ab2 = 2 = .
0 −1 1 2 0 −1 1 −2
3 0
15 / 37
Example
4 −1
1 3 2
Let A = and B := 1 2 . Then
0 −1 1
3 0
4 −1
1 3 2 13 1 3 2 5
Ab1 = 1 = and Ab2 = 2 = .
0 −1 1 2 0 −1 1 −2
3 0
13 5
Therefore AB = Ab1 Ab2 = .
2 −2
15 / 37
Example
4 −1
1 3 2
Let A = and B := 1 2 . Then
0 −1 1
3 0
4 −1
1 3 2 13 1 3 2 5
Ab1 = 1 = and Ab2 = 2 = .
0 −1 1 2 0 −1 1 −2
3 0
13 5
Therefore AB = Ab1 Ab2 = . On the other hand
2 −2
4 −1 4 −1
a1 B = 1
b 3 2 1 2 = 13 5 a2 B = 0
and b −1 1 1 2 = 2 −2 .
3 0 3 0
15 / 37
Example
4 −1
1 3 2
Let A = and B := 1 2 . Then
0 −1 1
3 0
4 −1
1 3 2 13 1 3 2 5
Ab1 = 1 = and Ab2 = 2 = .
0 −1 1 2 0 −1 1 −2
3 0
13 5
Therefore AB = Ab1 Ab2 = . On the other hand
2 −2
4 −1 4 −1
a1 B = 1
b 3 2 1 2 = 13 5 a2 B = 0
and b −1 1 1 2 = 2 −2 .
3 0 3 0
a1 B 13 5
Therefore AB = = = Ab1 Ab2 .
b
a2 B
b 2 −2
15 / 37
Block matrices
Definition: An m × n block matrix (or a partitioned matrix) is a matrix of the form
A11 · · · A1n
A := ... ..
··· .
Am1 · · · Amn
16 / 37
Block matrices
Definition: An m × n block matrix (or a partitioned matrix) is a matrix of the form
A11 · · · A1n
A := ... ..
··· .
Am1 · · · Amn
Amj
column of A.
16 / 37
Block matrices
Definition: An m × n block matrix (or a partitioned matrix) is a matrix of the form
A11 · · · A1n
A := ... ..
··· .
Am1 · · · Amn
Amj
column of A.
1 2 2 0 1 4
3 4 1 2 3 5
Example:
5 7
has 2 block rows and 3 block columns.
2 7 8 8
3 4 1 9 2 2
16 / 37
Block matrix operations
Block matrix addition: Let A := [Aij ]m×n and B := [Bij ]m×n be block matrices such
that size of Aij = size of Bij for i = 1 : m and j = 1 : n. Then A + B := [Aij + Bij ]m×n .
17 / 37
Block matrix operations
Block matrix addition: Let A := [Aij ]m×n and B := [Bij ]m×n be block matrices such
that size of Aij = size of Bij for i = 1 : m and j = 1 : n. Then A + B := [Aij + Bij ]m×n .
17 / 37
Block matrix operations
Block matrix addition: Let A := [Aij ]m×n and B := [Bij ]m×n be block matrices such
that size of Aij = size of Bij for i = 1 : m and j = 1 : n. Then A + B := [Aij + Bij ]m×n .
17 / 37
Block matrix operations
Block matrix addition: Let A := [Aij ]m×n and B := [Bij ]m×n be block matrices such
that size of Aij = size of Bij for i = 1 : m and j = 1 : n. Then A + B := [Aij + Bij ]m×n .
17 / 37
Block matrix multiplication
Example:
1 1 1 1
1 1 1 1 1 8 6 4 5
2 2 2 1 1
= 10 9
1 1
3 6 7
1 1 1
3 3 2 2 18 15 10 12
3 2 1 2
1 1 1 1
1 1 1 1 1 8 6 4 5
2 2 2 1 1
1 1 3
= 10 9 6 7
1 1 1
3 3 2 2 18 15 10 12
3 2 1 2
18 / 37
Outer product
Given two vectors x and y in Rn , we have seen that y> x is the inner product given by
hx, yi = x1 y1 + · · · + xn yn = y> x.
19 / 37
Outer product
Given two vectors x and y in Rn , we have seen that y> x is the inner product given by
hx, yi = x1 y1 + · · · + xn yn = y> x.
19 / 37
Outer product
> >
Example: If x := 4 1 3 and y := 3 5 2 then
4 12 20 8
xy> = 1 3 5 2 = 3 5 2 .
3 9 15 6
20 / 37
Outer product
> >
Example: If x := 4 1 3 and y := 3 5 2 then
4 12 20 8
xy> = 1 3 5 2 = 3 5 2 .
3 9 15 6
20 / 37
Outer product
> >
Example: If x := 4 1 3 and y := 3 5 2 then
4 12 20 8
xy> = 1 3 5 2 = 3 5 2 .
3 9 15 6
20 / 37
Properties of matrix multiplication
Thoerm: Let A, B and C be matrices (whose sizes are such that the indicated
operations can be performed) and let α be a scalar. Then
21 / 37
Properties of matrix multiplication
Thoerm: Let A, B and C be matrices (whose sizes are such that the indicated
operations can be performed) and let α be a scalar. Then
1 Associative Law: (AB)C = A(BC )
2 Left Distributive Law: A(B + C ) = AB + AC
3 Right Distributive Law: (A + B)C = AC + BC
21 / 37
Properties of matrix multiplication
Thoerm: Let A, B and C be matrices (whose sizes are such that the indicated
operations can be performed) and let α be a scalar. Then
1 Associative Law: (AB)C = A(BC )
2 Left Distributive Law: A(B + C ) = AB + AC
3 Right Distributive Law: (A + B)C = AC + BC
4 Scalar multiplication: α(AB) = (αA)B = A(αB)
5 Multiplicative identity: If A is an m × n matrix then Im A = A = AIn .
21 / 37
Properties of matrix multiplication
Thoerm: Let A, B and C be matrices (whose sizes are such that the indicated
operations can be performed) and let α be a scalar. Then
1 Associative Law: (AB)C = A(BC )
2 Left Distributive Law: A(B + C ) = AB + AC
3 Right Distributive Law: (A + B)C = AC + BC
4 Scalar multiplication: α(AB) = (αA)B = A(αB)
5 Multiplicative identity: If A is an m × n matrix then Im A = A = AIn .
ai := e>
Proof: Let b i A and cj := C ej . Then for all i and j, we have
[(AB)C ]ij = (b
ai B)cj = b
ai Bcj and [A(BC )]ij = b
ai (Bcj ) = b
ai Bcj .
21 / 37
Properties of matrix multiplication
Thoerm: Let A, B and C be matrices (whose sizes are such that the indicated
operations can be performed) and let α be a scalar. Then
1 Associative Law: (AB)C = A(BC )
2 Left Distributive Law: A(B + C ) = AB + AC
3 Right Distributive Law: (A + B)C = AC + BC
4 Scalar multiplication: α(AB) = (αA)B = A(αB)
5 Multiplicative identity: If A is an m × n matrix then Im A = A = AIn .
ai := e>
Proof: Let b i A and cj := C ej . Then for all i and j, we have
[(AB)C ]ij = (b
ai B)cj = b
ai Bcj and [A(BC )]ij = b
ai (Bcj ) = b
ai Bcj .
21 / 37
The inverse of a matrix
Definition: An n × n matrix A is said to be invertible if there exists a matrix B such
that AB = In = BA. The matrix B is called an inverse of A.
22 / 37
The inverse of a matrix
Definition: An n × n matrix A is said to be invertible if there exists a matrix B such
that AB = In = BA. The matrix B is called an inverse of A.
Exercise: Show that inverse of an invertible matrix A is unique.
22 / 37
The inverse of a matrix
Definition: An n × n matrix A is said to be invertible if there exists a matrix B such
that AB = In = BA. The matrix B is called an inverse of A.
Exercise: Show that inverse of an invertible matrix A is unique.
22 / 37
The inverse of a matrix
Definition: An n × n matrix A is said to be invertible if there exists a matrix B such
that AB = In = BA. The matrix B is called an inverse of A.
Exercise: Show that inverse of an invertible matrix A is unique.
We denote the inverse of A by A−1 . Note that if A is invertible then the system
Ax = b has a unique solution x = A−1 b.
22 / 37
The inverse of a matrix
Definition: An n × n matrix A is said to be invertible if there exists a matrix B such
that AB = In = BA. The matrix B is called an inverse of A.
Exercise: Show that inverse of an invertible matrix A is unique.
We denote the inverse of A by A−1 . Note that if A is invertible then the system
Ax = b has a unique solution x = A−1 b.
2 5
The matrix A = is invertible since
1 3
2 5 3 −5 1 0 3 −5 2 5
= = .
1 3 −1 2 0 1 −1 2 1 3
22 / 37
The inverse of a matrix
Definition: An n × n matrix A is said to be invertible if there exists a matrix B such
that AB = In = BA. The matrix B is called an inverse of A.
Exercise: Show that inverse of an invertible matrix A is unique.
We denote the inverse of A by A−1 . Note that if A is invertible then the system
Ax = b has a unique solution x = A−1 b.
2 5
The matrix A = is invertible since
1 3
2 5 3 −5 1 0 3 −5 2 5
= = .
1 3 −1 2 0 1 −1 2 1 3
22 / 37
The inverse of a matrix
Definition: An n × n matrix A is said to be invertible if there exists a matrix B such
that AB = In = BA. The matrix B is called an inverse of A.
Exercise: Show that inverse of an invertible matrix A is unique.
We denote the inverse of A by A−1 . Note that if A is invertible then the system
Ax = b has a unique solution x = A−1 b.
2 5
The matrix A = is invertible since
1 3
2 5 3 −5 1 0 3 −5 2 5
= = .
1 3 −1 2 0 1 −1 2 1 3
23 / 37
Properties of invertible matrices
Theorem: Let A and B be n × n invertible matrices.
1 The matrix A−1 is also invertible, and (A−1 )−1 = A.
2 If c 6= 0 then cA is also invertible, and (cA)−1 = c1 A−1 .
3 The matrix AB is invertible, and (AB)−1 = B −1 A−1 .
4 The matrix A> is invertible, and (A> )−1 = (A−1 )> .
23 / 37
Properties of invertible matrices
Theorem: Let A and B be n × n invertible matrices.
1 The matrix A−1 is also invertible, and (A−1 )−1 = A.
2 If c 6= 0 then cA is also invertible, and (cA)−1 = c1 A−1 .
3 The matrix AB is invertible, and (AB)−1 = B −1 A−1 .
4 The matrix A> is invertible, and (A> )−1 = (A−1 )> .
5 For any non-negative integer k, the matrix Ak is invertible, and (Ak )−1 = (A−1 )k .
23 / 37
Properties of invertible matrices
Theorem: Let A and B be n × n invertible matrices.
1 The matrix A−1 is also invertible, and (A−1 )−1 = A.
2 If c 6= 0 then cA is also invertible, and (cA)−1 = c1 A−1 .
3 The matrix AB is invertible, and (AB)−1 = B −1 A−1 .
4 The matrix A> is invertible, and (A> )−1 = (A−1 )> .
5 For any non-negative integer k, the matrix Ak is invertible, and (Ak )−1 = (A−1 )k .
a b
Let A := . If ad − bc 6= 0 then A is invertible, and
c d
−1 1 d −b
A = .
ad − bc −c a
23 / 37
Elementary matrices
Type Operation Inverse operation
I Ri ←− α Ri Ri ←− α1 Ri
II Rj ←− cRi + Rj Rj ←− −cRi + Rj
III Ri ↔ Rj Ri ↔ Rj
24 / 37
Elementary matrices
Type Operation Inverse operation
I Ri ←− α Ri Ri ←− α1 Ri
II Rj ←− cRi + Rj Rj ←− −cRi + Rj
III Ri ↔ Rj Ri ↔ Rj
24 / 37
Elementary matrices
Type Operation Inverse operation
I Ri ←− α Ri Ri ←− α1 Ri
II Rj ←− cRi + Rj Rj ←− −cRi + Rj
III Ri ↔ Rj Ri ↔ Rj
24 / 37
Elementary matrices
Type Operation Inverse operation
I Ri ←− α Ri Ri ←− α1 Ri
II Rj ←− cRi + Rj Rj ←− −cRi + Rj
III Ri ↔ Rj Ri ↔ Rj
24 / 37
Elementary matrices
Type Operation Inverse operation
I Ri ←− α Ri Ri ←− α1 Ri
II Rj ←− cRi + Rj Rj ←− −cRi + Rj
III Ri ↔ Rj Ri ↔ Rj
25 / 37
Type II elementary matrices
Let E13 (2) be the matrix obtained by performing R3 ← 2R1 + R3 on I3 . Then
1 0 0
E13 (2) := 0 1 0
2 0 1
25 / 37
Type II elementary matrices
Let E13 (2) be the matrix obtained by performing R3 ← 2R1 + R3 on I3 . Then
1 0 0
E13 (2) := 0 1 0
2 0 1
25 / 37
Type III elementary matrices
Let Eij be the matrix obtained by performing Ri ↔ Rj on the identity matrix I . Then
1 0 0 x1 x1 Row1 (A)
E23 = 0 0 1 =⇒ E23 x2 = x3 and E23 A = Row3 (A) .
0 1 0 x3 x2 Row2 (A)
26 / 37
Type III elementary matrices
Let Eij be the matrix obtained by performing Ri ↔ Rj on the identity matrix I . Then
1 0 0 x1 x1 Row1 (A)
E23 = 0 0 1 =⇒ E23 x2 = x3 and E23 A = Row3 (A) .
0 1 0 x3 x2 Row2 (A)
The inverse of a Type III elementary matrix is a Type III elementary matrix
26 / 37
Type III elementary matrices
Let Eij be the matrix obtained by performing Ri ↔ Rj on the identity matrix I . Then
1 0 0 x1 x1 Row1 (A)
E23 = 0 0 1 =⇒ E23 x2 = x3 and E23 A = Row3 (A) .
0 1 0 x3 x2 Row2 (A)
The inverse of a Type III elementary matrix is a Type III elementary matrix
Theorem:
Type I: Multiplying Ei (c) to A giving Ei (c)A amounts to performing the row
operation Ri ←− cRi on A.
26 / 37
Type III elementary matrices
Let Eij be the matrix obtained by performing Ri ↔ Rj on the identity matrix I . Then
1 0 0 x1 x1 Row1 (A)
E23 = 0 0 1 =⇒ E23 x2 = x3 and E23 A = Row3 (A) .
0 1 0 x3 x2 Row2 (A)
The inverse of a Type III elementary matrix is a Type III elementary matrix
Theorem:
Type I: Multiplying Ei (c) to A giving Ei (c)A amounts to performing the row
operation Ri ←− cRi on A.
Type II: Multiplying Eij (c) to A giving Eij (c)A amounts to performing the row
operation Rj ←− cRi + Rj on A.
26 / 37
Type III elementary matrices
Let Eij be the matrix obtained by performing Ri ↔ Rj on the identity matrix I . Then
1 0 0 x1 x1 Row1 (A)
E23 = 0 0 1 =⇒ E23 x2 = x3 and E23 A = Row3 (A) .
0 1 0 x3 x2 Row2 (A)
The inverse of a Type III elementary matrix is a Type III elementary matrix
Theorem:
Type I: Multiplying Ei (c) to A giving Ei (c)A amounts to performing the row
operation Ri ←− cRi on A.
Type II: Multiplying Eij (c) to A giving Eij (c)A amounts to performing the row
operation Rj ←− cRi + Rj on A.
Type III: Multiplying Eij to A giving Eij A amounts to performing the row
operation Ri ←→ Rj on A.
26 / 37
Row echelon form and elementary matrices
Recall that two m × n matrices A and B are said to be row equivalent if A can be
transformed to B by elementary row operations.
27 / 37
Row echelon form and elementary matrices
Recall that two m × n matrices A and B are said to be row equivalent if A can be
transformed to B by elementary row operations.
Row echelon form: Let ref(A) be a row echelon form of A. Then there exist elementary
matrices E1 , . . . , E` such that ref(A) = E` · · · E2 E1 A.
27 / 37
Row echelon form and elementary matrices
Recall that two m × n matrices A and B are said to be row equivalent if A can be
transformed to B by elementary row operations.
Row echelon form: Let ref(A) be a row echelon form of A. Then there exist elementary
matrices E1 , . . . , E` such that ref(A) = E` · · · E2 E1 A.
Reduced row echelon form: Let rref(A) be the reduced row echelon form of A.
27 / 37
Row echelon form and elementary matrices
Recall that two m × n matrices A and B are said to be row equivalent if A can be
transformed to B by elementary row operations.
Row echelon form: Let ref(A) be a row echelon form of A. Then there exist elementary
matrices E1 , . . . , E` such that ref(A) = E` · · · E2 E1 A.
Reduced row echelon form: Let rref(A) be the reduced row echelon form of A. Then
there exist elementary matrices Ê1 , . . . , Êp such that
27 / 37
Row echelon form and elementary matrices
Recall that two m × n matrices A and B are said to be row equivalent if A can be
transformed to B by elementary row operations.
Row echelon form: Let ref(A) be a row echelon form of A. Then there exist elementary
matrices E1 , . . . , E` such that ref(A) = E` · · · E2 E1 A.
Reduced row echelon form: Let rref(A) be the reduced row echelon form of A. Then
there exist elementary matrices Ê1 , . . . , Êp such that
Fact: Let : [U | d] := ref([A | b]). Then there is an invertible matrix E such that
E [A | b] = [U | d]. Hence the system Ax = b and Ux = d are equivalent.
27 / 37
Gauss-Jordan method for computing inverse
Fact: Let A be an n × n matrix. If there exist elementary matrices E1 , . . . , Ep such that
Ep · · · E2 E1 A = In then A−1 = Ep · · · E2 E1 . Hence rref(A) = In ⇐⇒ A is invertible.
28 / 37
Gauss-Jordan method for computing inverse
Fact: Let A be an n × n matrix. If there exist elementary matrices E1 , . . . , Ep such that
Ep · · · E2 E1 A = In then A−1 = Ep · · · E2 E1 . Hence rref(A) = In ⇐⇒ A is invertible.
28 / 37
Gauss-Jordan method for computing inverse
Fact: Let A be an n × n matrix. If there exist elementary matrices E1 , . . . , Ep such that
Ep · · · E2 E1 A = In then A−1 = Ep · · · E2 E1 . Hence rref(A) = In ⇐⇒ A is invertible.
28 / 37
Gauss-Jordan method for computing inverse
Fact: Let A be an n × n matrix. If there exist elementary matrices E1 , . . . , Ep such that
Ep · · · E2 E1 A = In then A−1 = Ep · · · E2 E1 . Hence rref(A) = In ⇐⇒ A is invertible.
Gauss-Jordan method:
28 / 37
Gauss-Jordan
method
1 2 3
Let A := 2 5 7 . Then [A | I ] → [I | A−1 ] gives
3 7 9
29 / 37
Gauss-Jordan
method
1 2 3
Let A := 2 5 7 . Then [A | I ] → [I | A−1 ] gives
3 7 9
1 2 3 1 0 0 1 2 3 1 0 0
2 5 7 0 1 0 → 0 1 1 −2 1 0 →
3 7 9 0 0 1 0 1 0 −3 0 1
29 / 37
Gauss-Jordan
method
1 2 3
Let A := 2 5 7 . Then [A | I ] → [I | A−1 ] gives
3 7 9
1 2 3 1 0 0 1 2 3 1 0 0
2 5 7 0 1 0 → 0 1 1 −2 1 0 →
3 7 9 0 0 1 0 1 0 −3 0 1
1 0 1 5 −2 0 1 0 0 4 −3 1
0 1 1 −2 1 0 → 0 1 0 −3 0 1
0 0 −1 −1 −1 1 0 0 1 1 1 −1
29 / 37
Gauss-Jordan
method
1 2 3
Let A := 2 5 7 . Then [A | I ] → [I | A−1 ] gives
3 7 9
1 2 3 1 0 0 1 2 3 1 0 0
2 5 7 0 1 0 → 0 1 1 −2 1 0 →
3 7 9 0 0 1 0 1 0 −3 0 1
1 0 1 5 −2 0 1 0 0 4 −3 1
0 1 1 −2 1 0 → 0 1 0 −3 0 1
0 0 −1 −1 −1 1 0 0 1 1 1 −1
4 −3 1
=⇒ A−1 = −3 0 1 .
1 1 −1
29 / 37
Characterization of invertibility
30 / 37
Characterization of invertibility
30 / 37
Characterization of invertibility
30 / 37
Characterization of invertibility
Now if rank(A) = n then the reduced row echelon form of A has n pivot column
=⇒ rref(A) = In =⇒ A is invertible. Hence (6) ⇒ (1). This completes the proof.
30 / 37
LU Factorization
Definition: Let A be an n × n matrix. A decomposition of A of the form A = LU is
called an LU factorization of A, where U is an upper triangular matrix and L is a unit
lower triangular matrix (the diagonal entries of L are 1).
31 / 37
LU Factorization
Definition: Let A be an n × n matrix. A decomposition of A of the form A = LU is
called an LU factorization of A, where U is an upper triangular matrix and L is a unit
lower triangular matrix (the diagonal entries of L are 1).
31 / 37
LU Factorization
Definition: Let A be an n × n matrix. A decomposition of A of the form A = LU is
called an LU factorization of A, where U is an upper triangular matrix and L is a unit
lower triangular matrix (the diagonal entries of L are 1).
31 / 37
LU Factorization
Definition: Let A be an n × n matrix. A decomposition of A of the form A = LU is
called an LU factorization of A, where U is an upper triangular matrix and L is a unit
lower triangular matrix (the diagonal entries of L are 1).
31 / 37
LU Factorization
Definition: Let A be an n × n matrix. A decomposition of A of the form A = LU is
called an LU factorization of A, where U is an upper triangular matrix and L is a unit
lower triangular matrix (the diagonal entries of L are 1).
31 / 37
LU factorization
a11 a12 a13
Let A = a21 a22 a23 . Set m21 := a21 /a11 and m31 := a31 /a11
a31 a32 a33
32 / 37
LU factorization
a11 a12 a13
Let A = a21 a22 a23 . Set m21 := a21 /a11 and m31 := a31 /a11
a31 a32 a33
32 / 37
LU factorization
1 0 0
E3 := 0 1 0 . Then we have
0 −m32 1
a11 a12 a13 a11 a12 a13
(1) (1) (1) (1)
E3 E2 E1 A = E3 0 a22 a23 = 0 a22 a23 = U.
(1) (1) (2)
0 a32 a33 0 0 a33
33 / 37
LU factorization
1 0 0
E3 := 0 1 0 . Then we have
0 −m32 1
a11 a12 a13 a11 a12 a13
(1) (1) (1) (1)
E3 E2 E1 A = E3 0 a22 a23 = 0 a22 a23 = U.
(1) (1) (2)
0 a32 a33 0 0 a33
1 0 0
Hence A = LU, where L = E1−1 E2−1 E3−1 = m21 1 0 .
m31 m32 1
33 / 37
Examples: LU factorization
1 1 1
Let A := 1 2 2 . Then A = LU, where
1 2 3
1 0 0 1 1 1
L = 1 1 0 and U = 0 1 1 .
1 1 1 0 0 1
34 / 37
Examples: LU factorization
1 1 1
Let A := 1 2 2 . Then A = LU, where
1 2 3
1 0 0 1 1 1
L = 1 1 0 and U = 0 1 1 .
1 1 1 0 0 1
2 4 −1
Let A := −4 −5 3 . Then A = LU, where
2 −5 −4
1 0 0 2 4 −1
L = −2 1 0 and U = 0 3 1 .
1 −3 1 0 0 0
34 / 37
Three theorems with similar proof structure
35 / 37
Three theorems with similar proof structure
35 / 37
Three theorems with similar proof structure
35 / 37
Uniqueness of rref
Theorem: Let R1 and R2 be rref of A. Then R1 = R2 .
36 / 37
Uniqueness of rref
Theorem: Let R1 and R2 be rref of A. Then R1 = R2 .
Proof: Note that R1 and R2 are row equivalent. Hence
R1 x = 0 ⇐⇒ R2 x = 0. (1)
36 / 37
Uniqueness of rref
Theorem: Let R1 and R2 be rref of A. Then R1 = R2 .
Proof: Note that R1 and R2 are row equivalent. Hence
R1 x = 0 ⇐⇒ R2 x = 0. (1)
Step-1: Claim R1 e1 = R2 e1 .
36 / 37
Uniqueness of rref
Theorem: Let R1 and R2 be rref of A. Then R1 = R2 .
Proof: Note that R1 and R2 are row equivalent. Hence
R1 x = 0 ⇐⇒ R2 x = 0. (1)
Step-1: Claim R1 e1 = R2 e1 .
By (1), we have R1 e1 = 0 ⇐⇒ R2 e1 = 0.
36 / 37
Uniqueness of rref
Theorem: Let R1 and R2 be rref of A. Then R1 = R2 .
Proof: Note that R1 and R2 are row equivalent. Hence
R1 x = 0 ⇐⇒ R2 x = 0. (1)
Step-1: Claim R1 e1 = R2 e1 .
By (1), we have R1 e1 = 0 ⇐⇒ R2 e1 = 0.
If R1 e1 6= 0 then R1 e1 and R2 e1 are pivot columns. Hence
R1 e1 = e1 = R2 e1 .
36 / 37
Uniqueness of rref
Theorem: Let R1 and R2 be rref of A. Then R1 = R2 .
Proof: Note that R1 and R2 are row equivalent. Hence
R1 x = 0 ⇐⇒ R2 x = 0. (1)
Step-1: Claim R1 e1 = R2 e1 .
By (1), we have R1 e1 = 0 ⇐⇒ R2 e1 = 0.
If R1 e1 6= 0 then R1 e1 and R2 e1 are pivot columns. Hence
R1 e1 = e1 = R2 e1 .
36 / 37
Uniqueness of rref
Theorem: Let R1 and R2 be rref of A. Then R1 = R2 .
Proof: Note that R1 and R2 are row equivalent. Hence
R1 x = 0 ⇐⇒ R2 x = 0. (1)
Step-1: Claim R1 e1 = R2 e1 .
By (1), we have R1 e1 = 0 ⇐⇒ R2 e1 = 0.
If R1 e1 6= 0 then R1 e1 and R2 e1 are pivot columns. Hence
R1 e1 = e1 = R2 e1 .
36 / 37
Uniqueness of rref
Theorem: Let R1 and R2 be rref of A. Then R1 = R2 .
Proof: Note that R1 and R2 are row equivalent. Hence
R1 x = 0 ⇐⇒ R2 x = 0. (1)
Step-1: Claim R1 e1 = R2 e1 .
By (1), we have R1 e1 = 0 ⇐⇒ R2 e1 = 0.
If R1 e1 6= 0 then R1 e1 and R2 e1 are pivot columns. Hence
R1 e1 = e1 = R2 e1 .
R1 ek+1 = α1 R1 e1 + · · · + αk R1 ek =⇒ R1 x = 0, (2)
37 / 37
Uniqueness of rref
R2 x = 0 ⇒ R2 ek+1 = α1 R2 e1 + · · · + αk R2 ek = R1 ek+1 .
37 / 37
Uniqueness of rref
R2 x = 0 ⇒ R2 ek+1 = α1 R2 e1 + · · · + αk R2 ek = R1 ek+1 .
On the other hand, if R1 ek+1 is a pivot column then R2 ek+1 is also a pivot column.
37 / 37
Uniqueness of rref
R2 x = 0 ⇒ R2 ek+1 = α1 R2 e1 + · · · + αk R2 ek = R1 ek+1 .
On the other hand, if R1 ek+1 is a pivot column then R2 ek+1 is also a pivot column.
If there are p pivot columns in the first k columns of R1 then R1 ek+1 = ep+1 and
R2 ek+1 = ep+1 .
37 / 37
Uniqueness of rref
R2 x = 0 ⇒ R2 ek+1 = α1 R2 e1 + · · · + αk R2 ek = R1 ek+1 .
On the other hand, if R1 ek+1 is a pivot column then R2 ek+1 is also a pivot column.
If there are p pivot columns in the first k columns of R1 then R1 ek+1 = ep+1 and
R2 ek+1 = ep+1 . Hence R1 ek+1 = R2 ek+1 .
37 / 37
Uniqueness of rref
R2 x = 0 ⇒ R2 ek+1 = α1 R2 e1 + · · · + αk R2 ek = R1 ek+1 .
On the other hand, if R1 ek+1 is a pivot column then R2 ek+1 is also a pivot column.
If there are p pivot columns in the first k columns of R1 then R1 ek+1 = ep+1 and
R2 ek+1 = ep+1 . Hence R1 ek+1 = R2 ek+1 .
Hence we have R1 = R2 .
37 / 37