Ti2008 03

On best approximations of matrix polynomials
Petr Tichý
joint work with
Jörg Liesen
Institute of Computer Science,

Academy of Sciences of the Czech Republic
September 12, 2008

Technische Universität Hamburg-Harburg, Germany
GAMM Workshop on Applied and Numerical Linear Algebra
1
Ideal Arnoldi approximation problem
min k p(A) k = min k Am+1 − p(A) k ,

p∈Mm+1 p∈Pm
where Mm+1 is the class of monic polynomials of degree m + 1,

Pm is the class of polynomials of degree at most m.
2
Ideal Arnoldi approximation problem
min k p(A) k = min k Am+1 − p(A) k ,

p∈Mm+1 p∈Pm
where Mm+1 is the class of monic polynomials of degree m + 1,

Pm is the class of polynomials of degree at most m.
Introduced in [Greenbaum and Trefethen, 1994], paper contains

uniqueness result (→ story of the proof).
The unique polynomial that solves the problem is called
the (m + 1)st ideal Arnoldi polynomial of A,
or the (m + 1)st Chebyshev polynomial of A .
Some work on these polynomials in [Toh PhD thesis, 1996],
[Toh and Trefethen, 1998], [Trefethen and Embree, 2005] .
2
Matrix function best approximation problem
We consider the matrix approximation problem
min k f (A) − p(A) k

p∈Pm
k · k is the spectral norm (matrix 2-norm), A ∈ Cn×n , f is analytic

in neighborhood of A’s spectrum.
3

p∈Pm

Well known: f (A) = pf (A) for a polynomial pf depending on

values and possibly derivatives of f on A’s spectrum.
Without loss of generality we assume that f is a given polynomial.
3

p∈Pm

Well known: f (A) = pf (A) for a polynomial pf depending on

values and possibly derivatives of f on A’s spectrum.
Without loss of generality we assume that f is a given polynomial.
Does this problem have a unique solution p∗ ∈ Pm ?
3
Outline
1 General matrix approximation problems
2 Formulation of matrix polynomial approximation problems
3 Uniqueness results
4 Ideal Arnoldi versus ideal GMRES polynomials
4
Outline
5
General matrix approximation problems
Given
m linearly independent matrices A1 , . . . , Am ∈ Cn×n ,
A ≡ span {A1 , . . . , Am },
B ∈ Cn×n \A,
k · k is a matrix norm.
Consider the best approximation problem
min k B − M k .
M∈A
6
General matrix approximation problems
Given
m linearly independent matrices A1 , . . . , Am ∈ Cn×n ,
A ≡ span {A1 , . . . , Am },
B ∈ Cn×n \A,
k · k is a matrix norm.
Consider the best approximation problem
min k B − M k .
M∈A
This problem has a unique solution if k · k is strictly convex.

[see, e.g., Sreedharan, 1973]
6
Strictly convex norms
The norm k · k is strictly convex if for all X, Y,
kXk = kYk = 1 , kX + Yk = 2 ⇒ X = Y.
7
kXk = kYk = 1 , kX + Yk = 2 ⇒ X = Y.
Which matrix norms are strictly convex?

Let σ1 ≥ σ2 ≥ · · · ≥ σn be singular values of X and 1 ≤ p ≤ ∞.
n
!1/p
X
The cp -norm: kXkp ≡ σip .
i=1
p = 2 . . . Frobenius norm,
p = ∞ . . . spectral norm, matrix 2-norm, kXk∞ = σ1 ,
p = 1 . . . trace (nuclear) norm.
7
kXk = kYk = 1 , kX + Yk = 2 ⇒ X = Y.
Which matrix norms are strictly convex?

Let σ1 ≥ σ2 ≥ · · · ≥ σn be singular values of X and 1 ≤ p ≤ ∞.
n
!1/p
X
The cp -norm: kXkp ≡ σip .
i=1
p = 2 . . . Frobenius norm,
p = ∞ . . . spectral norm, matrix 2-norm, kXk∞ = σ1 ,
p = 1 . . . trace (nuclear) norm.
Theorem. If 1 < p < ∞ then the cp -norm is strictly convex.
[see, e.g., Ziȩtak, 1988]
7
Spectral norm (matrix 2-norm)
A useful matrix norm in many applications: spectral norm
kXk ≡ σ1 .
8
kXk ≡ σ1 .
This norm is not strictly convex:

" # " #
I I
X= , Y= , ε, δ ∈ h0, 1i .
ε δ
Then we have, for each ε, δ ∈ h0, 1i,
kXk = kYk = 1 and kX + Yk = 2
but if ε 6= δ then X 6= Y.
8
kXk ≡ σ1 .
This norm is not strictly convex:

" # " #
I I
X= , Y= , ε, δ ∈ h0, 1i .
ε δ
Then we have, for each ε, δ ∈ h0, 1i,
kXk = kYk = 1 and kX + Yk = 2
but if ε 6= δ then X 6= Y.
Consequently: Best approximation problems in the spectral norm
are not guaranteed to have a unique solution.
8
Matrix approximation problems in spectral norm
min k B − M k = k B − A∗ k
M∈A
A∗ ∈ A achieving the minimum is called a spectral approximation

of B from the subspace A.
Open question: When does this problem have a unique solution?
9
M∈A

Ziȩtak’s sufficient condition

Theorem [Ziȩtak, 1993]. If the residual matrix B − A∗ has an n-fold
maximal singular value, then the spectral approximation A∗ of B
from the subspace A is unique.
9
M∈A


Is this sufficient condition satisfied, e.g., for the ideal Arnoldi

approximation problem?
9
General characterization of spectral approximations
General characterization by [Lau and Riha, 1981] and [Ziȩtak, 1993, 1996]
→ based on the Singer’s theorem [Singer, 1970].
10
Define ||| · ||| (trace norm, nuclear norm, c1 -norm) and h·, ·i by
|||X||| = σ1 + · · · + σn , hZ, Xi ≡ tr(Z∗ X) .
10
Define ||| · ||| (trace norm, nuclear norm, c1 -norm) and h·, ·i by
|||X||| = σ1 + · · · + σn , hZ, Xi ≡ tr(Z∗ X) .
Characterization: [Ziȩtak, 1996] A∗ ∈ A is a spectral approximation

of B from the subspace A iff there exists Z ∈ Cn×n , s.t.
||| Z ||| = 1, hZ, Xi = 0, ∀X ∈ A ,
and
RehZ, B − A∗ i = kB − A∗ k .
10
Chebyshev polynomials of Jordan blocks
Theorem. Let Jλ be the n × n Jordan block. Consider the ideal
Arnoldi approximation problem
min k p(Jλ ) k = min k B − M k ,
p∈Mm M∈A
where B = Jm
λ, A = span {I, Jλ , . . . , Jm−1
λ }. The minimum is
m
attained by the polynomial p∗ = (z − λ) [Liesen and T., 2008].
11
p∈Mm M∈A
where B = Jm
λ, A = span {I, Jλ , . . . , Jm−1
m
Proof. For p = (z − λ)m , the residual matrix B − M is given by

B − M = p(Jλ ) = (Jλ − λI)m = Jm
0 .
11
p∈Mm M∈A
where B = Jm
λ, A = span {I, Jλ , . . . , Jm−1
m

B − M = p(Jλ ) = (Jλ − λI)m = Jm
0 .
Define Z ≡ e1 eTm+1 . It holds that

|||Z||| = 1, hZ, Jkλ i = 0, k = 0, . . . , m − 1
and
hZ, B − Mi = hZ, Jm
0 i = 1 = kB − Mk
11
p∈Mm M∈A
where B = Jm
λ, A = span {I, Jλ , . . . , Jm−1
m

B − M = p(Jλ ) = (Jλ − λI)m = Jm
0 .
Define Z ≡ e1 eTm+1 . It holds that

|||Z||| = 1, hZ, Jkλ i = 0, k = 0, . . . , m − 1
and
hZ, B − Mi = hZ, Jm
0 i = 1 = kB − Mk ⇒ M = A∗ .
11

For the ideal Arnoldi approximation problem and the Jordan block
Jλ , we have shown that
B − A∗ = Jm
0 .
One is (n − m)-fold maximal singular value of B − A∗ ,

zero is m-fold singular value of B − A∗ .
12

For the ideal Arnoldi approximation problem and the Jordan block
Jλ , we have shown that
B − A∗ = Jm
0 .
One is (n − m)-fold maximal singular value of B − A∗ ,

zero is m-fold singular value of B − A∗ .
The spectral approximation is unique [Greenbaum and Trefethen. 1994],

but, apparently, Ziȩtak’s sufficient condition is not satisfied!
12
Outline
13
The problem and known results
min k f (A) − p(A) k .

p∈Pm
where k · k is the spectral norm and f is a given polynomial.
14

p∈Pm
Known results
If A is normal, the problem reduces to the well studied scalar

approximation problem on the spectrum of A,
min max | f (λ) − p(λ) | .

p∈Pm λ∈Λ
14

p∈Pm
Known results
If A is normal, the problem reduces to the well studied scalar

approximation problem on the spectrum of A,
min max | f (λ) − p(λ) | .

p∈Pm λ∈Λ
For general A - only a special case of f (A) = Am+1

is known to have a unique solution [Greenbaum and Trefethen, 1994].
14
Reformulation of the problem
Let f be a polynomial of degree m + ℓ + 1 (m ≥ 0, ℓ ≥ 0). Then
f (z) = z m+1 g(z) + fm z m + · · · + f1 z + f0 ,
where g is a polynomial of degree at most ℓ. Approximate f by p
p(z) = pm z m + · · · + p1 z + p0 .
15
f (z) = z m+1 g(z) + fm z m + · · · + f1 z + f0 ,
p(z) = pm z m + · · · + p1 z + p0 .
It is easy to show that
min kf (A) − p(A)k = min kAm+1 g(A) − h(A)k .

p∈Pm h∈Pm
15
f (z) = z m+1 g(z) + fm z m + · · · + f1 z + f0 ,
p(z) = pm z m + · · · + p1 z + p0 .
It is easy to show that
min kf (A) − p(A)k = min kAm+1 g(A) − h(A)k .

p∈Pm h∈Pm
Without loss of generality we can consider the problem
min kAm+1 g(A) − h(A)k ,

h∈Pm
where g is a given polynomial of degree at most ℓ.

15
Matrix polynomial approximation problems
We consider the problem
min k Am+1 g(A) − h(A) k ,

h∈Pm

For g ≡ 1 we obtain the ideal Arnoldi approximation problem.
16
Matrix polynomial approximation problems
We consider the problem

h∈Pm

For g ≡ 1 we obtain the ideal Arnoldi approximation problem.
Related problem:

g∈Pℓ
where h is a given polynomial of degree at most m.

For h ≡ 1 we obtain the ideal GMRES approximation problem.
16
Matrix polynomial approx. problems - general notation
We consider matrix approximation problems of the form
min k B − M k .
M∈A
B ≡ Am+1 g(A) , g ∈ Pℓ given

A ≡ span {I, A, . . . , Am } ,
B ≡ h(A) , h ∈ Pm given
m+1 m+2
A ≡ span {A ,A , . . . , Am+ℓ+1 } .
B ∈ Cn×n \A means that the minimum > 0.

17
Outline
18
Uniqueness results
Theorem [Liesen and T., 2008].

1 Given g ∈ Pℓ , the problem
min k Am+1 g(A) − h(A) k > 0,

h∈Pm
has the unique minimizer.
19
Uniqueness results

min k Am+1 g(A) − h(A) k > 0,

h∈Pm

2 Let A be nonsingular and h ∈ Pm given. Then the problem
min k Am+1 g(A) − h(A) k > 0,

g∈Pℓ
19
Uniqueness results

min k Am+1 g(A) − h(A) k > 0,

h∈Pm

2 Let A be nonsingular and h ∈ Pm given. Then the problem
min k Am+1 g(A) − h(A) k > 0,

g∈Pℓ
The nonsingularity in (2) cannot be omitted in general.
19
Idea of the proof (by contradiction)
Based on the proof by [Greenbaum and Trefethen, 1994].
Consider the problem
min kp(A)k
(g)
p∈Gℓ,m
where
n o
(g)
Gℓ,m ≡ z m+1 g + h : g ∈ Pℓ is given, h ∈ Pm .
Let q1 and q2 be two different solutions, kq1 (A)k = kq2 (A)k = C.
20
Idea of the proof (by contradiction)
Based on the proof by [Greenbaum and Trefethen, 1994].
Consider the problem
min kp(A)k
(g)
p∈Gℓ,m
where
n o
(g)
Gℓ,m ≡ z m+1 g + h : g ∈ Pℓ is given, h ∈ Pm .
Let q1 and q2 be two different solutions, kq1 (A)k = kq2 (A)k = C.

Use q1 and q2 to construct the polynomial
(g)
qǫ = (1 − ǫ) q + ǫ qe ∈ Gℓ,m
and show that, for sufficiently small ǫ,
kqǫ (A)k < C.

20
Outline
21
Ideal Arnoldi versus ideal GMRES polynomials
Ideal Arnoldi and ideal GMRES problems
min kp(A)k , min kp(A)k .

p∈Mm p∈πm
The ideal Arnoldi polynomial is (z − λ)m [Liesen and T., 2008].
22

p∈Mm p∈πm
For λ 6= 0, we can write
(z − λ)m = (−λ)m (1 − λ−1 z)m .
(1 − λ−1 z)m is a candidate for solving ideal GMRES problem.
22

p∈Mm p∈πm
(z − λ)m = (−λ)m (1 − λ−1 z)m .
Is the ideal GMRES polynomial a scaled version of the ideal

Arnoldi polynomial (at least for Jλ )?
22

p∈Mm p∈πm
(z − λ)m = (−λ)m (1 − λ−1 z)m .
Is the ideal GMRES polynomial a scaled version of the ideal

Arnoldi polynomial (at least for Jλ )?
No! Determination of ideal GMRES polynomials for Jλ is very
complicated and intriguing problem [T., Liesen and Faber, 2007].
22
Ideal Arnoldi and ideal GMRES polynomials for Jλ
Theorem [T., Liesen and Faber, 2007]. The mth ideal GMRES
polynomial is (1 − λ−1 z)m iff 0 ≤ m < n2 and |λ| ≥ ̺−1m,n−m .
̺k,n is the radius of the polynomial numerical hull of degree k

of an n × n Jordan block (independent of λ).
23

Example: Let n be even and consider m = n2 .

2
If |λ| ≤ 2− n , the ideal GMRES polynomial is 1.
2
If |λ| ≥ 2− n , the ideal GMRES polynomial is equal to
2 4λn − 1 n
+ (1 − λ−1 z) 2 .
4λn+1 n
4λ + 1
23

Example: Let n be even and consider m = n2 .

2
If |λ| ≤ 2− n , the ideal GMRES polynomial is 1.
2
If |λ| ≥ 2− n , the ideal GMRES polynomial is equal to
2 4λn − 1 n
+ (1 − λ−1 z) 2 .
4λn+1 n
4λ + 1
Obviously, neither 1 nor the above polynomial are scalar multiples

of the corresponding ideal Arnoldi polynomial.
23
Summary
We showed uniqueness of two matrix best approximation
problems in spectral norm,
p∈Pm
and
min k h(A) − Am+1 p(A) k .
p∈Pℓ
24
Summary
p∈Pm
and
p∈Pℓ
Generalization of ideal Arnoldi and ideal GMRES problems.
24
Summary
p∈Pm
and
p∈Pℓ
Nontrivial problem for nonnormal A.
24
Summary
p∈Pm
and
p∈Pℓ
Ideal Arnoldi and Ideal GMRES polynomials can differ.
24
Summary
p∈Pm
and
p∈Pℓ
Ideal Arnoldi and Ideal GMRES polynomials can differ.
Open question: When does the general problem

min k B − M k
M∈A
have a unique solution?
24
Related paper
J. Liesen and P. Tichý, [On best approximations of polynomials in

matrices in the matrix 2-norm, submitted, June 2008.]
P. Tichý, J. Liesen and V. Faber, [On worst-case GMRES, ideal

GMRES, and the polynomial numerical hull of a Jordan block, Electronic
Transactions on Numerical Analysis (ETNA), Volume 26, pp. 453-473,
published online, 2007.]
More details can be found at
http://www.cs.cas.cz/tichy
http://www.math.tu-berlin.de/˜liesen
25
Related paper
J. Liesen and P. Tichý, [On best approximations of polynomials in

matrices in the matrix 2-norm, submitted, June 2008.]
P. Tichý, J. Liesen and V. Faber, [On worst-case GMRES, ideal

GMRES, and the polynomial numerical hull of a Jordan block, Electronic
Transactions on Numerical Analysis (ETNA), Volume 26, pp. 453-473,
published online, 2007.]
More details can be found at
http://www.cs.cas.cz/tichy
http://www.math.tu-berlin.de/˜liesen
Thank you for your attention!
25

Ti2008 03

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Ti2008 03

Uploaded by

Copyright:

Available Formats

On best approximations of matrix polynomials

Institute of Computer Science,

September 12, 2008

min k p(A) k = min k Am+1 − p(A) k ,

where Mm+1 is the class of monic polynomials of degree m + 1,

min k p(A) k = min k Am+1 − p(A) k ,

where Mm+1 is the class of monic polynomials of degree m + 1,

Introduced in [Greenbaum and Trefethen, 1994], paper contains

We consider the matrix approximation problem

min k f (A) − p(A) k

k · k is the spectral norm (matrix 2-norm), A ∈ Cn×n , f is analytic

We consider the matrix approximation problem

min k f (A) − p(A) k

k · k is the spectral norm (matrix 2-norm), A ∈ Cn×n , f is analytic

Well known: f (A) = pf (A) for a polynomial pf depending on

Without loss of generality we assume that f is a given polynomial.

We consider the matrix approximation problem

min k f (A) − p(A) k

k · k is the spectral norm (matrix 2-norm), A ∈ Cn×n , f is analytic

Well known: f (A) = pf (A) for a polynomial pf depending on

Without loss of generality we assume that f is a given polynomial.

Does this problem have a unique solution p∗ ∈ Pm ?

1 General matrix approximation problems

2 Formulation of matrix polynomial approximation problems

4 Ideal Arnoldi versus ideal GMRES polynomials

1 General matrix approximation problems

2 Formulation of matrix polynomial approximation problems

4 Ideal Arnoldi versus ideal GMRES polynomials

Consider the best approximation problem

Consider the best approximation problem

This problem has a unique solution if k · k is strictly convex.

Which matrix norms are strictly convex?

Which matrix norms are strictly convex?

This norm is not strictly convex:

Then we have, for each ε, δ ∈ h0, 1i,

kXk = kYk = 1 and kX + Yk = 2

This norm is not strictly convex:

Then we have, for each ε, δ ∈ h0, 1i,

kXk = kYk = 1 and kX + Yk = 2

A∗ ∈ A achieving the minimum is called a spectral approximation

Open question: When does this problem have a unique solution?

A∗ ∈ A achieving the minimum is called a spectral approximation

Open question: When does this problem have a unique solution?

Ziȩtak’s sufficient condition

A∗ ∈ A achieving the minimum is called a spectral approximation

Open question: When does this problem have a unique solution?

Ziȩtak’s sufficient condition

Is this sufficient condition satisfied, e.g., for the ideal Arnoldi

|||X||| = σ1 + · · · + σn , hZ, Xi ≡ tr(Z∗ X) .

|||X||| = σ1 + · · · + σn , hZ, Xi ≡ tr(Z∗ X) .

Characterization: [Ziȩtak, 1996] A∗ ∈ A is a spectral approximation

||| Z ||| = 1, hZ, Xi = 0, ∀X ∈ A ,

Proof. For p = (z − λ)m , the residual matrix B − M is given by

Proof. For p = (z − λ)m , the residual matrix B − M is given by

Define Z ≡ e1 eTm+1 . It holds that

Proof. For p = (z − λ)m , the residual matrix B − M is given by

Define Z ≡ e1 eTm+1 . It holds that

Theorem [Ziȩtak, 1993]. If the residual matrix B − A∗ has an n-fold

One is (n − m)-fold maximal singular value of B − A∗ ,

Theorem [Ziȩtak, 1993]. If the residual matrix B − A∗ has an n-fold

One is (n − m)-fold maximal singular value of B − A∗ ,

The spectral approximation is unique [Greenbaum and Trefethen. 1994],

1 General matrix approximation problems