Professional Documents
Culture Documents
VICTOR Y. PAN†
Key words. Vandermonde matrices, condition number, Cauchy matrices, Cauchy inversion
formulae, Vandermonde matrix inversion
DOI. 10.1137/15M1030170
s = (si )n−1
i=0 denotes the vector of n distinct knots s0 , . . . , sn−1 , and ||M || = ||M ||2
denotes the spectral norm of a matrix M . First, one would like to know whether
this number is reasonably bounded or large, that is, whether the matrix is well- or
ill-conditioned. In the latter case, one must perform computations with extended
precision in order to obtain an accurate solution to a linear system of equations with
this matrix (cf. [GL13]) and to the equivalent problem of polynomial interpolation.
It has been proven in [GI88], [G90], and [T94] that the condition number κ(Vs )
is exponential in n if all knots s0 , s1 , . . . , sn−1 are real. This means that such a
matrix is badly ill-conditioned already for moderately large integers n, but empirically
almost all Vandermonde matrices of large, or even reasonably large, sizes are badly
∗ Received by the editors July 10, 2015; accepted for publication (in revised form) February 1,
2016 by J. L. Barlow; published electronically May 25, 2016. This work was supported by NSF grant
CCF–1116736 and PSC CUNY Award 67699-00 45.
http://www.siam.org/journals/simax/37-2/M103017.html
† Department of Mathematics and Computer Science, Lehman College of the City University of
New York, Bronx, NY 10468, and Ph.D. Programs in Mathematics and Computer Science, The Grad-
uate Center of the City University of New York, New York, NY 10036 (victor.pan@lehman.cuny.edu,
http://comet.lehman.cuny.edu/vpan/).
676
and
n−1
s(x) X
si (x) = 0
= si,j xj for m = n
(x − si )s (si ) i,j=0
(see other expressions for the inverse matrix Vs−1 in [MS58], [M03], [T66], [SEMC11],
and the references therein). However, the known lower and upper bounds on the
condition number of a Vandermonde matrix, as Gautschi wrote in [G90, section IV],
“unfortunately, are too far apart to give much useful information”, and so he “com-
puted the condition number . . . numerically”.
Our new lower bounds enable us to formally support the cited empirical observa-
tions for a very general class of Vandermonde matrices. Our results in sections 4 and
8 indicate that the condition number of an n × n Vandermonde matrix is exponential
in n unless its knots are more or less equally spaced on or about the unit circle C(0, 1).
Our study covers the matrices defined with the quasi-cyclic sequence of knots
as a special case (see section 5), and we further observe in section 6 that the k × k
blocks of the n × n matrix of the DFT for n = 2k are badly ill-conditioned as long as
the integers k and n are sufficiently large, even though DFT matrices themselves are
perfectly conditioned.
In order to prove our lower bounds on the condition number κ(Vs ), we shift from
Vandermonde matrices Vs of (1) to Cauchy matrices,
1 n−1
(5) Cs,t = ,
si − tj i,j=0
defined by 2n distinct knots s0 , . . . , sn−1 ; t0 , . . . , tn−1 . The matrices of these two
classes are much different, and it is not obvious that their studies can be linked
together. Unlike a Vandermonde matrix Vs , a Cauchy matrix Cs,t has rational en-
tries, every one of its submatrices is a Cauchy matrix as well, and the Cauchy struc-
ture is invariant in column interchange as well as in scaling and shifting all entries
by the same scalar, that is, aCas,at = Cs,t for a 6= 0 and Cs+ae,t+ae = Cs,t for
e = (1, . . . , 1)T . Moreover, some powerful techniques have been developed in [GR87],
Downloaded 11/01/16 to 121.49.106.94. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
[CGR88], [MRT05], [R06], [B10], [XXG12], [XXCB14], [P15], and [Pa] for the ap-
proximation of reasonably large submatrices of Cauchy matrices of a large class by
lower-rank matrices.
Nevertheless there exists a simple Vandermonde–Cauchy link, and this very link
has become the basis for not only our progress in this paper but also the work in [P15]
and [Pa]. (The paper [P90] revealed a more general link among the matrix structures
of Vandermonde, Cauchy, Toeplitz, and Hankel types, based on their displacement
representation. The paper also pointed out potential algorithmic applications, and
this link has indeed become the springboard for devising the highly efficient algo-
rithms of the papers [GKO95], [HB97], [HB98], [G98], [MRT05], [R06], [AR10], [P10],
[XXG12], [XXCB14], [P15], and [Pa].)
Restatement of our original task in terms of Cauchy matrices has opened new
opportunities for our study of Vandermonde matrices, and we obtained the desired
progress by extending the known formula for the Cauchy determinant to the following
simple expression for all entries of the inverse of a Cauchy matrix Cs,t :
−1 (−1)n
(6) (Cs,t )i0 ,j 0 = s(tj 0 )t(si0 ) for all pairs (i0 , j 0 ).
tj 0 − si0
This expression involves the values of the polynomial s(x) of (3) and the polynomial
n−1
Y
(7) t(x) = tn (x) = (x − tj ).
j=0
We apply formula (6) in order to prove exponential lower bounds on the norms of the
inverses and the condition numbers of Cauchy matrices of a large class, and then we
extend these bounds to Vandermonde matrices. We also deduce some alternative ex-
ponential lower bounds by applying the known techniques for low-rank approximation
of certain large submatrices of Cauchy matrices of a large class.
In section 3, by applying the Ekkart–Young theorem, we more readily prove the
exponential growth of the condition numbers of Vandermonde matrices of a narrower
class, namely, those having at least a single knot outside the disc D(0, 1) = {x : |x| ≤
1} or having many knots lying inside this disc and not close to its boundary circle
C(0, 1). These simple results can be of independent interest, and they provide some
initial insight into the subject.
We organize our presentation as follows. In section 2 we recall some basic defini-
tions and auxiliary results. In section 4 we first deduce our basic lower bounds on the
condition number κ(Vs ) by applying a link between Vandermonde and Cauchy matri-
ces, and then we informally interpret these bounds in terms of some restrictions on the
knot set {s0 , s1 , . . . , sn−1 } of any well-conditioned Vandermonde matrix. We prove
exponential growth of the condition number of (a) Vandermonde matrices defined by
the quasi-cyclic sequence of knots in section 5 and (b) the half-size leading blocks of
the DFT matrices in section 6. In section 7 we deduce an exponential lower bound
on the condition numbers of some Cauchy matrices closely linked to Vandermonde
matrices. In section 8 we extend this bound to an alternative exponential lower bound
on the condition numbers κ(Vs ) of Vandermonde matrices Vs . This bound is weaker
than the bounds of section 4 for the matrix classes of sections 5 and 6, but its domain
of applications is shown more explicitly. We devote section 9 to our numerical tests.
n−1 n−1
(8) s+ = max |si |, |Vs | = max{1, s+ }.
i=0
Next, we deduce expressions (6) for the inverse of a nonsingular Cauchy matrix
Cs,t of (5) by combining (9) with the following well-known formula:
Y Y
(10) det(Cs,t ) = (sj − si )(ti − tj )/ (si − tj ).
i<j i,j
2 0 0
, !
(−1)n +i +j Y Y Y Y
(si − ti0 ) (sj 0 − tj ) (sj 0 − si )(ti − tj 0 ) (sj − si0 )(ti0 − tj )
si0 − tj 0 i j i<j 0 j>i0
for every pair (i0 , j 0 ). In particular, note that n2 + n − 1 is an odd number and obtain
−1 1 Y Y (−1)n
(12) (Cs,t )n−1,0 = (si − t0 ) (sn−1 − tj ) = s(t0 )t(sn−1 )
t0 − sn−1 i j
t0 − sn−1
for the polynomials s(x) of (3) and t(x) of (7). By re-enumerating the knots si and
tj , we can turn any pair of them into the pair of sn−1 and t0 and thus arrive at
expression (6).
We are going to combine this expression with the following equations (cf. [P15,
eq. (5)]), which link Vandermonde and Cauchy matrices:
Here
(14) D = diag(t(si ))n−1 and D1 = diag(t0 (tj ))n−1
Downloaded 11/01/16 to 121.49.106.94. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
i=0 j=0
Cs,f = ( s 1
j )n−1
i,j=0 , which are Cauchy matrices linked to Vandermonde matrices.
i −f ωn
−1
where the matrix Cs,fis defined by (11) for tj = f ωnj
and j = 0, . . . , n − 1.
j
By combining (11), (15), and tj = f ωn for |f | = 1 and j = 0, . . . , n − 1, we obtain
inversion formulae for a nonsingular Vandermonde matrix Vs depending only on the
single nonzero parameter f .
Furthermore, we can substitute t(x) = xn − f n into (12) and express any entry
−1 −1
(Cs,t )i0 ,j 0 of the matrix Cs,f as follows:
−1
(16) (Cs,f )i0 ,j 0 = (−1)n s(tj 0 )(sni0 − f n )/(si0 − tj 0 ).
Equations (15) and (16), combined for all pairs i0 and j 0 , provide a simplified
representation of the inverses of CV and Vandermonde matrices. Next, we apply this
representation in order to estimate the condition number of a Vandermonde matrix.
Theorem 4.1. Let Vs be a nonsingular n × n Vandermonde matrix of (1). Fix
a scalar f such that |f | = 1 and si 6= f ωnj for all pairs of i and j, thus implying that
the CV matrix Cs,f is well-defined and nonsingular. Then
√ −1 n−1
κ(Vs ) ≥ n ||Cs,f || max |sni − f n |.
i=0
Proof. Apply (15) and note that the matrix diag(f n−1−j )n−1 H −j n−1
j=0 Ω diag(ωn )j=0
−1 n−1
is unitary. Therefore, ||Vs−1 || = ||Cs,f 1
diag( sn −f n )j=0 ||, and consequently ||Vs
−1
|| ≥
−1 n−1 n
i
√
||Cs,f ||/ maxi=0 |si − f n |. Combine this estimate with the bound κ(Vs ) ≥ ||Vs−1 || n
of Theorem 3.1.
Corollary 4.1.1. Let Vs be a nonsingular n × n Vandermonde matrix of (1)
with s+ ≤ 1 for s+ of (8). Then
√
κ(Vs ) ≥ 0.5 n max |s(f )| for s(x) of (3).
|f |=1
Proof. Suppose that the matrix Cs,f is well-defined and nonsingular and apply
Theorem 4.1.
−1 −1
)i0 ,j 0 | for all pairs i0 and j 0 . Substitute this inequality
Downloaded 11/01/16 to 121.49.106.94. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
√
Proof. Note that max|f |=1 |s(f )| ≥ ||v||∞ ≥ ||v||/ n.
Part (i) of Corollary 3.2.1 implies that a Vandermonde matrix Vs of a large size
is badly ill-conditioned if s+ ≥ ν for a constant ν exceeding 1. Corollary 4.1.2 implies
that even if ν+ ≤ 1, this property of the matrix Vs still holds as long as the knot
set {s0 , . . . , sn−1 } is sparse in some disc that covers a reasonably thick neighborhood
of a reasonably long arc of the circle C(0, 1). Thus, informally, the knots of a well-
conditioned Vandermonde matrix must be more or less equally spaced on or about
the circle C(0, 1).
In sections 7 and 8, by applying Theorem 4.1 and low-rank approximation of CV
matrices, we obtain an alternative proof of this property for the inputs of large sizes.
In sections 5 and 6, we estimate exponentially growing condition numbers κ(Vs ) in
two specific cases when the knots si deviate from equally spaced distributions on the
unit circle C(0, 1).
We conclude this section by estimating the condition number κ(Vs ) in terms of
the coefficients of the polynomial s(x) rather than its values.
can make the matrix ill-conditioned, as [G90] has shown empirically, and we formally
deduce this from Theorem 4.1.
Downloaded 11/01/16 to 121.49.106.94. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
i = 0, 1, . . . , n − 2k .
k
By including √ also all of the 2 th roots of 1, we arrive at a sequence of knots
si = exp(2πfi −1 ), i = 0, 1, . . . , n − 1, called quasi-cyclic in [G90, section IV] and
defined by the following sequence of fractions fi , i = 0, 1, . . . , n − 1:
1 1 3 1 3 5 7 1 3 5 7 9 11 13 15
0, , , , , , , , , , , , , , , , . . . .
2 4 4 8 8 8 8 16 16 16 16 16 16 16 16
For n = 2k as well as n = 2k+1 the first n knots of the sequence define the unitary
matrices of the DFT, but for n = 3·2k and k ≥ 4 the condition numbers of the resulting
matrices Vs grow large as the values k increase, so that the condition number κ(Vs )
exceeds 1,000,000 already for k = 4, n = 48 (cf. [G90, section IV]). This has remained
an empirical observation since 1990, but next we deduce from Corollary 4.1.2 that
log(κ(Vs )) for n = 3 · 2k−1 grows at least proportionally to 2k , and so κ(Vs ) grows
exponentially in n.
Theorem 5.1. Let Vq denote the n×n Vandermonde matrix defined by the quasi-
cyclic sequence of knots on the√unit circle C(0, 1) for n = 3q, q = 2k−1 , and a positive
integer k. Then κ(Vq ) ≥ 2q/2 n.
i
Proof. For n = 3q the sequence has the 2q knots si = ω2q , i = 0, 1, . . . , 2q − 1,
2i+1
equally spaced on the circle C(0, 1), and the q additional knots si = ω4q , i =
0, 1, . . . , q − 1, equally spaced on the upper half of that circle, such that s(x) =
(x2q − 1)pq (x) for
q−1
Y
2i+1
(17) pq (x) = (x − ω4q ).
i=0
√
Corollary 4.1.2 implies that κ(Vq ) ≥ |s(f )| n/2, where we can choose any complex
number f such that√ |f | = 1.
Choose f = − −1 ω4q and observe that f 2q − 1 = −2, and so in this case,
(18) |s(f )| = 2|pq (f )|.
2i+1
√
Furthermore, |f − ω4q ≥ 2 for i = 1, . . . , q (see Figure 1). Consequently |s(f )| ≥
|
q/2+1
2 , and the theorem follows because
√
(19) κ(Vq ) ≥ 0.5|s(f )| n.
2i+1
√
Fig. 1. The minimum length of the line intervals [f, ω4q ], i = 0, 1, . . . , q − 1, is 2, reached
where i = 1.
The theorem implies that the condition number κ(Vk ) grows exponentially in n
for n = 3q and q = 2k−1 , but we can strengthen the estimate a little further because
of the following observation.
2i+1
Proposition 5.1.1. |f − ω4q | ≥ 2 cos((0.5 − i/q)π/2) for q being a power of 2
and i in the range between 0 and q.
2i+1
√
This proposition implies that the distance |f − ω4q | grows from 2 to 2 as i
√
grows from 0 to q/2, but this distance decreases to 2 as i grows from q/2 to q (cf.
Figure 1). In the limit, as q → ∞, we obtain
Z q
(20) |pq (f )| ≥ exp(y) for y = ln(2 cos((0.5 − x/q)π/2)) dx.
0
For any q we obtain a lower bound on the value |pq (f )| by replacing the integral
with a partial sum. Let us do this for q divisible by 12. Replace the integral with√a
partial sum by applying Proposition 5.1.1 for i = q/6 and 2 cos((0.5 − i/q)π/2) = 3
and obtain
This lower estimate for the integral is quite tight in the case of large integers q,
but next we refine it further by applying Proposition 5.1.1 for i = q/3 and 2 cos((0.5 −
i/q)π/2) ≈ 1.9318517. In this way, we deduce that
√
(22) |pq (f )| ≥ (2 cos(π/12) 6 )q/3 .
√ √
For n = 48 the lower bound of Theorem 5.1 on κ(Vq ) is just 28 48 = 1024 3 >
1, 773.6, but (21) and (22) enable us to increase it to 15,417 and 27,598, respectively.
By engaging Proposition 5.1.1 at first for i = q/4 and 2 cos((0.5−i/q)π/2) ≈ 1.9615706
and then for i = q/12 and 2 cos((0.5 − i/q)π/2) ≈ 1.586707, we increase these lower
bounds at first to 38,453 and then to 71,174.
Downloaded 11/01/16 to 121.49.106.94. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
j √
Fig. 2. The distances |f − ωn |≥ 2 for j = 0, 1, . . . , n/2 − 1.
In contrast, by virtue of our next theorem, a DFT matrix Ω of a large size cannot
be strongly well-conditioned, and thus such a matrix is numerically hard for GENP.
(Surely we do not need GENP in order to invert the DFT matrix, but we will show
some interesting implications of the theorem.)
Theorem 6.2. Assume for convenience that n is even, and let Ω(q) denote the
q×q leading
√ (that is, northwestern) block of the matrix Ω for q = n/2. Then κ(Ω(q) ) ≥
2n/4−1 n.
√
Proof. As follows from Corollary 4.1.2, κ(Ω(q) ) ≥ |s(f ωqj )| n/2 for j = 0, . . . , q −
Qq−1 √
1, where s(x) = j=0 (x − ωnj ) and where we can choose f = − −1. Under such a
√
choice, we get |f − ωnj | ≥ 2 for j = 0, . . . , q − 1 (see Figure 2), and so |s(f )| ≥ 2n/4 .
(q) √
Hence κ(Ωn ) ≥ 2n/4 n/2, and Theorem 6.2 follows.
Remark 6.3. We can strengthen the bound of this theorem by extending the es-
timates of (18)–(22): they still hold if we substitute pq (x) = s(x) and n = 2q (instead
of n = 3q) into these equations. Moreover, by extending the proof of Theorem 6.2,
we can show that all of the k × k blocks of the unitary matrix √1n Ω are badly ill-
conditioned as long as the integer k or n − k is large.
Corollary 6.3.1. GENP is numerically unsafe for (i) a DFT matrix Ω of a
large size as well as for its products, (ii) DΩ, and (iii) ΩZ with any diagonal matrix
D and any circulant matrix Z of the same size.1
Proof. Part (i) follows from Theorems 6.1 and 6.2 combined.
In order to prove part (ii), first note that κ(DΩ) = κ(D) because Ω is a unitary
matrix up to scaling. Therefore, GENP applied to the matrix DΩ fails numerically if
the matrix D is ill-conditioned.
1 See [D70], [GL13, p. 220], or [P01, p. 33] for the definition of circulant matrices.
Fig. 3. The figure shows the sets Sm− = {s10 , s11 } and Sm+ = {s0 , s1 , . . . , s9 } as well as the
distance r from the center c to the nearest points tj 0 and tj 00 of the set Tj 0 ,j 00 .
Two complex sets S and T are (η, c)-separated if every pair of points s ∈ S and
t ∈ T is (η, c)-separated.
Theorem 7.2 (cf. [CGS07, section 2.2], [P15, Thm. 29]). Let m and l denote a
pair of positive integers, and let Sm = {s0 , . . . , sm−1 } and Tl = {t0 , . . . , tl−1 } denote
two sets that are (η, c)-separated for a pair of real number η > 1 and complex number
c. Define the m × n Cauchy matrix C = Cs,t = ( si −t 1
)m−1,l−1 , and write
j i,j=0
m−1
δ = δc,S = min |si − c|.
i=0
Then
1/σρ (C) ≥ (η − 1)η ρ−1 δ for all positive integers ρ.
We will apply the theorem to an arbitrary knot set Sm and to a specific knot set
Tl . In the next section we comment on the implications of the resulting basic theorem.
Theorem 7.3 (cf. Figure 3). Let us be given a complex number f such that
|f | = 1 and two knot sets Sm = {s0 , . . . , sm−1 √ } and Tn = {t0 , . . . , tn−1 }, where
tj = f ω j for j = 0, . . . , n − 1 and ω = exp(2π −1/n). Define the m × n CV matrix
1
Cs,f = C(Sm , Tn ) = ( si −t j
)i∈Sm ,j∈Tn .
Fix two integers j and j 00 in the range [0, n−1] such that 0 < l = j 00 −j 0 +1 ≤ n/2,
0
00
and define the subset Tj 0 ,j 00 = {f ω j }jj=j 0 of the set Tn .
Let c = 0.5(tj 0 + tj 00 ) denote the midpoint of the line interval with the endpoints
tj and tj 00 , and let r = |c − tj 0 | = |c − tj 00 | = sin((j 00 − j 0 ) nπ ) denote the distance from
0
this midpoint to the points tj 0 and tj 00 . (Note that (j 00 − j 0 ) nπ is a quite tight upper
bound on r for large integers n and bounded differences j 00 − j 0 .)
Fix a constant η > 1, and partition the set Sm into the two subsets Sm,− and
Sm,+ of cardinalities m− and m+ , respectively (m− = 2 and m+ = 10 in Figure 3),
such that the knots of Sm,− lie in the open disc D(c, ηr) = {z : |z − c| < ηr}, while
the knots of Sm,+ lie in the exterior of that disc.
(i) Then the two sets Sm,+ and Tj 0 ,j 00 are (c, η)-separated and
(ii) 1/σρ+m− +n−l (Cs,f ) ≥ (η − 1)η ρ r for all positive integers ρ.
Downloaded 11/01/16 to 121.49.106.94. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
Proof. Part (i) follows because distance(c, Tj 0 ,j 00) = r and because distance(c, Sm,+) ≥
ηr by virtue of the definition of the set Sm,+ .
In order to prove part (ii), first define the two following submatrices of the CV
matrix Cs,f = C(Sm , Tn ):
1 1
C(Sm , Tj 0 ,j 00 ) = and C(Sm+ , Tj 0 ,j 00 ) = .
si − tj i∈Sm ,j∈Tj0 ,j00 si − tj i∈Sm,+ ,j∈Tj0 ,j00
Apply Theorem 7.2 to the matrix C = C(Sm+ , Tj 0 ,j 00 ) and to δ = ηr, and obtain
that
1/σρ (C(Sm+ , Tj 0 ,j 00 )) ≥ (η − 1)η ρ r for all positive integers ρ.
Combine this bound with Corollary 2.1.1 applied for k = m− , and obtain that
Combine the latter bound with Corollary 2.1.1 applied for k = n − l, and obtain
part (ii) of the theorem.
Corollary 7.3.1. Under the assumptions of Theorem 7.3, let m = n and ρ =
ρ̄ = l − m− > 0. Then Cs,f = C(Sm , Tn )) is an n × n matrix and
−1
||Cs,f || = 1/σn (Cs,f ) ≥ (η − 1)η ρ̄ r.
knot sn−1 in the range from 1.14 to 10. For comparison, the table also displays the
lower estimates of part (i) of Corollary 3.2.1 in the case where m = n. By setting
sn−1 = ωnn−1 we would have had κ(Vs ) = 1, but with the growth of the value |sn−1 |—
and even with the growth of n for sn−1 fixed at 1.14—the matrix quickly became
ill-conditioned.
Table 2 displays the condition numbers of n × n Vandermonde matrices defined
i
by various pairs of integers k and n and the knots s0 , . . . , sn−1 such that si = ωn−k
i
are the (n − k)th roots of 1, while sn−k+i = ρωk are the kth roots of 1 scaled by ρ in
both cases for i = 0, 1, . . . , k and ρ = √
3/4, 1/2. For comparison, the table also displays
the lower estimates κ− = |Vs |ν k−1 /( k max{k, ν/(ν − 1)}) of part (ii) of Corollary
3.2.1 for ν = 1/ρ.
Table 1
Condition numbers of Vandermonde matrices with a single absolutely large knot.
n−1 √
n sn−1 κ(Vs ) s+ / n
64 1.14E+00 3.36E+03 4.98E+02
64 1.56E+00 6.88E+11 2.03E+11
64 3.25E+00 4.62E+32 2.22E+31
64 1.00E+01 5.47E+63 1.25E+62
128 1.14E+00 1.08E+07 1.60E+06
128 1.56E+00 3.24E+32 3.64E+23
128 3.25E+00 6.40E+76 9.03E+63
128 1.00E+01 1.30E+130 8.84E+125
256 1.14E+00 1.57E+14 2.33E+13
256 1.56E+00 6.61E+62 1.66E+48
256 3.25E+00 2.89E+132 2.12E+129
256 1.00E+01 1.18E+265 6.25E+253
Table 2
Condition numbers of Vandermonde matrices with k absolutely small knots.
ρ = 3/4 ρ = 1/2
n k κ(Vs ), κ− κ(Vs ), κ−
64 8 4.04e+01, 3.57e+00 6.90e+02, 6.10e+01
64 16 2.71e+02, 1.20e+01 1.19e+05, 5.24e+03
64 32 1.71e+04, 3.78e+02 4.91e+09, 1.09e+08
128 8 5.85e+01, 5.17e+00 1.00e+03, 8.84e+01
128 16 4.03e+02, 1.78e+01 1.77e+05, 7.80e+03
128 32 2.70e+04, 5.97e+02 7.77e+09, 1.72e+08
256 8 8.38e+01, 7.40e+00 1.43e+03, 1.26e+02
256 16 5.85e+02, 2.58e+01 2.56e+05, 1.13e+04
256 32 4.02e+04, 8.89e+02 1.16e+10, 2.56e+08
Table 3 displays the condition numbers κ(Vs ) of the 3q×3q Vandermonde matrices
defined by the quasi-cyclic sequence of knots of Example 5.0.1. The table also displays
the lower bounds κ and κ0 on these numbers, calculated based on (22) and (20),
respectively.
Table 4 displays the condition numbers κ(Ω(q) ) of the q × q leading blocks of the
n × n DFT matrices Ω = (ωnij )n−1
i,j=0 for n = 2q = 8, 16, 32, 64. The table also displays
the lower bounds κ− and κ0− on these numbers, calculated for s(x) = pq (x) and for
the same pairs of n and q based on Theorem 6.2 and (20), respectively.
n q κ(Vs ) κ κ0
12 4 2.16e+01 1.96e+01 1.03E+01
24 8 1.50e+03 2.22e+02 1.06E+02
48 16 1.16e+07 2.01e+04 1.13E+04
96 32 9.86e+14 2.16e+08 1.27E+08
Table 4
Condition numbers of the leading blocks of DFT matrices.
(k)
n q κ(Ωn ) κ− κ0−
8 4 1.53E+01 8.00E+00 1.03E+01
16 8 1.06E+03 4.53E+01 1.06E+02
32 16 8.18E+06 1.02E+03 1.13E+04
64 32 8.44E+14 3.71E+05 1.27E+08
According to our test results, the lower bounds based on (22) and (20)—and even
those based on Theorem 6.2—grow very fast, but the condition numbers grow even
faster.
Table 5 shows the mean and standard deviation (std) of the relative residual
norms rn = ||Ωn x̃ − b||/||b|| and relative error norms en = ||x̃ − x||/||x̃|| when GENP
was applied to the linear systems of n equations Ωn x = b for the n × n DFT matrix
Ωn and the vectors b = Ωn e + τ g of dimension n, and when 100 linear systems
Ωn x = b were solved for each n, n = 64, 128, 256, 512, 1024. Here eT = (1, 1, . . . , 1),
τ = 0.001, g is a vector filled with independently and identically distributed (i.i.d.)
standard Gaussian random variables, and x̃ denotes the computed approximation to
the solution vector x.
According to the data in this table and all other tables, our formal estimates and
test results are in quite good accordance with each other.
Table 5
GENP with DFT matrices: relative residual and error norms.
10. A brief summary of our results. First, we have estimated the condition
numbers κ(Vs ) of n × n Vandermonde matrices Vs = (sji )n−1 i,j=0 with complex knots
s0 , . . . , sn−1 by applying the Ekkart–Young theorem, and we have readily deduced
that these numbers grow exponentially in n unless all knots s0 , . . . , sn−1 lie in or near
the unit disc {x : |x| ≤ 1} and, furthermore, unless they lie mostly on or near the
unit circle C(0, 1) = {x : |x = 1|}.
Then, by applying more involved techniques based on the Vandermonde–Cauchy
transformation of matrix structure, our formulae for the inversion of a Cauchy matrix,
their extension to Vandermonde matrix inversion, and the techniques of low-rank
d
X d
X
σn (V ) = σn (V T ) ≤ σn (Vs ) ||Cj || ||Dj || and σn (C) ≤ σn (Cs,t ) ||Dj0 || ||Dj00 ||
j=1 j=1
Qn−1 n−1
Here t(x) = j=0 (x − tj ) is the polynomial of (7) and u = (uj )j=0 denotes the
n
coefficient vector of the polynomial t(x) − x . The decisive step could be the selection
of a proper set of knots T = {tj }n−1
j=0 , which would enable us to reduce the estimation of
the condition number of a polynomial Vandermonde matrix VP,s to the same task for
a Cauchy matrix Cs,t . Sections V and VI of [G90] might give us some initial guidance
for the selection of the knot set that would extend the set of equally spaced knots on
the unit circle used in the case of Vandermonde matrices. Then it would remain to
extend our estimates for CV matrices to the new subclass of Cauchy matrices.
Acknowledgments. I am grateful to the reviewers for their valuable and
detailed comments.
REFERENCES
[AR10] A. Aricò and G. Rodriguez, A fast solver for linear systems with displacement struc-
Downloaded 11/01/16 to 121.49.106.94. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php
[Pa] V.Y. Pan, Fast Approximate Computations with Cauchy Matrices and Polynomials,
preprint, 2015. arXiv:1506.02285 [math.NA].
[PQY15] V.Y. Pan, G. Qian, and X. Yan, Random multipliers numerically stabilize Gaussian
Downloaded 11/01/16 to 121.49.106.94. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php