Professional Documents
Culture Documents
MTL101:::Lecture 17 and 18
(Inner product, Cauchy-Schwarz Ineq, Gram-Schmidt orthogonalization,best approximation)
(54) Let V be a vector space over F (where F = R or C). A map V ×V → F denoted by (u, v) 7→ hu|vi
is called an inner product on V if the following properties hold:
(a) hu|ui ∈ R and ≥ 0 for each u ∈ V ;
(b) hu|ui = 0 if and only if u = 0;
(c) hau + bv|wi = ahu|wi + bhv|wi;
(d) hu|vi = hv|ui (the complex conjugate).
A vector space together with an inner product is called an inner product space. Observe
that hu|av + bwi = hav + bw|ui = ahv|ui + bhw|ui = āhv|ui + b̄hw|ui = āhu|vi + b̄hu|wi.
Example: 1. Standard inner product on Rn and Cn : The dot product on R3 is an
inner products. Recall
(a1 , a2 , a3 ) · (b1 , b2 , b3 ) = a1 b1 + a2 b2 + a3 b3 .
n
P
In fact for every n ∈ N, the map h(x1 , x2 , . . . , xn )|(y1 , y2 , . . . , yn )i = xi yi is an inner product
i=1
n
on Rn . On Cn , h(x1 , x2 , . . . , xn )|(y1 , y2 , . . . , yn )i =
P
xi yi is an inner product. We leave the
i=1
verification to the students.
Example: 2. Suppose CC [0, 1] is the collection of complex valued continuous mapsR 1 on [0, 1]
which is a complex vector space (i.e. a vector space over C). The map hf |gi := 0 f (s)g(s)ds
is an inner product on C[0, 1]. Guess, how to get an inner product on the vector space CR [0, 1]
of real valued continuous functions on [0, 1].
Lemma: Suppose a > 0. Then at2 + bt + c > 0 for every t ∈ R if and only if b2 − 4ac < 0.
Hint: The quadratic expression aX 2 + bX + c has a real zero if and only if the discriminant
b2 − 4ac ≥ 0. Therefore this expression has no real zero if and only b2 − 4ac < 0. Since
s 7→ as2 + bs + c is a continuous map on R, it does not change sign on R if aX 2 + bX + c has
no real zero. Since a > 0, as2 + bs + c > 0 if aX 2 + bX + c has no real zero.
Excercise: Suppose a > 0. Show that at2 +bt+c ≥ 0 for every t ∈ R if and only if b2 −4ac ≤ 0.
2
3. What are all inner products on R :
a b x2
Define h(x1 , y1 )|(x2 , y2 )i := x1 y1 = ax1 x2 + bx1 y2 + cx2 y1 + dy1 y2 (observe
c d y2
that term containing x1 y1 , x2 y2 , x2i or yi2 is not acceptable!).
Assertion:h(x1 , y1 )|(x2 , y2 )i = ax1 x2 + bx1 y2 + cx2 y1 + dy1 y2 defines an inner product on R2 if
and only if a > 0, b = c and ad − bc > 0.
To prove this if and only if statement, first assume that the expression defines an inner product.
Then h(1, 0)|(1, 0)i > 0 ⇒ a > 0 (by the first property). By symmetry (the last property) we
get b = c (to get this explicitly write down the property.). For any (x, y) ∈ R2 \ {(0, 0)}, we
have ax2 +2bxy +dy 2 = h(x, y)|(x, y)i > 0. Hence we must have (2by)2 −4a·(dy 2 ) < 0 (∀ y 6= 0)
i.e., ad − b2 > 0. We leave the “if” part for the students as an exercise.
Remark: All the inner products on Rn can be described. But we skip this part since it is
beyond the scope of this course.
Example: 3. Suppose Pn be the real Rvector space of polynomials in X with real coefficients
b
of degree < n. Then the map hf |gi := a f (s)g(s)ds (for f, g ∈ Pn ) is an inner product on Pn
for any a < b ∈ R.
A map V → F denoted by v 7→ ||v|| is called a norm on V if the following conditions are
satisfied.
(a) ||v|| ≥ 0;
(b) ||v|| = 0 if and only if v = 0;
(c) ||av|| = |a|||v|| for any a ∈ F;
(d) ||u + v|| ≤ ||u|| + |v||.
A vector space with a norm is called a normed space.
22
Theorem: (Cauchy-Schwarz inequality) Suppose V is an inner product space. Then for any
u, v ∈ V we have
|hu|vi| ≤ ||u||||v||.
Proof: Suppose F = R. Then 0 ≤ hu + tv|u + tvi = ||u||2 + 2thu|vi + t2 ||v||2 . Using the Lemma
above we have (2hu|vi)2 − 4||u||2 ||v||2 ≤ 0 or |hu|vi| ≤ ||u||||v||.
If F = C, the let α = hu|vi. 0 ≤ hu + tαv|u + tαvi = ||u||2 + t(ᾱhu|vi + αhu|vi) + t2 |α|2 ||v||2 =
||u||2 +2t(|hu|vi|2 )+t2 |hu|vi|2 |2 ||v||2 . Hence, by Lemma above (2|hu|vi|2 )2 −4||u||2 ||v||2 |hu|vi|2 ≤
0 which implies |hu|vi| ≤ ||u||||v||.
Every inner p product space is a normed space. Suppose V is an inner product space, then the
map ||v|| = hv|vi is a norm. Verification of the first three properties is straight forward. We
verify the last property ||u + v|| ≤ ||u|| + ||v||.
||u+v||2 = ||u||2 +(hu|vi+hv|ui)+||v||2 = ||u||2 +2<(hu|vi)+||v||2 ≤ ||u||2 +2|hu|vi|+||v||2 (since
the real part of z= <(z) ≤ |z|). Use Cauchy-Schwarz inequality and get ||u+v||2 ≤ (||u||+||v||)2 .
Remark: If we know the concept of “metric space” we may observe the following: A norm on
a vector spaces defines the following metric: d(u, v) = ||u − v||. Observe that the properties of
a metric follows from the properties of a norm.
(55) A set {u1 , u2 , . . . , un } of an inner product space is called an orthogonal set of vectors if
hui |uj i = 0 if i 6= j and hui |ui i =6 0 for each i and each j. It is not difficult to show that a
orthogonal set of vectors is linearly independent (exercise for the students). Although not every
linearly independet set of vectors is orthogonal, we have a very interesting and usual theorem:
Theorem: (Gram-Schmidt Orthogonalization) Suppose {u1 , u2 , . . . , un } is a linearly inde-
pendent set of vectors in an inner product space. Then there is an orthogonal set {v1 , v2 , . . . , vn }
such that for each i,
span{u1 , u2 , . . . , ui } = span{v1 , v2 , . . . , vi }.
Indeed, v1 = u1 and for 2 ≤ i ≤ n,
i−1
X hui , vj i
vi = ui − vj .
j=1
||vj ||2
Application: Every finite dimensional inner product space has an orthogonal basis.
Proof: Suppose {u1 , u2 , . . . , un } be a basis of the inner product space V (so that the dimension
is n). Apply Gram-Schmidt process on this set to get the orthogonal set {v1 , v2 , . . . , vn } which
also spans V .
Example: Find an orthogonal basis of R2 with the inner product given by h(x1 , y1 )|(x2 , y2 )i =
x1 x2 + 2x1 y2 + 2x2 y1 + 5y1 y2 . We know that {e1 , e2 } is a basis of R2 . Since he1 , e2 i = 2 6= 0, the
standard basis is not an orthogonal basis. To get an orthogonal basis we use Gram-Schmidt
process. v1 = e1 and v2 = e2 − he||e21|e||12i e1 = e2 − 2e1 . Thus {e1 , e2 − 2e1 } is an orthogonal basis.
23
(56) Suppose V is an inner product space and W is a proper subspace of V . Given v ∈ V , a vector
w0 ∈ W is said to be a best approximation of v by a vector belonging to W if for each
w ∈ W we have
||v − w0 || ≤ ||v − w||.
Theorem: A vector w0 ∈ W is a best approximation of v ∈ V by a vector belonging to W if
and only if v − w0 is orthogonal to every vector belong to W .
Proof: Let w ∈ W . Then ||v − w||2 = ||(v − w0 ) + (w − w0 )||2 = ||v − w0 ||2 + ||w0 − w||2 +
2<(hv − w0 |w0 − wi). Suppose v − w0 is orthogonal to every vector belong to W . Then
||v − w||2 = ||v − w0 ||2 + ||w0 − w||2 Hence ||v − w|| ≥ ||v − w0 ||.
Conversely suppose w0 is a best approximation of v by a vector belonging to W . Since
||v − w|| ≥ ||v − w0 ||, we have ||w0 − w||2 + 2<(hv − w0 |w0 − wi) ≥ 0. Since any vector u in W
can be expressed in the form w − w0 for some w ∈ W , we have ||u||2 + 2<(hv − w0 |ui) ≥ 0 for
every u ∈ W . Choose u = − hv−w 0 |w0 −wi
||w0 −w||2
(w0 − w). Then you would get