Professional Documents
Culture Documents
,
I
MATHEMATICAL ANALYSIS
I I
MATHEMATICAL ANALYSIS
SECOND EDmON
ADDISON· WESLEY PUBLISHING COMPANY Reading, Massachusetts Amsterdam· London· Manila • Singapore • Sydney. Tokyo
•;
,9ii;-~
11,00'"
To my parents
A complete and unabridged reprint of the original American textbook, this World Student Series edition may be sold only in those countries to which it is oonldgned by Addison-Wesley or its authorized trade distrlbutOfS.1t may not be re-exported from the country to which it has been consigned, and it may not be sold in the United States of America or its possessions.
Consulting Editor: Lynn Loomis
191I Copyright © 1974 by Addison-Wesley Publishing Company, right 1974 by Addison-Wesley PublUhing Company, Inc.
AD rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical. photocopying, recording or otherwise. without the prior written permission of the publisher. Original edition prlnted in the United States of America. Published simultaneously in Canada. Librat'Y of Congress Catalog Card No. 72-11473.
,
.
-'.
/.~~I
'.~. "'j
PREFACE
A glance at the table of contents will reveal that this textbook treats topics in analysis at the "Advanced Calculus" level. The aim has been to provide a development of the subject which is honest. rigorous, up to date, and, at the same time, not too pedantic. The book provides a transition from elementary calculus to advanced courses in real and complex function theory, and it introduces the reader to some of the abstract thinking that pervades modem analysis. The second edition differs from the first in many respects. Point set topology is developed in the setting of general metric spaces as well as in Euclidean n-space, and two new chapters have been added on Lebesgue integration. The material on line integrals, vector analysis, and surface integrals has been deleted. The order of some chapters has been rearranged. many sections ha-ve been completely rewritten. and several new exercises have been added. The development of Lebesgue integration foDows the Riesz-Nagy approach which focuses dicectly on functions and their integrals and does not depend on measure theory. The treatment here is simplified. spread out, and somewhat rearranged for presentation at the undergraduate level. The first edition has been used in mathematics courses at a variety of levels, from first-year undergraduate to first-year graduate. both as a text and as supplementary reference. The second edition preserves this llexibility. For example, Chapters 1 throngh S, 12, and 13 provide a course in differential calculus of functions of one or more variables. Chapters 6 through 11, 14, and 15 provide a course in integration theory. Many other combinations are possible; individual instructors can choose topics to suit their needs by consulting the diagram on the next page, which displays the logical interdependence of the chapters. I would like to express my gratitude to the many people who have taken the trouble to write me about the first edition. Their comments and suggestions influenced the preparation of the second edition. Special thanks are due Dr. Charalambos Aliprantis who carefully read the entire manuscript and made numerous helpful suggestions. He also provided some of the new exercises. Finally, I would like to acknowledge my debt to the undergraduate students of CaItech whose enthusiasm for mathematics provided the original inc.entive for this work. pastJdena September 1973 T.M.A.
LOGICAl,
INTERDEPENDENCE
OF THE
CHAPTERS
CONTENTS
<lIapter
I The Real aJld Complex Number Systems 1.1 1.2 1.3 1.4 1.5 1.6 Introduction . The field axioms . The order axioms Geometric representation of real numbers Intervals . Integers The unique factorization theorem for integers Rational numbers Irrational numbers Upper bounds" maximum elemenr. least upper bound (supremum) . The completeness axiom Some properties of the supremum Properties of the integers deduced from the completeness axiom The Archimedean property or the real-number system . Rational numbers with finite decimal representlltion Finite decimal approximations to real numbers . Infinite decimal representation of real numbers Absolute values and the triangle inequality The Cauchy-Schwarz inequality Plus and minus infinity and the extended real number system R* Complex numbers Geometric representation of compleJ:: numbers The imaginaI)' unit . Absolute value of a complex number. Impossibility Qf ordering the complex numbers Complex exponentials Further properties of complex exponentials . The argument of a complex number . Integral powers and roots of complex numbers . Complex logarithms Complex powers Complex sines and cosines . Infinity and tbe extended complex plane c+ . Exercises .
is. 1
1 2
3 3
4 4 6
1.7
1.8 1.9 1.l0
12
7 8
9 9 JO 10 11 11 J2
12
tIl 1.12
1.13
1.14 1.15 1.16
IMPLICIT FUNCTIONS
AND EXTREMUM PROBLEMS
1.17
1.18 1.19 1.20 1.21 1.22 1.23 1.24 1.25 1.26 1.27 1.28 1.29 1.30 1.31 1.32
13 14 15 17 18
18
19 19 20 20 21 22 23
16
24
24 25
1.33
't·
:'~~
',\
II
eoat.ts Z Some BaRe: Notions of Set 'l1teOIry Introduction . Notations Ordered pairs Cartesian product of two sets Relations and functions Further terminology concerning functions One-to-one functions and inverses Composite functions. Sequences . Similar (equinumerous) sets Finite and infinite sets Countable and uncountable sets Uocouotability of the real-number system Set algebra Countable collections of countable sets Exercises .
0( Pomt
ea.taJIs 4.7 4.8 4.9 4.10 4.11 4.12 4.13 4.14 4.15 4.16 4.11 4.18 4.19 4.20 4.21 4.22 4.23 Limits of vector-valued fuoctions . Continuous functions Continuity of composite functions. Continuous complex-valued and vector-valued functions Examples of continuous functions Continuity and inverse ima.ges of open or dosed sets Functions continuous on compact sets Topological mappings (homeomorphisms) Bolzano's theorem Coacectedness Components of a metric space . Arcwise connectedness . Uniform continuity. Unifonn continuity and compact sets Fixed-point theorem for contractions Discontinuities of real-valued functions Monotonic functions Exercises •
Ii
Chapter
2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 2.10
32 32 33
33
34
z.u
35 36 31 31 38
77 18 19 80 80 81 82 84 84 86 81
38
39 39 40 42 43
88
90 91 92 92 94 95
CltQteF 3 EJemenU
3.1 3.2 3.3
Set Topology 47 47 49 SO 52 52
53 54
ClJapter
5 DeriYatJves 5,1 Introduction . 5.2 Definition of derivative . 5.3 Derivatives and continuity . 5.4 Algebra of derivatives 5.S TIle chain rule 5.6 One-sided derivatives and infinite derivatives 5.1 Functions with nonzero derivative 5.8 Zero derivatives and local extrema 5.9 Rolle's theorem . 5.10 The Mean-Value Theorem for dmivatives 5.11 Intermediate-value theorem for derivatives 5.12 Taylor's formula with remainder 5.13 Derivatives of vector-valued functions 5.14 Partial derivatives S.Is Differentiation of functions of a complex variable 5.16 TIle Cauchy-Riemann equati~ Exercises .
Introduction . Euclidean space R- . Open balls and open sets in R" 3.• The structure of open sets in Rl 3.5 Closed sets 3.6 Adherent points. Accumwation points . 3.1 Closed sets and a.dbenint points 3.8 The Bobaao-Weierstrass theorem 3.9 The Cantor intersection theorem . 3.10 The Lindelof covering tbeorem 3.11 The Heine-Borel covering theorem 3.12 Compactness in R" 3~13 Metric spaces 3.14 Point set topology in metric spaces s 3.15 C-0111pact ubsets of a metric space 3.16 Boundary of a set Execcises . Cllaptcr
104104105
106 106
107
108 109
56 56 58 59 60
llO
61
63 64
6S
4
4.1 4.2 4.3
4.4
4.5 4.6
Introduction . Convergent seqUClJCeS in a metric: space Cauchy sequences Complete metric spaces . Limit of a function . Limits of complex-valued functions
10
1t}
CIlapler
F!lllctions of Bounded Variatioo and Rectifiable Curves Introduction . Properties of monotonic functions Functions of bounded variation Total variation Additive property of total variation 127 127 128 129
130
72
74 74 76
TotaJ variation on [a. xl as a function of x Functions of bounded variation expressed as the diffe£enCe of increa$ing functions . Continuous functions of bounded variation Curves and paths :Rectifiable paths and arc length Additive and continuity properties of arc length Equivalence of paths. Change of parameter Exercises .
Jntegnll
CbapUlr
7 The Riemaan-Stlelijes 7.1 7.2 7.3 7.4 7.5 1.6 7.7 7.8 7.9
1.10 7.11
7.12 7.13 7.14 7.15 7. [6 7.17 7.18 7.19 7.20 7.21 7.22 7.23 7.24 7.2S 7.26 7.27
Introduction. Notation. The definition of the Riemann-Stieltjes integral. Linear properties Integration by parts . Change of variable in a Riernann-Stieltjes integral Reduction to a Riemann integral. Step functions as integrators Reduction of a Riemaon-Stieltjes integral to a finite sum Euler's summation formula MonotonicallyiDcreasingintegrators. Upper andlowec integrals . Additive and linearity propertie, of upper and lower integrals. Riemann's condition Comparison theorems Integrators of bounded variation Sufficient conditions for existence of Riemann-Stieltjes integrals Necessary conditions for existence of Riemann-Stieltjes integrals. Mean Value Theorems for Riemann-Stieltjes integrals. The integral as a function of the interval. Second fundamental theorem of integral calculus Change of variable in a Riemann integra] Second Mean-Value Theorem for Riemann integrals Riemann-Stieltjes integrals depending on a parameter. Dilferentiation under the integral sign Interchanging the order of integration Lebesgue's criterion for existence of Riemann integrals Complex-valued Riemann-Stieltjes integrals . Exercises . Infinite Series and JllfInite Produets Introduction. Convergent and divergent sequences of complex numbers . Limit superior and limit inferior of a real-valued sequence Monotonic sequences of real numbers Infinite series.
140 141 141 142 144 144 145 147 148 149" ISO 1S3 IS3 155 156 159 160 160 161 162 163 165 167 167 169 173 174
8.6 Inserting and removing parentheses 8.1 Alternating series 8.8 Absolute and conditional convergence 8.9 Real and imaginary pam of a complex series 8.10 Tests for convergence of series with positive terms 8.11 The geometric series 8.12 The integral test. 8.13 The big oh and little oh notation 8.14 The ratio test and the root test 8.15 Dirichlet's test and Abel's test . 8.16 Partial sums of the geometric series E zIt on the unit circle 8.17 Rearrangements of series 8.18 Riemann's theorem on oonditionaUy convergent series . 8.19 Subseries. 8.20 Double sequences 8.21 Double series 8.22 Rearrangement theorem for double series 8.23 A sufficient condition for equality of iterated series. 8.24 Multiplication of series . 8.25 CesAro summabiJity . 8.26 Infinite products . 8.27 Euler's product for the Riemann zeta function Exercises .
Izl
=t
187 188 189 189 190 190 191 192 193 193 195 196 191 191 199 200 201 202 203 205 206
2(J9
210
9.1
.66
Chapter
9.17: 9.13 9.14 9.15 9.16 9.11 9.18 9.19 9.20 9.21
Pointwise convergence of sequences of functions Examples of sequences of real-valued functions. Definition of uniform convergence Uniform convergence and continuity. The cauchy condition for uniform convergence Uniform convergence of infinite series of functions. A space-filling curve Uniform convergence and Riemann-Stieltjes integration Nonuniformly convergent sequences that can be integrated term by term Uniform convergence and differentiation Sufficient conditions for uniform convergence of a series Uniform convergence and double Sequences. Mean convergence Power series . Multiplication of power series. The substitution theorem Reciprocal of a power series Real power series The Taylor's series generated by a function Bernstein's theorem . The binomial series .
218 219 220 221 222 213 224 225 226 228 230 231 232 234 231 238 239 240 241 242 244
:':. -:
m
9.22 9.23 Abers limit tbc:on:m Tauber's theorem
,\
,Of
~."..
Con __ :1"
Exercises •
Chlpter"tO
244 246
247
The ..........
IDtegra]
10.7
10.8
10.9 10.10 10.11 10.12 10.1l 10.14 10.lS 10.16 10.17 10.18 10.19 10.20 10.21 10.22 10.23 10.24
Introduction . The integral of a step function Monotonic sequences of step fUJlCtions Upper functioBli and their integrals Riemann-integrable functions as examples of upper functions The class of Lebesgue-integrable functions on a general ioterval Basic properties of the Lebesgue integral. Lebesgue integration and sets of measure zero The Levi monotone convergence theomns The Lebesgue dominated converpna: theorem Applications of Lebesgue's dominated convergence theorem Lebesgue inlf:grals on unbounded intervals as limits of integrals on bounded intervals Improper Riemann intcgrali Measurable functions Continuity of functions defined by Lebesgue integrals Differentiation under the integral sign Interchanging the order of integration Measurable sets OD the real line The Lebesgue integral over arbitrary subsets of R Lebesgue integrals of complex-valued functions . Inner products and norms . The set L2(1) of square-integrable functions • The set L2(1) as a semimctric space A wnvcrgcncc theorem for series of functions in L2(1) Exercises .
252 253
2S4
Sufficient conditions for conversence of II Fourier series at a particular point Cesaro summability of Fourier series • Consequences of Fejer's theorem . The Weierstrass approximation theorem Other forms of Fourier series . The Fourier integral theorem The exponential form of the Fourier integral theorem Integral transforms . Convolutions The convolution theorem for Fourier transforms The Poisson summation formula Exercises
MAltivariable
325
326
327 329
332 335
Chaptet" 12
12.1
DifferentiaJ Calculus
3M
279
281
283 287
289 291 292 293
294295
J2.13
12.14
295
297
298
Introduction . The directional derivative Directional derivatives and continuity The total derivative. The total derivative expressed in terms of partial derivatives An application to complex-valued functions. The matrix of a linear function The Jacobian matrix The chain rule Matrix form of the chain rule. The Mean-Value Theorem for differentiable functions A sufficient condition for differentiability A sufficient condition for equality of mixed partial derivatives Taylor's formula for functions from R" to Rl Exercises.
344
345 346 347 348 349
11.1 Introduction
11.4
115
ILIO
lUI
11.9
. Orthogonal systems of functions . TIle theorem on best approximation The Fourier series of a function relative to an orthonormal system . Properties or the Fourier coefficients . The Riesz-Fiscber theorem The convergence and representation problems for trigonometric series The Riemann-Lebesgue lemma The Dirichlet integrals . An integral representation for the partial sums of a Fourier series Riemann's localization theorem
306 306
307
13.1 Introduction . 13.2 Functions with nonzero Jacobian determinant 13.:1 The inverse function theorem 13.4 The implicit function theorem . 13.5 Extrema of real-valued functions of one variable 13.6 Extrema of real-valued functions of several variables 13.7 El\trcmum problems with side conditions Exercises .
Cbapter 14
367 368
372
373 375 376
380
384
llyii
The RiemaDn integral of a bounded function defuIed on a compact interval in R- . ... .. 14.4 Sets of measure zero and Lebesgue's criterion for existence of a multiple Riemann integral .. 14.5 Evaluation of a multiple integral by iterated integration 14.6 Jordan-measurable sets in R- . 14.7 Multiple integration over Jordan-measurable sets 14.8 Jordan content cx.J)I'es&ed 118 a Riemann integnd 14.9 Additive property of the Riemann integral . 14.10 Mean-Value Theorem for multiple integrals • Exercises .
14.3
389
391 391
The identity theorem for analytic functions . The maximum and minimum modulus of an analytic function The open mapping theorem Laurent expansions for functions analytic in an annulus
455
457 459
4(i()
singular
point
Chapter 15 Multiple J..ebesgae bte&mIs 15.1 Introduction. J 5.2 Step functiollll and their integrals . 15.3 Upper functioos and Lebesgue-intesmblc fUDctiOJlS 15.4 Measwablc functions and IJ:Jea$IJA.ble sets in R••... 15;5 Fubw's reduction theorem for the double integral ofa step function 1S.6 Some properties of sets of mc:asurc zero. •• 15.7 Fubini's reduction tbeoRIIl for double integrals 15.8 The TODClli-Hobson test for integrability 15.9 Coordinate transforma.t.ions. . 1s.t0 TIle transformation formula for multiple integnds. •..•. 15.11 Proor of the transformation formula for linear coordinate tramlfonna-
Counting zeros and poles in a region Evaluation of real-valued integrals by means of residues 16.24 16.25 Evaluation of Gauss's sum by residue calculus . 16.26 Application of the residue theorem to the inversion fonnula transforms 16.27 Conformal mappings
Exercises . IDdex of Special Symbols
468
410 412 481 485
405
406 406
407 409 411 413 415
Index .
CJ
416
421 421
tions
..
15.12 15.13
Exercises .
<hIpter 16 16.1 16.2 16.3 16.4
16.5
too
423 429
430
CaudlTs
CakuIoI
434
J6.1S
Contour integrals • . . •. The integral along a cin:ular path as a fum::tion of tile radilJ8 • atucby"s integral theorem for a circle Homotopic curves • Invarianee of contour integrals under homotopy GtDera1 form of atuchy's integral theorem • Cauchy's integral formula.. •••... The winding number of a circuit with mspcct to a point . .. The unboundedness of the set of points with winding number zero Analytic functions defined by contour integrals. Power-series expansiOIlll for analytic functions Cauchy's inequalities. LiouviUc's theorem . Isolation of the zeros of an analytic function
451
o CHAPTER 1
".
"
Several methods are used to introduce real numbers. One method starts with the positive integers I, 2, 3. . .. as undefined concepts and uses them to build a larger system, the positive rational numbers (quotients of positive integers), their negatives. and zero. The rational numbers, in tum. are then used to construct the irratiorwi numbers. real numbers like.,fi and ~"""hichan: not rational. The rational and irrational. numbers together constitute the real-number system. Although these matters are an important part of the foundations of mathematics, they will not be described in detail here. As a matter of fact, in most phases of analysis it is only the properties of real numbers that concern us, rather than the methods used to construct them. Therefore, we lhall talce the real numbers themselves as undefined objects satisfying certain axioms from which further properties will be derived. Since the reader is probably familiar with most of the properties of real numbers discussed in the next few pages, the presentation will be rather brief. Its purpose is to review the important features and persuade the reader that, if it were necessary to do so, all the properties could be traced back to the axioms. More detailed treatments can be found in the references at the end of this chapter. For convenience we usc some elementary set notation and terminology. Let S denote a set (a collection of objects). The notation XES means that the object x is in the set S, and we write x ¢ S to indicate that x is not in S. A set S is said to be a subset of T, and we write S !;; T, if every object in S is . also in T. A set is called 1I01U!mpty if it contains at least one object. We assume there exists a nonempty set R of objects, caned real numbers. which satisfy the ten axioms listed below. The axioms fall in a natural way into three groups which we refer to as the field axioms. the order axioms. and the completeness axiom (also cailCd the least-upper-bownd axiom or the axiom of continuity). 1.1 THE FIElD AXIOMS
'.
Along with the set R of real numbers we assume the existence of two operations. called addition and multiplication, such that for every pair of real numbers x and y
1
I
;
!~
As. 1
11L 1.1
the sum x + y and the product xy are real numbers uniquely determined by x and y satisfying the following axioms. (In the axioms that appear below, x, y, z represent arbitrary real numbers unless something is said to the oontrary.) A:ri(!m 1. x
(commutative laws).
z; x(yz) = (xy)z iassociatioe laws).
x < y < z,
Thus we have 2 s: 3 since 2 < 3; and 2 s 2 since 2 2. The symbol ~ is similarly used. A real number x is called nonnegative if x ~ O. A pair of ;imultaneous inequalities such as x < y, y < Z is usually written more brir fiy as The following theorem, which is a simple consequence is often used in proofs in analysis. Theorem 1.1. Gioen real numbers a and b such that of the foregoing axioms,
(distributive law).
Axiom 4. Given any two real numbers x and y, there exists a real number z such that x + z = y. This z is denoted by y -'x; the number x - x is denoted by O. (11 can be proved that 0 is independe1tt of x.) We write =x for 0 - x and call -x the
Then a
.s
for every t h.
> O.
(1)
negative 0/ x,
(a - b)/2 because
=
Q.
There exists at least one real number x #- O. If x and y are two real numbers with x #c 0, then there exists a real number z such that xz = y. This z is denoted by yix : the number xlx is denoted by 1 and can be shown. to be independent of x. We write X-I for l/x if x of. 0 and call x-1 the reciprocal of x.
A:ri(!m 5. From these axioms all the usual laws of arithmetic (-x) x, (X-t)-l x, -(x - y) = y x, x a more detailed explanation, see Reference 1.1.) 1.3 THE ORDER AXIOMS We also assume the existence of a relation < which establishes the real numbers and which satisfies the following axioms: A.xiom 6. Exactly
NOTE.
+z
+a- b
22.2
b<
+a
by Axiom 6 we must have a ,:s:; h. 10, the completeness axiom, will be described in Section 1.11.
1.4 GEOMETRIC
REPRESENTATION
OF REAL NUMBERS
an ordering among
< y,
>
y holds.
x > y means the same as y < x. 7. If x < y. then for every z we hove x
The real numbers are often represented geometrically as points on a line (called the real line or the real axis). A point is selected to represent 0 and another to represent I, as shown in Fig. 1.1. This choice determines the scale. Under an appropriate set of axioms for Euclidean geometry, each point on the real line corresponds to one and only one real number and, conversely, each real number is represented by one and only one point on the line. It is customary to refer to the point x rather than the point representing the real number x.
Axiom
+z <y +
z,
ex
Figure 1.1
II
Axiom 8. A.xiom 9.
If X >
0 and y and y
> O. >
Z,
then xy then x
>
O. The order relation has a simple geometric interpretation. If x < y, the point x lies to the left of the point y. as shown in Fig. 1.1. Positive numbers lie to the right of 0, and negative numbers to the left of O. If a < b, a point x satisfies the inequalities a < x < b if and only if x is between a and b. 1.5 INTERVALS The set of all points between a and b is called an interval. Sometimes it is important to distinguish between intervals which include their endpoints and intervals which do not.
NOTATION. jz,"
If x > y
> z,
NOTE. A real number x is called positive if x > 0, and negative if x < O. We denote by R + the set of all positive real numbers, and by R - the set of all negative real numbers.
From these axioms we can derive the usual rules for operating with inequalities. For example, if we have x < y, then xt < yz if z is positive, whereas xz > yz if z is negative. Also, if x > y and z > w where both y and w are positive, then xz > yw. (For a complete discussion of these rules see Reference L I.)
NOTE.
The symbolism
The notation {x: x satisfies all real numbers x which satisfy property
PJ
P.
the set of
..
(a, b) = {x: a
Def.l.2
n.1.7
a prime if n > 1 and if the only positive divisors of n are 1 and n. If n > 1 and n is not prime, then n is called composite. The integer 1 is neither prime nor composite. This section derives some elementary results on factorization of integers, culminating in the unique factorization theorem, also called the fundamental theorem of arithmetic. The fundamental theorem states that (I) every integer n > 1 can be represented as a product of prime factors, and (2) this factorization can be done in only one way. apart from the order of the factors. It is easy to prove part (I). Tlu!onm 105. Every integer n > I is either a prime or a product of primes. Proof We use induction on n. The theorem holds trivially for n = 2. Assume it is true for every integer k with 1 < k < n. If n is not prime it has a positive divisor d with 1 < d < n. Hence n = cd, where 1 < e < n. Since both c and dare «n, each is a prime or a product of primes; hence n is a product of primes. Before proving part (2). uniqueness of the factorization, we introduce some further concepts, If dla and dlb we say d is a common divisor of Q and b. The next theorem shows that every pair of integers a and b has a common divisor which is a tinear combination of a and b. 1'Iu!or_ 1.6. Every pair of integers a and b has a common dipuor d of the form d=ax+by
this d.
lk./illitio" 1.2. Assume a < b. The open interval (a, b) is d4ined to be the set
<
<
b}.
The closed interval [a, b] is the set {x: Q.':-:;; x ;S; b}. The half-open intervals (a. b] and [a, b) are similarly tkjined, using the inequalities a < x ;S; b and a :$ x < b, respectively. Infinite intervals are tkfined as follows: (Q,+oo) (-GO, a)
=
{x:x>a}, {x:x
{x:x
= {x:x:5:
~ a}, a}.
< a},
The real tine R is. sometimes referred to as the open interval (-00, +00). A single point is. considered as a "degenerate" dosed interval.
NOTE.
The symbols + GO and - GO are used here purely for convenience in notation and are Dot to be considered as being real numbers. Later we shall extend the real-number system to include these two symbols, but until this is done, the reader should understand that all real numbers are "finite."
1.6 INTEGEllS
This section describes the integers, a special subset of R. Before we define the integers it is convenient to introduce first the notion of an inductioe set.
De./illititm 1.3.
two properties.-
if it ho.J
the following
a) The number 1 is in the set. b) For every x in the set, the number x
+ 1 is also in
the set.
For example, R is an inductive set. So is the set R.... Now we shall define the positive integers to be those real numbers which belong to every inductive set.
De}i1litilnll.4. A real number is called a positive integer inductive set. The set of positive integers is tknoted by Z +.
if
it belongs to enery
The set Z + is itself an inductive set. It contains the number I. the number 1 + 1 (denoted by 2), the number 2 + I (denoted by 3), and so on. Since Z+ is a subset of every inductive set, we refer to Z+ as the smalJest inductive set. This. property of Z+ is sometimes called the principle of inductwn. We assume the reader is familiar with proofs by induction which are based on this principle, (See Reference 1.1.) Examples of such proofs are given in the next section. The negatives of the positive integers are called the negative integers. The positive integers, together with the negative integers and 0 (zero), form a set Z which we call simply the set of integers.
1.7 THE UNIQUE FACTORIZATION THEOREM FOR IN1EGERS
Proof First assume that a ~ O. b :2-. 0 and use induction on n a + h. If n = 0 then a = b = 0, and we can take d = 0 with x = y = O. Assume, then, that the theorem has been proved for 0, 1,2, ...• n 1. By symmetry, we can assume a ~ h. If b = 0 take d = a, x = I. Y = O. If b ~ 1 we can apply the induction hypothesis to a - b and b, since their sum is a n - b ~ PI - 1. Hence there is a common divisor d of a - band b of the form d = (a - h)x + by. This d also divides (a - b) + b = a, so d is a common divisor of a and band we have d = ax + (y - x)b, a linear combination a and b. To complete the proof we need to' show that every common divisor divides d. Since a common divisor divides a and b, it also divides the linear combination ax + (y - x)h = d. This completes the proof if a ~ 0 and b ~ O. If one or both of a and h is negative, apply the result just proved to lal and Ibl.
of
If nand d are integers and if n = cd for some integer c, we say d is a divisor of n; or n is a multiple of d, and we write din (read: d divides n). An integer n is called
NOTE. If d is a common divisor of a and b of the form d = ax + by, then -d IS also a divisor of the same form, - d = a( - x) + b( - y). Of these two common divisors, the nonnegative one is called the greatest common divisor of a and b; and is denoted by gcd(a, b) or, simply by (a, b). If (0. b) = l then a and b are said to be relatively prime.
I, then
ale.
Ii
Real ...
Th.U
1b. 1.11
Proof, Since (a, b) = I we can write 1 But alacx and albcy, so ale.
ax
+ by.
Therefore c
acx
bey.
p
Theorem 1.B. If a prime p divides ab, then pia or plb. More generally, divides a product a 1 ••• Ok. then p divides at least one of the fa(,;tors.
if a prime
;0;;
J"j, e,
Proof, Assume plab and that p does not divide a. If we prove that (p, a) 1, then Euclid's Lemma implies plb. Let d = (P. a). Then dip so d = 1 or d '" p. We cannot have d = p because dla but p does not divide 0. Hence d J. To prove the more general statement we use induction on k, the number of factors. Details are left to the reader.
Real numbers that are not rational are called irrational. For example, the numbers 'It" and e'" are irrational. Ordinarily it is not too easy to prove that some particular number is irrational. There is no simple proof, for example, of the irrationality of e". However, the irrationality of certain numbers such as../2 and";3 is not too difficult to establish and, in fact, we easily prove the following; Tlu!im!m 1.10. 1/!'I is a positive integer which is not a perfect square, then";;'
irrational.
is
TAeorellf 1.9(U";qfle/tldOTiUltio" theorem). Every integer n > I can be represented us If product of prime factors in only one way, apart from the order of the factors. Proof We use induction on n. The theorem is true for n "" 2. Assume, then, that it is true for all integers greater than I and less than n. If n is prime there is nothing more to prove. Therefore assume that n is composite and that n has two factorizations into prime factors, say
(2)
We wish to show that s = t and that each p equals some q, Since PI divides the product q!qz ... qf' it divides at least one factor. Relabel the q's if necessary so that pdq" Then PI = ql since both PI and qt are primes. In (2) we cancel PI on both sides to obtain n
PI
Proof. Suppose first that n contains no square factor> l. We assume that ..;;, is rational and obtain a contradiction. Let";; = alb, where a and b are integers having no factor in common. Then nb2 = 0,2 and, since the left side of this equation is a multiple of n, so too is a2• However. if cl is a multiple of n, a itself must be a multiple of n, since n has no square factors. > I. (This is easily seen by examining the factorization of a into its prime factors.) This means that a = en, where c is some integer. Then the equation nbz = 0,2 becomes nb2 = c2n2• or b2 nez. The same argument shows that b must also be a multiple of n. Thus a and b are both multiples of n, which contradicts the fact that they have no factor in common. This completes the proof if n has no square factor > I. If n has a square factor, we can write 11 = m2k. where k > I and k has no square factor> I. Then";; = mil.; and jf"/; were rational, the number ../k would also be rational, contradicting that which was just proved. A different type of argument is needed to prove that the number e is irrational. (We assume familiarity with the exponential e" from elementary calculus and its representation as an infinite serie~.) Tieorem 1.11. If e" number e is irrational. I
Since n is composite, I < nJpI < n; so by the induction hypothesis the two factorizations of nIp! are identical, apart from the order of the factors. Therefore the same is true in (2) and the proof is complete.
U RATIONAL NUMBERS
+ x + x2f2! +
xlJ3!
Quotients of integers afb (where b "'" 0) are called rational numbers. For example, 1/2, -1/5, and 6 are rational numbers. The set of rational numbers, which we denote by Q. contains Z as a subset. The reader should note that all the field axioms and the order axioms are satisfied by Q. We assume that the reader is familiar with certain elementary properties of rational numbers. For example, if a and b are rational, their average (a + b)/2 is also rational"and lies between a and b. Therefore between any two rational numbers there are infinitely many rational numbers, which implies that if we are given a certain rational number we Cannot speak of the "next largest" rational number.
Proof We shall prove that e-I is irrational. The series for e-1 is an alternating series with terms which decrease steadily in absolute value. In such an alternating series the error made by stopping at the nth term has the algebraic sign of the first neglected term and is less in absolute value than the first neglected term. Hence. if Sit = Ll=o (_l)c/k!. we have the inequality
O < e -[ from which we obtain O«2~-1)!(e-l for any integer k ;::::I. Now (2k rational, then we could-choose k
Slk-l
1 <--,
(2k)!
-Slk-I)<2~:::;;~' f)!
SQ S2i-l
(3)
is always an integer. If e-1 were large that (2k - I)! e-1 would also be an
•
NOrE.
Dfl.l.1l
1'11. 1.14
integer. Because of (3) the dift"erence of these two integers would be a number between 0 and i. which is impossible. Thus e-1 cannot be rational, and hence e cannot be rational.
For sets like the one in Example 3. which are bounded above but have no maximum element. there is a concept which takes the place of the maximum ele-ment. It is called the least upper bound or supremum of the set and is defined as
'It:
follows: De./iIdtion 1.13. Let S be a set of real numbers bounded aboDe. A real number b ir called a least upper bound for $ if it has the following two properties; a) b is an upper hound for S. b) No number less than b is an upper boundfor S.
&•..,...,.. S If [0, 1] the maximum element I ill also a least upper bound for S. If S = [0,1) the number 1 is a least upper bound for S, even though S has no maximum element. It is an easy exercise to prove that a set cannot haw two different least upper bounds. Therefore. if there is a least upper bound for S. there is only one and we can speak of the least upper bound. It is common practice to refer to the least upper bound of a set by the more concise term supremum, abbreviated sup. We shall adopt this convention and write
The ancient Greeks were aware of the existence of irrational numbers as early as 500 D.C. However, a satisfactory theory of such numbers was not developed until late in the nineteenth century, at which time three different theories were introduced by Cantor. Dedekind. and Weierstrass. For an account of the theories of Dedekind and Cantor and their equivalence. see Reference 1.6. LtO UPPER. BOUNDS, MAXIMUM
(SVPREMlJM)
ELEMENT,
Irrational numbers arise in algebra when we try to solve certain quadratic equations. For example, it is desirable to have a real number x such that = 2. From the nine axioms listed above we cannot prove that such an x exists in R. because these nine axioms are also satisfied by Q and we have shown that there is no rational number whose square is 2. The completeness: axiom allows us to introduce irrational numbers in the real-number system, and it gives the real-number system a property of continuity that is fundamental to many theorems in analysis. Before we describe the completeness axiom, it is convenient to introduce additional terminology and notation.
b
to indicate
supS
that b is the supremum of S. If S has a maximum element. then sup S. The greatest lower bound, or i"./imum of S. denoted by inf S, is defined in an analogous fashion.
max: S
set of real1rUl11bers. If there is a real number b such that x :s: b for every x in S, the" b is called an upper bound for S and we say that S is bounded aboDe by b.
I.II
We say an upper bound because every number greater than b will also be an upper bound. H an upper bound h is also a member of S. then b is called the largest member or the maximum element of S. There can be at most one such h. If it exists. we write b = max S. A set with no upper bound is said to be unhouruled above. Definitions of the terms lower bound, bounded below. smalJest member (or minimum element) can be similarly Connulated. If S has a minimum clement we denote it by min S.
ED...,...,.
A.xiom 10. Every nonempty set S of real monbers which is bounded above IuJs a supremum; that is, there is a real number b such that b = sup S.
As. a consequence
numbers
of this axiom it follows that every nonempty which is bounded below has an infimum. OF THE SUPREMUM
set of real
1.12 SOME PROPERTlFS This section discusses some useful in this text. There is the reader should formulate The fint property shows trariJy close to its supremum
I. TIle set R + "" (0, + 00) is unbounded above. It has no upper bounds and no mu· imum element. It is bounded below by 0 but has no minimum elooxot.
.z.
The closed. interval S = [0, 1] is bounded above by I and is bounded below by O. In fact, max S = 1 and min S ... O.
fundamental properties of the supremum that will be a corresponding set of properties of the infimum that for himself. that a set with a supremum contains numbers arbi.
3. The balf-open interval S [0, 1) is bounded above by 1 but it has no maximum element. Its minimum element is O.
T_rem 1.U(Approximatiollpropel'ty). Let S be a nonempty set o/real numbers with a supremum, say b = sup S, Then for every a < b there is some x in S such that a < x s; h.
to
Tb. US
Tb.UO
11
Proof. First of all, x !5: h for all x in S. If we had x !5: a for every x in S, then a would be an upper bound for S smaller than the least upper bound. Therefore x > a for at least one x in S. Tllunm 1.15 (Additir;e properly).
Given nonempty subsets A and B of R, let C denote the set
covered by a finite number of line segments of a given positive length, no matter how small.
Theorem 1.19. If x > 0 and integer n such that nx > y.
if
y is an arbitrary
c=
{x
+y =
:x
A,
Y E B}.
wrra
FINITE DECIMAL
REPRESENTATION
sup C
sup A
+ sup
B.
Proof. Let a = sup A, b = sup B. If Z E C then z = x + y, where x E A, Y E B, so z = x + y :::;; + h. Hence a + b is an upper bound for C, so C has a a supremum, say c = sup C, and c :::;; + b. We show next that a + b ~ c. a Choose any 6 > O. By Theorem 1.14 there is an x in A and a y in B such that and Adding these inequalities we find a
Thus,
0
1=60+-+ a1
10
1()2
aa + ... +-
10" '
aj !5:
b-
s < y.
where a(} is a nonnegative integer and a1> ... is usually written more briefty as follows:
, a~
9,
+b£
2s
<x +
y :::;; . c
+b<c+
2s for every
+ b s; c.
-=-=0.5
1 2
5 10
'
50
-=-
lal
=002 .•
- = 7+- +-
29 4
2 10
5 = 7.25. 10~
The proof of the next theorem is left as an exercise for the reader. Theorem 1.16 (Compariwnproperly). that 8 :..; t for every s tn S and t in T.
and Given nonempty subsets Sand T of R such If T has a supremum then S has a supremum
Real numbers like these are necessarily rational and, in fact, they all have the form r = a/1ft, where Q is an integer. However. not all rational numbers can be expressed with finite decimal representations. For example. if 1- could be so expressed, then we would have i = a/10" or 30 = lO" for some integer a. But this is impossible since 3 does not divide any power of 10.
1.1' FINITE DECIMAL APPROXIMATIONS TO REAL NUMBERS
z+ of positive
If Z+ were bounded above then Z+ would have a supremum, say a = sup Z+. By Theorem 1.14 we would have a - I < n for some n'in Z+. Then n + 1 > (.l for this n. Since n + I € Z + this contradicts the fact that a = sup Z +. TMoum I.IB. For every real x there is
0 positive
This section uses the completeness axiom to show that real numbers can be approximated to any desired degree of accuracy by rational numbers with finite decimal representations.
Tltevrem 1.10. Assume x ~ O. Then for every integer n ~ I there is a jinite
decimal r.
such that
integer
II
Proof: If this were not true, some x would be an upper bound for Z ", contradicting Theorem 1.17. 1.14 THE ARCHlMEDEAN PROPERTY OF THE REAL NUMBER SYSTEM
r~ s x <
r; +
1 10'"
The next theorem describes the Archimedean property of the real number system. Geometrically, it tells us that any line segment, no matter how long, can be
Proof Let S be the set of all nonnegative integers Sx. Then Sis nonempty, since 0 E S, and S is bounded above by x, Therefore S has a supremum, say ao = sup S. It is easily verified that ao E S, so aD is a nonnegative integer. We call ao the greatest integer in x, and we write llo = [x]. Clearly, we have
ao ~ x <
00
I.
u
Now let "1 = [lOx· - l~]. 5 lOx - lOuo = 100x the greatest integer in lOx - 10ao. 10, we have 0 ~ al :S 9 and lOao
n. 1.2.1
Since
ao) <
o. In other
words,
Q1
:S lOx -
<
Q1
1.
A fundamental inequality
Ixl
={
-x,
x,
if x :s:; O.
values is given in the following:
concerning
absolute
tlo
to
1.
More generally. having chosen a1> •••• largest integer satisfying the inequalities Qo
a .._1 with 0 :S a,
:s: 9,
+
t
:s:
10
til
<
a
0
10
01
+ ., . +
aM
10" .
(4)
I) and we have
Proof From the definition of lxi, we have the inequality -Ixl s: x ::s; ixl, since x = Ixl or x -Ixl. If we assume that Ixl :s: a; then we can write - a :s:; -Ixl :s:; x :s:; Ixl :s:; a and thus half of the theorem is proved. Conversely, let us assume -a :s:; x :s:; a. Then if x ~ 0, we have Ixl = x :s:; a, whereas if x < 0, we have [x] = -x :s a. In either case we have [x] :s:; a and the theorem is proved.
We can use this theorem to prove the triangle inequality.
< "" _ x -c r,
+ -, 1
to"
It is easy to verify that x is
where rIO = ao .alaz .. , a... This completes the proof. actually the supremum of the set of rational numbers 1.17 INFINI1E DECIMAL REPRESENTATIONS
r" rz•....
Ix + yl
The integers Qo. at> a2• • •• obtained in the proof of Theorem define an infinite decimal representation of x. We write
Ix
x
+ yj ~ Ixl + Iyl·
Ixl and -lyl:S: y :s:; Iyl. Addition gives us Iyl. and from Theorem 1.21 we conclude that
the theorem. For example, if we take
This
x
00 = 0,
ao'"la~'"
The triangle inequality is often used in other forms. a - , and y = , - b in Theorem 1,22 we find la -
to mean that a,. is the largest integer satilifyi.t18 (4). For example, if x = 1 find we a, = 1. a2 2, a3 = 5. and a" 0 for all n ~ 4. Therefore we can write i = 0.125(XX}···
hi ~
la -
c]
[c -
hi.
Taking
Also, y=
1.22 we have
Ixl ~ Ix
+ yl - Iyl·
b,
If we interchange the inequality signs :S and < in (4), we obtain a slightly ditJerept definition of decimal expansions. The finite decimals T.. satisfy T. < x :S '" + 10-" although the digits ao, a1> Oz, ••• need not be the same as those in (4). For example, ifwe apply this second definition to x = i we find the infinite decimal
representation .
Interchanging hence
0.124999'':''
The fact a real number might have two different decimal representations is merely a reflection of the fact that two different sets of real numbers can have the same supremum. 1.18 ABSOLUTE VALUES AND THE TRIANGLE INEQUALITY
mat
Calculations with inequalities arise quite frequently in ana1ysis. They are of particular importance in dealing with the notion of absolute value. If x is any real
inequality
14
Th.l.13 and
Del. t.26
IS
If
aj,
•.••
Q"
bj,
•••
b"
are
+00,
we haxe
X(-oo)
00.
c) If
Moreover, QlX + hie Proof
x<
0, then
-00, + (+00) + (-00)
if
if
and only
if
x(+oo)
+00. +00,
d) (+00)
(-00)
= =
(-00)(-00) -00.
k~t
L (a,x + bli
II
<x <
+00.
~0
for every real x, with equality if and only if each term is zero. This inequality can be written in the form
Ax:.!
NOTATION. WedenoteRby 00, +00) andR* by [-00, +00]' ThepointsinR are called "finite" to distinguish them from the "infinite" points + 00 and - 00.
where
A
2Bx
C 2 0,
Loi,
l~l
IfA> If A
s;
The principal reason for introducing the symbols + 00 and - 00 is one of convenience. For example, if we define + 00 to be the sup of a set of real numbers which is not bounded above, then every nonempty subset of R has a supremum in R*. The sup is finite if the set is bounded above and infinite if it is not bounded above. Similarly, we define - 00 to be the inf of any set of real numbers which is not bounded below. Then every nonempty subset of R has an inf in R*. For some of the later work concerned with limits. it is also convenient to introduce the following terminology. Definitioll 1.25. Every open interval (a, + co) is called a neighborhood of + 00 or a ball with center + 00. Every open interval ( - 00, a) is called a neighborhood of - 00 or a ball with center 00.
I.II COMPLEX NUMBERS
NOTE.
(B'W
where a = (aJ> ... , a,,), b
(b l'
... ,
~ lIall'"Ifbll1.,
h.) are two n·dimensional vectors, .
k=1
• L alA,
Iiall
Next we extend the real number system by adjoining two "ideal points" denoted + 00 and - 00 ("plus infinity" and "minus infinity").
It foliows, from the axioms governing the relation < that the square of a real number is never negative. Thus, for example, the elementary quadratic equation x2 = - I has no solution among the real numbers. New types of numbers, called complex numbers, have been introduced to provide solutions to such equations. It turns out that the introduction of complex numbers provides, at the same time, solutions to general algebraic equations of the form
Q(}
0,
/kfiwitWn 1.U. By the extended rea/number system R* we shall mean the set of real nllmbers R together with tWQ symbols + 00 and 00 which satisfy the followinq
properties .a) If x x
E
where the coefficients 00' Ot •••• , a. are arbitrary real numbers. (This fact is known as the FundamentaI_Theorem of A (qebra.) We shall now define complex. numbers and discuss them in further detail. De./illition 1.26. Bya complex number we shaJ/ mean an ordered pair of rea/numbers which we denote by (xl> x2). The first member, Xl' is called the real part of the
complex number; the second member, Xl. is called the imaginary part. Two complex numbers x = (x,; Xl) and y = (Yl, Y2) are called equal, and we write x = y, if,
R, then we have
+ (+ (0) = + 00,
o.
x+ 00) x-(-oo)::::
00,
+00,
16
D.l.n
17 convenient to think oftbe real number system as being a special case of the complex number system. and we agree to identify the complex number (x, 0) and the real number x. Therefore, we write x = (x. 0). In particular, 0 = (0, 0) and 1 = (1,0). 1.2Z GEOMETRIC REPRESENTATION OF COMPLEX NUMBERS
Yl and Xl
by
the equations
NOTE.
Tluwum 1.17. The operations of addiliOf1 and multiplication just defined satisfy the commutative, associatioe, and distributive laws.
Proof We prove only the distributive law; proofs of the others are simpler. x = (x,; x2),! = (Yl> Y2), and z = (z,. 22). then we have x{y
If"
Just as real numbers are represented geometrically by points on a line, so complex numbers are represented by points in a plane. The complex number x = (XI> X2) can be thought of as the "point" with coordinates (x 1> x2). When this is done, the definition of addition amounts to addition by the parallelogram law. (See Fig. 1.2.)
z)
= =
(Xb
X2)(}'1
(XI}'I (XlYI
xy
+ XI.
+ ZI> Fz + Z2) + XIZI X2Yl - X2Z2, XI Y2 + XII2 + XlYI + XlII) - X2Y,., X1Yl + X2Y.) + (XlIt - XlZl, XII2 + X2I,) =
TltHum 1.18.
(X1o
Xl)
+ (0.0) =
0) =
(XI>
x2),
(x I> X2)(0, 0)
(XI> X2)
(0, 0),
-X2)
(Xl' X1)(I,
(Xl' Xl),
(-Xl>
= (0,0).
of
Figure 1.2
Proof The proofs here are immediate Theorems 1.29, 1.30, 1.32, and 1.33.
TIIeonm 1.19. Given two complex numbers x = (x I' x2) and Y = (y h Y2). there exists a complex number z such that X + I = y. Infact, Z = (YI Xh Y2 X2)' This Z is denoted by y - x. The complex number (-x., -x2) is denoted by -x. TllloulIIlJO. /JejiIIitio1II.Jl.
X-l
= -(xy)
{-I, OXxy}.
[xl/(~
+ xi),
If x
x2)
-X2/(X;
+ xin.
The idea of expressing complex numbers geometrically as points on a plane was formulated by Gauss in his dissertation in 1799 and, independently, by Argand in 1806. Gauss later coined the somewhat unfortunate phrase "complex number." Other geometric interpretations of complex numbers are possible. Instead of using points on a plane, we can use points on other surfaces. Riemann found the sphere particularly convenient for this purpose. Points ofthe sphere are projected from the North Pole onto the tangent plane at the South Pole and thus there corresponds to each point of the plane a definite point of the sphere, With the exception of the North Pole itself. each point of the sphere corresponds to exactly one point of the plane. This correspondence is called a stereographic projectwn. (See Fig. 1.3.)
TIteoUllt IJ1. If x and r are complex numbers with complex number z :ruch that xz y, namely, Z = yx-l•
Of special interest part is O. are operations with complex
numbers
TIteoum 1.33.
(XI' 0)
+ (Yh
0)
(Xl
+ Y,. 0),
a},
O)/(y,> 0) =
(xt/r"
NOTE. It is evident from Theorem J.33 that we can perform arithmetic operations on complex numbers with zero imaginary part by performing the u~iial real-number operations on the real parts alone. Hence the complex numbers of the form (x, 0) have the same arithmetic properties as the real numbers. For this reason it is
Figure 1.3
i.
1.23 THE IMAGINARY UNIT
Del. 1.34
Del. 1.40
.,
[x
Xl) as a two-dimensional vector with components XI and X2' Adding two complex numbers by means of Definition 1.26 is then the same as adding two vectors component by component. The complex number I = (1,0) plays the same role as a unit vector in the horizontal direction. The analog of a unit vector in the vertical direction will now be introduced.
+ yj
:::; Ixl
+ Iyl
(triangle inequality).
The proof is left as an exercise for the reader. 1.25 IMPOSSIBILiTY OF ORDERING THE COMPLEX NUMBERS
lkjbtititnll..3l.
inary Imit.
1'IIeonm 1..35. Every complex number x = (Xl' x,) can be represented in the form
= XI
Proof.
The next theorem te1ls us that the complex number t provides us with a solution to the equation xl = -1. TllMnm 1.36. i Z Proof.
L
= (0, 1)(0. I) = (-1.0) = -I. VALUE OF A COMPLEX. NUMBER value to the complex number system.
1.24 ABSOLUIE
IhforitU»t 1.37.. If x =
I
,
As yet we have not defined a relation of the form x < y if x and y are arbitrary complex numbers, for the reason that it is impossible to give a definition of -c for complex numbers which will have all the properties in Axioms 6 through 8. To illustrate, suppose we were able to define an order relation < satisfying Axioms 6. 7, and 8. Then, since j ¢ 0, we mUlit have either i > 0 or i < 0, by Axiom 6. Let us assume i > O. Then taking, x = y = i in Axiom 8, we get ;2 > 0, or - I > O. Adding 1 to both sides (Axiom 7), we get 0 > L On the other hand, applying Axiom 8 to -1 > 0 we find I > O. Thus we have both 0 > 1 .and I > 0, which. by Axiom 6, is impossible. Hence the assumption i > 0 leads us to a contradiction. [Why was the inequality - 1 > 0 not already a contradiction '/] A similar argument shows that we cannot have j < O. Hence the complex numbers cannot be ordered in such a way that Axioms 6. 7, and 8 will be satisfied. 1.26 COMPLEX EXPONENTIALS The exponential e" (x real) was mentioned earlier. We now wish to define e" when z is a complex number in such a way that the principal properties of the real exponential function wiD be preserved. The main properties of e" for real are the law of exponents, trll!"l = e"1+;r1,and the equation e{) = I. We shall give a definition of e" for complex z which preserves these properties and reduces to the ordinary exponential when z is real. If we write z = x + iy (x, y real). then for the law of exponents to hold we want.l!"+ il = e"eiy• It remains, therefore. to define what we shall mean by eb•
Ixl 1honm1.38.
i) 1(0,0)1 iii)
Jxi +.G.
ii)
+ O.
Ixyl = Ixllyl.
0)1= Ixd.
lx/yl
Ixi/lyl, if Y + O.
iv) I(xl>
e"
Proof. Statements (i) and (iv) are immediate. To prove (ii), we write x = XI + ixz• Y = YI + i)'z. so that xy = X1Y1 - XIYZ + i(x1yz + xZYI)' Statement (ii)
follows from the relation Ixyl2 = Equation
This definition" agrees with the real exponential function when is, Y = 0). We prove next that the law of exponents still bolds.
is real (that
= (xt
Geometrically, Ixl represents the length of the segment joining the origin to the point x. More generally, [x - yl is the distance between the points X and y. Using this geometric interpretation, the following theorem states that one side of a triangle is less than the sum of the other two sides.
• Several III'gUments can be gi\len to motivate the equation elF = cos y + j sin y. For example, let us write el~ = f(y) + ig(y) try to determine the real-valued functionsf and 9 so that the usual rules of operating with real exponentials will also apply to complex exponentials. Formal dlfferentiation yields e" = g'(y) - if'(y). if we assume that (et,y = te", Comparing the two expressions for .ly. we see that/and g must satisfy the equalions/(y) g'(y),f'(y) -g(y). Elimination of 9 yieldsf{y) -f'(y). Since we want eO I, we must havef(O) "" I and/,(O) O. It follows thatf(y) ... cos y and g(y) "" - f'(y) '" sin y. or course, this argument proves nothing, but it strongly suggests that the definition e'Y = COli Y + f-sin y is reasonable.
ana
'l'b. 1.41
Del. 1M
11
then we have
PrI)Qf,
of'
= e'"l(COS Y1 +
i sin Yl).
e'l
= e'"l{cos Y1
i sin Y2),
The two numbers r and I} uniquely determine z, Conversely, the positive number r is uniquely determined by z; in fact. r = [z]. However, z determines the angle 8 only up to multiples of 211:.There are infinitely many values of fJ which satisfy the equations x = [z ] cos 0, Y = [r] sin 0 but, of course, any two of them differ by some multiple of 211:.Each such (}is called an argument of z but one of these values is singled out and is called the principol argument of z, DejUritiQlf 1.46. Let z = x + iy be a nonzero complex number. number (} which satisfies the conditions
e'Jt!1.
e""c'[cos y, cos Y2
+
Now e"'e" and
;(oos Yl sin Y2
Xl
+
.1£'2
= [z ]
cos 9,
[z] sin 6,
-n <
(l ~ +1t
and
arg (z).
sin Y. cos Y2
sin (YI
+ h),
1.2.7 FURTHER PROPERTIES OF COMPLEX EXPONENTIALS In the following theorems, Tileorem 1.42.
I
."
immediately
reil, where r
NOTE.
This method of representing complex numbers is particularly connection with multiplication and division, since we have and
z,
Z,'
Z2
e is never
eO
zero. be zero.
Proof, e'e:"
=
x
Tlleorem 1.48. If
where
ZIZl
(%1)
+ 21tn(ZJ'
+n,
Z2)'
lei" I
I.
Proof,
!eIX~2 =
C(lS2
+ sin2
+ arg + arg
+ aeg (Z2)
S; -ll:.
2n.
= I if, and
If z
e'
cos (2nn)
+ i sin
Conversely, suppose that e' = I. This means that e" C(lS Y = I and e'" sin y = O. Since e" #- 0, we must have sin Y = 0, y hr., where k is an integer. But cos- (kn) = (-] 1. Hence e" = (-1)"', since e'" cos (kJI) = l. Since e" > 0, k must be even. Therefore e" 1 and hence x O. This proves the theorem.
Proof, Write Z1 = Izde'41" Z2 = IZ2Ie"" where 61 = arg (ZI) and (;}l = arg (%2)' Then ZlZ2 = IZ1z11/(f,'HI... Since -1[ < 91 :s;; +n and -71: < 82 ~ +)'1:. we have -21t < 81 + 61 :s: 2n. Hence there is an integer n such that -n < fJ, + 61 + 2mr ~ n. This n is the same as the integer nCz., Z2) given in the theorem, and for this n we have arg (ZIZ2) = fll + 92 + 21111. This proves the theorem.
1.29 INTEGRAL POWERS AND ROOTS OF COMPLEX NUMBERS Defolitiml 1.49. Given a complex number z and an integer of z as follows ..
/1,
c'
if;
Zl -
Zl ""
e'
-z, = I.
if n ~
o,
and n
if z #- 0
Theorem
> O.
If the point z
can
= (x, y) =
write x =r
iy is represented by polar coordinates rand 6, we cos 6 and y = r sin 6, so that z = r cos 6 + ir sin 8 = re'"
by mathematical
1.50, which states that the usual laws of exponents hold, can be proved induction. The proof is left as an exercise.
TiIetwem 1,s0.
1
II
[)ef..l.55
13
and
Tkorem 1.5/. Ilz"'_ 0, and if n is a positive integer. then there are exactly distinct complex numbers zo, zl> ... , %._1 (eo/Jed the nth roots olz), $lien. that
TIu!orem 1.52. If z is a complex number #0, then rhf're exist complex numbers such that e" = 2. One such w is the complex number
'III
log Izi
+ i arg(z}, + 2mri,
z, But if
z:
z,
for each k
0, I, 2, ...•
II -
l. where n is an integer.
Furthermore,
log [z]
Proof. Since el"'1"1+''''''(2)
i arg (z)
fbI<
NOTE.
arg (z)
II
+ 2nk
11
(k = 0, 1.2 •...•
11 -
l~
log [s] + i arg (z) is a solution of the equation eW solution, then e" = e"" and hence w - WI = 2J17ri.
e",I<Ie'"rj:(%.
The n nth roots of z are equally spaced on the circle ofradius R center at the origin.
Proof.
DejiaitiOIt
!zll/.,
1.53. Let z ::J:. 0 be a giLlen complex number. If w is a complex number = z, then w is called a logarithm of z, The partiCular !JOlueof w gjven
'III
S k :5: 11 -
Jog [z]
+ iarg
(z)
We must now show that there are no other nth roots of z, Suppose w = Ae'" is a complex number such that IV' z. Then Iwlft = Irl. and hence A" = [z], A = Izl"-. Therefore. w" = z can be written el"" = eila'1l«)]. which implies
1IiX -
Log z.
=
1. Since 1. Since
Iii =
1- il
arg (z)
21£k
1 andarg(-I)
=
Log (-I)
==
Hence If = [arg (z) + 2l1'k]/n. But when k runs through all integral values, w takes only the distinct values zo •...• ZR-l' (See Fig. 1.4.)
II + il
log x. since
Ixl
+ i)
+ i)
'"
log"/i + i1t/4.
22)'
Tlleorem lSI. If
21Z2
# 0, then
= Log z,
Log
Log
%2
+' 2nin(zl>
1.~8.
Log (zlzz)
= =
log log
IZlz21
+ t arg
(Z122)
(2t)
+ arg
(Z::)
2mr(zl' z~)].
Figure 1.4
•.3.
COMPLEX
POWERS
1.30 COMPLEX LOGARITHMS By Theorem 1.42. e" is never zero. It is natural to ask if there are other values that e" cannot assume. The next theorem shows that zero is the only exceptional value.
Using complex logarithms. we can now give a definition of complex powers of complex numbers.
Dejillition 1.55. If z '" 0 and ifw is any complex number, we define
111. 1.56
EumpJes 1. it = e1LofI =
etU'"/~) = e-'lt/2.
01"(1.)
2. (_1)1 = e'LotlI-1l =
= e-1t•
3. If /J is an integer. then z ·+! e"'+ ])LotI. = not conflict with Definition 1.49.
e"Lof'ros"
= :"z,
SO
The next two theorems give rules for calcu1ating with complex powers: 7'INNHrIm 1.56. z .. , z ..,
, I
I
Def.l.Cil
b) III. c) so
e C, but z
= a»,
00 = (ooXaJ)
00.
DejUritiotJ 1.61. Every set in C olthi!form hood of 00, or a ball with center at 00.
zlP, ",.. ,
if z oF
O.
.1
I .
",
X2)
(%1%2)'"
ewlA>tI(x,.,)
e...' ......Zl+IAIZ>+Z
.. I«(.".>U.
sin z
TIttoran 1.59, If
The reader may wonder wby two symbols, +C(l and co, are adjoined to R but only one symbol, oo, is adjoined to C. The answer lies in the fact that there is an ordering relation -c among the real numbers, but no such relation occurs among the complex. numbers. In order that certain properties of real numbers involving the relation < hold without exception, we need two symbols, + 00 and - 00, as defined above. We have already mentioned that in R* every nonempty set has a sup, for example. In C it turns out to be more convenient to have just one ideal point. By way of illustration, let us recall the stereographic projection which establishes a oneto-one correspondence between the points of the complex plane and those points on the surface of the sphere distinct from the North Pole. The apparent exception at the North Pole can be removed by regarding it as the geometric representative of the ideal point 00. We then get a one-to-one correspondence between the extended. complex plane C* and the total surface of the sphere. It is geometrically evident that if the South Pole is placed on the origin of the complex plane, the exterior of a "large" circle in the plane will correspond. by stereographic projection, to a "small" spherical cap about the North Pole. This illustrates vividly why we have defined a neighborhood of CQ by an inequality of the form [z 1 > r.
=
cos z
cosh)' y
Pr(H)/
2cosz
sin z
=
EXERCISES Irrtegers 1.1 Prove tbat there is no largest prime. (A proof was known to Euclid.) 1.2 If n is a positive integer, prove the algebraic identity
= e-'(oos x + i sin x) + eY(cos x - i sin x) = cos x(e' + e-J') - t sin x(eY - e-,) = 2 cos x cosh Y 21 sin x sinh y. is similar.
eiz
I.
properties
1.33 INFINITY AND TIlE EXTENDED COMPLEX PLANE C. Next we extend the complex number system by adjoining the symbol CQ. an ideal point denoted by
1.3 If 2" I is prime, prove that n is prime. A prime of the form 2" - I, where p is prime, is called a Mersenne prime. 1.4 If 2~ + 1 is prime, prove that n is a power of 2. A prime of the form 22m called a Fermat prime. Him. Use Exercise 1.2.
Xw 1
I is
IhjillititJII1.60. By the extended complex number system C· we slwll mean the complex plane C along with a zymbol 00 which satisfies the folloWing properties: a) If z
E
1.5 The Fibonacci "umbers I. I. 2. 3, 5, 8, 13, ... are defined by the recursion formula + = X" + X~-h with x] = Xl = 1. Prove that (xn' X~+l) = I and that x" = (a" - b-)/(a - b}, where a and b are the roots of the quadratic equation x2 - x-I = O. 1.6 Prove that every nonempty set of positive integers contains a smallest member. This is called the welf-orde,ing principle.
C, then we halJe Z
(J)
z-
<::IJ =
co, z/oo
O.
26 Ratioml and irrational IIlIDIben 1.7 Find the rational number whose decimal expansion is O.3~44444 ... 1.8 Prove that the decimal expansion of x will end in zeros (or in nines) if. and only if, x is a rational number whose denominator is of the form 2"5"', where m and n are nonnegative integers. 1.9 Prove that 1.22 Given x :> 0 and an integer k 2: 2. Let QI) denote the largest integer ~x and, assuming that alb a1, ... , 0._1 have been defined, let Q" denote the largest integer such that al 01 a Qo + - + -:; + ... + ....!! ,.; x. k k" k" a) Prove that 0 ~ Q, ~ k - 1 for each i = 1, 2, ... b) Let r,. = ao + alk-I + il2k-2 + ... + a,/(.-· and show that x is the sup of the set of rational numbers '1' '2.' •
NOTE.
1.10 If a. b. c, d are rational and if x is irrational, prove that (ax irrational. When do exceptions occur? 1.11 Given any real x 1.11 If alb < c:/d with b > 0, d andc/d.
:>
h)j(cx
d) is usually
«(I + c)/(h +
When k = 10 the integers a{h a,. a2 •... are the digits in a decimal representation of x. For general k they provide a representation in the scale of k .
1.13 Let a and b be positive integers. Prove that "./2 always lies between the two fractions alb and (a + 2b)/(a + b). Which fraction is closer to ·'/i.1 1.14 Prove that
.J n
-I
Jn +
2:
I. Note that this identity implies the Cauchy-Schwarz 1.24 Prove that for arbitrary real
01.,
1.tS Given a real x and an integer N:> I. prove that there exist integers h and k with o < k: ~ N such that Ih - hi < lIN. Hint. Consider the N + 1 numbers tx - [tx] for t 0, 1,2, ... , N and show that some pair differs by at most lIN. l.t6 If x is irrational prove that there are infinitely many rational numbers hlk with k :> 0 such thai Ix - h/k1 < 1/k2• Hint. Assume there are only a finIte number h,/kl' ... , h.lk. and obtain a contradiction by applying Exercise 1.15 with N:> Ifo, where Ii is the smallest of the numbers Ix - hdkll. 1.11 Let x be a positive rational number of the form x
ls.k<.J~JI.
(Utbj -
ajblt)".
inequality.
b., c~ we have
l=1
• :E~, k!
(t
•••
(Ok
bt}2)
This
II ~
is the
(a1o
•••
where each Uk is a nonnegative integer with a.. ~ k - I for k 2: 2 and a" > O. Let [x] denote the greatest integer in x. Prove that UI "" [x], that ll" = [k! x] - k[(k - J)! x] for k = 2, ... , n, and that n is the smallest integer such that n! x is an integer. Conversely, show that every positive rational number x can be expressed in this form in one and only one way.
Uppe£
lIall
1.26 If al
~
a~)
2:
112
02
;::
an and hi
:2:
b2
:2: •••
bound!>
1.18 Show that the sup and inf of a set are uniquely determined whenever they exist. 1.1' Find the sup and inf of each of the following sets of real numbers: a) All numbers of the form 2 -p + 3 - .. integer values. b) S e) S
=>
{x: 3x2
lOx
a)(x
3 < OJ.
c)(x d) < OJ,
Complex nlUl1bers 1.27 Express the following complex numbers in the form a
a) (1 c)
j5
{x : (x
b)(x
where
<
b<c
<
d.
+
+
i)J,
;16,
b) (2
d)
1.20 Prove the comparison property for suprema (Theorem 1.16). 1.21 Let A and B be two sets of positive numbers bounded above, and let a = sup A, b "" sup B. Let C be the set of all products of the form xy, where x E A and Y E B. Prove that ab = sup C.
1.28 In each case, determine all real x and y which satisfy the given relation. a) x
+ iy
Ix
iy[,
b) x
iy
(.y _ iy)',
l~O
:E i"
100
iy,
NIIIIIber Systems
ZI <:
%2,
d)
+ Z2 +z=
il
zz.
iliJ"
or
Which of Axioms 6, 1, 8, 9 are satisfied by this relation 1 1.37 Which of Axioms 6, 7, 8,9 are satisfied if the pseudo-ordering is defined as follows i We say (XI' Y1) < (x2, fz) if we have either
e) (x - i)f; = twice the imaginary part of z. 1.39 Describe geometrically the set of complex numbers z which satisfies each of the following: conditions:
a) tzl ~
d) z
I, I,
b)
e)
Ixl
z-
< 1,
c) Izl ~
f) I
1,
= Iz12,
or
1.38 State and prove a theorem analogous to Theorem 1.48, expressing arg (zl/I2) in terms of arg (z 1) and ars (Z2)' 1..39 State and prove II. theorem analogous to Theorem 1.54, expressing Log (ZI/Z2) in terms of Log (Il) and Log (Z2)' 1.40 Prove that the nth roots of 1 (also called the nth roots of unity) are given by «, 1lf2. ••••• fi', where Of = ,,2.1/., and show that the roots # 1 satisfy the equation
t
i""
= i,
1.31 Given three complex numbers II' Z2. Z;3 such that IZII = Izzl = IZ31 = I and ZI + Zz + Z;l, = O. Show that these numbers are vertices of an equilateral triangle inscribed in the unit circle with center at the origin. 1.32: If
II
a) la -
W s (I + lal2)(1 + IW)·
lal + Ibl if, and
111 - bl
=
b} If 11 # 0, then 111+ hi =
;(2
+ ... +
x"-1
o.
leas z I
..... UlJ.
II - 001
1.41 a) Prove that Izil -c e« for all complex z l' O. b) Prove that there is no constant M > 0 such tbat
lal =
I or
Ibl
la - bl < II -
libl
1.41 If w =
II
= e"1oI1o:1-"..'I(.lel[
• ..,.I.I+
1.34 If a and c are real constants, b complex, show that the equation
1.43 a} Prove that Log (ZW) "" w Log I b) Prove that (z",)", ~ z-e2" 1.44
i) If (J and
Q
mill, where
11
is an integer. that
j
azi
+ bZ + liz + c
=0
(a #- 0, Z ... x
iy)
.....
where n is an integer.
represents a circle in the .o-plane, 1.35 Recall the definition of the inverse tangent: given a real number t, tan-1 (t) is the unique real number 0 which satisfies the two conditions tan 0 = t, If I
"'" X
+ It. prove
+ i sin Of
-= cos (aU)
sin (00),
Show that, in general, the restriction -7t < ()~ +n i$ necessary in (i) by taking ()= -Jr, a = i. iii) If a is an integer, show thai the fonnula in (i) holds without any restriction on 0, In this case it is known as DeMoivre's theorem,
ji)
1.45 Use DeMoivre's theorem (Exercise 1.44) to derive the trigonometric identities sin 3(} ... 3 eas2 0 sin () - sin:! fJ, if x>
0,
a) arg (z)
cos 30
2:
e,
+
iy, we have
tan-
(~)
11",
if
x < 0, y
0,
c) arg (z)
tan-
(~)
Ir,
ifx<O,y<O,
d) arg (z)
= ~ if x =
2
=-
'J! if 2
x = 0, y < O.
z=x
1.47 Lei w be a given complex number. If w # ±], show that there exist two values of + iy satisfying the conditions cos z = wand - II: < X S + ft. Find these values when w = iand when w "" 2.
NIIIIIber
Systems
31
I~
i;;
(J -
].6 Hobson, E. W., The Theoryof Functions of a Real Variable and the Theory of Fourier's Series, Vol. I, 3rd ed, Cambr-idge University Press, 1927.
Ib,J2
l:<6tSD
Ia.Oj -
I'lJ~12.
1.7 Landau, E., FOlllfdiJtionsof Analysis, 2nd ed, Chelsea, New York, 1960. 1.8 Robinson, A.• N(}n-tllandard Analysis. North-HoUand. Amsterdam, 1966. 1.9 Thurston, H. A., The Number System. Blackie, London, 1956.
Use this to deduce a Cauchy-Schwarz inequality for complex numbers. ].49 a) By equating ilrulginary parts in DeMolvre's formula prove that sin nO b} If 0 <
(J =
sin~ 8 {(;)
cot"-1
(;)
COt'-3 8
+ (;)
cot"-5 8 -
+ ... } .
1.10 Wilder, R. L., Introduction to the Foundation;; of Mathematics, 2nd ed, Wiley, New York,I965.
em 1+ I)
x'" -
em 3+ 1)
7ft _ 2m
x"'-l
em 5+ 1)
x"'-3 -
+ ... +
I)}
Use this to show that P", has zeros at them distinct points x; = cot2 {n:k/(2m for k = I, 2, ... , m. c) Show that the sum of the zeros of Pm is given by
i: cor' _
+
1
knl
= m(2m - 1) , 3
m(2m -
1)(4m1
+
1l 2
10m -
9)
2m
45 j6andI::'
I
These identities can be used to prove thatL~l n-2 (See Exercises 8.46 and 8.47.)
n-4
...
1f4./90.
11;=1 (2
0-1
k~1
. SIU-
kt: n
n = -2"-1
for- n
2::
2.
SUGGESlED
REFERENCES
1.2 Birkhoff, G .• and MacLane, S., A Survey of Modern. Algebra, 3rd ed. Macmillan, New York, 1965. 1.3 Cohen, L., and Ehrlich, G., The Structure of the Real-Number System. trand,.Princeton, 1%3. 1.4 Gleason, A., Fundamentals of Abstract Analysis. Addison-Wesley, Van Nos1966,
Reading,
1.5 Hardy, G. H., A Course of Pure Math"malics, 10th ed. Cambridge University Press, 1952.
Dd'.l.l
CHAPTER 2
11 INTRODUcnON
In discussing any branch of mathematics it is helpful to use the notation and terminology of set theory. This subject, which was developed by Doole and Cantor in the latter part of the 19th century. has had a profound influence on the development of mathematics in the 20th century. It has unified many seemingly disconnected ideas and has helped reduce many mathematical concepts to their logical foundations in an elegant and systematic way. We shan not attempt a systematic treatment of the theory of sets but shall confine ourselves to a discussion of some of the more basic concepts. The reader who wishes to explore the subject further can consult the references at the end of this chapter. A collection of objects viewed as a single entity will be referred to as a set. The objects in the collection will be called elements or members of the set. and they will be said to belong to or to be ccmtai"ed in the set. The set. in tum, will be said to contain or to be composed of its elements. For the most part we shall be interested in sets of mathematical objects; that is, sets of numbers, points, functions, curves, etc. However, since much of the theory of sets does not depend on the nature of the individual objects in the collection, we gain a great economy of thought by discussing sets whose elements may be objects of any kind. It is because of this quality of generality that the theory of sets has had such a strong effect in furthering tbe development of mathematics.
l.2 NOTADQNS
I
I 4
by 4, namely. {4, 8}, is-a subset of the set of even integen less than 10. In general, we say that a set A is a subset of B, and we write A !;;; B whenever every element of .A also belongs to B. The statement A. 6 B does not rule out the possibility that B E A. In fact, we have both A. £; Band B £: A if, and only if, A and B have the same elements. In this case we shall call the sets A. and B equal and we write A = B. If A and B are not equal, we write A ,p B. H A £: B but A. "" B. then we say that A is a proper subset of B. It is convenient to consider the possibility of a set which contains no elements whatever; this set is called the empty set and we agree to call it a subset of every set. The reader may find it helpful to picture a set as a box containing certain objects. its elements. The empty set is then an empty box. We denote tJiC empty set by the symbol 0.
Suppose we have a set consisting of two elements a and b; that is, the set {a. b}. By our definition of equality this set is the same as the set o}, since no question of order is involved. However, it is also necessary to consider sets of two elements in which order is important. For example. tn analytic geometry of the plane, the coordinates (x, y) of a point represent an ordered pair of numbers. The point (3, 4) is different from the point (4. 1), whereas the set {3. 4} is the same as the set {4, 3}. When we wish to consider a set of two elements a and b as being ordered, we shall enclose the elements in parentheses: (a, b). Then a is called the :firstelement and b the second. It is possible to give a purely set-theoretic definition of the concept of an ordered pair of objects (a, b). One such definition is tbe following:
rh.
lkforititM2.1.
(a, h)
This definition states that (a, b) is a set containing two elements, {a} and {a, b}. Using this definition, we can prove the following theorem: Tlteore",2.2. (a, b)
This theorem shows that Definition 2.1 is a "reasonable" definition of an ordered pair, in the sense that the object a has been distinguished from the object h. The proof of Theorem 2.2 will be an instructive exercise for the reader. (See Exercise 2.1.)
and elements by lower-case letters: 0, b, c•... , x, y, z, We write XES to mean "x is an element of S," or "x belongs to S." If x does not belong to S, we write x f S. We sometimes designate sets by displaying the elements in braces; for example, the set of positive even integers less than 10 is denoted by {2, 4, 6, 8}. If S is the collection of all x which satisfy a property P, we indicate this briefly by writing S = {x: x satisfies Pl. From a given set we can form new sets, called subsets of the given set. For example, the set consisting of all positive integers less than 10 which are divisible
32
2.4 CARTESIAN PRODUCT OF TWO SETS Ihjinitioll 13. Given two sets A and B. the set of all ordered pairs (a. b) such that o E .A and b e B is cQlled the cartesian product of A and B. and is denoted by A x B. &le. If R denotes the set of aU real numbers, then R x R is the set of aU complex
numbers.
Th.2.6
if
andG
every
=
numbers, Such a set of ordered.
1,
CONCERNING
FUNCTIONS
expressions defines a certain set of ordered the set of all J') for which the
('"~nnh>,, number
COlJ'1J'Jl,?X variable. is a subset of a cartesian A B, then F is called two variables. In this case we denote the function values
is a subset of
If S is a subset of we say that F is on S. such that ES caned t.he under F and T is any set which contains then F is also called a rnajt>tJil~(J This is often denoted F:51-+ T. of If The oorlcel~ of relation can be formulated need not be numbers hut may be is called a relation. IfS 1s a the set of all elements x that occur as first members of y) in S is caned the domain denoted The set of second members y is called. the ral1(jeof S, denoted The first shown in kind of relation know.n as a Dejinititm 2.1. A ,un", m", the member. That is, The definition of function that for every x in the dormdn of one y such that Y)E F. ltis to call y the value to write y= x ?md
If two functions F and G the inclusion relat.ion G <;;;; F; we say that G is a restriction of F or that F is an extension of G. In particular, jf S is a subset of and if G is defined the
of any kind.
Consider, for example, the function of a variable defined the eqaa= zZ. This function maps every sector S of the form 0 ::;;. rg a ::;; of the onto a sector described the ::;; ::;;arg
instead E F to indicate that the As an alternative to a function it coetams, to describe the domain of F, and for each x in the """en,",,,, to describe how the function value ill obtained. In this COlrme)cIIOIJ, we have the theorem whose is left as an exercise for the reader,
for all x in S,
then we call G the restriction
such that x
of F to 51. The function G consist" of those S. Its domain is S and its range is
l6
Def.1..7
Def.1.U
F is also afunction.
2.7. Let F he a function tkfored on S. We say F is one-to-one and only if. for every x and Y in S,
Jhjiltitiln; F(x)
S if.
Proof. To show that lisa function, we must show that if {x, J) E Fand (x, z) e F, then y = z: But (x, y) E i means that (y, x) E F; that is. x = F(y). Similarly,
ex, z) e f means that x = F(z). Thus F(y) = F(z) and, since we are assuming that F is one-to-one, this implies y = z: Hence, is a function.
F(y)
implies
x = y.
This is the same as saying that It function which is one-to-one on S assigns distinct function values to distinct members of S. Such functions are also called injective. They are important because. as we shall presently see. they possess inverses. However, before stating the definition of the inverse of a function, it is convenient to introduce a more general notion, that of the converse of a relation. lhjUJitimr 2.8. Given a rewtion S, t~ new relation
NOTE.
one-to-one
on a subset S of 5j{F),
S defined
by
S=
is called the converse of S.
{(a, b) : (b, a) E S}
lkfotitio. 2.11. Givm two functions F and G such that !1/(F) S;; ~(O). we CWl form a n.ew junction, the camposite G " F of G and F, defined as follows: fOT every x in the domain of F. (G" F)(x) = G[F(x)].
Since ut(F) S;; !i(O), tbe element F(x) is in the domain of G. and therefore it makes sense to consider G[F(x)]. In general, it is not true that G" F = F., G. In fact. FoG may be meaningless unless the range of G is contained in the domain of F. However. the associative law,
Thus an ordered pair (a, b) belongs to if•.and only if, the pair (b, a), with elements interchanged, belongs to S. When S is a plane relatlan, this simply means that the graph of S is the reflection of the graph of S with respect to the line y x. In the relation defined by x < J. the converse relation is defined by y < x.
JhjUrJtkm 2.9. Suppose that the relation F is a function. Consider the conuerse relation i, which maJ or may not be a function. If F is also a function. then F is called the inverle of F and II tknoted by F-1•
always holds whenever each side of the equation has a meaning. be an interesting exercise for the reader, See Exercise 2.4.)
(Verification
will
Figure 2.3(a) illustrates an example ofa function Fforwhlch iis not a function. In Fig. 2.3(b) both F and its converse are functions. The next theorem tells us that a function which is one-to-one on its domain always has an inverse.
2.9 SEQUENCES
, . ,
I
examples
Ihjillition 2.12. By a finite sequence of" terms we shall under;stand a function whose domain is the set of numbers {I. 2, ... , n}.
The range of F is the set {F(l), F(2), F(3), ...• F(n)}. customarily written {FI' F2• F3 •••• ,FH}· The elements of the range are called terms of the sequence and, of course, they may be arbitrary objects of any kind. Definition 2.13.
By an infinite sequence we shall mean a function F whose dOfHllin is the set {I, 2, 3, } of all positive integers. The range of F. that is, the set {F(I), F(2), F(3), }, is also written {FlO FI, F3, ••• }. and the function value F.
is called the nth term of the sequence.
(al
(b)
For brevity, we &baH occasionally use the notation {F,,} to denote the infinite sequence whose nth term is F Let s = {s,,} be an infinite sequence, and let k be a function whose domain is the set of positive integers and whose range is a subset of the positive integers.
II •
Del.. 2.l4
I1L. 117.
< k(I'I),
Then the composite function so k is defined for all integers n ~ I, and for every such n we have
(s
0
A set S is said to be countably infinite if it is equinumerous with the set of all positive integers; that is, if S'"" {I. 2. 3, ... }. In this case there is a function f which establishes a one-to-one correspondence between the positive integers and the elements of S; hence the set S can be displayed as follows: S = (f(I)./(2)./(3), ... }. Often we use subscripts and denote f(k) by al (or by a similar notation) and we write S = {o., 02' OJ •••• }. The important thing here is that the correspondence enables us to use the positive integers as "labels" for the clements of S. A countably infinite set is said to have cardinal number ~(l (read; aleph nought). 1h./htitioII2.15. A set S is ca//ed countable if it is either finite or countably infinite. A set which is not countable is called uncountable. The words denumerable and nomlenumerable are sometimes used in place of countable and "ncountable. 1'Iworem 2.16. Every subset of a countable set is countable. Proof. Let S be the given countable set and assume A ~ S. If A is finite, there i~ nothing to prove, so we can assume that A is infinite (which means S is also infinite). Let s = {s,,} be an infinite sequence of distinct terms such that S
k)(n) = s.ll;(,,)'
Sw::h a composite function is said to be a subsequence of s. Again. for brevity, we often use the notation {s.(.)} or {jt~}to denote the subsequence of {s.} whose nth term is st(It).
ED.,.... Let s
= {lIn}
=;
{l/2"}.
SETS
Ikjiaitiorll.U. 1Wo sets A and Ii are called similar, or equinumerous, and we write A ~ B, if and only if there exist!J a one-tO-OlUl!unction F whose domain is the set A and whose range is the set B. We also say that F establishes a one-to-one correspondence between the sets A and B. Clearly, every set A is similar to itself (take Fto be the "identity" function for which F(x) = x for all x in A). Furthermore. if A ,.., B then B ,..,A, because if F is a one-to-one function which makes A similar to B. then F - 1 will make B similar to A. Also, if A -- B and if B ,.., C, then A ~ c. (The proof is left to the reader.)
{Sl>
3'1..' •. }.
A set S is called finite and is said to contain n elements if S - {I. 2, ... , n}. The integer n is called the cardinal number of S. It is an easy exercise to prove that if {I, 2, ... , n} - {l, 2, ... , m} then m = lJ. Therefore, the cardinal number of a finite set is well defined. The empty set is also considered finite. Its cardinal number is defined to be O. Sets which are not finite are called infinite sets. The chief difference between the two is that an infinite set must be similar to some proper subset of itself. whereas a finite set cannot be similar to any proper subset of itself. (See Exercise 2.13.) For example, the set Z+ of'all positive integers is simi1ar to the proper subset {2, 4, 8, 16, ... } consisting of powers of 2. The one-to-one function F which makes them similar is defined by F(x) = 2"' for each x in Z + •
Define a function on the positive integers as follows: Let k(l) be the smallest positive integer m such that s'" e A. Assuming that k(l), k(2), ... , ken - I) have been defined, let k(1'I) be the smallest positive integer m > ken - I) such that 8.. EA. Then k is order-preserving: m > n implies k(m) > k(I'l). Form the composite function II" k. The domain of II" k is the set of positive integers and the range of s e k is A. Furthermore. s k is oneto-one, since s[k(n)] = s[k(m)]. implies
0
The next theorem shows that there are infinite sets which are not countable. Theorem 1.17. The set of all real numbers is uncountable.
'I1I.1..1
Del. l.U
41
Proof It suffices to show that the set of x satisfying 0 < x < 1 is uncountable. If the real numbers in this interval were countable, there would be a sequence
s = {sIll whose terms would constitute the whole interval. We shall &howthat this is impossible by constructing, in the interval, a real number which is Dot a term of this sequence. Write each s. as an infinite decimal:
The definition of union can be extended to any finite or infinite collection of sets:
DejinitioIl2.20. IJ F is an arbitrary collection of sets, then the union of all the sets in F is defined to be the set of those elements which belong to at least one of the sets in F, and is denoted by
U..,I
is O. I, ' .. .or 9. Consider the real number y which has the decimal
~=
"
{1. 2,
A",
U A = U A. =
1-1
..
AI u A:a
{AI> Ai
u A•.
we write
ifll if It
oj:.
I,
•... },
I.
Then no term of the sequence {s.} can be equal to y, since y differs from 81 in the first decimal place, differs from 82 in the second decimal place, ... , from 6" in the nth decimal place. (A situation like s. = 0.1999 ... and y = 0.2000 ... cannot occur here because of the way the v. are chosen.) Since 0 < y < I, the
""I'
U A = U A" =
1:-1
A 1 U A.z v ...
theorem is proved.
product Z.. x Z + is countable.
Dejilfitioll 2.11. If F is an arbitrary collection of sets, the intersection of all sets in F is defined to be the set of those elements which belong to everyone of lhe sets in F, and is denoted by
The intersection of two sets A 1 and A 2 is denoted by At n A 2 and consists of those clements common to both sets. If A I and A 2 have no elements in common. then Al n Al is the empty set and At and Al are said to be disjoint. If F is a finite collection (as above), we write
f(m, n) = 2"3·,
Then f is one-to-one on Z'" x Z+ and the range off is a subset of Z ", and if F is a countable collection, we write
2.14 SET ALGEBRA
Given tWo sets Al and Az, we define a new set, called the union of Al and AI, denoted by Al U A2, as follows:
/JejiJritio1l2J9. The UIIUm Al either to A I or to A 2 or to both.
U
Il.fiF
n A = n A.
ee
l=1
At
f"'1
A2
f"'1 ...
which belo1l!J
If the sets in the collection have no elements in common. their intersection is the empty set. Our definitions of union and intersection apply. of course, even when F is not countable. Because of the way we have defined unions and intersections, the commutative and associative laws are automatically satisfied.
Dejildlitm 2.22. The complement to be the set of A relative to B, denoted by B - A, is defined
This is the same as sayiog that Al U A 1consists of those elements which belong to at least one ofthe sets AI' A:a. Since there is 00 question of order involved in this definition, the union Al u Ali is the same as A2 u A J; that is, set addition is commutative. The definition is also phrased in such a way that set addition is
{x:xeB,butxjA}.
associative :
Note that B - (D - A) = A whenever A s;;_ B. Also note that B - A = B if B n A is empty. The notions of union, intersection, and complement are illustrated in Fig. 2.4.
43
B-
a collection
have E:xamplel,
countable collection The set Q of aU rational numbers is a countable ret. numbers having A.. From this
U 11 then s: E but it not true that x to at least one in therefore x to no A In F. for every A in F% x E B A. But this x 1~ SCI that J1 - S s;;; T. the we obtain T s;;; J1 S, and this proves that J1 - S = To prove the second statement, use a similar argument.
1.15 COUNTABLE COLLECTIONS OF CnUNTABI,E SETS
let A. denote tile ret positive The set all positive rational equal to Q eeentable, since each A~ is countable.
, , ) denote the set of rational numbers and leI. A. be the set endpoint and whose right endpoint rational, Then A. IS countable and S =
I
Ak•
EXF..RCiSES :U Prove Theorem ':t2. appeal the definition 2.2 Let Sbe a let
d)
fa, S is said
be
A relation
am:~~_!.
Determine
reflexive, and which of these properties possessed numbers (x, such thai
.1'2
f,n"·· e s·""~
Theorem 1.26,
G=
collection
sets, let
F
0
y.
Then G is a collection
44
Find F(x) if G(x) and G[F(x)] are given as follows: d) G(x) == x e) G{x)
3 ,
G[F(x)]
3x -
I.
2.11 If
I(A - B)
= /(A)
- [(B).
+ x + x2,
G[F(x»
+ S.
2.4 Given three functions F, G. H. what restrictions must be placed on their domains so that the following four composite functions can be defined? HoG, Assuming that H
a
fl,
If.
H» (Go F),
(HoG)·
F.
2.11 If S is an infinite set, prove that S contains a eountably infinite subset. Hint. Choose an element (II in S and consider S - (til). 2.13 Prove that every infinite set S contains a proper subset similar to S. 2.14 If A is a countable set and B an uncountable set, prove that B - A is similar to B. 2.15 A real number is called algebraic If it is a root of an algebraic equation fex-) = 0, where [(x) = «0 + alx + ... + a,.;i" is a polynomial with integer coefficients. Prove that the set of all polynomials with integer coefficients is countable and deduce that the set of algebraic numbers is also countable. 2.16 Let S be a finite set consisting of n elements and let Tbe the collection of all subsets of S. Show that T is a finite set and find the number of clements in T. 2.17 Let It denote the set of real numbers and let S denote the set of all real-valued functions whose domain is It. Sbow that S and It are not equinumerous. Hint. Assume S - R and let / be a one-to-one function such that /(R) S. If« E R. let D. ,.. [(a) be the real-valued function in S which corresponds to the real number a. Now define h by the equation h(x) = I + nx(x) if x E R, and show that h f S. 2.18 Let S be the collection of all sequences whose terms arc the integers 0 and 1. Show that S is uncountable. 2.1!' Show that the following sets are countable: a} the set 'Of circles in the complex plane having rational radii and centers with rational coordinates, b} any oollection
(G
F) and (H
2.5 Prove the following set-theoretic identities for union and intersection: a) A
b) A
V (B
u C)
=>
(A u B) u C. (A
A rv (B n C)
(A n.B) n C.
(B V C)
n n
B) V (A
C).
c) (A
V B) "
(A u C) = A v (B n C). (C v A) "" (A (A n B)
n!1)
C.
V (A
C)
(B
C).
e) A n (B - C) ... (A n B) - (A n C).
C) g) (A - B) u B
I:
S.... T be a function,
f(A
V B) = [(A) V [(B)
Generalize to arbitrary unions and intersections. 2.7 Let [ : S .... Tbe a function. If Y ~ T, we denote by f-I(Y) the largest subset of S which / maps into Y. That is, .
f-l(y)
= {x: XES
ot disjoint
The set I-I (Y) is called the inverse inwge of Y under /. Prove the following for arbitrary subsets X of Sand Yof T.
a)X~rl[J(X)],
2.20 Let [be a real-valued (unction defined for every x in the interval 0 :;;; x s L Suppose there is a positive number M having the following property: for every choice of a finite number of points XI' xl •.••• .:rftin the interval 0 :5 x :5 I. the sum
I/(x!)
b)J[r'(Y)]~ v /-'(Y;z),
Y2) [-l(l"l) n[-I(Y2)'
Y,
:5 M.
n
-
X-
s 1 fOT which/(x)
Y) = S - rl(y).
2.8 Refer to Exercise 2.7. Prove that[[t-J(y)] only if, T = liS). 2.9 Let / : S
-+
Yforevery
T be a function.
Prove that the following statements are equivalent. for all subsets A, B of S.
2.21 Find the fallacy in the following "proof" that the set of all intervals of positive length is countable. Let {Xl> X2 •••• } denote the countable set of rational numbers ami let I be any interval of positive length. Then I contains infinitely many rational points x.., but among these tbere will be one with smallest index n. Define a function Fby means of the equation F(I) = n, if x. is the rational number with smallest index in the interval I. This function establishes a one-to-one correspondence between the set of all intervals and a subset of the positive integers. Hence the set of all intervals is countable.
c)
t:'
2.n
[/(A)]
d) For all disjoint subsets A and B of S, the images f(A) and J (B) are disjoint.
Let S denote the collection of all subsets of a given set T. Let/: S .... R be a realy.uued function defined on S. The functionfis called oddttioe iff(A v B) [(A) + [(B) whenever A and B are disjoint subsets of T. If I is additive, prove that for any two subsets
= 1(.11.) + I(B
+ feB)
- I(A
f\
B).
1.2.3 Refer to Exercise 2.22. Assume I is additive and assume also that the following
relations bold for two particular subsct$ A and B of T: I(A v B} = I(A.,) !(A " B)
=
+ RB,)
- {(A1/(B') 1(.14)
f(A)/(B),
+ feB)
[(n.
and com3.1 INTRODUCTION
where A'
= T - A. B' = T' -.8. Prove that these relations detennine!(l1. pute the value of [(11.
REFERENCES FOR FURTHER FUllCtions. STUDY
SUGGESTED
01 Real
2.3 Gleason, A., FlUIlhmenlah oj AbstrBt.:t A1Illl"sis. Addison-Wesley, Reading, 1966. 2.4 HaImos, P. R.., Naive Set 7Mory. Van Nostrand, New York, 1960. 2.5 Kamke, E., TIteory of Sets. F. Bagemihl. translator. Dover. New York, 19.50. 2.6 Kaplanst:y, I., Set Theory and Metric Spat:es. Allyn and Bacon, Boston, 1972. 2.7 Ro~an, B., and Kneebone, G. T., The Theory of Sets and Transfinite Numbers. ElscvICr, New Yruk, 1968.
A large part of the previous chapter dealt with "abstract" sets, that is, sets of arbitrary objects, In this chapter we specialize our sets to be sets of real numbers, sets of complex numbers, and more generally, sets in higher-dimensional spaces. In this area of study it is convenient and helpful 10 use geometric terminology, Thus, we speak about sets of points on the real line, sets of points in the plane. Or sets of points in some higher-dimensional space. Later in this book we will study functions defined on point sets, and it is desirable to become acquainted with certain fundamental types of point sets, such as open sets, closed sets, and compact sets, before beginning the study of functions, The study of these sets is called
or
The number Xi is called the klh coordinate of the point x or the kth component of the vector x, The set of all n-dimeasiona! points is called n-dimenstona! Euclidean space or simply n-space, and is denoted by R~. The reader may wonder whether there is any advantage in discussing spaces of dimension greater than three. Actually, the language of a-space makes many complicated situations much easier to comprehend. The reader is probably familiar enough with three-dimensional vector analysis to realize the advantage of writing the equations of motion of a system having three degrees of freedom as a single vector equation rather than as three scalar equations. There is a similar advantage if the system has n degrees of freedom.
47
DeC. 3.2
Def.3.6
49
Another advantage in studying n-space for a general n is that we are able to deal in one stroke with many properties common to I-spaee, 2-space, 3-space, etc .• that is, properties independent of the dimensionality of the space. Higher-dimensional spaces arise quite naturally in such fields as relativity, and statistical and quantum mechanics. In fact, even infinite-dimensional spaces are quite common in quantum mechanics. . Algebraic operations on n-dimensionai points are defined as follows: Dejilritiml 1.2. a) Equality: b) Sum: c) Multiplication d) Difference: x y x y if, and only if, XI x
1Ix. + yl12
k=1
(x~
Yk)2
l~l
t (xi +
2xkY~
+ Y!)
NOTE.
Sometimes
is written
in the form
Let x
j>,,) be in R".
We define:
Ilx - yll
+ Ill' - zll.
y and y by y - z. We also have
X~
Y•.
+Y
= (XI
+ y"
... ,
+ y.).
(1,0, .. , ,0),
U2
+ (-I)y.
If x
x.' Ul'
(Xl>""
••• , X"
x.J then x = XlOl + ... + x"u.. and XI X'OI' x' u". The vectors u., ... , u" are also called basis vectors.
==
L xlY ..·
"
=
g) Norm or lenglh:
Let a be a given point in R" and let r be a given positive number. points x in R" such that
The set of
an
I[xll
The norm "x NOTE.
= (X'X)1/2
" L xi )111 .
1~1
IIx
all < r,
y.
In the terminology
is called an open n-bal! of radius r and center a. We denote this set by B{a) or by B(a; r). The ball 8(a; r) consists of all points whose distance from a is less than r. In R1 this is simply an open i-nterval with center at a. In X2 it is a circular disk, and in R3 it is a spherical solid with center at a and radius r.
Theorem 3.3. LeI x andy denote points in R". Then we haoe: a) [x]
;'<>:
0, Dnd [x]
O.
b) Ilax.
c) [x d)
3.5 DejUritiOIl of WI illlerior poillt. Let S be a subset o/R·, and assume that:ll E S. Then II is called an interior point of S if there is an open n-bal! with center at a. all of whose points belong to S.
In other words, every interior point :II of 8 can be surrounded by an n-ball 9(11) ~ S. The set of all interior points of S is called the interior of S and is denoted by jnt S. Any set containing a ball with center a is sometimes called a neighborhood of a.
IX'yl ~ IlxlillYll
e) IIx +
+ IIYII
(triangle-inequality).
Proof, Statements (a), (b) and (c) are immediate from the definition, and the Cauchy-Schwarz inequality was proved in Theorem 1.23. Statement (e) follows
WI
open set. A set Sin R" is called open if all its points are interior
lb.3.7
'lb. 3.11
ne
&nJctuR
or Opea Sets In R 1
51
EDmpJes. In R I the simplest type of nonempty open set is an open interval. The union of two or more open intervals is also open. A closed interval [a, h] is not an open set because the endpoints a and b are not interior points of the interval. Examples of open sets in the plane are: the interior of a disk; the cartesian product of two one-dimensional open intervals. The reader should be cautioned that an open interval in RI Is no longer an open set when it is considered as a. subset of the plane. In fact, no subset of R' (except the empty set) can be open in R2, because such a set cannot contain a 2"0011.
3.9 DejillitiDlIDfcomponentwerMl. Let S be an open subset of RI. An open internal I (which may be finite or infinite) is called a componem fnterwU of S if J s;; S and if there is no open internal J #- 1 such that I f:::: J <;;;; S. In other words, a component open interval contained in S. interval of S is not a proper subset of any other open set S belongs to one and only one
In R" the empty set is open (Why?) as is the whole space R". Every open It-ball is an Open set in R~. The cartesian product
(OJ>
bl)
)( • ..
x (a", bit)
Proof. Assume XES. Then x is contained in some open interval I with I <;;;; S. There are many such intervals but the "largest" of these will be the desired component interval. We leave it to the reader to verify that this largest interval is Ix = (o(x), b(x», where a(x)
ofn one-dimensional open intervals (al> htl, ... , (a", b,,) is an open set in R8 called an n-dimensiona! open interval. We denote it by (a, b), where a = (al, ... , aJ and b = (blo ... , b,,). The next two theorems show how additional open sets in R" can be constructed from given open sets. Tluorem
b(x)
3.7.
Set.
Proof. Let F be a collection of open sets and let S denote Assume XES. Then lit must belong to at least one of Since A is open, there exists an open n-ball B(x) f:::: A. and hence lit is an interior point of S. Since every point S is open.
their union, S = A~" A. the sets in F, say lit EA. But A f:::: S. so B(x) s,: S of S is an interior point,
Here a(x) might be - 00 and b(x) might be + 00. Clearly, there is no open interval J such that Ix 0;; J f:::: S, so Ix is a component interval of S containing x. If J x is another component interval of S containing x, then the union Ix u J" is an open interval contained in S and containing both Iex and J x' Hence, by the definition of component interval, it follows that I~ u J" = I~ and I~ u J" = J", so I", = J
I•
Tlu!orem 3.11 (RepreJelflatioll theQrem/or open lets ON tile rem liM). Every nonempty open set Sin Rt is the union of a countable collection of disjoint component intervals of S.
TheDrem 3.8. The intersection of a finite collection of open sets is open. Proof. Let S = A~ where each Ak is open. Assume XES. (If S is empty, there is nothing to prove.) Then lit E Ak for every k I, 2, ... , m, and hence there is an open n-ball B(x; l'k) s,: At. Let r be the smallest of the positive numbers rl. r2, •.. ,1",. Then lit E B(x; r) f:::: S. That is, x is an interior point, so S is open. Thus we see that from given open sets, new open sets can be formed by taking arbitrary unions or finite intersections. Arbitrary intersections, on the other band, will not always lead to open sets. For example, the intersection of all open intervals of the form (-lIn, I In), where n = 1,2, 3•.. , , is the set consisting of 0 alone. 3A THE STRUCTURE
1
Proof. If XES, let i" denote the component interval of S containing x. The union of all such intervals Ix is clearly S. If two of them. 1;,<and I" have a point in
common. then their union Ix U I; is an open interval contained in S and containing both (~and i,. Hence I" u I, I; and I,. u l, = so 1" = Iy• Therefore the intervals (. form a disjoint collection. It remains to show that they form a countable collection. For this purpose, let {XI' Xl' X3' ... } denote the countable set of rational numbers. In each component interval 1" there will be infinitely many x •• but among these there will be exactly one with smallest index 11. We then define a function F by means of the equation F((,,) = 11, if x. is the rational number in I~ with smallest index II. This function F is one-to-one since F(l,,) = F(l,) = 11 implies that I~ and Iy have x. in common and this implies Ix = I,. Therefore F establishes a one-to-one correspondence between the intervals (" and a subset of the positive integers. This completes the proof.
n:=1
r,
OF OPEN SETS IN R I
NOTE.
In R the union of a countable collection of disjoint open intervals is an open set and, remarkably enough, every nonempty open set in R_l can be obtained in this way. This section is devoted to a proof of this statement. First we introduce the concept of a component interval.
This representation of S is unique. In fact; if S is a union of disjoint open intervals, then these intervals must be the component intervals of S. This is an immediate consequence of Theorem 3.10. If S is an open interval, then the representation contains only one component interval, namely S itself. Therefore an open interval in RI cannot be expressed as
Def.3.12
DeI'.3.19
53
the union of two nonempty disjoint open sets. This property is also described by saying that an open interval is connected. The concept of connectedness for sets in RIO will be discussed further in Section 4.16. 3.5 CLOSED SETS J.12 Defillitioll 0/ II cleNd ret. R~ - S is open. Examples. A closed interval A set S in R" is called closed
In other words, x is an accumulation point of S if. and only if, x adheres to {x}. If xeS but x is not an accumulation point of S. then x is called an isolated point of S.
Examples
I, The
set of numbers
2. 3, ...•
if
its complement
2.. The set of rational numbers has every real number as an accumulation point. 3. Every point of the closed interval [a, b 1 is an accumulation point of the set of numbers in the open interval (a, b). Theorem 1.17_ If x is an accumulation point of S, then ellery n-ball B(x) contains infinitely many [Win's of S. Assume the contrary; that is, suppose an n-ball B(x) exists which contains only a finite number of points of S distinct from x, say .1' .2' ... ,am' If r denotes Proof the smallest of the positive numbers
la, b J in R
of n one-dimensional closed intervals is a closed set in R" called an n;;/imensiona/ closed interval [a, b],
The next theorem, a consequence of Theorems construct further closed sets from given ones.
Ilx
then B(x; ,/2) will be an n-ball about" from x. This is a contradiction. which contains
.",Ii.
TAwrem J.13. The union of a finite collection of closed sets is dosed, intersection of an arbitrary collection of closed sets is closed. A further relation theorem. Theorem 1.14. closed. between open and closed sets is described
no points of S distinct
by the following
B is open and B - A is
Proof We simply note that A - B = A n (Rn - B), the intersection of .two open sets, and that B - A = B n (R" - A), the intersection of two dosed sets. 3.6 ADHERENT POINTS. ACCUMULATION POINTS points and accumulation
This theorem implies, in particular, that a set cannot have an aC(:umulation point unless it contains infinitely many points to begin with. The converse, however, is not true in general. For example, the set of integers {I, 2, 3, ... } is an infinite set with no accumulation points. In a later section we will show that infinite sets contained in some n-ball always have an accumulation point. This is an important result known as the Bolzano-Weierstrass theorem. 3.7 CLOSED SETS AND ADHERENT POINTS of an open set. The next theorem
in terms of adherent
A closed set was defined to be the complement describes closed sets in another way.
3.15 Dejinitio" of - adIIereflt poillt. Let S be a subse: of R", an.l x a point in R", x not necessarily in S, Then x is said to be adherent to S if euery n-ball B(x) contains at least one point of S.
Theorem 3.1B. A set S in R" is closed if, and only if, it contains all its adherent points. Proof Assume S is closed and let x be adherent to S. We wish to prove that lit € S. We assume x f S and obtain a contradiction. If x f S then x E R~ - S and, since R· - S is open, some n-ball B(x) lies in R" - S. Thus B(x) contains no points of S, contradicting the fact that x adheres to S. To prove the converse, we assume S contains all its adherent points and show that S is closed. Assume x € R" - S. Then x ¢ s, so X does not adhere to S. Hence some ball B(:x) does not intersect S, so B(x) S RIO - S. Therefore RIO S is open, and hence S is closed. 3.19 Defotitio" of CIeSllU. The set of all adherent points of a set S is called the closure of S and is denoted by S.
ExamJ*s
1. If XES, then x adheres to S for the trivial reason that every n-ball B(x) contains x, 2. If S is a subset of R which is bounded above, then sup S is adherent to S. Some points adhere to S because every ball B(x) contains points of S distinct from x. These are called accumulation points. 3.16 Definition of - IlccumuliIlioll poillt. If S S Rn and x E R", then x is called I1n accumulation point of S if every n-ball B(x) contains at least one point of S distinct from x,
54
EIemeots
or Point Set
Topofogy
Tb. 3.20
n..3.24
For any set we have S £; S since every point of S adheres to S. Theorem 3.18 shows that the opposite inclusion S £ S holds if and only if S is closed. Therefore we have:
Theorem 3.20. A set S is closed
----- -------j--
if and
only
if S = S.
I.U DejinitiOIi of derived set. The set of all accumulation points of a set S is called the derived set of S and is denoted by S'. Clearly, we have S = SuS' for any set S. Hence Theorem 3.20 implies that S is dosed if and only if S' £ S. In other words, we have:
Tlreorem 3.12.
---6:
Figure 3.1
A set S in R" is closed if, and only if, it contains all its accumulation
points.
3.8 THE BOLZANO-WEJERSTRASS 3.23 DejittitiDlI 0/ a bounded set. THEOREM
A set S in R" is said to be bounded if it lies entirely within an n-ball B(a; r) for some r > 0 and some. in R".
If a bounded set S in R" contains infinitely points, then there is at least one point in R which is an accumulation point of S.
W
2.t>
x ... x I (~)
" ...n'
(a)
Proof. To help fix the ideas we give the proof first for RI. Since S is bounded, it lies in some interval [-D, aJ. At least one of the subintervals [-a, 0] or [0, a] contains an infinite subset of S. Call one such subinterval [at> bI]. Bisect [al• b.] and obtain a subinterval [a2> bl] containing an infinite subset of S, and continue this process. In this way a countable collection of intervals is obtained. the nth interval [a., b,,] being of length h. - a~ = aI2·-1• Clearly, the sup of the left endpoints a. and the inf of the right endpoints b" must be equal, say to x, [Why are they equalf] The point x will be an accumulation point of S because, if ,. is any positive number, the interval [a.., b~] will be contained in B(x; r) as soon as n is large enough so that bw a. < r/2. The interval B(x; r) contains a point of S distinct from x and hence x is an accumulation point of S. This proves the theorem for Rt. (Observe that the accumulation' point x mayor may not belong to S.) Next we give a proof for R", n > I, by an extension of the ideas used in treating RI. (The reader may find it helpful to visualize the proof in'R2 by referring to Fig. 3.1.)
where each k i = I or 2.. There are exactly 2" such products and, of course, each such product is an n-dimensional interval. The union of these 2~ intervals is the original interval II' which contains S; and hence at least one of the 2" intervals, in (a) must contain infinitely many points of S. One of these we denote by 12• which can then be expressed as 11
IF)
11
2)
x ...
1!2).
where each 1~2) is one of the subintervals of Tfi of length Q. We now proceed with J2 as we did with II' bisecting each interval /pl and arriving at an n-dimensional interval 13 containing an infinite subset of S. If we continue the process, we obtain a countable collection of n-dimensional intervals It, 12, I'j, ... , where the mth interval 1m has the property that it contains an infinite subset of Sand can be expressed in the form
Since S is bounded, S lies in some n-baU B(O; a), a > 0, and therefore within the n-dimensional interval II defined by the inequalities
-Q ~
Writing we have
(k
Xl
Sa
(k
1, 2•... , n).
Ifl)
,
Xk
x 1£1) x ...
that is, the set of points (Xl> ... one-dimensional interval -a :s;;
x,,), where x" E Itl) and where each IP' is a ~ a. Each interval JP) can be bisected to
For each fixed k, the sup of all left endpoints a1"'\ (m = I, 2, ... ), must therefore be equal to the inf of all right endpoints h~"i,(m = 1, 2, ... ), and their common value we denote by fl' We now assert that the point t = t2; •••• In) is an
o..
n. 3.28
accumulation point of S. To course, belongs to each of the m is such that aI2 .. -2 < rl2, contains infinitely many points an accumulation point of S. see this, take any n-ball .8(t; r), The point t, of intervals JI• Ji ••.• constructed above. and when this neighborhood will include J.... But since J. of S, SO will B(t; r), which proves tbat t is indeed 3. Let S = {(x, y) : x > O. y > o}. The collection F of all circular ~isks wi~ ~ters at (x, x) and with radius x. where x > O. is a covering of S. This covermg IS ,not countable. However, it contains a countable covering of S, namely, all those disks in which x is rational. (Sec.Exercise 3.18.) The Lindelbf covering theorem contains a countable subcolleetion the following preliminary 3.9 THE CANTOR INTERSECIlON THEOREM theorem we prove the Cantor As an application of the Balzano-Weierstrass intersection theorem. result: states that every open covering of a set S in R~ which also covers S. The proof makes use of
TIIuNtn 1.15. LeI {Q .. Q2' ... } be a countable collection of nonemply sets in Rsuch that:
(k ii) Each set
Tlwiwelfl 3.17 Let G {A It A.2" .} denote the countable collection of all nballs having rational radii and centers at points with rational coordinates. Assume x E R8 and let S be an open set in R· which contains x. Then at least one of the n-balls in G contains x and is contained in S. That is, we have x
E
Ak
!,;;
for some At in G. of Theorem 2.27, If x E R" and if S n-ball B(x; r} !,;; S., We s_haU~nd a is '~nea~" x ~n~, using this pomt. as which lies wlthm .8(x; r) and which x,,), that
1, 2, 3, ,'_. ).
Proof. Let S = ():'=l Qk' Then S is closed because of Theorem 3.13. To show that S is nonempty, we exhibit a point x in S. We can assume ,bat each Q" contains infinitely many points; otherwise the proof is trivial, Now form a collection of distinct points A = {lei' :1:2. ..}. where Xi e Q.. Since A is an infinite set contained in the bounded set Ql' it bas an accumulation point. say lI:. We'shall show tbat xeS by verifying that x e Qi for each k, It will suffice to Show that 11:
is an accumulation point of each Qb since they are all closed sets. But every neighborhood of:x contains infinitely many points of A. and since all except (possibly) a finite number the points of A belong to Qk> this neighborhood also contains infinitely many points of QI' Therefore x is an accumulation point of Q. and the theorem is proved.
Proof. The collection G is countable because is an open set containing x, then there is an point y in S with rational coordinates. that center will then find a neighborhood In G
-
contains and k:
x. Write
be
1'1.
x
a rational Then
(Xt. Xl •••••
let Yl
number
such
Ir.. -
XlI
<
r/(4n)
for
each
1,2, ... ,
l1y - x]
S;
IYl
XII
+ .. , + Iy" -
x.1 <
4.'
or
Next, let q be a rational number such that rl4 < q < r/2, Then x E ~Y; q) and B(y; q) !;; B(:r;:; r) ~ S. But B(y; q) e G and hence the theorem IS proved. (See Fig. 3.2 for the situatiOfl in R 2,)
COVERING TIIEORJ.!:M
In this section we introduce the concept of a covering of a set and prove the Linde/Of coveriTIg theorem. The usefulness of this concept wilt become apparent in some of the later work.
3.26 DejbliriM •• l:fIfJt/r;.g. A collection F 0/ sets is said to be a cOlJerillg of a given set S if S !;; A.-I" A. The. collectiem F is also said to cover S. If F is a collection of open sets, then F is called an open covering of S.
EDmples I. The collection of all intervals of the form lin < x < 2/n, (n = 2, 3,4, ' . ,), is an open covering of the interval 0 < x < 1. This is an example of a countable covering. 2. The real line gl is covered by the collection of al1open intervals (0. b). This covering is not countable. However, it contains a countable covering of Rl, namely. all intervals of the form (n, n + 2), where n runs through the integers.
CODnilfg
TII~o,em 1..21 (Lillilelif ctneriltg tlleorelll). Assume A !;; R· and let F be an open of A. Then there is a countable subco/leetion of F which also ('OVt'7'S A.
Proof. Let G
= {A" A2,.,,} denote the countab~ collection of all n-balls having rational centers and rational radii. This set G Will be used to help us extract a countable subcollection of F which covers A.
TIL 3.19
Th. 3.31
Co~inR'
59
Assume x E A. Then there is an open set Sin Fsuch that xeS. By Theorem 3.27 there is an n.ball Ai in G such that x E A.I;; !:;;; S. There are, of course, infinitely many such AI; corresponding to each S, but we choose only 6ne of these, for example, the one of smallest index, say m = m(x). Then we have x E A cS mC"'> - , The set of all n-balls A"'JI) obtained as x varies over all elements of A is a countable collection of open sets which covers A. To get a countable subcollectlon of F, which covers A. we simply correlate to each set At(,")one of the sets S of F which contained AI(,,)' This completes the proof.
3.11 COMPACfNESS IN a" We have just seen that if a set S in R" is closed and bounded, then any open covering of 8 can be reduced to a finite covering. It is natural to inquire whether there might be sets other than dosed and bounded sets which also have this property. Such sets will be called compact.
3.10 Ikftnitit", of IJ compact set; A set Sin R" is said to be compact if. and only if, every open covering of S contains 11 finite subcooer, that is, afinite subcollection which also covers S.
The Heine-Borel theorem states that every closed and bounded compact. Now we prove the converse result. set in R" is
COVERING
THEOREM
The Lindelof covering theorem states that from any open covering of an arbitrary set A in R~ we call extract a countable covering. The Heine-Borel theorem tells us that if, in addition, we know that A is closed and bounded, we can reduce the covering to a finite covering. 'The proof makes use of the Cantor intersection theorem. Theorem 1.19 (Hebre-Borel). Let F be an open covering of a closed and bounded set A in R". Then a finite subcollection of F also covers A.
Proof. A countable sabeollection of F, say {II, 12, 3.28. Consider, for m ~ I, the finite union
•••
},
covers A, by Theorem
This is open, since it is the union of open sets. We shall show that for some value of m the union S", COvers A.
For this purpose we consider the complement RIO - S"" which is closed. Define a countable collection of sets {Q .. Q1' ••. } as follows: Q1 = A, and for m :» I, Q..
= A n (R" -
Proof As noted above, (b) implies (a). If we prove that (a) implies (b), that (b) implies (e) and that (c) implies (b), this will establish the equivalence of all three statements. Assume (a) holds. We shall prove first that S is bounded. Choose a point p in S. The collection of ,,·balls B{p; k), k = 1,2, ... , is an open covering of S. By compactness a finite subcollection also COvers S and hence S is bounded. Next we prove that S is closed. Suppose S is not closed. Then there is an accumulation point y of S such that y If; S. If x E S, let r" = Ilx - Y11/2. Each r.. is positive since y ¢ S and the collection {B(x; I J :XES} is an open covering of S. By compactness, a finite number of these neighborhoods cover S, say S
£;
SOl)'
U B(xl; l= I
It
rl)'
That is, Q.. consists of those points of A which lie outside of S"" If we can show that for some ",'\IU(; of m the set Q...is empty, then we will have shown that for thism no point or" A lies outside S ...; in other words, we will have shown that some Sm covers A. Observe the following properties of the sets Qm: Each set Q.. is closed since of the closed set A and the closed set R" - SOl' The sets Q.. ar~ decreasing, since the SIll are increasing; that is, Q.. + I £; Q... The Sets Qm' being subsets of A, are all bounded, Therefore, if no set Q.. is empty, we can apply the Cantor intersection theorem to conclude that the intersection 1 Qk is also not empty. This means that there is some point in A which is in all the sets Q... or, what is the same thing, outside all the sets 8M, But this is impossible, since , A £; Uk'= 1 Sl' Therefore some Q... must be empty, and this completes the proof.
Let r denote the smallest of the radii rl, rh . , . , r p' Then it is easy to prove that the ball B(y; r) has no points in common with any of the balls 8(X1; rk)' In fact, if x € B(y; r), then [x - yll < r ::; Tk, and by the triangle inequality we have lIy - xkll ::; 111- x] + IIx - Xlii, so
it is the intersection
x.11 - [x
yll
2r.
[x - yll >
'k'
nk~
Hence x ¢ B(x.; r,.). Therefore B(y; r) n S is empty, contradicting the fact that y is an accumulation point of S. This contradiction shows that S is closed and hence (a) implies (b). Assume (b) holds. In this case the proof of (e) is immediate, because if Tis an infinite subset of S then T is bounded (since S is bounded), and hence by the Bolzano-Weierstrass theorem T has an accumulation point x, say. Now x is also
6}
an accumulation point of S and hence XES, since S is closed. Therefore (b) implies (c). Assume (e) holds. We shaJI prove (b). If S is unbounded, tben for every m > o there exists a point x... in Swith Ux...11 > m. The collection T"" {xJ. X2 .... } is an infinite subset of S and hence, by (c), T has an accumulation point y in S. But for m > I + IIyIl we have
We sometimes denote a metric space by (M, d) to emphasize that both the set M and the metric d playa role in the definition of a metric space. Examples 1. M = 11:"; d(I, Y) = Ilx YI. This is called tile Euclidean metric. Whenever we refer to Euclidean spate RIO, it will be understood that the metric is the Euclidean metric unless another metric is specifically IllCIltioned. 2. M = C. the complex plane; d(ZI' 2,,) = IZI ~ z~l. As a metric space, C is indistinguishable from Euclidean space R2 because it has the same points and the same metric, 3. Many nonempty set; d(x, y) = 0 if x = y, dtx, y} = I if x "" y. This is called the discrettl metric, and (M, d) is called a discrete metric space. 4. If (M. d) is a metrk space and if S is any nonempty subset of M, then (S, d) is also a metric space with the same metric or, more precisely, with the restriction of d to S x S WI metric. This is sometimes called tile relatioe metric: induced by don S, and S is called a metric sub$j1QCe f M. For example. the rational numbers Q with the o metric d(x, y) = Ix - yl form a metric subspace of R. R2; d(x, Y) = .J(XI - J'1)2-i- 4(Xl =-:;2)2, where x = (Xl> Xl) and y = The metric space (M, d) is not a metric subspace of Euclidean space R2 because the metric is different. M
()'1, )'2)'
>m-
~yU >
I,
contradicting the fact that y is an accumulation point of T. This proves that Sis bounded. To complete the proof we must show that S is closed. Let x be an accumulation point of S. Since every neighborhood of x contains infinitely many points of S, we can consider the neighborhoods B(x; Ilk), where k = 1,2 •...• and obtain a countable set of distinct points. say T = {Xl> Xl •••. }, contained in S, such that x.: EO' B(x; Ilk). The point x is also an accumulation point of T. Since T is an infinite subset of S, part (c) of the theorem tells us that T must have an accumulation point in S. The theorem will then be proved if we show that x is the only accumulation point of T. To do this, suppose that y ¢ x, Then by the triangle inequality we have
.s.
+ 11x.: - xII < IIY - xiii + Ilk, if~ E T. If to- is taken so large that Ifk < -lily - xii whenever k ;;:-:ko. the last inequality leads to -iIiY - xii < lr - xlii· This shows that Xl f. B(y; r) when k ~ ko• if r = llY - x], Hence y cannot be an accumulation point of T. This completes
lIy x] xLII
:s: lIy -
6.. M = {(Xl' XJ) : + I}, the unit circle in R"; d(x., y) smaller arc joining the two points I and y on the unit circle.
x~ xi "" xi
7. M = {(xl> X2, xl): + x~ + x~ = I}. the unit sphere in &3; d(x, y) of the smaller arc along the great circle joining the two points x and y. 8. M 9. M
the length
)'11
+ ... +
)'11 •...•
Ix. -
)'.1.
y.I}.
max {lxI
Ix" -
The proofs of some of the theorems of this chapter depend only on a few properties of the distance between points and not on the fact that the points are in R". When these properties of distance are studied abstractly they lead to the concept of a
metric space.
J.12 IhjiItitiM of .. _rk IJIIII". If metric space is a nonempty set M of objects (CQ//ed points) tOfP!ther with a functiorr d from M x M to R (cal/ed the metric of the space) satisfying the follOWing four properties for all points x, y, z ill M:
1. d(x. x) = 3. d(x, y)
The basic notions of point set topology can be extended to an arbitrary metric space (M, d). If a E M. the ball B(a; r) with center a and radius r > 0 is defined to be the set of all x in M such that
d(x, a)
<
r,
o.
d(y, x),
Sometimes we denote this ball by B1>I(a; r) to emphasize the fact that its points corne from M. If S is a metric subspace of M. the ball Bia; r) is the intersection of S with the ball BM(a~r).
~ In Euclidean space Rl the ball 8(0; I) is the open interval ( I. I). In the metric subspace S = [0, I] the ball .BJ.O~1) is the half-open interval [0, 1).
NOTE.
+ d(z, y).
Tbe nonnegative number d(x. y) is to be thought of as the distance from x to y. In these terms the intuitive meaning of properties 1 through 4 is dear. Property 4 is called the triangle inequality.
The geometric appearance of a ball in R~ need not be "spherical" if the (See Exercise 3.21,)
If S s; M, a point a in S is called an interior point of S if some ball BM(a; r) lies entirely in S. The interior, tnt S. is the set of interior peints of S. A set S is
FJeInaIIs
Th.l.33
Th.3.38
called open in M if all its points are interior points; it is called closed in M if M
is open in M.
.Examples.
The following theorems are valid in every metric space (M, d) and are proved exactly as they were for Euclidean space RW. In the proofs. the Euclidean distance IIx - yll need only be replaced by the metric d(x, y) . TlIeorem 3.35. a) The union of any collection of open sets is open, and the intersection of a finite collection of open sets is open. b) The union of a fin ite collection of closed sets is closed, and the intersection of any collection of closed sets is dosed. Tlleorl!m J.36. If A is open and B is closed, then A - B is open and B A is
In fact, if x e S, the ball R(x; ;) consists entirely of points of S (since it contains only x), so S is open. Therefore every subset of M is also closed t
l. In the metric subspace S ~ [0, 1] of Euclidean space Rl, evelY interval of the form [0, x) or (x, 1]. where 0 -c x <1, lS an open set in S. These sets are nol open in R '. Example 3 shows that if S is a metric subspace of M the open sets in S need not be open in M. The next theorem describes the relation between open sets in M and those in S. TIIeorem 3.33. Let (S, d) be a metric subspace of (M. d), and let X be a subset of S. Then X is open in S if, and only if,
closed. Theorem 337. For any subset S of M the following statements are equivalent,'
a) S is closed in M. b) S contains all its adherent points. c) S contains all its accumulation points.
X=A('\S
for some set A which is open in M. Proof. Assume A is open in M and let X = .A n S. If x E X, then x E A so BJI(x; r) <;; A for some r > O. Hence Bs(x; r) = B,H(x; t) ('\ S <;; A r, S = X so X is open in S. Conversely. assume X is open in S, We will show that X = A n S for some open set A in M. For every x in X there is a haJl Bs(x; r x) contained in X. Now Bs(x; 'x) = BJl(x; rJ n S, so if we let
d) S =S.
Example. Let M = Q. the set of rational numbers. with the Euclidean metric of RI. Let S consist of al! rational numbers in the open interval (0, b), where both a and b are irrational. Then S is a closed subset of Q.
»eX
U B",{x;
rJ,
then A is open in M and it is easy to verify that A ('\ S = X. T6eonm 3.34. Let (8, d) be a metric subspace of (M, d) and let Y be a subset of S. Then Y is closed in S if, aiu:/ only if. Y = B n S for some set B which is closed in M, Proof. If Y = B n S, where B is closed in M. then B= M - A where A is open in M so Y S II B = S 11 (M - A) = S - A; hence Y is closed in S. Conversely, if Y is closed in S, let X = S - Y. Then X is open in S so X = A ('\ S, where A is open in M and
Our proofs of the Bolzano-Weierstrass theorem, the Cantor intersection theorem, and the covering theorems of Lindelof and Heine-Bore! used not only the metric properties of Euclidean space R" but also special proper .ies of RR not generally valid in an arbitrary metric space (M, d). Further restrictions on M are required to extend these theorems to metric spaces. One of these extensions is outlined in Exercise 3.34. The next section describes compactness in an arbitrary metric space.
3.15 COMPACf SUBSETS OF A METRIC SPACE let (M, d) be a metric space and let S be a subset of M. A collection F of open subsets of M is said to be an open covering of S if S !;;;: A. A subset S of M is called compact if every open covering of S contains a finite subcover. S is called bounded if S ,; B(a; r) for some r > 0 and some a in M.
U~."
y
where B
= S=M-
S - (A n S)
S n (M
the proof.
A)
S ('\ B,
If S <;; M, a point x in M is called an adherent point of S if every ball B",(x; r) contains at least one point of S. If x adheres to S - {.t'} then x is called an accumulation point of S. The closure S of S is the set of all adherent points of S, and the derived set S' is the set of all accumulation points of S. Thus, S = SuS'.
Proof To prove (i) we refer to the proof of Theorem 3.31 and use that part of the argument which showed that (a) implies (b). The only change is that the Euclidean distance [x - rl is to be replaced throughout by the metric d(x, y).
Th.l.39 To prove (ii) we argue by contradiction. Let T be an infinite subset of Sand assume that no point of S is an accumulation point of T, Then for each point x in S there is a ball B(x) which contains no point of T (if x ; T) or exactly one point of T (x itself, if x E T). As x runs through S, the union of these balls B(x) is an open covering of S. Since S is compact, a finite subcolleetion covers S and hence also covers T. But this is a contradiction because T is an infinite set and each ball contains at most one point of T. EXERCISFS
Open and dosed sets in HI tIIIfl H2
3.1 Prove that an open interval in R I is an open set and tbat a closed interval is a closed set. 3.2 Determine all the ao::umuJa.tion points of the following the sets are open or closed (or neitber). a) All integers.
b) The interval (Q, b
Nom. In Euclidean space R", each of properties (i) and (ii) is equivalent to compactness (Theorem 3,31). In a general metric space, property (ii) is equivalent to compactness (for a proof see Reference 3;4), but property (i) is not. Exercise 3.42 gives an example of a metric space M in which certain closed and bounded subsets are not compact.
lhormJ 3.3'. Let X be a closed subset of compact.
Q
J.
lIn, +
5-",
(n
I, 2, 3.... ). ). ), ).
Then X is
Proof. Let F be an open covering of X, say X s;; UA"F A. We will show finite number of Ihe sets A cover X. Since X is closed its complement M open, so F u {(M - X)} is an open covering of M. But M is compact, covering contains a finite subcover which we can assume includes M - X.
fore
1,2,
(n = 1,2, ... ).
3.J The same as Exercise 3.2 for the foOowing sets in R2: a) All complex z such that Izi > 1.
b} All complex c)
M~
Al
U •••
u A" u (M - X).
z such that Izl ~ !. All complex numbers of the fonn (lIn) + (i/m).
(m, n
1.2. ... ).
dj All points (x, y) such that x2 ~ y2 < I. e) All POints (x, y) such that x > O.
f)
This subcover also covers X and. sinceM - X contains no points of X. we can delete the set M - X from the subcover and still cover X. Thus X s;; A I U' •• u A l' so X is compact.
All points
3.4 PrOVi: that eveIY nooempty open set S in RI contains both rational and irrational numbers. 35 Prove that the only sets in R 1 which are both open and dosed are the empty set and R.l itself. Is a similar statement true for R.2?
I.
3.16 BOUNDARY OF A SET lk./illitJoll 3.40. Let S be a subset of a metric space M, A point x in M is called a boundary point of S if every boll BAI(x; r) contains at least one point 'of S and at least one point of M - S. The set of all boundary points of S is called the boundary of S and is denoted by as.
3.6 Prove that every dosed set in Rl is the intersection of a countable collection of open sets. 3.7 Prove that a nonempty, bounded closed set S in R 1 is either a closed interval. or tbat S can be obtained from a closed interval by removing Ii rountable di~oint collection of open intervals whose endpoints belong to S. Open BDII closed sets in R" 3.8 Prove that open n-baJls and n-dimensional open intervals are open sets in R"'. 3.9 Prove thar the interior of a
set
I'l
M-
S.
EI:llIIIpIe In R·, the boundary ofa ball B(a; r) is the set of points 1 such that In R 1, the boundary of the set of rational numbers is all of R 1.'
Ix - _I
= r.
3.10 If S s;; R". prove that int S is the union of 311open subsets of R- which are contained in S. This is described by saying that int S is the largest open subset of S. 3.1t If S and T are subsets of RIt, prove that (iot S) n (int T) int (5" T), and (int S) u (int T) int (5 u T).
Further properties of metric spaces are developed in the Exercises and also in
Chapter 4.
..t
67 3.l3 Assume that S s Kft. A point x in K" is said to be a C()lfde~tion point of S if every n-bafl B(x) has the property that B(x) n S is not countable. Prove that if S is nc t countable, then there exists a point x in S such that x is a condensation point of S. 3.24 Assume that S S R- and assume that S is not countable. condensation points of S. Prove that; a) S - T is countable, c) T is a dosed set, Let T denote the set of
3.12 Let S' denote the derived set and S the closure of a set S in R". Prove that:
a) S' is dosed in R!'; that i$, (51'
S'.
S is the intersection of all closed subsets of R" containing S. That is, S is the
smallest closed set containing S.
and that S n T £ S () T
Note that Exercise 3.23 is a special case of (b). 3.25 A set in R" is called perfect if 5 = S', that is, if S is a closed set which contains no isolated points. Prove that every uncountable closed set Fin Rit can be expressed in the form F = A V B, where A is perfect and B is countable (Cantor-.Bendixon theorem). Hint. Metric Use Exercise 3.24. SIIlCES
The statements in Exercises 3.9 through 3.13 are true in any metric space.
3.14 A set Sin RR is called convex if, for every pair of points x and yin S and every real (J satisfying 0 < 0 < J, we have (Ix + (1 O)y EO S. Interpret this statement geometrically (in K2 and K3) and prove that:
a) Every n-ball in Kif is convex,
b) Every n-dimensional open interval is convex. c) The interior of a convex set is convex.
d) The closure of a convex set is convex.
J.IS Let F be a collection of sets in R", and let S ~ A and T A. For each of the following statements, either give It proof or exhibit a counterexample. a) If x is an accumulation point of AinF.
U~"F
n~~F
3.26 In any metric space (M, d), prove that the empty set 0 and the whole space M are both open and dosed. 3.27 Consider the following two metries in R":
d1(x, y) = max
l~t;!;;JI
r. then x is an accumulation
IXi - y,l,
d2(x, y) =
E IXI C:;;;;:I
YII·
b) If x is an accumulation point of S, then x is an accumulation point of at least one set A in F. 3.16 Prove that the set S of rational numbers in the interval (0, I) cannot be expressed as the intersection of a countable collection of open sets. Hint. Write S = {XI, X2' .•• }, assume S =1 Sk' where each Sk is open, and construct a sequence {Q.} of closed intervals such that Qu 1 £ Qn S S" and such that x" ¢ Q". Then use the Cantor intersection theorem to obtain a contradiction.
In each of the following metric spaces prove that the ball B(a; r) has the geometric appearance indicated: a) 10 (R", d1), a square with sides parallel to the coordinate axes. b) In (R2, d,), a square with diagonals parallel 10 Ihe axes.
c) A cube in (R3, d1). d) An octahedron
n:,~
in (R3, d2).
3.28 Let dJ and dz be the metrics of Exercise 3.27 and let [x - 111 denote the usual Euclidean metric, Prove the following inequalities for all x and y in R": dl(x, y) ~ [x -
3.17 If S S Rn, prove that the collection of isolated points of S is countable. 3.18 Prove that the set of open disks in the xy-plane with center at (x, x) and radius x > 0, x rational, is a countable covering of the ~ {(x, Y): x > 0,,, > .oj. 3.19 The collection F of open intervals of the form (lIn, 2In), where n = 2, 3, ... , is an open covering of the open interval (0, l). Prove (without using Theorem 3.31) that no finite subcollection of F covers (0, I).
3.%0 Give an example of a set S which is closed but not bounded and exhibit a countable open covering F such that no finite subset of F covers S.
yll ~
d2(x,
y)
and
d2(x, y)
~ ...1;llx - yll ~
nd.(x, Yl.
1 + d(x, y)
d{x, y)
Prove that d' is also a metric for M. Note that 0 ~ d'(x. y) < I for all x, y in M. 3.30 Prove that every finite subset of a metric space is closed. 3.31 In a metric space (M, d) the closed ball of radius r > 0 about a point a in M is the set B(a; r ) = {x: d(x. a) s r }, a) Prove that Bt«; r) is a closed set. b) Give an example of a metric space in which B(a; r) is not the closure of the open
ball B(a; r).
3.21 Given a set Sin R" with the property that for every x in S tbere is an n·ball B(x) such that B(x) n S is countable. Prove that S is countable.
3.n
Prove that a collection of disjoint open sets in R" is necessarily countable. example of a collection of disjoint closed sets which is not countable.
Give an
3.32 In a metric space M. if subsets satisfy A !i 5 5 A. where A is the closure of A. then A is said to be t:k1lS# in $. For example, the set Q of rational n umbers is dense in R. If A is dense in $ and if S is dense in T, prove that A is dense in T. 3.33 Refer to Exercise 3.32. A metric space M is said to be separable if there is a countubk subset A which is dense in M. For example, R is separable because the set Q of rational numbers ill a countable I.k!nsesubset. Prove that every Euclidean space Rt is separable. 3.34 Refer to Exercise 3.33. Prove that the Lindelof covering theorem (Theorem 3.28) is valid in any separable metric space. 3.35 Refer to Exercise 3.32. If A is dense in 5 and if B is open in S, prove that B 5 A n B. Hilft. Exercise 3.1 3: 3.36 Refer 10 Exercise 3.32. U each of A and B is dense in 5 and if B is open in S, prove that A n B is dense in 5. 3.37 Given two metric spaces (S1> d1) and (S2' d2). a metric p for the Cartesian product S, x 52 can be constructed from dl and d'l in many ways. For example, if x (x 1> Xl) and y = (Yr. y,) are in 5. )( S'h Ji:t p(x. y) = dl(Xh ytl + d2(X2, Y2)' Prove that pis a metric for $1 x 52 and construct further examples.
COIIIfII'CI; su.bsets of:ll metric space
3.49 If int A
int B
8: and
0.
int B
="
but int (A u B)
= M.
n 11 = 0, then
0(1l u B) = oA V oB.
SUGGESTED
REFERENCES
FOR FURTHF.It STUDY Carus Monograph No. 13. Wiley, New Addison-Wesley, Reading. 1966. New
Yod:.l960.
3.2 GIeuon., A., FIIIIGb'tt!1ftolJ of ~ 3.4 Simmons, O. F.. l~ YOI'k.I963.
AlfIQ/y.rU.
3.3 ICapJansky, I .• Set 'T'heory IIIfIl Metric Sjut-n. Allyn and Bacon, Boston, 1m. to Tupo/IIgy_ Modem Ana/yAr. McGraw-HiD,
Prove each of the roHowing statements concerning an arbitrary metric space (M, d) and subsets S, T of M. 3.38 AIISUffiC SsT SO M. Then S is compact in (M. d) if, and only if, S is compact in the metric subspace (T, J). 3.39 If S is closed and T is compact. then S nTis compact. 3.40 The intersection of an arbitnuy col1cction of compact subsets of M is aJmpact. 3.41 The union of a finite number of compact subsets of M is compact, 3.42 Consider tbe metric !ipMle Q of rational numbers with the Eudidcan metric of' R. Let 5 consist of all rational numben in the open interval (a. b). wIlcR= a and b an:: irrational, Then S is a dosed and bounded subset of Q which is not compact.. MiIIcdIueouIIi 3..43 int A pnIIICI1ies of Ole iDkriGr ... M - M - A. M - .:4.
= tile ~
b) s ,,1' (int A), if Eis an infinite coIli!ctioo of s.uhlieiS of M. c) Give an example where equality does not bokI in (b). lA' a)
n:-t
int
n...
U"'~F(int A)
SO
(U...er A).
b) Give an example of a finite collection Fin whdl «)U3.lity does not hold in (a). 3.41 a) int (ilA) = 0 if A is open or if A is- closed in M.
b) Give an example in which int (aA) "'" M.
n.o CHAPTER 4
71
We also say that {x.} converges to p mul we write x" .... p as n .... eo, or simpl, the sequence {x ..} is said to diverge. The definition of convergence x.
-+
NOTE.
if and only if
o.
The convergence
space Ri.
4.1 INIRODUCDON
The reader is aJready familiar with the limit concept as introduced in elementary calculus where, in fact, several kinds of limits are usually presented. For example, the limit of a sequence of real numbers {x.}. denoted symbolically by writing
lim .... "" x" = A.. means that for every number s
1. In Euclidean space ai, a sequence {x.l is called Uu:retlSing if x~ S; x.+t for all 11. If an increasing sequence is bounded above (that is, if xR :s; M for some M > 0 and all 11). them {x.} converges to the supremum of its range. sup {Xl> X2 •••• }. Similarly, {x.} is called det:reusing if x.+ I S; x. for all 11. Every decreasing sequence wbk:h is bounded below converges to the infimum of its range. For example. {lIn} converges toO. 2. If {a,.} and {b.} are real sequences converging to 0, then {a,. + b.} also converges: to O. If 0 ;0;; c. ,..;; ,. for all a and if {o.l conveJBeS to O. then {c..} also converges to O. a These e1ementary properties of sequences in RI can be used to simplif'y some of the
>
This limit process conveys the intuitive idea that x. can be made arbitrarily close to A provided that n is sufficiently large. There is also the limit of Q junction, indicated by notation such as lim !(x) = A, 11/;"',
which means that for every 8 If(x) - AI
proofs
conceming
limits in a
general
=
metric space.
+ 11-2 +
+
(2 - Iln)i.
2i)l
= Iz" - (I
> 0 there
Ii:
is another 0
number (j
> 0 such
that
so d(z", 1 TMormr4.2.
2i) .... O.
<
whenever
< Ix - pi <
a.
A sequence {x.} in a metric space (S, d) can converge to at most one We will prove that p
This conveys the idea that f(x) can be made arbitrarily close to A by taking x sufficiently close to p. Applications of calculus to geometrical and physical problems in 3-space and to functions of seveml variables make it necessary to extend these concepts to R·, It is just as easy to go one step further and introduce limits in the more genera1 setting of metric spaces. This achieves a simplification in the theory by stripping it of unnecessary restrictions and at the same time covers nearly all the important aspects needed in analysis. First we discuss limits of sequences of points in a metric space. then we diSCWIS limits of functions and the concept of continuity,
4.% CONVERGENT Dt!ji1titim14.I.
Q
point in S.
Proof Assume that x •.... p and x •.... q. triangle inequality we have
q.
By the
o s; d(p,
Since dept x.)
-+
q) ~ d(p, x
H)
+ d(x",
q).
0, so p
q;
If a sequence {x.} converges, the unique point to which it the limit of the sequence and is denoted by lim x .. or by lim._oo
converges
x".
is called
SEQUENCES
IN A METRIC
SPACE
Example. In Euclidean space Rl we have Iirn,."'<I) lin = O. The same sequence in the metric subspace T = (0, 1] does not converge because the only candidate for the limit is o and 0 j T. This example shows that the convergence or divergence a sequence depends on the Wlderlying space as weJI as on the metric.
or
if there is point
For etJery
8
A sequeru:e {x.} of points in a metric space (S, d) is said to converge p in S with the following properly:
11IetJrem 4.3. In a metric space (S, d), assume x~ .... p and let T = {Xl'
be the range of {xR}. Then:
X2.·
•• }
> 0 there
d(x., p) <
a) T is bounded.
b) P is an adherent point of T.
whenever n :2:. N.
70
73
hoof. a) Let N be tbe integer ~ndinl to II 1 in the definition of convergence. Then every z" with n :=: N lies in the ball SCp; I); so every point in T lies in the ball B(p; F), where .
F
4.7 ~ (1/11 CllllCIty SqllfUlCe. A sequence {x,,} in a metric space (S, d) is called a Cauchy sequence if it satisfies the following condition (called the Cauchy
condition): For every e > 0 there is an integer N such that d(x,., x...) <
E
d{p,
XN-1)}.
Therefore
T is bounded. b) Since every ba1l B(p; e) contains. a point of T. p is an adhcn:nt point of T. If T is ~te.
an aa:umuJation point of T.
whenever
11
2: Nand m ~ N.
~
p
IS
The next theorem provides a converse to part (b). J'IemuI 4.4. GWen0 metric IPfI" (S, d) ond a su.baet T s;;;;; S. II a point pinS is adherent point of T, ther. there MO sequence (xJ of points in T which amot!T(IeS top.
all
Theorem 4.6 states that every convergent sequence is a Cauchy sequence. The converse is not true in a general metric space. For example, the sequence {J/n} is a Cauchy sequence in the Euclidean subspace T = (0, I] ofR!, but this sequence does not converge in T. However, the converse of Theorem 4.6 is true in every Euclidean space a-I.. T1Ietwem
4.8.
Proof. For every integer n ~ 1 there is a point x. in T with d(P. x.) s; lIn. Hence d(p, xJ -+ 0, so z" -+ p.
if, eDeI')'
4.5. In Il metric space (S, d) a .requnrce {x.} C01IVf!rgesUJ p if. ond only
~
{x"(.)}
CUItDeI'fJD
to p.
Proof: Assume z" -+ p and consider any subsequence {xt(.,}. For every .. > 0 there ~ an N_ such that n ~ N implies d(x.. p) < 8. Since {x....)} is a subsequence, there IS an mteger M such that k(n) i!: N for n 2: M. Hence n 2: M implies d(x!{.}, p) < £, which proves that xl.(JI) -+ p. The converse statement bolds trivially since {x.} is itself a subsequencc.
4.3
CAUCHY SEQUENCES
Proof Let {x,,} be a Cauchy sequence in R' and let T {Xl' X2> ••• } be the range of the sequence. If Tis finite, then all except a finite number of the terms {x,,} are equal and hence {x.} converges to this common value. Now suppose T is infinite. We use the BoJzano-Weierstrass theorem to show that T has an accumulation point p, and then we show that {x.} converges to p. First we need to know that T is bounded. This rollows from the Cauchy condition. In fact, when 1) = 1 there is an N such that 11 :<! N implies 11x.. - xjIIlI < L This means that all points x, with n :<! N lie inside a ball of radius 1 about as center, so T lies inside a ball of radius I + M about 0, when: M is the largest of the numbers ~xlll •... ,II"NII. Therefore, since T is a bounded infinite set it has an accumulation point p in Ri; (by the Bolzano-Weierstrass theorem). We show next that {ll.} converges to p. Given .t > 0 there is an N such that Ilx" x..II < 1;/2 whenever n :<!: Nand m ~ N. The ball B(,; e/2) contains a paint x.. with m 2! N. Hence if n 2: N we have
"N
If a sequence {x.} converses to a limit P. its terms must uJtimately become close to p and hence close to each other. This property is stated more formally in the next
theorem. so lim
metric Sptlce
X ..
II + Ux... - pU < 2: +
[;
2=
I:
a,
x..
p. This completes
the proof.
1'iuIum 4.4. h.nune tItlIl {x..} converges in II > 0 there is all integer N $IIch that d(x". x..) <
8
Examples
whenerJer n ~ N (1lf(/ m ~ N.
1. Theorem 4.8 is oRen used for proving the convergence of a sequence when the limit is not known in advance, For example. consider the sequence in RI defined by
X
Proof. Let p = lim z". Given Il > 0, Jet N be such that d(x"" p) < 8/2 whenever " ~ N. Then d(x•• p) < 6/2 if m ~ N. If both II 2: N and m 2: N the triangle" inequality gives us
d(x ..
=1--+---+"'+--.
234
{_I)""-1
x,J
::s;: d(x .. p)
If m > n
;:0:
x,,1
If
+ ... +
I <1< I -m n - N'
74
so <: e as seea as N:> lIt. Therefore Ii CAirucny seqeenee itnd hena; it converges to some limit. It can be shown (see Exercise lUS) dud this limit is 2, a fact which is not immediately obviQus. 1. Given It real sequence {an} such that :s for all n ;e: L We can. prove that {a,,} converges. without knowing its limit Lei a" "" I~ ThenO:s :S so, by :S Heoceb" -> 1Ji, Also,lfm :> 17 wehave
I-
1S
Definition I,ll.
em accumulation
"" b,
and
T, the notation
is
to mean
for ellery
t :> ()
A,
~. p, and
as x tends to p, is indicate this
1:1,."
<:
or
as x - p. The definition conveys the intuitive idea that dose to h dose to p, We that p be of A to make certain that there will be x in A sufnci.,ently dose to p, with x ~ p. 17 need not he in the domain and b need not he in the range
EDmpre 1. Every Euclidean space R~ is complete (Tlloorem 4.8). In rnu'1ienl"r complete, but the T {O, 1] is not complete.
sequence in T and let A = Xz, .. } denote the range of then converges to one of the elements hence converges in T. If A is Tbeorem 3,38 tens us that A has an accumulation pm T since Till compact. We sbownext that x" - p. Given 8 :> 0, choose N so that Nand m ;;::; N < The bal! contains a m ;;;:N, Therefore if n > N the us
NOTE.
The definition can also be formulated in terms of baUs. Thus, and if, for every ball there is a bali such that empty and such that
whenever
XE
holds if,
A ill not
<
.2
+~=t
2
'
When formulated this way, the definition IS when p or b are in the extended rea! number R* or In the extended number system C*. However, In what it to be understood that p and b are finite unless it is stated that can be infinite, The next. theorem relates limits of functions to limits
in this section we consider two metric spaces and where denote the metries, Let A be a subset of S and let f :A function from A to T
and
T he a
0,
77
if.
..... ""
Proof
lim I(x.) = b,
{p} which converges to p. 0 there is a /j
(3)
a) lim,,_,
[J(x)
b.
b) lim" .. ,/(x)g(x)
c) lim, ....f(x)/g(x) ,
if
b "" O.
>
>
<e
wbenever x
eA
and 0
< ds(x,
Now take any sequence {x.} in A - {p} whlch converges to p. For the {J in (4), there is an integer N such that 11 ~ N implies dJ..x", p) < o. Therefore (4) implies drlf(xJ. b) < e for n ~ N. and hence {!(xR)} converges to b. Therefore (2) implies (3). To prove the converse we assume that (3) holds and that (2) is false and arrive at a contradiction. ]f (2) is false. then for some Il > 0 and every;; > 0 there is a point x in A (where x may depend on /j) such that
Proof Weprove(b),leavingtheotherpartusexercises. GivenllwithO < £ < I, let e' be a second number satisfying 0 < e' < 1, which will be made to depend on Bin a way to be described later. There is a b > 0 such that if x E A and d(x, p) < (j, then jf{x) - al < 1'.' lo(x) - bl < £'. and Then 1!(x)1 = la + (f(x) - a)1 < lal + £' < lal + I. Writing/(x)g(x) - ab
= I(x)g(x)
obi < (lal
- bf(x)
I/(x)g(x) -
S If(x)llo(x)
{J
but
drlf(x),
b) ~
8.
(5) sequence of
I}l:'
1).
Taking II = lin, n = 1,2, ... , this means there is a corresponding points {x.} in A - {pI such that
< b, and
elnal + Ibl +
o < ds(x.,
p)
<
lin
4.7 LIMITS OF .VECTOR-VALUED FUNCf(ONS Again, let (S, d) be a metric space and let A be a subset S. Consider two vectorvalued functions f and gdefined on A. each with values in gi,
Oearly. this sequence {x.} converges to p but the sequence {f(xJ} verge to b. contradicting (3).
NOTE.
of
Theorems
cannot
have two
f:A ..... \ R
Quotients of vector-valued functions arc not defined (if k > 2), but we can define the sum f + I, the product Ai (if l is rea1) and the inner product f·g by the respec-
different
limits as x .....p.
tive formulas
(f
Let (S, d) be a metric space, let A be a subset of S. and consider two complexvalued functions I and 9 defined on A, I:A ..... , C
g:A .....C. The sum f + 9 is defined to be the function whose value at each point x of A is the complex number I(x) + g(x). The difference I - g, the product I' s, and the quotientflg are similarly defined. It is understood that the quotient is defined only at those points x for which g(x) "" O. The usual rules for calculating with limits are given in the next theorem. TMtJre", 4.13. Let I turd 9 be complex-valued functions defined on a subset A 01 tJ metric apace (S, d). Let p be an accumulation point 01 A, and assume that Jim/(x) ;<-, = a,
x ....v
+ g)(x)
= ((x)
+ g(x).
(If){x)
1I(x),
(f'g)(x)
for each x in A. We then have the foUowing vector-valued functions. T6ttor~", 4.14.
rules
for calculating
"'
..,rex) = a,
lim g(x)
Z"'I'
b.
a) lim,,_
[f(x)
c) lim,,_p I(x)·g(x)
Hilt g{x)
h.
IIf(x) II
11.11·
18
79
and
The
(}
To prove
we write
tends to O as x To prove
NOTE.
-+
that
:s;
p, so
ll"
of 51 (a of S which is not an accumulation at p will be continuous at p because for ~llj'fic'i"'ntl\l < li, x = p, and eS.
of
A,
every sequence
, in
we have
Jim The of this theorem is similar to that. of Theorem 4.12 and is len as an exercise for the reader. (The result can also be deduced from 4.12 but there is a cOlrnpHCEttle,n in the argument due to the fact that some terms of the sequence could be to The theorem is often described that for continuous functions the UU;;n,;JI!<;UlgC'U with the function Some care is needed converges when
Example If x" _, x and.pM y in a metric space (S. d), then (Exercise 4.1). The reader can that d is continuous on the metric space where p is the metric of Exercise 3.37 with Sl "'" $2 = S. ..... t», y} d
x S, p),
These
for each r.
show that
=a
if.
and
if,
4•. CONTINUOUS fiUNCTlONS 8 The definition of in elementary functions from one metric space to another. Dejilfitiolf 4.1.5. Let S to T calculus can be extended to
o such
said
that
continuous
p) <
/'S-tTbea
a poinr P in S
the intuitive idea that dose to are It can also be stated in terms of balls; A function if, for every t > 0, there is a S >0 such that
a p is called a on the behavior in the immediate concerns the whole domain its domain is a property.
is a s) in 1'. is an accumulation
f)~
under
must be contained
-U
lim
continuous at p and
continuous
then h is continuous at
Let
Given
whenever
b) <:
and whenever
If S is a subset 011 C on which the not continuous on S. Therefore a rational function where g continuous at those C at which the denominator does not vru:llsn, The familiar al-valued functions such as the exand ''''l!;GH •• liJt.t,-, ttU1Cl:lODiS, U ele:rnelllUlry fllIlC:tiollS 3 , the common
vahle of the
<: II',
a metric space
of the OOlrrlP'IeX ""IJUU.f;W'"'' and tnaonomemc functions is a c01~rej'PO'!ld:mg real-valued fW1ctions and consequence of the ooIltirnlil:y Theorem. 4.. ,0. 2
4.12 CONTINUITY AND INVERSE IMAGES OF OPEN OR CLOSED SETS
is also continuous at p The result is trivial is an isolated of S. of S, we obtain the result from Theorem 4.1.3. There is, is
II:
cOlrrel>PlJmding
TlteorIt1l14,19. Let r wid II: be continuous at a p in a metric space and assume that f and g have Faiues inR". Then each continumts at p: the sum f + g, the every real A, the inner f·g, and the narm
TlutOl'l!m
4.21.
Let
It is a trivial exercise to show that the functions are oontinuollsoll C: fUn:cti()n$, defined
12
Note that the statements in Theorem 4.22 can also be cxprc:sscd as follows:
u B)
= /-I(A)
u /-I(B)
TIvort!llf 4.23. Let f :S - T be afimctionfrom one metric space (S, ds) to QlWther (T, dr). Then / is contilfUOll3 on S I/. and OIIly if, for every open set Y in T, the inrJe1'seimage /-1(y) is open in S.
Proof. Let Fbe an open covering off(X). so thatf(X) !;;; A. We will show that a finite number oftbe sets A coverf(X). Since/is continuous on the metric subspace (X, ds) we can apply Theorem 4.23 to conclude that each set ~ -l(A) ~s open in (X, ds). The sets I-I(A) form an open covering of X and, smce X IS compact, a finite number of them COVeT X. say X s;;; f-1(A I) u ••. v /-l(A ,,). Hence I(x) £ /U-1(A1) U'" uj-l(A,,)] = f(f-l(A1)] U'" v fU-!(A,)]
~ Al
U ···V
UAeF
A",
r I(Y).
Proof. Let/be continuous on S, let Y be open in T, and let p be any point of We will prove that p is an interior point of/-l(y}. Let y = /(p). Since Y is open we have B.,{y; e) £ Y for some! > O. Since I is continuous at 11. there is a.5 > 0 such that/(Bs(p; .5) £ BrtY; t}. Hence,
8M;~}
£;
sof(X}
if there is tl positive
/-l(f(Bs(p;
6»]
s;; /-I[8.,{y;
g)] £ /-I(y)'
so p is an interior point of/-I(y). Conversely. assume that f-l(Y) is open in S for every open subset Yin T_ Choose pinS and let y = /(P), We will prove thatfis continuous at p. For every 6 > 0, the ball Brty; &} is open in T, so rl(Brty; e) is open in S. Now, p ef-1(B.,.{y; e) so there is a ~ > 0 such tbat Bs(p; 8) £ 1-'(BrtY; Therefore,/(Bs(p; 0») £ 8T(y; 8) so f is continuous at p. ,
Since r is. bounded on S if and only if f(S) is a bounded subset of Rl, we have the following corollary of Theorem 4.25. Tlworem 4.27. Let f: S -+ Ri be a/unction from a metric space S to Euclidean space Rt. If r is continuous on a compact subset X of S, then f is bounded on X. This theorem has important implications for real-valued functions. If / is real-valued and bounded on X, then I(x) is a bounded subset of R, so it has a supremum, supf(X), and an infimum, inff(X). Moreover, inf I(X)
e».
TlIMnm 4.24. Let f :S -+ T be aJutretionfrom one metric space (S, ds') to another (T. dr). Then I is continuous on S if. and only if, for every closed set Y in T, the inverse image f-l( Y) is closed in S.
s fix)
:S sup /(X)
The next theorem shows that a continuous/actually and inf I(X) if X is compact.
- Y)
S - /-1(y).
Theorem 4.18. Let [': S -+ R be a rem-volued function from a metric space S to Euclidean space R. Assume that f is continuous on a compact subset X 0/ S. Then there exist points p and q in X such that
/(p)
NOTE.
inf/(X)
and
I(q)
sup f(X).
Since f(p) s /(x) s I(q) for all x in X, the numbers f(p) and /(q) are called, respectively, the absolute or global minimum and maximum values of /on x. Proof Theorem 4.25 shows that /(X) is a closed and bounded subset of R. Let m = inf/(X). Then m is adherent to /(X) and, sincef(X) is closed, m ef(X). Therefore m = /(p) for somep In X. SimilarJy,/(q) = supf(X) for some qin X. Theorem 4.29. Let ] : S .... T be o/unctionjrom one metric space (S, dJ to another (T. dT). Assume that I is one-to-one on S, so thas the inverse function exists. I/ S is compact and if I is continuous on S,. then /-1 is continuous on f(S).
The next theorem shows that the continuous image of a compact set is compact. This is another global property of continuous functions.
Tlu!orem 4.25. Let f: S -+ T be afunclion/rom one metric space (S, diJ to another (T, dr)_ If [is continuous on a compact subset X 01 S, then the image leX) is a compact subset 01 T: in particular, /(X) is closed and bounded in T.
r:
Proof By Theorem 4.24 (applied to f-I) we need only show that for every dosed set X in S the image f(X) is closed in T. (Note that I(X) is the inverse image of
Def.4.30 Since X is dosed and S is compact. X is compact (by Theorem sof(X) is compact (by Theorem 4.25) and bencef(X) is closed (by Theorem This completes the proof.
x under f-1.)
3.39), 3.38).
Ti;u1'Dll4.31. Let / be defined on an interval S in R, Assume that f is continuous at a paint c in S and that I(c) "'"O. Then there is D I-ball B(c; 5) such that /(x) has the same sign as f(c) in B(c~ ~ ('I s.
Rumple. This example shows that compactness or S is an essential part of Theorem 4.29. Let S = [0, l) with the usuaJ metric of R 1 and consider the complex-valued function jdefined by j(:x) 1r21<1", for 0 :!> x -e 1. This is D. OJlC.t<Hl1lC continuous mapping of the balf-opcn interval [0, I) onto the unit circle Izi = 1 in the complex plane. However.j-l is not continuous at the pointj(O). For example, if X8 = 1 - lin. the sequence {j(x.)} converges to 1(0) but {x,,} docs not CODverp in S. 4.14 TOPOLOGICAL MAPPINGS (HOMEOMORPIDSMS)
Proof
Assume f(c)
>
O. For every
>
0 there is a ~ whenever x
>
E
f(c) - s </(x)
Take the 8 corresponding:
<J(c)
+ I!
to s = /(c)/2
B(c; 6) n S,
so I(x) has the same sign, as ftc) in B(c; 6) except that we take e = -if(c).
< O.
f :S -+ T be Q Junction from one metric space (S, ds) to another (T, dT). Assume also that J is one-to-one on S, $0 that the inverse function /-1 exists. 1/1 is contilr1Jow on S and if is Continu0U3 on f(S}. thenf is called Q topaiogicm mapping or a homeonwrphi8m. and the metric spaces (8, ds) and (/(S), dT) are said to be hcmeomorphic.
lk./Utilio. 4.30. Let
r:'
TMorem 4.32 (Boi{lllllJ). Let f be real-valued and continuous on a compact interval b] in R, and suppose that /«(1) and /(b) have opposite sign.r; that is, assume f(a)f(b) < O. Then there is at least one poi", c in the open interval (0, b) such that f(c) = O.
[D,
Proof
For definiteness,
assumef(a)
=
>0
andf(b) and
< O. Let
J(x) ~ O}.
then so is 1-1. Theorem 4.23 shows that a homeomorphism maps open subsets of S onto open subsets of f(8). It also maps closed subsets of S onto closed subsets of f(S). A property of a set which remains invariant under every topological mapping is called a topologicm property. Thus the properties of being open, closed. Or compact are topological properties. An important example of a homeomorphism is an isometry. This is a function -+ T which is one-to-one on S and which preserves the metric; that is.
If f is a homeomorphism,
{x:xe[o,b]
t .s
Then A is nonempty since a E A, and A is bounded above by b. Let e = sup A. Then a < e < h. We will prove thatf(c) = O. Iff(c) "" 0, there is a l-ball B(c; iI) in whichfhas the same sign as/(c). If fCc) > O. there are points x > cat whichf(x) > O. contradicting the definition of c. If f(c) < 0, then e - iI/2 is an upper bound for A, again contradicting the definition of c. Therefore we must havef(cl = O. From Bolzano's theorem we can easily deduce the intermediate
dr{J(x).!(y))
d,(x, y)
value theorem
for all pojnts x and y in S. If there is an isometry from (S, ds) to (I(S), dT} the two metric spaces are called isometric. Topological mappings are particularly important in the theory of space curves. For example, a simple arc is tbe topological image of an interval, and a simple closed curve is the topological image of a circle. 4.15 BOLZANO'S THEOREM
This section is devoted to a famous theorem of Bolzano which concerns a global property of real-valued functions continuous on compact intervals [a, b] in R. If the graph off lies above the z-axis at Q and below the x-axis at b, Bolzano's theorem asserts that the graph must cross the axis somewhere in between. Our proof will be based on a local property of continuous functions. known as the sign-preserving properly.
The intermediate value theorem, together with Theorem 4.28, implies that the continuous image of a compact interval S under a real-valued function is another compact interval, namely,
[inf/(S), supf(S»). (If/is constant on S. this will be a degenerate interval.) The next section extends this property to the more general setting of metric spaces.
87
4.16 CONNECrEDmss This section describes the concept of connectedness and its relation to continuity. Since A and Bare nonempty.ftakes both values 0 and I, sofis not constant. Also.fis continuous on S because the inverse image of every open subset of {O. I} is open in S. Next we show that the continuous image of a connected
Dtjilfitioa 4.34. A metric space S is coiled tfisco,u,ecled if S = A V B wher .A. and B are disjoint nonempty open sets in S. We call S connected if it Lr not ~ connected.
NOlI!. A subset X of a metric space S is called connected metric subspace of S. it is a connected metric space.
set is connected.
if when regarded as a
Tllure", 4.37. Let I: S --+ M be a function from (J metric space S to another metric space M. Let X be a connected subset 0/ S. II f is continuous on X, then f{X) is a connected subset of M. Let g be a two-valued function on f(X). We willsbow that g is constant. Consider the composite function h defined on X by the equation h(x) = g(/(x»). Then h is continuous on X and can only take the values 0 and 1, so h is two-valued on X. Since X is connected, h is constant on X and this implies that g is constant onf(X). Thereforef(X) is connected.
Eumple. Since an interval X in R 1 is connected. every continuous image !(X) is connected. If f has real values, the image f(X) is another interval. JUhas values in R", the image f(X) is called a curve in R". Thus, every curve in a- is connected. As a corollary theorem. of Theorem
Rumples
1. The metric: space S = R - to} with the-usual Euclidean metric is discoonected. . it is ~ union of two disjoint DODempty open sets. the positive J\\'aJ numbers and. ncpti\'C n!aI numbers. 2. Ew;ry open interval. in R is 00dnCCted. ql.1CDl:e of Theorem 3.11. 3.
u::
Proof
This was
proved in
of rational numbers., regarded as a metric subspace of Euclidean space R I, tS disconoected. In fact, Q = A V B. where A consists of aU rational numbers < and B of an rational numbers Similarly. every ball in Q is disconnected.
:n-.set Q
J2
>"/i
4. Every metric space S contains nonempty ~ the set {P} is 00Iltleded. late coaneetedness To function.
n:
with continuity
we
{O, I}.
I which
Theorem 4.38 (IlfIt:rmediate-wdue 'Monm for ruJ ctIlftimtoIu I~). Let / be real-valued and continuous on a connected subset S oIR*. Iff takes on two different talues in S, say a and b, then for each real c between a and b there exists Q point x il'l S such that I(x) = c. Proof. The image /(S) is a connected subset of Ri. Hence, I(s) is an interval containing a and b (see Exercise 4.38). If some value c between a and h were not inf(S). then/(S) would be disconnected.
4.17 COMPONENTS OF A METRIC SPACE
I.D other words, a two-val~ function is a continuous function whose only possible ,,~ are 0 and I. This can be regarded as a continuous function from S to the metric space T = {Of 1}, where T has ~ discrete metric. We recall that every subset of a discrete metric space T is both open and closed in T.
T~em 4.36. A metric space S is connected Iun.ctwn on S rs constant. Proof
if.
and only
This section shows that every metric space S can be expressed in a unique way as a union of connected "pieces" caned components. First we prove the following:
Ass~ S is connected and letlbe a two-valued function on S. We must ~how that 11s constant. Let A = 1-1({O}) and B = I-I({I}) be tbe inverse l~ages of th~ subsets {OJ and {I}. Since {OJ and {Il are open subsets of the discrete metne space {O, I}, both A and B are open in S. Hence. S = .A. V B where A and B are dis~oint open sets. But since S is connected, either A is emptr and B = S, or else B IS empt~ and A = S. In either case. I is constant on S. Conv.e?:ly. assume that S IS disconnected, so that S = A u B, where A and B are disJomt nonempty open subsets of S. We will exhibit a two-valued fiun..... ' ",.. on on S w bih IS not constant. c Let
connected.
TMonm 4.39. Let F be a collection 0/ connected subsets 01 a metric space S such that the intersection T = A is not empty. Then the Wliol'l U = A is
n,..".
UA""
If x E U, then x IE' A for some A in F. Since A is connected, and. since t E A,/(x) = I(t).
Proof Since T #- 0, there is some t in T. Let f be a two-valued function on U. We will show that/is constant on U by showing thatf(x) = f(t) for all x in U.
/ is constant on A
f(x) = {~
ifxe.A.. ifxeB.
Every point x in a metric space S belongs to at least one connected subset of S, namely {x}. By Theorem 4.39, the union of all the connected subsets which contain x is also connected, We call this union a component of S, and we denote it by U(x). Thus. U(x) is the maximal connected subset of S wbich contains x.
•
1'IIeon", 4.40. Every point of a metric SJKI" S belonga 10 a uniquely determined component of S. in otlrer wor•• tire components of S form a coIlectkm 0/ diajoinl sets whole union is S. Proof. Two distinct components cannot contain a point 4.39) their union would be a larger connected set con~g
4.18 ARCWlSE CONNECl'EllNESS
X'
Oef.·US The next theorem Theorem 4.41. relates arewise connectedness with connectedness.
89
x.
Proof. Let 9 be two-valued on S. We will prove that 9 is constant on S. Choose a point a in S. If x € S, join a to x by an arc r lying in S. Since r is connected, g is constant on I' so g(x) = g(a). But since x is an arbitrary point of S, this shows that 9 is constant on S, so S is connected. We have already noted that there are connected sets which are not arcwise connected. However, the concepts are equivalent for open sets.
a special property. called arcwi.se CONtectedness, whicla is possessed by some (but not aU) eonneeted sets in Euclidean space R".
~ 1.11: set S i3 caIJed arcwi.se connecled if for any two points a and binS there IS a COIItinuous junCtiOll f : [0, 1] - S such tAat 1(0) = •
NOTE.
!".-
and
f(l)
b.
o ::;:
Such a ~unctiOft is ~ ~ P!'th from a to b. If 1(0) ¢ f(1l. the image of [0, I] ~ f IS called au arc JOIIlID8 • and b. ThUll, Sis arcwise CODIlCCted if every pur of distinct points in S can be joined by an arc lying in S.· An:wise connected sets are ~. ~Jed pathwtre co1lllected. If f(t) = tb + (J - t)a for t ::;:, the curve Jom:tng • and b is called a line segment. I
SIa II s
1. Efti)'
COIl'\'Q R" is an:wi!Je COI1JXded. sia:JI% tbc Iioe qrneot joining two points of sudl a set lies m the set. In parIicuIac, every n-baJJ is arcwiilC connected.
:a~
1.. The
set
or two tangent
IJ
Fjgure4.4 3. The set~n Fig. 4.5 C?nsists.of.those points on the curve described by y = sin (llx), < x _ 1, along With the points on the horizontal segment -I .s x s O. This set IS connected but not arcwise connected (Exercise 4.46).
TlleoreM 1.14. Every open set Sin R" can be expressed in one and (}Illy one way as a countable disjoint union of open connected sets.
Proof By Theorem 4.40, the components of S form a collection of disjoint sets whose union is S. Each component T of S is open, because if x € T then there is an n-baU H(x) contained in S. Since B{x) is connected, B{x) s T, so T is open. By the Lindelof theorem (Theorem 3.28), the components of S fonn a countable collection, and by Theorem 4.40 the decomposition into components is unique.
Fq:ure4.S
lkji.tJititnt 4.45. A set in R" Is called 0 region if it is the union of on open connected set with some, none, ot all its boundary points. If none of the boundary points are
Def.4M included, the reqion is called an open region. the region is called a closed region. Some authors complex plane.
NOT!':.
Th. 4.47
'.II
1/ all
in Example
2 is not uni-
use the term domain instead of open region, especially in the Uniform continuity on a set A implies continuity this.) The converse is also true if A is compact. on A. (The reader should verify
4.19 UNIFORM
CONTINUITY
Suppose/is defined on a metric space (S, ds), with values·in another metric space (T. dr), and assume thatfi~ continuous on a subset A of S. Then, given any point p in A and any e > 0, there is a c5 > 0 (depending on p and on e) such that, if x E A, then d"{f(x), f(p»)
Ti4!orem 4.17 (Heine), Let f :S ....T be a function from one metric space (S, ds) to another (T, dT). Let A be a compact subset oj Sand assume that f is continuous
an A. Proof Then
is uniformly continuous on A.
<
whenever dS<x, p)
< 0.
Let
BS<a; r),
with,
a,
In general we cannot expect that for a fixed e the same value of (i will serve equaUy well for every point p in A. This might happen, however. When it does, the function is called uniformly continuous on A. lJe./iltitinll 4.46. Let f : S -+ Theafunction/rom one metric space (S, ds) toano/her (T, dr)· Then/is said to be uniformly continuous on a subset A of S if the follOWing condition holds. For every e > 0 there exists and p E A then dr<f(x)./(p»)
Q~
<;
x E BS<a; r)
f"'l
A.
Consider the collection of balls Bsf"a; ,/2) each with radius r12. These cover A and, since A is compact, a finite number of them also cover A, say
>
if
<
<
0.
To emphasize the difference between continuity on A and uniform continuity on A we consider the following examples of real-valued functions.
Examples
1. Let lex) = llx for x ;:. 0 and take A ~ (0, 1 J. This function is continuous on A but not uniformly continuous on A. To prove this, let It = 10, and suppose we could find a 0, 0 < J < 1, to satisfy the condition of the definition. Taking x = 0, p = 0/11. we obtain Ix - pi < J and
<
2:'
+ '" 2
r, -i
I/(x) -/(p)1
= ~I -
olio
! = !Q ;;. 10.
I(p) I >
Hence, for these two points we would always have I/(x) the definition of uniform continuity. 2. Let I(x) = Xl if x E R 1 and take A = (0, 1] as above. continuous on A. To prove this. observe that
I/(x) - l(p)1 ~
to, contradicting
Hence, p
dsCp, a~)
€
dsCp, x)
+ 'k '2
< £/2.
6
rk•
Using the
Ix" - p21
I(x -
pXx
+ p)1 <
triangle inequality
I/(x) - f(P)1 < U. Hence. if £ is given, we need only take = 1l/2 to guarantee that If(x) - 1(P)1 < e for every pair x, p with Ix - pI < J. This shows that/is uniformly continuous on A.
If Ix
pi < 0, then
d,,{!(x).!{p»)
This completes
d.(f(x).!(a,,))
dr<f(asJ,f(p»
<
+2=
11
s,
the proof.
9Z
4.21 FIXElJ..POINT THEOREM FOR CONTRACTIONS
n......
Del. 4.49
93
a function from a metric space (8, d) into itself. A point p in 8 is called ajixed point of/if /(p) = p. The function/is called a contraction of 8 if there is a positive number IX < I (called a cantraction constant), such that Let / : d(/(x), /(y» ~ rJ.d(x, y)
s .....be S
Let / be defined 011 an interval (0, b). Assume C E [0, b). If J(x) -> A as x -+ c through values greater than e, we say that A is the righthand limit off at c and we indicate this by writing lim f(x)
A.
for all x, y in S.
(1)
The righthand limit A is also denoted by /(c+). In the s, <> terminology this means that for every .& > 0 there is a {) > 0 such that
/j
< h.
x, lex). /(/(x)),
Po x,
p,,+ I /(P,.),
...
n = 0, 1,2, ...
Note that f need not be defined at the point c itself. If/is defined at c and if f(c+) = /(c), we say that/is continuousfrom the right at c. .. . Lefthand limits and continuity from the left at e are similarly defined If c E (0, b]. . If a < e < b, thenfis continuous at c if, and only If,
/(c) the following conditions
/(c+)
J(c-}.
We say c is a discontinuity of / if / is not continuous at c. In this case one of is satisfied: a) Either /(c+) or f(c-) does not exist. b) Both/(c+) and f(c-) exist but have different values.
c) 8othf(c+)
We will prove that {pft} converges to a fixed point off First we show that {p.} is a Cauchy sequence. From (7) we have
d(Pw+l'P.)
d{J(p.).!(Pft-'».$
I1.d(P."Pn_I)'
andf{c-)
existandf(c+)
=/Cc-)
¥-/(c).
Po) =
crt,.,
a!"
where c = d(pl> po)· Using the triangle inequality we find, for m > n,
• -1
In case (c), the point c is called a removable discontinuity, since the discontinuity could be removed by redefiningf at c to have the value /(c +) .= /(c.-): In cases (a) and (bl, we call c an irremovable discontinuity because the discontinuity cannot be removed by redefining/at e.
d(p"" PII)
L d(PUh
~~.
~-1
Pt) ::::; e
t-8
rt' L a: c--I-IX
c < --ci'.
I-a:
~JUritWlf
both exist
at some
4.49. Let / be definNl on a closed internal [a. b]. 1/ f(c+) ami /(c-) interior point c, then: is called the lefthandjump o/fat
is called the jump off at c. three numbers is different from 0, then c is called a jump dis-
Since rt' -+ 0 as n -> co, this inequality shows that {p.} is a Cauchy sequence. But 8 is complete so there is a point P in S such that p. -+ p. By continuity of J.
f(p) = /(lim
a) /(c) - /(c-)
b) /(c+) c) /(c+)
c,
0.1
c,
"-'liD
p,,\ J
lim f(p,,)
.-+eJ;,lo
"-,110
lim P..+ 1
p,
If anyone
continuity
0/ these
off.
Many important existence theorems in analysis are easy consequences 'Of the fixed point theorem. Examples are given in Exercises 7.36 and 7.37. Reference 4.4 gives appflcations to numerical analysis.
4.22 DISCON11Nl.lmES OF REAL-VALUED FUNCIlONS
For the endpoints a and b, only one-sided jumps are considered, tbe righthand jump at 0, f(o +) /(a). and the lefthand jump at bJ(h) - f(b- ).
Eumpies 1. The function [defined by /(x) = x/lxl if x #; 0,[(0) = A, bas a jump discontinuity at 0, regardless ofthe value of A. Here [{O+ ) t andf(O-) I. (See Fig. 4.6.) 2. The functi(ln / defined by /(x) = i if x of- 0, /(0) continuity at O. In this case [(0+ ) = /(0-) = 1.
=
The rest of this chapter is devoted to special properties of real-valued functions defined on subintervals of R.
Def.4.50
Th. 4.53
Monotonic Functions
We shall prove that functions which are monotonic on compact intervals always have finite right- and lefthand limits. Hence their discontinuities (if any) must be jump discontinuities. Theorem 4.51. If[is increasinq on [a. b], then/(c+) c in (a, b) and we haoe [(c-) sf(c) S/(c+).
A t the endpoints we have
FJgUre4.7
and/(c-)
f(a) S /(a+)
and
/(b-)
s [{b).
l. The function I defined by /(x) = l/x if x of;. 0, [(0) = A, hal; an irremovable discontinuity at O. In this case neither /(0+) nor /(0-) exists. (See Fig. 4.7.)
4. Thefunclionfdefined by lex) =, sin (l/x) if x '# 0./(0) '" A, has an irremovable discontinuity at 0 since neither 1(0+) nor /(0-) exists. (See Fig, 4.8.)
Proof. Let A = {I(x): a < x < c}. Since I is increasing. this set is bounded above by f(c). Let 0: = sup A. Then a; S f(c) and we shall prove that /(c-) exists and equals a. To do this we must show that for every I; > 0 there is' a (j > 0 such that c-6<x<c implies
I/(x) a;1< e.
5. The function f defined by f(x) ... x sin (J/x) if x ,t. 0, /(0) = I, has a removable jump discontinuity at 0, sinre[(O+) /(0-) O. (See Fig. 4.9.)
But since Cl = sup A, there is an element/(xt) of A such that a - t < f(xd $; 0:. Since / is increasing, for every x in (Xl' c) we also have It - e < [(x) $; 0:, and hence I/(x) ~ al < e. Therefore the number {j = c - XI has the required property. (The proof that [(c+) exists and is ';;;!f(c) is similar, and only trivial modifications are needed for the endpoints.) There is, of course. a corresponding theorem for decreasing functions which the reader can formulate for himself, Theorem 4.52. Let [be strictly increasing on a set S in R. Then strictly increasing on/(S).
r: exists
and is
FiKIIRl 4.8
Figure 4.9
FUNCTIONS
Since I is strictly increasing it is one-to-one on S, so exists. To see is strictly increasing, let j-, < Y2 be two points in/(S) and let Xt = f-I(y,), X2 = [-I(Y2)' We cannot have XI ';;;! Xl' for then we would also have y, ~ Y2' The only alternative is
Proof
tbat[-t
r:
! said to be increasing is
in S,
I be
a real-valued function defined on a subset S 01 R. Then (or nondecreasinqs on S if /()r every pair of points x and y implies
f -1 is strictly increasing.
x<y
I{x) $;J(y).
Theorem 4.52, together with Theorem 4.29, now gives us: Theonm 4.53. Let I be strictly increasing and continuous On a compact interval [a. bJ. Then 1-1 is continuous and strictly increasing on the interval (/(a),f(b)].
NOTE.
II x :: y impli~s.r.(x) < J~ then]' is said to be strictly increasing on S. (Decreasing y), functions are similarly defined.) A function is called monotonic on S if it is increasing on S or decreasing on S.
. If [is an increasing function, then -f IS a decreasing function. Because of this simple fact, in many situations involving monotonic functions it sufficesto consider only the case of increasing functions.
Theorem 4.53 tells us that a continuous, strictly increasing function is a topological mapping. Conversely, every topological mapping of an interval [a, b] onto an interval [c, dJ must be a strictly monotonic function. The verification of this fact will be an instructive exercise for the reader (Exercise 4.62).
EXERCISES Limits
or sequences
->
y2 (Xy)l (x _
or the following
if
1=1
« 1; {zoO} diverges
1=1
> 1.
b)
lex,
.1') ..
(xy)2'+
X
y)2
= 0,
""
c) z~/lI! ..... 0
.../;? + 2 - n, then Ow ...... O. 4.2 If au (an I + Q.)/2 for all 11 ~ I, show a.+! = Ha" - a. + Il.
(x
'2a1,)/3. Hint.
4.3 If 0 -c Xl <: 1 and if X0+1 = 1 - ~~ for all n ;?: 1, prove that {x.J is a decreasing sequence with Iirnit O. Prove also that x~+tlx" .....1-. 4.4 Two sequences of positive integers {o"l and {b.} are defined recursively by taking til = b, = 1 and equating rational and irrationaJ parts in the equation a. Prove that Q~ and that 2b,./a" .....
e) f(x, y)
Sinx = tan x
cos3 x
u.;
'\:2
+
=
b•../2 =. (Q,,_ 1
+ b"_l
.../2)2
for n ~ 2;
.../2
In each of the pr«:eding examples, determine whether the following limits exist and evaluate those limits that do exist: lim f(x, y). lim [lim [ix, y)] ; lim pim/(x, y)] ;
x-rO r-()
y-+O K-O
.1',
if Ian x ~ tan y.
through values
>.../i,
(r,71-(O,O)
4.12 If x
[0, I) prove that the following limit exists, lim [lim cos~· (m! 1fX)] ,
m-l>!j;()
4.S A real sequence {x,,} satisfies 7x,,+1 = x! + 6 for n ?: 1. If Xl = t, prove that the sequence increases and find its limit. What happens if Xl = Of if Xl = i1
tt ...... ro
4.6 If
con verges.
la.l
<: 2 and
la..+2
an+ll
:$
all
n~
1,
prove
and that its value is 0 or I, according to whether x is irrational or rational. COIItinuity or real-valued. fUDCtions 4.13 Let f be continuous on [0, b] and let f{x) '" 0 when x is rational. Prove that f(x) =: 0 for every x in [a, b J. 4.14 Let f be continuous: at the point a (Ob 0l' ... , a.. in RO.. Keep 02' 03' ... , o. ) fixed and define a new function 9 of one real variable by the equation g(x)
that
{aN}
4.7 In a metric space (S, d), assume that x •......x and Y. _,. y, Prove that d(l(", Y.) ..... dix, y). 4.8 Prove that in a compact metric space (8, d), e'llety sequence in S has a subsequence which converges in S. This property also implies that S is compact but you arenot required to prove this. (For a proof see either Reference 4.2 or 4.3.) 4.9 Let A be a subset of a metric space S. If A is complete, prove that A is dosed. that the converse also holds if S is complete. Limits of functions
NOTE.
Prove
= f(x,
02·' .••
Q~).
Prove that 9 is continuous at the point x al' (This is ~metimes stated as follows: A continuous function of 11 variables is cominuous in each oariable separately.) 4.15 Show by an example that the converse of the statement in Exercise 4.14 is not true in general. 4.16 Let/, g, and h be defined on [0, I] as follows: f(x) g(x) h(x) 0, whenever x is irrational;
/(x) = 1 and g(x) = x,
and assume x
r;:
Prove that (a) always implies (b), and give an example in which (b) holds but (a) does not. 4.11 Let f be defined
Oil
"-0
If(x +
h) - f(x)t
'" 0;
b) lim tf(x
.-0
h) - I(x
- illt
o.
whenever x is rational;
h(x) ~ I/n, if x is the rational number min (in lowest terms); h(O} ... 1.
R z. If lim f(x, y) L
Prove that / is not continuous anywhere in [0, 1]. that 9 is continuous only at x that h is continuous only at the irrational points in [0, I J.
0, and
and if the one-dimensional limits lim.._./(x, y) and lim?_./(x, y) both exist, prove that
... _111
4.17 For each x in (0, I}, let I(x) = x if x is rational, and let f(x) = I - x if x is irrational. Prove that: a) f(f{x» = x for all x in [0, 1]. b) /(x) + /(l - x) = 1 for all x in [0, 1).
value between
° and
1.
I.
a) Prove that/cannot
[0, I].
I is continuous.
4.18 Let fbe defined on R. and assume that there exists at least one point Xo in R at which
y) = f(x)
+ fly).
=
c) Prove that any function with this property has infinitely many discontinuities on [0, 1). 4.28 In each case, give an example of a function J. continuous on S and such that I(S) = T, or else explain why there can be no such I;
a) S = (0, 1),
ax for all x.
T T T
= =
4.19 Letlbeoontinuous on [a. bland defineg as follows:g{a) = f(a)and. for a < x ~ V, let g(x) be the maximum value of fin the subinterval [a, x]. Show that g is continuous' on
b) S = (0, I),
c) S = R',
(a, b].
.. ,I,. bem real-valued functions defined on a set Sin R". Assume that each h is continuous at the point a s. Define a new function I as follows: For each x in S, /(x) is the largest of the m numbers Ii(x), ... ,/,..(x). Discuss the continuity of I at a.
4.20 LetJi,
or
e) S "" {O, t] x (0, I], f) S "" [0, 1] x [0, 1], g) S = (0, 1) x (0, I),
4.21 Let /: S --> R be continuous on an open set S in R", assume that , E S, and assume thatl(P) > O. Prove that there is an If-ball B(P; r) such that.f(x) > 0 for every x in the ball. 4.22 Let f be defined and continuous on a dosed set S in R. Let
A {x:xeS
and
/(x)
= OJ.
Continuity in PJetric spaces In Exercises 4.29 through 4.33, we assume that space (5, ds) to another (T, dT).
4.29 /-'l(int B) S int/-I(B) 4.30 Prove thatlis
r: S
-->
Prove that lis continuous on S if, and only if, for every subset B of T.
I:
A = {(x. y); y
< /(x)},
B = {(x, y):.v
> /(x)}.
Prove that / is continuous on R if, and only if, both A and B are open subsets of R 2. 4.24 Let
/be
4.31 Prove that .f is continuous on S if, and only if, f is ~ontinuous on every compact subset of S. Hint. If X" ..... pin S, the set {p, XI> X2'··· } IS compact.
4.32 A function.f:
S --> Tis called a ctosed mapping on S if the imagel(A) is.dosed in T for every closed subset A of S. Prove that/is continuous and dosed. on S If, and only
4.25 Let I be continuous on a compact interval (a, b). Suppose that / has a local maximum at XI and a local maximum at Xl' Show that there must be a third point between XI and X2 where/has a local minimum,
NOTe. To say that I has a local maximum at .1:"1 means that there is a l-ball B(Xl) such that I(x) -s f(x I) for all x in B(x I) rv [u, b J. local minimum is similarly defined.
if, /(it) = M for every subset A of S. . 4.33 Give an example of a continuous f and a Cauchy sequence {x~} in some metnc space S for which {f(x.)} is not a Cauchy sequence in T. 4.34 Prove that the interval (- 1, I) in R I is hOH_Ieomorphie to R I. This shows that neither boundedness nor completeness is a topological property.
I(
4.35 Section 9.7 contains an example of a function t. continuous on [0, I]. with [0, I]) = [0, 1] x [0, I]. Prove that no such f can be one-to-one on [0, I ]. CoonectedDeSS
4.3(; Prove that a metric space S is disconnected.if, and only if, there is a nonempty subset
4.26 Let /be a real-valued function, continuous on [0. I), with the following property: For every real r. either there is no x in [0, I] for which [(x) = y or there is exactly one such x. Prove that/is strictly monotonic on {O, I]. 4.27 Let I be a function defmed on [0, I] with the following property: For every real number y, either there is no x in [0, I] for whichf(x) '" y or there are exactly two values of x in [0, 1) for whichf(x) = y,
A of SA",
4.37 P~ove tlla; a metric space S is connected if, and o~\Y if. the only subsets of S which are both open and closed in S are the empty set and S Itself. .. 4.38 Prove that the only connected subsets of R are (a) the em.pty (b) sets consssung of a single point, and (e) intervals (open, closed, half-open, or infinite),
s:et,
100
4.39 Let X be a connected subset of a metric; space S. Let Y be a subset of S such that X <;; y <;; X. where X is the closure of X. Prove tha.t Y is also connected. In particular,
101
4.40 If x is a point in a metric space S, lei U{x) be the component of S containing x. Prove that U(x) is closed in S. 4.41 Let S be an open subset of R. By Theorem 3.11, S is the union of a countable disjoint collection of open intervals in R. Prove that each of these open intervals is a component of the metric subspace S. Explain why this does not contradict Exercise 4.40. 4.41 Given a compact set S in R"' with the following property: For every pair of points a and h in S and for every II > 0 there exists II. finite set of points {so. Xt. •.• , x,,} in S with Xo = a and xN = b such that
4.55 Let f :S -+ T be a function from a metric space S to another metric space T. Assume I is uniformly continuous on a subset A of S and that T is com~ete. prove that there is a unique extension of /to A which is uniformly rontinuous on A. 4.56 In a metric space (S, d). let A be a nonempty subset of S. Define a function
fA : S
I,b}
for each x in S. The number
b) Prove that
= inf {d(x.y):
yEA}
I.. {x)
XES
= {x;
I..(x)
= O}.
for k
Prove or disprove; S is connected.
= I, 2•...
II.
4.57 In a metric space (S, d). let A and B be disjoint dosed subsets of S. Prove that there exist dildoint open subsets U and Y of S such that A <;; U and B s V. Hint. Let g(x) = h,{x) - IJ.x), in the notation of Exercise 4.56, and consider 9- I{ - 00. 0) and
g~](O,
+ 00).
4.43 Prove that a metric space S is connected if, and only if, every nonempty proper subset of S has a nonempty boundary.
4.44 Prove that every convex subset of R" is connected.
DiscoIIliI'lllities
4.S8 Locate and classify the discontinuities of the functions! defined on R [ by the following equations: a) I(x) = (sin x)lx
b) I(x) = ell"
4A5 Given a function f: R't .... R"' which is one-to-one and continuous on R". IT A is open aDd disconoectl:d in R-, prove that f(A) is open and disconnected in f(R"). 4.46 LetA = {(x.y):O -c x s 1, y = sin I/x}. B = {(x,y):y = 0, -I s x sO}, and let S = A v B. Prove that S is connected but not arcwise ooonected. (See Fig. 4.5. Section 4.18,) 4.47 Let F = {Fl' Fl •... } be a countable colle<;tion of connected compact sets in R" such that FH 1 s Fk for each Ie 2:; 1. Prove that the intcnCction t FA:is connected and closed.
if x ". O./(O) if x
'f: 0.[(0)
c) f(x)
= ellx
+ sin
(l/x)
if x ". 0, /(0)
= o. = o. = o.
n.i..
d) f(x) = I/O ell"} if x ". O,{(O} = O. • 4.59 Locate the points in R:.l at wbidt each of the functions in Exercise 4.11 is not con-
tinuous. Moootooie flllPctioos 4.60 Let/be defined in the open interval (0, b) and assume that for each interior point x of (a. b) there exists a I-ball B(x) in which / is increasing. Prove that! is an increasing function throughout (0, b). 4.61 Let/be continuous on a compact interval {a. b] and assume that/does not hav~ a local maximum or II. local minimum at any interior point. (See the NOTE following Exercise 4.25.) Prove that I must be monotonic on [a. b]. 4.61 IfI is one-to-one and continuous on [a, b]. prove that! must be strictly monotonic on Ia. b}. That is, prove that every topological mapping of [a. b] onto an interval [e, d] must be strictly monotonic. 4.63 Let I be an increasing function defined on [a. bland the interior such that a -c XI < X2 < ... < x~ < b.
a} Show that
4.48 Let Sbe an open connected sci in R". Let Tbea component ofR« R- - T is connected. 4.4' Let (S, ti) be a connected metric space which is not bounded. a in S and every r :> 0, the set {z : di«, a) = r} is aonemply.
Uniform aJDtinuity
S. Prove that
R.
be bounded on S. 4.53 Let f be a function defined on a set S in R- and assume thar f(S) <;; R", Let g be defined on r(S) with value in Rl, and let .. denote the composite function defined by h(x) g[f(x)] ih E S. Ifris unifol"mly oonlmuous on Sand ifg is uniformlycontmuous on f{S). show that b is unirormly continuous 00 S. 454 Assume I : s -+ T is uniformly continuous on S, w~re Sand T are metric spaces. If {x.} is any Cauchy sequence in S. prove that {/{x,,)} is a Cauchy sequence in T. (Compare with Exercise 4.33.)
let x I,
...
Xw
be n points in
D=l
I has
(f(Xt+)
- I(Xl-)]
;;;f(b-)
- I(a+}.
b) Deduce from part (a) that the set of discontinuities of f is countable. c) Prove that that points of continuity in every open subinterval of [a, b]. defined and strictly increasing on a set S in R. such
4.64 Give an example of a functionf,
/-1 is not
continuous on!(S}.
103
10l
4.65 Let/be strictly inCtt.!lsing 00 a subset S of R. Assume that the image/(S) has one of the following properties: (a)f(S) is open; (b)/(S) is connected; (c)/(S) is dosed. Prove that/must be continuous 00 S. Metric spaces and hed JI(lOlts 4.66 Let B(S) denote the set of all real-valued functions which are defined and bounded on a nonempty set S. Ir /e B(S).let
the condition
in Exercise 4.71.
If x
IE S,
let Po = x,
a) Prove that {e.) is a decreasing sequence, and let c = lim "". b) Assume there is a subsequence {PM"») which converges to a point q in S. Prove that
d(q,/(q»
d(!(q),/[J(q»)).
II/II
The number
= sup 1/(x)J.
Ifl is called
suGGESTED
g~ defines a metric don B(S).
REFERENCES
FOR FURTHER
STUDY
Irf ~
4.1 Boas,. R. P., A Prime,. of Real FflflCliortS. Carns Monograph No. 13. Wiley, New York,I960. 4.2 Gleason, A .• FlUItiamentais 0/ Abstract Analysis. Addison-Wesley. Reading, 1966,
b) Prove that the 'metric space (B(S). d) is complete. Hint. If {f.} is a Cauchy sequence in B(S), show that {f.(x)} is a Cauchy sequence of real numbers for each x in S. 4.67 Refer to Exercise 4.66 and let C(S) denote the subset or B(S) consisting of all functions continuous and bounded on S, where now S is a metric space. a) Prove that C(S} is a closed subset of B{S). b) Prove that the metric subspace C(S) is complete. 4.68 Refer to the proof of the fixed-point theorem (Theorem 4.48) for notation. a) Prove that dip, p,,) -s d(x.J(x)}:t'/(1 - a), This inequality. whk:h is useful in numerical work. provides an estimate for the distance from P.. to the fixed point p. An example is given in (b). b) Take/(x) = t(x + 2jx). S = [I. + co). Prove that/is a contraction of S with contraction constant a = t and fixed point p =..12. Form the sequence {pft} starting with x = Po = 1 and show that Ip. - ../21 .s; 2-", 4.69 Show by counterexamples that the fixed-point theorem for contractions need not hold if either (a) the underlying metric space is not complete, or (b) the contraction constant II ;:::1. 4.70 Let f :S .... S be a function from a complete metric space (S, d) into itself. Assume there is a real sequence {a~} which converges to 0 such that dU"(x), P'(y» :$ «,p(x, y) for all 11 ~ 1 and all x, y in S, where/" is the nth iterate of /; that is,
P(x) = /(.1'). f"+1(x) = /(/"(.1'»
4.3 Simmons. G. F., Introduction to Topology and Modtrm AlUllysis. McGraw-Hili, New York, 1963. 4.4 Todd, J" Survey 0/ Numerkal A1Illlys;s, McGraw·HiIl, New York, 1962.
forn2:L
a uaiqae
fu!:oo
point.
4.71 Let j": S .... S be a function from a metric space(S, d) into irsell such that whenever x ~ y. d(J(x),
fey»~
< di», y)
a) Prove that [has at most one fixed point, and give an example of such an /with no fixed peim, b) If S is compact, prove that f has exactly one fixed point. g(.1') d(x. /(.1'» attains its minimum on S, Hint. Show tbat
Th.5.3
105
CHAPTER
derivative off Similarly, the nth derivative of J, denoted by P,.}, is defined to be the first derivative of for n = 2, 3, . . .. (By our definition, we do not consider p.) unless r:» is defined on an open intervaL) Other notations with
r:».
DERIVATIVES
S.1 INTRODUCfION This chapter treats the derivative, the central concept of differential calculus. Two different types of problem-the physical problem of finding the instantaneous ~elocity of a moving particle, and the geometrical problem of finding the tangent h,ne to a curve at a given point-both lead quite naturally to the notion of derivatrve ', Here. w~ shall not be concerned with applications to mechanics and geometry, but Instead wIll confine our study to general properties of derivatives, This chapter deals primarily with derivatives of functions of one real variable specifically, real-valued functions defined on intervals in R. It also discusses bri~fly .deriv~tives of vect~:}f-~a1ued functions of 0n.e real variable, and partial derivatives, srnee these tOpICS Involve no new ideas. Much of this material should be familiar to the reader from elementary calculus. A more detailed treatment of derivative theory for functions of several variables involves significant changes and is dealt with in Chapter 12. The last part of the chapter discusses derivatives of complex-valued functions of a complex variable. S.2 DEFINITION OF DERIVATIVE If f is defined on an open interval (a, b), then for two distinct points x and c in (a, b) we can form the difference quotient
/'(e)
= D/(e)
== ~f (c) = ~.~
dx dx
"me
or similar notations. The function I itself is sometimes which produces f' from I is called differentiation.
PO).
The process
The next theorem makes it possible to reduce some of the theorems on continuity.
Tlworem 5.2. If I is defined on (a, b) and differentiable at a point c in (a, b), then there is a function 1* (depending on! and on e) which is continuous at c and which satisjit'f the equation
I(x) - J(c)
(x - e)f*(x),
(I)
for all x in (0, b), with j*(c) = /'(c). Conversely, tinuous at e, which satisfies (I), then f is differentiable Proof If/,(c) exists, let/*
if
/*(x)
Thenj* is continuous
f(x} -:-J(c1
C
= /,(c).
Conversely, if (I) holds for somef* continuous at e, then by dividing by x and Jetting x ...... we see that/,(c) c exists and equalsf*(c).
f(x)
- fCc) x c
as x .... c.
CE
As an immediate consequence
Tlworem 5.3. Iff Proof Let x
-+ C
is differentiable in (I).
Definition 5.1. Let f be defined on an open interval (a, b), and assume that
f is said
(a, b).
NOTE.
IQ
be differentiable
01
x--+('
. f(x)j 1m X-
Equation (I) bas a geometric interpretation which helps us gain insight into its meaning. SinceJ* is continuous at cJ*(x) is nearly equal toft(c) = /'«(') if x is neal' c. Replacing/*(x) by /'(1:) in (1) we obtain the equation
I(x)
f(c)
+ !'(c)(x
- c),
exists.
This limit process defines a new function I', whose domain consists of those points in (a, b) at which I is differentiable. The function /' is called the first 104
which should be approximately correct when x c is small. In other words, if lis differentiable at c, then/is approximately a linear function near c. (See Fig, 5.1). Differential calculus continually exploits this geometric property of functions.
106
Deri'ratives
Th.S.4
Def.5.6
One-Sided Derivati_
107
Theorem 5.5 (CNUII ruk). Let f be defined on an open interval S, let 9 be defined on I(S), and consider the composite function 9 f defined on S by the equation
0
(g JXx)
0
g[j(x)).
Figure 5.1
Assume there is a point c in 5 such that f(c) is on interior point of 1(5). II f is di.fferenliable at c and if g is d{trerentiable at fCc) then go f is differentiable at c and we have (g fnc) = gTf(c)]J'(c).
Q
Proof
for all x in S,
The next theorem describes the usual formulas for differentiating the sum, difference, product and quotient of two functions.
/'(c). Similarly,
Lv -
f(c)]g*(y),
_r + gJ
a)
b) c) Proof
Theorem 5.4. Assume f and 9 are defined on (0, b) and differentiable - g, and]> gare alsodijferentiobJe at c. This is also trueolllg The derivatives at c are given by the following formulas."
at c. Then if g(c) #- O.
for all y in some open subinterval T of/(S) containingf(e). at/(e) and g*[j{c)] g'[f(c)]. Choosing x in S so that y = f(x)
g[j(x)] - g[f(e)]
= E
Here g* is continuous
U±
g)'(c)
= /'(e) ±
=
=
T, we then have
=
g'(c),
[J(x) - f(c)]g*[f(x}]
(x - c)f*{x)g*(f(x)].
(2)
(f'g),(c) U/g)'(c)
f(c)g'(e)
g(c)f'(c)
+ J'(c)g(c),
- f(e)g'(~) g(e)2 •
o.
g*lf(x)]
..... g*(f(c)]
== 9Tf(c)]
as x
-jo
c.
(x -
c)f*(x),
+ +
lim g[f(x)]
(x - c)g*(x).
Thus,
I(x)g(x) - I(c)g(c)
as required.
5.6 ONE-SIDED
>;-"
X -
g(J(c)]
C
= g'[I(c)]/,{{'),
== (x - c)(f(c)g*(x)
-+
+ f*(x)g(C)]
DERIVATIVES
AND INFINITE
DERIVATIVES
From the definition we see at once that iff is constant on (a, b) then f' = 0 on (a, b). Also, if f{x) x, then f'(x) I for all x. Repeated application of Theorem 5.4 tells us that jf f(x) "" x" (n a positive integer), then f'(x) "" nx"-1 for all x. Applying Theorem 5.4 again, we see that every polynomial has a derivative everywhere in R and every rational function has a derivative wherever it is defined. 5.5 THE CHAIN RULE
Up to this point, the statement that I has a derivative at c has meant that c was interior to an interval in which f was defined and that the limit definingj'(c) was finite. It is convenient to extend the scope of our ideas somewhat in order to discuss derivatives at endpoints of intervals. It is also desirable to introduce infinite derivatives, so that the usual geometric interpretation of a derivative as the slope of a tangent line will still be valid in case the tangent line happens to be vertical. In such a case we cannot prove that I is contin uous at c. Therefore, we explicitly require it to be so. Definiti6R 5.6. Let I be defined on a closed interval S and assume that f is continuous at the point c in S. Then I is said to have a righthand derioauoe at c if the righthand limit lim f(x)
x-,'+
A much deeper result is the so-called chain rule for differentiating composite functions.
- ftc)
XC
lOCI
lb. 5.7
n.5.9
lot
exists as a finite value. or if the limit is +00 or - 00. This limit will be deIWted by f~(c). Lefthand deril.llJtives, denoted by f:"'(e), are similariy defined. In addition, if c is an !nterior point of S, then we say that / has the derivative f'(c) = + OJ if both the rlgh/~ and /efthand derivatives at care + 00. (The derivative f'(c) = _ 00 is Similarly defined.)
If/,(e)
= + 0:).
>1
x-c
whenever x #- c.
It is clear thatfhas a derivative (finite or infinite) at an interior point c if and only if,f~(e) = f ~(c), in which casef~(c) = f'_(c) "= f'(e). •
In this ball the quotient is again positive and the conclusion follows as before. A result similar to Theorem 5.7 holds, of course, jf f'(e) at some interior point c of (0, b).
< 0 or if /'(c)
=-
0:)
IHjiJrititnI S.8. Let f be a real-fi4lued /unction defined on a subset S 0/ a metric space M, and assume a e S. Then. f is said to have a local maximum at a if there is a ball 8(a) such that f(x) If f(x)
NOTE.
~ /(D)
for aU
in 8(a) n S.
~ f(a) lor alJ x iii B(a) n S, then / is said to hove a local minimum at a.
A local maximum at a is the absolute maximum off on the subset B(a) n S. Iffhas an absolute maximum at a, then a is also a local maximum. However,/ can have local maxima at several points in S without having an absolute maximum on the whole set S.
YIgUU
S.Z
The next theorem shows a connection between zero derivatives and local extrema (maxima or minima) at interior points.
TItsoum j.!J. Let f be defined on an open interval (a. b) and assume that / has a local maximu.m or a local minimum at all interior point e of ia, h). If I has a deriuatite (finite or infinite) at e, then f'(e) must be O.
Figure 5.2 illustrates SOme of these concepts. At the point Xl we havef~(xl) = At Xl the lefthand derivative is 0 and the righthand derivative is 1. Also, f'(x3) ~ 00, f~(X4) = -1, .f~(X4) = +1, f'(x6) = +00, and /'-(x7) =2. There IS no derivative (one-sided or otherwise) at xs. since f is not continuous there.
00.
5.1 FUNCfIONS
~ In (~. b) 111 which
WITH NONZERO
DERIVATIVE
If f'(c) is positive or +00, then f cannot have a local extremum at c because of Theorem 5.1. Similady,f'(c) cannot be negative or -00. However, .because there is a derivative at c. the only other possibility is/,(c) O.
Proof.
T":orem 5.7. Let f be defined on an open interval (a, b) and assume that for SOme we have f'(e) > 0 or f'(c) = + co. Then there is a I-ball B(c) s (a, b) f(x)
> ftc)
if x > c,
f(x)
and
/(x)
< fee)
if
< c.
Proof
r(c) > O. By the sign preserving property B(e) r;;;; (a, b) in whichf*(x) has the same f(c) has the same sign as x-c.
The converse of Theorem 5.9 is not true. In general, knowing thatf'(c) = 0 is not enough to determine whether I has an extremum at c. In fact, it may have neither, as can be verified by the example f(x) = x3 and e = O. In this case, 1'(0) = 0 but/is increasing in every neighborhood ofO. Furthermore, it should be emphasized that f can have a local extremum at c without f'(c) being zero. The example I(x) = Ixl has a minimum at x = 0 but, of course, there is no derivative at O. Theorem 5.9 assumes that I has a derivative (finite or infinite) at c. The theorem also assumes that c is an interior point of (a, b). In the examplef(x) = x, where a !>: x !>: b,ftakes on its maximum and minimum at the endpoints butf'(x) is never zero in [a, b].
110
Th.5.10
Cor. 5.15
111
S.9 ROLLE'S
THEOREM
It is geometrically evident that a sufficiently "smooth" curve which crosses the x-axis at both endpoints of an interval [a, b] must have a "turning point" somewhere between a and b. The precise statement of this ract is known as Rolle's
NOTE. The reader should interpret Theorem 5.12 geometrically by referring to the curve in the xy-plane described by the parametric equations x = g(I), Y = I(t}, a:S;; t :s;; h.
theorem.
There is also an extension which does not require continuity at the endpoints. lhtJulfl 5.13. Let I and 9 be two lunctions, each having a derivative (finite or infinitf?)at each point 01 (a, h). At the endpoints assume that the limits I(a +), g(a+). I(b-) and g(b-) exist as finite values. Assume further that there is no interior point x at which both /'(x) and gt(x) are infinite. Then for some interior
point c we have
Tlteorem 5.10 (Rolle). Assume f IuJs (1 derivative (finite or infinite) at each point 01 an open interval (0, b), and assume that I is continuous at both endpoint» a olld b. 111(a) = f(b) there is at least one interior point c at whichf'(c) =
o.
Proof We assume f' is never 0 in (a. b) and obtain a contradiction. Since I is continuous on a compact set, it attains its maximum M and its minimum m somewhere in [a, bJ. Neither extreme value is attained at an interior point (otherwise I' would vanish there) so both are attained at the endpoints. Since I{a) = I(b), then m = M. and hencelis constant on [a, b]. This contradicts the assumption tharj" is never 0 on (a, b). Therefore /,(c) = 0 for some c in (a. b).
S.10 THE MEAN-VALUE THEOREM FOR DERIVATIVES
j'(c)[g(b-) Proof
- g(a+)] = g'(c)(f(b-)
f(!t)].
= I(x)
G(a)
and
G(x) = g(x}
if x
IE
(a, b);
1(a+).
G(b)
= g(b-).
TAeorem 5.11 (Mellllf-Yalue TfIeorem). Assume that I has a derivative (finite or infinite) at each point of an open interval (a, h). and assume also that I is continuous at both endpoints a and b. Then there is a point c in (a. b) such that
Then F and G are continuous on [a, b] and we can apply Theorem 5.12 to F and G to obtain the desired conclusion. The next result is an immediate consequence of the Mean-Value Theorem. Theorem 5.14. Assume I has a derivative (finite or infinite) at each point of an open interval (0. b) and 1M' I is continuous at the endpoints a and b. a) If f" takes only positive values (finite or infinite) in (a, b), then f is strictly Increasing on [a, h].
b) e)
I(b) - f(a)
= /'(c)(b
a).
Geometrically, this states that a sufficiently smooth curve joining two points We will deduce Theorem 5.11 from a more general version which involves two functions f and 9 in a symmetric fashion.
A and .B has a tangent line with the same slope as the chord AB. Tltevrem 5.12 (GeneraIivd Mea". Value Theorem), Let I and 9 be two functions, each having a derivative (finite or infinite) at each point 01 an open interval (a, b) and each continuous at theendpoints a and h. Assume also that there is no interior point x at which bOlhP(x) and g'(x) are infinite. Then lor some interior pcint c we have
II/' takes only negative values (finite or infinite) in (a, b), then decrt'tlJi1l!l on [0, b]. II/'
is zero f!Vf!rywhf're in (a, b) then
is strictly
I is
Proof Choose x < y and apply the Mean-Value Theorem to the subinterval [x, y] of [a. bJ to obtain I(y) - I(x)
= /,(c){y
- x)
/,(c}[g(b) - g(a)]
NOTE.
= g'(c)[J(b)
- 1(0)].
All the statements of the theorem follow at once from this equation.
By applying Theorem 5.14 (c) to the difference f Corolltuy 5.15. in (a, b), then f
When g(x)
9 we obtain:
Proof. Let hex) = f(x)[g(b) - g{a)] - g(x)[J(b) - 1(0.)]. Then h'(x) is finite if both!'(x) and g'(x) are finite, and h'(x) is infinite if exactly one of/,(x) or g'(x) is infinite. (The hypothesis excludes the case of both and g'(x) being infinite.) Also, " is continuous at the endpoints, and h(a) = h(b) = I(a)g(b) - g(a)f(b}. By Rolle's theorem we have h'(c) = 0 for some interior point, and this proves the assertion.
II I
and 9 are continuous on [0, b] and haoe equal finite derivatir,'es 9 is constant on [a, b]. TIJEOREM FOR DERIVATIVES
rex)
5.11 lNTERMEDIATE-VALUE
In Theorem 4.33 we proved that a function I which is continuous on a compact interval [a, b] assumes every value between its maximum and its "minimum on
III
Th.S-It;
Th. !UO
113
the interval. In particular,J assumes every value between f(a} and feb}. A similar result will now be proved for functions which are derivatives. v'orem S.16 (l"'e''''dmt~ tlt.uremfor tkr~N). Assume that f is defined on a compact interval [a, b] and that f bas a derilJiZtive(finite or infinite) (11 each interior point, Assume also that / has finite one-sided derivatives I~(a) and f'_(b) at the endJWints. with f+(a) .p. /'-(b}. Then, if c is a real number between /+(a) and f'-(b), there exists at least one interior point x !luch that /'(x) = c. Proof. Define a new function 9 as follows'
discontinuity (by Theorem 4.5 J). Hence f' omits some value between /'(Cl) and /'(fJ), contradicting the intermediate-value theorem.
5.12 TAYLOR'S FORMULA WITH REMAINDER
As noted earlier, iff is differentiable at c, thenj" is approximately a linear function near c. That is, the equation f(x)
=
/(c)
+ f'(cXx
- c),
if x
f; tl,
g(a)
= f';'(a).
Then 9 is continuous on the closed interval [a, h]. By the intermediate-value theorem for continuous functions, 9 takes on every value between /';'(a) and [feb) - l(o)]/(b - a) in the interior (a, b). By the Mean-Value Theorem, we have g(x) = J'(k) for some k in (u, x) whenever x E (a, b). Therefore /' takes on every value between 1+(0) and [/(b) - l(a)]/(b - a) in the interior (a, b). A similar argument applied to the function h, defined by
h(x)
is approximately correct when x c is small. Taylor's theorem tells us that. more generally,fcan be approximated by a polynomial of degree n I ir/has a derivative of order n. Moreover, Taylor's theorem gives a useful expression for the error made by this approximation. TllHrem 5.19 (Taylor). Let f be a !unction having finite nth der;lJiZtivep.) everywhere in an open interval (a, b) and assume that j<,,-l) is continuous on the dosed interval [a, b]. Assume that e E [a, b]. Then,for every x in [0. b]. x .p. C, there exists a point x 1 interior to the interval joining x and c such that
f(x)
= f(x)
- feb)
x~b
jf x '# b,
h(b)
f:(b),
= f(c) +
.-1 f(I:)( }
.=1
L __e (x ~ k!
e)"
+~
f(O), n!
(x - C)R.
shots that ]" takes on every value between [/(b) - f(a)]/(b a) andf~(b) in the interior (a, b). Combining these results. we see that/, takes on every value between I~(a) andf:(b) in the interior (a, b), and this proves the theorem. Theorem 5.16 is still valid if one or both of the one-sided derivatives I~(a), f~(b), is infinite. The proof in this case can be given by considering the auxiliary function g defined by the equation g(x) = /(x) - ex, jf x E [a, b]. Details are left to the reader.
NOTE.
Taylor's theorem will be obtained as a consequence of a more general result that is a direct extension of the generalized Mean-Value Theorem. Theorem 5.20. Let / and 9 be two functions having finite nth derivalilJes l and g('" in an open interoal (0,11) and continuous (n - l)st derivatives in the closed interval [a, b]. Assume that C E [a, b]. Then, for every x in [a, b], x #; C, there exists 0 point Xl interior to the interval joining.;,K and c such that
r'
The intermediate-value theorem shows that a derivative cannot change sign in an interval without taking the value O. Therefore, we have the following strengthening of Theorem 5. 14(a) and (b). Tlreorem 5.17. Assume f has a derivative (finite or infinite) on (a, b) and is continuous at the endpoints a and h. If /'(x) #: 0 lor all x in (fl, b) then f is strictly monotonic on [a, b J. The intermediate-value theorem also shows that monotonic derivatives are necessarily continuous.
/(x) -
.-If(l) )]
t~O
L~ k!
(x - e)l
g(·)(XI)
= P")(xt)
[0-1
g(x) -
l=O
(l}() L g~k! (x
- c)~ .
NOTE.
o~
For the special case in which g(x) = (x - e)", we have g(i)(C) = 0 for k s; n - I and g(lO'(X) = n!. This theorem then reduces to Taylor's theorem.
C
Proof.
For simplicity, assume that new functions F and G as follows: F{t) G(t)
>
= f(t) +
=
Theorem 5.18.
L: _t k!
ril( )
(x ~
tl,
9(t)
0-1
k~1
We assume j" has a discontinuity at some point c in (a, b) and arrive at a contradiction. Choose a closed subinterval [I)', P] of (a, b) which contains c in its interior. Since f' is monotonic on [a, If] the discontinuity at c must be a jump
L:-gk!' _
(l)(t)
(x - I)",
for each t in [c, x]. Then F and G are continuous on the closed interval [c, x] and have finite derivatives in the open interval (c, x), Therefore, Theorem 5.12 is
1:14
PartiIIl Derimi:,.
115
applicable
F'(xl)[G(x)
G'(x,)[F(x)
- F(c)] ,
where
XI E
(c, z).
The Mean-Value Theorem, as stated in Theorem 5.11, does not hold for vectorvalued functions. For example, if I(t) = (cos t, sin t) for all real t, then f(21t) - 1(0) O.
G'(x1)[f(x)
- F(c)],
(a)
since G(x) = g(x) and F(x) = f(x). If. now, we compute the derivative of the sum defining F(t), keeping in mind that each term of the sum is a product, we find that all terms cancel but one, and we are left with F'(t) SimiJarly. we obtain
but 1'(1) is never zero. In fact, Ilf'(t)U = 1 for all t, A modified version the Mean-Value Theorem for vector-valued functions is given in Chapter 12 (Theorem 12.8). 5.14 PARTIAL DERIVATIVES Let S be an open set in Euclidean space R", and let f: S -+ R be a real-valued function defined on S. If:l[ = (x, •... , xJ and c = (CI, , ..• cJ are two points of S having corresponding coordinates equal except for the kth, that is, if Xi = c. fOT i '" k and if xt '" c1> then we can consider the limit
or
G'(t)
If we put t =
Xl
= (x -
I· /(:11:) 1m
%~......
fCc)
X ... -
c"
5.13 DERJV ATIVES OF VECTOR-V ALUJID FUNCfIONS Let I: (a, b) _,. R" be a vector-valued function defined on an open interval (a, b) in R. Then f = (/1' ... .f.J where each eomponent j; is a real-valued function defined on (a, b). We say that f is differentiable at a point c in (a, b) if each component!. is differentiable at c and we define f/(e) = U;(c}. ' , . , f~(c)). In other words, the derivative f'(c) is obtained by differentiating each component of r at c. In view of this definition, it is not surprising to find that many of the theorems on differentiation are also valid for vector-valued functions. For example, if f and g are vector-valued functions differentiable at c and if A is a real-valued function differentiable at c, then the sum r + g, the produce U, and the dot product f· g are differentiable at c and we have (f
When this limit exists. it is called the partilll deril10tive of fwith coordinate and is denoted by DJ(c),
or by a similar expression. We shall adhere to the notation Dd(c}. This process produces n further functions Dt!, DzJ, ... , DJ defined at those points in S where the corresponding limits exist. . Partial differentiation is not really a new concept. We are merely trea tmg /(x!, .. , x,,) as a function of one variable at a time, holding the others fixed. That is, if we introduce a function g defined by
g(Xt)
= /(Ch
c.),
g)'(c)
=
=
r'(c)
g'(c},
(ll')'(c) (f·g),(c)
}.'(c)f(c) f'(c)'g(e}
+ J.(c)f'(c),
+
f(c)·g'(c).
The proofs follow easily by considering components. There is also a chain rule for differentiating composite functions which is proved in the same way. If f is vectorvalued and if 11 is real-valued, then the composite function g given by g(x) "'" f[ u(x)] is vector-valued. The chain rule states that g'(c)
then the partial derivative DJ(c) is exactly the same as the ordinary derivative g'(Cl)' This is usually described by saying that we differentiate/with respect to the kth variable, holding the others fixed. In generalizing a concept from RI to R", we seek to preserve the important properties in the one-dimensional case. For example, in the one-dimensional case, the existence of the derivative at c implies continuity at c. Therefore it seems desirable to have a concept of derivative for functions of several variables which will imply continuity. Partial derivatives do not do this. A function of n variables can have partial derivatives at a point with respect to each of the variables and yet not be continuous at the point. We illustrate with the following example of a function of two variables:
a neighborhood
fT u(e)J
both
f(x, y)
= {~, + Y.
if x
= 0 or
= O.
otherwise.
ner.5.11 The partial derivatives DJ(O, 0) and Dd(O, 0) both exist. In fact, 1,
117
become a field] which includes C. Since division is possible in C, ho~ever. we can form the fundamental difference quotient [/(z) - I(c)]/(z - c) which was used to define the derivative in R. and it now becomes clear how the derivative should be defined in C. DejinitWfJ 5.11. LeI I be a complex-valued [unction defin~d on Q~ ~pen set S in C, and assume c E S. Then/is said to be differentiable at c !f the limit lim
r-ec Pe;_;(;...-z
and, similarly, D·d(O. 0) = I. On the other hand. it is clear that this function is not continuous at (0,0). The existence of the partial derivatives with respect to each variable separately implies continuity in each variable separately; but, as we have just seen, this does not necessarily imply continuity in all the variables simultaneously. The difficulty with partial derivatives is that by their very definition we are forced to consider only one variable at a time. Partial derivatives give us the rate of change of a function in the direction of each coordinate axis, There is a more general concept of derivative which does not restrict our eonsiderations to the special directions of the coordinate axes. This will be studied in detail in Chapter 12. The purpose of this section is merely to introduce the notation for partial derivatives, since we shall use them occasionally before we reach Chapter 12. If f has partial derivatives Dd •... , DJ on an open set S, then we can also consider their partial derivatives. These are called second-order partial derivatives. We write Dr.d for the partial derivative of Dtf with respect to the rth variable. Thus, D•.d Higher-order partial derivatives D,(Dd). Other notations are
)'----'--1..;..,(
Z -
cc.-)
f'(c)
exists. By means of this limit process, a new complex-valued ~unc.tioll f~is _?etined at those points z of S where f'(z) exists. Higher-order derivatives f ,f , ... are, of course, similarly defined. . The following statements can now be proved for complex-valued functions defined on an open set S by exactly the same proofs used in the real case:
a) f is dijJerfntiable at c if. and only if, there is a function f"", continuous that
f(z) - J(c)
01
c, such
(z
c)[*(z),
= f'(c).
J(z)
where g(z)
--jo
f(c)
+ !'(c)(z
- c)
+ g(z)(z
0 as z
--jo
c.
f.
In this section we shall discuss briefly derivatives of complex-valued functions defined on subsets of the complex plane. Such functions are, of course, vectorvalued functions whose domain and range are subsets of R 2• An the conside ..atiens of Chapter 4 concerning limits and continuity of vector-valued functions apply, in particular, to functions of a complex variable. There is, however. one essential difference between the set of complex numbers C and the set of n-dimensional vectors R" (when n > 2) that plays an important role here. In the complex number system we have the four algebraic operations of addition. subtraction, multiplication, and division, and these operations satisfy most of the "usual" laws of algebra that hold for the real number system. In particular, they satisfy the first five axioms for real numbers listed in Chapter I. (Axioms 6 through 10 involve the ordering relation -c , which cannot exist among the complex numbers.) Any algebraic system which satisfies Axioms I through 5 is called a field. (For a thorough discussion of fields, see Reference 1.4.) Multiplication and division, it turns out, cannot be introduced in RIt (for n > 2) in such a way that R" will
b) If I is differentiable at c. then f is continuous at c. c) Iitwofunctionsfand 9 have derivatives at c, th~n their sum, difference, praduc:, and quotient also have derivatives at c and are gIVen by tire usualformulas (as In Theorem 5.4). In the case of JIg, we must assume gee) #- O. d) The chain rule is valid; thai is
(g
0
10 say.
11:0"
haue ]f'(c),
f)'(c)
= g'[/(e)
if the domain of g ('ontains a neighborhood of ftc) and if /,(e) and g'(f(c)] both exist.
When/(z) = z, we findf'(z) = I for all z. i~ C: Using (c) r:epeatedly, we find thatl'(z) = nz"-I when/(z) = z" (n is a positive integer), This also holds when For example, if it were possible to define multiplication in R:J so as to mak; R: a field 3 including C. we could argueas follows: For every x III R the veclor~ I, x; x ,x ,:,,",ouId be linearly dependent (see Reference SJ. p, SS8). Hence for each x In R • a relation of the form a + a x + a Xl + aJ,x3 = 4) would hold, where all> Qt. Ql. 03 are real hers pol;nomial of degree three with real coefficients is a product of a iynomial and a quadratic polynomial with real coefficients. The only roots such l~um PO. mear le be polynomials can have are either real numbers or romp x num rs.
i'U! ~
ttl
111. 5.12
Th. S.U
Caudly-R.IemamI EquatiOllS
119
n is a negative integer, provided z '" O. Therefore, we may compute derivatives of complex polynomials and complex rational functions by the same techniques used in elementary differential calculus.
5 .r 6 THE CAUCHY-RIEMANN EQUATIONS
at
andf*(c)
= /,(c).
ib, and
Write /*(z)
iy,
A(z)
iB(z).
where A(z) and 8(z} are real. Note that A(z) _,. A(e) and B(z) -> 8(c) as z _,. c. By considering only those z in S with y = band taking real and imaginary parts of (5), we find u(x, b) - Il(a, b) = (x - a)A(x
+ ib),
and
vex, b) -
!J(a, b) = (x -
a)B(x
+ lb).
~~.
. Similarly. whenf(z)
+ ;V(Z),
.
where II ~d v are real-valued functions of a complex variable. We can, of course, also consider II and v to be real-valued functions of two real variables and tben f(z)
= u(x. y)
A(e)
D11)(c)
= =
B(e).
iv(x, y),
if z
+ iy.
Since!'(c)
= A(c) + i8(e), this proves (3). Similarly, by considering only those z in S with x
a we find
In either case, we write f = ~ + ivand we refer to II and 11 as tbe real and imagrnary parts off. For example, 10 the case of the complex exponential function J, defined by f(z) the real and imaginary
and which proves (4). Applications of the Cauchy-Riemann equations are given in the next theorem.
~cosy
i~siny.
u(x, y)
Z2
Theorem 5.11. Let f = u + io be a function with a derivative everywhere in an open disk D centered at (a, b). If anyone 01 u, D, or If I is constanrt on D, then [ is constant on D. Also.f is constant iff'(z) = o jar all z in D. Proof Suppose u is constant on D. The Cauchy-Riemann equations show that D2t:i = D,» = 0 on D. Applying the one-dimensional Mean-Value Theorem twice
places a we find, for some y' between band J"
(y -
In the next theorem we shall see that the existence of the derivative/, rather severe restriction on the real and imaginary parts u and v.
v(x,
y) -
v(x, b) =
b)D2v(x,
y') =
0,
5.22. LeI J = u + iv he defined on an open set S in C. IJ /,(e) exists for so~e C In S, then the partial derivatives D1u(c), D2u(c), Dlv(e) and D2v(e) also exist and we have /'(c) D11I(c) + i D1v(c), (3) and j'(c) = D2v(e) - i D2u(e). (4) This implies, in particular, that and
These last two equations are known They are usually seen in the form
NOTE. {JIJ
Tu",em_
O.
Therefore v(x, y) v(a, b} for all (x, y) in D. so v is constant on D. A similar argument shows that if v is constant then u is constant. Now suppose If I is constant on D. Then 1/12 u2 + 1,1 is. constant on D. Taking partial derivatives. we find
as the Cauchy-Riemann
equations.
By the Cauchy-Riemann
equations
{'DIU
can be written as
= O.
DID
iJu
iJx
oy
defined on S such that
+ (;2 =
Ilf
Proof
Since/'(c)
is constant,
= O. If uf
(z - c)f*(z),
(5)
tHere
lZU Finally, jf f' = 0 on D. both partial derivatives DID and D1v are zero on D. Again. as in the first part of the proof, we find/is COnstant on D. Theorem 5.22 tells WI that a necessary condition for the functionj' = u + io to have a derivative at c is that the four partials DIu, D1.u, D1o, DIP, exist at c and satisfy the Cauchy-Riemann equations. This condition, however, is not sufficient. as we see by considering the following example. Enmple. Let
II
121
EXERCISES
ReaI.valued functions
In the following exercises assume, where necessary, .a knowledge. of ~he for~uJas for differentiating the elementary trigonometric, exponential, and logartthmic functions. 5.1 A function f is said to satisfy a Lipschitz condition of order a at c if there exists a positive number M (which may depend on c) and a I-ball B(c) such that
If(x} - /(c)1 < Mix - cl" whenever x E B(c), x "# c. a) Show that a function which satisfies a Lipschitz condition of order at c if It: > 0, and has a derivative at c if a ;. 1.
if (x, y)
if (x, y)
11(0, 0) 0(0. 0)
0, O.
It:
is continuous
(0, 0),
It is easily seen that DIu(O, 0) = D1v(O, O} I and that D1II(0, 0) = Dl!l{O,O) = ~1, that the Cauchy-Riemann equations hold at (0, 0). Nevertheless, the function I '" u + if) cannot have a derivative at z = O. In fact, for x = 0, the difference quotient becomes
80
b) Give an example of a function satisfying a Lipschitz condition of order I at c for which /,(c) does not exist. S.Z In eaCh of the following cases, determine the intervals in which the function I is increasing or decreasing and find the maxima and minima (if any) in the set where each I is defined. a) /(x) .. x3 + ax + b, x E R. b) [(x)
c) f(x)
ee
/(z)
- 1(0)
z-o
whereas for x ~ Y. it becomes
/(z) - 1(0)
-y
JjI
.+ iy = 1 + i,
I+I 2
log (x2
);213(X -
=
=
9), 1)4,
Ix1 >
0~
X
3.
::;;
l.
d)
lex)
xi
z-O
and hencel'(O) cannot exist.
x+ix
5.3 Find a
polynomial/of
/(Xl) ." a..
i2l>
[(Xl) = aJ,
/'(xl)
bl>
j'(X'l)
= b",
where In Chapter 12 we shall prove that the Cauchy-Riemann equations do suffice to establish existence of the derivative of J = u + if) at c if the partial derivatives of u and v are continuous in some neighborhood of e. To illustrate how this result is used in practice, we shall obtain the derivative of the exponential function, Let J(z) = e" = u + io. Then
x,
-#
x2
and
02'
5.4 Define
I as
folloW'l: f(x)
e-1/X'
if x "# 0,/(0)
O. Show that
a) f is continuous for all x, b} p.) is continuous for all x, and that /<0>(0) = 0, (n = 1,2, ... ).
g(x) = x sin (llx),
uix, y)
and hence
if" cos y,
v(x, y)
e" sin y,
O./(x)
sin (I/x),
a)
/'(x)
-1/x2
cos
b) g'(x)
sin (1Ix)
c) 1I'(x) == 2x sin (If:.:) - cos (1Ix), if x ¥ 0; Since these partial derivatives are continuous everywhere in il2 and satisfy the Cauchy-Riemann equations. the derivative f'(z) exists for all z, To compute it we lise Theorem 5.22 to obtain
5.6 Derive Leibnitz's formula for the nth derivative of the product II of two functions landg;
If·l(X) =
1:=0
f'(z)
Thus, the exponential
= if" cos y
ie" sin y
= fez).
(as in the real case). 57 Let
t (n) f
k
kl(X)u'"-1<J(x),
where
k! (II -
k)!
function
nd g be two functions defined and having finite third-order derivatives I-ex) x in R. If I(x)g(x) = 1 for all x, show that the relations in (a). (b), (c),
112
113
and (d) hold at those points where the denominators are not zero: a) I (x)lf(x)
b) I'(x)/f'(x)
Mean-Value 1'1IeoI'em 5.10 Given a function f defined and having a finite derivative in (a, b) and such that lim.. _I>_ I{x) + 00. Prove that limx-+b_ P(x) either fails to exist or is infinite.
+ g'(x)fg(x)
- 2/,(x)//(x}
o.
- g~{x)/g'(x) = 3r{x)
o.
c) /"'(x)
/'(x)
_ 31'(x)g"(x) /(x)g'(x)
d)
rex) _ ~ (r(X»)2
f'(x) 2 /'(x)
lex)
_ g"'_(x) = gT(X)
o.
s.n
Show that the formula in the Mean-Value Theorem can be written as follows:
(J
= x\
The expression which appears on the left side of (d) is called the Schwarzian ikrilltltWe of/at x.
NOTE.
lex)
log x ,
.• > O.
e) Show that/and
Keep x#.O
in each case.
g(x) = (a/(x)
+ b l!{cf(x) +
dl,
+
where ad - be "# O.
(be
Hint.
part (d).
If c "# 0, write
(€if + b)/(ef +
d) = (a/c)
ad)/[C(cf
5.11 Take lex) = 3x4 - 2x3 - x2 + 1 and g(x) = 4x3 - 3x2 - 2x in Theorem 5.20. Show that f'(x)/g'(x} is never equal to the quotient [(0) - I{O)]/[g(l) - u(O)] if o < x .s I. How do you reeoncije this with the equation
- /(a) _ f'(x,) g(b) -
5.8 Letfl'/2, 91' Uj;be four functions having derivatives in (0,0). Define Fby means of the determinant
F(x) =
yea) -
g'{Xl)'
It ..
1'?
11
! II(x)
lUI (z]
'/~(X)
91(X)
Il(X)!, Ul(X)!
if x
(a, b).
S.13 In each of the following special cases of Theorem 5.20, take and show that XI = (# + h)/2. a) I(x)
=
= I, c = a, x ,.. b,
sin x,
g(x) = cos x;
b) f(x)
e",
X,
li(X)j 4- ) /:(x)
gix)
Ui(X)
92(X)
i2(X)I.
Can you find a general class of such pairs of functions land U for which (0 + b)/2 and such that both examples (a) and (b) are in this class'!
will always be
b) State and prove a more general result for nth order determinants. 5.9 Given n functioIlllh ••..• /", each haviDg nth order derivatives in (a, b). A function W, called the Wronskian off, •...• I", is defined as follows: For each x in (0, b), W(••) is the value of the determinant of order n whose element in the kth row and mth column is r,:-o(x), where k ~ 1,2, .•.• IT and m "" I, 2, ...• n. [The expressionr.,°)(x) is written for/ .. (x).] a) Show that W'(x) can be obtained by replacing the last row of the determinant defining W(x) by the nih derivatives!t")(x), ,r.·}{x). b) Assuming
cdl(x) + x in (a, b).
5.14 Given a function I defined and having a finite derivative /' in the .half-open interval < x S I and such that IP(x) I < 1. Define Q. = f(lfn) for n = I, 2, 3, ...• and show that iim"_,,, o. exists. Hint. Cauchy condition.
5.15 Assume that/has a finite derivative at each point of the open interval (8, 0). Assume also that limr_cP(x) exists and is finite for some interior point c. Prove that the value of this limit must be I'(c). 5.16 Let /be continuous on (a, b) with a finite derivative P everywhere in (a, b). except possibly at c. If lirnx->.. /'(x) exists and has the value A. show that p(e) must also exist and have the value A. 5.17 if tion 5.18
the existence of n constants c" , c•• not all zero, such that ... + c../,,(x) ~ 0 for every x in (a, b}, show that W(..-} = 0 for each
Let I be continuous on [0, I], /(0) = 0, j'(x) finite for each x in (0, 1). Prove thai is an increasing function on (0, 1), then so too is the function g defined by the equag(x) = /(x)/x.
Assume f has a finite derivative in (a, h) aDd is continuous on [a, b] with f(a) =
= O. Prove that for every real 1 there is some c in (a, b) such that
NOTE.
A set of functions satisfying such a relation is said to be a linearly dependent set oo~~ . c) The vanishing of the Wronskian throughout (a, b) is necessary, but not sufficient, for linear dependence of It..... f". Show that in the case of two functions, if the Wronskian vanishes throughout (0, b) and if one of the functions does not vanish in (a, b), then they form a linearly dependent set in (a, b).
f(fJ)
/'(d
= If(e).
5.1' Assume/is continuous on (a, b] and has a finite second derivativ !' in the open interval (a, b). Assume that the line segment joining the points A = (a,/(a)) and B = (b,f(b» intersects the graph of lin a third point P different from A and B. Prove that!'(c) = 0 for some c in (a, b).
us
5.28 Prove the foUowing theorem: Let / and g be two functions having /inite nth derisatioes in (a, b). For some interior point e in (0, b), alsume that /(c) = /,(c) ••• = P~-I)(C) "" 0, and thar g(c) = g'(c) = ... = g(.-H(C) = 0, but that g(ltl(x) is neoer zero in (a, b). Show thar lim f(x} = r'(c) .
x-+~
= /(b) =
/,(b) = 0,
prove thatj'(c)
5.11 Assume I is nonnegative and has a finite third derivative I" in the open interval (0, I). IfI(x) = 0 for at least two values of x in (0, I), prove that /"(c) = 0 for some e in (0. I). S.l2 Assume /has a) If/(x) b) If /'(x)
1;:) If rex)
-+
g(x)
g"')(c)
NOTE.
1(·) and
g("}
as x
-+
+ 00,
+ 00. + 00,
I)!
as x
define G similarly, and apply Theorem 5.20 to tile functiOllll Fand G. 5.29 Show that the formula in Taylor's theorem can also be written as foUows: j(x) = ~~-(x ~ k!
K-l Flt.c)
5.l3 Let II be a fixed positive number. Show lhal there is no function / satisfying the foUowing three conditions: /,(x) exists for x ;;.:O./,(0) = 0, J'(x) ;;.:h for x > O. S.l4 If h > 0 and if rex) ellists (and is finite) for every x in (Il - h. a continuous on [0 - h;« + h), show that we have: a) f(a b) /(a
+ n). and
if / is
- cl
(x
c){x _ XI)"-l
(n _ I)!
rex!),
h) ~ I(a -
h)
= r(a +
h}
(lh)
+ r(a
_ fJh),
where Xl is interior to theintervaljojning x and c. Let 1 - () = (x XI)/(X - c). Show that 0 < 8 < I and deduce the following form of rhe remainder term (due to Caucny):
(1 (n -
h) -
2f~a)
+ f(a
= f'(a
(I
8)c).
= get)
+ It)
2j(a)
+ I(a -
Vector-valued
h).
fimctions
.~O
hZ
d) Give an example where the limit of the quotient in (c) exists but where /"(0) does not exist. S.l! Let / have a finite derivative in (a. b) and assume that c eo(a, b). Consider the following condition: For every e > 0 there exists a l-ball 1I(c; 0), whose radius" depends only on t and not on c, such that if x e B(c; §), and x ¢ c, then
j(x)
r'(c)
= lim! [f(c
~.... h o
h) -
f(c)].
Conversely. if this limit exists, prove that f is differentiable at c. 5.31 A vector-valued function f is differentiable at each point of (0, b) and has constant norm I~I!. rove that f(t)· I'(t) = 0 on (0, b). P 532 A vector-valued function f is never zero and has a derivative f" which exists and is continuous on R. If there is a. real function A such that f'(t) "" l(t)(/) for all t, prove that there is it. positive real function u and a. constant vector c such that f(t) = u(t)c for all t. Partial deri1'lltives 5.33 Consider the function / defined on HZ by the following formulas:
x-c
- /(c) _
{(C)i <
1:.
Show that/'
lex)
5.26 Assume f has a finite derivative in (a, b) and is continuous on [a, b], with a ::;; :$ b for aU x in [a, bland If'(x) I s ex < 1 for all x in (a, b). Prove that I has it unique fixed point in [a, b].
9 having finite derivatives in (0,1),
tt». y)
xy
X Z
"-1;1 g(x)
but such that limx_o!'(x)/g'(x)
limf(x)
0,
if (x,
y)
¢ (0,0)
/(0,0)
o.
Prove that the partial derivatives Dt/(x. y) and Dd(x., y) exist for every (x, y) in R2 and evaluate these derivatives explicitly in terms of x and y, Also, show that f is not continuous at (0,0).
~-r
CHAPTER
+ J'
if (x
Y) ¢ (0, 0),
1(0, 0) = O.
Compute the first- and second-order partial derivatives off at the origin, when they exist. C4Jmp1n:-valued fllllCtiCDl 5.35 Let S be an open set in C and let S· be the set of complex conjugates z, where Z E S. If/is defined on S, defineg on S· as follows: g(Z) = 1{Zj. the complex conjugate of /(z). If / is differentiable at c prove that 9 is differentiable at c and that g'(c) = !'(c).
5..36
6.1 INTRODUcnON Some of the basic properties of monotonic functions were derived in Chapter 4. This brief chapter discusses functions of bounded variation, a class of functions closely related to monotonic functions. We shall find that these functions are intimately connected with curves having finite arc length (rectifiable curves). They also playa role in the theory of Riemann-Stieltjes integration which is developed in the next chapter. 6.2 PROPERTIES OF MONOTONIC FUNCfIONS Theorem 6.1. Let f be an increasing function defined be II + I points such that
011
i) In each of the following eu.mples write / for u(x. y) and v(x, y); a) j{z) = sin z, c) /(z) = [z],
e) /(z) = arg z (z"# 0). g) /(z) = b) fez)
u
=
+ Iv and
cos z,
i, Log z
il
d) fez)
f) fez)
=
=
(z"# 0),
(These
h) fez) =.t (<< complex, z '" 0). functions are to be defined as indicated in Chapler L) ii) Show that the Caucby-Riemann equations for tile following values of z : AU z in (a). (b), (g); no z in (c), (d), (e); all z except real z :;;: in (f), (h). 0 (In part (b), the CallChy-R.icmann equations hold for all z if 1% is a nonnegative integer, and they hold for all z "# 0 if II is a negative integer.) the derivative/,(z) in (a), (b), (f), (g), (h), assuming
it exists.
e"',
u and v satisfy
x"
a=
Then we have the inequality
n-I k~l
Xo
<
XI
<
X2
< '"
< x"
b.
u + ivand assume that/has a derivative at each point of an open disk: D centered at (0, 0). If IlUZ + bv2 is Constant on D for some real a and b, not both 0, prove that / is constant on D.
L: [!(x +)
Q
- f(xc)]
s f(b)
f(a).
SUGGESTED
REFERENCES
FOR FURTHER
STUDY
5.1 Apostol, T. M., ell/elilus, Vol. 1, 2nd ed, Xerox, Waltham, J967.
Proof, AS5umethatYl€(xltXul)' Fori ~ksn l,wehavef(x1+)sf(yJ and J(Yt-)) s f(x_-). so that f(xf. +) - f(xlr.-) :-:;; - f(Ytf(Yl) If we add these inequalities, the sum on the right telescopes to ftV,,-l) - f(yo). Since f(Y"-I) - f(yo) :s; feb) - f(a), this completes the proof.
a·
The difference f(x. +) f(xk-) is. of course, the jump of fat Xt.. The foregoing theorem tells us that for every finite collection of points xt in (a, b), the sum of the jumps at these points is always bounded by f(b) - f(a). This result can be used to prove the following theorem. Theorem 6.2. Iff countable. is monotonic on [a, b]. then the set of discontinuities off is
Proof Assume thatfis increasing and Jet S .. be the set of points in (11. b) at which the jump off exceeds l [m, m > O. If x I < Xl < ... < x_ -I are in S.., Theorem 6.1 tells us that
-._-
feb) - f(a).
127
128
Dd'.U
TOOIIVariatJon
129
This means tbat Sill must be a finite set, But the set of discontinuities of/jn (0, b) is a subset of the union U:=l S.. and hence is countable. (Iffis decreasing, the argument can be applied to -J.)
tj.J FUNCTIONS
Proof
Assume that x e (a, b). Using the special partition ii(x) - f(a)l
S;
If(b) - l(x)1
S;
;$
M.
holds
OF BOUNDIID VARIATION
M, I/(x)1
1/(0)\
Ihfotititm satisfying
{Xlh
<
X~ =
b,
I. It is easy to construct a continuous function which is not of bounded variation. For example,letf(x) = x cos {></(2x)}ifx # 0.1(0) = O. Then f'is continuous on [O.l}. but if we consider the partition into 2n subintervals P
=
is called a partition of [a. b]. The iktelT!Dl [Xl-I> Xl] is called the kth sublntel'VQI of P and we write Ax. = x. - Xl_I' so that l:l~l hl = b - Q. The collection of all pvssible partitiOllS of [Q, b] will he denoted by 9'[0, b].
Defod;tio1l6.1. Let f be defined on [a, b]. If P = {xo, XI' •••• x~} ts a partition of [0, b], write Ali = f(x ..) I(Xi-I), for k = 1.2, ... , n. If there exists a
2~
+ --
+ ... + - + 22
lIt
=1
+ - + ... + - .
2 n
for all partitions of [a, b], then f is said to be of bounded ,'arialioll on [Q. b].
This is not bounded for al1 n, since the series l.::~l(lIn) diverges. In this example the derivative]" exists in (0. I) but/' is not bounded ontO, 1), However,f'is bounded on any compact interval not containing tbe origin and hence [will be of bounded variation on such an interval. 2. An example similar to the fint is given by f(x) x" cos O/z) if x ". 0, [(0) O. This [is of bounded variation on [0, I}. since I' is bounded on [0, 1]. In fact, ['(0) = 0 and, for x ". O,f'(x) sin (I/z) + 2x cos (Ilx), so that If'(x) 1 :::;; 3 for aU x in (O, 1]. 3. Boundcdness of l' is not necessary for [to be of bounded variation. For example, let f(x) = xll3. This function is monotonic (md hence of bounded variation} on every finite interval. However. [,(x) ...... 00 as x _,. O. +
ss.
W
Iff
Aft ;;,._ 0
- !(Xi-I)]
= !(b)
I(a).
is
s A for
continuous on [a. b] and iff' exists and is bounded in the all x in (a, b). thenf is of bounded variation on [a. Theorem, we have
bJ.
IAJ,.I
on [a, b], and felL (P)denote the sum P = {Xo, Xl' •• ,. x.} 0/ [€I. b]. The
AI". = f(xt)
This implies
/(xt-1)
= f'(tt)(x"
- Xi:-l),
number
VJa,b)
= sup
{E(P):
Pe9"[a,b]},
Theorem 6.7. Iff is 0/ hounded variation on [a, b], say titions 0/[0, b], then f is bounded on [a. b]. In/act, If(x)1
L IAJ,.I
Vr(a, b).
;$
s 1/(a)1 +
for all
in [a, b].
Sincefis of bounded. variation on [0, b], the number VI is finite. Also, VI ;;,._ , 0 since each sum L (P) ;;,._ Moreover, V,(o, b) = 0 if, and only if, I is constant O. on [a, b].
Th.6.9
n. 6.JZ
Toad VariatiGrl
ClII.
I.. xl as _ Flmction
of x
131
$0
Tlreoum 6.9. Assume thatf and 9 are each 0/ bounded variation on [0, bJ. are their sum, difference, and product. Also, we have tJnd
Theil
This shows that each sum 1: (PI) and 1: (Pl.) is bounded by V,(o, b) and this means that/is of bounded variation on [a. c] and on [c, b]. From (I) we also ob.aia the inequality because of Theorem 1.1 S. To obtain the reverse inequality, let P = {xo, XU"'' x,,} e @[a, b] and let Po = P u {e} be the (possibly new) partition obtained by adjoining the point c. If c e [Xi _ .. X,,]. then we have
I/(Xl) - f(Xt-I)1 :-:;;If(x') - f(c)1
where A Proof
sup {lg(x)1 : x
[0, b]}.
sup fI/(x)
I:x
[0, b]}.
FOT
IM.I
= =
l/(xJg(X,l:)
f(Xk-l)g(X._I)~
I[/(x.)g(x.)
- /(xt-1)g(Xl)]
- /(xl-1)g(xt_I)]1
;S;
If(c) - f(xa.-I)!,
[/(Xk-l)g(XI)
AIA/kl +
BIA9ll.
This implies that his of bounded variation and that VII ;S; AVj for the sum and difference are simpler and will be omitted.
NOTE.
+ BY,..
The proofs
and hence L (P) ;S; L (Po). Now the points of Po in [a, c] determine a partition PI of [a. c] and those in [e, b] determine a partition Pz of [c, b]. The corresponding sums for all these partitions are connected by the relation
Quotients were net included in the foregoing theorem because the reciprocal of a function of bounded variation need not be of hounded variation. For example, if/(x) -+ 0 as x -+ Xo. then If/will not be bounded on any interval containing XI) and (by Theorem 6.7) I/f cannot be of bounded variation on such an interval. To extend Theorem 6.9 to quotients, it suffices to exclude functions whose values become arbitrarily close to zero.
Theorun 6JO. Let/be 0/ bounded variation on [a, h] and assume that f is bounded away from zero; that is, suppose that there exists a posit;ve number m such that < m :-:;; /(x)!for all x in [a, b]. Then g = II!is also of bounded variation on I [D, h], tJnd V, ;S; VJiml. PrQOf.
L: (PI) + L
(Pz)
:-:;; Via. c)
+ VAc, b).
1:(Pl. Since this
Therefore, VAU. c) + V,,<c,b) is an upper bound for every sum cannot be smaller than the least upper bound. we must have
Via, b) .:::;; V,,(o, c)
V,,(c, b),
Now we keep the function f and the left endpoint of the interval fixed and study the total variation as a function of the right endpoint. The additive property implies important consequences for this function. TAet1rem 6.12. LeU he 0/ bounded variatitin on [a, h1 Lei V be defined on [a, h] asfollows: V(x) = Vjo, x) if a < x :::;; V(a) = O. Then: b,
In the last two theorems the interval [a, b] was kept fixed and Vf(a, b) was considered as a function off If we keep f fixed and study the total variation as a function of the interval [a, b], we can prove. the following additive property.
Theorem 6.11. Let/be of bounded variation on [a, b]. and asStime that c Then f is 0/ bounded rariation on (a, c] and on [c, b] and we have V
EO
(a, b).
Jo, b)
= Via, c)
D(y) - D(x)
- f(x)).
Vic, b).
Proof. We first prove that/is of bounded variation on [a, c] and on [c, b]. Let PI be a partition or [a, c] and let be a partition or [e, b]. Then Po = PI U is a partition of [a, bJ. If 1: (P) denotes the sum L IAli:! corresponding to the
r,
r,
····1 :.j.
fey) - f(x) :::;Vf(x, y). This means that DCy) - D(x) ~ 0, and (ii) holds.
NOTE. For some functions j, the total variation Via. x) can be expressed as an integral. (See Exercise 7.20.)
partition
(1)
111 6.7 FUNCTIONS OF BOUNDED VARIATION EXPRESSED DIFFERENCE OF INCREASING FUNcrIONS The following simple and elegant characterization tion is a consequence of Theorem 6.12. of functions AS THE of bounded
TIt. 6.13
Tb. 4i.l5
Adding more points to P can only increase the sum that 0 < Xl - Xu < O. This means that variaand the foregoing
l: IAftl and
IAft!
inequality
1'IIeorem 6.13. Let f be defined on [a, b]. Then f is afbounded variation on [a, b] if. and only if,f can be expressed as the difference of two increasing /unctions. Proof. Iff is of bounded variation on [a. b]. we can write f = V - D, where Vis the function of Theorem 6.12 and D = V - [. Both V and D are increasing functions on [a, bj. The converse follows at once from Theorems 6.5 and 6.9. The representation of a function of bounded variation as a difference of two increasing functions is by no means unique. Iff = fl - f2' where f. and 12 are increasing, we also have f = (i. + g) + g), where 9 is an arbitrary increasing function, and we get a new representation off If 9 is strictly increasing. thesame will be true off1 + 9 and/z + g. Therefore, Theorem 6.13 also holds if "increasing" is replaced by "strictly increasing."
now becomes
VIc, b) - ! < ~ +
2 2
since {Xl' But
X2"" ,
l~2
b),
have
.t,,} is a partition
of [Xl'
o s Vex,)
Vee) =
V,(a,
Xl) -
V~a. c)
u;,
0<
Xl -
C<
implies
FUNCTIONS
OF BOUNDED
VARlt\T10N
of bounded variation on [a, b]. If x E (a. b]. let V(x) = O. Then every point of continuity off is also a point of continuity of V. The converse is also true.
I be
This proves that V(c+) = VCr). A similar argument yields V(c-) = V(c). The theorem is therefore proved for all interior points of [a, b]. (Trivial modifications are needed for the endpoints.) Combining Theorem
Proof. Since V is monotonic, the right- and lefthand limits V(x+) and V(x-) exist for each point x in (a, b). Because of Theorem 6.13, the same is true of
TIIeorem 6.15. Let f be continuous on [a, h]. Then f is 01 bounded variation on [a, b] if, and only if,I can be expressed as the dif{l'rence ()f 111'0 increasing continuous
functions.
NOTE.
f(x+)
If
andf(x-).
Q
<
< Y s b, then os
of VAx, y)]
The theorem
is replaced by "strictly
increasing."
V(y) -
Vex).
Letting y ~ x, we find
Of course, discontinuities (if any) of a function of bounded variation must be jump discontinuities because of Theorem 6.13. Moreover, Theorem 6.2 tells us that they form a countable set. 6.9 CURVES AND PATHS
e, x.
such that
b,
[a, b] -+ RIO be a vector-valued function, continuous on a compact interval in R. As t runs through [D, b], the function values f(t) trace out a set of points in R" called the graph of f or the curve described by f. A curve is a compact and connected subset of R· since it is the continuous image of a compact interval. The function (itself is called a path. It is often helpful to imagine a curve as being traced out by a moving particle. The interval [a, b] is thought of as a time interval and the vector I(t) specifies the position of the particle at time t. In this interpretation, the function f itself is called a motion.
Let f:
[0, b)
Dd'.6.16
Th. 6.t8
135
Different paths can trace out the same curve, valued functions
For example.
defined by the equation Al...a, b) = sup {AI...P): P 1/ the set of numbers A/,.P) is unbounded,
It is an easy matter to characterize
E
each trace out the unit circle xl + I, but the points are visited in opposite directions. The same circle is traced out five times by the function h(t) = e10.",
o !:> t !:>
!P[a, b]}.
r is ClJllednonrectifiable.
O!:>t~1.
6.10 RECTIFIABLE PATIIS AND .u,C LENGTH
Theorem 6.17. Consider a path f: [a, b] -+ RIt with comp(ments , (flO'" .t~). Then f is rectifiable if, and only if. each component fa is of bounded variation 011 [a, b]. If r is rectifiable, we have the inequaiities
V.(a, b) :s; Al...a, b) ~ V1(a, b) where Vl(a, b) denotes the
10101
Next we introduce the concept of arc ICIl8th of a curve. The idea is to approximate the curve by inscribed polygons. a technique learned from ancient geometers. Our intuition tells us that the length of any inscribed polygon should not exceed that of the eurve (since a straight line is the shortest path between two points). so the length of a curve should be an upper bound to the lengths of all inscribed polygons. Therefore, it seems natural to define the length of a curve to be the least upper bound of the lengths of all possible inscribed polygons. For most curves that arise in practice, this gives a useful definition of arc length. However, as we will see presently, there are curves for which there is no upper bound to the lengths of tbe inscribed polygons. Therefore, it becomes necessary to classify curves into two categories: those which bave a length, and those which do not. The former are called rectifiable, the latter lWnrectifiable. We now tum to a formal description ofthesc ideas. Let f : [a. b] -+ R" be a path in R". For any partition of [0, b] given by
+ ... +
v,,(a, b),
(k
1,2•...• n).
(2)
variation 0/ ft on [a. b l
of [a, b] we have
Proof
J~I
i:
INti) - k(ti-1)!
AI...P) :;:;
I-I
i: t Ih{tt) - /j.,tj-1)1,
J~l
(3)
of the theorem
Examples 1. As noted earlier. the function given by f(x) x cos {n/(2x)} for x "" o. f{O) = 0, is oontinuous but not of bounded variation on [0, I]. Therefore its graph is a nonrectifiable curve. :t. It can be shown (Exercise 7.21) that if I' is continuous on [tl, b J. then f is rectifiable and its arc length can be expressed as an integral. AI-a, b)""
{to, t1,
••••
t..},
1"'(t)11 dt.
OF ARC LENGTH
the points '(to), C(t1)•••• , r(tm) are the vertices of an inscribed polygon. (An example is shown in Fig. 6.1.) The length of this polygon is denoted by MP) and is defined to be the sum A,(P) =
l~l
I:IIf(tt)
..
6.11 ADDITIVE AND CONTINUITY. PROPERTIES Let r = (fl' ... J.. is of bounded
- r(lt-l)lI·
DejWtio" 6.16. If th« set o/numbers Atf..P)is bounded/or all partitions P of [a, b],
then the path f is sajd to be rectifiable and its arc length, denoted by
,/.) be a rectifiable path defined on [a. bJ. Then each component variation on every subinterval [x. y] of [a, b]. In this section we keep f fixed and study the are length A,(x, y) as: a function of the. interval [x, y]. First we prove an additive property.
Ma, b). ;3
If c e (0, b) we have
Ma, b) =
[a, c]
A,(a, c)
+ At._c, b).
PI of
Adjoining the point c to a partition P of [a, b], we get a partition and a partition P 1. of [c, b] such that A/.P) ~ A/.P1)
+ A/.P2)
A1(a, c)
To obtain
+ At,(c.
b). let
+ A,.(c. b).
P
=
1b. 6.19
137
is a
partition
At<.P1)
+ AIPz) =
AlP)
s; A,(a, b).
is the sum A,{a. c) If x
E
Since the supremum of all sums A/..Pj) (see Theorem 1.15). the theorem follows.
+ A,{P2)
+ At(c. b)
(a, b], let
the compact set [a, b], Theorem 4.29 tells us that (-I exists and is continuous on its graph. Define U(/) = f-1[g(l)] if t e [e, d]. Then u is continuous on [c. d] and get) = f[u(t)]. The reader can verify that u is strictly monotonic, and hence f and g are equivalent paths. EXERCISES
FlIIICtious of boooded variation
TMorem 6.19. Consider a rectifiable path f defined on [a, b]. s(x) = Alp, x) and let .I'(a) = O. Then we have:
i) The function s $0 defined is increasing and continuous on [a, b]. Ii] If there is no subinteroal of [a, b] on which f is constant, then s is strictly increasing on [0, b].
6.1 Determine which of the following functions are of bounded variation on [0, 1].
Proof Has; x < y S; b, Theorem 6.1 g implies .~(y) - sex) = Al..x, y) ~ O. This proves that s is increasing on [a, b]. Furthermore, we have s(y) - s(x) > 0 unless AJ..x. y) = O. But, by inequality (2), A/..x, .v) = 0 implies V.(x, y) = 0 for each k and this, in turn, implies that f is constant on [x, yJ. Hence (ii] holds. To prove that s is continuous, we use inequality (2) again to write
if x l' 0,/(0) = O. if x #- 0./(0) = O. 6.2 A function /, defined on [a, b], is said to satisfy a uniform Lipschitz condition of order « :> 0 on [a, b 1 if there eMsts a constant M :> 0 such that I/(x) - f(Y)l < Mix - YI'"for all x and JI in [a, b 1. (Compare with Exercise 5.1.)
a) /(x)
x2 sin (llx)
b) I(x) =
a) If/is
IX "'" 1 impliesfis
sllch a function, show that Of :> 1 implies/isconstant of bounded variation [u, b].
o :S s(y)
- sex)
= Ab.
y) :::;
1~1
:E Vt(x, y).
Similarly.
It
b) Give an example of a function/satisfying II unlfonn Lipschitz condition of order IX < 1 on [a. b] such that f is not of bounded variation on [a, b]. c) Give an example of a function / which is of bounded variation on [a, b J but which satisfies no uniform Lipschitz condition on [a, b]. 6.3 Show that a polynomial / is of bounded variation on every compact interval [a, b J Describe a method for finding the total variation off on [0, b] if the zeros of the derivative f' are known. 6.4 A nonempty set S of real-valued functions defined on an interval [a, b] is called a linear space of functions ifit has the following two properties: a)
Hwe let y __". we find each term y;.(x, y) _. 0 and hence s(x) = s(x+). x, s(x) = s(x-) and the proof is complete. 6.12 EQUIVALENCE OF PATHS. CHANGE OF PARAMETER
This section describes a class of paths. having the same graph. Let f. [a, b] .... R" be a path in R". Let u: [c, d] .... [a, b] be a real-valued function, continuous and strictly monotonic on [t, d] with range [0, b]. Then the composite function g = fOil given by g(t) = f(Il(I») for
C ;$,
Iff E ~.
then cf
s:
d,
b) !ffE S andge
5, then/+
s « S.
is a path having the same graph as f. Two paths f and g so related are called equivalent. They are said to provide different parametric representations of the same curve. The function u is said to define a change of parameter. Let C denote the common graph of two equivalent paths f and g. If u is strictly increasing, we say that f and g trace out C in the same direction. If u is strictly decreasing, we say that rand g trace out C in opposite directions. In the first case, u is said to be orientation-preserving; in the second case, orientationreversing. Tlleorf!m 6.20, LeI f : [a, b] ......RIO and g : [c, d] ..... RK be tWQ paths in R", each of which is one-tv-one on its domain. Then f and g are equivalent ih and only if, they have the same graph.
Theorem 6.9 shows loot the set Y of a.I1 functions of bounded variation on [a, b) is a linear space. If S is any linear space which contains all monotonic functions on [a, b}, prove that V 0;; S. Thill can be described by saying that the functions of bounded variation form the smallest linear space containing aU monotonic functions. . 6.S Let / be a real-valued function defined on [0, 11 such that /(0) > 0, f(x) if;. x for all x, andf(x) ~ fey) whenever x ~ y. Let A = {x :/(x) > z}, Prove that sup A e A and that/<l) > 1. 6.6 If / is defined everywhere in Rt, then / is said to be of bounded variation on (- 00, + 0',) if/is of bounded variation on every finite interval and ifthere exists a positive number M such that Y _,(a, b) < M for all compact intervals {a, b]. The total variation of / on (- 0',), + 0',) is then defined to be the sup of aU numbers Y_,(u, b), - 00 < u < b < + 00, and is denoted by V_,(- ro, + 00). Similar definitions apply to half-open infinite intervals [a. + 00) and ( - co, b]. a) Stale and prove theorems for the infinite interval (- 00, + 00) analogous to Theorems 6.7, 6.9. 6.10, 6.11. and 6.12.
Proof
Equivalent paths necessarily have the same grapb. To prove the converse, assume that f and g have the same graph. Since f is one-to-one and continuous on
13.
b) Show that Theomn 6.S is true for (- 00, + (0) if "monotonic" is replaced by "bounded and monotonic." State and prove a similar modification of Theorem 6.13. 6.7 Assume that f is of bounded variation on [a, b) and let p ~ {xo. x" ... , x,,} e (11 [a, b]. As usual, write Aft. The numbers a) Show that h describes a rectifiable curve c) Show that the set of points S
139
r.
{(x,y):a
S x:s
b, f(x)::;
CI:lIVe
y:s:
g(x)}
= [(xJ
- f(xt-1),
1,2 •.•••
n. Define
r.
0] as roJlows: if a S t S b,
A(P) = {k!
< O}.
ti [g(t)
P E (11[0, b] }
H(t)
=t
+ ti [g(2b
if b
sts
2b ":'
IJ.
sup {
hB(P)
L IAlil: P
l}
ro which
.s
g(x)
2y :s g(x)
are called, respectively, the positive and negative variations off on [a, b]. For each x in (a, b]. let V{x) ee V,,(a. x), p(x) = pr<a. x), n(x} n.t<a. z), and let V(a) = p(a) = ma) = O. Show that we have: a) V(x)
b) 0 pCx)
e) Show that So has the x-axis as a line of symmetry. Syll'Utll!'l'ization of S with respect to the x-axis.}
f) Show that the length of Absolutely
ru does. not
r.
+ 1I(x).
Vex) and 0 S n(x) S V(x).
s: p(x) s
eootinuous funetions
+ p(x)
A real-valued function [defined on {a, b J is said to be absolutely continuous on [a, b] if for every I! > 0 there is a B > 0 such that
". V(x} - /(x)
e) 2P{x) = V(x)
+ lex) -
+ [(a).
Ic~l
L I/(b
R
k)
f(a~)j
<
Ii
g be
f(t)
[0, 1],
g{t) =
t'2~lt
if t
for every n disjOint open subintervals (Uk' bk) of [a, b], n = 1,2, ... , the sum of whose lengths 1l~1 (b,. - ail is Jess than ,j. AbsoJutely continuous functions occur in the Lebesgue theory of integration and differentiation. The following exercises give some of their elementary properties. 6.11 Prove that every absolutely continuous bounded variation on [a, b 1. function on [a, b 1 is continuous and of
[0,2].
a) Prove that f and g have the same graph but are not equivalent according to the definition in Section 6.12. ,b) Prove that the length of 9 is twice that of /. 6.9 Let f be a rectifiable path of Iength L defined on [0, b]. and assume that f is not constant on any subinterval of [a, b]. Let $ denote the arc-length function given by s(x} "" At(a. x) if a < x ::; b, s(a} O. a) Prove that
[J
NOTE. There exist functions which are coatlnuous and of bounded variation but not absolutely continuous.
6.12 Prove that f is absolutely continuous if it satisfies a uniform Lipschitz condition of order 1 on [a, b}. (See Exercise 6,2.) 6.13 Iff and g are absolutely continuous OD [a. b J. prove tba.t each of the following is also: III. cf(cconstant),f + e.I: g; a1so/tg if gill bounded away from zero.
b) Define g(t) = f[s-I(I)] if I E [0, L] and show that g is equivalent to f. Since f(/) = g[s(t)], the function g is said to provide a representation of the graph of f with arc length as parameter. 6.10 Let f and 9 be two real-valued continuous functions of bounded variation defined on [a, b 1. with 0 < f(x} < g(x) for each x in (0, b), f(a) = g(02),[(b) = n(b). Let h be the complex-valued function defined on the interval [a, 2b - oj as follows:
h(t} = t
SUGGESTED
REFERENCES
FOR. FURlHER
STUDY
6.1 Apostol, T. M., Ca/culll1J, Vol I, 2nd ed. Xerox, Waltham, 1967. 6.2 Natanson, 1. P., 17u:ory of Function» 0/ Q Real Boron, translator. Ungar, New York. 1961.
Variable,
if(t}, t
if 0 - t),
:S
_$
b,
h(t) = 2b -
+ ig(2b
ifb:St::;2b-02.
Del. 7.1
141
CHAPTER 7
THE RIEMANN-STIELTJES
lNlEGRAL
become special cases of this more general process. Problems in physics which involve mass distributions that are partly discrete and partly continuous can also be treated by using Stieltjes integrals. In the mathematical theory of probability this integral is a very useful tool that makes possible the simultaneous treatment of continuous and discrete random variables. In Chapter 10 we discuss another generalization of the Riemann integral known as the Lebesgue integral.
7.1. NOTATION
7.1 JNTRODUcnON
Calculus deals principally with two geometric problems: "finding the tangent line to a curve, and finding the area of a region under a curve. The first is studied by a limit process known as differt!1l.li4tkm; the second by another limit processintegration--to which we tum now. The reader will recall from elementary calculus that to find the area of the region under the graph of a positive function f defined on [a, b], we subdivide the interval [a, b] into a finite number of subintervals, say n, the kth subinterval having length LU.., and we consider sums of the fonn :E:;1 /(tkJ Ax... where I., is some point in the kth subinterval. Such a sum is an approximation to the area by means of rectangles. If / is sufficiently well behaved in [a. b}-continuous, for example-then there is some hope that these sums will tend to a limit as we let n -. 00, making the successive subdivisions finer and finer. This, roughly speaking, is what is involved in Riemann's definition of the definite integral J:f(x) dx. (A precise definition is given below.) The two concepts, derivative and integral, arise in entirely different ways and it is a remarkable fact indeed that the two are intimately connected. Hwe consider the definite integral of a continuous function f as a function of its upper limit, say we write F(x)
For brevity we make certain stipulations concerning notation and terminology to be used in this chapter. We shall be working with a compact interval (0, b] and. unless otherwise stated, all functions denoted by J, g, a, P. ete., will be assumed to be real-valued functions defined and bounded on [a, b]. Complex-valued functions are dealt with in Section 7.27, and extensions to unbounded functions and infinite intervals will be discussed in Chapter 10. As in Chapter 6, a partition P of [0, b] is a finite set of points, say
P such that a = Xo < Xl < X._I < X~= b. A partition P' of [a, b] is said to be finer than P (or a refinement of P) if P s;;; P', which we also write P' ;2 P. The symbol A.~ denotes the difference A.a. = ll(xJ - a(xl-1). so that
<.. ...
= lxo. Xl>
•••
x,,}.
1~1
""7 1%(0).
The set of all possible partitions of [a, b] is denoted by £'[0, b]. The norm of a partition P is the length of the largest subinterval of P and is denoted by IIPII. Note that
P';2 P
J." /(I) e.
implies
IIP'II ~ IIPII.
then F has a derivative and F'(x) = f(x). This important result shows that differentiation and integration are, in a sense. inverse operations. In this chapter we study the process of integration in some detail, Actually we consider a more general concept than that of Riemann: this is the RiemannStieltjes integral, which involves two functions f and «. The symbol for such an integral is J:f(x) da:(x), or something similar, and the usual Riemann integral occurs as the special case in which C!(x)= x. When a has a continuous derivative, the definition is such that the Stieltjes integral J:f(x) d«(x) becomes the Riemann integral J!f(x) a'(x) fix. However, the Stieltjes integral still makes sense when 1% is not differentiable or even when 1% is discontinuous. In ract, it is in dealing with discontinuous exthat tbe importance of the Stieltjes integral becomes apparent. By a suitable choice of a discontinuous a, any finite or infinite sum can be expressed as a Stieltjes integral, and summation and ordinary Riemann integration then 140
That is, refinement of a partition decreases its norm, but the converse does not necessarily hold.
7.3 THE DEfINITION OF THE RIEMANN-STIELTJES INTEGRAL
[6,
DejinititJlJ 7.1. Let P = {Xo. Xl' ... ,x ..} be a partition of point in the subinterval [Xk- t s Xl]' A sum of the form S(P.j. a) =
b] and let
11 be
l~ I
" :E j(IJ
AOCl
is called a Riemann-Stieftjfs sum off witll respect to ce. We say f is Riemannintegrable with respect to ex on [a, b], and we write "fE R(a) on [a, b]" if there exists a number A hailing the following property: For every I: > 0, there exists a partition p. of [0. b] such that for every partitwn P finer than r, and for every choice of the points tt in [Xt-l, xJ. we have IS(P.J, ex) - AJ < s,
141
n.7.1
Def,'U
141
When such a number A exists, it is uniquely determined and is denoted by r~f da.or by j!f(x) d«(x). We also say that the Riemann-Stiehjes integral da. existtJ. The functions f and IX are referred to as the integrand and the integmtor. respectively. In the special case when a(x) = x, we write S(P,/) instead of S(P,f, IX}, and Ie R instead ofle R(IX). The integral is then called a Riemann integral and is denoted by dx or by J:f(x) dx. The numerical value of J:f(x) da.(x) depends only onJ.«, ii, and b, and does Dot depend on the symbol x. The letter x is a "dummy variable" and may be replaced by any other convenient symbol.
$!I
TlluuIII7.4. Assume that e e (a, b). If two of the three integrals in (I) exist, then the third also exists and we have (I) Proof. If P is a partition
P' = P of
Ii
J!f
[a, h] [D, c]
and
P"
f"l
[c, b],
The Ric-
This is one of several accepted definitions of the Riemann-Stiel1jes An alternative (but not equivalent) definition is stated in Exercise 7.3.
NOTE.
integral.
denote the corresponding partitions of [a, c] and [c, b]. respectively. mann-Stieltjes sums for these partitions are connected by the equation
S(p./. IX)
S(P',f, IX)
+ S(P",f.
£
a).
7," LINEAR PROPER'I1ES It is an easy matter to prove that the integral operates in a linear fashion on both the integrand and the integrator. This is the context of the next two theorems. 17tetwt!1fI7.2. If Ie R(a.) and if 9 e R(a) 011 [0, b]. then [a, b (for any two COlUUlllttJ Cl and C2) and we have
(cd + e2g)
da.
Cl
f
II
f dCl +
Cz
cd + C29 e R(<<) on
9 da..
J~fch exist.
f
Then, given
dal
<~
whenever P'
is finer
than
p;.
J-
<
wbenever
r: is finer than
P~.
Proof. Let h
= cd + clg.
l-l
Given a partition
All}
so, h. IX)
h(t.t)
c1
tKl
E f(tJ
!X).
+ Cz
l=J
g(t,J
06.1%1;
Then P; = P; u P: is a partition of [a, b] such tbat P finer than p. implies P' ;i2 P; and P" ;;2 P~. Hence, if P is finer than p., we can combine the foregoing results to obtain the inequality
= cIS(P,J, a)
+ c2S(P, y,
IS(p,f, IX) -
f dlX -
s:
Given E; > O. choose P; so that P ~ P; implies IS(P, J, a) - J:f dill choose P: so that P 2 P: implies IS(P, g, a) g drxl < s, If P, = P~ v.p;, then, for P finer than P., we have
S(P, h, If) - c1
J:
< s, we
and take
I da. -
'2
This proves that I da. exists and equals f dCl + fda.. The reader can easily verify that a similar argument proves the theorem in the remaining cases. Using mathematical induction. we can prove a similar result for a decomposition of [a, b] into a finite number of subintervals.
J:
J;
J!
R(a) andfe
C2)
+ cd!)
f~fda
C2fI) on [0, b]
NOTE.
The preceding type of argument cannot be used to prove that the integral exists whenever J:l da. exists. The conclusion is correct, however. For integrators a of bounded variation, this fact will later be proved in Theorem 7.25.
Ida
c1
fdlJ.
J:I d« = - J!f
da: whenever
$:1da.
exists.
The proof is similar to that of Theorem 7.2 and is left as an exercise. A result somewhat analogous to the previous two theorems tells us that the integral is also additive with respect to the interval of integration.
s:
f d«
1'"fda
I d«
= O.
Th.7.6
Th.7.7
Reduction to a Ri_
lDtegraJ
145
b = g(d).
A remarkable connection exists between the integrand and the integrator in a Riemann-Stieltjes integral. The existence of f drr. implies the existence of a df, and the converse is also true. Moreover, a very simple relation holds between the two integrals.
J!
S:
= [[g(x)],
{J(x) = a[g(x)],
if
That is,
XES.
Then h
S!f
da.
=
c
S~h dp.
!(x) d«(x)
f.
fI<d)
J(t) dlX(t)
fd
f[g(x)]
d{fX[g(X)]}.
g{,,)
a(%) dl(x)
NOTE.
This equation. which provides a kind of reciprocity known as the formula for integration by parts.
Proof.
such that
Let IS> 0 be given. Since J:fdlX exists, there is a partition for every P' finer than P .. we have
Proof For definiteness, assume that 9 is strictly increasing on S. (This implies , c < d.) Then 9 is one-to-one and has a strictly increasing, continuous inverse g-l defined on [a. b]. Therefore, for every partition P = {Yo, ... ,Ya} of [c, d], there corresponds one and only one partition P' = {xo, ... , x,,} of [a, b] with xA = g(y.). In fact. we can write
P'
g(P)
and
J dal < E.
sum for the integral
(2)
J:
IX
dJ, say
SCPo a./)
" = :E
~=I
a(tJ 611t. =
k=l
" :E a.(tJJ(xJ -
J<s
" L a:(tJJ(Xk-l),
1
Furthermore, a refinement of P produces a corresponding refinement of P', and the converse also holds. If I: > 0 is given. there is a partition P~ of [a, b] such that P' finer than P; implies IS(P',J, a:) - J:fdal < 8. Let P; = g-I(P;) be the corresponding partition of [c. dJ, and let P = {Yo •...• Y.} be a partition of [c. dJ finer than p•. Form a Riemann-Stieltjes sum S(P, h, fJ) = where
Xi
= f(b)lX(b)
"
- f(ap(a),
A
Subtracting A-
:E!(x~xJ
t= 1
II
- L!(Xi-l)a(Xi-1).
k= 1
k=1
" :E
h(uJ AjJ",
S{P, a:,!)
LJ(xJ[a(xk)
1-1,
equations,
we find
Ul e [Yl-I> Yt] and AfJ~ = (J(y1t,) {1(Yl-I)' If we put tlo; g(Uk) and g(y.), then P' = {xo, ... , x.} is a partition of [a, b] finer than P;. Moreover,
- (X(li)]
~
LJ(Xl-l)[a:(tt)
l=l
we then have
- a(xt_1)].
S(P, h, (J)
The two sums on the right can be combined into a single sum of the form S(P' ,t. IX). where P' is that partition of [a, b] obtained by taking the points x. and lit. together. Then pi is finer than P and hence finer than p •. Therefore the inequality (2) is valid and this means that we have
" = EJ[g(Ut)]{a[g(y~)]
1=1
- a[g(Yl-l)]}
= S(P',J, IX),
=
since f" € proved.
NOTE.
(Xk-I,
k~l
" L J(lt){O:(Xl)
- a:(xt-1)}
S(P, IX,/)
xJ.
Therefore,!
S(P, h, fJ) -
J: f dal
<e
is
dill
<
E, that
s:r
J: a: df
exists
This theorem applies, in particular, to Riemann integrals, that is, when x, Another theorem of this type, in which g is not required to be monotonic, will later be proved for Riemann integrals. (See Theorem 7.36.)
I1(X)
INTEGRAL
7.7 REDUCTION
TO A RIEMANN
INTEGRAL
Theorem 7.7. Let f « R(a) on [a, b] and le: g be a strictly monotonic continuous [unction defined on an. interval S having endpoints c and d. Assume that a gee).
The next theorem tells us that we are permitted to replace the symbol dlX(x) by a:'{x) dx in the integral S:f(x) da(x) whenever IX has a continuous derivative «'.
146
11t. 7.8
111.7.9
147 AS INTEGRATORS
Theorem 7.8. Assume f e R(a) an [a, b) and assume that IX has a continUQUS derivative rT: on [a, b]. Then the Riemann integral f(x)a'(x) dx exists and we iuJve
J:
If
at
dx.
= f(x)a'(x)
S(P, g)
since each sum S(P,/. a) = O. However, if a is constant except for a jump discontinuity at one point, the integral da: need not exist and, if it does exist, its value need not be zero. The situation is described more fully in the following
J:l
1-1
" L g(/l)
Axk
" = L /(tJa'(tJ
t=1
< b. Define a on [a, b] as follows: The values a(a), if a :s; x < .C, if
c
The same partition P and the same choice of the Riemann-Stieitjes sum
S(P,J. a) =
til;
a(x) = «(a)
l-l
L f(tt)
"
<
x:s; h.
Aocll'
Let 1 defined on [a, b] in such a way that at least one of the functions / or IX is be continuous/ram the left at c and at least one is continuousfrom the right at c. Then / e R(a} on [a, b] and we have
k=!
" L /(tl)[a'(vl)
- a:'(tJ] ':uk'
NOTE. The result also holds if c = a, provided that we write aCe) for a(c-), and it holds for c = b if we write a(e) for a(c+). We win prove later (Theorem 7.29) that the integral does not exist if both f and 0: are discontinuous from the right or
Since f is bounded, we have I/(x)1 5: M for all x in [a, b], where M > O. Continuity of a.' on [a, b] implies uniform continuity on [a, bJ. Hence, if I> > 0 is given, there exists a ~ > 0 (depending only on e) such that
from the left at c. Proof. If c E P, every term in the sum S(P, /, a) is zero except the two terms arising from the subinterval separated by c, say
S(P,/. tx) !(tt-l)[CC(C) a(c-)]
os
[x -
yl <
implies
la'(x) - a'(Y)1
<
2M(b - a)
+ /(fk)[a(c+)
- «(c)].
If we take a partition P; with norm IIP;n < t;, then for any finer partition P we will have la'(v.} ~ ,,'(tt)I < 1l/[2M(b ~ a)] in the preceding equation. For such P we therefore have
IS(P,J, a) - S(P, g)!
where
tl;-1 S;
c s;
ti•
= [/(tll;-l)
a(c)],
- a(e-)J.
< 2"
IAI s 1/(tk-1)
- a(c)l.
On the other hand, since IE R(a) on [a, bJ, there exists a partition P; such that p finer than P:implies
I/(t"-I)
IS(p,/,
Il)
/dal < ;.
= P~ u P;,
a(c-)I
+ I>]a.(c+)
Combining the last two inequalities, we see that when P is finer than P, we will have IS(P, g) - J!I docl < s, and this proves the theorem.
NOT!'!.
But this inequality holds whether or not/is continuous at c. For example, if/is discontinuous both from the right and from the left at c, then «(c) = I%(c-) and o:(e) = a(c+) and we get A = O. On the other hand, if fis continuous from the left and discontinuous from the right at c, we must have aCe) = a:(c+) and we,get
148
net'.
7.10
1b.1.13
149
IAI ~ eIOl(c) - OI(C- )1. Similarly. if f is continuous from the right and discontinuous from the left at c, we have «(c) = I%(c-) and IAI s tIIX(c+) - lX(c)l. Hence the last displayed inequality holds in every case. This proves the theorem. Theorem 7.9 tens us that the value of a Riemann-Stie1tjes integral can be altered by changing the value of f at a siogle point. The following example shows that the existence of the integral can also be affected by such a change. Let
Example. «(x) = 0,
01'
$:1 da exists
and we
f(x) dct(x)
=~
!(X,clUl'
of integrals of the type
ih if
=
O. But if we re-define I so that /(0) = 2 and exist. In fact, when P is a par. tition which includes 0 as a point of subdivision, we find
1
J:
I d«
One of the simplest step functions is the greatest-integer junction. Its value at x is the greatest integer which is less than or equal to x and is denoted by [x]. Thus, [x] is the unique integer satisfying the inequalities [x] S X < [x] + L Tlreorem 7.12. Every finite sum can be written as a Riemann-Stieltjes fact, given 0 sum L:~J Q1. define/on [0, n] as follows: f(x) Then
=
integral. I"
S(P.J. II)
= J(r.J[«(x~)
ee
a~
ifk - 1 <
s; k
(k
= o.
f(tl)
JOt-I),
where x,,_ 2 ~ t 0 ~ tk ~ x.. The value of this sum is 0, 1, or - 1, depending on the choice of t" and 'k-l' Hence, l~l / th does not exist in this case. However. in a Riemann integral J! J(x) dx, the values of 1 can be changed at a finite number of points without affecting either the existence or the value of the integral. To prove this, it suffices to consider the case where I(x} 0 for aU x in [a. b] except for one point. say x c. But for such a function it is obvious that IS(P,.n1 :5 If(c)1~PII. Since IIP~can be made arbitrarily small, it rollows that i:J(x) dx = O.
'Ii_
t i1
a" :;
s; x.
f(k)
f:
f(x) d[x],
The greatest-integer function is. a step function, continuous from the right and having jump I at each integer. The function/ is continuous from the left at 1,2 •...• n. Now apply Theorem 7.1 L 7.to EULER'S SUMMATION FORMULA
7.9 REDUCTION
OF A RlEMANN-STIELTJES
The integrator in Theorem 7.9 is a special case of an important class of functions known as step functions. These are functions which are constant throughout an . interval except for a finite number of jump discontinuities.
II defined
We shall illustrate the use of Riemann-Stieltjes integrals by deriving a remarkable formula known as Euler's summation formula, which relates the integral of a function over an interval [a, h] with the sum of the function values at the integers in [a, b]. It can sometimes be used to approximate integrals by sums or, conversely, to estimate the values of certain sums by means. of integrals.
such that II is constant on each open subinterval (Xl-" Xl)' The number ,,(Xl +) rx(x,,-) is called the jump at Xl; if I < k < I'. The jump at XI is :x(x1 +) - :X(x1). and the jump at X. is cc(x,,) - o:(x. - ).
Step functions provide the connecting and finite sums: link between Riemann-Stieltjes integrals
fen)
f(x) dx
If/has
a continuous derioatioe
f' on
j'(x)«(x»
dx
+ f(o)(a»
- 1(b)(b»,
where «x»
x - [x).
T6eorem 7.I1 (Rehctioll of (J RiemaM-Stieitjes inteflrtll to ajitdte sum}, Let Cl be a step/unction defined on [a, b] with jump 7k al xt, where Xl' ... , are as described in Definition 7.10. Let f he defined on [a, b] in such a ~l'ay tluu not both f and ('(are
x.
];;1(n) ==
NOTE.
When
a and
f(x) dx
f'(x) (~ - [x] -.
dx
+ f(o) ; feb) .
L.,,~~~ means
[a]
1 to"
[b].
n,7.HI
151
we have
Since the
at the
ask: What the value of upper sums? This leads us to consider the inf of all upper sums, a number called the upper The defined be the sup of all lower sums, for functions) both these will be m'F"'''''~ wiU be different and it becomes an will ensure that the disCJJSSthis type of
b] and tet
If
combine thi£
INTEGRATORS, UPPER AND
,x
'XE
The
and
IX) =
k~ i
" l:
with respect
appear in the Riemanna vital role ill the fl"lJ>'II~"_ we shalluse the abbreviation "a? 011
.;;::-' and we if ? em from which it follows that the lower sums do not exceed.the upper sums, Furthermore, if I. E j) then
We
the upper and lower sums to the '·<'11 '''''! "" sums. do not which are used in the material that
l!l""C"~UI"
.,,,,,,,,!.\,,,,,,i
function,
The next theorem shows for (1., refinement of the increases the lower sums and decreases the upper sums, Theorem 7.15. Assume that
IX?
on
Then:
we have
151
Dd'.7.16
Th. 7.19
153
Proof. It suffices to prove (i) when P' contains exactly one more point than P, say the point c. If c is in the ith subinterval of P, we can write U(P',f. (X) =
Iom1
:E Mk_(f)
Then for every partition P of [0, 1], we have Mt(f) = 1 and m~(f) = 0, since every subinterval contains both rational and irrational numbers. Therefore, U(P, j) = 1 and L(P, 0 for all P. It follows that we have, for {a, b] [0, 1],
.da,l; + M'[o:(c)
«(Xi-I)]
M"[oc(xJ - o:(e)].
l""
and
f dx
o,
=
[X'-I'
M"
s M!f),
Observe that the same result holds if [(x) = 0 when x is rational, andf(x) irrational. 1.12 ADDITIVE AND LINEARITY LOWER INTEGRALS Upper and lower integrals ample. we have PROPERTIES OF UPPER AND
1 when x is
we have U(P',/. «) :::;:U(P.j, ct). (The inequality for lower sums is proved in a similar fashion.) To prove (ji), let P = PI U Pl' Then we have L(Pt.!.
NOTE.
a.)
s L(P,f,
0:)
a.).
of the integral.
For ex-
M[(X(h) - a(a)], if a < c < b, and the same equation holds for lower integrals. However. certain equations which hold for integrals must be replaced by inequalities when they are stated for upper and lower integrals. For example, we have
where M and m denote the sup and inf of/on Definition 7.16. Assume that «» on [a, b]' respect to 0: is defined as follows:
[a, b].
The upper Stieltjes integral
all
with
s: f
f
da = inf {U(P,J,
U + g}
+ g)
da
s: f
d«
J:
g da.
da.
sup {L(P,f,
a): P
.'P[a, b)}.
f (/
d« ~
S: f
da
g d«.
NOTE.
We sometimes write J(f. a.) and fU 0:) for the upper and lower integrals. In the special case where t.J:(x} = x, the upper and lower sums are denoted by U(P,f} and L(P.f} and are called upper and lower Riemann sums. The corresponding integrals. denoted by J:f(x) and by f(x) dx, are called upper and
ax
J:
lower Riemann
integrals,
by
J.
7.13 RIEMANN'S
CONDITION
G. Darboux «).
(1875).
s 1(f.
>0
PI such that
By Theorem 7.15. it follows that L(P, f. a). Hence, l(f. Of) ~ l(f, [(f. a) S J(f. «). EDmple. define/on
tu: a) + s is an upper
Of)
< J(f. a)
+ e,
If we are to expect equality of the upper and lower integrals, then we must also expect the upper sums to become arbitrarily close to the lower sums. Hence it seems reasonable to seek those funetions j for which the difference U(P,f, IX) L(P,J, a) can be made arbitrarily small.
a, and, since
1l
Definition 7.18. We say that f satisfies Riemann's eondttton with respect to rx on [a, b] if,Jor every Il > 0, there exists a partition P; such that P finer than P; implies
os
Theort!m 7.19. Assume thai
equivalent: i)
U(P,J, a)
LJ..p.f,
e) < s,
It is easy to give an example in which l(/' «) < [0, J} as follows: [(x) = 1, if x is rational, f(x)
=
us. «).
«»
0,
if
.r is irrational.
f e R(a)
on [0, b).
151
n.1.)'
7.14 COMPARlSON DIEOKEMS
ii) Isatisfies Riemann's condition with respect to a on [a, b]. = J(f, a).
Proof. We will prove that part (i) implies (ii), part (0) implies (iii), and part (ill) implies 0). Assume that (i) holds. H a(b) = «(a). then (ti) holds trivially, so we can assume that «(a) < a(b), Given Il > 0, choose P, so that for any finer P and all choices of and t~in [Xi-I> x,J, we have
TIteouIII7.1O. Assume that «» on (a. b]. lIfe R(<<) and g e R{rz) on [a. b) and flAx) s g{x)/or all x in [a, b]. then we hatJe
't
r
I;:r
f(x)
da:(x);o;;
r
•
P, the corresponding
Itf(tJ
1=1
Aa& ~
AI
< !!_ 3
• E/(tJ ~
1
;0;;
:where A
= J!I
.t-I
E g(tJ.!\al
follows
S(P,
e. a),
g(x) :<!: 0
da. Combining
these inequalities,
From
tbis the
theorem
< ~ s.
3
[Xt-h
implies that
r. g{x) da(x) 2 0
1!(x)1
da(x).
easily.
whenever
Since M.. (f) - m.(n = sup fl(x) - Ax'): x. x' in for every h > 0 we can choose t" and t~so that
TMot'e1ll 7.21.
Auume that
fl./'
on [a, b].
f{tJ -
Int.(/) - h.
Making a choice corresponding to h = te/[a(b) - I%(a)]. we can write U(P,I, (I) L(P,/. a)
If
I{x) d!Z{X)1
of Definition
5;
=
<
•-1
L [Min
"
- mk(f)] ~
f(tl)] At'll
Hence, (i) implies (ii). Next, assume that (ii) holds. He> 0 is given, there exists a partition P, such that P finer than p. implies U(P, J, IX) < L(P.J, a) + e. Hence, for such P we have
1i~1
L Aa4
..
M1(f) - m.(f)
< e.
we
1I/(x)1 - II(Y)11
MUltiplying
U(p.
J(f. It)
S;
+s
III. fl.)
L(P.I/I.
L(p.f. «).
S;
I(/' a.}
+ e.
That is, 1(f, a) ,s; l(f, a) + Il for every t > O. Therefore, I(f, a.) S; l(f. 12:). But, by Theorem 7.17. we also have the opposite inequality. Hence (il) implies (iii). Finally. assume that [(f, a) = l(f, tx) and let A denote their common value. We will prove tbat cia: exists and equals A. Given,fl > 0, choose P; so that U(P,j, Il) < l(f, a) + e for all P finer than P;. Also choose P~ such that
for every partition P of (0. b]. By applying Riemann's condition, we find that III € R(I2:)on [a, bJ. The inequality in the theorem follows by taking g = III in
Theorem 7.20.
NOTE.
J!I
> l(f, al - s
[a, b].
Proof. Using the notation
of Definition
€ R(<<)
on
P;.
I:
If P,
l(f. a) -
< L(P,/,
ct:)
S;
+t
Ml(f~)
Hence
[Ml(I/IW
m..(p) = [mA;(lfI)]2.
- mil/l))
for every P finer than P,. But, since l(f, a) = 1(f, a) = A, this means that IS(P, f, (.() A I < Ii: whenever P is finer than This proves that f tkt exists and equals A, and the proof of the theorem is now complete.
we
can write
r;
J:
+ mt<:lfl)][MA;(lflJ
::;;2M[M,,(I/I) - m.,(lfl)].
Th 7.2.3
n.7.25
157
By applying
Riemann's
condition,
we note that
-
Alk n
7.21.
Assume that
E
IX?
the productf-g
R(a:) on [a.
bJ.
IE
[a. b].
then
~~l
L: [Mol(/)
mt(f)](AVt
IAalD
S;
2M
1;=
:E (AY.
I
- IAeltl)
= 2M (V(b)
k~1
i: iAllkl)
< A},
<
t.
[I(x)
+ g(xW
[{(X)]2
- [g(x)r·
R(P)
{k: ~
7.1S INTEGRATORS
OF BOUNDED
VARIATION
In Theorem 6.13 we found that every function IX of bounded variation on [a, bJ can be expressed as the difference of two increasing functions. Jfr;x = 01:1 az is such a decomposition and if Ie R(al) andfE R(lXz) on [a, bJ. it follows by linearity thatfe R(a.) on [a, b]. However. the converse is not always true. Hfe R(a:) on [a. b], it is quite possible to choose increasing functions al and (X2 such that Cl ()!1 01::2, but such that neither integral drx" da.2 exists. The difficulty, of course, is due to the ncnuniqeeness of the decomposition a a} - 1%2' However, we can prove that Qi:ere is at least one decomposition for which the converse is true, namely, when a'l is the total variation of IX and 0:2 = a1 - a. (Recall Definition 6.8.)
and let h
= !J:./ Y(b).
Hk
t; so that
f(t,;) - I(t~) > M,,(/) - m.(f) - II; but, if k e B(P). choose Then
It and Ii so that
J:I
J:f
It.= I
L [M.(f) -
..
k,4(P)
:E
[f(t.)
f(t.OJ fAa..1
- 1(t.)]
+ IEB(I',)[J(tt) L
=
i~l
I~I + h
.-1
L IAall
TMonm 7.24. Assume that a is o/boundedvariiltion on [a, b]. Let V(x) denote the total variation of IX on [D, x] if a < x s; b, and let Yea) = O. Let I he defined and bounded 011 [a, bJ. lIfE R(a) all [a, bJ, then/E R(V) on [a, b]. Proof, If V(b) 0, then V is contant and the result is trivial. Suppose therefore, that V(b) > O. Suppose also that rf(x)1 s; M if x EO [a, b]. Since V is increasing, we need only verify that/satisfies Riemann's condition with respect to Von [a, b]. Given e > O. choose P~ so thit for any finer P and all choices of points til; and
t~ in
L [J(t
l)
f(t.t)J Ar.rl
t
+
8
11
.~l
L IAall
<
It follows thatfe
NOTE.
8 + hV(b) = -- +
442
= -.
[Xl ~ 1. Xt] we
I~
have
[f(tJ
- f(t0]
&x.j < ~
and
V(b) <
t~1
1: lAIlll
+ ....£.. •
4M
This theorem (together with Theorem 6.12) enables us to reduce the theory of Riemann-Stieltjes integration for integrators of bounded variation to the case of increasing integrators. Riemann's condition then becomes available and it turns out to be a particularly useful tool in this work. As a first application we shall obtain a result which is closely related to Theorem 7.4. Th«I'Dn 7.25. Let «. he 01 bounded variation on [a, b) and assume that [a, b]. Then I E R(r.r) on every subinterval [c, dJ 0/ [a, b].
t;; [MJ<U)
"
and
which, by addition,
f e R(Il)
on
- mk(f)](AJIi. -
lAllA!) < ;,
J.
V) < e.
Proof. Let V(x) denote total variation of a on [D, x], with Veal = O. Then a = V - (V - IX). where both V and V - 0% are increasing on [a, b (Theorem 6.12). By Theorem 7.24,/e R(V), and hencefe R(V - «) on [a, b]. Therefore, if the theorem is true for increasing integrators, it follows that f E R( V) on [c, d] aud/e R(V - cr) on [c, sofe R(r.r) on [c. dJ.
too
dJ,
Th.736
'lb. 7.211
15'1
Hence. it suffices to prove the theorem when ('f.~ on (a, b]. By Theorem 7.4 it suffices to prove that each integral J~fda and J:fdCl: exists, Assume that a < c < b. If P is a partition of [a, x], let A(P, x) denote the difference A(P. x) = U(P,f, a) - L(P,/. IX), of the upper and lower sums associated with the interval [0, x], Since f e R(ex) on [a, b], Riemann's condition holds. Hence, if e > 0 is given, there exists a partition P, of [a, b] such that A(P, b) < I; if P is finer than P•• We can assume that C E r; The points of r, in [a. c] form a partition r; of [0, c]. If P' is a partition of [a, e] finer than P:. then P = P' v p. is a partition of [0, b] composed of the points of P' along with those points of p. in [c, b]. Now the sum defining A(P', c) contains only part of the terms in the sum defining A(P, b). Since each term is ~O and since P is finer than p .. we have
A{P', c) 5; A(P, b)
Therefore, if M,
IS(p,j, G) -
5;
Mil
t. S:.
r
=
sup {lg(x)1 ; x
f·g dal
I~f:-,
=
t l~-l
da.(t)1
M,{U(P,f,
Since f E R«('(). for every I': > 0 there is a partition P, such that P finer than P, implies U(P./. fl.) L(P,/. a.) < 1;. This proves that je R(G) on [a. b] and that J:f' 9 da. = J:f dG. A similar argument shows that 9 e R(F) on [a, b] and that J:f'g da = dF.
S:g
NOTE.
(l
<
That is, P' finer than P; implies A(P', c) < e. Hence, f satisfies Riemann's condition on [a, e] and f~f da. exists. The same argument, of course, shows that f:f da. exists, and by Theorem 1.4 it follows that J~fda. exists. The next theorem is an application of Theorems 7.23, 7.21, and 7.25.
Tleonm Define 7.26. Assume fe R(o:) and 9 e R(o:) on [a, h], where (Xl"'
In most of the previous theorems we have assumed that certain integrals existed and then studied their properties. It is quite natural to ask: When does the integra] exist? Two useful sufficient conditions will be obtained.
1'Ieorem 7.27. Iff is continuous on [a, b] and
F(x) and
on [a, b].
if
II is
f(t) da.(t)
By Theorem 7.6, a second sufficient condition can be obtained by interchangingfand a in the hypothesis.
G(x)
Then
i""
Proof.
g(t)drx(t)
E
ifxe[a,b].
IE
R(G), 9
f(x)g(x)
da.(x)
=
=
Proof
[a, b] we have
S(P,j,
The integral
r r
n» dG(x)
g(x) dF(x).
It suffices to prove the theorem when 0: l' with a(a) < a(b). Continuity of f on [a, b] implies uniform continuity, so that if 1: > 0 is given, we can find Jj > 0 (depending only on 8) such that Ix - yl < ij implies If(x) - f{y)1 < etA, where A = 2[a:(b) - a(a)]. If P, is a partition with norm lIP.1I < a, then for P finer than p. we must have Mt(f) - fflk(f) 5; £'/A, since Mk(J) - m.(J) = sup (f(x) - f()-') ; x, y in [x._!, XI]}' inequality by.IJ.t'l.and summing, we find
U(P,f,
Multiplying the
a) - L(P.f,
ex)
i: A
£, k=1
A2:k
-<£ 2 '
and we see that Riemann's condition holds. Hence.j e R(~) on [a, bJ. For the special case in which a(x) corollary:
Theorem 7.28. Each of the following Riemann integral x, Theorems 7.27 and 7,6 give the following conditions is sufficient for the existence of the
G)
=
6
i"''
~-J
get) dt>:(t) =
and
f f(x)g(x)
Q
drx(x)
e~ b; 1,,-.
~
At J~k-l ex"
1
f(t~)g(t)
da(t),
f(l)g(l)
drx(t).
,,,:
J: fix)
lb. 7.19
Th.7.32
161
FOR EXISI'ENCE
OF RIEMANN-S'I1ELTJES
When !:I: is of bounded variation on [a, oJ, continuity of/is sufficient for the existence of J:/ da, Continuity of / throughout [a. h] is by no means necessary, however. For example. in Theorem 7.9 we found that when a: is a step function, then/can be defined quite arbitrarily in [a. bJ provided only thatfis continuous at the discontinuities of 11. The next theorem tells us that common discontinuities from the right or from the left must be avoided if the integritl is to exist. TMOum 7.19. Assume that al" on (0, bJ and let a < c < h. Assume further that both 11 and / are discontinuous from the right at x = e; that is, assume that there exists em a > 0 such that for every Ii > 0 there are values 0/ x and y in the interval (e, c + 5) lor which
I/(x) - /(c)! ;:::: £ / are discontinUOus from the left at c. Proof. lA:t P be a partition of [a, b] containing c as a point of subdivision and form the differ:mce U(P,f, a) L(P.J. a:) = and I(I(Y) -
I(x) drz(x) = c
In particular, if/is
continuous on [a,
bJ, then
= I(xo)/or
some
Xo
in [a,
bJ.
Proof If a(a) = a(b), the theorem holds trivially, both sides being O. Hence we .can assume that tx(a) < a(b). Since all upper and lower sums satisfy m[a:(b) - 1):(0)] S L(P,f, «) :>; U(P,/, a) :>; M[ll(b) - o:(a)],
da must lie between the same bounds, Therefore, the quotient (J!fda)/(J! da.) lies between m and M. When/is continuous on [0, bJ, the intermediate value theorem yields c = f{xo) for some Xo in [0, bJ.
the integral
J:/
a(e)1 ;::::II.
A second theorem of this type can be obtained from the first by using integra-
Then the integral J:/(x) th(x) cannot exist. The integral also/ai13 to exist
if a: and
tion by parts.
Tluorsm 7.31 (&cvml Muut-Yalue T1teorem for RiellJlllla-Stiehje.s integrflls). Assume that a; is continuous and that f /' on [a. b]. Then there exists a paint x(J
I !
1:=1
L [M.(f)
-
- mlU)]
,6l.1l·
s:
f{x) da(x)
= I(a)
fO
da:(x)
+ f(b)
fo
da(x).
proof
mil)][«(xJ - a(c)],
since each term of the sum is ;:::: If c is a common discontinuity from the right. O. we can assume that the point Xi is chosen so that a(XI) - «(c) 2!:. e, Furthermore. the hypothesis of the theorem implies M IJJ
U(P,
S:
f(x)
a(x) df(x).
mlJJ:l!: e. Hence,
J. a:)
- L(P,
J. «)
:l!:e2, where Xo
E
f(x)
drx(x) = f(a)[a(xo)
rx{a)J
+ J(b)[a(b)
- l.I(xo)],
and Riemann's condition cannot be satisfied. (If e is a common discontinwty from the left, the argument is similar.)
7.18 MEAN-VALUE THEOREMS FOR RIEMANN-STlELTJES INTEGRALS
7 .r 9 THE INTEGRAL
Although integrals occur in a wide variety of problems, there are relatively few eases in which the explicit vaJue of the integral can be obtained. However, it often suffices to have an estimate for the integral rather than its exact value. The M eon Value Theorems of this section are especially useful in making such estimates.
1'IIurem 7.JO(Fir. Metlll- V4Iae Tbttoremfar Rk~ie/tjtl, itIIegrllb). Assume that I%/' and let f « R{a) on [a, hJ. Let M and m denote, respectively, the sup and inf of the set {f(x): x e [a, bJ}. Then there exists a reall1umher e satisffing
Iff E R(a) on [a, b] and jfa: is of bounded variation, (hen (by Theorem 7.25) the integral J:/ da exists for each x in [a, b] and can be studied as a function of x. Some properties of this function will now be obtained.
[0, b].
Theorem 7.12. Let IX IN: 0/ bounded variation on [a, b] and assume that / Define F by the equation F(x) =
e R(a;) on
1""f d«,
if x e [a, b]'
· 161
Th.7.33
a.a..: of Variable
(a, b) and has the value
1()J
Then we have.'
i) F is of bounded variation on [a, Ii) Every point of continuity of
It
b].
Q
= f(x)
is also
iii) If «» on [a, b], the derivative F'(x) exists at each point x in (a, b) where a;'(x) exists and where f is continuous. For such x, we have F'{x) Proof
that F(J') - F(x) =
= f(x}«'{x).
g(a) - g(o+)
= g(b) -
Then we have
f
~
f(x)
dx
g'(x) dx
J d« =
Proof
c[a.(y) - a(x)].
where m ::; e ::s;; M (in the notation of Theorem 7.30). Statements (i) and (ii) follow at once from this equation; To prove (iii), we divide by y - x and observe that c ~ fix) as y ~ x. . When Theorem 7.32 is used in conjunction with Theorem 7.26, we obtain the following theorem which converts a Riemann integral of a product f" 9 into a Riemann-Stieltjes integral J:f riG with a continuous integrator of bounded variation. Tkorem 7.33. lffe F(x)
gCb)
l~ 1
L: [g(x.)
- g(Xl-1)]
1;= 1
L: 0'(/,,)
..
Axl
L:f(t,)
k= 1
Ax...
where tic is a point in (Xt-l' xi) determined by the Mean-Value Theorem of differential calculus. But, for II. given B > 0, the partition can be taken so fine that 19(b) - g(a) -
/(x) dxj
l~f(tJ
4xk -
n» dxl < s,
G(x)
The second fundamental theorem can be combined with Theorem 1.33 to give the following strengthening of Theorem 7.&,
get) dt
fe
Then F and G tire continuous fimcti()ns of bounded f1QJ'iation on [a, b]. R(G) and 9 E R(F) on [a, b], and we haoe
f(x)g(x)
ax
f(x)
dG(x)
Tlietlrem 7J5. Assume fER 011 [a, b]. Let a.be Q junctUm which is cominuous on [a, b] and whose derivative a.' is Riemann integrable ,oil [a, b]. Then the foll,owing integrals exist tmd are equal:
g(x) dF(x}.
s:
/(x) drz(x) =
s:
f(x}:l'{x)
dx.
Proof Parts (i) and (ii) of Theorem 7.32 show that F and G are continuous functions of bounded variation on [a, b]. The existence of the integrals and the two formulas for J:f(x)g(x) dx follow by taking a(x) x in Theorem 7.26.
NOTE.
Proof. By the second fundamental theorem we have, for each x in [a, b],
«(x) - t.(a) =
I%'(t) dt.
fundamental
When a:(x) = x, part (iii) of Theorem 7.32 is sometimes called the first theorem of inteqral calculus. It states that F'(x) = fix) at each point of continuity off A companion result, called the second fundamental theorem, is given in the next section.
UO SECOND FUNDAMENTAL TIIEOREM OF INTEGRAL CALCULUS
Taking 9
NOTE.
The formula $:/ drz == J~h d{J of Theorem 1.7 for changing the variable in an integral assumes the form
Tlu!orem 7.34 (S«OM folldtmtelltl.d 'h#Ore/ll of illtegr. ~). Assume tiuu fER defined on [a. b] such that the derivative g' exists in
0(1' f(x)
dx =
fd
<
f[g(t)Jg'{t)
dt,
f(~)
164
TIt.. 7.36
Th.7.37
165
when «(x) = x and when 9 is a strictly monotonic function with a eontinuous derivative g'. It is valid if J e R on [a. b]: When J is continuous, we can use Theorem 7.32 to remove the restriction that 9 be monotonic. In fact, we have the following theorem: 7.36 (C~ of lUfridle ill " R1emt:ura intq,./). Assume that 9 has a continuous derivative g' on QII interval [c, d]. Let f be continuous on g([e, and define F by the equation
Iheou",
dJ)
In particular, the result is valid if g(c) = g(d). This makes the theorem especially useful in the applications. (See Fig. 7.2 for a permissible g.) Actually, there is a more general version of Theorem 7.36 which does not require continuity of J or of g', but the proof is considerably more difficult. Assume that II E'R on [c, d] and, if x e [c, d], let g(x) = h(t) dt. where a is a fixed point in [c, d]. Then if Ron g([c, d]) the integral S~![g(t)] h(¥) dt exists and we have
r«
S;
F(x)
JI'I<:)
r'"
f{t) dt
if x
g([ c, d]).
dt exists QIId has the value
9(~)
f(x) dx
Id J[g(t)]h(t)
c
dt.
IIle)
Then, fOT each x in [c, d] the integral S~f[g(t)]g'(t) F[g(x)]. In particular; we have
l'ld)
l(x) dx
11l[g(t)]g'(t)
c
dt.
on [c, d]
This appears to be the most general theorem On change of variable in a Riemann integral. (For a proof, see the article by H. Kestelman. Mathematical Gazette, 45 (1961), pp, 17-23.) Theorem 7.36 is the special case in which" is continuous on [c, d] andJis continuous on g([c, d]). 7.22 SECOND MEAN·VALUE THEOREM FOR RIEMANN INTEGRAlS Theorem 7.37. Let 9 be continuous and assume that two real numbers salis/ring the inequalities
,(c)
Proof Since both g' and the composite function fog are continuous the integral in question exists. Define G on [c, d] as follows: G(x)
f'
f»
J[g(t)]g'(t)
By Theorem
A :sJCo+)
Then there exists a point
and
B ;;:f(b-).
G'(x) = f[g{x)]g'(x), and, by the chain rule, the derivative of F[g(x)] is also f[g(x}]g'(x), sinceF'(x) = j(x). Hence, G(x) - F[g(x)] is constant. But, when x c, we get G{c) = 0 and F[g(c)] = 0, so this constant must be O. Hence, G(x) = g(x)] for all x in [c, d]. In particular, when x = d, we get G(d) = F[g(d)] and this is the last equation in the theorem.
i)
r
I"
til
Xo
f(x)g(x) dx
1"" g(x) dx +
r g(x) dx, J;.:o
b
f:
g(x) dx,
Fr
In particular,
if /(x)
ii)
f(x)g(x)
dx = B
where
XI) €
[a,
bJ.
NOTE. Some texts prove the preceding theorem under the added hypothesis that g' is never zero on [c, d], which, of course, implies monotonicity of g. The above proof shows that this is not needed. It should be noted that g is continuous on [c, d]. so g([c. dJ) is an interval which contains the interval joining g{c) and g(d).
NOTE.
Proof
If oc(x)
J: get)
f(x)g(;x) dx
= f(a)
I'·
7.31 is applicable,
g(x) dx
+ feb)
f:
and we get
g(x) dx.
Figure'.:!
This proves (i) whenever A = f(a) and B = f(b). Now if A and B are any two real numbers satisfying A :s f(a+) and B ~ J(b-). we can redefine/at the endpoints a and b to have the "Values/(a) = A andJ(b) = D. ThemodifiedJis still increasing on [a. b] and, as we have remarked before, changing the value off at a finite number of points does not affect the value of a Riemann integral. (Of course. the point Xo in (1) will depend on the choice of A and D.) By taking A = O. part (ii) follows from part (i),
166
'l'L7.l1
167
I_rem
bounded variation
[a, b] and,for
Assume that a: is of each fixed y in [c. d], assume that the integral
f(x, y) : a ~
X~
h. c ~ y ~ d}..
.As$U11lethat IX is of bounded oariati(}1I tm [a. b) und let F be the fimction [c. d) by the equation F(y) Then F is continuous on (c, =
F(y) "'"
defined on
lex, y)
da.(x),
f(x, y) drx(x).
If the partial derivative D,./ is continuous on Q, the derivative F'(y) exists for each y in (e, d) and is given. by
exists.
F'(y)
NOTE.
d].
In other words.
if Jlo e
lim !(x.
'-+10
[c.
d]. we have
D:J(x, y) dll(x).
lim
'-10
f.b lex,
"
y)
dct{x)
=
=
r
J..
e"
y) d«(x)
In particular, when 9
F(y)
g(x)/(x. y) dx
Proof Assume that «I' on [a, b). Since Q is a compact set, I is uniformly continuous on Q. Hence. glven 8 > 0, there exists' a ;; > 0 (depending only on e) such that for every pair of points z = (x, y) and z' = (z', y') in Q with Iz - z' J < lJ. we have lAx, y) Ax. )"')1 < B. If Lv- )"'1 < lJ, we have
IF(y) F(y')1 s
5." I(x,
"
f.b
•
D:J(x.
Y) drx(x).
where j is between y and Yo. Since Dzf is continuous clusion by arguing as in the proof of Theorem 7.38. 7.25 INTERCHANGING
I/(x,
y) -
Tlleo"", 7.41. Let Q = {(x, y) : a :s: x S b. c ~ y s d}. Assume that oX is 01 bcunded variation on [a. b], P is 01 bounded variation on [c, d], and f is continuous on Q. If (x, y) € Q, define
F(y) Then
FE
I(x, y) da.(x),
G(x)
lex, y)
dP(Y).
.[a, b].
TMonm 7.39.
If I is
conf;nu01lS on the rectangle [a, b] x [c, d]. and F defined by lire equation .F(y)
if g e R
on
R(P) lm [c.
d]. G E R(a)
y(x)f(x.
)I) dx;
if Yo
r [f
F(y) dP(y)
G(x) dix(x) .
f U:
f(x,
y) da(X)] dP(y)·
Proof. If G(x)
= J~g(t)
J!f(x, y) dG(x).
Similarly.
Proof. By Theorem 7.38, Fis continuous on [c, d] and hence F E R(P) on [c. d]. G E R(a) on [a, b]. To prove the equality of the two integrals, it suffices to consider the case in which a » on [a, b] and P » on [e, d].
168 By uniform continuity, given e > 0 there is aD> points z = (x, y) and z' = (x'. y') in Q. with Iz -
7.41
Def.7.43
16
Tllurem 7A2. Let f be continuous on th« rectangle [a, b] x [c, d]. If 9 [a, b] and if heR on [c, d], then we have
lAx,
y) - f(x'.
i)1 < e.
[0. b] and [c, d]
Let us now subdivide Q into n1 equal rectangles by subdividing each into n equal parts. where n is chosen so that and Writing (d - c)
r U:
Ron
g(x)h(y)f(x,
y) dY] dx =
f U:
g(x)h(y)f(x,
_v)
dX]
dy. 7.26
J~g(u) du and
let P(y)
= H h(v)
-n-
< .../2"
7.26 LEBFSGUE'S
OF RIEMANN INTEGRAlS
x.
=a
+ -'----'n n. we have
k(b -a)
and
for k == 0, 1, 2, ...•
f \Jcr"
b/
II
l~O J
f(x,
y) dfJ(y}) dlX(x}.
We apply Theorem
.-1 .-1
i::;JiliO JAO
L L f(x;.
yj)[p(Yj+l)
'
- P(YJ)][<r(xu I) - G1(xJ].
Every continuous function is Riemann integrable. However, continuity is certainly not necessary, for we have seen thatfE R whenfis of bounded variation on [a, b]. In particular.j" can be a monotonic function with a countable set of discontinuities and yet the integral J!f(i) dx will exist. Actually, there are Riemann-integrable functions whose discontinuities form a noncountable set. (See Exercise 7.32.) Therefore. it is. natural to ask "how many" discontinuities a function can have and still be Riemann integrable. The definitive theorem on this question was discovered by Lebesgue and is proved in this section. The idea behind Lebesgue's theorem is revealed by examining Riemann's condition to see the kind of restriction it puts on the set of discontinuities off. The difference between the upper and lower Riemann sums is given by
Qt,j having
Y)
and
(X.l:+1.
Yj+l) as opposite
"~1
" L [Mi(f)
mi(f)}
Ax",
f (1" If
f(x,
y)
d~X»)
.. -I
dP(y)
,,-1
k=O
L L f(x;, yj)[,6(Yj+
j=O
I) - P(YJ)][G1(XU1)
E
- G1(xJ].
G(x) da(x) -
and hence
F(y) dP(y)1
.. -1
<s
=
JEO
L [fJ(YJ+I)
,,-1
P(y)]
l~O
L [IX{X.Hl) -
and, roughly speaking, / will be integrable if, and only if, this sum can be made arbitrarily small. Split this sum into two parts, say SI + S2' where 51 comes from subintervals containing only points of continuity of J, and 82 contains the remaining terms. In S1> each difference Mt{f) - m.(f) is small because of continuity and hence a large number of such terms can OCCurand still keep SI small. In 8:1.' however, the differencesM.(f) - mi.(/) need not be small; but because they are bounded (say by M), we have IS21 :;;; M so that 82 will be small if the sum of the lengths of the subintervals corresponding to 8z is small. Hence we may expect that the set of discontinuities of an integrable function can be covered by tervals whose total length is small, This is the central idea in Lebesgue's theorem. To formulate it more precisely we introduce sets of measure aero,
Lax"
IX{xJ]
Defotititm 7.43. A set S of real numbers is mid to halJemeasure zero if, for every ~> 0, there is a counlab1tt covering of S by open intervals, the sum 0/ w/wse lengths i~ less than e. If the intervals are denoted by (ak'
Since e is arbitrary.
this implies equality of the two iategrals. with Theorem 7.26 gives the following result for Rie-
btl,
the definition
U (a
t
i•
bJ
and
t'7G
Th. 7M
1Ia. 7.48
171
If the collection of intervals is finite, the index k in (3) runs over a finite set. If the collection is countably infinite, then k goes from I to co, and the sum of the lengths is the sum of an infinite series given by lim
N""oo
only on 8) such that for every closed subinterval T whenever the length of T is less than h. Proof. For each x in [a, b] there exists a l-ball B"
<;;;
nAT) < e:
1:-1
L (bl.
aJ.
Besides the definition, we need one more result about sets of measure zero. TMOTt!lft 7.44.
each of which has measure zero. Then their rmion S also has a measure zero.
=
~~I
U r;
..,
The set of all half size balls B(x; ~>J2) forms an open covering of [a, b]. By compactness, a finite number (say k) of these COVel' [a. b]. let their radii be 8,,2 •...• ~k{2 and let Jj be the smallest of these k numbers. When the interval T has length <0, then T is partly covered by at least one of these balls, say by B(x,,; 6,J2}. However, the ball B(x,,; h,,) completely covers T (since ij, 2: 20). Moreover. in B(x,.; lJ,,) n [a, b] the oscillation of lis less than e. This implies that !lAT) < £ and the theorem is proved .
f be defined and bounded on [a. b]. For each e > 0 define the
I. {x: x
E
Proof. Given 8 > 0, there is a countable covering of Fl by open intervals. the sum of whose lengths is less than 8/2t. The union of all these coverings is itself a countable covering of S by open intervals and the sum of the lengths of all the
intervals is less than .
[0,
bJ•.
Wf(X) 2: e},
Then J. is (J closed set. Proof. Let x be an accumulation point of Je. If x ¢ I.. we have w_,(x) Hence there is a l-ball B(x) such that
n_,(B(x) ('\ [a, b])
< e.
&=12'"
'" L _!_
= e.
<
e.
EumpIes. Since a set consisting of just one point has measure zero, it follows that ewry countable subset of R has measure zero. In particular, the set of rational numbers has measure zero. However, there are lIMountable sets which bave measure zero, (See Exercise 7,32.) Next we introduce
DefinitiOll
Thus
DO
accumulation
TMorem 7.48 (Lebesglle's uiteritJ1f for Rielfltfllll-iRtt!f/rability). Let f be defined and bounded on [a, b] and let D denote theset aldiscontinuities olfin [a. b]. Then feR on [a, b] if, and only if, D has measure zero.
7.45.
number
II T
S;
S, the
Proof (Necessity). First we assume that D does not have measure zero and show
thatfis not integrable. We can write D as a countable union of sets
is called the osc;llation of Ion T. The oscillation off at x is defined to be the number mAx)
NOTE.
This limit always exists, since OiB(x; h) n S) is a decreasing function of Jr. In fact. T, <;;; Tz implies nATI) s n.r<T2). Also, w1(x) = 0 if. and only if, lis continuous at x (Exercise 4.24),
If x e D, then wAx) > 0, so D is the union of the sets D, for r = l, 2, ... Now if D does not have measure zero, then some set Dr does not (by Theorem
The next theorem tells us that if wf(x) < II at each point of a compact I; for all sufficiently small subintervals T.
interval
7.44), Therefore, there is some £ > 0 such that every countable collection of open intervals covering D. has a sum of lengths ~e. For any partition P of [a, b] we
Tlrreor~1II7.46. Let I be defined and bounded on [~ b], and let e > 0 be given. Assume that wAx) < Bfor erJeryx in [0, b). Then there exists Q 6 > 0 (depending
l;1
" L [Mt(f)
- m.(f)) 4xt
Sl
+ S2 ~
S"
111
1'h.1M
Th.7..50
173 on
where SI contains those terms coming from subintervals containing points of D in their interior, and 82 contains the remaining terms. The open intervals from SI cover Dr except possibly for a finite subset of D .. which has measure 0, so the sum of their lengths is at least e. Moreover, in these intervals we have I 6 M.(f) - m..(f) ~ and hence SI ;;:0; - •
that a bounded functionfon a compact interval [a, b) is Riemann integrable [a, b] if, and only if./is continuous almost everywhere on [a, b]. The following statements
(some of which were proved earlier in the chapter) of Lebesgue's theorem. bounded variation on [a, b], then fER
E
I is of
on [D, b].
e U(P.J) - 4.P,J) ~ • r
P, so Riemann's condition cannot be satisfied. Therefore f is integrable. In other words, iff E R. then D has measure zero. (Sufficiency). Now we assume that D has measure zero and show that the Riemann condition is satisfied. Again we write D = I' Dr. where Dr is the set of points x at which wt<x) ~ I/r. Since Dr :!;;;: D. each D. has measure 0, so Dr can be covered by open intervals, the sum of whose lengths is < l/r. Since D"is compact (Theorem 7.47), a finite number of these intervals cover Dr' The union of these intervals is an open set whlch we denote by A•. The complement Br = [a, b] - A, is the union of a finite number of closed subintervals of [a. Let J be a typical subinterval of B,. If x E I. then wt<x) < llr so, by Theorem 7.46, there is a 6 > 0 (depending only on r) such that I can be further subdivided into a finite number of subintervals T of length <6 in which O.,(T) < l/r. The endpoints of all these subintervals determine a partition P, of [a, b]' If P is finer than P; we can write for
[a, b].
c) lffe
III E R and f2
R on [a. h). thenfe ROil [e, d]for E R on [D, b]. Also, f·g
R on [0,
bJ
not
every partition
011
[a,
U~
d) Iff and 9 are bounded functio1l8 having the same t/is(xmtinuities on [a. bJ. then fe R on [a, b] if, and only if. 9 e R on [0, b]. e) Let 9 e R on [a, b] and amune that m s g(x) ;5; M for all x in [a. b]. Iffis continuous on [m, M], the comp08;tefunction h defined by hex) = f[g(x)] is Riemann-integrable on [D, b].
NOTE.
from O.
b) whenever
oJ.
if
we assume
only that
fe R
on
Em, M].
RIEMANN-sTIELTJF.S
INTEGRALS
U(P,j)
- [{PJ)
t=1
.. :E [M,(J)
- m.(f)] Ax. = SI
S2.
of
where S, contains those terms coming from subintervals containing points Dr> and S2 contains the remaining terms. In the kth term of S2 we have and hence Si Since A. covers all the intervals contributing
to SI. we have
S 1:$
where M and m are the sup and inf U(P.f)
M
---
r m
off
Riemann-Stieltjes integrals of the form da.. in which / and ex: are complexvalued functions defined and bounded on an interval [a, b are of fundamental importance in the theory of functions of a complex variable. They can be introduced by exactly the same definition we have used in the real case. In fact. Definition 7.1 is meaningful when f and IX are complex-valued. The sums of the products f(tJ[ct(xJ - ct(Xt-I)] which are used to form Riemann-Stieltjes sums need only be in_terpreted as sums of products of complex numbers. Since complex numbers satisfy the commutative. associative, and distributive laws which bold for real numbers, it is not surprising that complex-valued integrals share many of the properties of real-valued integrals. In particular. Theorems 7.2, 7.3, 7.4. 7.6, and 1.1 (as well as their proofs) are all valid (word for word) whenfaod IX are complex-valued functions. (In Theorems 7.2 and 7.3, the constants Cl and C1 may now be complex numbers.) In addition, we have the following theorem which, in effect. reduces the tbeory of complex Stieltjes integrals to the real case.
S:f
J.
_ 4P,f)
<
M-
+
r
b-
a.
holds,
SOfE
on [a, b).
condition
NOfE. A property is said to hold a/most everywhere on a subset S of Rl if it holds everywhere on S except for a set of measure O. Thus, Lebesgue's theorem states
r a: r
bJ.
I
dll =
if2 and
C!l
+
j
i«2
be complex-valued
Then we have
-
11 dal
Iz
d(2)
(f 12
d~1
functions
II dal)
174 The proof of Theorem 7.50 is immediate from the definition and is left to the reader. The use of this theorem permits us to extend most of the important properties of real integrals to the complex case. For example, the connection between differentiation and integration established in Theorem 7.32 remains valid for complex integrals if we simply define such notions as continuity, differentiability and bounded variation by components, as with vector-valued functions. Thus, we say that the complex-valued function tX Q!1 + ia.2 is of bounded variation on [6, b] if each component (Xl and !Xl is of bounded variation on [0, b]. Similarly. the derivative (X'(t) is defined by the equation a'(t) = ai(/) + iiX2(t) whenever the derivatives a:i(t) and ajJt) exist. (One-sided derivatives are defined in the same way.) With this understanding, Theorems 7.32 and 7.34 (the fundamental theorems of integral calculus) both remain valid when f and II are complex-valaed, The proofs follow from the real case by using Theorem 7.50 in a straightforward
and hence U(P.f) < 1+ e. Similarly, L(P'f) Hence IS(P,f) -
175
II <
if
>1
iqPll
-e 1/
For x
=
2!:
O. define
r-l
where [x] is the greatest integer in x and empty sams are interpreted as zero. Let {have a continuous derivative in the interval I :s; x :s; a. Use Stieltjes integrals to derive the. following formula:
E a,.!(n) =
~~.
- [A(X)/'(X)
1
dx
A(a)/(a).
manner.
We shall return to complex-valued integrals in Chapter 16, when we study functions of a complex variable in more detail.
7.6 Use Euler's summation formula, or integration by parts in a Stieltjes intc:gra1, to derive the fonowing identities: I rt-1 b)
+ $ Jl
I
t'
[xl
x'+l
dx dx
if, '" 1.
1-1
• :E -k1 = log n -
i"x-
--2~
{x]
1.
I! dlr(x)
Il(b)
Il(a).directly from
Definition 7.1.
is monotonic on [a, b],
7.Z If fE R(a:) on [a, b] and if J!I tk ... 0 for every {which prove that III must be constant on [a. b 1-
2nJ
7.3 The following definition of a R.iemann-StieltjOi integral is often used in the literature: We say lis integrable with respect t.o Ill: irthere exises a real number A having the property that for every Il > 0, there exists a {} > 0 such that for every partition P of [a. b] with nonn IIPII < 0 and for every choice of t1 in [Xl-I> XlI. we have IS(P,/. a) - AI < fl.
6
211
(-lll(k)
J.
t:
/'(x)([x}
- 2[x/2]) dx,
a) Show that jf j! , d", exists according to this definition, then it also existsaccordiqg to Definition 7.1 and the two integrals are equal. 0 for a ~ x < t:'/{x) ~ a(x) ." I for c < x ~ b.J(c) = O. a:(t) == 1. Show that tk exists according to Definition 7.1 but does not exist by this second definition.
"" a:(x)
ee
7.8 Let fll(X) = x - [x] - ! if x ~ integer. and let 1I'1(X) = 0 if x integer. Also, let "1(X) Jij '1(1) dt. If I" is continuous on [1. nJ prove that Euler's summation formula implies that
b) Let/(x)
$:{
i-Ill
:E '(Ie)
"i
=
'(x)
dx -
i"
fP2(x)/"(x)
dx
+ 1(1) + fen) 2
7.9 Take I(x) = log x in.Exercise 7.8 and prove that log n! 7.10 If x
;":!
7.4 IffE R accordiog to Definition 7.1. prove that S:I(x) dx also exists according to the definition of Exercise 7.3. [Contrast with Exercise 7.3(b).] Hint. Let I ""I!/(x) dx, M = sup {1/(x)1 : x E [a, b J}. Given e > 0, choose p. SO tbat U(P., f) < I + 8/2 (notation of Section 7.11). Let N be the number of subdivision points in p. and let ii = 8/(2MN). If IPII < 0, write U(P,f) =
(n
1) log
If -
If
+ (-
J.
'2!t)
t
lit.
L MA(f)
Ax, = Sl
+ S2'
.E I,
JI",X
where S 1 is the sum of terms arising from those subintervals of P containing no points of p. and Sl is tbe sum of the remaining terms. ;Then SI -s U(P.,f) <J
:s
X
z/2
and
81. ~
NMllPl1
116
This is usuaJly proved by studying a related function 8 given by
171 7.14 Assume IE R(gr) on [a, b 1, where « is of bounded variation On [a, b]. Let Vex) denote the total variation of /l on [II. x] for each x in (a, b J. and let Veal = O. Show that
8(x) ~
where again the sum is extended over all primes p s x. Both functions If and 9 are step functions with jumps at the primes. This exercise shows bow the Riemann-Stieltjes integra] can be used to relate these two functions.
.""
:E logp,
Ifldals
fill
dV
s MV(b),
a) If x :<!: 2, prove that n(x) and 9(x) can be expressed as the following RiemannSticl\jcs
becomes
integrals:
8(x) =
NOTE.
Jm
(;It
log' dx(t),
B(x) =
i"
I
1
I
3/2108
d3(t).
If ul
I(x)
:50
M(b - a),
1be lower limit can be replaced by any number in the open interval (1. 2),
b) If x
~ 2, use integfation by
7.15 Let {.z,,} be a sequence of functions of bounded variation on [a, b]. Suppose ~ exists a function a defined on [a. 6] such that the total variation of gr - «" on [a, b] tends to 0 as n -+ 00. Assume also that Cl(a) = a,.(a) = 0 for each n = 1.2, ... If lis continuous on [a, b], prove that
12
:~~ f
/(x)
dx..{x)
[/(X)
i'Al(x).
x(x)
8(x)
log X
+ [lI:
J2
7.16 If Ie R(<<),/l t R(<<), 9 e R(<<), and 92 e R(<<) on [a, b], prove that
These equations can be used to prove that the prime number theorem is equivalent to the relation lim" .. ", 8(x)/x = I. 7.11 If tV on [a. b J, prove that we have
2..
g(y)
g(X)1
d:z(y)l
a)
b)
f '"
rr r. sri r
=
= (a < e <
(1" th(X»)(f
/(X)2
dz(x)
g(x/
th:(X») -
(1"
I(x)g(x) dx(X)t
I· +
I..
b).
inequality
if +
g) •
c) [
if +
g) tb.:<!:
1:
i
d«
9 ..
I th. + [
9 th:.
7.1l Give an example of a bounded function I and an increasing funaion c defined on [a. b] such that IIIEO R{«) but for which I! I. does not exist 7.13 Let 0; be a continuous function of bounded variation on [a, b). Assume g e R(o;)
on [a, b) and define P(x) =
i[ [f u:» -
I E R(gr), 9
E R(fl). and
Show that
I(x»)(g(y) - g(x»
th:(y)]
I!g(t)
I dfJ
= /(a)
..
r
..
9•
+ I(b)
i~
;;'0-
U:
/(x)
9 ...
b) If. in addition, lis continuous on [a, b]. we also have f(x)g(x) dx(x) = I(a)
9•
+ 1(6)
J~
(J:
/(x)
d«(X»)(f
g(x)
g ••.
when both land 9 are increasing (or both an: decreasing) on [a, b]. Show that the reverse inequality holds if I increases and 9 decreases on [a, b],
178
~bmgraIs 7.18 Assume fER
OD
I"
[a, b]. Use Exercise 7.4 to prove that the Umit lim b .... '" "
c) Usc (b) to prove tho folJowiq tIJcm:an of L. F..,: The number of ehanp in sign of 1m [0. a] is not Jess than the mmtber of c:hanaes in sisn itt the ordered set
1(0).
ai:J(a
:t~1
k b - "')
f(t)
dt'l
•
t/(l)
41.
• •• •
1"/(1) lit.
I!/(x)
(f/(x'!
dx)l/ll .. M.
if x is rational, if x is irrational.
7.1' Define
7..25 A function! of two real wri.a.bIes ia defined. for each point (x. y) in the unit square :s; x s: 1. 0 s )l :s; I as foRows:
a) Show that g'(x) + /,(x) "" 0 for all x and deduce that g(x) b) Use (a) to prove that
+ f(x)
= 71/4.
a) Compute
JJ
7.20 Assume 9
integral
I:
R on [a. b] and define I(x) = J! g(l) dt if x Ig(t)1 tit gives the total variation of I on [4, x].
E
e) Let F(x)
fA !(X, Y) dy.
.s
rOl' each
of y.
t s 1.
ro I(x, y) dy in tcnns
2-·, then/(x) "" 2-·,
Show that
fA F(x)
7.2.1 Let f = (11)' . .fJ be a w:ctor-valued function with a continuous derivative f' on [a, h). Prove that the curve described by f has length A,(a" b) =
7311 Let/be defined on [0, I] as foIlows:/(O) = 0; iU-·-1 for 11 = O. 1,2. ... a) Give two
I'C8!IOM why
<X
S;
ro I(t) dt.
.... 21
IA /(x) dx exists.
F(x)
s:
1 we have
xA(x) - tA(x?
73:1. If/(Hll is continuous on [a. z], define l.(x) = n! • (x - 1)"/0.+1)(1) dt. I a) Show that 1~_I(X)- ll(x) = .fkJ(Q~~ - a>", k = 1, 2, ...•
II.
r[-qJrI
7:r1 Assume I has a derivative whid1 is monotonic III > 0 for all x in [a, 0]. Prow that
decr'easinlI:
If
Hint. Multiply and divide
7.28 Given a dcacasing &eqUCllCCof Define &function/on [0. I] in terms I(x) = 0; if x is the rational mJ,. oscillation w_,.(x) at each x in [0. I]
cos/(x)
thel s ~.
b) Usc (a) to express the remainder in Taylor's formula (Theorem 5.19) as an integral. 7.23 Letlbe continuous on [0, aJ. If x
E
em
I. +1(x)
... -1
n!
IX
0
mil numbers {G{n}} such that G(II) ~ 0 as ,. .... eo, of (G(II)} as foIlows:/(O) = 1; if x is iIrational, tbc:n lowest teIms). thmf(m/n) = G(n). Compute the and show that Ie R on [0. I].
(x - t)"f(t) dt,
0, I, 2, ...
a) Show that the 11thderivative of I. exists and equals! b) Prove the following theorem of M. Fekete: The number of changes in sign of 1 in [0, 0] is not Jess than the number of changes in sign in the ordered set of
7.l9 Let/be defined as in EKrcise 7.28 with G(n) = lIn. Let g(x) == 1 if 0 -c x S; I, g(O) O. Show that the composite function" dc6ned by h(x) = g[f(x)] is not Riemannintegrable on [0, 1], although both I € R and g e R on [0. I],
7.30 Use Lebesgue's theorem to prove Theorem 7.41). 7.31 Use Lebesgue's theorem to pJ'O'iC that jf / e R and g m > 0 for all x in [D, b], then the function h dcOned by
/(x) = I(x~)
E
numbers
is Riemann-integrable
on [a, h].
..,
7.31 Let 1 = [0. I} and let AI = 1 1) be that subset of 1 obtained by removing those points whlch He in the open middle thinl of 1; that is. Al = [0. t) v 1J. Let A;I. be that subset of Al obtained by removing the open middle third of [0, tJ and of K. 1). Continue this process and define .43 .4... . .. The set C = n:'l A" is called the • Cantor set. Prove that:
lU
<t.
e.
Existenee tbeoRms for Integral_ dift'creidiaI .,....tionB The following exercises illustrate bow the fixed-point theorem for centractions (Theorem 4.48) is used to prow emte:nce theorems (or solutions of amain intcgIal and differential equations. We denote by C [a, b] the metric space of all real continuous fuoctioos on [a, b] with the rnetJ:i<:
a) C is a (:(Jrnpac:t set having measure zero. b) x e C if, and only if, JC = l:..";1 0.,3-", when each a,. is either 0 or 2. c) C is uncountable. d) Let {(x) = 1 if x
E C,/(x)
and recall that C [a. b] is a complete metric space (Exercise 4.67). Let /(x)
7.33 This exercise outliDcs a proof (due to lvan Niveo) that x"(l - x'fln!. Prove that: a) 0 < {(x) < l/nl Now assume that if 0 < x < 1. . and{oo(1) is an integer. b) Each ktb dcrivative{Ct)(O)
r is irrational.
7.36 Given a funetien g in C [a, b J. and a functioo K continuous 00 the rectangle Q = [a, b] x [a, b). consider the function. T defined on C[a, b J by the equation T(V)(x) = o(x)
wbeIe
+ 1[
F(x) =
..
Prove that: c) F(!) and F(I) are integers. d) ,rti'/(x) sin roc
b) If IK(x,y)1 s Mon Q. where M::> 0, and if P.I < M-l(b -:- a).-I. pro~ tbat T is a contraction of C [a, bJ and bencc has a fixed point , which IS a solution of the integral equation (l(x) = g(x) + 1 K(x, t)f'{t) dJ.
I:
= !.. {F(x)
dx
e) P(l) +
F(O) = 7fti'
f n»
It) x [b - k, b
k],
a) Let " be a. function, continuous on [a - It, a + It], such that (x, (l(x») e Q for all x in [a - h. a + II]. If 0 -c c ~ II., prove that Ip satisfies the differential equation y' y) on (a e, a + c) and the initial condition fl'{a) = b if, and only if. II satisfies the integral equation
n»;
to deduce that 0 -c F(l) + F(O) < 1 if n is sufficiently large. This contradicts (c) and shows that (and hence n) is irrational.
,(x) = b
f(t. ,,(t») dJ
on
(12 - c. a
c).
7.34 Given a real-valucd function a, continuous 00 the interval [a, b] and having a finite bounded derivative II: 00 (a, b). Let/be defined and bounded on [a, b] and asswne that both integrals
!(x) dx(x)
and
I(x) rr'(x) dx
/I.'
b) A&sume that. I/(x, )1)1s Moo Q. where M> O. and let c "" min {Ir. kiM}. Let S denote the metric subspace of C [a - e, a + e J consisting of all II such that If'{x} - bl ;s Me on [0 C, Q + c). Prove that S is a closed subspace of C [a - c, a + e] and hence that S is itself a complete metric space.
c) Prove that the function Tdefined on S by the equation T(9')(x) b
exist. Prove that these intcsrals are equal. (It is not assumed that
is eontiauous.)
7.35 Prove the following theorem" which implies that a function with a positive integral must itself be positive on someint.erval. Assume that/"" Ron [a, b] and that 0 :5; /(x) s M on la, b], where M ::> O. Let 1 = I:{(x) dx, let h = YI(M + b - a), and assume that 1 ::> O. Then the set T = {x :/(x) ~ h} contains a finite number of intervals, the sum of whose lengths is at least It. Hint. Let P be a _partition of [a, b) such that every Riemann sum S(P.f) = D_l1(tJ &xl satisfies S(P./) ::> 1/2.. Split S(P,/) into two parts, S(P,!) = k + :Et..a. where A = {k: [Xl_I' Xi]
LuAx,,::>
!; T}.
[(t.
,,(t»
dJ
maps S into itself. d) Now assume that 1satisfies a Lipschitz condition of the form
I/(x, y) - I(x,
%)1 ~
AIY -
zl
and
{k: k ¢ A}.
<
n.
Deduce that
for overy pair of points (x. y) and (x. z) in Q. where A > o. Prove that T is a contnldion of S if" < IIA. Deduce that for II < 1/.4 the differetltial equation y' = /(x.)') has exactly one solution )' = tp(x) on (a - c. Q + c) such that f(a) = b.