Professional Documents
Culture Documents
KpacHOB,
BhICWeH MaTeMaTHKH A. Kucenea,
~JUI HH)KeHepoB
r. MaKapeHKO,
E.IIIBKBH
Mir Publishers
Moscow
Translated from Russian
by Alexander Yastrebov
Ha aH2JlUUCKOM Jl3blKe
Preface 9
Chapter 1 An Introduction to Analytic Geometry 11
I.I Cartesian Coordinates 11
1.2 Elementary Problems of Analytic Geometry 14
1.3 Polar Coordinates 18
1.4 Second- and Third-Order Determinants 19
Chapter 2 Elements of Vector Algebra 24
2.1 Fixed Vectors and Free Vectors 24
2.2 Linear Operations on Vectors 26
2.3 Coordinates and Components of a Vector 30
2.4 Projection of a Vector onto an Axis 33
2.5 Scalar Product of Two Vectors 34
2.6 Vector Product of 1\vo Vectors 39
2. 7 Mixed Products of Three Vectors 43
Exercises 45
Answers 46
Chapter 3 The Line and the Plane 47
3.1 The Plane 47
3.2 Straight Line in a Plane 51
3.3 Straight Line in Three-Dimensional Space 55
Exercises 60
Answers 62
Chapter 4 Curves and Surfaces of the Second Order 63
4.1 Changing the Axes of Coordinates in a Plane 63
4.2 Curves of the Second Order 66
4.3 The Ellipse 67
4.4 The Hyperbola 71
4.5 The Parabola 77
4.6 Optical Properties of Curves of the Second Order 79
4.7 Classification of Curves of the Second Order 83
4.8 Surfaces of the Second Order 89
4.9 Classification of Surfaces 90
4.10 Standard Equations of Surfaces of the Second Order 95
Exercises 102
Answers 102
6 Contents
The Authors
Chapter 1
An Introduction to Analytic Geometry
a
0 L
Fig. 1.2
X
L
0 M 0 L1
Fig. 1.3 Fig. 1.4
Definition. The axis L with the reference point O and the unit distance
a given on it is called the coordinate axis; the number x as defined above
is said to be the coordinate of M.
In symbols, we write M(x) to designate a point M whose coordinate is x.
Cartesian coordinates in a plane.. Let O be a point in a plane. We draw
through O two mutually perpendicular lines L1 and L2. Let us choose a
direction for each line and a unit distance a which is the same for L1 and
L2. Then L1 and L2 become coordinate axes with a common reference point
0 (Fig. 1.4).
12 1. An Introducti_onJp Analytic Geometr
~-------------
We call one of the coordinate axes the x-axis or the axis of abscissas
and the other one the y-axis or the axis of ordinates (Fig. 1.5). The point
0 is called the origin of coordinates.
Let M be a point in a plane as shown in Fig. 1.6. We drop from M
two perpendiculars onto the coordinate axes, the points Mx and My being
the projections of Mon the x- and y-axes and associate with M an ordered
pair (x, y) of numbers so that x is the coordinate of the point Mx and
y is the coordinate of the point My.
y
---- -- ---,M
I
I
I
I
I
I
I
0 X
0 .x
Fig. 1.5 Fig. 1.6
!J
I II Ill IV
II I
.x + - - +
.x
Ill IV y + + - -
Fig. 1.7
z
a
r
Fig. 1.8 Fig. 1.9
My
/ y
/
.. ·... /
/
/
-------·~
Fig. 1.10
Y+
Y2
----- ,- -- - - - - M2
I
I
I
I
I
Ml
- - __ 1 M
iJ,
0 ..r, r2 X
Fig. 1.11
lj
!J
...
.....
p /\,/
.. ;At
/ \ ..... ..
/
/
\ ..
... Fi 0 F,, .. X
..... . .. ...
0 :r
Whence
✓ (x + c) 2 + y 2 + ✓ (x - c) 2 + y 2 = 2a.
Transpose the second radical to the right:
✓ (x + c)2 + y 2 = 2a - ✓ (x - c )2 + y 2 •
16 1. An Introduction to Analytic Geometry
fj
M2
M
I
I I I
I I I
I I I
I I I
0 X
Fig. 1.14
◄ Suppose that the segment is not parallel to the y-axis (Fig. 1.14). Then
IM1MI IM1xMxl
IMM2I IMxM2xl ·
Since IM1xMxl = lx1 - xi and IMxM2xl = Ix - x2I we have
lxt - xi At
Ix - x2I = A2 ·
____ 1.2 Elementary Problems of Analytic Geometry 17
The point M lies between M1 and M2. Hence, there holds either
x1 < x < x2 or x1 > x > x2. This implies that the differences xi - x and
x - x2 are always of the same sign. Thus we may write
X1 - X At
X - X2 - A.2 .
Hence
x=
When the segment M1M2 is parallel to the y-axis, we have x1 = x2 = x.
Notice that this result immediately follows from (*) if we set x1 = x2.
Similar reasoning yields
= A2Y1 + A1Y2 ►
y - "-1 + "-2 •
Fig. 1.15
◄ Recall that in any triangle the centre of gravity and the point of inter-
section of medians coincide so that M divides each median in the ratio
2: 1 reckoning from the corresponding vertex. Thus, the coordinates of M
are
- lX3 + 2x I d
. ly3 + 2y I
x - 2+ 1 an y = 2+ 1 '
where x' and y' are the coordinates of the point M' of the median M3M'.
Since M' is the mid-point of M1M2, we have
, lx1 + lx2 d , _ ly1 + ly2
x= 1+1 an Y - 1+1
Substituting these relations into the formulas for x and y, we arrive
at the desired result
X= and y = Yt + Y2 + YJ ►
3
Remark. Let M(x, y, z) divide a segment joining M1(X1, Y1, z1) and
M2(x2, Y2, Z2) in the ratio At : A2. Then
X=
0
Fig. 1.16 Fig. 1.17
The ordered pair (r, cp) of numbers represents the polar coordinates
of M. The numbers rand cp are called the polar radius and the polar angle,
respectively. ,.,
The point O is referred to as the pole and the axis L as the polar axis.
It is clear that r > 0 and O ~ cp < 21r.
When M coincides with the pole, r = 0. In this case the polar angle
is not defined.
Therefore we may set another coordinate system in a plane, namely the
polar coordinate system.
The Cartesian coordinate system is said to be compatible with a given
polar system if the origin O is the pole, the x-axis is the polar axis and
the y-axis makes an angle of +; with the x-axis (Fig. 1.18). Then the
relations between the Cartesian coordinates and the polar coordinates are
given by the formulas:
cos cp =x and sin cp =Y ,
or r r
x = rcos cp and y = rs1n cp,
where r = ✓ x2 + y 2 •
1.4 Second- and Third-Order Determinants 19
R
0
X X
In symbols, we write
+
' /.
Fig. 1.20
20 1. An Introduction to Analytic Geometry
- l 3 = - 4+6 = 2. ►
-2 4
We encounter the second-order determinants when dealing with systems
of linear equations in two unknowns
[
a11X + a12Y = bi
a21X + a22Y = bi
x - -- -bia12
= -b1a22 - - and y = -bia11
- -- -b1a21
--
aua22 - a12a21 a11a22 - a12a21
or, as quotients of determinants,
bi a12 au b1
bi a22 a21 b2
X=----- and y=----
a11 a12 a11 a12
Now we shall turn our attention to some properties of the second- and
the third-order determinants easily verified by applying formulas (1.1) and
(1.2).
Properties of determinants. (1) The value of the determinant is un-
changed if the rows and the columns are transposed, i.e.,
au a12 au a21
021 022 a12 a22
au 012 013 au a21 031
a21 a22 023 012 a22 032
031 032 033 013 023 033
(2) The determinant reverses its sign if any two rows (or any two
columns) are interchang~d.
~2- ._. __ 1: _A_n .!~!!Od11ction to Analytic_ Geometry-··- _________ _
(3) The factor common to all elements of some row (or some column)
may be taken outside the determinant, i.e.,
ka11 ka12 ka13 a11 ka12 a13 au a12 013
a21 a22 a23 a21 ka22 a23 =k 021 022 023
031 032 033 031 ka32 033 031 032 033
M 23 = a11 a12
•
a31 a32
hold.
By way of illustration we show that
A = a11A11 + a12A12 + a13A13.
1.4
.
Second- and Third-Order
.
Determinants 23
= a11
Formulas (1.3) and (1.4) represent the expansion of the determinant with
respect to the ith row and the expansion of the determinant with respect
to the jth column, respectively.
Example. Compute the value of
1 0 -1
d= 2 4 3.
3 -1 6
◄ Expanding the determinant with respect to the first row, we get
d=l 4 3 _ 0 2 3 _ 1 2 4
-1 6 3 6 3 -1
= (24 + 3) + 0 + (2 + 12) = 41. ►
Chapter 2
Elements of Vector Algebra
D
I
I
I
I
I
I
---+---
B
----+- B ~
B
AB BA
A
A A
Fig. 2.1 Fig. 2.2
--+ -
Definition. The fixed vectors AB and CD are said to be equivalent if
the mid-points of the segments AD and BC coincide (Fig. 2.2).
In symbols, we write AB= CD to signify the fact that AB and CD
are two equivalent fixed vectors.
---+ ---+
Notice that when AB and CD do not lie in the same line, we may in-
troduce a definition equivalent to the given above, namely AB and CD are ·
2.1 Fixed Vectors and Free Vectors 25
1 ,
A B
Fig. 2.4
Fig. 2.3
Fig. 2.5
CD= AB.
Hence, to any point we may apply a fixed vector equivalent to the given one.
Now we shall turn our attention to free vectors, i.e, to vectors which
may be made to start at an arbitrary point. In other words, a free vector
is a vector which may be moved rigidly parallel to itself.
We clearly see that a given fixed vector AB uniquely defines a free vector
which starts at A and ends at B.
Free vectors whose initial points lie on the line determined by a given
(nonzero) fixed vector are thought of as sliding vectors (Fig. 2.5).
Fixed vectors and sliding vectors are widely used in theoretical
mechanics.
26 2.··-Elements
- . --·
of Vector
- ~- . --
Algebra
Fig. 2.6
B
I
I
I
I
I
I
I
I C
0 a A
Fig. 2.13
a3 a,
\
\
\
\
a
a"' a,
\
\
\
as a6 a4
· • - - ~.~----1
. ►~ ◄•~-- ■8►
....•------..
0 A 8 A 0
z z
y y
~ ~
z z
R
----------71 M
// I
-------- A I
I I
I I
I I
I IQ
y
I /
___________j/
a= b iff [;: ;: :
Zl = Z2.
The position vector of a point M(x, y, z) is the vector r = xi + yj + zk
which starts at the origin O of coordinates and ends at M(x, y, z)
(Fig. 2.21 ).
Linear operations on vectors specified by their coordinates. Let there
be given two vectors a = {x1, Y1, z1) and b = {x2, Y2, z2) so that
a = xii + Yd + z1 k and b = X2i + y2j + z2k. Then, by laws of addition we
get
a + b = (xii + Yd + z1k) + (x2i + Y2i + z2k)
= (x1 + X2)i + (Y1 + Y2)j + (z1 + z2)k
~~------)-_. Elements _Q..:f..:Ve_ctor AlgeQ_@_ ----·--------
or
a +b = (x1 + X2, Yt + Y2, Zt + Z2}.
Hence, addition of vectors becomes addition of their corresponding coor-
dinates.
Similarly, we have
a - b = (X1 - X2, Yt - Y2, Zt - Z2}.
Also
X.a = AX1i + X.yij + AZ1k
or
~ = (AX1, A.Yi , AZ1 }.
Hence, multiplication of a vector by a scalar reduces to multiplication of
all the coordinates of the vector by the scalar.
Let a = (X1, Y1 , z1 } and b = (x2, Y2, z2 } be collinear vectors, and b '# 0.
Then a = µ,b, i.e.,
X1 = µ,x2, Yi = µ,y2, z1 = µ,z2
lj
Fig. 2.22
or
X1 YI Z1
-=--=- (2.3)
X2 Y2 Z2
◄ From Fig. 2.22 it follows that M1M2 r1, where r1 and r2 are = r2 -
position vectors of M1 and M2, respectively. Thus
M1M2 = {X2 - Xi, Y2 - Y1, Z2 - Zl }.
Hence, we conclude that the coordinates of M1M2 are equal to the differ-
ences of the respective coordinates of M2 and M1. ►
B
\
\
\
\
\
\
\
\
\
D
B
\
. . . . . . >,
...... \
\
A \
\
Fig. 2.25
3-9505
34 2. Elements of Vector Algebra
Fig. 2.26
(2) The projection of the sum of vectors onto an axis I is equal to the
sum of the projections of the vectors onto the axis.
For example, from Fig. 2.26 if follows that
proj/(a + b + c) = proj, a + proj, b + proj, c.
Properties of the scalar product. (1) The scalar product of a and b van-
ishes if and only if a and/or b are zero vectors, or if a and b are perpendicu-
lar (written a .1. b).
◄ This immediately follows from (2.4). ►
Since the direction of the zero vector is not defined we may set it perpen-
dicular to any vector. So we may write the property in question as
a .1. b iff a•b = 0.
Fig. 2.27
since the angles (a,b) and (Aa,b) are equal (Fig. 2.28).
We proceed similarly when A< 0.
The case of A = 0 is trivial. ►
Remark. In general
(a•b)•c ~ a•(b•c).
Fig. 2.28
(2.10)
Similarly,
YI a•j
cos {3 = ✓ 2 2 2 = Tai' (2.12)
XI + YI + ZI
ZI a•k
= ✓ 2 (2.13)
cos 'Y
XI + YI
2
+ ZI
2 lal .
Formulas (2.11)-(2.13) define the direction cosines of the vector a, i.e.,
the cosines of angles between a and the coordinate axes (Fig. 2.29).
y
z
y 0 X
Fig. 2.31
a•b
Fig. 2.32
40 2. Elements of Vector Algebra
Fig. 2.33
a•b
Fig. 2.34
Fig. 2.35
i j k
a X b = 1 1 - 1 = i ·0 - j · 1 + k · ( -1) = - j + k.
2 1 -1
Whence
So = ✓1 + 1 = "2. ►
(2) Compute the ai;-ea of the triangle OAB shown in Fig. 2.36.
Fig. 2.36
◄ We clearly see that the area SA of OAB is half the area So of the
parallelogram OACB. Computing the vector product of a = 6A and
b = OB, we obtain
i j k
a xb= X1 Y1 0 = (X1.Y2 - X2Y1)lk.
X1 Y2 0
Whence
2. 7 Mixed Products of Three Vectors
Fig. 2.37
A
(b)
Fig. 2.38
44 2. Elements of Vector Algebra
where S is the area of the parallelogram OADB, e is the unit vector perpen-
dicular to both a and b so that the ordered triple of vectors (a, b, e) is
right-handed, i.e., the thumb, the index finger and the middle finger of
the right hand point along the directions of a, b and e, respectively.
Computing the scalar product of a ,c b and c, we obtain
(a ,c b)•c = (Se)•c = S(e•c) = Sproje c. )
The number S proje c is equal to the positive height h of the parallele-
piped if the angle 'P between e and c is acute, i.e., if the ordered triple of
vectors (a, b, c) is right-handed, and to the negative height if the angle
'Pis obtuse, i.e., if (a, b, c) is the left-handed ordered triple of vectors, so that
(a ,c b) · c = ± Sh = ± V.
Therefore the scalar triple product of a,. b and c is numerically equal
to the volume V of the parallelepiped with edges a, b, c when the triple
(a, b, c) is right-handed, and to minus V when the triple (a, b, c) is left-
handed.
The geometry reveals that multiplying the three vectors in any other
order, we obtain either + V or - V.
Notice that when the triple (a, b, c) is right-handed, both (b, c, a) and
(c, a, b) are also right-handed, while all the triples (b, a, c), (a, c, b) and
(c, b, a) are left-handed. Thus
(a ,c b)•c = (b ,c c)•a = (c ,c a)·b
= - (b ,c a)· C = - (a ,c C) • b = - (C ,c b) •a.
Once again we shall emphasize the following: the scalar triple product
of three vectors a, b, c is equal to zero if and only if a, b and c are coplanar.
In symbols,
a, b, c are coplanar iff (a ,c b)•c = 0.
i j k
a ,c b = x1 Y1 z1
X2 Y2 Z2
Whence
(axb)·c
Xi Y1 Zi
Yi Zi Xi Zi X1 Y1
= X3 - Y3 + Z3 X2 Y2 Z2
Y2 Z2 X2 Z2 X2 Y2 Z3
X3 Y3
Thus the scalar triple product of three vectors specified by their coor-
dinates with respect to the orthonormal basis (i, j, k) is equal to the third-
order determinant whose rows are coordinates of the vectors being mul-
tiplied.
The necessary and sufficient condition for three vectors a =
{x1, Yi, zi), b = {x2, Y2, z2} and c = {X3, y3, z3} to be coplanar may be
written as
X1 Yi Zi
X2 Y2 Z2 = 0.
X3 Y3 Z3
Answers
--
1. AM = c + -a or AM
-- = -
c-b -- = a + -b
- · BN or BN-- = - a-c
- ; Cr =t- =
2 2 ' 2 2
b + ~ or CP = b ; a . 2. IPI = lql since the diagonal of .the parallelogram bisects the
Fig. 3.1
Suppose that an arbitrary point M moves in the plane 1r. Whatever the
path of moving, the position vector r = OM varies so that the relation
. _..
prOJn°0M = p (3.1)
remains valid unless M leaves the plane. In other words, relation (3.1)
describes a property peculiar to all points of the pla~ i.e., relation (3.1)
is the equation of a plane. Observing that projn° OM= r•n°, we may
write (3.1) as
r•n° - p = 0. (3.2)
Equation (3.2) is called the normal vector equation of a plane. The posi-
tion vector r is sometimes referred to as a moving position vector.
48 3. The Line and the Plane
--------------------
ma! vector to this plane. This implies that the vector n = {A, B, C} is
a vector normal to the plane (3.4). Hence, A, Band Care easy to interpret
as coordinates of the vector n = {A, B, C} normal to the plane (3.4) (see
Fig. 3.2). Any other normal vector is obtained by multiplying n by a non-
zero scalar.
!I
Fig. 3.2
Fig. 3.3
4-9505
50 3. The Line and the Plane
\
\
\
\
)
/
/
/
/
/
Fig. 3.4
We easily see from Fig. 3.4 that one of the two dihedral angles is equal
to the angle 'P between the vectors n1 = (A1, B1, C1} and n2 =
{A2, B2, C2 }, whence we have
cos 'P
n1 •n2 + B1B2 + C1C2 A1A2
= ----,----,-----,---,- = ----;:=:;::==:;.::::=::::;:::---;::=:;:::==;:==::;:::• (3.10)
ln1I · 1°21 ✓Ar+ Bi+ C7 .-✓Ai+ Bi+ a
Conditions of perpendicularity and parallelism of two planes. If two
planes are perpendicular to each other, their respective normal vectors are
also perpendicular. Hence, there holds
81 • D2 =0
or
A1A2 + B1B2 + C1 C2 = 0.
This is the condition of perpendicularity of two planes.
3.2 Straight Line in a Plane 51
If two planes are parallel, their respective normal vectors are collinear,
i.e., ni = M2. Expressing this identity in terms of coordinates of ni and
02, we obtain
IJ:lc%+b
katan a
2. The distance between the point M* and the line L is the length of
the perpendicular segment M*N dropped from M* onto L (Fig. 3.10).
Let M*(x*, y*) be a given point and
x cos <P + y sin '/J - p =0
be a normal coordinate equation of L. Then the distance between M* and
Lis
Fig. 3.11
Fig. 3.14
x- Xo Y -Yo z - 7A>
I = t, ---= t, - - - = t.
m n
56 3. The Line and the Plane
Whence
x- Xo Y - Yo z- ZO
I m - n , (3.14)
where Xo, Yo, zo are the coordinates of Mo belonging to the line and I, m,
n are the direction numbers (/2 + m 2 + n 2 > 0).
Relation (3.14) is said to be the point direction equation of a line.
Any two equations obtained from (3.14), say
X - Xo Y - Yo Y - Yo Z - ZO
- - - - - - - and - - - -
! m m n '
define a line as a line of intersection of two planes.
Equation of a line passing through two given points. We wish to find
equation of a line passing through the points M1 (x1, Y1, z1) and
M2(X2, Y2, z2). Suppose we are seeking for the point direction equation of
a line. So we need to know the coordinates of any point belonging to the
line and the direction vector. Let us select M1 (x1, Y1, z1) as the point we
---.
need and define the direction vector as M1M2 = {x2 - X1, Y2 - Y1,
z2 - z1}. Then, the desired equation is
X - X1 Y - Yt Z - Zl
(3.15)
x-1 y z+l
-- ►
2- =
. -
1 = - 2- - .
General equation of a line. Reducing the general equation to the point
direction equation of a line. A line in space may be determined as a line
of intersection of any two distinct nonparallel planes.
Let two distinct nonparallel planes be specified by the equations
A1x + B1y + C1z + Di = 0 and A2x + B2Y + C2z + Di. = 0.
Then the system of the two equations
[
X - y +Z- 3 = 0, (3.17)
X + y - 2z + 1 = 0
to the point direction equation of a line.
◄ We find a point belonging to the line (3.17). Setting, for example, z = 0,
we arrive at the system
[
X - y = 3,
x+y=-1.
Whence, x = 1 and y = -2. Thus Mo(l, -2, 0) belongs to the line. ►-
The normal vectors to the planes are n1 = {1, -1, 1 } and
n2 = {l, 1, -2). Then the direction vector of the line is
i j k
S = n1 ,c n2 = 1 - 1 1 =i + 3j + 2k.
1 1 -2
Hence, the point direction equation of a line specified by the system
(3.17) is
X - 1 y +2 z
----=- ►
1 3 2·
Angle between a line and a plane. Let there be given a line
X - Xo _ Y - Yo _ Z - Zo (3.18)
I m n
and a plane
Ax + By + Cz + D = 0. (3.19)
58 3. The Line and the Plane
The angle 'P between the line and the plane is the smallest angle between
the line and its projection onto the plane (Fig. 3.15).
Let a be an angle between the line (3.18) and the vector normal to the
plane (3.19). Then
n•s
cos a = lnl • Isl '
where s is the direction vector of the line (3.18) (see Fig. 3.16).
Fig. 3.15
When the line (3.18) is parallel to the plane (3.19), t~e direction vector
s of the line is perpendicular to the vector n normal to the plane so that
n •s =0
or
Al + Bm + Cn = 0. (3.21)
- n
/
/
Fig. 3.16
3.3 Straight Line in Three-Dimensional Space 59
Exercises
(The angle between two given lines is that between any two lines which
pass through an arbitrary point in space and are parallel to the given lines.)
8. Reduce the equations of the line
2x - 3y - 3z + 9 = O,
[
X - 2y + Z + 3 = 0
f 2x - y + 3z - 1 = 0,
lsx + 4y - z - 7 = o.
10. Write the equation of the line which passes through the point
(1, -1, 0) and is perpendicular to the plane 2x - 3y + 5z - 7 = 0.
. o f (a) t he 1·1ne x - 7 = y - 4 = z - 5
. o f.1ntersect1on
11• F'1nd t he point 5 1 4
15. Write the equation of the plane which passes through the line
x - 3 _ Y + 4 _ z - 2 and is parallel to the line x ~ 5 -
2 1 -3
y-2 z-1
7 - 2 •
62 3. The Line and the Plane
Answers
LOOy+S=~OOx+~=Q1~-~+~-~=Q100x-~+~+
15 = O; (b) 2x - y - z = ~ 4. 3x - 6y + 2z - 49 = O. 5. 4x - y - 14z = 0.
6• x = yb = -z . 7• cos 'P = -
72. 8. -x = -y = -
5
Z +
-3; provi'ded t hat t he point
. 1ymg
. on t he
0
C 77 9 -1
x-2
line is (0, 0, ;_ 3). 9. (a) 0 = y+S
0 = z-3
1 or
[x-2=0
Y + s = 0' (b)
x-2
4 =
y + S = Z - 3 . (c) X - 2 = y + S = Z - 3 . lO. x - 1 = y + 1 = :_ .
-6 9 -11 17 13 2 -3 S
11. (a) (2, 3, l); (b) the line is parallel to the plane. 12. 4x + Sy - 2z = 0. 13. (S, -1, 0).
14. 1~ - 27y + 14z - 159 = 0. 15. 23x - 16y + 10: - 153 = 0. ·
Chapter 4
Curves and Surfaces of the Second Order
0 X
Fig. 4.1
O' I I'
0 I
Fig. 4.2
64 4. Curves and Surfaces of the Second Order
:..:.:....=..:...:=-:...=-:..=.=:......=;.;=~---------·--
00' = ai + /3j,
where a and /3 are the coordinates of the origin 0' on the x- and y-axes,
respectively.
Since
r= r' + 00',
we obtain
xi+ yj = (x'i + y'i) + (ai + {3j).
Whence
x = x' + a,
y = y I + {3.
Rotation of the axes of coordinates. Suppose that the x ' - and y '-axes
are obtained by rotating the x- and y-axes through an angle 'P so that the
origins of the two Cartesian coordinate systems coincide (Fig. 4.4).
We find coordinates of the unit vectors i' and j ' relative to the Cartesian
coordinate system 0xy (Fig. 4.5). It is easy to see that the coordinates of
the unit vector i' are the cosines of the angles 'P and ; - 'P made by the
unit vector i ' with the x- and y-axes, respectively. Thus, we may write
i' = i cos 'P + j sin 'P·
4.1 Changing the Axes of Coordinates in a Plane 65
Similarly, the coordinates of the unit vector j' are the cosines of the
angles cp + ; and cp so that
j' = -i sin cp + j cos cp.
xi + yj = X I i' + y j ''
I
then substituting the expressions for i' and j ' into the above identity, we
obtain
xi + yj = x' (i cos cp + j sin cp) + y' ( - i sin cp + j cos cp)
= (x' cos 'P - y' sin cp )i + (x' sin cp + y' cos cp )j.
X
0 x,x 1
Whence
x = x' cos 'P - y' sin 'P,
y = x' sin cp + y' cos cp.
♦
I
I
I
y I
x'
--- ---------
0 X
Fig. 4.7
Any change from one Cartesian coordinate system to another one with
the same unit distance may be made using in succession the translation,
rotation and reflection of the axes (Fig. 4. 7).
y
y
X X
1
The equation
F(x, y) =0
is said to be the equation of a curve of the second order.
While curves of the first order are straight lines (and only straight lines),
the quadratic polynomial equation provides a variety of curves of the se-
cond order. So it is helpful to precede our study of the general equation
of a curve of the second order with an investigation of some important
specific plane curves of the second order.
where a ~ b > 0.
Equation (4.1) is called the standard Cartesian equation of the ellipse
and the associated Cartesian coordinate system is called the standard Carte-
sian coordinate system.
y
Fig. 4.10
The circle
x2 + y2 = a2 (4.2)
is an ellipse with a = b. This enables us to regard the ellipse (4.1) as a plane
figure obtained by uniformly compressing the circle (4.2) with coefficient
b/a towards the x-axis (Fig. 4.10). In other words, equation (4.1) of an ellipse
is obtained by substituting (alb )y for y in the equation x 2 + y 2 = a2 • (The
I '
'I
68 4. Curves and Surfaces of the Second Order
y
y
-a o a I
// 0
I I /
I I /
/
L- --~------=-=--
-b
- -J ..- /
(-xo,Yo)
Fig. 4.11 Fig. 4.12
(2) For the ellipse (4.1) the x- and y-axes of the standard Cartesian coor-
dinate system are the axes of symmetry and the origin O of coordinates
is the centre of symmetry. This means that when Mo(X-0, Yo) belongs to
the ellipse, so do the points ( -X-O, Yo), ( -X-O, -yo) and (X-O, -Yo) (Fig. 4.12).
(3) If a > b, that is, unless the ellipse is a circle, the x- and y-axes of
the standard Cartesian coordinate system are the unique axes of symmetry
of the ellipse (4.1).
Set c = ✓ a2 - b 2 • It is easy to see that c < a. The points ( - c, 0) and
(c, 0) are called the left-hand and right-handfoci of the ellipse. The distance
between the foci is equal to 2c.
(4) The ellipse is a locus of points of a plane such that the sum of
the distances of these points from two given points (from the foci of the
ellipse) is constant (is equal to a given number) (Fig. 4.13).
69
The distance from M(x, y) to the left-hand and to the right-hand foci
of the ellipse are
Q1= ✓ (x+c) 2 +Y 2 and Qr= ✓ (x-c) 2 +y 2 .
Fig. 4.13
By the substitution
y2 = b(1 - ~)
2
we easily obtain
QI=
c2 C C
- x2 +2cx+a2 - -x+a =a+ -x,
a2 a a
y
y I
I
I
I
I
I
I
I _g_
e
'al-e
I
-a a I X X
I
I I
I I
I I
I I
I I
Fig. 4.14 Fig. 4.15
◄ Let M(x, y) be a point on the ellipse (4.1) (Fig. 4.15). The distances
from M(x, y) to the right-hand focus and to the right-hand directrix are
a
er = a - ex and dr = - - X.
e
Whence we obtain the desired result
er
dr = e.
Similarly, we obtain
_g!_ = a + ex = e.
di a +x
e
Now we consider the point (c, 0) and the line x = ae (c = ae).
The distance from the point M(x, y) to the given point (c, 0) and. to
_______ 4.4 The Hyperbola ______ ·--···--··-- 71
.
t he given 1·1ne y = -a are
e
a
✓ (x - c) 2 + y 2 and - X '
e
respectively.
We require that
✓ (x - c)2 + y2
I: I -x
= e.
Then
✓ (x - c)2 + y 2 = la - exl.
Squaring the above identity and setting b 2 = a2 - c2 , we easily obtain
x2 y2
-a2+ -b2
= 1.
x2 y2
-2=l+-2~l.
a b
Therefore lxl ~ a for any point lying on the hyperbola. ►
The points (±a, 0) are the vertices of the hyperbola.
(2) The hyperbola (4.3) lies in the interior of the vertical angles formed
by the lines y = ±bx, the x-axis being the bisector of these angles
a
(Fig. 4.17).
72 4. Curves and Surfaces of the Second Order
a r
Thus the hyperbola contains two parts called the branches of the
hyperbola.
The lines
x + Y = 0 and x - Y = 0
a b a b
are called the asymptotes of the hyperbola.
(3) The hyperbola contains points infinitely distant from the origin 0
of coordinates.
◄ For example, let M(x, y) be a point on the hyperbola and let IYI = n,
where n is an arbitrary positive number (Fig __ 4.18).
0 a+b I X
n~
d(M, N) =~ (x - ✓x2 - a 2 ) ·
';:-,_
(-Xo, · · · · · · · · · · · · · · · · · · · · · · ·· , oJ
/
'- ' /
' /
/
X
/0'
//
/
'-
'
.......... · · · · · · · · · · · · · Yo)
~ ~
;,,
Fig. 4.20
The axes of the standard Cartesian coordinate system are the unique
axes of symmetry of the hyperbola.
Set c = ✓ a2 + b 2 • It is easy to see that c > 0. The points ( - c, 0) and
(c, 0) are called the foci of the hyperbola. The distance between the foci
is 2c.
(7) The hyperbola is a locus of points of a plane such that for any point
the absolute value of the difference of its distances to two given points
(to the foci of the hyperbola) is constant (is equal to a given number).
◄ The proof is similar to that of Property 4 for the ellipse. For example,
we shall show that each point of the hyperbola possesses this property. Let
M(x, y) be a point lying on the hyperbola (4.3) (Fig. 4.21). Then the dis-
tances from M(x, y) to the foci of the hyperbola are
Qi = ✓ (X + C) 2 + y2 - C
a+-x
a
and
Qr = ✓ (X - C) 2 + y2 - a-~+
4.4 The Hyperbola 75
.
S1nce -C > 1,
a
a + £ x if x ~ a,
a
Qt=
- a - £ x if x ~ - a
a
!:I
Fig. 4.21
and
C
-a+-x if X ~ a,
a
Qr=
C
a--x if X ~ -a.
a
Whence
if X ~ a,
QI - Qr=
[_: if x~ -a.
Therefore we may write
le, - erl = 2a. ►
The number e = £ is called the eccentricity of the hyperbola (4.3). It
a
is easy to see that e > 1.
The lines
x+a =0 and x-a =0
e e
are called the directrices of the hyperbola (4.3).
76 4. Curves and Surfaces of the Second Order___ ___ . ---------··--------· - - - -
a 0 a X
-el le
I I
I I
I I
I I
I I
I I
I I
I I
Fig. 4.22
Fig. 4.23
4.5 The Parabola 77
Fig. 4.24
The hyperbola
x2 y2 -
a2 - b2 - -1 (4.4)
Properties of the parabola. (1) The parabola lies in a semi plane to the
right from the y-axis, i.e., x ~ 0 for any point lying on the parabola
(Fig. 4.26). The origin of coordinates belongs to the parabola and is called
the vertex of the parabola (4.5).
(2) The parabola contains points infinitely distant from the origin of
coordinates..
(3) The axis of abscissas of the standard Cartesian coordinate system
is the unique axis of symmetry of the parabola (4.5) (Fig. 4.27).
The axis of symmetry is called the axis of the parabola.
The point «,
The number p is sometimes called the focal parameter of the parabola.
0) is the focus, and the line x
the parabola (4.5).
=- i is the directrix of
(4) The parabola is a locus of points of a plane such that for any point
its distance from a given point (from the focus of the parabola) is equal
to its distance from a given line (the directrix of the parabola) (Fig. 4.28).
lj
- .E I r
2 I
I
I
I
I
◄ Let M(x, y) be a point lying on the parabola (4.5). The distances from
M(x, y) to the focus ("i ,0) and from M(x, y) to the directrix x = -i are
x+P
2
respectively.
Substituting 2px for y 2 , we easily obtain
4.6 Properties of Curves of the Second Order 79
The converse is also true Let M(x, y) be a point equidistant from the
point ~ , 0) and from the line x = - ~ , i.e.,
Fig. 4.29
For definiteness we assume that the point Mo lies in the first quadrant
of the coordinate plane, i.e., Xo > 0 and Yo > 0 (Fig. 4.29).
The part of the ellipse in the first quadrant is given by the equation
~
y=b✓ ,.-IT.
a1 j1 :4
Since the point Mo(Xo, Yo) belongs to the ellipse,
Yo=b✓ i-
~.
7
Whence we have
Xob 2
Y - Yo = - ~ - (x - Xo).
a yo
2
or
YYo - Y~ + PXo - px = 0.
Whence, observing that y~ = 2pXo, we arrive at the equation of a tan-
gent of the form
YYo = p(x + Xo).
Remark. Comparing standard Cartesian equations of the ellipse, hyper-
bola and parabola with the equations of respective tangents we see that
the latter can easily be obtained from the former. Indeed, substituting YYo
for y 2 and XXo for x 2 in the equations of the ellipse and hyperbola, and
x + Xo for 2x in the equation of the parabola, we immediately arrive at
the equations of the respective tangents. Notice that the point with coor-
dinates (Xo, Yo) lies on a plane curve.
Optical property of the ellipse. Let Mo(Xo, Yo) be a point lying on the
ellipse
x2 Y2
-a2+ -b2
=1.
Recall that the distance from Mo to the foci Fi and Fr of the ellipse are
e, = le.xo + al and Qr = leXo - al,
respectively.
We draw through Mo a tangent line given by the equation
XXo + YYo _ l
7 y-.
Fig. 4.30
It is easy to find the distances from the tangent to the foci Fi( - c, 0)
and Fr(c, 0) (Fig. 4.30). We have
CXo l
--+
a2
and hr=µ
x~ x~
where p, = II a4 + b4 is a normalizing factor.
Ii 9505
82 4. Curves and Surfaces of the Second Order
Fig. 4.31
Fig. 4.32
We see from Fig. 4.30 that the above ratios are equal to the sines of
the angles made by the segments FiMo and FrMo with the tangent. Since
the sines are equal so are the angles. Therefore a tangent at any point lying
on an ellipse makes equal angles with the line segments joining this point
with the foci of the ellipse. This property is sometimes called the optical
property of the ellipse because the rays from a source of light placed at
one of the foci are reflected from the "mirror" surface of the ellipse and
are focused into the other focus (Fig. 4.31).
4. 7 Classification of Curves of the Second Order 83
(a) (b)
Fig. 4.33
Let b-# 0.
Consider a Cartesian coordinate system Ox' y' obtained from the origi-
nal system Oxy by rotating the x- and y-axes through an angle cp (Fig. 4.34).
The new and original axes of coordinates are related as
x = x ' cos cp - y ' sin 'P, (4.9)
y = x' sin cp + y' cos 'P·
Substituting (4.9) into (4.8), we reduce the term 2bxy to the term
2b 'x' y', where
2b' = 2( - a sin cp cos cp + b(cos 2 'P - sin 2 cp) + c sin cp cos 'P)
= (c - a) sin 2'(J + 2b cos 2cp.
To eliminate 2b 'x 'y ' from the polynomial f(x' , y ') it suffices to set
2b ' = 0. Whence we have
a-c
cot 2'() = 2b .
lj
y'
y
I'
y
'f
a' X
X
0 r
Observe that a, b and c are known. From the above identity we may
find the value 'P of the angle through which we should rotate the original
axes of coordinates to eliminate the term 2b 'x 'y' from the polynomial
f(x' , y' ). In other words, we may always choose a Cartesian coordinate
system such that the original polynomial is reduced to the form
f(x, y) = ax2 + cy 2 + 2dx + 2ey + g,
where a2 + c2 > 0, and in what follows we shall consider the above poly-
nomial. For definiteness, we also set c ¢ 0, since substituting y for x, we
may always reduce the polynomial to a form with c ¢ 0.
4. 7 Classification of Curves of the Second Order 85
where - a and - {3 are the coordinates of the origin O' of the system
0 'XY in the original system Oxy.
Depending on the relations between the coefficients a, b, c, d and e,
the polynomial f(x, y) reduces to one of the following three kinds
(1) a ¢ 0, c ¢ 0. Then setting a =d and {3 = ~, we obtain
a C
F(X, Y) = AX2 + BY2 + C,
d2 e2
where A = a, B = b, and C = g - - - -.
a C
a = ~d ( g - e: ) and /j =~
we obtain
F(X, Y) = BY2 + 2DX,
where B = c and D = d.
(3) a = d = 0. Then setting
e
a= 0 and {3 =-
c
we obtain
F(X, Y) = BY2 + E,
e2
where B =c and E =g - -. ►
C
wh ere a 2 = -
C and b 2 = -
C (a ~ b > 0).
A B
Notice that in the XY-plane there exists no point whose coordinates
satisfy this equation.
(c) If C = 0, we obtain the equation
x2 y2
-+-=0
a2 b2 '
1 an d b2 = B
wh ere a 2 = A 1 (a~ b > 0).
where a 2 =- A
C and b 2 =B
C (a > 0, b > 0).
•> This curve is called the imaginary ellipse since its equation resembles the equation of
the real ellipse.
••> We speak of imaginary lines since the respective equation resembles the equation
defining two intersecting lines in a plane.
4.7 Classification of Curves of the Second Order 87
where a2 = A1 an d b2 =- 1.
B
This equation defines two intersecting lines
X Y X Y
---=0 and - + - = 0
a b a b
in a plane.
(2) F(X, Y) = BY2 + 2DX = 0, B · D -# 0.
Assume that B · D < 0. Substituting if necessary - X for X we always
reduce the polynomial equation to a form with B · D < 0. Then we obtain
an equation of the parabola
Y2 = 2pX,
where p = - D
B (p > 0).
(3) F(X, Y) = BY2 + E = 0, B -# 0.
Assume that B > 0. Then
(a) If E < 0, we obtain the equation
y2 - c2 = 0,
E
where c 2 = - B (c > 0).
This equation defines two parallel lines in a plane.
(b) If E > 0, we obtain the equation
y2 + c2 = 0,
where c 2 = ! (c > 0).
There is no plane which contains a point whose coordinates satisfy this
equation, called the equation of two imaginary parallel lines because it
resembles the equation defining two parallel lines.
(c) If E = 0, we obtain the equation
y2 = 0
which defines two coinciding lines in a plane.
We may determine what type of plane curve a polynomial equation
represents without making manipulations as given above. It suffices to de-
fine the signs of some expressions involving the coefficients of the poly-
nomial equation in question.
88 4. Curves and Surfaces of the Second Order
-
r
- Ellipse
'-- -- _/'
+ + Imaginary ellipse
~o Hyperbola
"'\ ~
-
/ ~ ·
/
-
~
~o Parabola
-
0 0 Pair of imaginary parallel
lines
The numbers D and A are called invariants since they are independent
of the Cartesian coordinate system set up in a plane.
Table 4.1 shows a classification of plane curves of the second order in
terms of D and A.
Fig. 4.36
z
'Y
0 X y
Revolving the curve -y around the z-axis generates a surface called the
surface of revolution (Fig. 4.38).
We find the equation of this surface, i.e., the equation to be satisfied
only by points lying on the surface.
Let Mo(Xo, 0, ~)bean arbitrary point on the curve -y (Fig. 4.39). When
the curve -y revolves around the z-axis the point Mo moves in a plane which
4.9 Classification of Surfaces 91
,.,,-1
.,, I
I
I
I
I
IJ
I
I
1o
~ I .......... J
0 X :
I .....
I /
/
◄ Indeed, if a point (x, y, z) lies on the cylindrical surface, then the point
(x, y, 0) belongs to the directing line -yo (Fig. 4.42). Hence the point
(x, y, 0) satisfies the equation
F(x, y) = 0.
On the other hand, this equation is also satisfied by the coordinates x and
y of the point (x, y, z). Therefore we may regard the equation
F(x, y) =0
as the equation of the cylindrical surface since it holds true for any point
on this surface. ►
Example. Let there be given a Cartesian coordinate system Oxyz in
three-dimensional space (Fig. 4.43). The equation
4.9 Classification of Surfaces 93
x2 y2
-+-=1
a2 b2
defines a cylindrical surface called the elliptic cylinder.
Remark. The equation
F(y, z) = 0
defines a cylindrical surface with directing line parallel to the x-axis, and
the equation
F(x, z) = O
defines a cylindrical surface with directing line parallel to the y-axis.
Conic surfaces. Suppose that we are given an arbitrary curve 'Y and a
point O outside 'Y· Let us draw lines through O and every point of 'Y. A
totality of lines thus obtained, that is a set of points lying on these lines,
is called the conic surface with the directing line -y and vertex O (Fig. 4.44).
Any line passing through the vertex O and a point on the directing line
'Y is called the generating line of the conic surface.
Consider a function F(x, y, z) of three variables x, y and z.
The function F(x, y, z) is called a homogeneous function of degree
q if for any t > 0 there holds
F(tx, ty, tz) = tq F(x, y., z).
94 4. Curves and Surfaces of the Second Order
0 X X
Fig. 4.47
called the ellipsoid of revolution, which gives an idea about the shape of
96 4. Curves and Surfaces of the Second Order
Fig. 4.48
0 X X
Fig. 4.49
l
0 X X
Fig. 4.50
z
lj h=-2
whose structures, i.e., shapes and mutual locations of the lines on the coor-
dinate planes, enable us to make a conclusion on the shape of the surface
itself.
Let us start with sections z = h = const parallel to the xy-plane. Depend-
ing on the values of h, we obtain three families of intersection lines, namely
(a) a family of hyperbolas
x2 y2
----= 1
(✓2ph )2 (✓2qh )2 '
where h > O;
(b) a family of conjugate hyperbolas
x2 y2
-;:::::===-= -1
(✓ -2ph )2 (✓ - 2qh) 2 '
where h < O;
(c) two intersecting straight lines
y2
----c=-= =0
(vq)2 '
provided that h = 0.
Fig. 4.53
Notice that these straight lines are asymptotes of all the hyperbolas of
the families (a) and (b), i.e., they are asymptotes of hyperbolas for any value
of h distinct from zero.
Mapping the intersection lines onto the xy-plane, we obtain the family
of lines shown in Fig. 4.52 from which we infer that the surface in question
is of a saddle shape (Fig. 4.53).
I'
100 4. Curves and Surfaces of the Second Order
x2 = 2p(z + 2~q)'
as shown in Fig. 4.54.
Similarly, cutting the surface by planes x = h, we obtain a family of
parabolas in the yz-plane
2
y2 = - 2q ( z - 2pq
h ) ,
0 I
Fig. 4.56
:.r.
;---- - ............
L_
-- ....
Fig. 4.57 Fig. 4.58
X
X
-----
Fig. 4.60
Fig. 4.59
102 4. Curves and Surfaces of the Second Order
Exercises
x2 y2
1. For the hyperbola 9 - -16 = 1 find; (a) the coordinates of
foci, (b) the eccentricity, (c) the equations of asymptotes and directrices,
(d) the equation of the conjugate hyperbola and its eccentricity.
2. Write down the equation of a parabola provided that the distance from
the focus to the vertex is equal to 3.
2 2
3. Write down the equation of the tangent to the ellipse ; 2 + ; 8 = 1
at the point M( 4, 3).
4. Identify the types and locations of plane curves given by the equations:
(a) x 2 + 2y + 4x - 4y = 0; (b) 6xy + 8y 2 - 12x - 26y + 11 = 0;
(c) x2 - 4xy + 4y 2 + 4x - 3y - 7 = 0; (d) xy + x + y = 0, (e) x2 -
5xy + 4y 2 + x + 2y - 2 = 0; (f) 4x 2 - 12xy + 9y 2 - 2x + 3y - 2 = 0.
Answers
= 35 , (c) y = ± 34 x, x -= ± 59 ; (d) 9x - 16
2 Y 2
1. (a) F1(-5, 0), Fr(5, O); (b) e -= -1,
~ ~
e = 45 . 2. y 2 = l2x. 3. 3x + 4y - 24 = 0. 4. (a) The ellipse 6 + - 3- = 1 with centre
2 2
at O ' ( - 2, 1) and the major axis O 'X parallel to the x-axis; (b) the hyperbola ~ - ~ =1
with centre at O' ( -1, 2) and the tangent of the angle between the axis O 'X and the x-axis
equal to 3; (c) the parabola Y2 = Js X with vertex at O' (3, 2), the vector of the O 'X-axis
directed to the vertex is { - 2, - 1 ); (d) the hyperbola with centre at O ' ( =-1~.J), the asymptotes
are parallel to the x- and y-axes; (e) a pair of intersecting straight lines x - y - 1 = 0 and
x - 4y + 2 = O; (f) a pair of parallel straight lines 2x - 3y + 1 = 0 and 2x - 3y - 2 = 0.
Chapter 5
Matrices. Determinants.
Systems of Linear Equations
5.1 Matrices
Definitions. An m x n matrix is an array of m•n numbers au (i = 1,
2, ... , m; j = 1, 2, ... , n) arranged in the rectangular form
au 0:12
0:21 0:22
A= (5.1)
Cim 1 Cim2 Cimn
Cimj
a12 lXIn
a22 a2n
A= . .. . . . . .. . .. . . . . . . .. .
lXnl lXn2 lXnn
[ - ----
Fig. 5.1
1 0 0
0 1 0
I= ..... . . . . . . . . . . '
(5.2)
0 0 1
is called an identity or unit matrix.
For any m x n matrix there exists a zero matrix and for any square
matrix of a given order n there exists a unit matrix.
We shall denote the set of all matrices of the type m x n by IRm x n with
the understanding that we are concerned with matrices whose elements are
5.1 Matrices 105
The matrices A = (au) and B = (/3u) are called equal if they are of the
same type and their elements occupying identical positions coincide, i.e., if
AEIRmxn, BEIRmxn
and
a;i=/3u (i= 1, 2, ... , m; j= 1, 2, ... , n).
In symbols, we write A = B.
Now we shall turn our attention to arithmetic operations on matrices.
Addition of matrices. Let A and B be two matrices of type m x n, that
.
IS,
A= (au) E IRmxn and B = (/3u) E IRmxn.
The sum of matrices A and B is the matrix C = (')'ij) E fRm x n whose
elements are
'"Yii = au + /3u (i = 1, 2, ... , m; j = 1, 2, ... , n). (5.3)
In symbols, we write C = A + B.
Multiplication of a matrix by a scalar. The product of a matrix A =
(au) E IRm x n by a scalar Ais the matrix B = (/3u) E IRm x n whose elements are
f3u=Aa;i (i= 1, 2, ... , m; j= 1, 2, ... , n). (5.4)
In symbols, we write B = AA.
By way of illustration we show how these operations are performed by
using notation (5.1 ):
O'.tj O'.In
0'.2j 0'.2n
J
/311 (312 /31j /31n
/321 (322 /32j f32n
. . . . ... . ... . . . . . . . . . . . . . . . .. ...
X
{3;1 {3;2 f3u /3in
. . . . . ... . . . . . . . . . . .. .. . .. . . .. . .
f3n1 f3n2 /3nj f3nn
J
"/11 "/12 "/lj "/In
"/21 "/22 "/2j "/2n
In general AB ~ BA.
Examples. (1) Let
A= (~ ~) and B = (~ ~)-
Then
CXI 1 CX12
CX13)
A = ( CX21 cx22 cx23 •
CX31 CX32 CX33
0'.11
( 0'.13)
A= a21 0'.23 ·
0'.31 0'.33
Then
0'.11 a12
A·I = ( a21 a22
0'.31 0'.32
0'.11 a12
0'.13)
= ( a21 a22 a23 = A.
0'.3 I 0'.32 Q'.33
Relation (5.6) explains why the matrix I is called the identity matrix.
The operation of matrix multiplication obeys associative and distribu-
tive laws, namely if A, B, C and D are square matrices of order n, then
(AB)C = A(BC),
A(B + C) = AB + AC,
(B + C)D = BD + CD.
◄ By way of illustration we show that A(B + C) = AB + AC. It is clear
that the three matrices AB, AC and A(B + C) are of the same order n.
Their elements in the (i, j)th position are, respectively,
~
n
~ O'.ik"(kj, (i,j= 1, 2, ... , n).
k=l
n
~ O'.ik(/3kj + "(kj),
k=l
5.1 Matrices 109
Then we obtain
n n n
~ a;k(f3kj + "(kj) = ~ a;kf3kj + ~ aik"(kj
k=l k=l k=l
and this proves the property in question.
Similar reasoning is applicable to the other two identities. ►
Remark. The operation of matrix multiplication may also be defined
for rectangular matrices.
◄ Let there be given matrices A = (a;k) E fRm x n and B = (f3kJ) E fRn x 1.
Then the elements of the matrix C = AB E fRm x I are
n
'Yii = ~ a;k/3ki (i = 1, 2, ... , m; j = l, 2, ... , /). (5.7)
k=l
Therefore the product of two rectangular matrices may be defined only
when the number of columns in the first factor is equal to the number
of rows in the second factor (Fig. 5.2) as is easily seen from (5.7). ►
•
6x5
Fig. 5.2
Products of rectangular matrices obey the laws (5.6) provided that the
corresponding operations of matrix multiplication make sense.
Example. Compute the product of the matrix
A =( ! !) by the matrU B = (: : : : ).
◄ Observe that the number of columns in A is equal to the number of
rows in B. This means that we may multiply A by B. Computing the
product, we obtain
AB=(! !)-(: : :
9·1 + 5·1 9·0 + 5.9 9·1 + 5.4 9·2 + 5·2)
= ( 1·1 + 9·1 1·0 + 9.9 1·1 + 9.4 1·2 + 9·2
8·1+6·1 8·0+6·9 8·1 + 6·4 8·2 + 6·2
14 45 29 28)
- ( 10 81 37 20 . ►
14 54 32 28
110 5. Matrices. Determinants. Systems of Linear Equations
Whence we obtain
n m m n
~ ~au=~ b au.
J=l i=l i=l j=l
a12 a1n
a22 a2n
A= (5.1)
am2 amn
A= G 2
6
3
7 :)
.
lS
1 5
2 6
A' -
3 7
4 8
It is important to observe that the element of A' in the (i, ))th position
coincides with the element of A in the U, 1)th position. The operation of
transposition interchanges rows and columns of the matrix A so that rows
in A become columns in A ' and columns in A become rows in A '. There-
fore if the matrix A has m rows and n columns, the transpose A' of A
contains n rows and m columns (Fig. 5.3).
6 x11
11x6
8 2
2
V
V
V
V
Fig. 5.3
(a) a + b = b + a,
(b) (a + b) + c = a + (b + c),
(c) a + 0 = 0 + a = a,
(d) A(a + b) = Ml + Ab,
(e) (A + µ) a = Ml + µa, (5.10)
(f) A(µa) = (Aµ)a,
(g) I •a = a,
the equation a + x = 0 being uniquely soluble for any row-vector a. In
(5.10) A and µ are arbitrary scalars, a, b, c and x are row-vectors (1 x n
matrices), 0 is a zero row-vector (a zero 1 x n matrix). We shall see in
Chap. 6 that conditions (5.10) define a linear vector space over a set of
row-vectors.
Now we shall introduce an important notion of linear dependence of
row-vectors.
Let a1, a2, ... , am be m row-vectors. A relation of the form
(5.11)
8-9505
114 5. Matrices. Determinants. Systems of Linear Equations .
-----
(i ~ D
is obtained from the matrix
A=(i ! ~)
by interchanging the second and third rows, and the matrix
1
0
0
is obtained from A by interchanging the first and second columns.
Adding the third row of A multiplied by - 2 to the first row of A,
we obtain the matrix
-12
1
6
Remark. It is easy to see that when the matrix A is obtained by applying
either of the elementary row operations to the matrix A, the transition from
A to A is achieved by the same row operation, that is, either by interchang-
ing the kth and /th rows or by multiplying the kth row by the scalar 1/ {3
or by adding the kth row multiplied by - 'Y to the /th row.
Now we shall turn our attention to the procedure of changing from
an arbitrary matrix to a matrix of simpler form by a finite sequence of
elementary row operations.
Let A = (au) E [Rm x n be a nonzero matrix.
Step 1. Since A is a nonzero matrix there exists at least one nonzero
element in A. Consequently, there exists at least one nonzero row in A.
We choose a nonzero row such that its first nonzero element occurs in a
column with the smallest number k1 ~ 1. Interchanging this row and the
first row of A, we reduce A to the matrix
(5.12)
5.1 Matrices 115
<Xik1 •
Adding scalar multiples of the first row of (5.12) by - (I) (1 = 2,
<X1k1
3, ... , m) to the corresponding rows, we obtain a matrix of the form
0
0 (5.13)
rv(l)
0 0 0 u.mn
Observe that the elements in the k1th row are zeros with the exception of
the element a~}/1 •
It may happen that rows of (5.13) are zero ones with the exception of
the first row. In this case the procedure terminates. If this is not the case,
i.e., if there exist nonzero rows in addition to the first row, the procedure
goes on.
Step 2. Similar to Step 1 we choose a row such that its first nonzero
element occurs in a column with the smallest number k2 (k1 < k2).
Then interchanging this row and the second row of (5.13), we obtain
0 0 (X~~I (X~~2 - 1 (X~~2 (X~~
0 0 0 0 ai:J 2 a~;/
rv(l)
0 0 0 0 u.mn
(5.14)
(I)
. 1es o f the second row bY -
Adding scalar mult1p a~:J
<Xikz (.
, = 3, 4, . . . '
2
8*
116 5. Matrices. Determinants. Systems of Linear Equations
0 1 1 1 1
0 0 -2 3 -4
A1= 0 0 0 0 0
0 0 0 0 0
Interchanging the third and fourth rows of A1, we obtain
0 1 1 1 1 1
0 0 -2 3 -4 5
A2 = 0 0 0 0 0 1
0 0 0 0 0 0
The matrix A2 is of schematic form. ►
(2) Reduce the matrix
3 -1 3 2 5
5 -3 2 3 4
A= 1 -3 -5 0 -7
7 -5 1 4 1 \
to a matrix of schematic form.
◄ Interchanging the first and third rows, we obtain
1 -3 -5 0 -7
5 -3 2 3 4
A1= 3 -1 3 2 5
7 -5 1 4 1
5.1 Matrices 117
0 0
118 5. Matrices. Determinants. Systems of Linear Equations
whose elements are zeros with the exception of those occupying the posi-
tions (1, 1), (2, 2), ... , (r, r), each of which is equal to unity.
Interchanging the columns in (5.15) so that the k1 th column replaces
the first one, the k2th column replaces the second one, etc., until the krth
column replaces the rth column, we obtain a matrix of the form
O'.tr O'.ln
0'.2r 0'.2n
(5.17)
O'.rr O'.rn
0
where &1 I ~ 0, &22 -:¢. 0, ... ' CXrr -:¢. 0.
For example, interchanging the third and fifth columns in As, we obtain
7 0 5
13 1 9
-1 0 0
0 0 0
0'.11 0 0 0 0
0 0'.22
0 0 0 O'.rn
0 0 0 0 0
where the first row contains only one nonzero element, &.11.
Similarly, operating on the rows 2, 3, ... , r, we obtain the matrix
&.11
&22 0 \
&;; (5.18)
0 CXrr
1 0 0 0 0
0 1 0 0 0
As= 0 0 1 0 0
.
0 0 0 0 0
i
.
J
1 •
•
1
••• 0 1 l
P;i = 1
•• .
~
1
1 . .. 0 ... j
• 1
•
•
1
0 .
1 l
"
1 j
0
120 5. Matrices. Determinants. Systems of Linear Equations
by interchanging the ith and jth rows. Observe that off-diagonal elements
of Pu are zeros with the exceptions of the elements in the (i, })th and
U, l)th positions.
(b) Matrices obtained from the corresponding identity matrices by sub-
stituting a nonzero scalar for a diagonal element. For example, the matrix
1
1
. . . . . /3 . . . . .
J
1
1
J
differs from the identity matrix by the element {3 ~ 0 in the U, })th position.
Notice that all off-diagonal elements of Dj are zeros.
(c) Matrices that differ from the corresponding identity matrices by one
off-diagonal element. For example, the matrix
i
1
Ru= I
•
1
l
•
\
also differs from the identity matrix by the same element but located in
the (i, j)th position. Observe that off-diagonal elements of Lij and Rij are
zeros with the exception of the element -y.
We point out here that for any matrix each elementary operation is
equivalent to pre- or post-multiplication of the matrix by a suitable elemen-
tary matrix.
5.1 Matrices
----------.~-----··--·------ - . 121
0'.11 a12
Q'.13)
A= ( a21 Q'.22 0'.23 ,
0'.31 0'.32 Q'.33
Recall that pre-multiplying A by the matrix
P23 =( g ~ I)
gives
0'.11 Q'.12
Q'.13)
Q'.33
B = ( a31 0'.32
a21 a22 0'.23
It is easy to see that the matrix B differs from A by the order of rows
and the matrix C differs from A by the order of columns.
Similarly we can verify the conditions (a) and (a') for the matrices
P12 =( ~ 0~ ~1 )
0
and P13 =( ~1 ~0 0~ ) .
122__ 5. Matrices. Determinants. Systems of Linear Equations
D2=(i i ~)-
Pre-multiplying A by D2, we have
D2A =
1
( ~
0
{3 O)(au
0 a21
a12
a22
0 1 a31 a32
(au
/3a21
a12
/3a22
ao)
/3a23 .
a31 a32 a33
AD2=
(a11
a21
a12
a22
a")C
a23 0
0
{3
a31 a32 a33 0 0
= (a11a21
/3a12
{3a22
a13)
0:'.23 •
a31 /3a32 a33
Similarly, we verify the conditions (b) and (b ') for the elementary ma-
trices
0 0
1 1
0 0
Notice that the conditions (c) and (c ') can easily be verified in a similar
way. ►
5.2 Determinants \
We associate with a square matrix a single number called a deter-
minant according to the following rules:
The determinant of the matrix of order 1
(an)
is equal to a11.
5.2 Determinants 123
(::: :::)
is a number equal to a11 a22 - a12a21.
In symbols, we write
a11 a12
det (au a12) = (5.19)
a21 a22 a21 a22
( a11a21
a12
a22
a13)
a23
a31 a32 a33
is a number equal to
a22 a23 a12 a13 a12 a13
a11
a32 a33
- a21
a32
+ a31
a33 a22 a23
+ Fig. 5.4
is a number equal to
D = a11M11 - a21M21 + ... + ( - It+ 1 an1Mn1, (5.22)
where M;i (i = 1, 2, ... , n) is the determinant of order n - I
Q'.13 O'.ln
0'.23 0'.2n
obtained by deleting from A the ith row and the first column.
In symbols, we write
◄ We have
IAI = au
0 0 0
Fig. 5.5
126 5. Matrices. Determinants. Systems of Linear Equations
Consider n numbers
n
D1 = ~ (-li+ 1auMu U= 1, 2, ... , n). (5.27)
;=1
It is easy to see that for j = 1 formula (5.27) coincides with (5.26). Let
us prove that D = D1 = D2 = . . . = Dn.
◄ We confine ourselves to the case when n = 3. By setting j = 2 in (5.27),
we have
D2 = - a12M12 + a22M22 - a32M32, (5.28)
where
a21 a23
M12 = = a21 a33 - a31 a23,
CT31 CT33
a11 a13
M22 = = a11a33 - a31a13, (5.29)
CTJI CT33
a11 CT13
M32 = = a11a23 - a13a21.
a21 a23
Whence we obtain
D2 = - a12(a12a33 - a31a23) + a22(a11a33 - a13a31)
n
~t = ~ ( - 1)' + j auMij (i = 1, 2, ... , n) (5.34)
J=I
n
D= ~ (-l)t+JauM;1 (i= 1, 2, ... , n), (5.35)
)=1
i.e., the determinant of any square matrix of order n admits expansion with
respect to the ith row.
◄ It is sufficient to prove that
~l = D. (5.36)
Again we confine ourselves to n = 3.
Using (5.34), we have
◄ The determinant of Pu is
l J
1
1
0 1 l
IPiJ I 1
1
1 0 J
1
0 1
I
1
I Pu I 1 . 0
1
after i - 1 steps.
Using this procedure further, we finally arrive at
0 1
I Pu I = l O = -1.
D= l.
O'.n l
Then
D = AD' + µD",
where the determinants
Whence
D = }.JJ' + µD 11
• ►
9-9505
130 5. Matrices. Determinants. Systems of Linear Equations
◄ Suppose that
a22 n'.2n
Q'.12 Q'. 1n
D=
n'.n2 n'.nn
is obtained from
ll'.ln
ll'.2n
D=
ll'.nl n'.n2 ll'.nn
we obtain
cx.;2 CX.in l
Cc'.i2 k
41'
132 5. Matrices. Determinants. Systems of Linear Equations
Observe that the ith and kth rows in J5 are identical. By property (a),
J5 = 0. Then expanding 15 with respect to the kth row, we arrive at the
desired identity
O'.i1Ak1 + O'.i2Ak2 + ... + O'.inAkn = 0 (i ~ k).
(Recall that cofactors of elements in any row do not depend on the values
of these elements.) ►
Computing determinants. When computing determinants we widely use
elementary (row and column) operations. It is important to point out that
being subjected to elementary operations either (a) or (b) or (c) the deter-
minant either reverses its sign or is multiplied by a scalar or remai~s un-
changed, respectively. Therefore the elementary operations are closely
related to the properties of determinants.
Example. Compute the determinant of
2 -5 4 3
3 -4 7 4
A= 4 -10 8 3
-3 2 -5 3
◄ Observe that a11 ~ 0 and divides only two elements of the first column
with non-zero remainders. To avoid division of elements of the matrix we
multiply the second row by - 2, the third row by -1 and the fourth row
by 2. Then we obtain
2 -5 4 3
-6 8 -14 -8
( - 2)( -1)21 A I - - 3 .
-4 10 -8
-6 4 -10 6
Step 1. Adding multiples by 3, 2 and 3 of the first row to the second,
third and fourth rows, respectively, we obtain
2 -5 4 3
0 -7 -2 1
41AI - 3 .
0 0 0
0 -11 2 15
Step 2. To avoid division we multiply the fourth row by - 7. Then we
have
2 -5 4 3
0 -7 -2 1
(-7)4 IAI - 0 3 .
0 0
0 77 -14 -105
5.3 Inverse Matrices 133
A= j;
Oljl Olji Oljn
Au Ant
A12 An2
B= .
Au An; l.
A1n Onn
134 5. Matrices. Determinants. Systems of Linear Equations
IAI. 0
AB=BA= IAI •I. (5.38)
0 IAI
By way of illustration we show that
AB= IAl•I.
◄ The element of AB occupying the position where the ith row and the
jth column meet is given by
n
'YU = ~ Ol;kAjk.
k=l
Setting i = j, we see that 'Yii is the result of the expansion of the deter-
minant of A with respect to the ith row, i.e.,
n
'Yii = ~ Ol;kAik = IA I.
k=l
Ali An;
(5.39)
IAI IAI
A1n Ann
IAI IAI
is called the inverse of the matrix A.
From (5.38) it follows that
AA - 1 = I, A - 1A = I. (5.40)
This means that the matrix A - 1 may be considered as a solution satisfy-
ing two matrix equations simultaneously
AX = I and XA = I,
5.3 Inverse Matrices 135
where
Xu X1j X1n
X= Xin
is an unknown matrix.
Let us show that A - 1 is the only solution satisfying both these equa-
tions simultaneously.
Assume that there exists a matrix C such that
AC = I and CA = I.
Pre-multiplying the former identity by A - 1 and post-multiplying the
latter by A - 1 , we have
A - 1 (AC) = (A - 1)1 and (CA)A - 1 = l(A - 1). (5.41)
By the associative law of multiplication we may write
A - 1 (AC) = (A - 1 A)C and (CA)A - 1 = C(AA - 1).
1 0
1
0 1
and thus complete the proof. ►
In matrix form this theorem states the following.
Theorem 5.5. Let A be an arbitrary nonsingular matrix. Then there exist
elementary matrices Q1, Q2, ... , Qk such that
Qk,Qk- I'.,. ·Q2·Q1 ·A = I. (5.44)
Post-multiplying (5.44) by A - 1, we obtain
Qk,Qk-1',,, ·Q2·Q1 = A - 1.
Pivotal condensation. Suppose that we are given a nonsingular matrix
A of order n. Let
(A I I) (5.45)
be a matrix of order n x 2n. If (5.45) is subjected to a sequence of elemen-
tary row operations or, equivalently, is premultiplied by a sequence of
elementary matrices, the matrix A reduces to the identity matrix and the
identity matrix I reduces to the matrix A - 1 inverse to A. Thus, by elemen-
tary row operations (5.45) is reduced to
(I I A - 1 ).
where I is th·e identity matrix obtained from A. (See the procedure employed
in Theorem 5.4.)
4. The matrix C = (-y;j) of order n (i, j = l, 2, ... , n) in (I IC) is the
inverse of the matrix A, i.e.,
C = A- 1•
Step 4. Subtract the fourth row from the other three rows:
1 1 1 0 3/4 1/4 1/4 -1/4
0 1 0 0 1/4 1/4 -1/4 -1/4
B3 ⇒ B4 = 0 0 1 0 1/4 -1/4 1/4 -1/4
0 0 0 1 1/4 1/4 -1/4 1/4
Step 5. Subtract the third row from the first one:
1 1 0 0 1/2 1/2 0 0
0 1 0 0 1/4 1/4 -1/4 -1/4
0 0 1 0 1/4 -1/4 1/4 -1/4
0 0 0 1 1/4 1/4 -1/4 1/4
Step 6. Subtract the second row from the first one:
1 0 0 0 1/4 1/4 1/4 1/4
0 1 0 0 1/4 1/4 -1/4 -1/4
Bs ⇒ B6 = -1/4 -1/4
0 0 1 0 1/4 1/4
0 0 0 1 1/4 -1/4 -1/4 1/4
Whence we conclude that
A - i =!A. ►
Let us choose in A k rows and k columns such that i1 < i2 < . . . < it and
ji < h < ... < }k refer to the chosen rows and columns, respectively, and
set up the kth order matrix
A=
Then the first r rows of A denoted by a 1, a2, ... , a, become the base rows
of A.
We shall prove that a1, a2, ... , a, are linearly independent. Let us sup-
pose the opposite, i.e., the rows a1, a2, ... , a, are linearly dependent. Then
one of the rows is a linear combination of the other r - 1 rows. Assume
that
This means that the rth row of the base minor M, is a linear combination
of the other rows of M,. Recall that any determinant such that one of its
rows is a linear combination of the others is equal to zero. Hence, M, = 0.
But this contradicts the definition of the base minor. Consequently, the
assumption on the linear dependence of a1, a2, ... , a, is false. Thus we
infer that a1, a2, ... , a, are linearly independent.
Now we show that each row of A can be represented as a linear combi-
nation of the other base rows of A. First, we ascertain that for any i and
j (1 ~ i ~ m, 1 ~ j ~ n) there holds
a11 CX1r CX1j
Ll=
a,1 CXrr CXrj
= 0. (5.47)
At=
- = r.
r ►
0
_ (1) (2) (r) 0
- ex tk1 cx.21c2 ••• cx,k, ~
•
0 (r)
CX.rk,
and any minor of order s (s > r) comprises the zero row and, consequently,
is equal to zero.
Thus, subjecting a matrix to a sequence of elementary operations, we
may easily and effectively define the rank of this matrix.
Example. Find the rank of
A = (~ - ~ ~ - i ~).
2 -1 1 8 2
◄ Step 1. Subtracting the multiples of the first row by 2 and 1 from the
second and third rows, respectively, we get
2 -1 3 -2
r(A) = r ( ~ -1 5
o oI -2 10
Step 2. Subtracting the second row multiplied by 2 from the third row,
we get
-1
0 I -I
3 -2 4)~5 - = 2. ►
0 0 0
0'.21 0'.2n
A=
Ci.ml O'.m2 O'.mn
(5.53)
/31
f32
b= and X =
/3m
The matrix A is called the coefficient matrix of the system (5.51), the
column-vector b is called the column-vector of constant terms and the
column-vector X is called the column-vector of unknowns. The matrix
A= (Alb)
is called the augmented matrix of the system (5.51). The solution of the
matrix equation (5.52) is the column-vector
,'l
,'2
f=
'Yn
5.5 Systems of Linear Equations 145
or
O'.I 1)'l + 0'.12 ')'2 + + O'.tr'Yr = f31,
a21 'YI + a22 ')'2 + + 0'.2r')'r = f32,
O!mI 'YI + O'.m2'Y2 + ••• + O!mr'Yr = f3m.
10-9505
146 5. Matrices. Determinants. Systems of Linear Equations
and
a{1X1 + a{2X2 + ... + a{nXn = {3{,
CX21X1 + CX22X2 + .. , + CX2nXn = /32,
I fl I
148 5. Matrices. Determinants. Systems of Linear Equations
an a12 U1n
a21 a22 a2n
A=
Uml Um2 Umn
In the previous sections we have shown that the elementary row opera-
tions may reduce the augmented matrix A to schematic form
A'-
a(r)
tJr+ 1
0
0
0
Similarly, the system (•) is reduced to the form with the augmented
matrix A.
5.5 Systems of Linear Equations 149
This means that there exists non-tuple of numbers turning the above equa-
tion into identity.
Let {3~'1 1 = 0. Then only the first r rows in A' are distinct from zero.
To simplify the notation we set
k2 = 2, k3 = 3, ... , k, = r.
(This may be made by renumbering Yi = X1, Y2 = Xk2, ..• , y, = Xk,, ...• )
The linear system takes the form
(1)
a11X1 + a11X2
(1) (1)
+ ... + a1,X, + ... + a1nXn(1)
= {3(1)
1 ,
(2)
a2t X2 + . . .
+ a2,
(2) (2) _ {3(2)
X, + . . . + Ct'.2,n Xn - 2 ,
00 Xr
a,, + ... + 00 Xn
Ct'.rn = {300
r ,
(n - 1) (n - 1) _ {3(n -
1)
Ct'.n-1 n-1Xn-l
'
+ O!n-1
'
nXn - n-1 ,
rv(n)x
u.nn n
= {3(n)
n ,
(1)
aP?x1 + aiYx2 + + ai9xr = 13p> - ai~~+I ')'r+I - - a in 'Yn,
a~~X2 + + a~~ Xr = t3f> - aft+ 1 'Yr+ 1 - -
(2)
0'.2n 'Yn,
Similarly to the case (1) we obtain the values of Xr, Xr - 1, . . . , X2, X1.
Since 'Yr+ 1, -y, + 2, . . . , 'Yn are arbitrary numbers we infer that the system
has infinitely many solutions. ►
Example. Solve the system
3x1 - 5x2 + 2x3 + 4x4 = 2,
[ 1x1 - 4x2 + X3 + 3x4 = 5,
5x1 + 7x2 - 4x3 - 6.x.i = 3.
◄ Let us arrange the augmented matrix
-5 2 4
-4
7
and reduce A to schematic form.
-4
1
-6
3
D
Step 1. To get the matrix with au = 1 we subtract the first row multi-
plied by 2 from the second row and interchange the first row and the ob-
tained second one. We have
6 -3 -5
-5 2 4
7 -4 -6
Subtracting the multiples of the first row by 3 and 5 from the second
and third rows, respectively, we get
1 6 -3 -5
( ol -23 11 19
o I -23 11 19
Step 2. Subtracting the second row from· the third one, we have
1 6 -3 -5
( ol -23 11 19
0 0 0 0
Since the rank of the coefficient matrix is equal to 2 and the rank of
the augmented matrix is equal to 3 we conclude that the linear system is
inconsistent. ►
5.5 Systems of Linear Equations 151
1 8 -7 12
0 -29 19 -39
0 -13 9 -17
0 -11 6 -16
Step 2. To simplify computations we subtract the third row multiplied
by 2 from the second row and then the fourth row from the third one.
We have
8 -7 12
-3 1 -5
-2 3 -1
-11 6 -16
Subtracting the third row from the second one, we get
1 8 -7 12
0 -1 -2 -4
0 -2 3 -1
-11 6 -16
Subtracting the multiples of the second rows by 2 and 11 from the third
and fourth rows, respectively, we obtain
152 5. Matrices. Determinants. Systems of Linear Equations
N
1 8 -7
-2
7
28
12
-4
7
28
Step 3. Subtracting the third row multiplied by 4 from the fourth row,
we have
8 -7 12
-1 -2 -4
0 7 7
0 0 0
Multiplying the third row by 1/7, we obtain
8 -7 12
-1 -2 -4
0 1 1
0 0 0
The system is consistent since the rank of the coefficient matrix is equal
to the rank of the augmented matrix, that is, r(A) = r(A) = 3. Since the
rank is equal to the number of unknowns the system has a unique solution.
Thus the original system is equivalent to the system
Xt + 8X2 - 7X3 = 12,
[ - X2 - 2x3 = -4,
X3 = 1.
From the third equation we have x 3 = 1. Substituting X3 = 1 into the
second equation, we have - X2 - 2 = -4. Whence, x2 = 2. Substituting
the values of x2 and X3 into the first equation, we get x1 + 16 - 7 = 12.
Whence X1 = 3. Thus the system has the unique solution x1 = 3, x2 = 2,
X3 = 1. ►
Example. Solve the system
3Xt - 2x2 + 5X3 + 4X4 = 2,
[ 6x1 - 4x2 + 4x3 + 3X4 = 3,
9x1 - 6x2 + 3x3 + 2X4 = 4.
◄ Arrange the augmented matrix
G : : i D·
Step 1. Subtracting the multiples of the first row by 2 and 3 from the
second and third rows, respectively, we have
5.5 Systems of Linear Equations 1S3
3 -2
f1l-12
( 0 0
5
-6 -5
-10
4
-i).
-2
Step 2. Subtracting the multiple of the second row by 2 from the third
row, we get
3 -2 5 4
( 0 0 -6 -5
0 0 0 0
The system is consistent (r(A) = r(A) = 2) and has infinitely many so-
lutions since the rank of the coefficient matrix is smaller than the number
of unknowns (r(A) < 4).
The original system is equivalent to the system
[
3xi - 2x2 + 5x3 + 4X4 = 2,
- 6X3 - 5X4 = -1.
Let us find the solution of this system.
Set x2 and X4 equal to arbitrary numbers -y2 and 'Y4. Then transposing
the corresponding terms of the system to the right, we have
[
3x1 + 5x3 = 2 + 2-y2 - 4-y4,
6x3 =1 - 5-y4.
From the last equation we get
X3 = 61 (1 - 5-y4),
a22 a2n
..... . . ... . . . .. . . . ... . . . . . . . .
O'.nl O'.n2 . . . f3n O'.nn
Xj = '
j - 1, 2, . . . ' n,
an a12 O'.tj a1n
A= (A lb)=
0 0 1 f3n
where A is a triangular matrix.
Step 3. By elementary row operations reduce (A I ii) to the form
1 0 0 "/1
(I I c) = 0 1 0 "/2
.............
1 1 'Yn
The n-tuple
X1 = 'Yl, X2 = 'Y2, ... , Xn = 'Yn
is the solution of the original system.
Homogeneous linear systems. The system of m linear equations in n
unknowns is called homogeneous if all constant terms are equal to zero,
1.e., the homogeneous linear system is given by
a11X1 + a12X2 + ... + a1nXn = 0,
a21X1 + a22X2 + ... + a2nXn = 0, (5.58)
am1X1 + am2X2 + ... + amnXn = 0.
We shall consider some important properties of homogeneous linear
systems.
(a) Any homogeneous system is consistent.
◄ The n-tuple X1 = 0, X2 = 0, ... , Xn = 0 is the trivial solution of any
homogeneous system. ►
(b) If the number n of the unknowns exceeds the number m of the equa-
tions the homogeneous system has nontrivial solutions.
◄ By definition the rank r of the system (5.58) satisfies the inequality
r ~ m < n. Whence we infer that the linear system is indeterminate. ►
(c) The sum of solutions of the homogeneous system (5.58) is a solution
of (5.58).
◄ Let -y{, -y 2, ... , -y; and -y{', -y,J,', ... , -y;' be two solutions of (5.58). This
means that
n n
~ a;;-yj = 0 and ~ aij-yj'= 0
j=l J=l
the n-tuple
' + 'YI,"
'YI 'Y2 '+ 'Y2," • • ,, 'Yn '+ 'Yn"
i.e., the sum of the solutions of (5.58), is also the solution of the homogene-
ous linear system (5.58). ►
(d) The product of the solution of (5.58) by an arbitrary number is also
a solution of (5.58).
5.5 Systems of Linear Equations 157
Whence we conclude that the n-tuple µ-y1, µ-y2, ... , µ-yn, i.e., the product
of the solution -y1, -y2, ... , 'Yn by the numberµ is the solution of the system
(5.58). ►
In matrix form, the homogeneous linear system may be written as
0
0
Xn 0
Allowing for renumbering of the unknowns we may consider X1, x2, ... ,
x, as the principal unknowns and the other unknowns as indeterminate or
disposal unknowns.
Let the rank r of the system (5.60) be smaller than the number n of
the unknowns, that is, r < n.
We compute n - r basic solutions by giving to the indeterminate
unknowns x, + 1, x, + 2, . . . , Xn the values from the following table
1 1 0 ... 0 0
2 0 1 ... 0 0
(5.61)
... . .. . .. . .. . .. . ..
n-r- 1 0 0 ... 1 0
n-r 0 0 ... 0 1
Each row in (5.61) determines the solution of the system (5.60) so that
we have the collection of n - r solutions
')'11 ,'12
')'21 ')'22
')'rl 'Yr2
1 0
0 1
0 0
0 0
5.5 Systems of Linear Equations 159
)'I, n - r - 1 )'1,n - r
)'2,n - r- 1 )'2,n - r
1 0
0 1
The solutions f1, r2, ... , r n - r are linearly independent.
◄ Consider the linear combination
µr+2
µn- 1
µn
It is easy to see that the linear combination (5.62) is equal to the zero
column-vector if and only if µr+ I = µr+ 2 = ... = µn - I = µn = 0. Hence,
it follows that the zero (trivial) solution of the system (5.60) is equal only
to the trivial linear combination of the solutions f I, f 2, . . . , f n - r• ►
The properties (c ') and (d ') imply that given arbitrary scalars µr+ 1,
µr+ 2, . . . , µn, the linear combination (5.62) is the solution of the system
(5.60).
◄ Let
r= (5.63)
µ,r + 1
Multiplying the solutions f1, f2, ... , fn-r byµ,+ 1, µ,+2, ... , µn,
respectively, and adding the multiples obtained, we get the solution of (5.60)
as the linear combination (5.62).
Comparing (5.62) and (5.63), we easily see that both comprise the same
values of the indeterminate unknowns µ, + 1, µ, + 2, ... , µn. Observe that
the values of the indeterminate unknowns uniquely determine the values
of the principal unknowns. Hence, the solutions (5.62) and (5.63) are identi-
cal and
(5.64)
Therefore the solutions r1, r2, ... , rn _, of the homogeneous system
(5.58) are such that (a) they are linearly independent and (b) any solution
of (5.58) may be represented as a linear combination of r 1, r2, ... , r n _ ,.
Definition. Any collection of n - r solutions of the homogeneous sys-
tem (5.58) that satisfy the conditions (a) and (b) given above is called the
fundamental system of solutions of (5.58).
Example. Compute the solutions of the system
3X1 - 2x2 + 2x3 + 4X4 = 0,
[ 6x1 - 4x2 + 4x3 + 3x4 = 0,
9x1 - 6x2 + 3x3 + 2x4 = 0.
◄ By the method of Gaussian elimination we have
(
3X1 - 2x2 + 5X3 + 4X4 = 0,
- 6X3 - 5X4 = 0.
Let us choose X2 and X4 as indeterminate unknowns and arrange the
table
X1 X2 X3 X4
2
- 1 0 0
3
1 5
0 1
18 6
a1 = 0, /31 = 0, Xn + 1 = 0.
Collecting all the formulas together, we may describe the method as
consisting of
(a) elimination stage
C;
ex; + 1 = - - - - - i = 1, 2, ... , n; a1 = 0,
b; + a;a; '
a;/3; - d;
{3; + 1 = - ---- , i = 1, 2, ... , n; /31 = 0,
b; + a;a;
_ _ -· __ 5.5 Systems_of_Linear Equations ___________________ ... 163
y~\.1 = yf) - r ( .±
J=l
o:;jyfJ - {3j) (i = 1, 2, ... , n). (5.69)
II*
164 5. Matrices. Determinants. Systems of Linear Equations
Yo= (~)
and the relative error e = 0.3, use the method of simple iteration to com-
pute the solution of the linear system
2x1 + X2 = 1,
[
X1 + 2x2 = 2,
or, in matrix form,
We have
2- A 1
1 2- A = (A - 2)2 - 1 = 0.
Whence A1 = 1 and A2 = 3.
Put
1
7 = - = 0.5.
2
Thus condition (5.70) is satisfied. Then from (5.68) we have
Whence
Exercises 165
Y= (g_75)
2
Exercises
1. Multiply the matrix A by the matrix B:
B= (D·
1 1 1 1
1 2 1 2
4. Reduce the matrices to schematic forms: (a) 1 1 3 1
1 2 1 4
1 2 3 4
3 4 5 6
(b)
5 6 7 8
31 23 55 42
166 5. Matrices. Determinants. Systems of Linear Equations
---
0 1 2 3
1 0 1 2
5. Compute the values of the determinants: (a) 2 1 0 1
3 2 1 0
1 2 3 4 1 2 3 4
3 6 8 11 3 4 5 6
(b) (c)
7 13 20 26 5 6 7 9
31 23 55 42 31 23 55 42
0 1 1 1
-1 0 1 1
(c) -1 -1 1
0
-1 -1 -1 0
- 3X1 + X2 + X3 = 0,
(d) ( 5x1 + x2 - 2x3 = 2,
-2X1 - 2x2 + X3 = -3.
3x1 + 2x2 + X3 = 0,
9. Compute the solutions of the systems: (a) ( 7x1 + 6x2 + 5x3 = 0,
5x1 + 4x2 + 3x3 = 0;
____EE.rcis~----- 167
Answers
1 2a ab+
2. ( 0 1 2b
2c) .
0 0 1
3. AB = (- 1), BA =( i -12 0)
-9 0 .
-3 0
1 1 1 1 1 2 3 4
0 1 0 1 0 -2 -4 -6
4. (a) 00 2 0 (b) 0 0 40 35
00 0 2 0 0 0 0
0 -1 1 -1
1 0 -1 1
(c) -1 1 0 -1
1 -1 1 0
1
1
or =A 1 +µ
0
X1
X2
(c) X3 =A +µ
X4
Xs
Chapter 6
Linear Spaces and Linear Operators
......
......
'' ......
' ' ......
a+b '
(a) (b)
Fig. 6.1
(a)
(b)
Fig. 6.2
Fig. 6.3
L(X) = [y .±
=
J=l
O!jXjlXj EX; O!j E IR; q = 1, 2, ... ]
6.2 Linear Subspaces 173
Properties of linear spans. (a) The linear span L(X) contains the set X.
(b) The linear span L(X) forms a linear subspace in V.
◄ This follows from the fact that the sum of linear combinations of vec-
tors in X and the multiples of linear combinations by an arbitrary number
are linear combinations of vectors in X. ►
(c) The linear span L(X) is the smallest linear subspace containing the
set X.
In other words, if the linear subspace W contains the set X, then W
also contains the linear span L(X) of X.
◄ Indeed, an arbitrary linear combination a1x1 + a2X2 + ... + aqXq of
vectors in X, being an element of L(X), is also contained in W. ►
1 ....
Fig. 6.4
Examples. (1) Let ~ = (1, 1, 0) and .,, = (1, 0, 1) be two vectors in the
linear space IR 3• Then the set of solutions of the equation
~· + ~2 - ~3 =0 (6.2)
is the linear span L(t 71) of the vectors ~ and .,,.
◄ Indeed, the triples (1, 1, 0) and (1, 0, 1) form the fundamental system
of solutions of the homogeneous equation (6.2). Hence each solution of
(6.2) is a linear combination of fundamental solutions (Fig. 6.4). ►
(2) Let C(- oo, oo) be a linear space of real-valued functions continuous
_174.. 6. Lin~ar Spaces and Lin~ru: QQeratorL. ____________
i.e., the vector Xq is a linear combination of the vectors x1, x2, ... , Xq.
Conversely, if one of the vectors is equal to the linear combination of
the others
/31X1 + /3-$-2 + ... + f3q- 1Xq- 1 = Xq
then transposing Xq to the left, we have the linear combination
{31X1 + {3-$-2 + ... + /3q-t'Xq-1 + (-l)Xq = ()
that contains at least one nonzero coefficient ( - 1 -;rt 0). This means that
the vectors x1, x2, ••• , Xq are linearly dependent. ►
Theorem 6.2. Let x1 , x2, ... , Xq be linearly independent vectors and let
y = a1x1 + a2X2 + ... + aqXq. Then the coefficients a1, a2, ... , aq are
uniquely defined by y.
6.4 Basis and Dimension ) 175
◄ Let
Then
a1X1 + a2,X2 + ... + aqXq = f31x1 + f32X2 + ... + {3qXq,
Whence
(a1 - f31)x1 + (a2 - f32)X2 + ... + (aq - {3q)Xq = 8.
Since the vectors x1, are
x2, ... , Xq linearly independent,
a1 - (31 = a2 - f32 = ... = aq - {3q = 0. Hence, a1 = (31, a2 = f32, ... ,
aq = {3q, ►
Theorem 6.3. A collection of vectors containing linearly dependent vec-
tors is linearly dependent.
Fig. 6.5
◄ Let X1, X2, ... , Xq, Xq + 1, ... , Xm be a collection of vectors such that the
first q vectors are linearly dependent. Then
a1X1 + a2,X2 + ... + aqXq =8
and not all a1, a2, ... , aq are equal to zero. Adding to this linear combina-
tion the multiples of Xq + 1, Xq + 2, ... , Xm by zero, we obtain the linear combi-
nation
a1X1 + a2,X2 + ... + aqXq + 0Xq+ 1 + ... + Oxm = 8,
where not all a; are equal to zero. ►
Example. The vectors in V3 are linearly dependent if and only if they
are coplanar (Fig. 6.5).
Fig. 6.6
By Theorem 6.2 the numbers ~1, ~ 2, ... , ~n, called the coordinates of
the vector x relative to the basis e, are uniquely defined.
n n
Let x = ~ ~ie; and y = ~ r,'e; be vectors in V. Then
i=l i=l
n n n
x + y = ~ ~;e; + ~ r,'e; = ~ (~; + r,'}e;
i=l i=l i=l
and, for any number a,
n n
ax =a ~ ~ie; = I; (a~;)e;.
i=l i=l
be expansions of the vectors x1, x2, .... , Xq relative to the basis e and let
1;l 1;! 1;~
l;r I;~ I;~
' ... , ..:
or, equivalently,
S;l I;} 1;i 0
'l\1
l;r + A2 l;i + ... + Aq 1;: - 0
(6.5)
1;1 I;~ 1;: 0
This means that the linear combination of column-vectors of coor-
dinates of x1, x2, ... , Xq is equal to the zero column-vector.
If we suppose that (6.5) holds, then reversing our arguments, we arrive
at formula (6.4). Whence we infer that if a nontrivial linear combination
of x1, x2, ... , Xq (i.e. a linear combination with A1, A2, ... , Aq not all equal
to zero) vanishes, then a nontrivial combination of column-vectors of the
12-9505
178 6. Linear Spaces antj Linear Operators
coordinates of x1, x2, ... , Xq with the same numbers /\1, A2, ... , l\q is equal
to the zero column-vector and vice versa. ►
Theorem 6.5. Let a basis in V comprise n vectors. Then any system
of m (m > n) vectors in V is linearly dependent.
◄ By Theorem 6.3 it suffices to consider the case of m = n + 1.
Let x1, x2, ... , Xn + 1 be arbitrary vectors in V. Expanding these vectors
relative to the basis e = (e1, e2, ... , en), we have
X1 = ~lei + ~Ie2 + ... + ~1en
X2 = ~½e1 + ~ie2 + ... + ~~en,
.....................................................
K=
........................
Whence a1 = a2 = ... = an = 0.
Besides, any vector~ = (e, e, ..., ~n) in IRn may be expressed as a linear
combination of e1, e2, ... , en, i.e.,
~ = eo, o, ... , o, o> + ~2(0, 1, ... , o, o>
+ ... + ~n(0, 0, ... , 0, 1) = (e, I/, ... , ~n).
This means that the dimension of [Rn is equal to n. ►
(2) Recall that the homogeneous linear system
a11X1 + a12X2 + ... + a1nXn = 0,
a21X1 + a22X2 + ... + a2nXn = 0,
(6.7)
because the linear independence of a1, a2, ... , ak implies that the nontrivial
linear combination
A1a1 + A2a2 + ... + Akak + µb = (J
involves µ "#- 0.
Suppose that each vector b in V admits a representation of the form
(6.7). In this case the vectors a1, a2, ... , ak form a basis in V by definition.
However, this contradicts to the fact that the number k of the vectors is
smaller than the dimension n of V. This means that there must exist a vector
ak + 1 in V such that the vectors a1, a2, ... , ak, ak + 1 are linearly independent.
If k + 1 = n the vectors a1, a2, ... , ak, ak + 1 form a basis in V.
If k + 1 < n we repeat the previous reasoning for the vectors a1, a2,
... , ak, ak + 1.
This process enables us to complement the collection of the vectors a1,
a2, ... , ak by the vectors ak+ 1, ak+2, ... , an ~o that these n vectors form
a basis in V. In other words, we may always construct a basis of V which
incorporates the basis of a given subspace of V. ►
Example. Construct the basis of IR4 by complementing the collection
of the vectors a1 = (1, 2, 0, 1) and a2 = (-1, 1, 1, 0).
◄ We choose the vectors a3 = (1, 0, 0, 0) and at = (0, 1, 0, 0) in IR 4 • Let
us show that the vectors a1, a2, a3, at form a basis of IR 4.
6.5 Changing a Basis
- -----·· - ---- -···- ----~- ---.·,-·~--···--·-··-·-·--· -·
181
. - - - -··· ···--- - - - ~-~~ --~-- -··--··-----~-------··•--
1 2 0 1
-1 1 1 0
A=
1 0 0 0
0 1 0 0
a12 <Xtn
a22 a2n
...............................
The matrix
1 1
a1 O'.n
A= ai a~
n
a2
describes the transition from the basis e to the basis e' and is called the
transition matrix.
Properties of transition matrices. (a) The determinant of A is not equal
to zero, i.e., det A ;r. 0.
Let us assume the converse, i.e., det A = 0. This means that the columns
in A are linearly dependent.
◄ Since the column-vectors in A are the column-vectors of the coor-
dinates of e{, ei., ... , e~ relative to the basis e, Theorem 6.4 implies that
e{ , ei , ... , e~ are linearly dependent vectors. However, this contradicts to
the fact that e' is the basis of V. Consequently, the assumption that
det A = 0 is false. Hence, det A ¢ 0. ►
(b) If t1, e, ...,
tn and t'1, t'2, ..., t'n are the coordinates of a vector
x relative to the bases e and e ', respectively, then
e t,1
t2 =A t'2 (6.10)
Theorem 6.8. For any two vectors x and y of the Euclidean space there
holds the inequality
(x, y)2 ~ (x, x)(y, y)
(the Cauchy-Schwarz inequality).
◄ If (x, x) = 0, then x = (J and the inequality holds since (8, y) = 0.
Let (x, x) ¢ 0. Then (x, x) > 0.
_184 ___ 6. __!j_near Spaces_ and Linear Operators-· ·····-·----· -·· _______ ·- .....
--- --
------ lx+Y
I
I y
I
I
I
(f; f ·) = [ 1 for ~ = !,
' 1 0 for 1 -;rt J.
6.7 Orthogonalization
Let f1, f2, ... , fk be linearly independent vectors in a real Euclidean
space. We describe the procedure of constructing a collection of k or-
thogonal vectors using f1, f2, ... , fk (Fig. 6.9).
◄ Set &1 = f 1. For the vector &2 = f2 - a1&1 to be orthogonal to g1 it is
necessary that the identity
0 = (f2, &1) - a1(&1, &1)
holds.
Whence
a1 =
186 6. Linear Spaces and Linear Operators
9,=f1
(a) (b)
Fig. 6.9
Using g1, g2, f3, we construct the vector g3 = f3 - f31g1 - f32g2 which
is orthogonal both to g1 and to g2. It suffices to require that the numbers
(31 and (32 satisfy the following conditions
0 = (f3, g1) - f31(g1, g1) and O = (f3, g2) - f32(g2, g2).
Whence
(f3, g1)
(3 1 = (g1, g1)
so that the vector
g3 = f3 _ (f3, gi) g1 _ (f3, g2)
(g1, g1) (g2, g2) gz
is orthogonal both to g1 and to g2.
Similarly, the vector
. _ f· _ (f;, g1) _ (f;, g2) _ _ (f;, g;- 1)
g, - r ( ) g1 ( ) g2 ... _(_ _ _ _)_ g; - 1
g1, g1 g2, g2 g;_ 1, g;-1
(i = 3, 4, ... , k)
is orthogonal to each of the vectors g1, g2, ... , g; _ 1.
Therefore the vectors g1, g2, ... , gk form a collection of orthogonal
vectors.
Dividing each vector g; (i = 1, 2, ... , k) by its length lg;I we obtain the
6.7 Orthogonalization 187
The basis e = (e1, e2, ... , en) of the Euclidean space is called orthonor-
mal if
(e;, ej) = ou = [ 01 ii = j
-;c. j (i, j = 1, 2, ... , n).
Fig. 6.10
so that
1 (1, 2, 0, 1)
b2 = ( -1, 1, 1, 0) - 6 = ( - 67 , 32 , 1) ·
1, - 6
188 6. Linear Spaces and Linear Operators
- - - - . ---- -- --- ------ --- --- ---
1
e2 = ( - 7, 4, 6, 1),
..J1o2
1
e3 = (6, -1, 7, 4).
..J1o2
The vectors e1, ez, e3 form an orthonormal basis of the Euclidean
space. ►
If we wish to compute the scalar product of two vectors in a Euclidean
space it is helpful to expand the vectors relative to the orthonormal basis
because in this case the desired product can be expressed by the simplest
formula.
__________ §.~_Qrthoc9_!_11_1tlimenJs of Linear Sub~~-~~----------- ________________1_8_9
we have
In particular
n
(x, x) = ~ (~;)2 •
;=1
Whence
◄ This follows from the fact that the zero vector is the only vector con-
tained both in W and in W .1 • ►
Property (b) implies that any vector x in V is uniquely expressed as
X =y+ Z,
Fig. 6.11
e, = ( Jz , 0, 0, - ~) and e2 = ( 0, ~ , - Jz , 0) ·
The vector
y =( ~ - ~) G.o. o. - Jz)
+ ( 1 - 1) (o, Jz , - ~ , o)
- (- ; , 0, 0, D + ( 0, - ; , ; , 0) = ; ( -1, -1, 1, 1)
Fig. 6.12
-
(2) The rule that sets every vector x in V into correspondence with a
vector AX in V, where ~ is a given scalar, is a linear mapping, called the
similarity mapping (Fig. 6.12).
(3) Let e = (e1, e2, ... , en) be a basis of V. The rule 9: V---+ V, which
associates every vector n
X = ee1
+ ,E2e2 + ••• + ~kek + ... + ~nen = I; fe;
i =1
in V with the vector
k
9x = ~1e1 + ~2e2 + ... + ~kek = I; ~;e;
i= 1
6.10 Linear Mappings 193
is a linear mapping
~: T2-+T2.
..... ...... \.
................ \.
.....
Fig. 6.13
(2 ') The image of the similarity mapping coincides with the linear
space V.
(3 ') The image of the projective mapping 9: V ➔ V is the subspace
Wk = L(e1, e2, ... , ek),
(4 ') The image of the differential mapping !iJ: T2 ➔ T2 coincides with
the linear space T 2.
Proposition. The image of the linear mapping N: V ➔ W is a linear
subspace of W.
◄ Let Y1 and Y2 be vectors in im d. This means that in V there exist vectors
x1 and x2 such that st'x1 = Y1 and Nx2 = Y2,
From the formula
A1Y1 + Ai.Y2 = At st'x1 + A2 Nx2 = d(A1x1 + AzX2)
it follows that any linear combination of the vectors Y1 and Y2 also belongs
to im N. ►
The dimension of the image of the linear mapping is called the rank
of the linear mapping.
In symbols, we write rank (N) to mean the rank of the linear mapping
N.
The linear mappings SJf': V ➔ W and !18 : V ➔ W are said to coincide
if for any vector x in V there holds Nx = !18 x.
In symbols, we write d = !18 to mean the coincidence of N and !18.
Theorem 6.11 (on construction of a linear mapping). Let V and W be
two linear spaces and e = (e1, e2, ... , en) be a basis in V, and let f 1,
f 2, ... , f n be arbitrary vectors in W.
Then there exists the only linear mapping
SJI: V ➔ W
such that
Nek = fk (k = 1, 2, ... , n). (6.13)
First, we prove that a linear mapping satisfying (6.13) exists.
◄ Consider the expansion of some vector x in V with respect to the
basis e
n'
k.ercfl
X
I
I
I
--1-~
I
I
I
I
I
i:-.:;_----~:Px
im ,f{
Fig. 6.14
dx = :t
k=l
~kfk =
k=l
:t ~k ~ ek = ~ ( ±
k=l
~kek) = ~ x.
Whence we conclude that the linear mappings d and ~ coincide. ►
From Theorem 6.11 it follows that we may define a linear mapping by
identifying its action on the base vectors of V.
13*
196 6. Linear Spaces and Linear Operators
AX =0
form the kernel of the mapping
d: fRn x 1 ➔ fRm x 1.
The dimension of the kernel of the linear mapping dis called the nul-
lity of Sil.
In symbols, we write dim ker N = nullity Ne
Notice that for any linear space d: V--+ W the following identity holds
rank N + nullity N = dim V. ( •)
Operations on linear mappings. Let d: V --+ W and ~ : V --+ W be two
linear mappings.
The sum of the linear mappings d and ~ is the mapping -67: V --+ W
such that given any x in V
£x=dx+~x.
It is easy to verify that the mapping -67 is linear. Indeed,
-67 (),.x + µ,y) = d(),.x + µ,y) + ~ (Ax + µ,y)
= A(.QtX + ~) + µ{dy + ~y) = A£X + µ,£y.
In symbols, we denote the sum of d and /Jd as -67 = d + 11d.
The product of the linear mapping d: V --+ W by an arbitrary number
a is a mapping ~ : V --+ W such that given any x in V
~x=adx.
6.11 Linear Operators 197
------ -----,
I I
I I
I I
I I
I I
0 {fi..8)1;
Fig. 6.15
Definition. The linear operators Ji?": V --+. V and !18 : V --+. V are said to
be equivalent if given any vector x in V there holds Ji?"x = !Jlx.
Let d: V -+ V be a linear operator.
The linear operator !18 : V -+ V is said to be an inverse of d if
(6.15)
where /: V--+. Vis the identity operator, i.e., /x = x for any vector x in V.
Theorem 6.12. For the linear operator d: V-+ V to be invertible it is
necessary and sufficient that the image of d coincide with the space V,
i.e., im d = V.
◄ Suppose, first, that there exists a linear operator !18 inverse to the given
linear operator d. _
Recall that im d of dis a subspace of V.
We show that an arbitrary vector y in V belongs to im d
Let x = ~y. By (6.15) we have
dx = d(~y) = (~)y = /y = y.
Whence we infer that the vector y is the image of the vector x = ~ y
and, consequently, y belongs to im d Therefore, im d = V. ►
Now we suppose that the image of .9/ coincides with the space V, i.e.,
imd= V. Then
rank d= dim V
6.11 Linear Operators 199
and the linear operator d maps a basis of V into another basis of V, i.e.,
d: e = (e1, e2, ... , en) ➔ f = (f1, 12, ... , ln),
where fk = dek (k = 1, 2, ... , n).
Consider a linear operator §8 such that
Bilk = ek (k = 1, 2, ... , n). (6.16)
(~)x = (~)
n
Ct ~kek) = 9J
n
(J, t.iafek)
n
= ~ ~k Bl(dek) = ~ ~k Bilk= ~ ~kek = x.
k=l k=l k=l
and
n n n
(~)x = ~ 11k d(Blfk) = ~ 11k dek = ~ 11kfk = x.
k=l k=l k=l
~· 1
,,,,,,.-- .............
/
I '-
I \
/1 ~2
I
/
_,/
-1
Fig. 6.16
The matrix
A= A(e) = ...........................
whose columns comprise the coordinates of the images of the base vectors
is called the matrix of the linear operator d relative to the basis e.
2
Examples. (1) Relative to the basis eo = 1, e1 = t, e2 = ~! , e3 =
3
~! the matrix D(e) of the differential linear operator ~: M3 ➔ M3 takes
6.12 Matrices of Linear Operators 201
the form
0 1 0 0
0 0 1 0
D(e) =
0 0 0 1
0 0 0 0
(2) Relative to the basis e1 = cos x, e2 = sin x the matrix D(e) of the
differential linear operator .!»: T2 -+ T2 takes the form
D(e) = ( _~ ~)•
since
and
Let
y=Jdfx
and let
n n
x = b ~'e; and y = b r/ek
i=1 k=l
which comprise the coordinates of x and y relative to the basis e are related
as
y(e) = A(e)x(e). (6.17)
◄ Indeed, since the expansion of the vector y relative to the basis e is
unique, comparing
with
202 6. Linear Spaces and Linear Operators
yields
n
k i
~ a;~ (k = 1, 2, ... , n).
i =1
Arranging these n identities in matrix form, we have
.r,1 al a½ a~ e
r, 2 aI a~ a~ ~2
Setting
n
'Y'I'= ~ {J'!/cxf (i, m = 1, 2, ... , n) (6.18)
k=l
we may write
C(e) = ("/'/'). (6.19)
On the other hand, since A(e) = (af) and B(e) = (/J'f) (6.18) and (6.19)
yield
C(e) = B(e)A(e). (6.20)
Thus, relative to the basis e, the matrix of the linear operator !J8d is
equal to B(e)A(e). ►
From the equivalence of the operation of multiplication of linear opera-
tors and that of multiplication of the respective matrices it easily follows
that the matrix of the linear operator d - 1 which is inverse to the opera-
tor d is the inverse of the matrix A of d
◄ Indeed, by definition of the inverse operator we have
det (A - ti)=
we can write
x(t) = det (A - ti) = ( - Itt" + 'Yn - 1t" - l + ... + -y1t + 'YO·
6.13 Eigenvalues and Eigenvectors 205
! (1) =0 X 1 = 0.
9 = _!}_
dt ·. T2 ➔ T2
has no eignvectors.
◄ Consider the trigonometric polynomial a cos t + {3 sin t.
Using (*), we have
9 (a cos t + {3 sin t) = A (a cos t + {3 sin t).
This means that
- a sin t + {3 cos t = A.a cos t + A{3 sin t,
or
(A{3 + a) sin t + (Aa - (3) cos t = 0,
which is fulfilled if and only if
a + A{3 = 0 and A.a - {3 = 0.
Whence a = {3 = 0. Consequently, the polynomial a cos t + {3 sin t is a
zero one and can not be the eigenvector of !!I. ►
206 6. Linear Spaces and Linear Operators
Theorem 6.15. The number >.. is the eigenvalue of the linear opera-
tor ,S>;/ if and only if >.. is the root of the characteristic polynomial x(t)
of d i.e. x(>..) = 0.
◄ Suppose that >.. is the root of the polynomial x(t) in which case
e 0
(A(e) - >..I) e -
0
tn 0
or
(al - o:1t2
>..)€1 + + ... + O:n1 tn =0
a~e + (o:i - >..)f + ... + o:~tn ::;:: 0,' (6.25)
................................................................
By virtue of (6.24) system (6.25) has the nonzero solution t1, t2, ... ,
tn and
n
X = ~ ~;e; -;c Ov.
i =1
Fig. 6.17
( ~ ~ ~)
0 0 0
and ( ~0 ~0 -1~ )
respectively. Then we may write the homogeneous linear systems as
rt x y
o= o
~: and rt ~ o.~:
-z =
The corresponding fundamental systems of solutions are
D=(~~~)-
0 0 0
Then the characteristic polynomial - A3 = 0 has the root A = 0 of mul-
tiplicity 3.
Consider the homogeneous linear system
{3=0]
2"( = 0
0=0
whose solution is 1, 0, 0. The triple 1, 0, 0 corresponds to the polynomial
of the zero degree which is the eigenvector of fg, ► ·
6.14 Adjoint Operators 209
14-9505
210 6. Linear Spac_es and Linear Operators
and
Whence
(x, ,yg y) = (x, -t'y)
and
(x, ~Y --t'y) = 0.
Since x is an arbitrary vector in V we infer that the vector ~ y - -ef'y
is orthogonal to every vector in V and, consequently, to itself. The latter
implies that ~Y --t'y = 6 and §&y = G'y. Hence, §& = -ef' since y is an ar-
bitrary vector in V. ►
From Property (a) it immediately follows that
(b) (aw")"' = ot);f/*, where o: is an arbitrary real number;
(c) (.w+ ~)* = .w'* + ~•;
(d) (N~)* = ~ * .w*;
(e) (d*)* = d
We shall also mention the other two important properties of adjoint
operators, namely
(f) Let e be an orthonormal basis of V. Then for the linear operators
N: V ➔ V and ~: V ➔ V to be mutually adjoint, i.e., to satisfy both
~ = N* and d = 11d * it is necessary and sufficient that relative to the basis
e the matrix of one of the operators, say B = B(e), be obtained by transpos•
ing the matrix A = A(e) of the other operator so that B = A'.
Notice that this property holds true only if A and B are matrices ar-
ranged relative to an orthonormal basis and violates otherwise.
(g) If a linear operator N is nonsingular so is the adjoint operator d*
of dand
(.w - 1)• = (.w'* ) - 1.
(100)
0 1 0 ,
0 0 0
since 9i = i, 9j = j and 9k = fJ. This means that the operator 9is sym-
metric.
Fig. 6.18
and
(x1, dx2) = (x1, A2X2) = A2(x1, x2).
Whence
A1 (x1, x2) = A2(X1, x2)
and C"-1 - A2)(x1, x2) = 0.
Since A1 - A2 ;e. 0 we arrive at
(X1, X2) = 0. ►
(e) Let d: V -+ V be a symmetric operator. Then in V there exists an
orthonormal basis e = (e1, e2, ... , en) comprising the eigenvectors of d
so that
de; = A;e; (i = 1, 2, ... , n),
(e;, ej) = ou (i, j = 1, 2, ... , n).
Turning back to the previous example we easily see that the triple
(i, j, k) is the desired orthonormal basis in V since the vectors i and j are
the eigenvectors of 9corresponding to the eigenvalue 1 (of multiplicity 2)
and k is the eigenvector corresponding to the eigenvalue 0.
(t) If a nonsingular operator d: V-+ V is symmetric so is its inverse
Jd'- 1: V-+ V.
Remark. All the eigenvalues of a nonsingular operator are distinct from
zero. Indeed, if A ;e. 0 is the eigenvalue of a nonsingular operator Ji{,
then 1/A is the eigenvalue of the inverse operator d - 1.
We shall say that a symmetric operator d' is positive if given any non-
zero vector x in V there holds (dx, x) > 0.
Properties of positive operators. (a) A symmetric operator Sif: V-+ V
is positive if and only if all the eigenvalues of dare positive.
(b) A positive operator is nonsingular.
(c) If an operator dis positive so is its inverse.
We shall also say that the function JJf (x, x) is defined in an n-dimensional
Euclidean space V.
We may also associate with any quadratic form d(x, x) the bilinear
form
n n
d(x, y) = I; I; au~ir,1, (6.35)
i = lj = 1
where ,,1, r,2, ... , 11" are the coordinates of the vector y relative to the
orthonormal basis e = (e1, e2, ... , en) so that
Y = '] e 1
1 2 n
+ 'Y/ e2 + ,,. + 'Y/ en ,
The form (6.35) is called bilinear since it is linear in both the argument
x and the argument y so that
.W(a1X1 + OL2X2, y) = a1N(x1, y) + a2d(x2, y)
and
d(x, f3tY1 + f32Y2) = f31d(x, Y1) + /32.w'(x, Y2),
where a1, .a2, /31 and f32 are arbitrary numbers.
The bilinear form (6.35) is symmetric since its value is independent of
the order in which x and y occur in (6.35), i.e.,
..Qf(y, x) = d(x, y).
Computing the value of d(x, y) for the base vectors, i.e., for x = ek,
y = em, we obtain
(6.36)
Whence it follows that the elements of the associated matrix A of the quad-
rati~ form (6.34) are the values of the bilinear form computed for the vec-
tors of the basis e.
The scalar product of two vectors in an n-dimensional coordinate space
IR"
6.16 Quadratic Forms 215
( CXQll CX22. • Q)
• CXnn •
Theorem 6.16. For any quadratic form defined over a Euclidean space
there exists an orthonormal basis relative to which the associated matrix
becomes diagonal.
◄ To prove this theorem we shall use the arguments that follow from
properties of symmetric operators.
We choose the orthonormal basis e = (e1, e2, ... , en) and consider the
linear operator N: V ~ V such that, relative to e, the matrix (o:j) of N'
is identical to the matrix (o:u) of a given quadratic form, i.e., cxJ = au,
Since (cxJ) is symmetric so is the operator d
Let us compute (Nx, x). Since the basis e is orthonormal we have
(Ne;, e1) = ex{= cx;J,
and
(Nx, x) = (N (.± ~;e;), .± tei)
l=l J=l
n n n n
= ~ ~ ~;~(Ne;, eJ) = ~ ~ o:;J~;t = N(x, x).
i =lj = l i = lj = l
Whence we infer that the quadratic form N (x, x) defined over a Euclidean
linear space V and the symmetric operator N acting in V are related as
N(x, x) = (Nx, x). (6.37)
Recall that for any symmetric operator and, in particular, for ~ in V
there exists an orthonormal basis f = (f1, f2, ... , fn) comprising the eigen-
216 6. Linear Spaces and Linear Operators
vectors of sf so that
sffk = Akfk (k = 1, 2, ... , n); (fk, fm) = Dkm• (6.38)
Notice that
(k = m),
(.Wfk, fm) = (Akfk, fm) = Aklikm = ( ~k (k ~ m).
Substituting the expansion of x
n
X = ~ r/fk
k=l
relative to the basis f = (f 1, f2, ... , fn), into (<w"x, x), we have
(~~ !).
1 1 0
To find the eigenvalues we must solve
-'A 1 1
1 -'A 1 = -'A3 +3'A+2=0
1 1 -'A
yielding 'A1 = 2 and 'A2,3 = -1.
217
Thus we have
,w(x, x) = 2.x2 - y2 - i,2 •
X + Y - 2z = 0.
All solutions of the system are proportional to the vector (1, 1, 1)'.
. vector 1s
H ence, t he unit . 7
1 = ( 1 , v3
Y3 1 , v3
1 )'
o:o
has two linearly independent solutions and we have to choose them so that
they become orthogonal.
The system reduces to the single equation x + y + z = 0. Then the
desired solutions are (1, - 2, 1)' and (1, 0, -1)' and the unit vectors are
J = (l/v'6, - 2/Y6, 1/Y6)' and k = (1/v2, 0, - l/v2) '.
It is easy to verify that both the vector Jand the vector k are orthogonal
to the vector I. (Notice that this result also follows from Property (d) of
a symmetric operator.)
Then the desired orthonormal basis comprises the vectors
i= i+j +k ';' _ i - 2j + k
{f ,J- v6
Remark. We may accept any n-dimensional Euclidean space as V.
However, of practical interest is a coordinate space !Rn whose elements are
all possible ordered n-tuples of real numbers ~ = (~ 1, f, ... , ~n). The basis
of !Rn comprises the vectors (1, 0, ... , 0, 0), (0, 1, ... , 0, 0), ... , (0, 0,
... , 1, 0) relative to which the scalar product of two vectors ~ = (~1, e,
218 6. Linear Spaces and Linear Operators
. . .' tn) and 'fJ = (17 1 , 17 2 , ••• , 17n) is given by the formula
<t
,,,> = 1 + f 11 2+ ... + t1,n.
e11
We shall describe the procedure that enables us to choose the basis rela-
tive to which a given quadratic form specified over an n-dimensional coor-
dinate space becomes diagonal.
n n
◄ Let sd' (x, x) = ~ ~ a;j~; ~ be a given quadratic form.
i = lj = 1
Un 1 Cin2 . . . Cinn
Cinl Cin2
has exactly k linearly independent solutions that form the fundamental sys-
tem of solutions. On normalizing the solutiovs we obtain k pairwise unit
vectors.
Repeating this process for the other eigenvalues, we obtain exactly n
pairwise orthogonal unit vectors that comprise the orthonormal basis f 1,
f2, .•• , fn of IR". Notice that the vectors corresponding to the distinct eigen-
values are orthogonal by virtue of Property (d) of a symmetric operator.
Step 4. Write down J:lf(x, x), relative to the basis f = (f1, f2, •.• , fn),
in the diagonal form
d(x, x) = A1(17 1)2 + 'X.2(172)2 + ... + An(17")2,
where x = 17 1f1 + 1,2f2 + ... + 17"fn. ►
6.16 Quadratic Forms 219
au a12
a12 a22
> 0.
a1n <X2n . . . ann
.,,k = ~k (k = 2, 3, . . ., n),
we obtain
n n
d(x, x) = a11(17 1) 2 + I: I: a;111'r,',
i = 2j= 2
where
We look now at
n n
Ni (x, x) = I: I: a[j17;rf.
i = 2j= 2
◄ By the substitution
X = i + y, y = i - Y, z= z
.w(x, x) is reduced to the form
.w(x, x) = 2i2 - 2y2 + 4iz = 2(i2 + 2iz) - 2y2
= 2(x + z) 2 - 2J2 - 2z2 •
Set x = i + i y = y, z = z. Then
.w(x, x) = 2i2 - 2y2 - 22 2 • ►
Remark. The major shortcoming of the process of completing the
square is that it involves the trasformations of coordinates which are not
orthogonal, that is, the new coordinates taken in pairs are not orthogonal.
On comparing the diagonal forms of 2xy + 2yz + 2zx obtained by ap-
plying the procedure that involves identification of an orthonormal basis
and the procedure of completing the square we easily see that in both cases
the number of positive terms remains unchanged and so does the num-
ber of negative terms. This is an important property of quadratic forms
called the law of inertia which states that for any quadratic form the num-
ber of positive terms remains the same in all its diagonal forms and so
does the number of negative terms and the number of zero terms. Thus
these numbers are independent of procedures applied to reduce a given
quadratic form to a diagonal form.
X _ a Y _ e
== X + Ti' ' = y + A2
the equation is reduced to the form
AtX2 + A2 Y2 + J = 0.
[}
Fig. 6.19
and compute the roots A1, >..2 and }..3 of the characteristic polynomial
a11 '- t a12 CX13
a12 a22 - t a23
CX13 CX23 CX33 - f
X -_ X- + -x-;-
a14
' y _ - + a24
- y ---x;- ' z __ z-
the general equation is reduced to the form
2 2 - -
A1X + A2 Y + 2a34Z + a44 = 0.
6.17 Curves and Surfaces of the Second Order 225
a44
X = X, y = Y, Z = Z + ---
2a34
Table 6.1
x2 y2
>w>-2 >0 >-1 ·a« >0 -+-= 1 Elliptic
a2 b2 cylinder
x2 y2
Al ·>-2 >0 >-1 ·a« < 0 - + - = -1 Empty set
a2 b2
x2 y2
>-1 ·>-2 >0 0'.44 = 0 -+-=0 z-axis
a2 b2
x2 y2
>-1·>-2<0 0'.44 ¢ 0 ---=1 Hyperbolic
a2 b2 cylinder
x2 y2
>-1 ·>-2 < 0 <l'.44 = 0 _ - _ =0 Pair of
a2 b2 intersecting lines
as
Exercises
(0 0 1)
0 1 0
1 0 0
be a matrix of an operator .er/ relative to the basis 1, t, t2 of the space M2.
Find the matrix of .rr/ relative to the basis formed by the polynomials
3( 2 + 2t, 5t 2 + 3t + 1, 7( 2 + 5t + 3.
IG*
228 6. Linear Spaces and Linear Operators
14. Compute the eigenvectors and the eigenvalues of the operators defined
by the matrices
-1
1 -2 .
1
15. Let an operator define the rotation of a plane through the angle ; .
Find the operator adjoint to the given one.
16. Convert the quadratic form 2x 2 + 5y 2 + 2z 2 - 4xy - 2xz + 4yz to the
diagonal form.
17. Specify what surfaces are given by the equations
(a) 7x2 + 6y 2 + 5z 2 - 4xy - 4yz - 6x - 24y - 18z + 30 = 0;
(b) x 2 + 5y 2 + z 2 + 2xy + 6xz + 2yz - 2x + 6y + 2z = O;
(c) 5x 2 - y 2 + z 2 + 4xy + 6xz + 2x + 4y + 6z - 8 = 0.
Answers
1. The collection of polynomials of order not exceeding 2. 2. (a) yes; (b) yes.
4. For example, (0, 1, 0, 0), (0, 0, 1, 0). 5. (1, 1, 1). 6. 4; x,, x,, x,, x.,. 7. ,r/4. 8.
-i,f). (-t, -;, -½), (t• -½, -;).9.y=(O, -3,5,2),z=(2, -2, -2,2).
0,
11. The basis of the image is y, = (2, 1, 1), Yi = ( -1, 2, 1). The basis of the kernel is z =
•> It has not yet been established whether a set of natural numbers n such that the equa-
tion x" + 2 + yn + 2 = zn + 2 has positive integral solutions is empty or not. (In other words,
it has not yet been established if Fermat's last theorem is true or false.)
7.1 Basic Concepts 231
lal = ( a ~f a ~ 0,
-a 1f a< 0.
The inequality lxl ~ a, where a > 0, is equivalent to the relation
implies that
lal - lbl ~ la - bj.
Similarly, the relation
lbl = l(b - a) + al ~ lb - al + lal = la - bl + lal
yields
la - bl ~ lbl - lal
or
lal - lbl ~ - la - bl.
From (*) and (**) it follows that
- la - bl ~ lal - lbl ~ la - bl.
Whence we obtain the desired inequality
llal - lbll ~ la - bl. ►
Absolute and relative errors. We shall introduce some notions that are
widely used whenever we apply numerical methods to compute approximate
solutions of mathematical problems.
Let a be a true value of some quantity and a* be an approximation
to a. We shall call a the exact number and a* the approximate number.
The simplest measure to estimate the precision of the approximate number
a* is the absolute error of a*. We say that a positive number .6.(a*) is the
absolute error of a* if
la - a* I ~ .6.(a*).
This definition of the absolute error is rather ambiguous. For example,
if both a and a* are known the absolute error of a* is exactly equal to
the absolute value of the difference between a and a*. However, we may
not know the value of a. In this case inequality(***) means that the abso-
lute value of the difference between a and a* does not exceed .6.(a*) and,
consequently, any other positive number larger than .6.(a*) may also be
regarded as the absolute error of a*.
The absolute error refers to the precision of approximation and tells
us nothing about the accuracy. For example, if we have made two measure-
ments of temperature with the same absolute error equal to 0.2 °C and
found the readings 1000 °C and 10 °C, both measurements are to the same
level of precision. However it is easy to see that the former is more accurate
than the latter.
The accuracy of an approximation refers to the relative error. We say
7.1 Basic Concepts 233
The absolute and relative errors are usually represented by numbers with
two or three significant digits.
Number line. Real numbers can be conveniently displayed as points on
a line (Fig. 7.1 ).
Suppose that we have drawn a line and fixed the direction, the origin
0 and the unit distance e for this line. To display a real number a on the
line we choose a point to the right of O such that the distance between
this point and the origin O is equal to a if the number a is positive. When
e
0 a
Fig. 7.1
Fig. 7.2
(ii) for any (whatever small) number e > 0 there exists a number x* EE
such that M - e < x* ~ M.
In other words, the supremum of Eis the least upper bound of E. We
shall denote the supremum of E by M = sup E or M = sup {x}.
xeE
If Eis a set unbounded above we put the supremum of E equal to + oo
and write sup E = + oo.
Infimum of a set. Let E be a set bounded below, i.e., let there exist
a number a such that a ~ x for all x E E. Then a is called a lower bound
of E. Clearly, any number smaller than a is also a lower bound of E.
Definition. The number m is called the infimum of E provided that
(i) for any x E E there holds x ~ m;
(ii) for any (whatever small) number e > 0 there exists a number x* EE
such that m ~ x* < m + e.
Thus, the infimum of E is the greatest lower bound of E. We shall
denote the infimum of E by m = inf E or m = inf {x}.
xeE
If E is a set unbounded below we put the infimum of E equal to - oo
and write inf E = - oo.
To illustrate the notions of supremum and infimum of a set we consider
the following examples. If E = [a, b] then inf E = a and sup E = b. If
E = (a, b) then again inf E = a and sup E = b. Notice that inf E and sup E
are contained in E in the former example and do not belong to E in the
latter one. For the set E = [ 1, ~ , ... , ! , ... ] we have inf E = 0 and
sup£= 1.
In conclusion we state the following theorem.
Theorem 7.1. A non-empty set of real numbers which is bounded above
has a supremum and a non-empty set of real numbers which is bounded
below has an infimum.
Logical symbols and connectives. To abbreviate the notation and simpli-
fy the definitions we shall use logical symbols and connectives.
The universal quantifier V is read "For every", "For any", "For each",
"For alP'.
The existential quantifier 3 is read "There is", "There exist", "There
exists at least one ..." or something equivalent.
We say that a declarative sentence is a statement if it can be true or
false but not both. For example, the sentences "Mathematics is a science",
"The number 2 is smaller than the number 3" and "The number 6 is a
prime" are statements whereas the sentences "Close the door" or "How
old are you"? are not. We shall denote the statements by Greek letters a,
~, 'Y, etc.
7.1 Basic Concepts 237
The implication a ~ (3 (read "if a then /3" or "a implies /3,,) is a false
statement if and only if a, called the antecedent, is true and /3, called the
consequent, is false. We have to distinguish the implication from the cause-
and-effeet relation. V nlike the latter the implication a ~ (3 is true whenever
the antecedent a is false. In other words, a false statement implies any state-
ment, e.g., "if 2 x 2 = 5 then the unidentified flying object has landed near
your house".
The equivalence a # (3 (read "a if and only if /3") means that the state-
ments a and /3 are logically equivalent.
The conjunction a A. (3 (read "a and /3") is a compound statement made
up of the statements a and (3 connected by the conjunction and. The con-
junction a A. (3 is regarded as a true statement if and only if both a and
(3 are true.
The disjunction av /3 (read "a or /3") is a compound statement made
up of the statements a and /3 connected by the conjunction or. The disjunc-
tion is thought of as true if and only if at least one of the statements is true.
Let a be a statement. The statement a (read "not a•) is called the nega-
tion of a, a being true if a is false and vice versa.
To negate a statement that involves the quantifiers we have to substitute
every universal quantifier for the existentional quantifier and vice versa and
replace the antecedent by its logical opposite so that if /3 ~ 'Y then
(3 ~ 'Y # (3 A. :y.
Necessary and sufficient conditions. Let the theorem "If the statement
a is true so is the statement /3" be true. The statements a and /3 that can
be compound statements are called the hypothesis and conclusion, respec-
tively. The theorem can be symbolized as the implication a ~ (3 and can
also be expressed as
a is a sufficient condition for (3
or
/3 is a necessary condition for a.
Now we shall find out what we mean when speaking of the necessary
and sufficient conditions.
Let /3 be a statement. We say that a statement a is a sufficient condition
for (3 if a implies /3 and a is a necessary condition for (3 if a follows from (3.
Let a and /3 be two statements given as follows
a: "The number x is equal to zero",
(3: "The product xy is equal to zero".
Then a~ (3.
◄ Indeed, for xy to be equal to zero it is sufficient that x be equal to
zero. For x to be equal to zero it is necessary that xy b"e equal to zero.
238 7. An Introduction to Analysis
But /3 is not a sufficient condition for a since x can be distinct from zero
when xy is equal to zero. ►
If a and /3 are the statements each of which implies the other, i.e., a ~ /3
and /3 ~ a, we say that each of a and /3 is the necessary and sufficient
condition for the other and write
a <=> ./3.
The following expressions all mean that a is the necessary and sufficient
condition for /3 and vice versa:
(a) for a to be true it is necessary and sufficient that /3 hold;
(b) a holds if and only if /3 is satisfied;
(c) a is true if and only if /3 is true.
Mathematical induction. It is not a rare case when a statement which
is true in some particular instances turns out to be false in general. For
example, if we compute the values of 991n 2 + 1 for the subsequent natural
numbers 1, 2, 3, ... , 10 10 we shall fail to get at least one value which is
equal to the square of a natural number. Based upon this experience we
might conjecture that the expression 99ln 2 + 1 will never produce squares
of natural numbers when n is natural. However this conclusion would be
false. The point is that the smallest n such that the value of 991n 2 + 1
becomes equal to the square of a natural number is extremely large, viz.,
n = 12055735790331359447442538767.
Against this background it seems reasonable to draw our attention to the
following problem. Let there be a statement which is true in some particular
cases. How can we prove that it is true in general without having to verify
it for each particular case that would be an impossible task?
An important tool which enables us to answer the imposed question
is the mathematical (complete) induction method based on the principle
of mathematical induction.
Principle of mathematical induction. Let a be a statement that is true
for certain n. Then a is true for all natural n provided that
(a) a is true for n = 1;
(b) if a is true for n = k then a is true -for n = k + 1.
This principle lays down the basis of mathematical reasoning.
To illustrate how the principle of mathematical induction works we shall
prove Bernoulli's inequality:
If h > - 1, then
(1 + ht ~ 1 + nh V n E rN. (*)
Clearly, inequality (*) is true for n = 1. Assume that (*) has been proved
for n = m > 1, i.e.,
(1 + h)m ~ 1 + mh.
7.2 Sequences of Numbers 239
g] = I I
l, 2 ' 3
I
' · · ·' n ' . . .'
{2n } = 2, 4, 8, ... , 2n, ... ;
{I} = 1, I, I, ... , I, ... ;
f (- l)n + t ] = 1 - _I_ _I_ .
t n2 ' 22 ' 32 ' ... ' ' ... '
*> We have to distinguish the sequence {On} from the set {On}. For example, {5} =
5, 5, ... , 5, ... is a sequence whereas the set ( 5} contains a single element, 5.
240 7. An Introduction to Analysis
We shall write
A = lim an or an --+ A
• •
a,
..
Fig. 7.3
Ian - A I < e is equivalent to A - e < an < A + e which implies that an
lies in the e-neighbourhood of A. Hence, A is the limit of {an} if, given
any e-neighbourhood of A, there exists a number N such that all an with
n > N are contained in this e-neighbourhood of A, i.e., in the open interval
(A - e, A + e). Thus, only the finite number of terms a1, a2, ... , aN can
be outside the open interval (A - e, A + e). Whence it follows that the
sequence with all terms equal to A, called the stationary sequence, has the
limit equal to A.
A sequence is said to converge if it has a finite limit and to diverge
otherwise.
Example. Consider the sequence [ n ; 1 ], a. = n ; 1 being the nth
n+l
term. Clearly, the larger n the closer to 1 is the fraction - - - --l+-1.
n n
This prompts us to assume that
. an = 1·Im -
1lm +-
n- 1 = 1.
n-+oo n-+oo n
◄ To prove that our assumption is true we shall take an arbitrary e > 0
and show that there exists a number N such that for all n > N
n + 1 1
Ian - 11 = ----1 =- <e. (7.2)
n n
From the inequality _!_ < e we have n > _!_ • Then for any natural N
n e
exceeding _!_ and all n > N there holds _!_ < e, i.e., (7 .2) is true. Hence,
e n
by virtue of the definiton of a limit we have lim an = 1. ►
n ➔ oo
7.2 Sequences of Numbers 241
choose any number N such that Ian - A I < e for all n > N. Therefore we
do not need to find the smallest number N satisfying (7.1 ).
How we shall prove two important theorems on limits of sequences.
Theorem 7.2 (Cauchy convergence criterion). For the sequence ai, a2,
... , an . . . to converge it is necessary and sufficient that for any e > 0
there exists N such that for all n > N and all m > N
Ian - aml < s.
A sequence {an ) satisfying Theorem 7.2 is said to be a Cauchy sequence.
Theorem 7.3 (on uniqueness of a limit). A sequence {an) can not have
two distinct limits.
◄ Let A be a limit of {an) and let B ~ A. To prove that B is not a limit
of {an) we take e so small that the e-neighbourhood of A and the e-
neighbourhood of B do not intersect. For the purpose it suffices to set
e=
IE-Al .
(Fig. 7.4).
3
A B
-
Fig. 7.4
+1
smaller than 1. The sequence {an}, where an = -n - -= 1 .
1 + - 1s the nth
n n
term, is bounded since 1 < an ~ 2 for all n.
Sometimes it is helpful to use a slightly different definition of a bound-
ed sequence, namely, a sequence {an } is called bounded if there exists a
number K > 0 such that
lanl ~ K Vn.
By using logical symbols we can write this definiton as
({an) is bounded)# 3K > 0 Vn lanl ~ K.
A definition of an unbounded sequence is easily obtained from the defi-
nition of a bounded sequence by interchanging the quantifiers and convert-
ing the inequality involved, i.e.,
( {an} is unbounded) # vK > 0 3n lanl > K.
Example. Prove that the sequence {2n } is unbounded.
◄ Evidently, for any K > 0 there exists n such that 2n > K, i.e.,
n > log2 K. Hence, the sequence {2n } is unbounded. ►
A sequence is said to diverge to oo, and we write lim an = oo, if, given
n-+oo
any, whatever large, M > 0 there exists a number N = N(M) such that
Ian I > M vn > N.
A sequence {an ) such that
VM > 0 3N vn > N an> M (an< -M)
is said to diverge to + oo ( - oo ). In this case we write lim an = + oo
( lim an = - oo). n-+oo
n-+oo
We shall say that a sequence which diverges to oo, + oo or - oo is infinite-
ly large. It has no finite limit satisfying the definition of a limit given above
since the symbols oo, + oo and - oo do not represent real numbers. In what
follows we shall mean a finite limit as a real number. It is also worth men-
tioning here that there exist unbounded sequences which are not infinitely
large. For example, the sequence ( n sin n21r Jis unbounded but does not
diverge to infinity.
Theorem 7.4. Every convergent sequence is bounded, i.e., there exist
numbers m and M such that for all n
m ~an~ M.
◄ Let lim an =A and let e > 0 be an arbitrary number. Then there exists
n-+oo
7 .2 Sequences of Numbers 243
N such that the open interval (A - e, A + e) contains all the terms an with
n > N and a1, a2, ... , aN are the only terms of the sequence which can
lie outside this interval (Fig. 7.5). Thus only a finite number of terms can
lie outside the interval and we can choose the smallest ii and the largest
~ of them. Now let m = min (a,_ A - e} be the smallest of the numbers
iJ. and A - e and let M = max (a, A + e } be the largest of the numbers
a and A + e. Then the closed interval [m, M] contains the terms a1, a2,
. . . , llN and the open interval (A - e, A + e). Since all the terms an with
n ~ N + 1 belong to (A - e, A + e) the closed interval [m, M] contains
all the terms of the sequence ( an }. Hence, (an } is a bounded sequence. ►
• -
Fig. 7.5
sequence (*) has a limit, say, equal to A. Then for any e > 0 say for e = 41 ,
there exists N such that
1
Ian - A I < 4 vn > N.
Since the terms of ( *) alternate between 0 and 1 there must hold
I0-Al=IAl< 41 and ll-Al< 41 Vn>N.
Whence, we have
1 1 1
1 = 1(1 - A)+ Al ~ 11 - Al + IAI < 4 + 4 = 2'
that is, 1 < ~, which is impossible. This means that our assumption on
convergence of (*) is false. Hence, the sequence (*) does not have a limit,
i.e., it diverges. ►
Operations on convergent sequences. We shall discuss a number of theo-
rems which define arithmetic operations on convergent sequences and
basic properties of limits of sequences.
16*
244 7. An Introduction to Analysis
Now let N be the largest of N1 and N2. Then for any n > N there hold
both (**) and (***). Hence, we have
l(an + bn) - (A + B)I = l(an - A) + (bn - B)I
Thus
Ve > 0 3N Vn >N ~ l(an + bn) - (A + B)I < e.
By virtue of the definition of a limit we infer that A + B is the limit
of the sequence {an + bn } . ►
Theorem 7.5 can easily be generalized to any finite number of conver-
gent sequences.
Analogously we can prove
Theorem 7.6. If {an} and { bn} are convergent sequences so is { an - bn}
and
lim (an - bn) = lim On - lim bn,
n--♦ oo
lim On
l• lln n->oo
n1:!1! On = lim bn ·
n->oo
7.2 Sequences of Numbers 245
On=(l+~)" (•)
be the nth term of sequence {an}. Then substituting 1, 2 and 3 for n in
(*), we get
a1 = 2, a2 = (1 l)2 2 4'
+ 2 = + !_ 03 ( 1 1) 2 1
3
= +3 1
= + 3 + 27 .
Thus a1 < a2 < a3.
Using the binomial theorem*), we can easily show that the sequence
( (1 + ~)"] is a bounded and monotonically increasing sequence and
e = lim
n-+oo
(1 + l)n.
n
The number e is an irrational number and it can only be approximated:
e = 2.7183 .... Sometimes, in working with complicated expressions, it is
convenient to use the number e as the base of logarithms. The logarithm
of the number x > 0 to the base e is called the natural logarithm and denot-
ed by lnx.
bn = ( I + !) an = ( I + ! r +
1
= ( n: l r +
1
•
bn1 = ( n + 1 )n + 1/ ( n + 2 )n + 2 = (n + 1)2n + 3
bn + 1 n n + 1 nn + 1(n + 2t + 2
(n + 1)2n+4. n
---------------~
(n + l){ [(n + 1) - l][(n + 1) + l]} n + 2
[l 1 ]n+2
+ (n + 1)2 - 1 '
we obtain
i.e., bn ~ bn + 1 •
Thus, the sequence {bn } is nonincreasing and bounded below. This
means that ( bn } has a limit and so does the sequence ( On } and
.
11m . bn
On = 1Im 1 = lim bn .
n--+oo n--+oo 1 + _ n--+oo
Let X be a set of real numbers x and let there be given a certain law
or rule which assigns a real number y to every number x in X. Then we
say that there is a function defined on X and write
y = f(x) or y = y(x), x EX*>.
The set x•is called the domain of a function and the set Y of values y
specified by the function is called the range of a function. The domain
is sometimes denoted by D(f) and the range by E(f) provided that the
range of a function contains values y for every x in X.
Sometimes a function is denoted by writing f instead of f(x).
A function is fully determined if there are given (i) its domain of defini-
tion X and (ii) a rule which associates every x in X with a certain value
Y = f(x).
The functions f and g are said to be equal if D(/) = D(g) and the identi-
ty f(x) = g(x) remains true for all x in D(/) = D(g). For example, the func-
tions y = x 2 , - oo < x < + oo, and y = x 2 , 0 ~ x ~ 1, are not equal since
their domains are distinct; these functions are equal on the closed interval
[O, 1].
Examples of functions. (a) A sequence {an} is a function of an integral
variable whose domain is the set of natural numbers, such that f(n) = an
(n = 1, 2, ... ).
(b) y = n! is defined on the set of natural numbers. The relation n!
(read n factorial) is equal to the product of all integers from 1 through
n, 1.e.,
n! = 1 x 2 x 3 x ... x n
and by convention O! = 1.
Sometimes the symbol !! is used. The relation (2n) !! is assumed to be
equal to the product of all even integers from 2 through 2n. For example,
8! ! = 2 x 4 x 6 x 8. The relation (2n - 1)! ! is equal to the product of all
odd integers from 1 through 2n - 1, e.g., 7 ! ! = 1 x 3 x 5 x 7.
1 for x >0
(c) y = sgn x = [ 0 for x = 0,
-1 for x <0
is defined at each point of the number line - oo < x < + oo. The domain
*> This notion describes a numeric function and can easily be generalized to the case
of arbitrary sets. Let M and N be two arbitrary sets. We say that there is a function f which
is defined on M and assumes values in N if every element in M is associated with one and
only one element in N. Thus the function f maps the set M on the set N and is sometimes
called the mapping. Notice that the extended notion of a function is fully applicable to the
case of sets of numbers.
'F u=n=ct=io=n=s-=o.::....fO=ne=-V.~a=ri=ab=l=-e.....,an....,,d"--'L"""'i"""'m.....,it.,,__s_______________ _ ___.,,,_242.
_ ____;7_:.;;•3'-....
of y = sgn x is the set containing only numbers -1, 0 and 1 (Fig. 7.6).
The abbreviation sgn x means sign um function.
(d) y = [x], where xis a real number and [x] is the largest integer not
exceeding x, i.e., [x] = n for n ~ x < n + 1, n = 0, ±1, ±2, .... This
function is defined at every point of the number line and its range is the
set of all integers (Fig. 7. 7).
0 X
_ _ _ _ _ _ __,.1
Fig. 7.6
lj
4 - -+---
3 ------
-- - - ---•----
2 ______ _._ _ _
_
1 --
• •
-2 -1
0 2 3 4 5 r
• . -1
• . -2
Fig. 7.7
01 (xoJ)
I
I
I
I f(x 0 }
I
I
I
I
1 2 X 0 Xo X
0 for x < 0,
x for O ~ x ~ 1,
f(x) =
2 - x for 1 < x ~ 2,
0 for x > 2.
(b) Graphic representation. The graph of a function y = f(x) is a locus
of points (x, f(x)) on the xy-plane whose abscissas are the values of x and
-----'-7""""'.3-"'F-"'u=nc=t~io=n"-so"""f-'O'--n---e_V.-"-ar-'ia___b___ m_it_s________ __ ___ _ _____ .25 I
le_a_nd_L1_·
ordinates are the values of y (Fig. 7 .9). We say that a function admits a
graphic representation when its graph is specified.
Notice that not all functions can be given by graphs. For example, the
Dirichlet function
D(x) = [1 for ~atio_nal x,
0 for 1rrat1onal x
does not admit a graphic representation. The domain of this function is
the number line and the range comprises two numbers O and 1.
(c) Tabular representation. A function is said to be specified in a tabular
form if it is represented by means of a table which contains numerical
values of the variable x and the corresponding values of y.
Limit of a function at a point. A limit is a basic concept of mathemati-
cal analysis.
Cauchy criterion for limits. Let f(x) be a function defined in a neigh-
bourhood n of a point .xo, except probably at .xo. Then a number A is a
limit of f(x) at Xo if, given any (whatever small) number e > 0, there exists
a number o > 0 such that
lf(x)-Al<e,
whenever Ix - Xo I < o and x ~ .xo.
To signify that the number A is a limit of f(x) at the point Xo we write
lim f(x) = A.
X--,. Xo
M1M for x < Xo and to the ordinates of the curve MM2 for x > Xo
(Fig. 7.10). And let the value /(Xo) be equal to the ordinate of the point
N. In other words, we assume that the graph of f(x) is obtained from the
"good" curve M1MM2 by replacing N for M.
Let us show that at .xo the function f(x) has a limit equal to the ordinate
A of M.
I)
Mz
•N
A+c
A-e
0 I
Fig. 7.10
◄ We choose any (whatever small) e > 0 and fix the points with the or-
dinates A, A - e, A + eon the y-axis. Let P and Q be the points of inter-
section of the graph of y = f(x) with the lines y = A - e and y = A + e
and let Xo - h1 and Xo + h2, where h1 > 0 and h2 > 0, be the abscissas of
P and Q,.respectively. We easily see for any x ~ Xo in the interval (.xo - h1,
Xo + h2) the value of f(x) lies between A - e and A + e, i.e., for all x ~ Xo
we have
A - e < f(x) < A + e
whenever Xo - h1 < x < Xo + h2.
Let o = min ( h1, h2} be the smallest of h1 and h2. Then the interval
(Xo - o, Xo + o) is contained in the interval (Xo - h1, .xo + h2). Hence, the
inequality A - e < f(x) < A + e or, equivalently,
lf(x) - Al < e
remains true for all x ~ Xo belonging to the interval (Xo - o, Xo + o), i.e.,
for all x satisfying the condition
o < Ix - .xol < o.
Whence it follows that lim f(x) = A. ►
x-+xo
7 .3 Functions of One Variable and Limits 253
f(x) = [x 2 for x ~ 0,
1 for x = 0.
◄ The function g(x) = x 2 , - oo < x < + oo, is equal to f(x) everywhere,
except at x = 0, and its limit at x = 0 is equal to zero, i.e., lim g(x) = 0
x---+O
(show this!). Hence, lim f(x) = 0. ►
x---+O
Problems. Write down the Cauchy criterion for the following limits
(1) lim f(x) = 5, (2) lim f(x) = 0,
x---+1 x---+O
(3) lim f(x) = 1, (4) lim f(x) = -2.
x-+ -2 x-+ -3
{f(xn)} which has no limit or two sequences {f(xn)} and {f(x;)} having
distinct limits.
◄ Consider the function f(x) = sin_!_ defined everywhere, except at the
X
point x = 0 (Fig. 7.12).
lj y= sin
r1
0 I
0 -1
We have
1
Xn = -1- ~ 0, x; =----
n1r n-+oo 7r 2 n-+oo
and 2 + n1r
f(xn) = sin n 1r = 0,
f(x~) = sin(; + 2n1r) = 1.
Hence we have found two sequences {Xn J and {x; } converging to x = 0,
for which the corresponding sequences of the values of f(x) converge
to different limits, viz., {f(xn)} converges to zero while {f(x;)} converges
to 1. By the sequential criterion this means that f(x) = sin_!_ has no limit
X
at X = 0. ►
Remark. It can be proved that both criteria for limits at a point are
equivalent.
Limit theorems. We shall derive some basic results which are of impor-
tance in examining limits of functions of one variable.
Theorem 7.10. Let J(x) have a limit at a point Xo. Then this limit is
unique.
◄ Let lim f(x) = A. We shall show that there exists no number B -;t. A
x-+xo
which is a limit of f(x) at Xo,
The statement lim f(x) -;t. B can be written as
x ➔ xo
Let e = IB ; AI > 0. Since lim f(x) = A, then for the chosen e > 0
x->xo
there exists o> 0 such that
lf(x) - A I < e vx, x ~ Xo, Ix - .xol < o.
Applying (*), we reduce the above statement to the following
lf(x) - Bl ~ e vx, x ~ Xo, Ix - .xol < o.
Hence, we have found e > 0 such that for any (whatever small) o > 0
there exists a point x ~ Xo for which O < Ix - .xol < o and lf(x) - Bl ~ e.
Whence it follows that B ~ lim f(x). ►
x->xo
A function f(x) is said to be bounded in a neighbourhood of a point
Xo if there exist numbers M > 0 and o > 0 such that
Notice that in general from the strict inequality f(x) < cp(x) it follows that
lim f(x) ~ lim cp(x).
x-+xo x->xo
lj y
Tf(x 0 )
I y='f(x)
I y='P{x)
I
I
I
I
~yaf(x)
I
I
0 X 0 X
Theorem 7.13. Let cp(x) ~ f(x) ~ t/;(x) be true for all x in a neighbour-
hood of Xo, except probably at Xo, and let cp(x) and tf;(x) have the same
limit A at Xo, Then f(x) also has the limit A at Xo (Fig. 7.14).
Limit of a function when the variable tends to infinity. Let f(x) be de-
fined at every point of the number line or at all x whenever !xi > K for
some K > 0 so that we can compute the values of f(x) for whatever large
magnitudes of x.
7.3 Functions of One Variable and Limits 257
provided that for any e > 0 there exists a number N > 0 such that
lf(x) - A I < e,
whenever lxl > N.
Substituting x > N and x < - N for lxl > N, we have A = lim f(x)
X-,. +oo
and A = lim f(x), respectively. Whence it follows that
X-,. -oo
A = lim f(x)
X-+ <X>
lj
re
A ~-----~-44-WM¼~~tBi:~~:;:;i~
/"
A-c
,/
-------~..L.L....L.L....<....L...L...L.L...."--'--L...L._L_.L....L....,<.....<...L...L._L_,L..L...:....L...<...-
0 N
Fig. 7.15
17-9505
258 7. An Introduction to Analysis
1 - 0 <e
x2 +1
to hold there must hold ~-1- - < e. Whence x 2 + 1 > _!_ and
lxl > j! I.
x2 + 1
Thus, if we choose N= F e
then
1
~---0 < e whenever !xi > N, i.e., A = 0 is the limit of the given
x2 +1
function as x-+ co.
Notice that the inequality_!_ - 1 ~ 0 is true only for e ~ 1. For e > 1
e
1
it immediately follows that 2 < e for all x E IR.
X +1
The graph of an even function y = 2
1 asymptotically tends to the
X +1
line y = 0 as x -+ ± co. ►
Problems. We leave it to the reader to interpret the following statements
in terms of inequalities involved in the definition of a limit
(1) lim f(x) = - 3; (2) lim f(x) = 1; (3) lim f(x) = 0.
x---+oo x---+ + oo x---+ -oo
0 X
Fig. 7.16
The relation x sin _!_ can be considered as the product of the functions
X
lj
lj=:X
:x
IJ=-x
Fig. 7.17
lf(x)I > 1
1 1
vx, x ~ Xo, Ix - .xol < o.
262 7. An Introduction to Analysis
1 1 2
f(x) - lf<x>I < 7AT·
Hence the function f(~) is bounded in a deleted neighbourhood of
Xo. ►
Theorem 7.16. If a(x) is an infinitesimal as x ➔ Xo and if f(x) has a
nonzero limit at Xo then the quotient ;g; is an infinitesimal as x-> Xo-
◄ Express a(x) as
f(x)
a(x) 1
f(x) = a(x) f(x)
lf(x)I > M
whenever Ix - Xo I < o and x ~ Xo- In this case we write
lim f(x) = oo.
x-+xo
We say that f(x) has an infinite limit as x ➔ Xo- We also say that a limit
of f(x) is equal to A with the understanding that A is any (whatever large)
finite number.
7.4 Infinitesimals and Infinities 263
respectively. 1
For example, the function /(x) =- defined at every x -:;e 0 is an infinity
X
as x-+ 0 (Fig. 7.18). The function f(x) =-\- defined at every x -:;e 0 is an
X
infinity tending to + 00 as x-+ 0 (Fig. 7.19).
y lj
lj =l
x2
X 0
Fig. 7.19
Fig. 7.18
0 IXo .ra+B .r
I
I
I
I
-M
Fig. 7.20
if, given any (whatever large) number M > 0, there exists a number N > 0
such that
lf(x)I > M
whenever lxl > N.
7.4 Infinitesimals and Infinities 265
lf(x)I > M = !
e
whenever Ix - .xol < o and x ;e Xo, in which case there will be defined a
function a(x) = /(~) and
( ) Qox°' + a1x" - 1 + . , , + Om
y X = 1 (ao ;e 0, bo ;e 0),
box" + b1x" - + . . . + bn
266 7. An Introduction to Analysis
oo, m > n,
ao? + a1?- 1 + . . . + Om ao
lim , m = n,
x-+oo boxn + b1xn - l + . . . + bn bo
0, m < n.
lim f(x) = A.
x-+xo
We put
a(x) = /(x) - A
or, equivalently,
f(x) = A + a(x)
and prove that a(x) is an infinitesimal as x-+ Xo.
Let e > 0 be an arbitrary number. Since lim f(x) = A, then for the
x-+xo
given e > 0 there exists o> 0 such that
lf(x) - Al< e
whenever Ix - Xo I< ~ and x ~ Xo,
7.5 Operations on Limits 267
Whence
f(x)cp(x) = [A + a(x)] [B + {J(x)] =AB+ Ba(x) + At3(x) + a(x){J(x).
By Theorem 7.15 B a(x), A {J(x) and a(x){J(x) are infinitesimals as
x -+ .xo. Then their sum is also an infinitesimal as x -+ Xo, Thus we have
represented the function f(x)cp(x) as the sum of the constant quantity AB
and an infinitesimal as x-+ .xo. By Theorem 7.19 we conclude that the func-
tion f(x)cp(x) has a limit equal to AB at Xo, i.e.,
lim [f(x) · cp(x)] = AB = lim f(x) · lim cp(x). ►
x-+xo x-+xo x-+xo
x2 - 1
(2) Compute lim 1 .
x-+ I X -
Observing that the definition of a limit at a point does not involve this
point and the value of the function at this point, we can exclude the point
x = 1 from consideration. This means that we can reduce the fraction by
x - 1 (assuming that x - 1 ~ 0). Then we have
x2 - 1
= X + 1, X ~ 1.
x- 1
Whence
lim x
2
- 1
1 = lim (x + 1) = 2. ►
x-+l X - x-+1
. ✓ 1 + x2 - 1
(3) Compute hm 2 •
x-+O X
1 1 ►
lim (✓ 1 + x 2 + 1) - 2 ·
x-+O
(4) Consider the function f(x) = sin x 2 which is an even function, de-
fined at every point of the number line and bounded everywhere so that
lsinx2 1 ~ 1, vx. This function vanishes at x = ± ~ , where n = 0,
1, 2, ....
◄ Consider two consecutive points at which f (x) vanishes. Let these
points be ~ and ✓ (n + l)1r and the distance between them be
✓(n + l)1r - ~ -
Now we compute lim (✓ (n + l)1r - ~ ) . Multiplying and dividing
n-+oo
(✓ (n + l)1r - ✓-,,;-) by (✓ (n + l)1r + ~), we obtain
270 7. An Introduction to Analysis
n ->oo
- lim 1r =0
n-oo ✓(n + l)1r + ✓n1r '
since the nominator of the above fraction is a constant quantity and the
denominator increases infinitely as n ~ oo. Thus the distance between two
consecutive points at which f(x) vanishes tends to zero as n ~ oo. Hence,
the function J{x) = sin x 2 is nonperiodic. ►
One-sided limits. Let f(x) be defined in an interval (a, .xo). Then a num-
ber A is said to be a left-hand limit of f(x) at Xo if, given any e > 0, there
exists 8 > 0 such that
lf(x) - Al < e,
whenever Xo - 8 < x < .xo.
In symbols, we write
lf(x)-Al<e
whenever Xo < x < X-O + 8.
In symbols, we write
A = lim f(x) or A = f(Xo + 0).
X->Xo- 0
◄ Let lim f(x) = A. Then given any c: > 0 there exists 8 > 0 such that
X--+ Xo
lf(x) - Al< c:
holds for all x (x ~ xo) in (xo - 8, Xo + 8).
Evidently, ( *) holds for all x in (xo - 8, xo) and all x in (xo, xo + 8)
so that by definition we have
A = lim f(x) and A = lim f(x).
x--+ Xo - 0 X--+ Xo +0
Conversely, let A = lim f(x) and A = lim f(x). Then given any
X--+ Xo - 0 X--+ Xo +0
c > 0 there exist 81 > 0 and 02 > 0 such that
lf(x) - Al< c:
whenever Xo - 81 < x < Xo and Xo < x < Xo + 02.
lj
0 r 0 X
= lflx11
for x ";t. 0, h
Examples. (1) Consider the function f(x) sown
for x = 0,
in Fig. 7.22.
We have
lim f(x) = lim f(x) = o~ limf(x) = 0.
x--+O-O x--+O+O x--+O
272 7. An Introduction to Analysis
We have
lim f(x) = 1 and lim f(x) =0 => 3lim/(x).
x---+0-0 x---+O+O . x---+O
lj
lj
0 I X
0
Fig. 7.25
becomes
lim f(Xo + AX) = f(Xo)
dX ➔ O
or, equivalently,
lim [f(Xo + AX) - f(Xo)] = 0.
.£1x ➔ O
Observe that f(Xo + AX) - f(Xo) = .6.y. Then (**) can be written as
lim .6.y = 0,
.t1x ➔ O
18-9505
274 7. An Introduction to Analysis
for all x
h
lf<x) - f(.xo)I < 2
or
- ~ < f(x) - f(.xo) < ~
whenever Ix - .xol < o. Whence
h h h
f(x) > f(Xo) - 2 = A + h - l = A + 2 ·
Thus, f(x) > A, vx E (Xo - o, Xo + o). ►
From the preceding theorem the following theorem can be derived.
Theorem 7.22. Let f(x) be continuous at Xo and let f(.xo) ~ 0. Then there
exists a neighbourhood (Xo - o, Xo + o) of Xo such that f(x) does not vanish
and is of the same sign at every point of the neighbourhood.
◄ To prove this theorem it suffices to put A = 0 in Theorem 7.21. ►
Continuity of elementary functions. Basic elementary functions are*)
(a) power function y = xa, where a is an arbitrary real number, x > 0;
(b) exponential function y = ax, a > 0, a ~ 1, - oo < x < + oo;
(c) logarithmic function y = loga x, a> 0, a ~ 1, x > 0,
(d) trigonometric functions y = sin x, - oo < x < + oo, y = cos x;
7r
- oo < x < + oo, y = tanx, x ~ 2 + n1r, n = 0, ±I, ±2, ... , y = cotx;
x ~ n 1r, n = 0, ± 1, ± 2,
... ;
(e) inverse trigonometric functions y = sin - 1 x, -1 ~ x ~ 1, y = "
cos- 1 x, -1 ~x ~ 1, y = tan- 1 x, -oo <x< +oo, y = cot- 1 x,
- oo < X < + oo.
We say that elementary functions are those obtained by means of a finite
number of arithmetic operations and a finite number of operations of con-
structing a function of a function. It is not hard to show that the basic
elementary functions are continuous at every point of their domains.
We prove that the function y = cos x is continuous at every point of
the domain - oo < x < + oo.
◄ First we prove the auxiliary inequality
length of the arc BC is equal to 2x. Since the arc length is greater than
the length of the subtending chord we have
2 sinx < 2x and sinx < x.
This inequality can be written as lsin xi < lxl for x E (0, 1r/2). The identities
Isin ( - x)I ::;:: I~ sin xi = lsin xi and I - xi = lxl imply that the inequality
lsinxl < lxl also holds for x E (- 1r/2, 0). Observe that(*) is also true for
all x E ( - 1r/2, 1r/2) since sin 0 = 0. If x~( - 1r/2, 1r/2) then lxl ~ 1r/2 > 1,
whereas Isin xi ~ 1 vx. Thus we conclude that the inequality
lsin xi ~ lxl
holds for any x.
Now we turn to the function y = cos x which is defined at each point
of the number line. Let x be an arbitrary point (x E IR) and ~ be an incre-
ment of x at x. Then the increment of y = cos x can be expressed as
0
1
◄ Suppose that O < x < 1r/2 and consider a circle of unit radius
(Fig. 7.27). From Fig. 7 .27 it is easily seen that
area of ~OAB < area of sector OAB < area of ~OAC.
These areas are equal to ~ sin x, ~ x and -J tan x, respectively. Then we have
which holds for x E (0, 1r /2) and also for x E ( - 1r /2, 0) since
sin ( - x) sin x
- - - - = - - and cos(-x) = cosx.
-X X
Thus both <,0(x) = cos x and l/;(x) = 1 have the limits equal to 1 at x = 0.
By Theorem 7.13 (•••)and(****) yield the identity we are seeking for. ►
278 7. An Introduction to Analysis
lim
n--oo
(1 + l)n
n
= e.
lim (1 + z) 11 z = e,
z--o
where z = 1, 1/2, 1/3, ....
We can verify that there exists lim (1 + x) 11x which is also equal to the
number e when x tends to 0 in an arbitrary way by running through any
sequence of real values distinct from 0, i.e.,
lim (1 + x) 11x = e.
Operations on functions continuous at a point. In this section we shall
develop a number of important properties of continuous functions.
Theorem 7.24. Let f(x) and cp(x) be defined at Xo and in a neighbour-
hood of XQ. If f(x) and <P(X) are continuous at Xo so are the sum f(x) + <P(X),
the difference f(x) - <P(X), the product f(x)cp(x) and, if cp(x) ~ 0, the quo-
tient f(x) .
<P(X)
By way of illustration we shall prove the quotient rule; all the other
rules can be proved in a similar way.
◄ Let f(x) and cp(x) be continuous at Xo and let cp(Xo) ~ 0. By Theo-
rem 7 .22 there exists a neighbourhood of Xo such that <P(X) ~ 0 for all x in this
neighbourhood. Thus, the function F(x) = ~~~ is defined at Xo and in
a neighbourhood of Xo- Since lim f(x) = f(Xo) and lim cp(x) = <P(Xo) ~ 0
x--> Xo X--> Xo
the quotient rule (see Theorem 7.20) is applicable here and we have
lim f(x)
lim F(x) = lim f(x) x--> Xo
/(Xo) = F(Xo).
x-->x 0 x ➔ xo
cp(X) lim cp(x) <p(Xo)
x--> Xo
at .xo. ►
Composite functions. Let E be an arbitrary subset of a number line
and let u = <p(X) be a function defined on E. We denote by E1 a set of
values of u corresponding to the values of x in E. Furthermore, let there
be defined a function y = f(u) on E1. Then every x EE corresponds to a
7 .6 Continuous Functions. Continuity at a Point 279
l1
X E X
Fig. 7.28
such that lcp(x) - A I < r, or, equivalently, lu - A I < 'Y/ for all x ~ X-0
whenever Ix - xol < o.
The inequality Iu - A I < T/ implies (*) which can be written as
lf[cp(x)] - J(A)I < e.
Thus, for any e > 0 there exists o> 0 such that
lflcp(x)] - J(A)I <e
whenever Ix - Xo I < o and x ~ Xo-
280 7. An Introduction to Analysis
This means that the number f(A) is a limit of the composite tuncuon
f ['P(X)] at Xo. ►
We see that when the conditions of Theorem 7.25 are fulfilled we have
lim f['P(x)] = f(A)
X ➔ Xo
or, equivalently,
limf[cp(x)] =f[lim cp(x)].
X ➔ Xo x---+xo
I
: limf(rJ
I X-+Xo
I
0 X 0 Xo X
Example. Let
f ~1 1 for x '#- 0,
f(x) = l for x = 0.
◄ We have
lim f(x) = lim f(x) = 0 ~ 1 = /(0),
X--+Xo-0 x--+xo+O
*> If f(x) is not defined at Xo the point Xo is also called the point of discontinuity.
282 7. An Introduction to Analysis
then the point Xo is a discontinuity at which f(x) has a jump (or a gap)
and f(.xo) does not need to be equal to either of the one-sided limits. In
this case the jump of f(x) at Xo is equal to the difference
f(Xo + 0) - f(Xo - 0) of the left- and right-hand limits of f(x) at .xo.
y
0 X
Fig. 7.31
kind. Thus, if X<l is a discontinuity of the second kind the function f(x)
at X<l does not have either a finite left-hand limit or a finite right-hand limit.
Examples. (1) If f(x) = -~ , x -;,±. 0, f(0) = 0, the point x = 0 is a discon-
x
tinuity of the second kind, for lim f(x) = - oo and lim f(x) = + oo.
x-o-o x-o+o
(2) Let f(x) = sin_!_ , x -;,±. 0, f(0) = 0.
X
◄ At x = 0 f(x) has neither finite nor infinite one-sided limits. We can
easily prove this by applying the sequential criterion for limits. Thus the
point x = 0 is a discontinuity of the second kind. ►
(3) For the Dirichlet function
of these intervals denoted by [a1, bi] is such that f(x) assumes the values·
. signs
o f opposite . . en d points.
at its . Clear1y, t h e point t = a1 + b 1 b ecomes
. <;;1
2
the midpoint of [a1, bi]. If /(~1) = 0 the theorem is proved. However it
may occur that f( ~1) ~ 0. Then we choose between the closed intervals
[a1, 6] and [~1, bi] the one at whose endpoints f(x) assumes the values
of opposite signs, denote this closed interval by [a2, bi], and repeat the
process as above.
It is clear that if we continue this process we finally either arrive at
a point a E (a, b) wheref(a) = 0, thus completing the proof of the theorem,
or we obtain an infinite sequence of the nested closed intervals
[a, b] :J [a1, bi] :J ... :J [an, bn] :J ....
All the closed intervals tend to zero as n ~ oo, i.e.,
lim (bn - an) = lim b -: a = O
x ➔ oo n ➔ oo 2
and on every closed interval f(x) assumes the values of opposite signs at
the endpoints.
Recall that the Cantor lemma implies that there exists a unique point
a contained in each [an, bn], n = 0, 1, 2, .... Let us prove that f(a) = 0.
We assume the converse, i.e., f(a) ~ 0, and for definiteness we set
f(a) > 0. Since f(x) is continuous at a E [a, b]. Theorem 7.22 implies that
there exists an open interval (a - o, a + o) in which f(x) is positive. As
a = lim an = lim bn we can choose sufficiently large n so that the closed
n ➔ oo n ➔ oo
interval [an, bn] is contained in the open interval (a - o, a + o). This means
that f(an) and f(bn) are of the same sign (both positive). However on every
closed interval [an, bn] f(an) and f(bn) are of opposite signs. Thus we have
a contradiction which makes our assumption of f(a) ~ 0 false. Hence,
J(a) = 0. Notice that a < a < b, a E [a, b] and the point a must be distinct
from both a and b since f(a) ~ 0 and f(b) ~ 0. ►
It is easy to interpret this theorem geometrically (Fig. 7.32) .. If
f(a)J(b) < 0 then the points A(a, f(a)) and B(b, f(b)) occur on different
semiplanes relative to the x-axis, and the graph of the continuous function
f(x) intersects the x-axis at least at one point between a and b.
It is important that f(x) must be continuous of [a, b]. If the function
is discontinuous at a point in [a, b] it may have a jump from a negative
value to a positive one without assuming a zero value as does the function
-1 for -1 ~ X < 0,
f(x) = [ 1
for 0 ~ X ~ 1,
shown in Fig. 7.33 ..
7.7 Continuity on a Closed Interval 285
y y
f{b)
0 a
I
I f(a)
L) -- I -1,
I
0 .T
I I
I
We can apply Theorem 7.27 when we wish to find out whether a poly-
nomial has a real root and, if any, to compute an approximate value of
the root.
Example. Consider the polynomial P3(x) = x 3 + x - 1.
◄ Since P3(x) is of odd degree it has at least one real root.
Evidently, we have P3(0) = -1 < 0 and P3(1) = 1 > 0, i.e., at the end-
points of [O, 1] P3(X) assumes the values of opposite signs. Hence P3(X)
has a real root contained in (0, 1).
At the midpoint ~1 = 1/2 of the closed interval [O, 1] we have
P3(1/2) = -3/8 < 0. Recall that P 3(1) > 0. This means that the desired
root is contained in the open interval (1/2, 1). Again we compute the value
of P3(X) at the midpoint of [1/2, l], i.e., at b = 3/4. We obtain P3(1/2) < 0
and P3(3/4) = 11/64 > 0. Hence the root of P3(X) is contained
in (1/2, 3/4). Working through this process we obtain a sequence of the
nested open intervals of decreasing lengths. In other words, at each step
of this process we improve the approximation of the root whose precision
is given by the length of the last open interval thus obtained. ►
286 7. An Introduction to Analysis
lf(x)I ~ K.
and
f(~) ~ f(x) ~ f(~).
It is important here that f(x) is continuous on [a, b]. For instance,
f(x) =x is continuous and bounded on ( -1, 1) but does not attain its
7.7 Continuity on a Closed Interval 287
f(xo) = 1. Analogously, f(x) does not attain its infimum inf x = -1.
XE( - 1,1)
lj
0 X
Fig. 7.34
It is important here that for every e > 0 there exists o > 0 such that
lf(x') - f(x" )I < e for all x ', x" belonging to (a, b) only under the condi-
tion Ix' - x" I < o. For example, the function f(x) = x is uniformly con-
tinuous on the number line. To verify this it suffices to put o = e.
It is easy to see that if f(x) is a uniformly continuous function on
(a, b), it is continuous at every point x E (a, b). The converse is not true.
For instance, f(x) = sin is continuous on (0, 1) and is not uniformly
1r
X 1 2
continuous on (0, 1). Indeed, let x; = - and x;' = 2 . Then we can
n n+ 1
choose n so that the value of
1 2 1
n 2n + 1 n(2n + 1)
becomes smaller than any o> 0 whereas
lf(x;) - f(x;' )I = 1 > e ve < 1.
Thus for e > 0, say for e = ; , and for any o > 0 there exist points
x; and x;' in (0, 1) such that Ix; - x;' I < oand lf(x;) - f(x;' )I > e. Hence
f(x) = sin 1r is not uniformly continuous on (0, 1).
X
ll·m a(x) =0
x-+xo {3(x) .
7.8 Comparison of Infinitesimals 289
For example, a(x) = 2x and {l(x) = x, x---+ 0 are infinitesimals -of the
same order since
(c) a(x) and {3(x) are incomparable infinitesimals as x---+ Xo if the ratio
a(x)
~--- - has neither finite nor infinite limit as x---+ .xo. For example,
{3(x)
a(x)
a(x) = x sin 1 and {3(x) = x are incomparable since the ratio {3(x) -
X
. 3 sin 2 x
h m -- --- -- = 3.
x-+O X2
19-9505
290 7. An Introduction to Analysis
We write
a(x) ~ /j(x), x ➔ Xo
sin - t x .
lim---=l~s1n- 1 x~x ' x ➔ O '
x--+O X
tan- 1 x -1
lim - - - =1~ tan x ~ x, x ➔ 0.
x--+O X
a1(x)
◄ Consider and express it as
/31 (X)
a1(x) a1(X) a(x) {3(x}
-
/31 (x) a(x) {3(x) /31 (x) ·
By the. hypothesis of the theorem
. a1(x) _ {3(x)
h m - - - 1 and lim = 1.
x--+xo a(x) x--+xo /31(X}
.
1Im a 1(x) . a1 (x) . a(x) . {3(x) A
-- = 11m - - · 1Im - - · 1Im - - = .
X--+Xo /31(X) X--+Xo a(X} X--+Xo {3(X) X--+Xo /31(X)
Whence we infer that the ratio ;: i;~ also has a limit equal to A at Xo-
Notice that the symbolic relations we have introduced are not equalities
in the ordinary sense. For instance, the "equality" sin x = 0(1),
- oo < x < + oo, does not imply that 0(1) = sin x.
The following rules for the symbol "o" can easily be verified:
(a) o(f(x)) + o(f(x)) = o(f(x)), x ➔ Xo, x E o;
(b) o(f(x)). O(<P(X)) = o(f(x). <P(X)), X ➔ Xo, XE n;
(c) o(o(f(x))) = o(f(x)), x ➔ .xo, x E 0.
◄ Indeed, let g1(x) = o(f(x)) and g2(x) = o(f(x)). Then (a) becomes
g1 (x) + g2(x) = o(f(x)), x ➔ Xo since
.
IIm g1 (x) + g2(x) . g1 (x) . g(x) ►
---,----- = 1Im - - + 1Im - - = 0
X---+Xo f(X) X---+Xo f(X) X---+Xo f(X) •
It is also easy to verify the following rules:
(d) o(f(x)) + 0(f(x)) = 0(f(x)), x ➔ .xo;
(e) o(f(x)) · 0(<P(x)) = o(f(x) · <P(x)), x ➔ .xo;
(f) 0(f(x)) • 0(<P(X)) = 0(f(x) • <P(x)), x ➔ .xo;
lim f(x) = 1.
x--+xo <P(X)
where x and y are arbitrary real numbers and i is the imaginary unity such
that i2 = -1.
The numbers x and y are called the real and imaginary parts of the
complex number z = x + iy, respectively. We denote the real and imaginary
parts of z by
x = Re z and y = Im z.
The complex number x + iO is thought of as identical to the real num-
ber x.
The complex numbers z1 = x1 + iy1 and z2 = x2 + iy2 are said to be
equal (z1 = z2) if and only if X1 = x2 and Y1 = Y2,
Operations on complex numbers. (a) Addition of complex numbers.
The sum of z1 = x1 + iy1 and z2 = X2 + iy2 is the complex number
Z= Zt + Z2 = (X1 + X2) + i(y1 + Y2).
The operation of addition of complex numbers obeys
(1) the commutative law
Zt + Z2 = Z2 + Zt
and
(2) the associative law
(z1 + z2) + z3 = z1 + (z2 + z3).
(b) Subtraction of complex numbers. For any complex numbers z1 and
z2 there exists the number z such that z1 = z + z2. Then the number z is
called the difference of z1 and z2 and is denoted by
Z= Zt - Z2 = (Xt - X2) + i(y1 - Y2).
Z1 = Z2Z.
The complex number z is called the quotient of z1 and z2 and is denoted
by z = ~- We shall derive the quotient formula for z1 and z2.
Z2
◄ Let zi = Xi + iyi, z2 = x2 + iy2 and z = x + iy. Then from (*) it fol-
lows that
xi = x2x - Y2Y and Yi = X2Y + xy2.
This system of equations is solvable with respect to x and y provided that
z2 -;t: 0. We have
. XiX2 + YiY2 X2Y1 - XiY2 ►
z = X + 1y = 2 2 +i 2 2 •
X2 + Y2 X2 + Y
The complex number
Z =X - IY
is called the complex conjugate of the complex number z = x + iy and
obeys the following rules:
Zi + Z2 = Z i + Z2 , Z i Z2 = Zi Z2 ,
CJ ~> : = -:-:-;:-,
=
zz = lzl 2 = x2 + y 2·
y
0 y
0
M
M
0 X
Fig. 7.35 Fig. 7.36
◄ We have
x = - sin ; < 0 and y = - cos ; < 0.
The principal value of the argument of z is
argz = -1r + tan- (cot;)
1 = -1r + tan- 1 [tan(; -;) ]
- -1r + tan (tan 38.,,) =
-1 -1r + 38., = - 5; .
Hence
Argz = - 85 1r + 2k1r, k = 0, ±1, ±2, ...
lzl = •27r
Sill - + COS 21r
- = 1. ►
8 8
0
z 1 +z 2
\
\
\
\
\
\
\
---- 'z 1
Fig. 7.37
◄ Indeed
z1z2 = r1ei 01 r2ei 02 = r 1(cos 01 + i sin 01)r2(cos 02 + i sin 02)
= r1 r2 [cos 01 cos 02 - sin 01 sin 02) + (sin 01 cos 02 + sin 02 cos 01)]
= r1r2(cos(01 + 02) + i sin (01 + 02)) = r 1r2 ei< 01 + 02).
Thus, multiplication of complex numbers involves multiplication of moduli
and addition of arguments, viz.,
and
arg (Z1Z2) = arg z1 + arg z2. ►
k = 0, 1, 2, ... , n - 1
7.9 Complex Numbers 301
or
n n 0 +27rk
; _ __
vz = vr e n , k = 0, 1, 2, ... , n - 1.
Whence
;ZE:6 tr .. tr {3 .I V3 + i
Wo =e = cos 6 + l sin 6 =2 +l2 2
. _
, S1r 5 7r .. 5 7r
W1 = e 6 = COS - - + l Sln - -
6 6
-ff .1 -V'S+ i
- -+ l- =
2 2 2
,_
• 371"
31r . . 31r
2 = COS 2 + l Sln 2 = -l
{Zn} if given any e > 0 there exists a number N = N(e) such that
lzn - zl < e
whenever n ~ N.
A sequence {Zn } is called convergent to a number z if z is the limit
of {Zn }. In symbols, we write
Z = lim Zn or Zn ~ Z (n ~ 00)
n -+oo
Exercises
1. Prove that the limit of [ ; 2] is equal to zero. Verify what n
satisfy the inequality 4n < e provided that (a) any e > 0, (b) e = 0.1 and
(c) e = 0.01.
2. Prove that the limit of [ n ] is equal to 1.
n +1
Exercises 303
~ 2n 3 +
n - 2 . n + 6 - n V 2
6. lim - - - - - - . 7. 11m ----;=====----;;-----;====-
n-+ oo n + I n-+ oo ✓ n 2 + n + I + n4 + 1 1/
8. lim
Vn 5 + 1 + 1/ n 2 + 2
5
n---->oo ~n4 + 3 + ✓ n3 + 5
(Hint. On evaluating a limit of a quotient of polynomials it is helpful to
divide the nominator and denominator by nP where p is the greatest expo-
nent in the denominator. This approach is also used when we evaluate limits
of fractions involving irrational functions.)
Compute the following limits
9. lim (
n---->oo
nl)\ _
n+. n.1
• 10. lim ( 21
n---->oo
+ 41 + . . . + 2 -¼-) .
11. lim ---4--o + 2 + ... + n). 12. limn ~osn! 13. lim (✓n + 2
n--->oo n n--->oo n + 1 n--->oo
x 2 -2x+1 x2 - 1 x 3 + 3x 2 + 2x
16. lim . 17. lim . 18. lim
x----> 1 X3 - X x---> - 1 x 2 + 3x + 2 x---> -2 x 2 -x-6
✓ 1 + x2 - 1 2 - ✓x - 3 V1 + x 2 - 1
19. lim 20. lim . 21. lim
X--->0 X X--->7 x 2 - 49 X---->0 x2
x3 + X l + x - 2x 3
22. lim . 23. lim 2 3•
x---->oo x 4 - 3x2 + l x---->oo 1 + X + 3x
24. lim sin 2x . 25. lim s'.n ;x (a, (3 are constants). 26. lim t~n;; .
x--->O X x--->O Sill X x--->O Sill
27. lim sin -1 3x . 28. lim tan -1 2x 29 lim s~n 3x . 30. lim 1 _- x2 •
x--->O X x--->O sin 3X • • x+ir Sill 2X x---> I Sin 7rX
. In (1 + x2)
I1m . lnx - 1
I1m In cosx
2 32. ----. 33. lim---x--
x--->O Sin 3X x--->e X - e x--->O x2
.
34. I1m ex2 - 1 35 1· In (1 - x) 36 11·m eax - ebx ( b
. 2 • • 1m sinx • • ---- a, are
x--->O Sin X x--->O e - 1 x-+O X
41. lim ( x 2 + 1
2
)x
2
•
X---->CO 2x + 1
42. Compute the modulus and the principal value of the argument of the
following:
(a) 4 + 3i, (b) -2 + 2v3i; (c) -7 - i;
(d) -cos; + isin-~; (e) 4- 3i.
43. Write down complex numbers in the trigonometric and exponential
forms:
(a) - 2; (b) 2i; (c) - i; (d) -V2 + i V2.
44. Compute the following:
(a) (2 - 2i)7; (b) (v3 - 3i)6; (c) ( !+ ~ r
45. Compute all the values of the roots:
(a) Yi; (b) Vl; (c) ~ i ; ( d) t' - 1 + i ; (e) ✓ 2 - 2v3i .
Answers
1 3
1. (a) n > -Ii; (b) n ~ 4; (c) n > 10. 3. I. 4. 0. 5. oo. 6. vi. 7. -1. 8. 0. 9. 0. 10. I.
11. 1/2. 12. 0. 13. 0. 14. - 2. 15. 1/2. 16. 0. 17. - 2. 18. -2/5. 19. 0. 20. -1/56. 21. 1/3.
22. 0. 23. -2/3. 24. 2. 25. a/{3. 26. 3/2. 27. 3. 28. 2/3. 29. -3/2 (put 1r - x = t). 30. 2/1r.
31. 1/9. 32. 1/e. 33. -1/2. 34. I. 35. -1. 36. a - b. 37. I. 38. 1/e. 39. e9• 40. e- 112 •
21r 1
41. 0. 42. (a) r = 5, 0 = tan- 1 3/4; (b) r = 4, 0 = ---; (c) r = 5../2, e = -1r + tan- 1 - ;
3 7
(d) r = l, 0 = -47r ., (e) r = 5, 0 = -tan _ I --3 . 43. (a) 2(cos 1r + i sin 1r),
·
2e' ... ;
5 4
(c) ± ( cos 1r . sm
. 7r) ( cos -31r- + 1. sm
. - 37!" ) ; V4 . 6 7r
8 - 1
8 , ± 8 8 (d) - 3
2
(1 + 1), v2 ( -cos-+
12
t = cp(.AX).
20-9505
306 8. Differential Calculus. Functions of One Variable
X
2 •
Sill -
1 for X ':;t=
0'
X
f(x) =
0 for x = 0.
(2) Let f(x) be a periodic function with period T. Prove that if f(x) has
a derivative then the derivative of f(x) is also a periodic function with peri-
od T.
(3) Prove that the derivative of an even function is also an even function
and the derivative of an odd function is an odd function provided that
these derivatives exist.
Geometric interpretation of the derivative. We consider the graph of
a function y = f(x) which is defined on (a, b) (Fig. 8.1) and choose the
points M(x, f(x)) and P(x + Ax, f(x + Ax)) on the curve y = f(x). We also
draw the line through the points M and P.
Suppose that the point P is moving along the curve y = f(x) towards
the point M (or, which is the same, Ax tends to zero) so that the line MP
is changing its position until MP coincides with the line MT. The line MT
which defines the limiting position of MP as Ll.x-+ 0 is called the tangent
to the curve y = f(x) at the point M. Notice that when the point P moves
to the point M the angle TMP tends to zer~.
From Fig. 8.1 it is easily seen that the slope k of the line MP is
k = tan a= t.
Let 'P ¢. 1r /2 be the angle between the tangent MT and the x-axis. Since
the slope of the tangent MT to the curve y = f(x) at M is the limit of the
slope of MP as the point P moves to the point M along the curve and,
consequently, as AX-+ 0, we get
tan 'P = lim tan a = lim Ay = lim f(x + Ax) - f(x)
P--+M .6.x-+O Ax ax--+O AX ·
8.1 Derivatives and Differentials 307
The latter limit, if it exists, is the derivative f' (x) and is equal to tan cp.
Therefore, the derivative f' (x) of the function y = f(x) is the slope of the
tangent to the curve y = f(x) at the point specified by the abscissa x.
0 I I +-..:1X
Fig. 8.1
1 (/' (xo) ~ 0)
f' (xo)
20*
308 8. Differential Calculus. Functions of One Variable
so that the equation of the normal to the curve y = f(x) at Mo(xo, Yo) is
X X
The ratio ~/ llt is called the mean velocity in the time interval llt. We
define the velocity v of the point at the moment t as the limit of the mean
velocity in the time interval ll.t as llt ➔ 0 and write
Hence the velocity v(t) is equal to the derivative of the distance s with
respect to time t, i.e., v(t) = s' (t).
Example. Let us consider the law of rectilinear motion s = t 2 where
the distance s is measured in metres and the time t is measured in seconds
and compute the v~locity at t = 3 s.
8.1 Derivatives and Differentials 309
t
<lx>O
respectively.
310 8. Differential Calculus. Functions of One Variable
This implies that the tangent to the curve y = f(x) at the point
(xc, f(xo)) is perpendicular to the x-axis. For example, if f(x) = Vi then
at x = 0 we have
.1y _ f(0 + L\x) - f(0) _ ~ _ I
L\x .1x - .1x - V<.1.x->2 .
Whence it is easy to see that .1y/.1x tends to + oo as .1x tends to zero in
an arbitrary manner. The tangent to the curve y = Vi at the point
0(0, 0) coincides with the y-axis (Fig. 8.4).
J
y=yx
Fig. 8.4
lj
M 0 (::,: 0 ,f(xoJ)
I
I
I
I
I
0 X
Fig. 8.5
0 X
Fig. 8.6
0 X 0 :r·
(1) (2)
y y
0 Io ..r 0 Io X
(3) {4)
Fig. 8.7
312 8. Differential Calculus. Functions of One Variable
~ = A + a (AX),
where A is independent of AX, 1.e., is constant at a given point x and
a(AX) -+ 0 as AX-+ 0.
Theorem 7.19 implies that
where a (AX) ~ 0 as AX ~ 0.
If y = f(x) is differentiable at x the linear part A AX of .1y, provided
that A ¢ 0, is called the differential of y = f(x) and is denoted by dy or
by df(x) so that
dy =A AX.
8.1 Derivatives and Differentials 315
When A -:;t; 0 we say that A Llx is the principal linear part of Liy since
a(Llx) Llx is an infinitesimal of higher order than A Llx as .1.x ~ 0. If A = 0
the differential is equal to zero.
By virtue of Theorem 8.1 we have A = f' (x). Then the differential dy
becomes
dy = f' (x) Llx.
Along with the notion of the differential of a function we can introduce
the notion of the differential of an independent variable x by putting
dx = AX. Then the differential of y = f(x) can be written as
dy = f' (x) dx.
Whence we have the Leibniz notation for the derivative: f' (x) = 1x .Thus
a derivative of f(x) can also be thought of as a quotient of two differentials
dy and dx.
We say that a function y = f(x) is differentiable on an open interval
(a, b) if f(x) is differentiable at every point of (a, b).
lj
X x+dx X
Fig. 8.8
Thus
(C)' = 0 and dC = 0. ►
t t t-= +
Since u(x) and v (x) are differentiable at x there exist the derivatives
u '(x) and u '(x) at x. Then each summand in ( *) has a limit as LU-+ 0
so that there exists a limit of the right-hand -side of ( *) equal to u '(x) +
u '(x). Hence, the left-hand side of (*) also has a limit as LU -+ 0. In other
words, there exists
lim Lly = y ' (x).
tix-+O ,1x
Since u(x) and v(x) are differentiable at x they are continuous at x so that
.t,.u and .t,.v tend to zero as AX-+ 0. Thus the right-hand side of ( *) has
a limit equal to v(x)u' (x) + u(x) v '(x) as AX-+ 0. Hence there exists a limit
of the left-hand side of ( * ), i.e., there exists lim .t,.y = y' (x). Evaluating
~x---oAX
the limit of ( *) as AX -+ 0, we obtain ·
y' (x) = (u(x) v(x))' = u' (x) v(x) + u(x) v '(x).
Whence
d(uv) = vdu + udv. ►
y , -_(u)'
- -_
V
u 'v - uv'
V
2 '
(v(x) ~ 0).
d (~) = v du : 2 u dv ,
Whence
'1u il.v
A vA..,.-uA..,.
~ = _L.l.,\.
___ L.I.,;\_
At" v2 + v .1v
Recall that u(x) and v(x) are differentiable at x. This implies that there
· t 1·1m il.u
ex1s A ..... = u
' (x ) an d 1·1m -il.v = v '(x) and il.v--+ 0 as At"--+ 0. Also,
ax - o L.I.,\. ax - odX
at a given point x the values of u and v are constant and, as stated, v (x) ~ 0.
I I
Hence there exists a limit on the left of ( *), i.e., there exists lim ~~ = y' (x) .
.1x --+ O L.lA
Evaluating the limit of ( *) as Lix" --+ 0, we obtain
y'(x) = (u(x))'
v(x)
= u'(x)v(x) - u(x)v'(x),
v 2 (x)
(v(x) ;t; 0). ►
ex - 1
Example. Find the derivative of y = ---
x2 +5
, _ (ex - I ) ' _ (ex - I)' (x 2 + 5) - (ex - l)(x 2 + 5)'
◄ y - x2 + 5 - (x 2 + 5) 2
that (a) f(x) + <i0(x), (b) f(x)<i0(x) and (c) ~~i) have no derivatives at Xo?
(Consider the following functions: (i) f(x) = Ix I, ip(x) = - Ix I, xo = 0;
(ii) f(x) = <,0(x) = Ix I, Xo = 0; (iii) f(x) = <,0(x) = Ix I + I, Xo = 0.)
Derivatives of some elementary functions. (I) Exponential Junction
y = ax (a > 0, a ;t; 1). This function is defined at every point of the number
line. Hence, for any x and any Lix" we have
ax ax
.1 .ix 1 .1x 1
lim ___.E = lim ax a - = ax lim a - = ax In a•
.1x --+ 0 ax ax --+ 0 LU" tJ.x ..... 0 LU"
Thus
320 8. Differential Calculus. Functions of One Variable
In particular, if a = e then
(ex), = ex.
(2) Logarithmic function y = ln x (x > 0). For any x > 0 and any Ax
such that x + AX > 0 we have
and
In
lim .dy = lim
AX
.::ix--+O t.x--+O
Thus
(In x)' - _!_
X
Using the identity
loga x = Io~ e In x (a > 0, a ~ 1)
we arrive at
lo~ e 1
(lo~ x) ' = lo~ e (In x)' - x - x In a
and, finally,
I
(lo~ x) ' - x In a ·
(3) Power function y = xa (a is an arbitrary real number). This function
is defined at least for all x > 0. Then
and
t.y = (x + t..x)a - xa = xa [ ( + I ~ r-I]
Recall that (1 + xAX)a - I - a XAX as t,,.x -+ 0 (see Chap. 7). Then
and
and
. dX
Sill Z
Lix cos (x + ~)-
2
. dX
lim ~Y = lim
AX--+ 0 AX AX--+ 0
Sl~ 2 COS (x + ~) = COS X.
2
Thus
(sin x)' = cos x.
Analogously
(cos x)' = - sin x.
Using formulas for derivatives of sin x and cos x, we easily arrive at
(tan x), = ( sin x) ' = (sin x)' cos x - sin x(cos x)' 1
2 = sec2 x
cos x cos 2 x COS X
so that
Analogously
21-9505
322 8. Differential Calculus. Functions of One Variable
~➔ <P' (x0 ) as .6.x ➔ 0. Therefore the right-hand side of ( **) has a limit
equal to f' (u0 ) <P' (x0 ) as .6.x ➔ 0. So there exists a limit of the left-hand
side of ( * *), i.e., lim Liy , which is the derivative of the composite function
ax-+ o .6.x
y = f[<P(x)] at xo with respect to the variable x.
Evaluating the limit of ( * *) as .6.x ➔ 0, we obtain
(f[ <P(X)] ); Ix= xo = f ' (uo) <P '(xo),
where f' (uo) designates the derivative of f(u) with respect to the variable
u at the point uo = <P(Xo) corresponding to the point Xo of the variable x.
This identity can be written as
dy _ dy du
dx-dudx
or
YxI
= YuUx·
I I ►
I
so that for x < 0 y; = -.
X
Thus
Remark. The chain rule is also applicable when we consider any finite
chain of functions. For example, if y = f(u), u = <p(Z) and z = l/;(x) so that
y = f{ <p [ l/;(x)]} and if there exist the derivatives f~, 'P;
and 1/;; then
21*
324 8. Differential Calculus. Functions of One Variable
0 a :r b .r
Fig. 8.9
IJ='I'( x) / /
/
// --
I
I
I
I
I
I
Fig. 8.10
Clearly, the equations y = f(x) and x = <,0(y) specify the same curve on
the xy-plane. If we display independent variables on the x-axis in both cases,
i.e., if we plot the functions y = f(x) and y = <,0(X) instead of y = f(x) and
x = <,0(y), their graphs will be symmetric relative to the line bisecting the
first and third quadrants of the coordinate plane (Fig. 8.10).
We say that y = f(x) is an increasing function on [a, b] if given any
x1 and x2 in [a, b] such that x1 < x2 there holds /(x1) < /(x2).
For instance, f(x) = x 3 is an increasing function for - oo < x < + oo.
326 8. Differential Calculus. Functions of One Variable
f3 B
X=f{y)
a
0
a - -- -
/3
0 a X
Notice that Lly = f(xo + AX) - f(xo). Then Lly ;r= 0 implies that .dX ;r= O;
otherwise the given point Xo would correspond to two distinct values
Yo = J(xo) and Yo + Lly which is impossible by the definition of a function.
Against this background we can write the ratio Ax/ Lly (Lly ?=- 0) as
.dX
----
1
Lly Lly/ AX.
f' (xo) ¢ 0. Hence there exists a limit of the quotient .1.y~ Ax as Lly ~ 0 (an_d,
consequently, as Llx ~ 0) and this limit is equal to 1 . Then ( *) implies
f' (xo)
that there exists a limit of : as Lly ~0 and
lim Ax= l
.1y-- o Lly f' (xo)
On the other hand, the limit of i as Lly ~ 0 is the derivative cp' (yo) of
x = cp(y) at the point y = Yo- Hence we have
or Xy, = -1 . ►
y;
It is easy to interpret Theorem 8.5 geometrically. If y = f(x) has
a derivative at xo then there exists a tangent to a graph of y = f(x) at
Mo(xo, f(xo)) and _if this tangent is not parallel to the x-axis, then it is also
a tangent to the curve x = cp(y) at Mo (Fig. 8.12). (Notice that the functions
y = f(x) and x = cp(y) are inverse to each other and are specified by the
same curve.) Looking at Fig. 8.12 we see that f' (xo) = tan a, cp' (yo) =
tan /3 and a + /3 = 1r/2 so that tan /3 = tan(1r/2 - a) = cot a = 1/tan a,
1.e.,
where -1 < x < 1 and the "+" sign of ✓ 1 - x 2 is chosen, since cos y >0
for all y E ( - 1r/2, 1r/2).
7T
2
I1 X r
I
I
I
I
I
I
7T
2
Hence
1
(sin - 1 x)' -1 <x< 1.
✓ 1 - x2 '
cos 2 y
Hence
(tan- 1 x)' = 1 for all x.
1 + x2
(c) To derive the formulas for the derivatives of cos- Ix and cot- Ix
it suffices to notice that
sin - 1 X + cos - l X = 1r/2'
tan - I x + cot - 1 x = 1r/2.
Whence
1
(cos- Ix)' for -1 < x < 1,
✓ 1 - x2
1
(cot - 1 x) ' =- 2 for all x.
1+ X
(cosh x)' = (
ex+ e-x) 1
= sinh x.
2
Also by definition
sinh x cosh x
tanh x = h and coth x = . h .
COS X Sin X
Using the quotient rule and the identity cosh 2 x - sinh 2 x = 1, we obtain
(tanh x), =( sinh x)' = cosh 2 x - sinh 2 x = 1
cosh x cosh 2 x cosh 2 x
and
(tan x)' - 1
cos 2 x
1
(cot x)' - (x-¢ n1r, n = 0, ±1, ±2, ... );
sin 2 x
(cos - i x)' = 1
( -1 < x < l);
✓ 1 - x2
(tan - 1 x) ' - - 1-
1 + x2 '
(cot - 1 x)' = 1
----=- .
1+x 2 '
(tanh x)' - 1 ·
cosh 2 x'
dy = l + a(dX)
dy f' (x) .
✓x + Ax = vx + _1c dX (x ~ 0).
2vx
For instance, for ✓ 3.9978 = ✓4 + ( -0.0022). Putting x = 4 and dX =
- 0.0022, we get
The set of all functions f(x) which are defined on (a, b) and have con-
tinuous nth derivatives at every point x E (a, b) is denoted by cn(a, b).
We say that a function /(x) is infinitely differentiable on (a, b) and write
/(x) E C°(a, b) if /(x) has derivatives of any order at each point x E
(a, b). For example, the functions ex, sin x and cos x are infinitely differen-
tiable on ( - oo, + oo). ►
(4) Compute all the derivatives of the function y = x 4 •
◄ We have y<O = 4x3, y<2> = 12x2, y<3> = 24x, y<4 > = 24. Since y<4> is a
constant all derivatives of higher orders are zero, i.e., y<5> = y<6> = . . . =
y<n) = 0. ►
To give a physical interpretation of the second derivative we consider
the law s = s(t) of a rectilinear motion of a point. Then the first derivative
s' (t) = v(t) specifies the velocity of the point at the moment t and the
second derivative s" (t) = v '(t) is the acceleration of the point at t.
334 8. Differential Calculus. Functions of One Variable
Leibniz formula. Let u(x) and v(x) have the nth derivatives. Then
(a) if y(x) = u(x) ± v(x) then y<n\x) = u<n)(x) ± v<n)(x);
(b) if y(x) = u(x)v(x) then
y I = u + uv
IV I'
y 11
. = u v + 2u ' v ' + uu ",
II
It is easy to notice that the formulas on the right resemble the binomial
formulas for (u + v) 1, (u + v) 2 and (u + v) 3 with the exponents indicating
how many times u and v are to be differentiated. The resemblance becomes
nearly complete if u and v are replaced by u<0 ) and v <0 >, standing for deriva-
tives of the zero order. Using the method of mathematical induction, we
can show that in general
X' =-
l
y y;
and
t in the open interval (a, /3). Suppose that there exists an inverse function
t = g(x) of x = 'P(t). Then y = 1P[g(x)] is a composite function of x. Fur-
thermore, we assume that 'P(t) and 1P(t) are differentiable at the point t E
(a, /3), 'P '(t) ¢ 0 and t = g(x) is differentiable at a point x corresponding
to t. Then by Theorem 8.3 the function y = 'If [g(x)] is differentiable at the
point x and
Yx' = Yt't'x•
On the other hand, by virtue of Theorem 8.5 we have
t'=_!_
X I
x,
lj
Fig. 8.15
so that
I 1 y/' I
Yx = Y,- = -
x/ x/
or
22-9505
338 8. Differential Calculus. Functions of One Variable
d 2 y _ d (dy) _ d
dx 2 - dx dx - dt
(if;''(t)(t)) dxdt
cp
if; " (t) cp ' ( t) - if; ' (t) 'P " (t) 1 t/; " (t) 'P ' (t) - if; ' (t) 'P " (t)
- (cp ') 2 (!) . cp' (t) = (cp ')3(t)
Hence
y;' = (yx')/
x/
and in general
[ (n - I)] ,
y?> = Yx , t ,
x,
where y = f(x) is given in a parametric form by the equations x = x(t) and
y = y(t).
Example. Compute d 2{ provided that y = f(x) is given by
dx
[
x = a(t - sin t),
y = a(l - cos t).
◄ We have
dy = dyldt = a sin t = cot!_
dx dxldt a(l - cost) 2
and
d 2y
d:x2
= _!!_ (dy) = !!_
dx dx dt
(cot !._)
2
dt
dx
= !!_
dt
(cot !._)
2
1
dxldt
1 1 1 1
--·-·------ ►
sjn2 !_ 2 a(l - cos t)
2 4a sin 4 ~
8.5 Mean Value Theorems 339
!J
0 a b X
Fig. 8.16
Since f(x) has a derivative f' (x) at every point of (a, b) there exists
a derivative f' (~) at ~ and
lim f(~ + ax) - /(~) = lim f(~ - ax) - /(~) = f' (~) .
.ix-+O ax .1x ➔ O -ax
.ix>O .ix>O
On the other hand,/(~) =Mis the maximum of f(x) on [a, b] so that
f(~ + ax) - /(~) ~ o and /(~ - ax) - /(~) ~ o.
Whence
f(~ + ~ - /(~) ~ o and f(~ - ax) - /(~) ~ o (ax > O).
-ax
22*
340 8. Differential Calculus. Functions of One Variable
-r----------------
a 2
b I
Fig. 8.17
By way of illustration we shall show that the conditions (1)-(3) are im-
portant and if they are violated Rolle's theorem may not be true. Consider
the function f(x) = Ix I, -1 ~ x ~ 1 (Fig. 8.18). For this function condi-
tion (2) is not satisfied as /(x) has no derivative at x = 0. In this case Roll e's
theorem is inapplicable as in ( -1, 1) there exists no point at which the
derivative f' (x) is equal to zero. Indeed, f' (x) = -1 if - 1 < x < 0 and
f' (x) = 1 if O < x < 1 while at x = 0 f' (x) does not exist. Looking at
Fig. 8.19 we notice that the function/(x) = x - [x] does not satisfy condi-
y
y
-1 1 X
and/' (n is the slope of the tangent toy = f(x) at the point with the abscis-
sa x = ~- Hence the mean value theorem states that if AB is a continuous
curve such that there is a tangent at every point between A and B then
there exists at least one point C(~, J(n) between A and B at which the
tangent to the curve AB is parallel to the straight line AB.
The formula
f(b) - f(a) = f, (~)
b-a
IJ
f(b)
0 a b X
Fig. 8.20
or
f(b) - f(a) = f' (~)(b - a), a < ~ <b
is also valid for a > b. Since ~ is in general unknown it is convenient to
represent it as
~ = a + O(b - a),
where () is a real number, 0 < () < 1. Then the above formula becomes
f(b) - f(a) = f' (a + O(b - a))(b - a), 0 < fJ < 1.
Replacing a and b by x and x + Llx, respectively, we get
Jif(x) = f(x + AX) - f(x) = f' (x + 0 Llx)Jix, 0 < () < 1.
This is the exact expression that relates an increment of y = f(x) and an
increment ax whereas the relative error of the approximate relation
Jif(x) =f' (x)Jix
8.5 Mean Value Theorems 343
tends to zero only as Llx -+ 0. Also notice that in the exact relation the
number 0 is in general unknown.
Example. By use of the mean value theorem prove that
I tan - 1 x2 - tan - 1 x1 I ~ Ix2 - xi I Vx1, x2.
◄ Consider the function f(x) = tan - 1 x. This function satisfies the condi-
tions of the mean value theorem on any [a, b]. Then for any X1 and x2
f(x2) - f(xi) = f' (~)(x2 - xi)
or
tan -1 x2 - tan -1 X1 = 1 (x2 - x1 ) ,
2
1+ ~
where the point ~ is between the points X1 and X2.
Whence
1
I tan - 1 x2 - tan - 1 x1 I = 2 lx2 - x1 I
1+ ~
and
I tan - 1 x2 - tan - 1 x1 I ~ Ix2 - xi I
1
since - - 2 ~ 1 for all ~. ►
1+ ~
Problem. Using the mean value theorem prove that
X
-1 -- < In (1 + x) < x, x > - 1.
+x
Theorem 8.8 (Cauchy mean value theorem). If the functions f(x) and
<,0(x) (1) are continuous on [a, b], (2) have derivatives f' (x) and <,0' (x) in
(a, b) and (3) if <,0 '(x) ~ 0 in (a, b) then there exists at least one point
~ in (a, b) such that
that satisfies the conditions of Rolle's theorem. Indeed, (1) F(x) is continu-
ous on [a, b] since f(x) and <P(X) are continuous on [a, b]; (2) F(x) has
a derivative F' (x) at every point in (a, b), since every summand in the right-
hand side of F(x) has a derivative in (a, b); and (3) F(a) = F(b) = 0.
From Rolle's theorem we conclude that between a and b there exists
~ such that F' (~) = 0. The derivative F' (x) of F(x) is
_ We can introduce the similar notions for limits at infinity, i.e., for limits
as x ➔ oo.
Theorem 8.9 (IJHospital's rule). Let f(x) and ',O(X) have derivatives f' (x)
and "°'
(x) in a neighbourhood (a - o, a + o) of a point a except probably
at a. Suppose that ~(x) and ~' (x) are not equal to zero in (a - o, a + o).
If limf(x) = 0 and lim ~(x) = 0 and if the quotient f ', ~x~ has a finite or
x➔a x➔a ~ X
infinite limit as x ➔ a then there exists
lim f(x) = lim f' (x) .
x ➔ a ',O(X) x ➔ a ',O' (x)
◄ Theorem 8.9 says nothing about the values of f(x) and ',O(X) at x = a.
We put f(a) = 0 and ~(a) = 0. Then limf(x) = f(a) and lim',O(X) = ~(a)
x ➔ a x-+a
and the functions f(x) and ~(x) become continuous at a so that on
[a, x] (or on [x, a]) where x is a point in (a - o, a + o), f(x) and ~(x)
satisfy the conditions of the Cauchy mean value theorem. Hence, there ex-
ists at least one point ~ = ~(x) between a and x such that
f(x) _ f(x) - f(a) _ f' (~)
---
~(x) ~(x) - ~(a) ~' (~)
If for some x there exist more than one such ~ we choose any of them.
The point ~ is dependent of x and ~ ➔ a as x ➔ a. As stated, the quo-
tient f' (x) has a finite or infinite limit as
"°, (x) x➔ a. This limit is independent
of how x tends to a. So the quotient f' (~) has a limit equal to a limit
',O' (~)
From the identities (•) and (••) we have obtained it follows that
lim /(x) = lim f'(x) . ►
x-+a',O(X) x-+a ',O' (X)
The above identity represents CHospital's rule which allows, under
some specific conditions, to replace a limit of a quotient of functions by
346 8. Differential Calculus. Functions of One Variable
On the other hand, the quotient of the derivatives f' (x) = 2x sin ! -
cp' (x) x
cos ! has no limit at x = 0. Hence, from the existence of lim f((x)) it does
X x-+O 'PX
lim <,0(x) = oo and if f' (x) has a finite or infinite limit as x--. a then there
x-+a <,0'(x)
exists
lim f(x) = Iim f, (x) .
1
x-+a<,O(X) x--+a<,0 (X)
Example.
. ( -X-
1Im - -1-)
x In-x---x-+- 1-
= II. m
x-+ 1 x- 1 In x x-+ 1 (x - 1) In x
1. In x 11• · 1/x 1
= }~ In x + 1 - 1/x = x~ 1/x + 1/x2 - 2·
(c) lim lf(x)],px for each of the following cases
x-+a
348 8. Differential Calculus. Functions of One Variable
◄ Put
Y = lf(x)]~(x).
Let us consider
In y = <,0(x) In f(x)
and evaluate
liminy = lim[<,0(x)Inf(x)].
x ➔ a x ➔ a
so that
lim y = e0 = I. ►
x ➔ 0+0
Theorem 8.11. Suppose that (1) functions f(x) and <,0(x) are defined for
all x such that lxl is sufficiently large; (2) lim f(x) = Iim <,0(x) = 0 or
x ➔ oo x ➔ oo
lim f(x) = oo and lim <,0(x) = oo; (3) there· exist derivatives f' (x) and
x➔m x➔ m
<,0' (x) (<,0' (x) ¢ O)for all x such that lxl is sufficiently large; (4) there exists
a finite or infinite limit of the quotient f' (x) as x ..... oo. Then
<,O '-(X)
To verify that Theorem 8.11 is true it suffices to put x = lit and use
the results of Theorem 8.9 and Theorem 8.10.
8. 7 Tusts for Increase and Decrease of a Function 349
Example.
x2 . 2x
lim xe - lIm - = lim 2x = 0.
x➔ +ao x ➔ + Cl0 ex x➔ + Cl0 e
Applying l!Hospital's rule repeatedly n times, we obtain
. nxn - I 1· n! 0
hm x = ... = Im - =
X➔ + ao e X➔ + ao ex
.
IIm ✓1 + x2 .
1Im ✓1 + x2 .
1Im x
--- = --- =
x➔ + ao x x➔ + ao 1 x ..... + ao ✓ 1 + x2
1
lim - - - - lim ✓ 1 + x 2 , etc.,
x ➔ +ao X
.
1Im ✓ 1 -+ -
-
x ➔ +ao
x2 =
X
1·Im
x ➔ +ao
Fi• 2
X
+ = 1.
Theorem 8.12. Let f(x) be continuous on [a, b] and let f(x) have a
derivative f' (x) on (a, b). For the function f(x) to be nondecreasing on
[a, b] it is necessary and sufficient that f' (x) ;:;: 0 for all x in (a, b).
Necessity. Letf(x) be nondecreasing on [a, b] (Fig. 8.21). We prove that
f' (x) ;:;: 0 on (a, b).
y=f(x)
lj I
I
I
I
I
I
I
I
f(x+,:1r)
I
f(r) I
f(x- I
-,:1rJ I
I
0 a X-L1I r r-t,:1r b X
Fig. 8.21
Notice that at any point x in (a, b) there exists a derivative f' (x). Then
from the above inequality it follows that
I
I
I
I
f(x 0 J I
I
0 X
Fig. 8.22
This means that there exists o> 0 such that for all tlX
/(Xo + AX) - /(xo) >0
tlX
whenever O < ItlX I < o.
Whence it follows that if O < ItlX I < o, AX and /(xo + AX") -- /(xo) are
of the same sign, namely, if AX < 0 then f(xo + AX) - /(xo) < 0, i.e.,
f(xo + AX) < f(xo), and if tlX > 0 then /(xo + AX) - /(xo) > 0, 1.e.,
/(Xo + AX) > /(xo).
By definition this means that f (x) increases at xo.
Using similar reasoning, we can show that if /' (xo) < 0 then /(x)
decreases at xo. ►
y y
r
0 .r
Fig. 8.23
Fig. 8.24
0 .r
Fig. 8.25
lj
Fig. 8.26
23-9505
354 8. Differential Calculus. Functions of One Variable
0 I
Fig. 8.27
whenever O < Ix - xo I < o. Then the point x 0 is called the point of strict
maximum (minimum) of a function. Here we do not assume that f(x) is
continuous at Xo. For example, the function
f (x) = [x 2 for x ~ 0
1 for x = 0
.
is not continuous at x = 0 butf(x) has a maximum at x = 0. Indeed, there
exists o > 0, say o = 1, such that f (x) - f(0) = f (x) - 1 < 0 for all x ~ 0
in ( -1, 1) (Fig. 8.28).
Problems. (1) Using the definitions of maximum and minimum, prove
that the function
_ [e - I/x2 for x -¢. 0
f(x) - 0 for x =0
has a minimum at x = 0, and the function
_ [xe - uxz for x -¢. 0
g(x) - 0 for x =0
has no extremum at x = 0.
8.8 Extrema of a Function 355
IJ
y
y=f(x)
I
I
I
I
f(x0 ) I
I
I
0
0 Io -rJ Io Xo+rJ X
Using similar reasoning we arrive at the same conclusion for f' (xo) < 0.
Hence, if there exists a derivative f' (x) at a point xo and if f' (xo) -;e. 0
then f(x) can have neither a maximum nor a minimum at xo. Thus f(x)
can have an ext rem um only at a point where its derivative f' (x) either is
equal to zero or does not exist. ►
Figure 8.30 offers a geometric illustration of Theorem 8.15. The func-
tion y = f(x) has extrema at the points x1, x2, X3 and X4, The derivative
f' (x) does not exist at x1 and X4 and f' (x) is equal to zero at x2 and X3.
The points where the necessary condition for extremum of a function
f(x) is satisfied are sometimes called the critical points of f(x). These are
the roots of the equation f' (x) = 0 and the points where f' (x) does not
exist (in particular, the points where f' (x) is an infinity). The points where
f' (x) = 0 are called the stationary points of f(x), since the rate of change
of the function f(x) at these points is zero.
Theorem 8.15 specifies only the necessary condition for extremum of
a function f(x) and it does not necessarily have a maximum or a minimum
23*
356 8. Differential Calculus. Functions of One Variable
at every critical point. For example, if /(x) = x 3 then f' (0) = 0 so that the
point x = 0 is a critical point. However f(x) = x 3 has no extremum at x = 0
since f(0) = 0 and f(x) < 0 for x < 0 and f(x) > 0 for x > 0, which means
that f(x) increases at x = 0.
The following theorems specify the sufficient conditions for a function
to have a maximum or a minimum at a point.
lj
0 r
Fig. 8.30
(xo - o, xo) and for all x in (xo, xo + o), then given any whatever small
(xo - o, xo + o) f(x) increases both on the left and on the right of xo so
that f(xo) is neither a maximum nor a minimum of f(x) in (xo - o,
xo + o),
i.e., f(x) has neither a maximum nor a minimum at xo.
For the sufficient conditions given by Theorem 8.16 and Theorem 8.17
to be satisfied it is important that the function f(x) be continuous at xo.
For example, if
-x for x < 0,
f(x)= [ X + 1 for x ~ 0,
then the derivative f' (x) does not exist at x = 0 (Fig. 8.32). The derivative
f' (x) changes its sign when x moves through x = 0; however, f(x) has no
extremum at x = 0 since there exists no neighbourhood of x = 0 where
f(O) = 1 would be either a maximum or a minimum of f(x). The point
is that the function f(x) is not continuous at x = 0.
!J
0 Io +!J X
0 .r.
lj
lj
0 X
f(x) =
x2 ( 2 - sin !) for x 7' 0,
0 for x = 0.
8.8 Extrema of a Function 359
!J= 3x 2
Fig. 8.35
Theorem 8.18. Let f(x) have the first and second derivatives at a point
xo and let f' (xo) = 0 and f" (xo) -;,1: 0. Then at Xo the function f(x) has a
maximum if f" (xo) < 0 and a minimum if f" (xo) > 0.
◄ Observe that the point Xo is a critical point of the function f(x) since
f' (xo) = 0. Let f" (xo) < 0. This implies that at Xo the first derivative f' (x)
of f(x) decreases so that there exists a neighbourhood (x0 - o, xo + o) of
xo such that f' (x) > f' (xo) = 0 for all x in (xo - o, xo) and f' (x) <
f' (xo) = 0 for all x in (xo, xo + o). Hence f' (x) changes its sign from posi-
tive to negative at xo when x moves through x 0 from left to right. Thus
f(x) has a maximum at xo.
Similar reasoning yields that f(x) has a minimum at x 0 if f" (xo) > 0
at Xo. ►
Theorem 8.18 enables us to lay down the following useful procedure
for investigating a function for extremum. First, we find all critical points
of a function as outlined in the general procedure given above. Second,
we compute the second derivative f" (x) at a critical point and determine
its sign if f" (x) exists. If at some critical point xo f" (xo) < 0,f(x) has a
360 8. Differential Calculus. Functions of One Variable
maximum at xo and if f" (xo) > 0 at Xo then f(x) has a minimum at xo.
If the second derivative either is equal to zero or does not exist at Xo then
we can decide on the extremum of f(x) at this point by using the first deriva-
tive of f(x) as specified by the general procedure.
Example. Investigate the function y = e - xz for extremum.
◄ We have y' = - 2xe - xz so that x = 0 is a critical point of f(x). We
find y" = - 2e - xz + 4x 2 e - xz and y" (0) = - 2 < 0 so that f(x) has a
maximum at x = 0 (Fig. 8.36). ►
Fig. 8.36
f(b)
0 I
Fig. 8.37
maximum Mat the endpoints of [a, b]. So, if we wish to find an absolute
maximum of a functionf(x) continuous on a closed interval [a, b] we have
to find all maxima of f(x) in an open interval (a, b) and the values of
f(x) at the endpoints of [a, b], i.e., f(a) and f(b), and choose the greatest
value as the absolute maximum of f(x) on [a, b].
The absolute minimum of a function f(x) continuous on [a, b] is the
least value of f(x) among all the minima of f(x) in (a, b) and the values
of f(a) and f(b). In the case of a function shown in Fig. 8.37 we have
M = f(b) and m = f(xo).
Example. Suppose that we have a square sheet of steel whose side is
a and we wish to make a box of maximal volume by cutting out four equal
squares at the vertices of the given sheet as shown in Fig. 8.38 and flanging
the sheet. How should we choose the size of the cut out squares to get
a box of maximal volume?
X
I
I
----------
I
I
___
I
__J
I
L---
r---
1
I
I
Fig. 8.38
lj
0 I
Fig. 8.39
!:I
0 .r
Fig. 8.40
0 r
Fig. 8.41
I)
0 r r
Fig. 8.42
Since the equation of the tangent to the curve is
Y - f(xo) = f' (xo)(x - xo)
we have
Y - Y = f(x) - [ f(xo) + f' (xo)(x - xo)]
= [ f(x) - f(xo)] - f' (xo)(x - Xo)
or
A =y - Y= [f(xo + h) - f(xo)] - f' (xo)h,
where h = x - xo.
Let f(x) have the second derivative f" (x) at xo and in some neighbour-
hood of x0 • Applying the mean value theorem (Theorem 8. 7) to the above
identity, we get
A = f' (xo + 8h)h - f' (xo)h = [ /' (xo + 8h) - f' (Xo)]h,
where (J = (J(h) and O < (J < 1.
8.9 Shape of a Curve. Points of Inflection 365
so that
. f' (xo + Oh) - f' (xo) _ f" ( )
I1m Oh - Xo
Oh--+ 0
and
f' (xo + Oh) - f' (xo) = f" (xo)·Oh + a(Oh)·Oh,
where a(0h) ~ 0 as h ~ 0.
Then we can write
~ =y- Y= [f"(xo) + a(Oh)]Oh 2 •
lj
andf" (x) =- ~ 1/?. Clearly, there exist no points wheref" (x) = 0 and
at x = 0 f" (x) does not exist. Let us investigate the sign off" (x) in a neigh-
bourhood of x = 0. We have f" (x) > 0 for all x in ( - o, 0), o > 0, and
f" (x) < 0 for all x in (0, o) so that the curve is convex downward on the
left of the point 0(0, 0) and convex upward on the right of 0(0, 0). Hence,
the point 0(0, 0) is the point of inflection of y = x 113 . The tangent to
this curve at 0(0, 0) is perpendicular to the x-axis, for lim /' (x) = + oo.
x-+O
Finally, we can state the following sufficient condition for a point to
be a point of inflection. Let y = f(x) have a tangent at Mo(Xo, f(xo)) which
can be parallel to the y-axis. Let f(x) have the second derivative continuous
in a neighbourhood of Xo except probably at xo. If f" (x) is equal to zero
or does not exist at xo and if/" (x) changes its sign when x passes through
xo then Mo(xo, f(xo)) is the point of inflection of y = f(x).
8.10 Asymptotes of a Curve 367
holds.
lj
Fig. 8.45
Fig. 8.46
y lj
0 0 X
y lj
0 X 0 X
Fig. 8.47
8.10 Asymptotes of a Curve 369
!J
-1 0 :r
Fig. 8.48
y=f(x)
Fig. 8.49
Then
and
lim [ f(x) - kx] = lim [b + a(x)] = b
x-++oo x-++oo
Fig. 8.50
x2
Example. Prove that the graph of y = has an oblique asymptote.
x- 1
2
◄ The graph of y = X x- has the vertical asymptote x = 1 (Fig. 8.50).
1
Let us write this function if /(x) = x 2 1~ 1 = x + 1 + x-1
x-
- 1 where
372 8. Differential Calculus. Functions of One Variable
1 tends to zero as x-+ oo. Hence, the function being considered can
x- 1
be expressed as
/(x) = x + 1 + a (x),
where a(x) = 1 -+ 0 as x-+ oo.
x- 1
x2
Whence we conclude that the graph of the function y = has the
x- 1
oblique asymptote y = x + 1. ►
It is worth noting that the graph of y = f(x) lies below its asymptote
if ~ = /(x) - kx - b < 0 and the graph lies above the asymptote if ~ > 0.
Horizontal asymptotes. If the function /(x) has the finite limit
lim /(x) = b ( lim /(x) = b) as ·x-+ + oo (x-+ - oo) then the line
x-+ + oo x-+ - oo
y = b is the horizontal asymptote of the right (left) branch of the graph
of y = f(x).
Clearly, a horizontal asymptote is a specific case of an oblique asymp-
tote provided that k = 0.
.r
Fig. 8.51
Examples. (1) Lety = 1/x. Since Jim 1/x = 0 the graph of the function
.l' ➔ 00
- ,r/2. Hence, the right-hand branch of the graph of y = tan - 1 x has the
asymptote y = 1r/2 and the left-hand branch has the asymptote y = - ,r/2
(Fig. 8.51).
8.11 Curve Sketching 373
. .
(3) Let y = sin x and y(O) = 1. Since lim sm x = 0 the line y = 0 is the
X x---+oo X .
horizontal asymptote of the graph of the function y = sm x (Fig. 8.52).
X
As easily follows from the above example the graph of the function
y = f(x) can intersect its asymptote infinitely many times.
Problem. Derive the condition for asymptotes of the graph of the rational
function to exist.
y
-311 0 377' r
Fig. 8.52
(c) The function is even, i.e., f( - x) = f(x) so that the graph is symmetric
relative to the y-axis. The function is not periodic. Since the function is
even it suffices to construct the graph for x ~ 0 and reflect this graph with
respect to the y-axis.
(d) The graph lies above the x-axis since y = 1 at x = 0 and y > 0 for
all x; y -:;e; 0.
(e) The graph has the horizontal asymptote y = 0 since lim f(x) =
x---->±oo
lim 1 2 = 0. There are no oblique asymptotes.
x-+±ool+x 2
(t) The first derivative of y is y' = - x 2 2 so that f(x) increases
(1 + X)
for x < 0 and decreases for x > 0. The point x = 0 is critical. The derivative
y' changes its sign from negative to positive when x moves through the
point x = 0 from left to right. Hence, the function has the maximum
y(O) = 1 at x = 0. This almost immediately follows from the inequality
f(x) = 1 2 ~ l, holding for all x.
1 +X
(g) The second derivative y" 1 - 1 ~\ vanishes at the points
= -2
(1 + X)
x = 1/v3 and x = - 1/v3. The graph is convex downward for x > 1/v3 and
convex upward for x < 11\"3 since y" > 0 in the former case and y" < 0
in the latter. Hence, x = 1/\"3 = v3/3 is the point of inflection. By virtue
of symmetry the point x = -1/\"3 = - ../3/3 is also the point of inflection.
It is convenient to represent the results in the tabular form
Table 8.1
X
(-~. ~) -
v3
3 (- ·)
~'
0
(·· ~) v3
3 ( -~)
~'
f'(x) + + + 0 - - -
-
f" (X) + 0 - -2 - 0 +
The arrows /' and \i refer to the increase and decrease of the function,
respectively. The graph of the function is given in Fig. 8.53.
2 1
(
2) Y = X + - .
X
(a) The domain is the number line except the point x = 0.
8.11 Curve Sketching 375
lim
x--+O+O
(x> + !) = + oo and
lim
x--+0-0
(x> + !) = -00
0 I
Fig. 8.53
(c) The function is neither even nor odd and it is not periodic.
•
(d) Putting y = 0, we get x 2 + -1 = 0 or - + -1 = O; whence x = -1
x 3-
X X
so that the graph of the function meets the x-axis at the point x = -1.
(e) There are neither oblique nor horizontal asymptotes since
lim /(x) =
x--+:1::00 X
lim
x--+:1::00
(x + --4) = ±
X
oo.
. denvat1ve
(f) Th e f ust . . 1s . y , = 2x - 1 = 2x3 - 1 so t h at t he pmnt
.
2 2
X X
at the point x = i-- ; hence, at this point the function has the minimum.
~
(g) The second derivative y" = 2<x~ ; 1) vanishes at x = -1 and
X
changes the sign from positive to negative when x passes through the point
x = -1 from left to right. Thus the point x = -1 is the point of inflection.
Since y" > 0 for x in ( - oo, -1) and for x in (0, + oo) and y" < 0 on
the interval -1 < x < 0 the graph is convex downward on the open inter-
,376 8.. Differential Calculus. Functions of One Variable
vais ( - oo, -1) and (0, + oo) and is convex upward on the open interval
-1 < X < 0.
The graph of the function is given in Fig. 8.54.
In x
(J) y =X +- .
X
(a) The domain is the positive number line x > 0.
(b) There are no discontinuities in the domain of definition.
Fig. 8.54
We have Iim
x-+O+O
(x + In x)
X
= - oo so that the line x = 0 passing
through the endpoint of the domain of definition is the vertical asymptote
of the graph of the function.
(c) The function is neither even nor odd nor periodic.
-(d) Putting y = 0, we have x + In x + In x = 0. The soiu-
= 0 and x 2
x
tions of these equations which can be obtained from the graph shown in
Fig. 8.55 define the point of intersection of the graph under investigation
with the x-axis.
8.11 Curve Sketching 377
(e) Since
lim f(x)
x--+ + 00 X
= lim
x--+ + 00
(1 + lnx x) = 1= k
2
and
. In x
lim [/(x) - kx] = I1m - - = 0,
x--+ + 00 x--+ + oo X
Fig. 8.55
. d • • • ,
= 1 + 21 In x
=x + 1 - In x
2 ur.
(f) Th e first envat1ve 1s y - - 2- 2 • ne can
X X X
easily see from Fig. 8.56 that x 2 + 1 > In x for all x > 0, Hence, y' > 0
for all x and /(x) increases on the interval (0, + oo) and has no extrema.
. . 2 1 2 In x 2 In x - 3
(g) The second denvat1ve y" = - - 3 - 3 + 3 = 3 van-
x X X X
ishes at the point x = e 312
and changes the sign from negative to positive
when x passes through x = e312 from left to right. Hence, x = e 312 is the
point of inflection.
The graph of the function is shown in Fig. 8.57.
1
(4) y = X + 2 .
X
(a) The domain is the number line except the point x = 0.
(b) The point x = 0 is the discontinuity of the second kind. The line
378 8. Differential Calculus. Functions of One Variable
Fig. 8.56
Fig. 8.57
8.11 Curve Sketching 379
and
lim [ f(x) - x] = lim -; = 0 = b.
x--+±o::> x--+±o::> X
Fig. 8.58
(5) y = Vx(x - 3) 2 •
(a) The domain is the number line.
(b) The function is continuous everywhere in the domain of definition
and its graph has no vertical asymptotes.
(c) The function is nonperiodic and neither even nor odd.
(d) The function vanishes at the points x = 0 and x = 3.
(e) The graph of the function has the oblique asymptote y = x - 2 since
and
lim
X--+±oo
[ /(x) - x] = lim
x--+±oo
['Vx(x - 3) 2 - x]
. x(x - 3) 2 - x3
hm 3 3 -2 = b.
x--+ ± 00 1/ x 2 (x - 3) 4 + x1/ x(x - 3) 2 + x2
lj
Fig. 8.59
vanishes at the point x = 1 and does not exist at the points x = 0 and
x = 3. y' is of the same sign on the left and on the right of the point
x = 0 so that the function has no extremum at x = 0. On the other hand
the first derivative changes its sign from positive to negative when x passes
through the point x = 1 from left to right so that the function has a maxi-
mum at x = 1.
At the point x = 3 the function has a minimum since the first derivative
changes its sign from negative to positive when x passes through the point
x = 3 from left to right.
(g) The second derivative
II 2
Y = - xs/3 (x _ 3)413
does not exist at the point x = 0 and changes its sign from positive to nega-
tive when x passes through the point x = 0 from left to right so that the
point x = 0 is the point of inflection and the graph of the function has
the tangent parallel to the y-axis at (0, 0). The point x = 3 is not the point
of inflection. The graph of the function, shown in Fig. 8.59, is convex up-
ward in the half-plane x > 0.
y y
0 0 b X
(a) (b)
y
0 0
(c) (d.}
Fig. 8.60
The point a1 at which the secant AB intersects the x-axis lies between
a and ~' being a better approximation to ~ than a. Substituting y = 0 into
the equation of the secant AB, we easily obtain
_ f(a)(b - a)
01 - a - f(b) - f(a) ·
Look at Fig. 8.59. It is easy to observe that at a1 the signs of /(x) and/" (x)
are opposite.
Now we draw the tangent to the curve y = f(x) at the point B(b, /(b)).
Notice that at this point /(x) and/" (x) are of the same sign; this is a very
important condition: if /(x) and/" (x) are of opposite signs at B(b, /(b))
8.12 Approximate Solution of Equations 383
it can happen that the process of approximating the root diverges. The
point b 1 of intersection of the tangent with the x-axis lies between b and
~ and offers a better approximation to ~ than b. The tangent is given by
the equation
Y - f(b) = f' (b)(x - b).
Then putting y = 0, we get
b1 =b - Pt~) (f' (b) ¢- 0).
0 X
Fig. 8.61
and
and
where
f(an - 1)(bn - 1 - lln - 1)
lln = lln - 1 - I'. b
J( n-1) -
f(
lln-1 ) ,
bn -- b n - -f(bn - -
i) ( ao = a, bo = b, n = 1' 2 ' 3' )
-1 f' (-
bn - 1) ··· ·
The sequences {an ) and {bn} are monotone and bounded; hence they
have limits. Let
lim an = a and lim bn = {3.
3 5
b1 = 2 - =
4 4 = 1 + 41 .
For n = 2 we get
1 1
a2 = 1 - 14 and b2 = 1 + 40 ,
which are the approximate values for ~ with the absolute error .d(~*) <
0.1. ►
8.13 Taylor's Theorem 385
25-9505
386 8. Differential Calculus. Functions of One Variable
Notice that Taylor's formula gives the value of P(x) at any point x
provided that the values of P(x) and all its derivatives at some point a
are known.
Taylor's formula for arbitrary functions. Now we consider a function
f(x) defined in a neighbourhood of x = a; the function f(x) may not be
a polynomial of degree (n - 1) but is supposed to have derivatives up to
order n in this neighbourhood.
Let us compute the values f(a), f' (a), ..-. , fn - 1>(a) and use them to
construct the function
_
Qn - 1 (x) - /(a) +
f 1 (a)
I! (x - a) + ... +
tn -
(n _
I) (a)
I)! (x - a)
n- .
1
Let us put
f(x) = Qn - 1 (x) + Rn (x).
This equality is called Taylor's formula for the function f(x) in the neigh-
bourhood of the point x = a or Taylor's formula for the function f(x) at
the point a, and Rn (x) is called the nth remainder of Taylor's series.
The remainder Rn (x) can be expressed in terms of the nth derivative
of the functionf(x). For the purpose we assume thatf(x) is not a polynomi-
al of degree (n - 1) and possesses continuous derivatives up to order
(n - 1) on [a, b] and there exists the nth derivative of f(x) in (a, b). Sub-
stituting b for x in Taylor's formula for the function f(x) at the point a,
we can write
f(b) = f(a) + f__'Ja) (b - a) + f" (a) (b - a) 2
1! 2!
fn-l)(a) n-1
+ ... + (n _ l)! (b - a) + Rn.
We shall represent R11 in the form
Rn= M(b - at,
where M is the quantity to be defined.
To this end we consider the auxiliary function
'P' (x) = -
~
[1' (x) - f' (x)
1!
+ f" (x) (b - x) - f" (x) 2(b - x) +
1! 2!
25*
388 8. Differential Calculus. Functions of One Variable
fn - 1) (x) n- 2 tn - 1\x) n- 2
+ ··• + (n _ 2)! (b - x) - (n _ I)! (n - I)(b - x)
Therefore
R = fn>(~) (b - a)n
n n.I '
where ~ lies between a and b and is the nth remainder as given by Lagrange.
If we put n = 1 Taylor's formula for f(x) becomes as given by Theo-
rem 8.7
f(b) = f(a) + [_~~g} (b - a)
or
f(b) - f(a) = f' (~)(b - a).
8.13 Taylor's Theorem 389
Taylor's formula remains valid for any points xo and x in [a, b] so that
we can write it in the form
fn-1\x)
+ ... + (n - l)~ (x - xor- I + Rn(X),
where Rn(X) = fn)~~) (x - xot and the point ~ lies between x and x 0 , or
n.
This formula is sometimes called Taylor's formula for the function f(x)
with the remainder as given by Peano. It is easy to notice that the error
of the approximation of f(x) by Taylor's formula is an infinitesimal of
higher order than (x - xot as x-+ xo. Hence, this formula is suitable to
apply if we wish to approximate f(x) at points sufficiently close to xo. So
it is somet.imes called the local Taylor's formula.
Maclaurin's formulas for some elementary functions. We shall apply
Maclaurin's formula
f(x) = /(0) + f' (0) x + f" (0) x2 + fn - 1)(0) n- I
l! 2! + (n - l)! x
+ f n\ 0x) Xn O< 0 < 1
n! '
to get the approximation of some elementary functions.
Examples. (1) The function f(x) = ex.
◄ We have
and in general
fm>(x) = sin (x + m ;) .
Whence
fm)(0) = sin m 1r = [ O
for m = 2k,
2 ( - l)k for m = 2k + 1,
and
Therefore, the terms with even powers of x vanish so that Taylor's polynomi-
al involving (2n + 1) terms is identically equal to that involving (2n + 2)
terms. Then applying Maclaurin's formula and putting n = 2k + 1, we
obtain
• X X3 X5 2k - I
sm x = TI - Jf + 5T + . . . + ( - l) k - 1
(2 k _ 1) ! + R 2k + 1 (x ),
X
where
X2k + l [ ]
R2k + 1 (x) = (2k + l)! sin 0x + (2k + 1) ; , 0 < 0 < 1.
lxl 2k + I
Clearly, I R2k + 1 (x) I ~ (2k + 1)! . ►
(3) The cosine function f(x) = cos x.
◄ We have
f(x) = cos x, f(O) = 1,
f' (x) = -sin x, f' (0) = 0,
f " (x) -- --cos x, f" (0) = -1
and in general
fm>(x) = cos (x + m ;)
so that
where
X2k +2
Evidently, IR2k + 2 (x) I ~ (2 k + 2)! . ►
Maclaurin's formulas for sin x and cos x are suitable when we wish to
approximate these functions with any predetermined errors. The Taylor ap-
proximations to the functions sin x and cos x in a neighbourhood of x = 0
are shown in Fig. 8.62 and in Fig. 8.63.
IJ
:r 0
-1
= --x - x + -x -
2 3
ln(l + x)
1 2 3
where
+ ... + a(a - 1) . . . (a - n + 2) n- 1 +R ( )
(n - l)! X n X'
where
(1 m _ m m(m - 1) 2 · m
+ X) - 1+ I! X + 2! X + ... + X VX.
x2 x4 n x2n
cos x = I - 2! + 4! - + ( -1) (2n)! + o(x2n + i), x-+ 0;
x2 x3
ln(l + x) = x - 2 + 3
(1 + x)a = 1 + ax + a(a2~ l) x2
+ ••• + a(a - 1) . . . (a - n
I
+,. I) X n + O( Xn), X-+ 0.
n.
. X - Sill X
Example. Evaluate hm 3 •
x--+O X
· X - (X - x' + 0 (x 4 ) )
- 1
. x - sm x _ 1.
1Im 3.
3 - Im 3
x--+O X x--+O X
of a(t) when t varies over its domain of definition is called the hodograph
of the vector function a(t). In general the hodograph of a(t) is a curve
which contains all points of the space where a(t) ends (Fig. 8.64). Observe
that the hodograph of the position vector r of a moving point coincides
with the trajectory L of this point.
The equation
r = r(t), a < t < {3
or
r(t) = <P(t)i + ,/;(t)j + -y(t)k
is called the vector equation of the curve L.
lj
0 L
The equations
X = <PU),
[ y = ,/;(t), a<t</3
z = -y(t),
are called the parametric equations of the curve L.
For example, the equations
= R cost,
X
[ y = R sin t, 0 ~ t < 21r (R, h are constant)
z = ht,
are the parametric equations which describe one turn of a screw line (or
a circular helix) in space (Fig. 8.65).
398 8. Differential Calculus. Functions of One Variable
da
dt
0
A
so that the vector a(t) tends to coincide with the vector A as t-+ to
(Fig. 8.66). Hence
( lim a(t) = A) ¢} ( lim I a(t) - A I = 0).
t---+ lo t---+ to
da(t) = hm
----- . -tia = hm a(t + tit) - a(t)
. --·-·-··-----.
dt .11-0 tit .11-0 tit
The vector function a(t) having a derivative at a point t is called
differentiable at t.
Now we shall investigate how the vector !; is directed. The point M 1
moves along the hodograph L towards the point M as tit ➔ 0 so that the
secant MM1 tends to the tangent to L at M. Hence, the derivative : is
If the functions cp(t), 1/;(t) and -y{t) have derivatives at a given point
t then every summand on the right has a limit as t:..t -+ 0 so that there exists
a limit of the left-hand side of the above expression, i.e., there exists the
derivative d~;t) . Evaluating the limit as t:..t -+ 0, we get
da
dt = -1
. t + J.R cos t + kh . ►
.R sm
1 (a(t), b(t)) = ( :: , b) + ( a, : ) .
. a unit
C oro II ary. If a ( t ) 1s . vector, I.e.,
. Ie (t ) I = 1, t h en de . perpend'Icu-
dt IS
lar to e.
Exercises 401
or
2 (!~, e) = 0.
6. y = (x 3 + 1) (s --;) ,
X
y' (1) =? 7. y = 3
~.
✓ 1+7
8. y .
= sm x - cos x. g. y = 1 Sill X . lO . y = Sill
· 2 x.
+ COS X
14. y = sin 1/x. 15. y = sin 5 2x. 16. y = sin (sin x). 17. y =x sin - 1 x.
.
18• y = X Sill X tan - 1
X.
19• y . - 1 -2 • 20•
= Sill y = tan - 1 X 2 •
X
In x
21. y = In 2 x. 22. y = x 2 Iog3 x. 23. y = 2 • 24. y = In tan x.
1+X
x + 2-t
= -I1-
3
25. y • 26. y = 9x, 27. y = xex. 28. y =- -- .
nx x
29. y = x3 - 3x_ 30. y = 10 3x + 1 • 31. Y = 5sinx. 32. y = sin (3x}.
26-9505
402 • 8. Differential Calculus. Functions of One Variable
33. y = sinh 3 x. 34. y = ✓ cosh x. 35. y = tanh (In x). 36. y = 3sinh 2 x.
58 _ f x = 2t - 1, 59 _ f x = a cos 3 t, 60 _ f x ee 2_1_',
lv = t3. lv = b sin 3 t. lv
2
Compute the second derivatives d { of the functions given parame-
dx
trically
61 • f x = I~ t, 62 _ f x = et c~s t,
l_y = t . l_y = et sin t.
Draw the graphs of the following functions
63. y = X - X •
3 x2 - 1
64. y = - - . 65. y = - - . 66. y =
x2 + 1 x3 - 4
•
2
X X X
-x e2(x+ 1) X - 1
67. y = xe . 68. y = 2(x + 1) . 69. y = 2 In -X- + 1.
Exercises 403
- 8 - x2 3 ,_____ 3 ...----
70. Y = ✓X 2 - 4
• 71. y = Vx(x - 1) 2 • 72. y = Vx(x - 2).
76• y -_ 1Ox + 10 ~
.1or x E [ - l, 21 •
2
X +2x+2
77. Expand the polynomial x 3 + 3x2 - 2x + 4 in powers of x + 1.
78. Expand the polynomial x 3 - 2x2 + 3x + 5 in powers of x - 2.
79. Use Taylor's formula to expand the function/(x) = 1/x in a neighbour-
hood of the point Xo = -1.
80. Use Maclaurin's formula to expand the function f(x) = x ex in a neigh-
bourhood of the point Xo = 0.
81. Use Maclaurin's formula to find the expansion of the function
f (x) = ex12 + 2 with the remainder o (xn).
82. The shape of a string hanging of its own weight alone is given by the
equation of the catenary y = a cosh x/a where a is a constant. Show that
given sufficiently small Ix I the string takes the shape of a parabola.
Use the derivatives of higher orders to investigate the shape for each
of the following functions in a neighbourhood of a given point xo.
83. y = sin 2 (x - 1) - x 2 + 2x, Xo = l.
84. y = cos x + cosh x, Xo = 0.
85. y = x 2 + 2 ln(x + 2), Xo = -1.
86. y = x 2 - 2ex - 1, Xo = 1.
Answers
4. Y' =- _l_ (1 +
2VX
!) .
X
S. Y' = 1 - x2 . 6. Y' (1)
(x2 + 1) 2
= 16. 7. Y' =- 2x
3(1 + x 2) 413
.
8. y' = cos x + sin x. 9. y' = 1 . 10. y' = sin 2x. 11. y' = tan4 x.
1 + cosx
1 1
12. y' = 5 c~s 5x. 13. y' = 6 cos (3x - 1). 14. y' = - - 2 cos-.
x X
26*
404 8. Differential Calculus. Functions of One Variable
X
15. y' = 10 sin 4 2x cos 2x. 16. y' = cos (sin x) cos x. 17. y' = sin - 1 x+ ---;:::====
✓ 1 - x2
. x sin x 2
18. y' = sm x tan - 1 x + x cos x tan - 1 x + - - . 19. y' = - --;:::::;===
1 + x2 Ix I ✓ x 2 - 4
20. y' = -~ . 21. y' = ~ In x. 22. y' = 2x log3 x + ~.
1 + x4 x In 3
1 + x2 - 2x In x ,
= -2- . 25. ,
= - -1- . 26. , _ nx
2
23. y' = -- --. 24. y y y - :, In 9.
x (1 + x 2)2 sin 2x x ln 2 x
, 2x(x In 2 - 1) + 2x 3 , 2 x
27. y' = ex(x + 1). 28. y = ----- - - - -- . 29. y = 3x - 3 In 3.
x2
30. y' = 3 · 103x + 1 In 10. 31. y' = 5sinx cos x In 5. 32. y' = 3x cos(3x) In 3.
• 2 h 34. y , sinh x 35. y , 1
33. y' = 3 smh x cos x. = ----;===. = --- --
2.Jcosh x x cosh 2 (In x)
38. y' .
= xsmx ( cos (In x) sin
+ -~- x) . 39. y' = 2x1 x -1 In x.
0
43. y' = 9x2 sin - i x. 44. y' = (sin - 1 x) 2 . 45. d y = - ✓ -- 2 tan x dx.
dx . 46. d•v
x5 cos 2 x
1
41. dy = 510 sin x cot x In 5 dx. 48. tan- 1 1.02 = 0.795. 49. y" = 6x. 50. y" (1) =
2
51. ym = 27a 3x In 3 a. 52. y'" = x 2(60 In X + 47). 53. y<n> = ex(x + n).
lj
X
X
lj
Fig. 8.70
406 8. Differential Calculus. Functions of One Variable
y
r
:r:
e21X+11
Y= 2(x+1)
0 X
Fig. 8.73
Exercises 407
lj
y=2ln x-1 +1
X
Fig. 8.74
-8-x 2
lj= '{xL,4
,4 X
I
I
I
I
I
Fig. 8.75
408 8. Differential Calculus. Functions of One Variable
!I
(1, 1)
Fig. 8.79
Fig. 8.76
(0, 2)
lj Fig. 8.80
( - 1, 1)
Fig. 8.77
Fig. 8.81
:r
(1,-1)
Corollary.
where all Ak are some constants, is called the linear combination of the
functions fk(x), k = 1, 2, ... , n. Therefore we can formulate this corollary
in the following way: the indefinite integral of a linear combination of any
finite number of functions is equal to the linear combination of the in-
definite integrals of these functions.
Properties (4) and (5) are sometimes called the linear properties of an
indefinite integral.
Some indefinite integrals involving elementary functions. Every formula
for a derivative of a given function, i.e., a formula of the form F' (x) = f(x)
can be represented in terms of the function f(x) and its antiderivative F(x)
as J f(x) dx = F(x) + C where C is an arbitrary constant. This enables
us to derive the integration formulas directly implied by the differentiation
formulas for basic elementary functions. These are:
Jraxdx = ax
In a + C: 0 < a ;t= 1.
J cosxdx = sinx + C
Jf dx
sin2 x = -cot x + C: x ;t= n1r (n = 0, ± 1, ±2, ... ).
Jf dx 1r
cos 2 x = tan x + C, x ;t= 2 + n1r (n = 0, ± 1, ± 2, ... ).
9.1 Basic Concepts and Definitions 413
dx
= In I x + ✓ x2 ± a 2 I +C (the "minus" sign re-
J ✓x2 ± a2
quires the condition I x I > I a I be fulfilled).
J .1-:X_xr = tan- 1 x + C
Jsinhxdx = coshx + C
Jcoshxdx = sinh x + C
rJ ___coshp_x___x = tanh x + C
2
rJ -sinh
.cl_x_ - = - coth x + C x ~ 0.
2 x '
cosx2dx, dx
-1-,0<x~l.
J J nx
414 9. Integral Calculus. The Indefinite Integral
◄ We have
◄ We have
◄ To prove the above identity we differentiate the left- and right-hand in-
tegrals with respect to x. On differentiating the left-hand integral, we get
(.\f(x) dx): = f(x).
Observe that t = i/;(x) is the inverse function of x so that
tX,_
-
1 _
--- -
1
-----
X/ cp (t) ' I
<P'(t)
Since the left- and right-hand integrals have the same derivative these
integrals define the same set of antiderivatives of the function/(x). Whence
it follows that the desired identity is true. ►
The indentity being considered lays down the method of integration
called integration by substitution. When using this method we have to
choose a suitable function cp(t) to simplify the computation of the original
integral.
Examples. Compute the following integrals.
r
J
✓ dx
x2 + a2
= In (x + ✓x 2 + a2 ) + C:
where C =C- In a. ►
(2) f dx,---- .
J (x + 2)✓x + 1
◄ Put x = t 2 - 1 so that dx = 2t dt and t = ✓ x + 1 . Then
rJ (x + 2)✓x
dx,----
+ 1
= 2 tan - 1 ✓ X + 1 + C. ►
exdx
✓ex+ 1
= 2) (t 2 - 1) dt
27-9505
418 9. Integral Calculus. The Indefinite Integral
pn x dx = x In x - )dx = x In x - x + C = x (In x - 1) + C. ►
j -Ja 2 - x2 dx = ~ + j + 2C.
= X ✓ a2 - x2 + a2 rJ--✓ dx_
a2 - x2
. - rJ✓ a 2 - x 2 dx + 2C
=x -J a 2 - x 2 + a 2 sin - 1 ~ - j -J a 2 - x2 dx + 2C.
We have obtained the equation in one unknown ) ✓ a 2 - x 2 dx. Solving
this equation, we get
r
J -J a 2 - i2 dx = ~ x -J a 2 - x2 +
- i-sin -
2
1 ; + C. ►
(a) j -J x2 + a 2 dx = ~ x -J x 2 + a 2 + ~2 - In (x + -J x 2 + a 2 ) + C;
(b) j -J x2 - a2 dx = ~ x -J x 2 - a2 - ~ In I x + -Ji2 - a2 I + C,
where Ix I > Ia I .
9.2 Methods of Integration 419
Then
Jeaxcos {3x dx = ~ e•xsin {3x - ; J e•xsin {3x dx.
Integrating the latter integral by parts gives: u = eO'x and du = sin {3x dx
cos {3x
so that du = aeaxdx and v = - {3 and
JeQX sin {3x dx = - ~ eax cos {3x + ~ Je•x cos /3X dx.
Substituting this identity into the preceding one, we obtain
( 1 +
a,2 )
{32 Jreax cos {3x dx = 7J2
eCiX
(a cos {3x + {3 sin {3x)
'27*
420 9. Integral Calculus. The Indefinite Integral
and
j eax cos (3x dx =
eax
~-~ (a
0'.2 + (32
cos (3x + (3 sin (3x) + C.
+ (3
2 (a sin (3x ·- (3 cos (3x) + C. ►
= Qn + 1(x) In X - Hn + 1(x) + C,
where Hn + 1(x) is a polynomial of degree (n + 1).
Example. Compute the integral ) (4x 3 + 2x) In x dx.
◄ Put u = In x and du = (4x 3 + 2x) dx so that du = ~ and u = i(4x 3 +
2x) dx = x 4 + x 2 • Then integration by parts gives
) (4x 3 + 2x) In x dx = (x 4 + x 2 ) ln x - ) (x 3 + x) dx
= (x4 + x 2 ) In x - x4 - x2 + C.
4 2
►
Integrals of the form _\P,, (x) tan - 1 ax dx and IPn(x)cot - 1 ax dx, where a
is a real number. These integrals are reduced to those of rational functions.
For example, if we put u = tan - 1 ax and du = Pn(x) dx in the former in-
tegral then du = ad~ 2 and u = Qn + ~ (x) and integration by parts
1 + ex X
gives
lPn(X) tan-
J
I axdx = Qn+ 1(x) tan- 1ax - a rJ 1Qn+ t~Xt
+ ex X
dx.
r + C. ►
Integrals of the form jPn(X) sin - 1 ax dx and) Pn(x) cos - 1 ax dx, where
a is a real number. To integrate these integrals by parts we put u = sin - 1 ax
(u = cos - 1 ax in the latter integral) and dv = Pn(X) dx. Then
( du = - ✓ adx
2 2
) ' V = Qn + I (x)
1- a x
and
rPn(X) sin- 1axdx = Qn+ 1(X) sin- 1ax - r O'. Qn+ i(X) dx.
J J ✓ 1 - a2X2
The integral in the right-hand side can be_ computed by using different
methods which will be discussed in detail in Sec. 9.4.
Example. Integrate by parts 2 sin - 1 x dx. Ix
dx x3
◄ Put u = sin - 1 x and du = x 2 dx. Then du = - , = = ~ and v = - 3- .
2 ✓1 - x
Integrating by parts, we get
J sm
x2 • - 1
x dx
3
= -x3- sm
. - 1
x - -31 J x 3 dx
✓ 1 - x2
By the substitution t = ✓ 1 - x 2 so that x 2 = 1 - t 2 and x dx = -t dt
the integral on the right is reduced to
r x3 dx = r x2 X dx = - r1 - t 2 t dt
J ✓1 - x2 J✓ 1- x2 J t
Finally we have
Jx sin -
2 1 x dx = ~3 sin - 1 x + ! (x" + 2)✓ I - x2 . ►
so that
[ AX 1 AX 1
JPn(x) e dx =" Pn(X) e - "
where P~(x) is a polynomial of degree (n - 1).
Then applying integration by parts to the right-hand integral we again
obtain the expression which can also be subjected to integration by parts.
Continuing this process n times we finally arrive at the integral
Je"' dx = ~ e"' + C.
Example. Compute the integral )(x 2 + 2x) exdx.
◄ Put u = x 2 + 2x and dv = exdx so that du = 2(x + 1) dx and v = ex.
Then integration by parts gives
)(x 2 + 2x) exdx = (x 2 + 2x) ex - 2)(X + 1) exdx.
To integrate the right-hand integral by parts we put u = x + 1 and
dv = exdx. Then du = dx and v = ex so that
)(X + l)exdx = (x + l)ex_ )exdx = (x + l)ex-ex + C = xex + C.
Finally, we have
)(X 2 + 2x) exdx = (x 2 + 2x) ex - 2xex + C = x 2 ex + C. ►
Integrals of this form can also be computed by the method of compar-
ing (unknown) coefficients. In this case we assume that the integral is equal
to the product of the nth degree polynomial
Qn(X) = bo + b1X + ... + bnXn
with unknown coefficients bo, b1, ... , bn by the function eAX, i.e., we assume
that
)Pn(X) eAXdx = Qn(X) eAX.
On differentiating the integral we get
Pn(X) eAX = Q~(x) eAX + AQn(X) eAX
whence, dividing by e>.x ~ 0, we get
Pn(X) = Q;(x) + A Qn(X).
Since the polynomial on the left is to be identically equal to the sum
of polynomials on the right, the coefficients of any power of x must be
equal; so comparing coefficients, we obtain the system of (n + 1) linear
equations in n unknowns bk. This system has a unique solution, for the
respective determinant is distinct from zero. To demonstrate how this
method works we shall compute J(x2 + 2x) exdx.
9.2 Methods of Integration 423
◄ Put
I(x 2 + 2x) exdx = (bo + b1x + b2x2) ex,
where bo, b1, bi are unknown coefficients.
On differentiating this identity, we get
(x 2 + 2x) ex = (b1 + 2b2x) ex + (bo + b1x + b2x 2) ex.
Whence, division by ex :;r: 0 gives
2.\" -+ x 2 = bo + b1x + b2x + b1 + 2b2x
or
2x + x 2 = (bo + bi) + (b1 + 2b2) x + b2x 2.
Comparing the coefficients of equal powers of x, we arrive at the system
of linear equations
x0 = bo + b1 ] 0
x 1 = b1 + 2b2 '2
x 2 = b2 1
whose solution is bo = 0, b1 = 0 and b2 = 1.
Hence the original integral becomes
j(X2 + 2x) exdx = x 2ex + C ►
Integrals of the form JPn(x) sin {3x dx and JPn(x) cos {3x dx, where {3
is a real number, {3 :;r: 0.
◄ Putting u = Pn(x) and du = sin {3x dx (du = cos {3x dx in the latter in-
Jrsin· {3x dx = -
COS
{3
{3X
and Jrcos {3x dx = Sill
{3
{3X
. ►
Integration by parts can also be helpful in dealing with some other in-
tegrals apart from those considered above. By way of illustration we shall
.
compute t he 1ntegra I Jr X. dx
2
sm x dx
◄ Putting u = x and du = - ~ - , we obtain du = dx and v =
sin 2 x
r dx - cot x so that
J sin 2 x -
rJ _xsind!__x = -
2
X cot X + rJcot X dx.
r .sm~ ~xx
J
= - X cot X + In I sin X I+ C. ►
where the coefficients ao, a1, ... , an are real numbers and a0 :;±. 0.
The polynomial Qn(x) is said to be monic if ao = 1.
The real number b is called the root of the polynomial Qn(X) if
Qn(b) = 0.
It is known from algebra that any real polynomial Qn(X) can be uniquely
factorized into a product of monic linear polynomial x - b and monic
quadratic polynomials x 2 + px + q, where p and q are real coefficients and
every monic quadratic polynomial is irreducible to a product of linear poly-
nomials, for it has no real roots. Then gathering equal factors if any, we
can write the monic polynomial Qn(X) in the form
where Rm_ n(x) and P(x) are real polynomials and P(x) 1s a proper ra-
Qn(X)
tional function. For example, if we apply the division algorithm to the ra-
. 1 function
. xs + 1 Ps (x) b .
tiona 2 - - - , we o ta1n
X + 1 Q2(X)
X3 - X
x2 + 1 )x5 + Ox4 + Ox 3 + Ox 2 + Ox + 1
xs + Ox4 + 1x 3
1x3 _+_0_x~2 -+-Ox_+_l
lx 3 - Ox 2 - lx
X + 1.
Hence, XS + 1 = X3 _ X + X + 1 so that R3(x) = x 3 - x, P(x) =
x2 + 1 x2 + 1
x + 1 1s
. a proper rat1ona
. l function.
x + 1 and 2
X + 1
Splitting a rational function into partial fractions. Partial fractions are
proper rational functions of the form
A A Mx + N and Mx + n
x - a ' (x - a )k ' x2 + px + q (x + px + qt
2
where Ai, A2, ... , A(\', Bi, B2, ... , Bf3, ... , Mi, Ni, M2, N2, ... , Mµ,,,
Nµs are real numbers not all equal to zero.
To define the coefficients in the numerators of the partial fractions we
multiply the left- and right-hand sides of the preceding expression by Qn(x)
and apply the method of comparing (unknown) coefficients, i.e., we com-
pare the coefficients of equal powers of x on the left and on the right thus
obtaining a system of linear equations in the desired unknowns. The solu-
tion of this system uniquely defines the coefficients we are seeking for.
Sometimes it is convenient to use another method of computing coeffi-
cients of partial fractions. If two polynomials are identical the identity
holds for any value of x; then multiplying both sides of the expansion given
in Theorem 9.3 by Qn(X) and substituting some specific values for x into
the identity thus obtained it is possible to get simple equations for the
unknown coefficients. The method is often useful when Qn(X) has only
simple real roots, substituting the values of the roots for x we arrive at
simple equations for the unknown coefficients.
To illustrate these methods we use some specific examples.
. 3x 2 - 6x + 2
Examples. (1) Split the proper rational funct10n 3 2 into
x - 3x + 2x
partial fractions.
◄ On factoring the denominator, we get
A+B+C=3 ]
- 3A - 2B - C = - 6 ,
2A = 2
whose solution yields A = 1, B = 1 and C = 1.
We obtain the same result by substituting the values of the roots of
the _denominator for x in ( *). Indeed, the roots are X1 = 0, X2 = 1 and
X3 = 2. Then the identity yields
= 2A
2 and A = 1 for x = 0,
- 1 = -B and B = 1 for x = 1,
2 = 2C and C = I for x = 2.
Hence
3x 2 - 6x + 2 = 1 + 1 1
x3 - 3x2 + 2x X X - 1 + X - 2 ►.
. t h e proper rat10na
(2) Sp 11t . I f unction
. s x3 +4 3x +3 1 2 into par-
x + 3x + 3x + x
tial fractions.
◄ The denominator can be factorized as
xs + 3x4 + 3x3 + x 2 = x 2 (x 3 + 3x 2 + 3x + 1) = (x + 1) 3x 2 •
Thus it has the repeated root x1 = 0 of multiplicity 2 and the repeated
root x2 = -1 of multiplicity 3. Then
_ _~~-.± 3x__±_
_!__ ____ _ -~t_ + A2 + B1 + B2 + B3
xs + 3x + 3x + x
4 3 2 x x 2 x + 1 (x + 1)2 (x + 1)3
Multiplying out, as before, we get
x 3 + 3x + 1 = A1x(x + 1)3 + A2(x + 1) 3 + B1x 2(x + 1)2
+ B2x 2(x + 1) + B3x2 ( **)
or
x 3 + 3x + 1 = (A1 + Bi)x4 + (3Ai + A2 + 2Bi + B2)x3
+ (3Ai + 3A2 + Bi + B2 + B3)X2 + (Ai + 3A2)x + A2.
The method of comparing coefficients gives
x4 A1 + Bi = 0
x3 3Ai + A2 + 2Bi + B2 = 1
x2 3A i + 3A2 + Bi + B2 + B3 = 0
Xi A I + 3A2 = 3 .
x0 A2 = 1
428 9. Integral Calculus. The Indefinite Integral
so that A1 = 0, A2 = l, B1 = 0, B2 = 0, B3 = - 3 and
x 3 + 3x + l 1 3
x5 + 3x4 + 3x 3 + x2 - x2 (x + 1) 3 •
(3) Split the proper rational function _.x__3_~ x +2 1 into partial frac-
2
(x + 1)
tions.
◄ The denominator has no real roots since x 2 + l does not vanish for
any value of x. Then there must hold
x 3 + x 2- +~--
---~- M1 x +--------
1 - ---- N1 + M2x + N2
(x 2 + 1)2 x2 + 1 (x 2 + 1)2
Whence
x3 + x2 + l = (Mix+ Ni)(x 2 + 1) + M2x + N2
or
x3 + x2 + 1= M1x 3 + N,x 2 + (Mi + M2)x + (Ni + N2).
Comparing the coefficients of equal powers of x, we get
Mi = 1, N1 = l, M1 + M2 = 0, Ni + N2 = 1.
Whence
Mi = 1, Ni = l, M2 = - 1, N2 = 0
9.3 Integrating Rational Functions 429
and, consequently,
x3 + x 2 + 1 + 1
X X
(x2 + 1)2 x + 1
2 (x2 + 1)2 . ►
It is worth noting that sometimes it can be easier to arrive at the desired
result without applying the method of comparing coefficients. For instance,
in example (2) after a little algebra we obtain
x 3 + 3x + 1 (x3 + 3x2 + 3x + 1) - 3x2
x 5 + 3x4 + 3x3 + x 2 x 2 (x + 1)3
(x + 1)3 - 3x 2 1 3
x 2 (x + 1) 3 - x2 (x + 1)3 .
Integrating partial fractions. A rational function can be uniquely
represented in the form of a sum of a polynomial (a zero polynomial in
the case of a proper rational function) and a proper rational function which
can be splitted into partial fractions. Since any real polynomial can be easily
computed by applying standard integration formulas for elementary func-
tions, integration of a rational function becomes integration of a sum of par-
tial fractions. So now we turn our attention to techniques of integrating
partial fractions.
Using standard integration formulas we can easily integrate partial frac-
tions whose denominators are monic linear polynomials or power functions
of them, namely
f A dx =A f d(x - a) =A I
In x - a I + C.
J x-a J x-a
f A dx =A f(x-a)-kd(x-a) = 1~ k (x - a)-k+ 1 + C
J (x - al J
A
------k~-I + C.
(1 - k )(x - a) -
Mx+N
To integrate a partial fraction of the form ~ - - - - we apply the
x 2 +px+q
method of completing the square in the denominator. This yields
x2 + px + q = [x2 + 2x i + (iYJ + q - ~y
- (x + iY + ( q - ~) ·
Since the second summand is positive we set it equal to a2 where
) 2
Mx + N dx -
_ ) M(t - i) 2 2
+N
dt
x + px + q t + a
_ M
- 2
f 2t dt
Jt + a 2 2
+ (N _ Jf __t +
Mp_)
2 2
d!____
a2
=M \ d(t 2 + a 2) + (N __
AfP) f dt __
2 J t + a
2 2 J t +a 2 2
= M ln(t 2 + a 2) +
2
(N - a !a Mp_)! tan -
2
1 + C
=M
2 ln(x
2 + px + q) + -~~JL tan - 1 /x + p___ + C
✓ 4q - p2 -v 4q - p2
To illustrate the method of integrating partial fractions of this form
we work through 2 2 - X
X + 4x + 6
r.
J
dx.
x 2 + 4x + 6 = (x 2 + 4x + 4) + 2 = (x + 2)2 + 2.
The substitution x + 2 = t yields dx = dt, x = t - 2 and
x2 + 4x + 6 = t + 2. (Observe that here a = 2.) Then
2 2
2 - X = ) - (t-
-2 ~ - 2)
--
) ~ - - - - dx dt
x 2 + 4x + 6 t2 + 2
_ 4 rdt _ 1 2r dt . r
- J t +2
2 2 J7+2-
= ~ tan - I ~ -1 ln(l 2 + 2) + C
= ·"'
tan -1 X +2 -
Y2 1 2
2v~ 2 ln(x + 4x + 6) + C. ►
_- -M
2
J(t 2 + a 2 ) - k d(t 2 + a 2 ) + ( Mp
N - -
2
-) J- -+dt-a2t
(t2
--
_
-
M
2(1 - k)(t2 + a2l- 1
+ (N _-2-
Mp ) f
J (t2 +
dt
a2l
or
Thus we have arrived at the recurrence relation which yields the value
of lk for any k (k = 2, 3, ... ). Indeed, the integral 11 is easily computed
by using a standard integration formula, i.e.,
11 = Jt 2
dt
+a
1
2 = - tan
a
-1 t
- + C.
a
For k = 2 the recurrence relation gives
12 =
Jf (t2 +dta2)2 = t
2a2(t2 + a2)
+ _fl_
2a2
t + 1 tan - 1 ~ + C.
2a 2(t 2 + a 2) ~ a
432 9. Integral Calculus. The Indefinite Integral
f x + 1 dx = _ 1 + 3t
J (x 2 - 4x + 5) 2 2(t 2 + l) 2(t 2 + 1)
+ -3 tan - 1 t + C = 3t - 1
------=----- + -32 tan - 1 t + C.
2 2(t 2 + 1)
Finally, the substitution x =t+ 2 gives-
28-9505
434 9. Integral Calculus. The Indefinite Integral
and
J
x 3 dx
(x- l)(x2 - 4)
=
1
x-1 J(1-!.
+ 2. 1
x-2 3
2
x!2 )dx
= x - 31 In I x - 1 I + 2 In I x - 2 I - 32 In I x + 2 I + C. ►
+ 1
(2) Jx X
4
2
- X
3 dx.
x 2 + 1 = A 1 + A2 _+ A3 + B
x4 - x 3 x x2 x3 x - I
Multiplying out and dividing by the denominator of the integrand, we
arrive at
or
x 2 + 1 = (A1 + B)x 3 + (-A1 + A2)x2 + (-A2 + A3)X -A3.
Comparing the coefficients of x of equal powers, we get A 1 + B = 0,
-A1 + A2 = 1, -A2 + A3 = 0 and -A3 = 1. Whence A1 = -2,
A2 = - 1, A3 = - 1 and B = 2. Then
x2 + 1 _l __1___1_ + 2
x4 - x3 x x2 x3 x - I
and
x
-
2
_l +
x 3
2 )dx
x - 1
- 2 In I x I + l +
X
----4-
2X
+ 2 In I x - 1 I + C. ►
X3 - X
(3) J (x 2
+ 1)
2 dx.
◄ Since the denominator has no real roots the integrand can be written as
x3 - X Mix + N1 + M2x + N2
(x 2 + 1)2 - x2 + 1 (x 2 + 1)2
Then
x3 - x = (M1x + Ni)(x2 + 1) + M2x + N2
9.4 Integrals Involving Irrational Functions 435
-- - - - - - - - - - - - - - - - - - - - - - · - · ·
or
x3 - x = M1x 3 + N1x 2 + (M1 + M2)x + (N1 + N2).
Comparing the coefficients, we have
M1 = 1, N1 = 0, M1 + M2 = - 1, N1 + N2 =0
so that
and
r
- Jl [ x
X3 - X 2x
dx - x - ] dx
J (x 2 + 1)2 2 + 1 (x 2 + 1)2
= 21 ln(x2 + 1) + 21 +C
X + 1
Observe that the integrand can easily be splitted into partial fractions
by using simple algebra as
x3 - x (x 3 + x) - 2x _ x(x 2 + 1) - 2x
(x 2 + 1)2 (x 2 + 1) 2 (x 2 + 1)2
X 2x
x2 + 1 (x2 + 1)2 . ►
where Pm(U1, u2, ... , Uk) and Qn(U1, u2, ... , Uk) are the mth and nth
degree polynomials in u1, u2, ... , Uk, respectively. Then R(u1, u2, ... ,
Uk) is a rational function in u1, u2, ... , uk; otherwise, it is irrational. For
example, a quadratic polynomial in variables Ui and u2 takes the form
P1 (u1, u2) = Aoo + A10U1 + Ao1U2 + A20UI + A11u1u2 + Ao2uI where the
coefficients Aoo, A10, Ao1, A20, Au, Ao2 are real numbers and Alo+
Ari + A52 ~ 0.
2' 3
It is easy to observe thatf(x, y) = x + 2{ 2 + xy is a rational function
x+xy +1
in variables x and y since this function is a ratio of the third degree poly-
nomial P3 (X. y) = x 2 + 2y 3 + xy and the fifth degree polynomial
✓x2 - 2xy + 3
Qs(x, y) = x + x y + 1, while/(x, y) = - - - - - - i s an irrational
3 2
x+y
function.
28*
436 9. Integral Calculus. The Indefinite Integral
Suppose that variables u1, u2, ... , Uk are some functions of a variable
x, i.e., u1 = f1 (x), Uz = fz(x), ... , Uk = fk(X). Then the function R l/1 (x),
f2(x), ... ,fk(X)] is a rational function in functionsf1(x),f2(x), .. .,fk(X).
x 2 + ✓x 2 + x + l
For example, f(x) = ✓ is a rational function in
2
x + l + 3 x + x + 1
x and in ✓x 2 +x+l so that f(x) = R(x, ✓x 2 +x+l ). It is worth noting
that f(x) = In x + : ~ is an irrational function in x and in ✓x 2 + 1
2 + Sill X
while it is a rational function in functions In x, e✓ x 2 + 1 and sin x 2 so that
f(x) = R(lnx, e✓x 2 + 1 , sinx2 ).
It is not hard to notice that sometimes integrals involving irrational
functions do not admit representations iq elementary functions. For exam-
ple, the integrals
called the elliptic integrals of the first and second kinds, respectively, can
not be expressed in elementary functions. However, by suitable substitutions
it is often possible to reduce integrals involving irrational functions to those
of rational functions. Below we shall consider techniques applicable in deal-
ing with integrals of some specific forms.
. a rationa
Is . I function
. In. x an d y = ax + db an d t h e coe ff'1c1ents
m
. a, b, c
ex+
and dare real numbers such that ad - be :;r: O; we leave aside the case when
a b ax+ b a b .
ad - be = 0, for then - = d and d = - = d so that the Integrand
c ex+ c
becomes a rational function only in x and its integrals have already been
studied in detail.
The substitution
ax+ b
c~ +d
yields
tm = ax+ b
ex+ d '
(ex+ d)tm =ax+ b anrl ax - cxtm = dtm - b
9.4 Integrals Involving________________
----~-------::. Irrational Functions _ 43 7
so that
is a rational function in t.
On differentiating x with respect to t, we have
dmtm - 1 (a - ctm) + cmtm - 1 (dtm - b)
dx = - - - - - - - - ~ - - - - - dt
(a - ctm)2
and further
dx = (ad - bc)mt:- 1 dt.
(a - ctm)
Then
m
ax+ bd )dX
ex+
= fR ( dtm - b
J a - ctm
JfR (x, m ax + : ) dx = F
ex+
(m ax +
ex+
!) + C.
◄
(1) 1J;; +; (2x ~ 3) 2 •
Put I = ~. 3
Then 2x - 3 = 2xl4 + 31 4 , X=-
2
1 + t4
1 - t4
-
= 3lf4 1t s + C = -Ts1(-~)
Jt dt = 15 ~- 2x+3
5
+ C. ►
X = t 12 , Vx = t 'Vi=
3, t4, {x = t 6 and dx = l2t 11 dt. Hence
r dx r 12t 11 dt r t 4 dt
J TxcVx +--v'x)- = J t3(t4 + t6)- = 12 J t+--i2-
= 12 rJ u 1 +l) t+ 1 dt = 12 Jr(t
4
-
2
2 - 1 + ___l__ ) dt
1 + t2
= 12 ( 1; - I+ tan - i 1) + C
We also have
✓
ax 2 + bx + c = t - \/ax = t - va -~--
!2 -
2\/a t + b
C
-
vat 2+ bt + cva
2\/at + b
Thus x, dx and ✓ ax 2 + bx + c are all rational functions in t and we
can write
where
Ri (t) =R( !2 - , \/at 2 + bt + c\/a) 2 \/at 2 + bt + cva
c
2vat + b 2\/at + b (2vat + b)2
is a rational function in t.
Observe that the substitution t = ✓ ax 2 + bx + c - Yax also reduces
the original integral to that involving rational functions.
Example. Compute the integral
[ dx
J ✓x 2 + c/
,---- t2 2
◄ Since a = 1 > 0 we put t = ✓ x 2 + cl + x. Then x = - a:
2t
2 2 !2 + 2
dx = t +
2t
/x
dt and ✓ x 2 + c/ = t -
!2 2
- a: -
2t 2t
a: so that
= [
J !2
2t
+ a:2
t 2 + c/ dt =
2!2
r dtt
J
= In I t I + C
= ln I x + ✓ x 2 + a. 2 I+C
Noting that x + ✓ x 2 + a. 2 > 0 for all x, we finally arrive at
r ✓x2dx+
J a.2
= In I (x + ✓x:2 + cl) I +C ►
Problem. Prove that
J✓ dx
X2 - CY.
2
= In Ix + ✓x 2 - a2 I + c; Ix I > I "' I ·
440 9. Integral Calculus. The Indefinite Integral
rJ (x -
dx
2)✓ 1 - x 2 '
lxl<L
Then
1
- t2
2
1 - x 2 = (1 + x)2 t 2 , 1 - x = (1 + x)t 2 , x = ,
1+ t
x- 2 =_3! 2 +1 ✓ i-x2 = 2t dx=- 4tdt
f2 + 1 ' 1 + t2 ' (1 + 12 )2
9.4 Integrals Involving Irrational Functions 441
and
= r (t 2 + l)(t 2 + 1)4t dt = 2 r dt
J (3t 2 + 1)2t(t 2 + 1)2 J 3t 2 + 1
ax2 + bx + c = a ( x 2 + 2x !a + : )
2 2
b b ) +c- - -
=a [ ( x 2 +2x--+-- b -]
2a 4a2 a 4a 2
= a[ (x + ty + 4a~;, b 2
]
=a (x + _!!_)2
2a
+ 4ac - b2
4a
=a (x + _!!_)2
2a
+ /JJ
'
4ac - b 2
where p = 4a
442 9. Integral Calculus. The Indefinite Integral
f dx f dt
J ✓ ax2 + bx + c - J ✓ at 2 + p '
where a and p are of opposite signs or both positive.
When a > 0 and p > 0 and when a < 0 and p < 0 the integral is ex-
pressed by a logarithmic function; when a < 0 and p > 0 the integral is
expressed by an arcsine as the following two examples illustrate.
Examples. (1) f✓ dx .
J x2 - 4x + 5
◄ Since x 2 - 4x + 5 = (x - 2) 2 + 1 then the substitution x - 2 = t gives
dx = dt and
f dx f dx f dt
J ✓ x 2 ...: 4x + 5 - J ✓ (x - 2) 2 + 1 - J ✓t2 + 1
= In (t + ✓ t 2 + 1 ) + C = In (x - 2 + ✓x 2 - 4x + 5 ) + C ►
(2) J dx
✓
6x - x 2
.
◄ Since 6x - x 2 = - [(x2 - 6x + 9) - 9) = 9 - (x - 3) 2 the substitution
x - 3 = t yields dx = dt and
J ✓6x-x
dx
----.===- =
2
J dt
----.===- . - 1 -t
= Sin + 0 · = SI•n- 1 - 3 + C. ►
x--
✓9-t 2 3 3
(b) Integrals of the form J ✓
Mx + N
ax 2 +bx+ c
.
dx, a~ 0. Noting that
J ✓x+l
-----,=====- dx = - -
l J -2x~2
--.====- dx = --
l J (6-2.x)- 8 dx
--;:===--
6x-x 2 2 ✓ 6x - x2 2 ✓ 6x - x 2
=4r dx _ _!__ r d(6x-x 2 ) =4sin-l x-3 - ✓6x2-x2 +C. ►
J ✓ 6x - x 1 2 J
✓ 6x - x2 3
(c) Integrals of the form f ✓ P.(x) dx where P.(x) is a poly-
J ax +bx+ c
2
+A f dx
n J ✓ ax 2 + bx + c '
where Qn _ 1 (x) is a polynomial of degree (n - l) with unknown coefficients
Ao' Al' ... , An - l' i.e.,
Qn - l (x) = Ao + Aix + ... + An - 1Xn - 1•
To compute the values of the unknown coefficients we differentiate the
above identity with respect to x so that
Pn(X) = Q~ - 1 (x)✓ ax2 + bx + c
✓ax 2 +bx+ c
+Qn- 1(X) ax+ b/2 + An
✓ ax1 + bx + c ✓ ax2 + bx + c
Multiplying throughout by ✓ ax 2 + bx + c , we get the identity
Whence
Ao = - 23 , A 1 = 21 , A2 = 21 .
Computing the integral in the right-hand side of ( **), we have
f dx · _ f d(x + 1)
J ✓x2 + 1x + 2 - J ✓ (x + 1)2 + 1
= In(x + I + ✓x 2 + 2x + 2 + C
Then the original integral becomes
f x2dx = X - 3 ✓x2 + 2x + 2
J ✓x2 +2x+2 2
+ ~ ln(x + 1 + ✓x2 + 2x + 2 ) + C ►
9.5 Integrals Involving Trigonometric Functions 445
COS X
cos 2X · 2X
- - sm
= - - 2- - - -
-
2 1 - tan 2 i 1 - t2
X
• cos 2 2 · 2X
+ sin 1 + tan 2 ; 1 + t2 '
2
x = 2 tan - 1 t, dx = 2dt 2
1+t
and
JR(sinx, cosx)dx = JRC !\, , :: ~:) //~ 2 = JR,(t)dt,
where R1 (t) is a rational function in t.
Example. Compute the integral f dx
J Sill X
◄ The substitution tan; = t yields
2dt
f dx f I + t2
J sinx J 2t
1+ (2
(a) Integrals of the form IR(sin x) cos x dx. Here the substitution
sin x = t gives cos x dx = dt so that the original integral becomes iR(t) dt.
For example, if we wish to compute the integral
\ cosxdx
J 4 + sin 2x '
then the substitution sin x = t yields dt= cos x dx and
cosxdx = \ dt =_! t i_+C=_!_ t (_!_ · ) +C ►
J\ 4+sin x J 4+t
2 2 2 an
-1
2 2 an
-1
2 sinx ·
(b) Integrals of the form )R(cos x) sin x dx. The substitution cos x = t
yields sin x dx = - dt so that the original integral becomes - IR(t) dt. For
example, using this substitution we can easily compute the following in-
tegral
\ sin x dx = _ \ dt = _ \ d(2 + t)
J 2 + COS X J2+t J 2+t
-ln(2 + t) + C = -ln(2 + cosx) + C.
(c) Integrals of the form )R (sin x, cos x) dx where the integrand R (sin x,
cos x) involves only even powers of sin x and cos x. Then the substi-
tution tan x = t gives x = tan - 1 t and dx = dt 2 so that sin 2 x and
1+ t
cos x being rational functions in tan x become rational functions in t. In-
2
deed, we have
sin 2 x tan 2 x t2
sin 2 x = -
cos 2 x + sin 2 x 1 + tan 2 x 1 + t2 '
cos 2 x 1 1
cos 2 x =
cos 2 x + sin 2 x 1 + tan 2 x 1 + ,2 '
so that
\ dx
J sin2 x + 4 cos2 x + 2 ·
9.5 Integrals Involving Trigonometric Functions 447
dt t2
◄ Put tanx = t. Then dx = sin 2 x = and
1 + t2 ' 1+ t2
1
cos 2 x = so that
1 + t2
) dx
sin x + 4 cos 2 x + 2
2 ) t2
+
1
2 + 2
4
dt
1 + t2
1 + t2 1+ t
_ 1
- 3 Jf t2+2
dt _
-
1 t -
3-fi an
1 _t_
Y2
+C= 1
3Y2 an
t - 1( tan x )
Y3
C ►
+ .
Integrals of the form Jsina x cosa x dx where a and (3 are real numbers.
We consider two cases when the integral admits a representation in ra-
tional functions.
(a) Let either a or (3 be an odd positive number. For definiteness we
put (3 = 2k + 1 where k > 0 is an integer. Then a is any real number. Using
the identity cos 2 x + sin 2 x = 1, we have
= f (1 2 - 2t 4 + t 6 ) dt =! t 3. - ~ t 5 + _!._ t 1 + C
J 3 5 7
= ]l .3
Sill X - S2Sill
.s l
X + ""j
-1
Sill X + C. ►
448 9. Integral Calculus. The Indefinite Integral
sin 3 x
(2) ) 2 dx.
COS X
◄ We have
r sin 3
X dx = r sin 2
X sinxdx = rl - cos 2 X sinxdx.
J cos 2 x J cos 2 x J cos 2 x
Put cos x = t. Then sin x dx = - dt and
) 1 - cos x sinxdx= ) t 12
2 2 - 1
dt=
cos 2 x
= t + -l + C = cosx + -l- + C. ►
t cosx
(3) r cos3 X dx.
J ✓sin x
◄ We have
=) c- ~os2xr (1 + ~os2x)"dx
= 2}+ n ) (I - cos 2xr(I + cos 2x}" dx.
Applying the binomial theorem to the factors (1 - cos 2x)m and
(1 + cos 2xt and multiplying the polynomials thus obtained we arrive at
9.5 Integrals Involving Trigonometric Functions 449
the integrand involving odd and even powers of cos 2x. The terms involving
odd powers of cos 2x are easily integrable as we have discussed in the
preceding section. To integrate the terms involving even powers of cos 2x
we apply to them the identity (*). This yields the terms involving powers
of cos 4x. Continuing this process we finally arrive at integrals of the form
Jcos kx dx where k > 0 is a positive number. These integr,als can be com-
puted without difficulty.
Suppose now that m = n. Then applying the identity
.
Sill x cos x =
1 sm
. 2x
2
we obtain
) sin 2 " x cos 2 " x dx = ) (sin x cos x )2" dx = ) (~ sin 2x Y" dx
JfSill
· 2 X COS4 X dX = Jf 1 - cos 2x ( 1 + cos 2x ) 2 d
2 2 X
= g1 (12 X - 1 Sill
S . 4X + 1 Sill
6 . 3 2x) + c. ►
(2) Jsin 2 x cos 2 x dx.
◄ We have
29-9505
450 9. Integral Calculus. The Indefinite Integral
Integrals of the form Jsin ax cos {3x d'x, Jcos ax cos {3x dx,
Jsin ax sin {3x dx where a :;r: (3. To compute these integrals it is helpful to
use the following trigonometric identities
Exercises
Apply the standard integration formulas to compute the following
integrals
1. fx2
Txdx. 2. f 1/.7 . f✓~ 3. dx. 4. f(vx + Jx )'ttx.
5. f
z<gx dx. 6. f 6' dx.
32x
f+ 7. 2x
Hf
5x
dx. 8. f cos2x d
cos 2 x sin 2 x x.
9. f sin2x
sinx cos 3 x
d x. lO.f + si? 7x
Sill 5X
sin 3x dx.
COS 2x
11. f (tan x - cot x)2dx.
f sinh 2x dx.
12.
f cos2 xdx sin2 x · 13· f sinh2 x dxcosh2 x · 14· J coshx
Exercises 451
21. fxeX 212 dx. 22. lx 2 sin x33 dx. 23. f✓ x dx. 24. f dx .
J J J 1 - x2 J X In X
25.
JW<~\ ¼) . u. JW ;x-Tx . 27 " J✓/: 1 .
28. f f' f dx. 29. x2(x - 1) 18 dx. 30. f x-.J dx . 31. fx'-fx+Tdx.
J e -1 J J x-1 J
32. rJ Vx + 1 dx. 33. Jr✓ 1 x'- x dx. 34. Jr 1 +dxx 35. Jr✓ 1 x- x dx.
3 X
2
X 4 •
2
6
36. rJ In_sm2x
tan dx. 37. JIncotx
X d 38. J sin etan2x dx. X. 3
X
sin x COS X
39. J
tan - xe<tan- 2x i
dx. 40.
J lnx dx.
2
1
>
2
1+x x(4 + In x)
Use integration by parts to compute the following integrals
54. f 2 dx 55 f dx 56 f dx 57. J
dx
J 5 + 2x · • J 2 - 3x · • J (3x + 5) (1 - 2x)
5 •
3 •
69 f dx 70 f x2 - 3x d 71 f dx 72 f dx
· Jx + x 3 4 • • J (x + l)(x- 1)2 x. · J x(x + 1) 2 · · Jx 1
4 -
73. f : dx .
JX - 1
~
76. j(]~= x);I + x)' j: + ~ dx. 77. j ~T+x (1
d_x
+ x)2
78. f dx . 79. f dx . 80. f dx
J ✓x2 - 4x J ✓4x - x 2 J ✓x2 - 4x - 5 ·
84. j ✓
x + 1
dx. 85.
j ✓
x - 3
dx. 86.
j 2x 2 + 3x + 2
--=====-- dx.
1 + 6x - x 2 1 + 6x - x 2 ✓ x 2 + 2x + 2
87. f 1-:; 2x- 3x' dx. 88. f ✓' + I dx. 89. f ✓x> dx . 90. f✓ x• dx.
J 1- x 2 J x2 +2 J 1- x2 J 1+ x 2
97. j 1
1
+ cos x - sin
. x dx.
- COSX + SlnX
98. Jcos xdx. 3 99. Jsin xdx.
5
Exercises 453
103.1 sin 2x cos 2 x dx. 104. 1 sin4 x dx. 105. 1 sin4 x cos 2 x dx.
106. f dx
J sin4 x ·
107. r cos2
J sin4x
X dx. 108. f dx
J sin2 X cos4 X
109. 1 dx
cos 4 x - sin4 x ·
110. 1 sin 5x cos x dx. 111. 1 sin x cos 5x dx.
112. 1 cos 7xcos 3xdx. 113. 1 sin 15x sin !Ox dx. 114. 12cos X cos 3
X dx.
Answers
1. 0.3 x 1013 + C. 2. 4'Vi + C. 3. -~ x 1518 + C.
15
4. ~
.t.
+ 2x + ln I x I + C.
In ✓ 2x + ✓ 4x 2
1 2x - 3
17. 1
6 tan
-I2x
3 + C. 18 .
12 In 2x + 3
+ C. 19. + 9 + C.
20.½sin- 1 ~x+C. 21. ~ 12 + C. 22. -cos ~ 3 +C. 23. - ✓ 1-x2 + C. 24. Inllnxl +C.
3 3 t
31. 28 (x + 1)413 (4x - 3) + C. 32. 10 (x + 1)213 (2x - 3) + C. 33. -(2 + x 2 )v l -x 2 +C.
34.~ tan- 1 x 2 +C. 35.! sin- 1 x 3 +C. 36.¼ln 2 tanx+C. 37.ln lnsinxl +C. 38.½etan 2 x+c. I
1
2 e<tan-•)zx + C. 40.
39.- ln ✓4 + ln 2 x + C. 41. -(x + l)e-x + C. 42. xln~ - 1 2x + C.
In 2
43. ¼ sin 2x -1 cos 2x + C. 44. xex + C. 45. x 2x + C. 46. x 2e-x + C. 47. x tan- 1x -
x2 X
50. x tan x + In I cos x I + C. 51. - 2 + x tan x + In I cos x I + C. 52. 2 (cos In x +
1 1 1
In Ix + 3 I + C. 68. --+C. 69. - --- + In - - ---
X
-- + C.
3(x + 1) 3 x+ X 2x 2 X + 1
70. - 1
--1 - + In I x + 1 I + C. 71. In Ix I + C.
x- ✓x2+1
72. 4
1
In
x-1
X + 1
- 1-tan_,
2
x + C. 73. l
3
(1n Ix - 1 I
✓x2+ x + 1
+ v'3 tan - 1 2x+l)+c.
v3
74. ,Ji +x +C. 75. 2✓ 1 + x + In
-Jl + X - 1
+C. 76.
X
✓ 1 - x2
+ C.
r·
1-x -Jl + X + 1
83. ✓ x 2 - 6x - 1 + 4 In Ix - 3 + ✓ x 2 - 6x - I I + C. 84. 2✓ 1 + 6x -- x 2 +
._,x-3+c
4 sm --- . 85. - ✓ 1+6x-x 2 +C. 86. x✓x2 +2x+2 +C.
v'io
X X
2 _ 1 5 tan 2 +4 1 2 tan- - 1
2
91. In ( 3 + tan 2 ; ) + C. 92. 3 tan 3 + C. 93. 5 In +C.
X
2 tan 2 + 1
2 2tan~+l
94. - - tan - 1 ----- + C. 95. ln(l + sin 2 x) + C. 96. _!_ In 5 - sin x +C
✓-f5 ✓-f5 4 1 - sin x
Exercises 455
tan x
97. -x + 2 ln
2 · x -
+ C. 98 . sm l.3
3 sm x + C. 99. - 51 cos s x + 3
2 cos 3 x -
X
1 + tan 2
. 3 x - I sm
cos x + C. 100. 31 sm . s x + C. 101 . I cos7 x - 1 cos s x + C. 102• 1
5 1 5 3 cos 3 x
1 x sin 4x 3 1 1 x
cos x + C. 103. 8- 32 + C. 104. 8 x - 4 sin 2x + 32 sin 4x + C. 105. 16 -
1 3
3 tan x - 2 cot 2x + C. 109. - 2
1 1n tan ( x - ,r)
4 + C. 110• - cos126x _ co~ 4x + C.
111 • _ co:26x + co~ 4x + C. 112_ si;~ox + si~4x + C. 113. _ sin5~5x + si~gx +C.
y
B
Fig. 10.1
can write
n
Qn = f(~1)AX"1 + f(6)AX"2 + · · · + f(~n)AX"n = ~ f(~k)AX°k,
k=I
Thus we have arrived at the limits of the same form as before; if these
two limits exist they are called the definite integrals of the functions f(x)
b T
and /(t) denoted by the symbols ) f(x) dx and ) /(t) dt, respectively.
a to
where f(~k) is the value of f(x) at the point ~k E [xk _ 1, Xk]- This sum is
called the integral (Riemann) sum for /(x) determined by the given partition
of [a, b] and the given selection for this partition; hence a value of Sn
is dependent on the partition of [a, b] and the selection for the given parti-
tion, i.e., it depends on the way of dividing tlie interval [a, b] into subinter-
vals [Xk - 1, Xk] and choosing points ~kin these subintervals, k = 1, 2, ... , n.
Let A be the largest of the lengths of the subintervals [Xk _ 1, Xk], k = 1,
2, ... , n, i.e., A = max /1xk, We say that a number J is the limit of in-
n I~k~n
tegral sums ~ f(~k)l1xk for f(x) on [a, b] if given any number e > 0 there
k=l
exists a number o> 0 such that
n
~ f(~k)/1xk - J <e
k=l
for every partition of [a, b] with 11xk < o, k = 1, 2, ... , n, i.e., for every
partition with A < o, and every selection ~k, k = 1, 2, ... , n.
_______ 10.1 Basic Concepts and Definitions_ 459
In symbols, we write
n
J = lim I; f(~k) iixk.
}..-+Ok=I
The numbers a and b are called the lower limit and the upper limit
of the integral, respectively; xis called the variable of integration. The func-
tion f(x) is called the integrand and the expression f(x) dx is called the ele-
ment of integration.
It is worth noting that by virtue of the definition the definite integral
remains unchanged when at any point c in [a, b] the value f(x) of the in-
tegrand is replaced by some other number. In other words, if we replace
the integrand f(x) by the function
g(x) = ff(x) for x E [a, b], x ~ c,
lA for x = c,
where A ~ f(c), then
b b
1f(x)dx = 1g(x)dx.
a a
This is also true if f(x) is modified at any finite number of points in [a, b].
The definition above applies only if a < b; it is also convenient to in-
a
elude the cases a =b and b < a. We put I f(x) dx = 0 for b = a and
a
a b
) f(x)dx =- 1f(x)dx for b < a.
b a
b
Example. Compute ) dx.
a
460 10. Integral Calculus. The Definite Integral
the sum ~ f(~k) tuk remains the same for the chosen ~1 in [Xo, xi]. This
k=2
implies that the integral sum Sn has no finite limit as max tuk -+ 0, i.e.,
l~k~n
f(x) is not integrable on [a, b]. Whence it follows that if f(x) is integrable
on [a, b] then f(x) is bounded on [a, b]. ►
Remark. If a function is bounded on [a, b] it is not necessarily integra-
ble on this interval; in other words, a function can be bounded on [a, b]
but not integrable on [a, b]. For example, the Dirichlet's function
n
2, ... , n, and Sn = I; 0 · AXk = 0 for every selection of irrational points
k=l
~k- Hence, given any arbitrarily small X. = max LU°k the integral sum Sn
l~k~n
is equal either to 1 or to O; this means that Sn has no limit as X. ~ 0, i.e.,
f(x) is not integrable on [O, 1]. ►
We shall give without proof three theorems that outline sufficient condi-
tions for a function to be integrable on a closed interval.
Theorem 10.2. If the function f(x) is continuous on the closed interval
[a, b] then f(x) is integrable on [a, b].
For example, the function f(x) = e - xi is continuous on [O, a] where a
is an arbitrary number and, consequently, f(x) is integrable on [O, a], i.e.,
+ 00
there exists the definite integral I e- xi dx of this function.
a
Theorem 10.3. If the function f(x) is defined and monotone on the
closed interval [a, b] then f(x) is integrable on [a, b].
It is worth noting that all values of the function f(x) which is monotone
on [a, b] lie between the numbersf(a) andf(b); sof(x) is bounded on [a, b].
Theorem 10.4. Let a function f(x) be bounded on a closed interval
[a, b] and let f(x) have a finite number of discontinuities (of the first or
second kind) on [a, b]. Then f(x) is integrable on [a, b].
For example, the function
sin~ for x -:;r:- 0
f(x) = (
1 for x =0
1s integrable on the closed interval [O, 1) since lf(x)I ~ 1 for all x in
[O, l], i.e., f(x) is bounded on [O, 1), and f(x) has the only discontinuity
(of the second kind) at x = 0.
a
J f(x) dx = J f(t) dt =
a
I f(u) du.
a
462 10. Integral Calculus. The Definite Integral
(2) The constant factor can be taken outside the integral sign so that
b b
i Af(x) dx = A f f(x) dx, A = const.
a a
a
i Af(x) dx = !i~ k~I Af(~k) ilxk
n b
= A lim ~ f(~k) ilxk = A ) f(x) dx. ►
A-Ok=I a
(3) The definite integral of the sum (difference) of two functions is equal
to the sum (difference) of the integrals of these functions, i.e.,
b b b
) 1/1 (x) ± f2(x)] dx = J fi (x) dx ± ) f2(x) dx.
a a a
b n
n n
= lim ~ f1(~k) ilxk ± lim ~ f2(~k) AXk
A-Ok=l A-Ok=l
b b
= i f1(x) dx± f f2(x) dx. ►
a a
where the sums on the right are associated with the partitions of [a, c]
and of [c, b], respectively.
lj
b=Xn X
Fig. 10.2
b n m
Whence
b C C C b
1f(x) dx = 1f(x) dx - 1f(x) dx = 1f(x) dx + 1f(x) dx.
a a b a c
Given /(x) > 0 and a < c < b this property implies that the area of the
curvilinear figure aABb is equal to the sum of the areas of the curvilinear
figures acCA and cbBC as easily seen from Fig. 10.2 ►
(5) Let the functions f(x) and g(x) be such that f(x) ~ g(x) on [a, b].
Then
b b
) f(x) dx ~ 1g(x) dx.
a a
This means that integration preserves inequalities between functions.
◄ Since f(x) ~ g(x) at every point x in [a, b] then given any partition
of [a, b] and any selection ~k, k = 1, 2, ... , n, there holds
n n
~ f(~k) AXk ~ ~ g(~k) AXk.
k=l k=I
'j
n a X
Fig. 10.3
so that
b b
) f(x) dx ~ ) IJ(x)I dx. ►
a a
(7) Let m and M be the minimum and maximum values of f(x), respec-
tively, on [a, b] where a < b. Then
b
m(b - a) ~ ) /(x) dx ~M(b - a).
a
Observe that
b b
) m dx = m ) dx = m(b - a)
a a
and
b b
) M dx = M ) dx = M(b - a).
a a
Whence
b
m(b - a)~ ) f(x)dx ~ M(b - a). ►
a
lJ
0 a 6 X
Fig. 10.4
211"
dx
Examples. (1) Evaluate the integral )
✓ 10 + 6 sinx
0
◄ Since
. 1 1
m = min = 0.25
o~x~2'11" ✓ 10 + 6 sin x ✓ 10 + 6 sinx X=-
'If
2
and
1 1
M= max = 0.50.
O~x~h ✓ lo + 6 sinx ✓ 10+6sinx X=
3
'IT
2
Then Property 7 yields
211"
2,r X 0.25 ~ dx
) ----;:=====- ~ 21r X 0.50
✓ 10 + 6 sinx
0
and
211'
dx
7r ~ ) --;:::::::=====- ~
2 7r. ►
✓ 10 + 6 sinx
0
(2) Evaluate which of the integrals
1 1
J e - x dx2
and J e - x dx
0 0
is larger without direct computations of their values.
10.3 Fundamental Theorems for Definite Integrals 467
- ----- --- - ·----------------
Whence
b
) f(x) dx
m ~-a----~M
b-a
because b - a> 0.
Put b
b ~a 1
a
f(x) dx = µ.,
where m ~ µ ~ M.
Since f(x) is continuous on [a, b], it at~ains all intermediate values be-
tween m and M. This implies that there exists a point x = ~ in [a, b] such
that f(~) = µ, i.e.,
b
) f(x) dx b
Put
~-a
b = 0, 0 ~ 0 ~ 1.
-a
Then ~ = a + (b - a)0 and we rewrite the conclusion of this theorem as
b
1f(x) dx = (b - a)f [a + (b - a)0], 0 ~ 0 ~ 1.
a
I)
0 a b X
Fig. 10.5
To interpret the mean value theorem for integrals we look at Fig. 10.5.
The curvilinear figure abBA is bounded by the graph AB of the function
f(x) which is nonnegative on [a, b] (a < b), the x-axis and the vertical lines
b
x = a and x = b. We let Qi denote the area of abBA; then 1f(x) dx = Qi.
a
For the rectangle abNM the base is [a, b] and the height is equal to the
ordinate of the point C(t /(~)) so the area Qi of abNM is Qi = (b - a)/(~).
The mean value theorem tells us that there exists a point C(t /(~)) on the
graph AB such that Qi = Qi.
10.3 Fundamental Theorems for Definite Integrals 469
The number
b
M[/(x)] = b ~a ) f(x)dx
a
M[sinx] =
7r-
I O
Jf sinxdx =_!_(-cos
7r
1r + cosO) = ~.
7r
►
0
Since f(x) is continuous on [a, b], this integral exists at every x in [a, b].
Consequently, the integral becomes a function of its upper limit x. We let
F(x) denote this function, i.e.,
X
x+ dX X x+ dX
M = F(x + AX) - F(x) = ) f(t)dt - ) f(t)dt = ) f(t) dt
a a a
a a x+ dx x+ dx
+ ) f(t) dt = 1f(t) dt + 1 f(t) dt= 1 f(t) dt.
X X a X
that is,
F' (x) = f(x) or ( I dt)'
f(t) = f(x) for all x in [a, b]. ►
Remark. If f(x) is a continuous function on [a, b] then for every x in
[a, b] there holds
)f(x) dx = 1f(t) dt + C,
a
◄ Recall that
r
Jx dx = T
x2 x2
+ C and F(x) = T + C.
Then
4
r x2 4 42 22
J x dx = 2 2 =2 - 2 = 6. ►
2
11"
◄ We have
11"
where F(x) is an antiderivative of /(x) on [a, b ], i.e., F' (x) = /(x) for all
x in [a, b].
Consider the composite function 4'(t) = F [',O(t)] in ton [a, {l]. By the
chain rule of differentiation of a composite function we obtain
<l>'(t) = F' [',O(t)] 'P'(t) =f [',O(t)]'P'(t).
Hence 4'(t) is an antiderivative of the function/[',O(t)] 'P' (t) continuous
on [a, {l]. Then the Newton-Leibniz theorem yields the desired result,
namely
(j
I /[',O(t)] ',O' (t) dt = 4'(/1) - cl,(a)
a
b
= F['P(/1)] - F[',O(a)] = F(b) - F(a) = ) /(x) dx. ►
a
J ✓a2 - x2 dx = a1 J cos t dt 2 = a2
J
1 + cos 2t
2
dt
0 0 0
11" 11"
e
ln2x
=2 ( 02 t
2
0
1 . 2
+ Z SID t :) -
1ra2
4
. ►
(2)
J
1
X
dx.
474 10. Integral Calculus. The Definite Integral
j ln 2 x
x dx =
j t dt = Tt
1
2 3 1
= 31 . ►
0
I 0
j ✓e• - I
t2 dt
1 + t2
=2 l
J
0 0
l 2
1+ t
) dt=2 (t 1-
0
tan - l t 01)
= 2(1 - tan - 1 1) =2 - ; . ►
1
(4) ) (2x3 - l)✓x4 - 2x + l dx.
0
j
1
(2x3 - I)✓
,-----
x 4 - 2x I dx + = ~
0
j ,ft dt =
0 I
a
1f(x)dx =
[2) a
0
f(x)dx if f(x) is even,
-a 0 if f(x) is odd.
10.4 Evaluating Definite Integrals 475
Recall that for an even function /(x) we have /(x) = /( -x) so that
a a
J f(x) dx = 2 J /(x) dx. Similarly, for an odd function /(x), i.e., for
-a 0
a
f(-x) = -/(x), ) /(x) dx = 0. ►
-a
'Ir
J uv' dx = uv
a
I! - J vu' dx.
a
476 10. Integral Calculus. The Definite Integral
e
=(x - 1r) cos x I~ - sin x I~= ► 1r.
(2) J lnx ~
X
2-dx.
I
dx dx 1 and in-
◄ Put u = lnx and dv = - 2- . Then du= - - and v =
X X x
tegration by parts gives
e e
e e e
lnx 1 dx lnx 1
dx = - -lnx +
J
I
x2 X
1 J
I
x2 - X
I X
1 1
+1 = 1 - 2e- 1 • ►
e e
Q =
a
Jx
r 2 x3 a a3
dx = -3- o - -3-·
0
10.5 Computing Areas and Volumes by Integration 477
0 a g X
Fig. 10.6
Suppose that f(x) is a negative function on the closed interval [a, b],
where a < b, i.e., f(x) < 0 on [a, b]. Then the region (the plane figure)
bounded by the graph of y = f(x), the lines x = a and x = b and the x-axis
b
lies below the x-axis (see Fig. 10.8) and i f(x) dx < 0. In this case the area
a
Q of the plane figure abBA is
b b
Q= - i f(x) dx or Q = i f(x) dx
a a
Example. Compute the area of the plane figure bounded by the parabo-
la y = x 2 - 2x and the x-axis (Fig. 10.9).
y y
a g
0 X
X A
◄ Since y ~ 0 on (0, 2] the plane figure lies below the x-axis, the desired
area 1s
2 2
x3 2
4
and Q =3 . ►
!/
Fig. 10.9
Let a function f(x) change its sign when x passes through a point
c E (a, b ). Then the plane figure bounded by the graph of y = f(x), the
x-axis and the lines x = a and x = b can be regarded as being made up
of two plane figures lying above and below the x-axis (Fig. 10.10). In this
case the area Q of the plane figure is
C b
Q = Qi + Q2 = i f(x) dx +
a
) f(x) dx
C
For example, for the plane figure bounded by the parabola y = 1 - x2,
the line x = 2 and the x- and y-axes (Fig. 1O.l 1) is
1 2
Q= (1 - x 2 )dx + (1 - x 2 ) dx
J
0
J 1
1 x3 1 2 x3 2
=X
3
+ X
3
0 0 1 1
1 8
=1-- + 2-1-- +l - .
-2
3 3 3
10.5 Computing Areas and Volumes by Integration 479
0 r
Fig. 10.10
Fig. 10.11
Suppose now that f(x) and g(x) are continuous functions on [a, b],
where a < b, and f(x) > g(x) > 0. Let the graphs of y = f(x) and y = g(x)
intersect at the points A and B (Fig. 10.12). Then the area Q of the plane
figure bounded by the graphs of y = f(x) and y = g(x) is equal to the differ-
ence of the area Qi of the plane figure aACBb and the area Q2 of the
plane figure aADBb. Thus
b b b
Q = f f(x) dx - f g(x) dx or Q = f [f(x) - g(x)] dx.
a a a
y
8
0 a 6 X
Fig. 10.12
480 10. Integral Calculus. The Definite Integral
0 X
Fig. 10.13
◄ The abscissas of the points A and B where the parabolas intersect are
the solutions of the equation 4x - x 2 = x 2 - 4x + 6 or x 2 - 4x + 3 = O
whose roots are x1 = 1 and x2 = 3; hence the lower and upper limits of
integration are a = 1 and b = 3. Then the desired area Q is
3 3
3 2 3
= 4x2 - - x3
3
- 6x
1 1
If we put x = 'P(f) and y = t/;(t) then we get the following expression for
the area Q of the plane figure specified by parametric equations
(3
Q= j t/;(t) 'PI (t) dt.
Q'
0 a b X
t=d t=/3
Fig. 10.14
Example. Compute the area of the ellipse given by the parametric equa-
tions x = a cost and y = b sin t, 0 ~ t < 21r (a, b > 0).
◄ Since the ellipse is symmetric relative to the x-axis and to the y-axis
it suffices to compute the area of the ellipse in the first quadrant; the
desired area is
a
Q= 4 j ydx.
0
Example. Compute the area of the plane figure bounded by the parabo-
la x = 2 - y - y 2 and the y-axis (Fig. 10.16).
y !I
d,____,_____,~~~~~~~
X= g(y)
0 X
Problems. (1) Compute the area of the plane figure bounded by the
parabola y 2 = 2x + 1 and the line x - y - 1 = O;
(2) Compute the area of the plane figure bounded by the curves
y = sin - 1 x and y = cos - 1 x and the x-axis. (Hint. Write the equations of
the curve in the form x = g(y).) ·
10.5 Computing Areas and Volumes by Integration 483
Fig. 10.17
0 p
Fig. 10.18
To compute the area of the curvilinear sector OABO we divide the sector
into n subsectors by drawing n rays '/J = a = <Po; '/J = 'Pt, ... , '/Jn - 1;
'/J = (3 = '/Jn from the pole. Let A'fJ1, A'/>2, ... , A'/Jn be angles between the
rays. We let '(pk denote the angle formed by the rays '/Jk - 1 and '/Jk and Qk
the position vector associated with '(pk. Consider the circular sector of
radius Qk with central angle A'/Jk (Fig. 10.18). The area of this circular sector
is equal to AQk = ; QkA'/Jk or AQk =; j2(rpk)A'/Jk since Qk = f('Pk)-
When we replace each curvilinear subsector by the corresponding circu-
lar sector we obtain the plane figure made up of n circular sectors; its area
Qn is
31*
484 10. Integral Calculus. The Definite Integral
n n
k=l k =1
We denote the largest fj.<Pk by A = max fj.<Pk • If we make n tend to in-
1 ~k~n
finity, so that A ➔ 0, i.e., if we divide a given curvilinear sector into smaller
subsectors, then the plane figure made up of circular sectors comes closer
to the curvilinear sector OABO. Thus we may regard the limit of the area
Qn as A = max /j.<Pk ➔ 0 as the area of the curvilinear sector OABO
l~k~n
provided that this limit exists and is independent of a partition of the closed
interval [ec, /3] and of a selection 'Pk associated with a given partition, k = I,
2, ... 1 n, 1.e.,
n
Clearly, the sum ~ ~ / 2 ( 'Pk) fj.<Pk is an integral sum for the function
k =1
~/ 2 (<P) which is continuous on [a, /3] since f(<P) is continuous on [a, /3].
Hence, the limit of this sum as A ➔ 0 exists and is equal to the definite
(3
(jJ
2a p
Fig. 10.19
Q = ~2 r
21r
0
(1 + cos cp)2dcp = ~ 1
0
(1 + 2 cos 'ii' + cos 2 'P) d'()
'II
I
I
b X
Fig. 10.20
to the x-axis and directing line is the line of intersection of the solid surface
with the perpendicular plane x = ~k (see Fig. 10.20). The volume ~Vk of
this cylinder is the product of its height !:ak and the area Q(~k) of its base
so that ~ Vk = Q(~k) AXk, where ~k E [xk _ 1, Xk]. The volume of all n
cylinders thus obtained is
n n
Vn = ~ ~vk = ~ Q(~k) AX°k,
k=I k=I
If the limit of the sum on the right exists as ~ = max AX"n ~ 0 we set
l~k~n
it equal to the volume of the solid in question so that
n
Notice that this sum is an integral sum for the function Q(x) which is con-
tinuous on the closed interval [a, b]; hence, the above limit exists and is
equal to the definite integral
b
V = f Q(x) dx.
a
,--
\
1
y
Fig. 10.21
◄ The cross section of the ellipsoid with the plane perpendicular to the
x-axis at x is the ellipse (Fig. 10.21)
Y2 z2 x2
-+-=1--
b2 c2 a2
10.5 Computing Areas and Volumes by Integration 487
or
Y2 z2
-------+-------=!
Q(x) = 1rbc ( I - :, ) .
lj
8
0 X
0 X
Fig. 10.23
Fig. 10.22
2
( 1 - x02 ) dx = 3
4 1rabc.
The cross section of the solid of revolution with the plane perpendicular
to the x-axis at x is the circle whose area is Q(x) = 1ry 2 = 1rf2 (x); hence,
the volume of the solid of revolution is
b b
V = 1r 1/ 2 (x) dx or V = 1r 1y 2 dx.
a a
Fig. 10.24
points by line segments AM1, M1 Mi, ... , Mn - 1B with lengths M1, &2,
... , ~Sn, respectively, we obtain the polygonal line AM1M2 ... Mn - 1B
.____,
inscribed in the arc AB. The length Sn of this polygonal line is
n
Sn = &1 + &2 + , , , + as"n = t
k=l
as"k,
provided that this limit exists and is independent of choices of the points
M1, M2, ... , Mn- I on the arc AB.
Arc length of a curve in Cartesian coordinates. Let an arc AB be speci-
fied by the equation y = f(x) where the function f(x) has a continuous
derivative f' (x) on a closed interval [a, b ]. Consider a partition of [a, b]
into n subintervals [Xk _ 1, Xk], k = 1, 2, ... , n by choosing points such that
Xo = a< X1 < X2 < ... < Xk- I < Xk < ... < Xn = b.
y
Fig. 10.25
Recall that f' (x) is continuous on [a, b]. This implies that
✓ 1 + lf' (x)] 2 is a continuous function on [a, b]. Hence the integral sum
on the right of ( *) has a limit S as max dsk ~ 0 so that
1 ::s;;k,:;;;;n
n ~---- b
S= lim I; ✓ 1 + [/' (fa)] 2 AXk = I ✓ 1 + [/' (x)] 2 dx
max..isy-->Ok =1 J
a
l ::s;;k,:;;;;n
y' = sinh x.
Noting that cosh 2 x - sinh 2 x = 1, we obtain
✓ 1 + (y') 2 = ✓ 1 + sinh 2 x = ✓cosh 2 x = coshx (coshx > 0)
so that
a a
S= ) cosh x dx = sinh x = sinh a. ►
0 0
y=cosh x
0 I
-----oc::-¥-_ _ __.____ _
(1 X
(2) Find the length of the ellipse given by the parametric equations
x = a cost, y = b sin t, 0 ~ t < 21r (0 < b ~ a).
◄ Since x; = -a sin t and y/ = b cost and the ellipse is symmetric rela-
tive to the coordinate axes, formula (•••) gives
1r/2
S =4 I ✓a 2 sin2 t + b 2 cos 2 t dt
0
..-/2
=4 I ✓a (1 -
2 cos 2 I) + b 2 cos 2 t dt
0
..-12 --------- 1r/2
=4 I✓ a2 - (a 2 - b 2 ) cos 2 t dt = 4a I ✓1 - e 2 cos 2 t dt,
0 0
492 10. Integral Calculus. The Definite Integral
✓ a2 - b2
where e = ---- is the eccentricity of the ellipse, 0 ~ e < 1.
a
The definite integral on the right is called the elliptic integral; it is worth
noting here that the Newton-Leibniz theorem is inapplicable for computing
this integral since the antiderivative is not an elementary function. ►
Notice_ that the substitution t =; - T yields
Fig. 10.28
(3) Find the length of one arc of the cycloid specified by the parametric
equations (Fig. 10.28)
x = a (t - sin t), y = a (1 - cos t), 0 ~ t < 21r (a > 0).
◄ Applying formula (***), we have
S =a j✓
21r ,----------
(I - cos t) 2 + sin t dt = a
2 j ✓2
21r
- 2 cost dt
0 0
21r 21r
=a j .J4 sin ~ dt = j 2
2a
. t
s1n 2
dt
0 0
2,r
= 2a j sin ~ dt =
2,r
t
-4a cos- = 8a. ►
2 0
0
'--'
Arc length of a curve specified by its polar equation. Let AB be an
arc of a curve specified by its polar equation e = /(cp).on a closed interval
[a, ~] where .f(cp) has a continuous derivative/' (cp). To find the length
10.6 Computing Arc Lengths by Integration 493
of the arc AB we derive the parametric equations of the curve. Recall that
the relations between the polar and Cartesian coordinates are given by
x = e cos 'P and y = e sin 'P· Replacing e by f('P), we get the parametric
equations x = f( 'P) cos 'P and y = f( 'P) sin 'P that describe the given curve.
We use here the polar angle 'P as the parameter.
On differentiating the parametric equations, we have
x~ = f' ('P) cos 'P - f('P) sin 'P
and
y~ = f' ('P) sin 'P + f('P) cos 'P·
Squaring both identities and adding (x;) 2 with (y;) 2 , we get
(x;)2 + (y;)2 = [/' ('P)]2 + [/('P)]2.
or, equivalently,
/3
s= ~ ✓ e2 + (e, )2 d'P,
a
Example. Find the length of the cardioid given by the polar equation
e = a (1 + cos 'P), a > 0 (see Fig. 10.19).
◄ On differentiating the original equation we obtain e ' = - a sin 'P =
f' ('P), 0 ~ 'P ~ 21r, so that by symmetry
,r
0 0
'Ir
Now we consider an arc of the curve y = f(x) shown in Fig. 10.29. Sup-
pose that the function /(x) has a continuous derivative/' (x) on the closed
interval [a, b]. We let AM denote the arc bounded by the points A(a, f(a))
494 10. Integral Calculus. The Definite Integral
dS(x)
~---
dx
--- =
dx
J
✓ + [f,(t)] 2 dt)
-d -( - 1 = ✓ l + L/,(x)] 2
Q
or
~~f = J (-~Y
1 +
Whence the length dS of the arc AM becomes
ij
dx
0 X x+ dx )(
Fig. 10.29
as being nearly equal to the length of the line segment MN of the tangent
MT to the given curve at the point M, i.e., ~ = dS.
When a curve is specified by the parametric equations
x = cp(t), y = 1/; (t), to ~ t ~ T,
where cp(t) and 1/;(t) have continuous derivatives on [to, T], then
dS = ✓ [cp'(t)] 2 + [i/;'(t)] 2 dt
or
dS = ✓ (x/ ) 2 + (y/ ) 2 . dt.
If we put the parameter t equal to the length S of a variable arc, i.e.,
x = cp(S) and y = 1/;(s), then the above formulas become
dx )
( dS
2 ( dy )2
+ dS = 1.
For a curve given by the polar equation e = f(cp), ex ~ 'P ~ {3, where
f( cp) is a function having a continuous derivative oh the closed interval
[ex, {3] we have
dS = ✓ (/ + <e, )2 dcp.
10.7 Applications of the Definite Integral
,
Work done by a variable force. Suppose we wish to define a work
done by a force F in moving a particle M along the x-axis from point a
to point b (a < b ). Recall from the course of general physics that the work
W done by the constant force F is the product of the absolute value F
of this constant force by the distance S = b - a, i.e., W = FS, provided
that F is directed along the x-axis.
Let a force F acting on a particle M along the x-axis be a continuous
function in x (F = F(x)) on the closed interval [a, b] of the x-axis. We parti-
tion the closed interval [a, b] into n subintervals with lengths AX1, AX2,
... , AXn by choosing points in [a, b] such that Xo = a < Xi < X2 < ... <
Xn = b. Assume that ~k is an arbitrary point in the subinterval [Xk- 1, Xk]
and the force F is constant on [Xk - 1, Xk] so that F = F(~k)- Then given
a sufficiently small AXk the work awk done by F (~k) in moving the particle
from Xk - 1 to Xk is LiWk = F(~k) AX°k and the sum
n n
Wn = I; awk = I; F(~k) AXk
k=l k=l
W = Jk q~f 2
dr = kq1112 J~ = kq1112 ( - !) :
= kq1 Q2 (-r11- - - 1-) .
r2
►
The mass and centre of gravity of a rod of varying density. Consider
a rod of varying density. Think of the rod as a closed interval [a, b] of
the x-axis and suppose that the linear density of the rod is defined as a
function e = e(x).
We partition the closed interval [a, b] into n subintervals [xk _ 1, Xk],
k = 1, 2, ... , n, by choosing the points
Xo < Q < Xt < X2 < , .. < Xn-1 < Xn =- b.
We let ~k denote an arbitrary point in the kth subinterval [Xk- t, Xk],
k = 1, 2, ... , n, and consider the sum
Clearly, each summand gives the approximate value of the mass of the
part of the rod corresponding to the subinterval [Xk _ 1, Xk]. Then this sum
determines the approximate value of the total mass of the rod so that we
10.7 Applications of the Definite Integral 497
n
can define the total mass m as the limit of ~ e(~k) LUk as max LU°k ~ 0
k=I 1::;;k::;;n
b
which is equal to the integral ) e(x) dx. Thus, the total mass m of the
Ci
---
Let us partition the closed interval [a, b] into n subintervals [Xk _ 1, xk]
by choosing the points a = Xo < X1 < ... < b = Xn and compute the mass
mk corresponding to the kth subinterval, k = 1, 2, ... , n. Clearly,
Xk
mk = ) e(x) dx. Then applying the mean value theorem (Theorem 10.5),
Xk- I
we get
where Xk - 1 ~ ~k ~ Xk,
Assume that a particle with mass mk is located at some point ~k in
the subinterval [Xk - 1, Xk], Then we can replace the rod by a system of
n particles with masses mk located at the points 6, b, ... , ~n in the closed
interval [a, b]. Since
I;
k=l
n
mk = I;
n
r
x
k=lxk-1
e(x)dx =
b
I e(x)dx = m,
a
1 1
m = 1xdx = 2 .
0
+ . .. + f(Xn-d + f(Xn) (
Xn - Xn - 1
)
2
n-l
where f(Xk- i) and f(xk) are the bases of the trapezoids and
Xk - Xk-1 = -b-a
- - are t h.elf h.h
e1g ts.
n
Thus we arrive at the formula
I
b n-1
called the trapezoidal approximation. Notice that the larger is n the higher
precision of approximation is obtained.
lj
Xn-1 b=Xn X
Fig. 10.30
It is worth noting that given a function /{x) having the continuous sec-
ond derivative /" (x) on [a, b] the absolute error of approximation does
not exceed the number
M (b - a)3
12n 2 '
where M= max lf"(x)l-
a(;,x(;,b
32*
500 10. Integral Calculus. The Definite Integral
f
j X
dx I
+
= ln(x + I) 1
o
= ln2"' 0.69315.
0
Q= J (Ax 2 + Bx + C) dx = A - 3-
~ h
0
+B2
~ h
0
+ Cx
h
0
0
= A - h3- + B 2h +
3
= 6h (2Ah 2 +
2
Ch 3Bh + 6C).
parabola, we obtain
h1 h
Yo = C, Y1 = A 4 + B 2 + C, Y1 = Ah 1 + Bh + C.
Whence
2Ah 1 + 3Bh + 6C = Yo + 4y1 + Y1
so that
h
Q =6 (Yo + 4y1 + Yz).
------- y= Ax 2 +Bx +C
0 h h X
2
Fig. 10.31
b
Let us consider the definite integral ) f(x) dx where f(x) is a continuous_
a
nonnegative function on [a, b]. We divide [a, b] into 2n regular subintervals
(notice that 2n is even) by the points
a = Xo < X1 < X1 < ... < X1n - 1 < X1n - I < X1n = b
and write down the original integral in the form
1f(x) dx =
b ~
J f(x) dx +
~
1f(x) dx + t
... + J f(x) dx.
a Xo
We let A, M1, M1, ... , M2n - 2, M2n - 1, B denote the points of intersec-
tion of the vertical lines x = Xk, k = 0, 1, 2, ... , 2n, with the graph of
the function y = f(x); let Yo, Y1, Y2, ... , Y2n - 2, Y2n - 1, Yin be the ordinates
of the respective intersection points. If we draw a parabola with vertical
axis of symmetry through every triple of points M2k - 2, M2k - 1, M2k (k = 1,
2, ... , n) we get n curvilinear plane figures bounded above by the parabolas
(Fig. 10.32). Since the area of the curvilinear plane figure corresponding
b-a
to the subinterval [X2k- 2, X2k] with length h = - - - is approximately
n
502 10. Integral Calculus. The Definite Integral
equal to the area of the respective "parabolic" trapezoid, formula (*) gives
1
x2k
Jf /(x) dx ~ b - a
6n [Yo + Y2n + 2 (Y2 + Y4 + . . . + Y2n - 2)
a
+ 4(y1 + Y3 + ... + Y2n - 1)].
This is the parabolic approximation which is sometimes called Simp-
son's approximation.
Fig. 10.32
0
with 2n = 4.
Exercises 503
◄ Consider the regular partition of the closed interval [0, 1] by the points
Xo = 0, xi = 0.25, x2 = 0.50, X3 = 0.75, X4· = 1. The values of the function
Jf dx b-a
x + l :::::: 6n [Yo + Y4 + 2y2 + 4 (Y1 + y3)] =
0
Let us estimate the error of the result thus obtained. The integrand
1 24
1 has the fourth derivative j< >(x) = so that
f(x) = 4
5
X + (X + 1)
24
= 24.
(1 + x)5
Hence the error is at most 24 < 0.0005. Comparing Simpson's
4
2880 X 2
approximation with the exact value of the integral we conclude that the
absolute error in Simpson's approximation is less than 0.0001 as it has been
predicted above.
The computations of the trapezoidal approximation and Simpson's ap-
1
· · to t h e 1ntegra
prox1mat1on · 1 Jf x dx
+ 1 prove t h at t h e 1atter 1s
. more precise
.
0
than the farmer. ►
Exercises
Compute the following integrals by applying the Newton-Leibniz
theorem
4 2e
J(rx- Jx)
11'
1.
1
f x vx dx.
0
2
5
2.
0
dx. 3. j
e
dx
X
4.
7r/2
) cos~dx.
l/v2 1 3
3
5.
) ✓
dx
1- x2
6. ) 1+
dx
x2 . 7. ) lxl dx.
-2
8. ) xdx
✓x + 1
-1/../2 0
0
-3 e4 1
~
dx
~
xdx
~
dx 10.
9. 11.
✓25 + 3x x✓ lnx ✓ 1 - x2
0 e 0
504 10. Integral Calculus. The Definite Integral
4 2 1
In (x + ✓ 9 + x 2 )
12.
i
0
✓9 + x2
dx. 13.
i
0
xdx
✓ 1 + x4
14.
i
0
xdx
✓ 4 - x4
1r/2
1r/2
COS X - SinX
15.
I
0
3cos 2 x sin 2x dx. 16.
J cosx + sinx
dx.
e
7r/ 12
i
1
cos (In x)
17. J In sin 2x • cot 2x dx. 18. dx. 19.
J sinh 2 xdx.
X
0 0
1
ln2 ln3
20. tanhxdx. 21. ) tanh 2 xdx.
J
0 0
Integrate by parts each of the following integrals
21r 1 1
22. ) x sin x dx.
0
23.
I In (1 + x) dx.
0
24.
J0 xsinh xdx
l/v'2 1
25. J sin - 1x dx. 26.
J
x3 exl/2 dx.
0 0
Compute the areas of the plane figures bounded by the given curves
and the indicated axes
27. y = x 2 + 2x - 3 (parabola), y = x + 3 (line).
28. y = 2x - x 2 (parabola), y = -x (line).
29. y = x 2 - l (parabola), x = 2 (line); the x- and y-axes.
30. y = x 2 - 3x - 4, y = 4 + 3x - x 2 (parabolas).
31. y = x - l, y = 1, y = In x.
32. y = e - X, y = eX, x = 1.
33. y = x 3 , y = 8; the y-axis.
34. x = a cos 3 t and y = a sin 3 t, a > 0 (astroid).
35. x = a (t - sin t) and y = a (l - cost), a > 0 (an arch of a cycloid); the
axis of abscissas.
36. x = a (2 cost - cos 2t) and y = a (2 sin t - sin 2t), a > 0 (cardioid).
37. e = a sin cp, a > 0. 38. e = a sin 2cp, a > 0. 39. e = 2 + sin 'P·
Compute the volume of the solid generated by revolving the given curves
or the given plane figures around the indicated axis
40. y = sin x, 0 ~ x ~ 1r; around the x-axis.
41. y = sin 2 x, 0 ~ x ~ 71'; around the x-axis.
x2 Y2
42. -
a2 + - = 1 (ellipse); around the x-axis.
b2
Exercises 505
Compute the length of the arc of the given curve between the indicated
points
x2
47. y = 2 ; from (0, 0) to (1, 1/2).
48. y = U (semicubical parabola); from the origin of coordinates to
A (1, 1).
49. y = lnx; from x = v3 to x = VS.
50. y = In sin x; from x = 7r/3 to x = 71"12.
51. x = a cos 3 t and y = a sin 3 t, a > 0 (astroid; define the total length).
t3
52. x = - t and y = t 2 + 2· from t = 0 to t = 3.
3 '
53. x = et cos t and y = et sin t; from t = 0 to t = In 71".
54. e = a e"", a > 0, (logarithmic spiral); inside the circle e ~ a.
55. e = a sin cp, a > 0; (total length).
56. e = acp, a > 0, (spiral of Archimedes); the length of the first winding.
Answers
5 4 1r 1r 8 2
1. - . 2. - . 3. In 2. 4. - I. 5. . 6. - . 7. 6.5. 8. - . 9. - - . 10. 2. 11. 1.
17 3 2 4 3 3
3 2 1 r.;:.- 1r 2 In 2 . . 2
12. - In 3. 13. - In (4 + "17 ). 14. - . 15. - - . 16. 0. 17. - . 18. sm 1. 19. smh 1.
2 2 4 In 3 4
5 4 -I 7r+4-\f'2 _C
20. -In-. 21. In 3 - - . 22. -21r. 23. In 4 - 1. 24. e . 25. - - - - . 26. 2 - ve.
4 5 2
125 125 3 2 2
27. - . 28. 4.5. 29. 2. 30. - . 31. e - 2.5. 32. 2 cosh 1 - 2. 33. 12. 34. - 1ra . 35. 31ra .
6 3 8
2 1ra 2 1ra 2 9 2 1f 2 2 3
1ra s 7r
4
36. 61ra . 37. - - . 38. - - . 39. - 1r. 40. - 2 . 41. - 1r . 42. - 7f{Jb . 43. - 30 . 44. - .
2 4 2 8 3 2
h2 ) 1 13 ✓ 13 - 8
45. 7f{J 2.Jpij. 46. 1rabh ( 1 + ~ . 47. 2 [\'2 + In (1 + v'2)]. 48. 27
1 3 1
49. 1 + - In- . 50. - In 3. 51. 6a. 52. 12. 53. (1r - 1)\12. 54. a\12. 55. 1ra.
2 2 2
Let a function f(x) be defined for all x ~ a and integrable, say continu-
ous, on every finite closed interval a ~ x ~ b where a is a given number
and b (b ~ a) is any arbitrary number. To deduce what we mean by writing
+oo
I f(x)dx (*)
a
which is called the improper integral, we consider the function J(b) =
b
I /(x) dx of the variable b (b ;;;::: a).
a
Definition. If the function J(b) has a finite limit L as b--+ + oo the im-
proper integral (*) is said to converge.
In this case we write
+oo b
lim r /(x) dx = L.
J /(x) dx = b-++oo
a
J a
Suppose that the function J(b) has no (finite) limit as b --+ + oo. Then
the improper integral (*) is said to diverge; in this case no numerical value
is assigned to this integral.
11.l Integrals with Infinite Limits of Integration 507
+00
j dx
1+ X
2
_
-
.
11m
b-+ + oo
j
b
dx
1+ X
2 = .
11m tan - t b
b-++oo
= 21r . ►
0 0
+oo
(2) The improper integral I cos x dx diverges.
0
b
◄ Indeed, the integral I cos x dx = sin b has no limit as b ~ + oo; hence,
it diverges. ► 0
(3) Let two charges Q1 and Q2 be both positive so that they repel each
other. By the Coulomb's law the absolute value F of the force of electrostat-
ic interaction of two point charges in vacuum is given by
F = kq1q2
r2 '
where r is the distance between the charges and k is a constant.
Suppose that the charge q1 is located at the point Mo which is chosen
as the origin of some reference frame and the charge q2 is located at the
point M. We let r1 denote the distance between the ponts M and Mo and
compute the work done in moving the charge Q2 from M to infinity.
◄ The desired work Wis defined as the improper integral
+oo +oo
W = j k~? 2
dr = kq1 Q2 j dr
-2.
r
r
J
dr = lim
r2 b ➔ + oo
r dr
Jr 2
= lim
b-+ + oo
(-l)
r
r=b
r=r1
= lim
b-+ +oo
(
- 1- -
r1
!)
b
1
There1ore,
c W -- -kqi
-- . charge. Then W
Qi . Let q2 be a unit kqi
= --; th"1s
n ~
quantity is called the potential of the field induced by the charge Q1. ►
+oo
J
l d:
X
= lim
b->+oo
1
x• -a b bl -a 1
1- Cc'. 1- Cc'. 1 - Cc'.
Thus the original integral converges if a > 1. For a < 1 the integral J xadx
1
has no finite limit as b -+ + oo; in other words the integral diverges if a < 1.
Now let a = 1. Then
b b
l dx = f dx = In b.
J xa J X
1 1
b
the integral
f dx ( converges if a > 1,
J
1
t
xa diverges if a ~ 1.
y
cr=1
-r----a:<1
----a:=1
a>1
0 1 b X
Fig. 11.1
1 b X
Fig. 11.2
I
+ 00
dx
Remark. Notice that given any a ~ o> 0 the integral - - converges
X0t
a
for a > 1 and diverges for a ~ 1.
The definition of the improper integral
+ 00 b
1/(x) dx = lim
b-++oo
r /(x) dx
j
a a
+oo +00
(1) If the integral 1f(x) dx converges so does the integral 1 'A./(x) dx
a a
where 'A. is any real number and
+00 +oo
) 'A/(x) dx = 'A 1f(x) dx.
a a
+ 00 + 00
(2) If both 1/(x) dx and 1<P(X) dx converge so does
a a
+00
) (f(x) + 'P(x)) dx and
a
Both integrals on the right have limits as b ➔ + oo; hence, the integral on
+oo
the left also has a limit as b ➔ + oo, i.e., 1(f(x) + <P(X)) dx converges.
a
Evaluating the limit of the above identity as b ➔ + oo, we obtain the desired
result. ►
+oo
Problem. Let the integral 1(f~x) + 'P(x)) dx converge. Determine
a
+00 +cxi
whether or not the improper integrals ) f(x) dx and ) 'P(x) dx converge.
a a
(3) Let u(x) and v(x) be continuously differentiable functions on the
line x ~ a. Then
+oo +00 + 00
I u dv = (u(x) v(x))
a a
) v du (integration by parts)
a
provided that at least two of the three terms involved make sense (notice
that in this case the third term also makes sense).
+ 00
Example. Integrate by parts Jn = I x'2e - x dx, where n is a natural
0
number or zero.
11.2 Integrals of Nonnegative Functions 511
◄ We have
+00 +00 +00
In = I xn e - x dx =- (xn e - X) +n j xn - le - x dx = nJn - 1 ,
0 0 0
n = 1, 2, ....
Notice that
+00
Jo = J e - x dx = 1
a
we get In = n ! ►
Also, since /(x) ~ ip(x) for all x ~ a then given any b > a there holds
b b
j /(x) dx ~ j ip(x) dx.
a a
512 1I. Improper Integrals
b
The value of the integral J cp(x) dx is not larger than that of the in-
a
+ 00
tegral I cp(x) dx; the latter integral converges by the hypothesis. Hence,
a
J 1 + x z + sin x
e- x2
Example. Investigate the improper integral 4 dx.
0
-x2
◄ Notice that for all x ~
. Ji(x) = - - -
0 the function . sue h
e - - - 1s
2 4 1 + x + sin x
that
e-x2 1
Q. < 1 + x 2 + sin4 x ~ 1 + X2 = <P(X).
+ 00
◄ Indeed, let
cpx
lim
x ..... +oo
ft~
= k > 0. By the definition of the limit this
k
means that given any number e > 0, say e = 2 > 0, there exists a number
N such that for all x ~ N
f(x) _ k k
<s=-
cp(x) 2
or, equivalently,
33-9505
514 11. Improper Integrals
converge.
2x 2 + 1
◄ The integrand f(x) = - - - - - 1s positive for all x ~ 1. Write
x 3 + 3x + 4
down f(x) as
+ 1/x2 2
f(x) = x + 3/x + 4/x2 •
Whence· we conclude that for large x the function f(x) is similar to the
function 2/x.
+oo +oo
+4
= 2 rt 0. ►
+ 00
Theorem 11.3. Let there exist a number a > 1 such that for all suffi-
ciently large x
+oo
where Mis independent of x and M > 0. Then the integral J f(x) dx con-
a
verges.
If f(x) ~ M where M is independent of x and M > 0, whenever x is
X
+oo
sufficiently large, then the integral ) f(x) dx diverges.
a
◄ Let O ~ f(x) ~ ~(a> 1) for all x ~A > max (a, O}. Since the in-
+oo · X ·
= ~
tegral
Jf M dx where a > 1 converges then putting <P(X)
~ X
in The-
.
A
11.3 Absolutely Convergent Improper Integrals 515
+ 00
orem 11.1, we conclude that so does J f(x) dx. Whence it follows that
+00 A
J f(x) dx converges, for
a
b A b
) f(x) dx = J f(x) dx + J f(x) dx
a a A
b b
and the integrals J f(x) dx and J f(x) dx must simultaneously have finite
a A
limits as b ~ + oo.
M
Now let f(x) ~ - (M > 0) for all x ~ A > max {a, 0}. Since the in-
+ 00 X
tegral } ':dx diverges then, by virtue of Theorem 11.1, so does the in-
A
+oo +oo
tegral f f(x) dx; this implies that the integral J f(x) dx also diverges. ►
A a
+oo
x-2
Example. Investigate the integral } 3z dx.
x +x +2x+5
3
◄ For x ~ 3 we have
X - 2 X 1
0 < ~ -2 ~ - - - - <3- = -
x3 + x + 2x + 5 x x2 •
+ 00
The integral }
3
!~ converges smce "' = 2 > I; hence so does the
original integral. ►
+ 00
Theorem 11.4. If the integral J lf(x)I dx converges so does the integral
+00 a
j f(x) dx.
a
+ 00
◄ Let J lf(x)I dx converge, i.e.,
a
b
lim
b-++oo
J lf(x)I dx = L <
a
+ oo.
Since given any x in the domain of f(x) there holds
- lf(x)I ~ f(x) ~ lf(x)I
then
0 ~ lf(x)I + f(x) ~ 2lf(x)I.
+00
The integral J lf(x)I dx converges by the hypothesis; hence, so does
a
+00 +00
the integral j 2lf(x)I dx =2 j lf(x)I dx. Then by virtue of (•) Theo-
a a
+00
rem 11.1 yields that the integral J (f(x) + lf(x)I) dx converges. This means
b a
that the integral j (f(x) + lf(x)I) dx has a finite limit as b ~ + oo.
a
Evidently, we have for all x ~ a
f(x) = (f(x) + lf(x)I) - lf(x)I.
Whence for any b > a there holds
b b b
J f(x) dx = J (f(x) + lf(x)I) dx - J lf(x)I dx.
a a a
Both integrals on the right have finite limits as b ~ + oo. Hence, the integral
b + oo
J f(x) dx also has a finite limit as b ~ + oo, i.e., the integral J f(x) dx
a a
converges. ►
+ Theorem 11.1 and the results concerning the integrals of the form
I ::
00
leads to the following convergence test for the integral +r f(x) dx.
1 Theorem 11.5. Let there exist a number a > I such that for all suffi-
ciently large x there holds
M
lf<x>I ~ -
Xa.
,
11.3 Absolutely Convergent Improper Integrals 517
+oo
where Mis independent of x and M > 0. Then the integral J f(x) dx con-
a
verges absolutely.
◄ Suppose that the above condition holds for all x ~ A > max {a, 0}.
+ 00
+ 00
= cos 1 j cosx
----,2=--
X
dx.
1
+Joo COS X
The integral - ~ - dx converges absolutely; hence, it is convergent.
x2
1
Thus both expressions on the right of the above identity are finite so that
(1) the method of integration by parts is justified and (2) the integral
+oo
+ 00
implies that
b b b
j lsinxl
x dx ~ 2
I
Jf dxx _ 21
Jf cos 2x
x dx.
1 1 1
+ 00 b
J ~
The integral diverges, for lim
00
b-- + oo Jf
1
dx - + oo. Applying in-
X
. by parts to t h e 1ntegra
tegrat1on . 1 ~ -
cos - dx, we 1n1er
-2x . c t h at 1t
. converges.
X
1
Now we turn our attention to Dirichlet's convergence test which speci-
fies the sufficient conditions for the improper integral to converge.
Theorem 11.6 (Dirichlet's convergence test). If a function is continuous
and has a bounded antiderivative F(x) for x ~ a_ and if a function g(x)
is continuously differentiable and monotonically decreases for x ~ a and
+oo
if Iim g(x)
x-+ +oo
= 0, then the integral I f(x) g(x) dx converges.
a
The proof of this theorem can easily be found elsewhere in the relevant
literature.
By way of illustration we apply Dirichlet's convergence test to the in-
+joo sin X
tegral --dx,
Xa
a> 0.
1
Cauchy Principal Value 519
◄ The function f(x) = sin x has the bounded antiderivative F(x) = - cos x
for all x; g(x) = 1/xa (a > 0) is a continuously differentiable function for
x ~ 1 and g(x) monotonically decreases. and tends to zero as x ~ co. Hence,
all the conditions of Theorem 11.6 are satisfied so that the integral in ques-
tion converges. ►
+ 00
Problem. Show that the Fresnel integral 1sin x 2 dx converges. (Hint.
0
Use the substitution x 2 = t .)
b b
1f(x)dx = lim
a-+ -oo
) f(x) dx
-oo a
b
and say that 1f(x) dx converges if this limit exists; otherwise the improper
-00
or
+ 00 NZ
1f(x)dx = lim
N1-++oo 1 f(x) dx,
- 00
N2-+ + oo -Ni
+oo
i.e., when N1 = N2 = N. Then we say that the improper integral I /(x) dx
-oo
converges in the sense of the Cauchy principal value.
+ 00
Example. Investigate the integral J xdx
1 + x2 •
-oo
520 11. Improper Integrals
◄ We have 1
N1
xdx
-----=-
I + x2
1 1 + N;
= - I n - - ~ whence it follows that the in-
2 1 + Nf '
tegral 1
-Ni
---=-
xdx
1 + x2
has no limit as N1 ➔ + oo and N2 ➔ + oo so that the
+ 00
integral J -1xdx
- - diverges. On the other hand we have
+x 2
- 00
+ 00 N
xdx xdx 1 1 + N2
C.p.v. - lim = -In = 0.
-
J
00
1 + x2 N-+ + oo J
-N
1 + x2 2 1 + N2
Hence, the integral in question converges in the sense of the Cauchy prin-
cipal value. ►
0
1
◄ The integrand f(x) = ----;===- 1s a continuous function on any
✓ 1 - x2
[0, 1 - e] where e > 0. Hence it 1s integrable. On the other hand
!J
0 a b-~ b x
Fig. 11.3
so that the improper integral in question converges. Notice that the substi-
11"/2
This integral is called convergent if the finite limit on the right exists; other-
wise it is divergent.
1
dx
Example. Investigate the integral )
0
Xl -a 1 EI - a
1
1- a e 1- a 1- a
then
1
lim
£-+0+0
r
J
dx
------
1
1
1
dx
If a > 1 the integral - - has no finite limit as e -+ 0 + 0.
xa
e
Let a = 1. Then
1 1
~
dx
) = ) - In e -+ + oo as e -+ 0 + 0.
X
e e
)
0
dx [converges
xa diverges
if
if
a < 1,
a ~ 1.
11.5 Improper Integrals of Unbounded Functi9_n--'--s_ _ _ _ _ _ _ _ _ 523
lim (
t;l---+0+0
CrJel f(x) dx + ~J f(x) dx]
t: 2 ---+0+0 a c+ t:2
= lim
t:1 ->O+O
(c-t,J f(x) dx +
c+e2
1
J
f(x) dx].
e2-> 0 + 0
Notice that the limit on the right must exist as e1 and e2 tend inde-
pendently to zero.
Sometimes the improper integral thus defined has no meaning while
there exists its Cauchy principal value given by the formula
C.p.v.
b
J f(x) dx = e~f?lo
[ c-e
1 f(x) dx + b ]
J f(x) dx ,
a a c+e
where e > 0 is the same in both integrals on the right. In this case we say
that the improper integral converges in the sense of its Cauchy principal
value.
b
dx
- - = (In Ix - cl)
x-c a
a
b
b - c e1
+ (In Ix - cl) = I n - - - + In - .
c - a e2
The limit of the right-hand side as both e1 and e2 in an arbitrary manner
tend to zero does not exist. Put e1 = e2 = e. Then as e --+ 0 + 0 the limit
of the right-hand side exists and is equal to the Cauchy principal value
526 11. Improper Integrals
Exercises
Using the definition of the improper integral determine whether or
not the following integrals converge
+ 00 + 00 + 00
+
xdx
. x + I
2 00
dx
1. dx. J dx.
J
0
x4 + I
2.
J
1
X
3 3.
0
X e-xl 4.
J2 xlnx
xdx + I
X x 2 dx
5. 6. dx. 7.
J 1
x3 + x + I · J
1
x 2 +x+5 J
0
x4 + x 2 + I
2
+ 00 - 00 - 00
dx x tan - 1x dx
8.
J
1
1/x 2 + I
9.
J
0
1/x 4 + 1
dx. 10.
J
2
xlna X
(a is a real
number). 11.
+ 00
12. Evaluate the improper integral J x 2n + 1 e - xz dx where n is a positive
0
natural number.
Exercises 527
1 +co
dx tan - 1 x
23. . 24 . dx, a ~ 0.
J
0
ex - cosx J
0
Xa
+co +co
25.
Xa
- - ~ dx (a and (3 are real numbers). 26.
rx cosx
----dx.
J
0
1 + x 13 J
0
X + 1
Answer
not defined at the point x = 0 and is unbounded as x --+ 0 for sufficiently large a > 0. We
divide the interval of integration into two parts so that the first subinterval would contain
the point of discontinuity of f(x) at x = 0 and the second one would be appropriate to examine
the behaviour of f(x) as x --+ + oo. Consider, for example, the semiclosed intervals (0, 1) and
528 11. Improper Integrals
j
0
tan - Ix
x°'
dx = j
0
tan - x
1
x°'
dx +
J
tan - 1 x
x°'
dx.
Consider the first integral on the right. Recall that tan - 1 x ~ x as x ..... O; hence,
tan - 1 x 1
f(x) is similar to - - - near the point x = 0. This implies that for the integral
x°' X°' - I
I
j
0
tan - 1 x
- - - - dx to converge it is necessary that a - 1
x°'
< 1 or a < 2. The integrand
- I
f(x) = tanx" x of the second integral on the right behaves similarly to the function~
x°'
as x ..... + oo since tan - 1 x ..... '!!" as x ..... + oo. For this integral to converge it is necessary that
2
a > 1. Combining both conditions we infer that both integrals on the right converge only
if 1 < a < 2; this is the condition for the original integral to converge. 25. Converges if
a > -1 and {3 - a > 1. 26. Converges conditionally. Hint: apply the Dirichlet convergence
test.
Chapter 12
Functions of Several Variables
n
e(M',M")= ~(xl(-x/;)2.
k=l
lj
Fig. 12.1
X
Fig. 12.2
Fig. 12.3
12.1 Basic Notions and Notation 531
mean here a set of all points M(x1, x2 , •.. , Xn) in Rn such that
x?-e;<x;<x?+e;, e;>O, i= 1, 2, ... , n.
Putting n = 1, we obtain the e-neighbourhood Xo - e < x < Xo + e of
the point Xo well familiar to us from the calculus of functions of one varia-
ble; for n = 2 we get the plane figure bounded by the rectangle with sides
2e1 and 2e2 (sides are excluded, Fig. 12.3) and for n = 3 we have the (open)
solid bounded by the parallelepiped with centre at Mo(Xo, Yo, zo) and edges
2e1, 2e2 and 2e3 (Fig. 12.4).
I eM0
)---------
/
/
/
/
/
Fig. 12.4
Fig. 12.5
2 X
lj
In symbols we write
A = lim f(M) or A = lim f(x, y ).
M-+Mo x-+xo
y->yo
Fig. 12.8
Whence
2k
f(x, kx) ~ 2 as x ~ 0
1+ k
so that for different values of k the limiting values of the function are
different. This means that the given function has no limit at the point
0(0, 0). ►
2
(3) The function f(x, y) = 4x y 2 is defined everywhere on the xy-
x +y
plane except at the origin 0(0, 0) of coordinates. Again we investigate the
behaviour of this function provided that (x, y) approaches zero along the
lines y = kx, x -;t:. 0.
◄ We have
kx 3
f(x, kx) = 4 2 2 , x ;;t 0
X + k X
so that f(x, kx) ~ 0 as x ~ 0. Hence, f(x, kx) has a limit equal to zero
for any line y = kx, i.e., for any line along which the point tends to the
origin of coordinates. If we put y = x 2 then f(x, x2 ) = 1/2, x ,;t 0. This
means that the limit exists when a point tends to the origin by moving
along the parabola y = x 2 • However this limit is equal to 1/2. Thus the
given function has no limit at the point 0(0, 0). ►
Theorem 12.1. Let f(M) and cp(M) have limits at the point Mo. Then
at Mo there exist the limits of the sum f(M) + cp(M), the difference
/(M) - ,p(M), the product f(M),p(M) and the quotient ;~~ (provided
that lim 'P(M) -;t:. 0) and
M-+Mo
lim /(M)
M-+Mo
lim
M-+Mo
/(M)
'{)(M) lim '{)(M)
M-+Mo
(J~~o cp(M) '#:- O).
It is sometimes helpful to use the following definition of the limit which
is equivalent to the preceding one.
Definition. Let a function /(M) be defined in some deleted neighbour-
hood O of a point Mo, i.e., in some neighbourhood of Mo which does not
contain Mo. The number A is called the limit of the function /(M) at Mo
536 12. Functions of Several Variables
This means that the limit of f(x, y) depends upon the order in which we
tend the variables to their limiting values.
Definition. Let a function f(M) be defined at a point Mo(Xo, Yo) and
in some neighbourhood n of Mo. The function f(M) is called continuous
at Mo(Xo, Yo) if lim f(M) = f(Mo) or, equivalently, if lim f(x, y) =
M--+ Mo X--+ Xo
y->yo
f(Xo, Yo). We assume here that the point M(x, y) approaches Mo(Xo, Yo)
in an arbitrary mode, being always contained in the domain of f(M).
Using the "eo" definition of continuity of a function at a point we
say that a function f(M) defined at the point Mo and in some neighbour-
hood O of Mo is continuous at Mo if given any e > 0 there exists o > 0
such that for all points ME O there holds lf(M) - f(Mo)I < e whenever
e(M, Mo)< o.
We can also give slightly different definition of continuity of a function
at a point Mo. We let AX and .dy denote the increm~nts of the independent
variables x and y in moving from the point Mo(Xo, Yo) to the point
M(x, y). Let .dz = f(Xo + AX, Yo + .dy) - f(Xo, Yo) be the increment in z =
f(x, y) correspondng to AX and .dy. Then the expression
lim f(x, y) = f(Xo, Yo)
x-+ Xo
y-+yo
becomes equivalent to
lim ~ = 0,
.:1x--+ 0
.:1y --+ 0
12.2 Limits and Continuity 537
lj
0 X
Fig. 12.9
The increment
Axz = f(x + AX, y) - f(x, y)
is called the partial increment in z caused by the increment AX in x.
We let ~ denote the ratio of the partial incn;ment in z to the respec-
tive increment in x; clearly, this ratio is a function of AX.
Definition. If the ratio ~ has a finite limit as AX ➔ 0 this limit is
called the partial derivative of the function z = f(x, y) at the given point
(x, y) with respect to the independent variable x.
We shall denote the partial derivative of z = f(x, y) with respect to x
by writing
Therefore we put
az = lim .dxZ
ax ~x--o .dX
or, equivalently,
F'( . f(x + ax, y) - f(x, y)
Jx X, y) = 1Im ---------.
~x--o .dX
Analogously, we have
az = lim .dyZ = lim _fi_(x_,_Y_+_.d_J'_)_fi_(_x_,_Y_).
ay ~y--o .dy ~y--0 .dy
Let u = f(x1, X2, ... , Xn) be a function of n variables. Then
au
--=
. f(X1 ,X2 ,.,.,Xk - 1 ,Xk + Lll"k ,Xk + 1 , ••• ,Xn) - f(X1 ,X2 , ... ,Xk - 1 ,Xk ,... ,Xn)
1I m------------------------.
axk ~Xk _. O .dXk
Noting that .dxZ and .dyZ are computed by regarding x and y, respective-
ly, as constants, we can give the following definitions of partial derivatives.
The partial derivative of a function z = f(x, y) with respect to the varia-
ble x is an ordinary derivative with respect to x computed by regarding
y as a constant; similarly, the partial derivative of z = f(x, y) with respect
to y is an ordinary derivative with respect to y computed by regarding x
as a constant. Whence it follows that ordinary and partial derivatives are
subject to the same laws of differentiation.
Example. Compute the partial derivatives of the function z = eXJ'.
◄ We have
-
az = yeXJ' and -
az = xeXJ' ►
ax ay
Remark. If a function z = f(x, y) has partial derivatives with respect
to every varia~le at some point, z = f(x, y) is not necessarily continuous
at this point. For example, the function
xy 2 2
2 2 ' X + y ;e 0,
[
X +y
f(X, y) = 0, X=y = 0
is not continuous at the point 0(0, 0) but has both a partial derivative
with respect to x and that with respect toy sincef(x, 0) = 0 andf(0, y) = 0
so that -
aJ = 0 and - aJ = 0.
ax (0,0) ay (0,0)
540 12. Functions of Several Variables
Fig. 12.10
or
a ax+ (3 J).y = ee,
ax J).y
where e =a -- + (3 - - depends upon ax and J).y and tends to zero as
e e
Ax -+ 0 and J).y -+ 0 or, using the abbreviated notation, as e -+ 0.
Thus formula (*) expressing the condition for the function z = f(x, y)
to be differentiable becomes
i).z =A ax + B J).y + e e,
where e = e(e) -+ 0 as e -+ 0. For instance, turning to the previous example,
we have .dZ = 2x Ax+ 2y J).y + (ax)2 + (J).y) 2 = 2x ax+ 2y J).y + e2
where e(e) = g.
Necessary conditions for a function to be differentiable. We shall prove
the following important theorems.
Theorem 12.4. If a function z = f(x, y) is differentiable at some point
then f(x, y) is continuous at this point.
◄ Indeed, if z = f(x, y) is differentiable at a point (x, y) then the incre-
ment in z at (x, y) corresponding to the increments ax and J).y admits a
representation of the form
J).z =A ax + B J).y + a ax + (3 J).y,
where A and Bare constant at (x, y) and a-+ 0 and (3-+ 0 as ax-+ 0 and
J).y -+ 0, whence lim J).z = 0. This means that the function z = f(x, y) is
~x-> 0
~y _,. 0
continuous at the point (x, y). ►
Theorem 12.5. If a Junction z = f(x, y) is differentiable at a point then
. 1d .
Ji(x, y ) h as partla .
erzvallves az az at th.ts pomt.
ax an d ay .
◄ Indeed, let z = f(x, y) be differentiable at a point (x, y). Then the incre-
ment J).z corresponding to the increments ax and J).y can be written as
.dZ = A ax + B J).y + a(ax, J).y)ax + (3(ax, -1).y)J).y.
Put ax ;c 0 and J).y = 0. Then the above formula gives
J).xz =A ax + a(AX, 0)ax.
Whence
i).x Z
ax = A + a(ax, 0).
A .
11m i).x Z
= --.
Ax-+O ax
12.3 Partial Derivatives and Differentials 543
This implies that at the point (x, y) there exists a partial derivative of
z = f(x, y) with respect to x and
az
ax
=A
.
By similar reasoning we infer that at the point (x, y) there exists a partial
derivative of z = f(x, y) with respect to y and !~ = B. ►
It is worth mentioning here that Theorem 12.5 states that there exist
partial derivatives at the point (x, y ); however, it tells nothing about con-
tinuity and the behaviour of the partial derivatives at (x, y) and in a neigh-
bourhood of the point (x, y).
From Theorem 12.5 it follows that
az az
dZ = ax .6.x + ay dy + a AX + {3 dy.
dz = I 2y dy = dx + 2y dy
-----,2=- dx + 2 2 ►
x+y x+y x+y
Analogously, if u = f(x1, x2, ... , Xn) is a differentiable function of n
independent variables then
12.4 Derivatives of Composite Functions 545
The relation
dx z = f;(x, y) dx
is called the partial differential of a function z = f(x, y) with respect to
the variable x, and
dyz = J;, (x, y) dy
is called the partial differential of z = f(x, y) with respect to the variable y.
Using the above formulas for dz, dxz and dyz, we can write the differen-
tial of z = f(x, y) as
dz = dx z + dy z,
i.e., the total differential of z = f(x, y) is the sum of partial differentials.
Notice that in general the total increment Az in z = f(x, y) is not equal
to the sum of the increments in x and y.
Suppose that a function z = f(x, y) is differentiable at the point
(x, y) and dz '#- 0 at (x, y). Then the total increment
az az
Az = ax .:ix+ ay Ay + a(Ax, Ay).:ix + {3(Ax, Ay)Ay
az az
dz= ax ~ + ay ~y + a~+ (3 dy,
where a(~, ~y) and /3(~, ~y) tend to zero as ~--+ 0 and dy--+ 0. If
we put a(0, 0) = 0 and {3(0, 0) = 0 for ~ = dy = 0 then a(~, dy) and
{3(~, ~y) will be continuous as ~ = dy = 0.
Consider
~z az ~ az ~Y ~ dy
dt = ax M + ay M + a -z;F + f3 -z;F· (*)
Both factors of each summand on the right have limits as ~t --+ 0. In-
. I d envatlves
d eed , t he part1a . . az an d ay
ax az are constant at t h e given
. .
point
(x, y). By the hypothesis there exist limits
. -
11m ax- = -
dx = c,o , ( t ) an d 11m
. -~Y
- = - dy = .,. ,( )
.,,. t .
~t-+O ~t dt ~t-+O ~t dt
Since ddx and ddy exist at a point t the functions x = cp(t) and y = t/;(t)
t t -
are continuous at this point. This implies that as ~t--+ 0 both ~ and ~Y
tend to zero and, hence, so do a(~, ~y) and /3(~, ~y).
Thus the right-hand side of (*) has a limit as ~t --+ 0 equal to
:: : + :; dt .
This means that the left-hand side of(•) also has
. · as JJ.t--+
a I1m1t A Osot hat t here exists
· 1·1m ~~z = -d.
dz · EvaIuat1ng
· t he 1·1m1ts
·
~t- O JJ.t t
in (•) as dt ~ 0, we obtain
dz = az dx + az dy ►
dt ax dt ay dt .
12.4 Derivatives of Composite Functions 547
and dz = az + az dy = _ y x 2x 1 ►
dx ax ay dx x2 + y + x2 + y
2 2 = 1+ x2 .
Now we draw our attention to differentiation of a composite -function
of several variables. Consider the function z = f(x, y) where x = c,o(I;, 17)
and y = i/;(t 11) so that z = z(t 11) = /(c,o(I; 11), 1/;(€, 11)). Let continuous par-
. l d . . ax ax ay d ay .
tla envat1ves af' a11 , af an ¼ exist at a ·point (t 11) and let
f(x, y) be differentiable at the corresponding point (x, y) where x = c,o(t 11)
and y = if;(t 11). 0
We show that the composite function z = z(I;, 11) has the derivatives 0~
and :: at the point(~. 71) and derive the respective formulas. Notice that
this case is almost similar to that considered before. Indeed, on differentiat-
ing z with respect to I; the variable 17 is regarded as a constant; this implies
that x and y become functions of one variable E, i.e., x = ',O(I;, c) and
y = if;(~, c) so that formula (••) is fully applicable here.
Using (••), we obtain
oz oz ax oz oy
a1; = ax a1; + ay a1; ·
35*
548 12. Functions of Several Variables
Analogously
az az ax az ay
-
a,,, =ax- -
a +ay- a- •
11 11
au = aJ + aJ az and au = aJ + . aJ az
ax ax az ax ay ay az ay .
dz
az
= -ax dx + -
az dy.
oy
Comparing the above formula with (*) we infer that the total differential
of the function z = f(x, y) is expressed by formulas of the same form when
the variables x and y are independent and when x and y are functions of
some other variables. Thus the form of the expression for the total differen-
tial of a function of two and more variables remains the same (invariant)
in both these cases.
Remark. Based upon the previous results it is easy to verify that the
differentiation formulas
d(x ± y) = dx ± dy,
d(xy) = xdy + ydx,
d(x) = ydx - xdy.
y y2
remain valid for x and y being differentiable functions of any finite number
of variables, i.e., for x = cp(t 71, !;, ... ) and y = t/;(t 71, !; , ... ).
Yo such that the point (Xo, Yo) satisfies the original equation, then the line
z = y and the curve z = Xo + e sin y intersect at a unique point and vice
versa. Let us draw the graphs of z = y and z = Xo + e sin y in the zy-plane
(Fig. 12.11).
The graph of z = x + e sin y where x is thought of as a parameter is
obtained by translating the graph of the curve z = e sin y along the z-axis.
It is easy to see that the graphs of z = y and z = x + e sin y meet at the
only point whose ordinate y is a function of x implicitly specified by the
equation y - x - e sin y = 0, 0 < e < 1. This relationship can not be ex-
pressed in elementary functions. ►
z = x0 + e: s[n y
z = c sin y
Fig. 12.11
(ii) the function F(x, y) vanishes at the point (Xo, Yo) so that
F(Xo, Yo) = 0;
... ) there exist
(111 . contmuous
. . I d envatlves
partla . . aF
ax an d aF . D;
ay m
Then given any sufficiently small positive number e there exists a neigh-
bourhood Xo - 0o < Xo < Xo + 0o of Xo, 0o > 0, where there exists a unique-
ly defined *> continuous function y = f(x) (Fig. 12.12) such that Yo = f(Xo),
IY - Yol < e and the equation F(x, f(x)) = 0 becomes an identity, 1.e.,
F(x, f(x)) = 0
for all x in the given neighbourhood of Xo-
I
I
l(ro,Yo}I
I I
I I
I I
I
Fig. 12.12
•> We regard y = f(x) as "uniquely defined" in the sense that the coordinates of any
point which lies on the curve F(x. y) = 0 and is contained in the neighbourhood
0 = (Xo - &i < x < Xo + &i, Yo - e < y < Yo + e) of the point (Xo, Yo) are related by the
equation y = f(x).
12.5 Implicit Functions 553
-dF(x,
- dx - - = -aF
-y(x)) -
ax
aF dy
+ -- --
ay dx.
Therefore for y = f(x) we get
aF + aF dy =O
ax ay dx .
Whence
dy
dx
Suppose that <P(t), V1(t) and w(t) have continuous derivatives <P' (t), VI' (t)
and w '(t) at every t such that a < t < {3. We leave aside singular points
of L where cp' 2 (t), + VI ' 2 (t) + w ' 2 (t) = 0 and consider a regular
point Mo(Xo, y 0 , Zo) on L specified by the value to oft, to E (a, /3). Then
the vector -r ;::: x' (to)i + y' (t0 )j + z' (to)k lies on the tangent to the curve
L at the point Mo(Xo, Yo, Zo).
Now we choose a regular point P of a surface S and draw through P
a curve L lying in S. Let the curve be given, as before, by the parametric
equations
X = <P(t),
[ y = i/; (t), a < t < {3
z = w(t),
and let <PU), V1(t) and w(t) have continuous derivatives that nowhere in a(/3)
vanish simultaneously.
By the definition the tangent to L at Pis called the tangent to the sur-
face S at P.
If the above parametric equations are substituted into the equation
F(x, y, z) = 0 of the surface S the latter becomes an identity with respect
to t so that F(<P(t), 1/;(t), w(t)) = 0 since the curve L lies in the surface S.
Differentiating this identity as a composite function of t, we get
aF dx + aF dy + aF dz =O
ax dt oy dt oz dt .
The expression on the left is a scalar product of the vectors
oF . aF . aF k d dx . dy . dz k
n = ax I + oy J + az an T = dt I + dt J + dt .
The vector -r is a tangent vector to the curve L at the point P. The vector
n being independent of the shape of the curve passing through the point
P depends only on the coordinates of P and on the shape of the function
F(x, y, z).
Since P is a regular point the length of n
the tangent plane F(x, y, z) = 0 at the point P. Whence we get the equation
of the tangent plane to the surface F(x, y, z) = 0 at the regular point
Po(Xo, Yo, Zo) as
(Xo,Yo)
(x - Xo) + (!1 )
Y (Xo,Yo)
(y - Yo).
(Xo,Yo>
~+ ( a/)
ay (Xo,Yo)
Example. Write down the equations of the tangent plane and the normal
to the surface z = x2 + y 2 at the point 0(0, 0, 0).
◄ We have /(x, y) = x2 + y 2 ·so that :~ = 2x and :; = 2y; these
derivatives vanish at the point (0, 0), i.e., J;(o, 0) = J;(o, 0) = 0. Then the
equation of the tangent plane becomes
z- 0 = 0(x - 0) + 0(y - 0),
i.e., z = 0 (the tangent plane is the xy-plane).
The equations of the normal are
x-0
0
-
y-0
0
z-0 or
1
[xy=0
=0
The derivatives J; and J;: are called the mixed partial derivatives; the
former is computed by differentiating the given function first with respect
to x and then with respect toy and the latter is computed by differentiating
the function first with respect to y and then to x.
The partial derivatives of the third and higher orders can be defined
in a similar way.
Example. Compute the first and second derivatives of the function
z = x3y2 - xy3.
◄ We have
a 2z 2 2 a 2z 2 2 ►
ay ax = 6x y - 3y ' ax ay = 6x y - 3y .
X=y=O
have mixed partial derivatives J; and J;~ which are not continuous at the
point 0(0, 0) so that J;(o, 0) = -1 and J;~(0, 0) = I.
In general, the mth mixed partial derivatives (m ~ 2) of a function
u = f(x1, xi, ... , Xn) have the same value at a given point if they are con-
tinuous at this point and are distinct from each other by the order in which
the function is differentiated with respect to its variables.
Differentials of higher orders. Let z = f(x, y) be a function of variables
x and y defined on the domain D.
If z = f(x, y) is differentiable on the domain D then the total differen-
tial of z = f(x, y) at the point (x, y) E D corresponding to the increments
dx and dy in the variables x and y is given by the formula
az az
dz = ax dx + ay dy,
560 12. Functions of Several Variables
= d(;; ) dx + d ( :; ) dy.
Applying the formula for the total differential to aaz and az , we get
X oy
d( :; ) = :x (:: )dx + :y (:ndy = !) dx + a~ :x dy, 2
d=
2 a a
( -dx+-dy a2
) 2 =--dx+2
2
a
a2 dxdy+--
a2 dy 2 .
ax ay ax2 xay ay 2
Multiplying both sides termwise by z and inserting z into the nominators
of the "fractions" on the right, we get the formula previously deduced,
namely
a2 z a2z a2-
z 2
d 2z = - - 2 dx 2 + 2--dxdy + - 2 dy.
ax axay ay
Formulas for differentials of the third, fourth, etc., orders can be derived
in a similar way. In general, the total differential of the nth order denoted
by dnz is the total differential of the total (n - l)th differential so that
dnz = d(dn - 1 z).
If z = f(x, y) E cn(D) has continuous partial derivatives up to the nth
order derivatives then there exists the total differential of the nth order
given by the formula
36-9505
562 12. Functions of Several Variables
Fig. 12.15
+ <P(n)~(Jt) tn,
n.
where O < (} < 1.
Put t = 1. Then
<P>
'P(O)
a Ax + aya Ay)pf(x, y)
= ( ax x = Xo' p = 0, 1, ... , n - 1.
Y = Yo
We put t = 0. Then
1 ( a
+ . . . + (n - 1)! ax AX +
a
au Ay
)n- f(x, y)
1
x = Xo
J y = Yo
1 ( a
+ n! a
ax AX + ay Ay f(x, y)
)n X = Xo + 0Ax'
y =Yo+ (J~
Rn
1 (
= nf a
ax AX+ -a-.dy
a )nf(x,. y) X = Xo + 8Jlx
y Y =Yo+ 8t:.y
f(Xo + ax, Yo + .1y) = f(Xo, Yo) + J; (Xo, Yo) AX + J;, (Xo, Yo) .1y
1 1 1
f(x, y) = (1 - x)(l - y) - 1- x 1-y
= (1 + x + x 2 + o(x2 ))(1 + y + y 2 + o(y2 ))
= 1 + x + y + x2 + xy + y 2 + o(e 2 ),
where e2 = x2 + y 2 • ►
566 12. Functions of Several Variables
z
z
!I
-
_.-
1
--- -
.r Fig. 12.18
IJ=Yo
Fig. 12.19
Fig. 12.20
Points where az/ax and az/ay are zero or do not exist are called the
critical points of the function z = f(x, y), points where az/ax and az/ay
are zero are called its stationary points.
The theorem only gives the necessary conditions for an extremum. For
example the function z = x2 - y 2 has derivatives az/ax = 2x, ozloy = - 2y,
which become zero at x = y = 0. But at 0(0, 0) this function has no ex-
tremum.
12.9 Extrema of a Function of Several Variables
-----------------~---------~--~--------
569
A/= ; (A ~ 2 + 2B ~ Ay + C Ay 2 ) X = Xo + 8/lX"
Y =Yo+ 8~
If at the stationary point (Xo, Yo) AC - B 2 > 0 and A > 0 (C > 0), then
for all sufficiently small jaxj and j..1.yj we have
..1./ = /(Xo + ~, Yo + ..1.y) - /(Xo, Yo) ~ 0,
and so at (Xo, Yo) the function f(x, y) has a minimum. ►
Examples. (1) Examine for extremum the function
z = x2 + 2y 2 - 2x + 4y - 6.
◄ Using the necessary conditions for an extremum, we find the stationary
points. For this purpose, we find the partial derivatives oz/ax and oz/oy
and equate them to zero.
We obtain the system of equations
oz
-=2x-2=0
ax '
oz
T = 4y + 4 = o.
Hence x = 1, y = -1, so that Mo(l, -1) is a stationary point.
We now make use of the Theorem 12.12.
We have
A
o2 z = 2, B
Mo - ox2 Mo Mo
= -a z
2
C 2 = 4,
Mo qy Mo
C
az2
=0
- ayz '
Mo Mo
A
a2 z = 0, B
Mo - ax2 Mo Mo
C
a2 z =0
Mo - ayz Mo
If the equation of the curve Lis cp(x, y) = 0, then the problem on find-
ing a conditional extremum of z = f(x, y) of the curve L can be formulated
as follows: find extrema of the function z = f(x, y) in D under the condi-
tion that cp(x, y) = 0.
Therefore, in finding conditional extrema of the function z = f(x, y)
the arguments x and y can no longer be regarded as independent variables.
They are connected by the relation cp(x, y) = 0, which is known as a con-
straint equation.
To clarify the difference between unconditional and conditional extrema
we will take an example. The unconditional maximum of the function
z = l - x2 - y 2 (Fig. 12.22) is equal to unity and it is achieved at the point
(0, 0). Corresponding to it is the point M, i.e., the vertex of the paraboloid.
We now add the constraint equation y = l/2. The conditional maximum
will, clearly, be equal to 3/4. It is achieved at the point (0, 1/2), and cor-
57 4 12. Functions of Several Variables
Fig. 12.23
:: = J;(x, y) + A <P;(x, y) = 0,
!~ = J;(x, y) + A,p;(x, y) = 0,
aF
aA = <P(x, y) = o,
from which we find the values A and the coordinates x and y of the possible
extremum points.
The question of the existence and nature of a conditional extremum
is solved by examining the sign of the second differential of the Lagrange
function
From the first two equations of the system we will obtain x = y, and then
from the third equation of the system (constraint equation) we find
x = y = 1/2 (the coordinates of a possible extremum). It appears that
A = -1. The Lagrange function will thus be
F(x, y; -1) = x2 + y 2 - x - y + I.
2 0
For it F;~ = 2, F;; = 2, F;/y = 0, so that D = = 4 > 0 and
0 2
F;~ = 2 > 0, i.e., the point Mo(l/2, 1/2) is a conditional minimum of the
function z = x2 + y 2 subject to the condition x + y = I. ►
The fact that there is no unconditional extremum for the Lagrange func-
tion F(x, y) does not yet imply that there is no conditional extremum for
the function f(x, y) in the presence of the constaint <P(X, y) = 0.
Example. Find an extremum of the function z = xy subject to the condi-
tion y - x = 0.
◄ We write the Lagrange function
F(x, y; A) = xy + A(Y - x)
and also a system to find A and the coordinates of possible extrema:
F; = y - A= 0,
F; = x +A= 0, (**)
F{ = y - x = 0.
From the first two equations we find x + y = 0 and obtain the system
fx + y = 0,
l_y - x = 0,
whence x = y = 0. Then, A = 0. The Lagrange function will thus be
F(x, y; 0) = xy.
At the point (0, 0) F(x, y; 0) has no unconditional extremum; however,
z = xy has a conditional extremum when y = x. Indeed, we then have
z = x2, and so at the point (0, 0) there is a conditional minimum. ►
37-9505
578 12. Functions of Several Variables
where m < n.
We form the Lagrange function
F(x1, X2, • • . , Xn) = f(x1, X2, ... , Xn) + At '{)I (x1, X2, ... , Xn)
+ A2'{)2(X1, X2, ... , Xn) + ... + Xm'Pm(X1, X2, ... , Xn),
where A1, A2, ... , >w, are constant multipliers to be found.
Equating to zero all the partial derivatives of the first order of F and
attaching to the resultant equations the constraint equations (** ), we will
obtain a system of n + m equations from which we find A1, A2, ... , >w,
and the coordinates x1, x2, ... , Xn of the possible points of conditional
extremum. The question of whether or not the points obtained by the
Lagrange method are conditional extrema can often be solved from physical
or geometrical considerations.
Absolute maximum and minimum of a continuous function. We would
like to find the largest or smallest value of a function z = J(x, y) (i.e., its
absolute maximum or minimum), the function being continuous on some
closed region D. By Theorem 12.3 in this region there is a point (Xo, Yo)
at which the function attains an absolute maximum (minimum). If the point
(Xo, Yo) Ii~within the region D, then the functionfhas a maximum (mini-
mum) in D, so that in this case the point of interest is among the critical
points of f(x, y). However, the function may also attain its absolute maxi-
mum (minimum) at the boundary of the region.
Therefore, to find the absolute maximum (minimum) of the function
z = f(x, y) in the bounded closed region D we will have to find all maxima
(minima) of the function within the region, and also at the boundary of
the region. The largest (smallest) of these numbers will be the desired abso-
lute maximum (minimum) of z = f(x, y) in D. We now turn to a differentia-
ble function.
12.9 Extrema of a Function of Several Variables 579
Fig. 12.24
37*
580 12. Functions of Several Variables
Exercises
Find the domains of the following functions:
1. z= sin- 1 ~ + -Jxy. 2. z = ✓ 1 - x2 + ✓ 1 -y2 . 3. z= ✓sin(x2+y 2 ).
4. z =
x-y
1 + !Y . 5. z = In (x 2 + y). 6. z = xy + Vx2 + y 2 - R2 +
/In 2R
2
2 . 7. z = cot 7r(X + y). 8. (a) z = ✓sinx siny ; (b) z =
✓ X +y
✓ sin x - 1 + ✓ sin y - 1 .
(k = const)
1
x2 _ Y2 . - xz + y2
17. (a) lim x2 + Y2 , (b) lim e
x-+O x-+O x2 + y2 .
y--+O y--+O
Exercises 581
x+y
18. Show that the function z = - - - has no limit as x --+ 0 and y --+ 0.
x-y
Consider the behaviour of the function on the straight lines y = kx.
Find the sets of discontinuity points for the following functions.
19. (a) z= 2
2
2 ; (b) z = In✓ x2 + y 2 •
X +y
1 1
20. (a) z = --------,=---------,,- ;
(b) z = (x _ y )2 .
1 - x2 - Y2
1
21. (a) z = cos - ; (b) z = xy.
xy
1
22. z = 2 2 •
sin 1rx + sin 1ry
Find partial derivatives of the functions and their total differentials:
23. z = x3 + y 2 - 2.xy. 24. z = tan - 1 yx .
25. z = e-xly_ 26. z = In (x + lny).
27. u = xy + yz + xz. 28. u = ✓x 2 + y 2 + z 2 •
(b) z = x2 + y 2, where x = u + v, y = u - v. . az
Find iJu and av .
az
Using the formula for the derivative of a composite function of two
variables, find :; and :; of the functions:
34. (a) z = f(u), where u = sin - 1 xy + yx ,
. X
(b) z = f(u), where u = sin - + etanzy.
y
582 12. Functions of Several Variables
37. In tan Y Y = b.
X X
38 • X
2
y + .
Sill
- 1 X
- + -1 = 0.
y y
40. Find the slope of the tangent to the curve x 2 + y 2 = lOy at its intersec-
tion with the straighf line x = 3.
41. Find the points where the tangent to the curve x 2 + y 2 + 2.x -
2y - 2 = 0 is parallel to the x-axis.
. az az
Find ax and ay :
42. X COS y +yCOS Z + Z COS X = 1.
x2 Y2 z2
43. -2 + -2 -2 + = 1.
a b c
Write the equations of the tangent plane and normal to the surfaces:
44. z = x2 + 2y 2 at the point (1, 1, 3).
x2 Y2 z2 .
45. - 2 +- 2 +- 2 = 1 at the point (Xo, Yo, Zo).
a b c
46. z = sin x cosy at the point ('1r 14, 1r/4, 1/2).
47. z = x2 + y 2 + 2.xy at the point (1, 1, 4).
48. x2 + y 2 + xyz - 3 = 0 at the point (1, 1, 1).
49. Form the equations of the tangent planes to the surface x2 + 2y 2 +
3z 2 = 21 parallel to the plane x + 4y + 6z = 0.
Find three or four first terms in the Tuylor expansions of the functions:
50. f(x, y) = ex cosy in the neighbourhood of the point (0, 0).
51. f(x, y) = ex In (1 + y) in the neighbourhood of the point (0, 0).
52. f(x, y) = x" in the neighbourhood of the point (1, 1).
53. f(x, y) = tan - 1 t-
Y in the neighbourhood of the point (0, 0).
+ xy
54. f(x, y) = ex+y in
the neighbourhood of the point (1, -1).
Using the definition of the extremum of a function examine for extre-
mum the foil owing functions:
55. z = 1 - (x - 2)4 - (y - 3)4 at the point (2, 3).
56. z = (x - 2)4 + (y - 3)4 at the point (2, 3).
57. z = x4 - y 4 at the point (0, 0).
Exercises 583
Answers
0 ~ X ~ 2, [- 2 ~ X ~ 0,
1. [ and 2. The square formed by the segments of the
y ~ 0, y ~ 0.
straight lines x = ± 1 and y = ± 1, including its sides. 3. The family of concentric circles
21rk ~ x2 + y 2 ~ (2k + l)1r, k = 0, 1, 2, .... 4. The entire plane save for the points on
the straight lines y = x and y = 0. 5. The part of the plane above the parabola y = -x2 •
6. The points on the circle x2 + y 2 = R 2 • 7. The entire plane save for the straight lines
x + y = n, n = 0, ± l, ± 2, ....
8. (a) r sin x ~ 0,
.
smy ~ 0 ,
which yields
sin x ~ 0,
or [ . which yields
smy ~ 0 ,
(2k - 1),r ~ x ~ 2k1r, k = 0, ±1, ±2, ... '
[
(2m - l)1r ~ y ~ 2m1r, m = 0, ±1, ±2, ... ,
The domain is the hatched squares (Fig. 12.25)
sin x - 1 = 0,
(b) [ . l
smy - = O, which yields
[
x, = ! + 2/c,r, k = 0, ±1, ±2, ... '
Ym =- + 2m1r, m = 0, ±1, ±2,
2
The function is defined at the points Mkm = (Xk, Ym), 9. (a) Straight lines parallel to the
line x + y = O; (b) concentric circles with centre at the origin of coordinates. 10. (a) Parabo-
584 12. Functions of Several Variables
!I
::r
Fig. 12.25
las y = Cx2; (b) parabolas y = Cvx. 11. (a) Parabolas y = C - x 2 (C > O); (b) Hyperbolas
xy = C, where ICI ~ 1. 12. Planes parallel to the plane x + y + z = 0. 13. For u > 0 one-
sheet hyperboloids of revolution about the z-axis; for u < 0 two-sheet hyperboloids of revolu-
tion about the z-axis; both families of surfaces are separated by the cone x 2 + y 2 -
z 2 = 0. 14. (a) -1/4; (b) 0. 15. (a) l; (b) 2. 16. (a) ek; (b) 0. 17. (a) no limit; (b) 0.
(1 + k) x l - k .
18. We put y = kx, then z = - - - - = ---, x ;c 0. Fork= -1 we have hm z = 0,
(1 - k)x 1+ k x-o
for k = 1/2 lim z = 3, and for k = 3 lim z = -2, so that the given function has no limit
at the point (0, 0). 19. (a) The point (0, O); (b) the point (0, 0). 20. (a) The discontinuities
form the circle x2 + y 2 = 1; (b) the discontinuities form the line y = x. 21. (a) The points
of discontinuities lie on the x- and y-axes; (b) 0 (empty set). 22. All points (m, n), where
. az . .2 az ·
m and n are integers. 23. ax = 3x- - 2y; ay = 2y - 2x; dz = (3x 2 - 2y) dx + 2(y -
x) dy. 24. -
oz
= -2-y -2; -az = - X
dz = y dx - X dy
25. -
oz
= - ! e - x/y ·,
2 ; 2 2 •
oy x +y oy X2 +y X +Y OX Y
!!z X e - x/y !iz 1 az 1
;y = y2 e-x/y;dz = y2 (-ydx + xdy).26. ;x = X + lny; oy = y(x + lny);
y dx + dy au au au du = (y + z) dx +
dz = Y (X + 1ny)" 27. - 0-X =y + z; -ay = X + z; -
oz
=x+y;
au X au y au
(x + z) dy + (x + y)dz. 28. ax = ✓x2 + Yz + z2 ; oy = ✓ xi + Yi + zz --
oz
Exercises 585
z x dx + y dy + z dz oz . . az
-~~~~~~~~~~; du=--~---_-_-_-_-_-_-_-_-- 29. - = smh (x 2y+smhy) 2xy;
✓ x2 + Yi + z2 ✓ x2 + Yi + z2 ax ay
sinh (x 2y+sinh y)(x 2 + cosh y); 2
dz = sinh (x y + sinh y)[2xy dx + (x + cosh y) dy].
2
O; - az = 1; (b)-
au
dx
az = 4u; -az = 4v. 34. (a)-=/
av
az
ox
, (u) ---;::===-+-
Y
✓ l _ x2y2
1] ;
Y
ax
l dx
yx!'- I - yx lny
~- -- 39. y' = - - - - . 40. At the point M1 ( - 3; 1),
(1 _ x2 y2)✓y2 _ xi + xy xyx- 1 -x!' In x
az
Y ' = 3/4; at the point M 2(3; 9), y' = -3/4. 41. M1(-l; 3), M2(-l; -1). 42. - ax =
1 1
51. Y + 21 (2xy - T) + 31 (3x2y - 3.xy2 + 2y 3 ). 52. 1 + (x - l)+(x - l)(y - 1) +
1 1 x-y
2 (x - 1) (y - 1). 53. Hint: Use the formula tan - - -- tan - 1 x - tan - 1 y. We
2
1 + xy
obtain: x - Y - 31 (x3 - y3) + _51 (x5 - ys). 54.1 + [(x - l)+(y+l)]+ [(x-l)+(y+l)]2 +
2!
38-9505
586 12. Functions of Several Variables
point (1, 0). 61. No extremum. 62. Zmin = 0 at point (1, 1/2). 63. Zmin = - 2 at point
e
( - 2, 0). 64. The absolute maximum z = 1 at the points (1, 0) and ( -1, 0). The absolute minimum
z = -1 at the points (0, 1) and (0, -1). 65. The absolute maximum z = 4 at the point
(2, 1). The absolute minimum z = -64 at the point (4, 2). 66. x = y =½v, ½v,
z = ~ Vzv. 67. Cube with a side a= j.
Appendix I
Elementary Functions
1. Power functions. The power function y = x°', where a is any real
number, is defined for all x > 0; this function monotonically increases if
a > 0 and monotonically decreases if a < 0 as shown in Figs. I.I and 1.2,
res pecti vel y.
y y
1
1
0 1 X 1 X
Fig. I.l Fig. 1.2
y y
0 X
Fig. 1.4
Fig. 1.3
y
0 X X
Fig. 1.7
Elementary Functions 589
Fig. 1.8
y=sinx
Fig. 1.9
y
Fig. 1.10
The cosine funcion y = cos xis a periodic function with period T = 21r
defined for all x (Fig. 1.10). The graph of this function is obtained from
that of the sine function by translating the latter by - ; along the x-axis.
The tangent function y = tan xis a periodic function with period T = 1r
I
I
I
I
I
I
I
I
I
I
I ,r 3,r
1-2 2
.r
Fig. I.11
Elementary Functions 591
y=cot x
-1( 1T X
I I
I I
I I
I I
I I
I
I I
I I
I I
I I
I I
Fig. 1.12
y y
1T X X
2
-1
Fig. 1.13
592 Appendix I
lj
0 r
-1
2"-
-------~
!
1 X
Fig. 1.14
y
lj
rr :x .:r
12
I
I
I
I
I
I
I
Fig. 1.15
(3) y = tan - 1 x.
Let us consider the tangent function y = tan x on the open interval
( - 1r/2, 1r/2), ·where this function monotonically increases. Then the range
of y = tan xis the interval ( - oo, + oo) so that there exists the inverse func-
Elementary Functions 593.·
tion x = tan - 1 y defined at every point of the number line; the values of
x = tan - 1 y fill in the interval [ - 1r/2, 1r/2]. The graph of y = tan - 1 x is
shown in Fig. 1.15.
(4) y = cot- 1 x.
As before we consider the cotangent function y = cot x on the interval
(0, 1r), where y = cot x monotonically decreases. The range of y = cot x
is the interval ( - oo, + oo) so that there exists the inverse function
x = cot - 1 y defined at every point of the number line; the values of
x = cot - 1 y fill in the interval (0, 1r). The graph of y = cot - 1 xis shown
in Fig. 1.16.
y y
rr
X
1"
I
I
I
I
I
I
I
I Fig. 1.16
I
0 X
Fig. 1.18
Fig. 1.17
594 Appendix I
y y=coth x
lj=tanh :r
X 0 X
y=coth x
Fig. 1.19
Fig. 1.20
Quadratic form, 213 System(s) of linear equations, Taylor's formula, 385, 564
associated matrix of, 213 143 for functions, 386 ·
bilinear form associated augmented matrix of, 144 for polynomials, 385
with, 214 coefficients of, 144 Theorem
diagonalization of, 219 coefficient matrix of, 144 Cauchy mean value, 343
law of inertia for, 221 Cramer's rule for, 154 mean value, 341
positive-definite, 219 method of Gaussian elimina- Newton-Leibniz, 471
Quantifier tion for, 148-150 on construction of a linear
existential, 236 quadratic, 154 mapping, 191
universal, 236 Scalar product, 34 Rolle's, 339
basic properties of, 34-36 Third-order determinant, 20
Scalar square of a vector, 36 cofactor of, 22
Rational function, 424 Scalar triple product, 43 minor of, 21
proper, 425 geometric interpretation of, Total differential
real, 425 43 geometric interpretation of,
Reflection of axes of coor- Second-order determinant, 19 557
dinates, 65 Sequence of numbers, 239 Transition matrix, 182
Relations between Cartesian and bounded, 241 properties of, 182-183
polar coordinates, 18 bounded above, 241 Translation of axes of coor-
Relations between infinitesimals bounded below, 241 dinates, 63
and infinities, 265 Cauchy convergence criterion Trapezoidal approximation, 498
Rotation of axes of coordinates, of, 241 Triangle inequality, 184
64 infinitely large, 242
Row-vector(s), 103 limit of, 239
linear combination of, 112 stationary, 240 Unit vector(s), 29
linear dependence of, 112 Set(s), 229 as orthonormal basis, 31
linearly independent, 112 bounded, 235 Unitary space, 191
nontrivial linear combination bounded above, 235
of, 112 bounded below, 235 Vector(s)
trivial linear combination of, connected, 531 addition of, 26
112 countable, 230 direction of, 557
disjoint, 230 direction cosines of, 37
empty, 230 equivalent, 29
Space equal, 229 components of, 30
n-dimensional real coor- equivalent, 230 coordinates of, 30
dinate, 169 finite, 230 coplanar, 43
Euclidean, 183 greatest lower bound of, 236 fixed, 24
linear, 168 infimum of, 236 free, 24
linear complex, 168 infinite, 230 linear operations on, 26
linear real, 168 least upper bound of, 236 moving position, 48
unitary, 191 lower bound of, 236 normal, 49
vector, 168 one-to-one correspondence orthogonal, 185
Standard Cartesian coordinate between, 230 orthonormal, 185
system, 67 open, 531 position, 31
Surface(s) operations on, 230 scalar product of, 34
classification of, 90-95 proper subset of, 229 scalar square of, 36
conic, 93 supremum of, 235 sliding, 25
cylindrical, 91 unbounded above, 235 sum of, 26
equation of, 89 unbounded below, 235 unit, 29
of revolution, 90 unconnected, 531 Vector function, 396
of the second order, 89-90 upper bound of, 235 continuity of, 398
Surface(s) of the second order Significant digit, 233 differentiation of, 399
equation of, 90 accurate, 233 limit of, 398
standard equations of, 95-102 Simpson's approximation, 502 Vector product, 39
Symmetric operator Subset, 229 Vector space, 168
properties of, 212-213 proper, 229 Vector triple product, 45
10 THE READER
Mir Publishers would be grateful for your CO!llments on the con-
tent, translation and design of this book. We would also be pleased to
receive any other suggestions you may wish to make.
Our address is:
Mir Publishers
2 Pervy Rizhsky Pereulok
1-110, GSP, Moscow, 129820
USSR