This action might not be possible to undo. Are you sure you want to continue?

BooksAudiobooksComicsSheet Music### Categories

### Categories

Scribd Selects Books

Hand-picked favorites from

our editors

our editors

Scribd Selects Audiobooks

Hand-picked favorites from

our editors

our editors

Scribd Selects Comics

Hand-picked favorites from

our editors

our editors

Scribd Selects Sheet Music

Hand-picked favorites from

our editors

our editors

Top Books

What's trending, bestsellers,

award-winners & more

award-winners & more

Top Audiobooks

What's trending, bestsellers,

award-winners & more

award-winners & more

Top Comics

What's trending, bestsellers,

award-winners & more

award-winners & more

Top Sheet Music

What's trending, bestsellers,

award-winners & more

award-winners & more

P. 1

Dovermann - A Summary of Calculus|Views: 64|Likes: 1

Published by scottswallows

See more

See less

https://www.scribd.com/doc/95763900/Dovermann-A-Summary-of-Calculus

09/11/2014

text

original

- Basic Concepts
- 1.1 Real Numbers and Functions
- 1.2 Limits
- 1.2.1 Two important estimates
- 1.3 More Limits
- 1.4 Continuous Functions
- 1.5 Lines
- 1.6 Tangent Lines and the Derivative
- 1.6.1 Derivatives without Limits
- 1.7 Secant Lines and the Derivative
- 1.8 Diﬀerentiability implies Continuity
- 1.9 Basic Examples of Derivatives
- 1.10 The Exponential and Logarithm Functions
- 1.11 Diﬀerentiability on Closed Intervals
- 1.12. OTHER NOTATIONS FOR THE DERIVATIVE 21
- 1.12 Other Notations for the Derivative
- 1.13 Rules of Diﬀerentiation
- 1.13.1 Linearity of the Derivative
- 1.13.2 Product and Quotient Rules
- 1.13.3 Chain Rule
- 1.13.4 Hyperbolic Functions
- 1.13.5 Derivatives of Inverse Functions
- 1.13.6 Implicit Diﬀerentiation
- 1.14 Related Rates
- 1.15. EXPONENTIAL GROWTH AND DECAY 41
- 1.15 Exponential Growth and Decay
- 1.16. MORE EXPONENTIAL GROWTH AND DECAY 43
- 1.16 More Exponential Growth and Decay
- 1.17 The Second and Higher Derivatives
- 1.18 Numerical Methods
- 1.18.1 Approximation by Diﬀerentials
- 1.18.2 Newton’s Method
- 1.18.3 Euler’s Method
- 1.19. TABLE OF IMPORTANT DERIVATIVES 63
- 1.19 Table of Important Derivatives
- Global Theory
- 2.1 Cauchy’s Mean Value Theorem
- 2.2. UNIQUE SOLUTIONS OF DIFFERENTIAL EQUATIONS 67
- 2.2 Unique Solutions of Diﬀerential Equations
- 2.3. THE FIRST DERIVATIVE AND MONOTONICITY 69
- 2.3 The First Derivative and Monotonicity
- 2.3.1 Monotonicity on Intervals
- 2.3.2 Monotonicity at a Point
- 2.4 The Second Derivative and Concavity
- 2.4.1 Concavity on Intervals
- 2.4.2 Concavity at a Point
- 2.5 Local Extrema and Inﬂection Points
- 2.6 Detection of Local Extrema
- 2.7 Detection of Inﬂection Points
- 2.8 Absolute Extrema of Functions
- 2.9 Optimization Story Problems
- 2.10 Sketching Graphs
- Integration
- 3.1 Properties of Areas
- 3.2 Partitions and Sums
- 3.2.1 Upper and Lower Sums
- 3.2.2 Riemann Sums
- 3.3 Limits and Integrability
- 3.3.1 The Darboux Integral and Areas
- 3.3.2 The Riemann Integral
- 3.4 Integrable Functions
- 3.5 Some elementary observations
- 3.6 Areas and Integrals
- 3.7 Anti-derivatives
- 3.8 The Fundamental Theorem of Calculus
- 3.8.1 Some Proofs
- 3.9 Substitution
- 3.9.1 Substitution and Deﬁnite Integrals
- 3.10 Areas between Graphs
- 3.11 Numerical Integration
- 3.12 Applications of the Integral

**Karl Heinz Dovermann
**

Professor of Mathematics

University of Hawaii

July 28, 2003

c Copyright 2003 by the author. All rights reserved. No part of this

publication may be reproduced, stored in a retrieval system, or transmit-

ted, in any form or by any means, electronic, mechanical, photocopying,

recording, or otherwise, without the prior written permission of the author.

Printed in the United States of America.

This publication was typeset using A

M

S-T

E

X, the American Mathemat-

ical Society’s T

E

X macro system, and L

A

T

E

X2

ε

. The graphics were produced

with the help of Mathematica

1

.

This is an incomplete draft which will undergo further changes.

1

Mathematica Version 2.2, Wolfram Research, Inc., Champaign, Illinois (1993).

Contents

Preface v

1 Basic Concepts 1

1.1 Real Numbers and Functions . . . . . . . . . . . . . . . . . . 1

1.2 Limits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.2.1 Two important estimates . . . . . . . . . . . . . . . . 4

1.3 More Limits . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

1.4 Continuous Functions . . . . . . . . . . . . . . . . . . . . . . 8

1.5 Lines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

1.6 Tangent Lines and the Derivative . . . . . . . . . . . . . . . . 10

1.6.1 Derivatives without Limits . . . . . . . . . . . . . . . 12

1.7 Secant Lines and the Derivative . . . . . . . . . . . . . . . . . 13

1.8 Diﬀerentiability implies Continuity . . . . . . . . . . . . . . . 14

1.9 Basic Examples of Derivatives . . . . . . . . . . . . . . . . . . 15

1.10 The Exponential and Logarithm Functions . . . . . . . . . . . 18

1.11 Diﬀerentiability on Closed Intervals . . . . . . . . . . . . . . . 20

1.12 Other Notations for the Derivative . . . . . . . . . . . . . . . 21

1.13 Rules of Diﬀerentiation . . . . . . . . . . . . . . . . . . . . . 22

1.13.1 Linearity of the Derivative . . . . . . . . . . . . . . . . 22

1.13.2 Product and Quotient Rules . . . . . . . . . . . . . . . 23

1.13.3 Chain Rule . . . . . . . . . . . . . . . . . . . . . . . . 26

1.13.4 Hyperbolic Functions . . . . . . . . . . . . . . . . . . 29

1.13.5 Derivatives of Inverse Functions . . . . . . . . . . . . . 30

1.13.6 Implicit Diﬀerentiation . . . . . . . . . . . . . . . . . . 34

1.14 Related Rates . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

1.15 Exponential Growth and Decay . . . . . . . . . . . . . . . . . 41

1.16 More Exponential Growth and Decay . . . . . . . . . . . . . 43

1.17 The Second and Higher Derivatives . . . . . . . . . . . . . . . 48

1.18 Numerical Methods . . . . . . . . . . . . . . . . . . . . . . . . 48

i

1.18.1 Approximation by Diﬀerentials . . . . . . . . . . . . . 48

1.18.2 Newton’s Method . . . . . . . . . . . . . . . . . . . . . 51

1.18.3 Euler’s Method . . . . . . . . . . . . . . . . . . . . . . 53

1.19 Table of Important Derivatives . . . . . . . . . . . . . . . . . 63

2 Global Theory 65

2.1 Cauchy’s Mean Value Theorem . . . . . . . . . . . . . . . . . 65

2.2 Unique Solutions of Diﬀerential Equations . . . . . . . . . . . 67

2.3 The First Derivative and Monotonicity . . . . . . . . . . . . . 69

2.3.1 Monotonicity on Intervals . . . . . . . . . . . . . . . . 69

2.3.2 Monotonicity at a Point . . . . . . . . . . . . . . . . . 75

2.4 The Second Derivative and Concavity . . . . . . . . . . . . . 76

2.4.1 Concavity on Intervals . . . . . . . . . . . . . . . . . . 77

2.4.2 Concavity at a Point . . . . . . . . . . . . . . . . . . . 80

2.5 Local Extrema and Inﬂection Points . . . . . . . . . . . . . . 81

2.6 Detection of Local Extrema . . . . . . . . . . . . . . . . . . . 83

2.7 Detection of Inﬂection Points . . . . . . . . . . . . . . . . . . 88

2.8 Absolute Extrema of Functions . . . . . . . . . . . . . . . . . 90

2.9 Optimization Story Problems . . . . . . . . . . . . . . . . . . 92

2.10 Sketching Graphs . . . . . . . . . . . . . . . . . . . . . . . . . 98

3 Integration 105

3.1 Properties of Areas . . . . . . . . . . . . . . . . . . . . . . . . 105

3.2 Partitions and Sums . . . . . . . . . . . . . . . . . . . . . . . 107

3.2.1 Upper and Lower Sums . . . . . . . . . . . . . . . . . 108

3.2.2 Riemann Sums . . . . . . . . . . . . . . . . . . . . . . 111

3.3 Limits and Integrability . . . . . . . . . . . . . . . . . . . . . 112

3.3.1 The Darboux Integral and Areas . . . . . . . . . . . . 112

3.3.2 The Riemann Integral . . . . . . . . . . . . . . . . . . 115

3.4 Integrable Functions . . . . . . . . . . . . . . . . . . . . . . . 116

3.5 Some elementary observations . . . . . . . . . . . . . . . . . . 118

3.6 Areas and Integrals . . . . . . . . . . . . . . . . . . . . . . . . 121

3.7 Anti-derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . 122

3.8 The Fundamental Theorem of Calculus . . . . . . . . . . . . . 124

3.8.1 Some Proofs . . . . . . . . . . . . . . . . . . . . . . . 126

3.9 Substitution . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128

3.9.1 Substitution and Deﬁnite Integrals . . . . . . . . . . . 131

3.10 Areas between Graphs . . . . . . . . . . . . . . . . . . . . . . 133

3.11 Numerical Integration . . . . . . . . . . . . . . . . . . . . . . 136

3.12 Applications of the Integral . . . . . . . . . . . . . . . . . . . 142

ii

3.13 The Exponential and Logarithm Functions . . . . . . . . . . . 144

3.13.1 Other Bases . . . . . . . . . . . . . . . . . . . . . . . . 148

4 Trigonometric Functions 151

iii

iv

Preface

In these notes we like to summarize calculus.

v

vi PREFACE

Chapter 1

Basic Concepts

Introduction

In this chapter we introduce limits and derivatives. These are basic concepts

of calculus. We provide some rules for their computations.

1.1 Real Numbers and Functions

We assume that the reader is familiar with the real numbers (denoted by R)

and the operations of addition and multiplication. A real number is either

positive, negative, or zero. This allows us to order the real numbers. If

x and y are real numbers, then x is larger than y (i.e., x > y) if x − y is

positive.

Until further notice, we will work with real valued functions in one real

variable. Their domains, the sets on which these functions are deﬁned, are

subsets of the real numbers, and they take values in R. The range of a

function is a set in which the function takes values. The image of a function

f consists of all those points y in the range for which there exists an x in

the domain of f, such that f(x) = y.

We will make frequent use of the absolute value function.

|x| =

_

¸

_

¸

_

x if x > 0

−x if x < 0

0 if x = 0

The distance between two points a and b on the real line is |a − b|, and

{x ∈ R | |x − a| < } is the set of all real numbers whose distance from

1

2 CHAPTER 1. BASIC CONCEPTS

a is less than . Expressed as an interval this set is (a − , a + ). For

computations with absolute values it is worth noting that, for any two real

numbers x and x

|x · y| = |x| · |y|, |x +y| ≤ |x| +|y|, and ||x| −|y|| ≤ |x −y|. (1.1)

The ﬁrst inequality is referred to as triangle inequality, and the last one is

a variation of it.

Every now and then we will allude to the completeness of the real line,

which means that every bounded subset of the real line has a least upper

bound. This property is crucial for calculus, but arguments using it are too

diﬃcult for an introductory course on the subject.

1.2 Limits

Limits are a central tool in calculus and other areas of mathematics. We

discuss them in this section.

Deﬁnition 1.1. Let f be a function and L a real number. We say that

L = lim

x→a

f(x) (1.2)

if for all > 0 there exists a δ > 0, such that |f(x) − L| < whenever x is

in the domain of f and 0 < |x −a| < δ.

The equation in (1.2) reads as L is the limit of f(x) as x approaches

a. We also say that f(x) approaches or converges to L as x approaches a.

An intuitive interpretation is that the expected value of f(x) at x = a is L,

based on the values of f(x) for x near a.

In all but a few degenerate cases, limits are unique if they exist.

Proposition 1.2. Suppose that f(x) has a limit at x = a, then this limit is

unique, provided that the domain of the function f contains points arbitrarily

close to a.

1 2

The latter assumption in the proposition is satisﬁed if the domain of f

contains an interval, and either a belongs to this interval or a is an end point

1

Expressed in mathematical language this means, for all δ > 0 there is a point b in the

domain of f, such that 0 < |b −a| < δ.

2

Some authors do not apply the concept of a limit at isolated points of the domain of

a function, points for which there are no other arbitrarily close points in the domain of

the function.

1.2. LIMITS 3

of it. To avoid intricate language, we make this kind of an assumption for

the remainder of this section. Taking limits is compatible with the basic

algebraic operations in the following sense.

Proposition 1.3. Assume that the domains of the functions f(x) and g(x)

both contain an interval of the form (d, a) or (a, e) where d < a < e. Suppose

that

lim

x→a

f(x) = L and lim

x→a

g(x) = M.

and that c is a constant. Then

lim

x→a

(f +g)(x) = M +L

lim

x→a

cf(x) = cM

lim

x→a

(f · g)(x) = M · L

lim

x→a

(f/g)(x) = M/L provided that L = 0.

As a special case we obtain the following useful observation:

lim

x→c

f(x) = L if and only if lim

x→c

(f(x) −L) = 0. (1.3)

Proposition 1.4 (Pinching Theorem). Assume that the domains of the

functions f(x), g(x), and h(x) all contain an interval of the form (d, a) or

(a, e) where d < a < e and that f(x) ≤ h(x) ≤ g(x). If

lim

x→a

f(x) = L = lim

x→a

g(x),

then the limit of h(x) exists as x approaches a, and it is equal to L.

For many functions the computation of limits is no challenge.

Proposition 1.5. If f(x) is a polynomial, a rational function, or a trigono-

metric function and f(a) is deﬁned, then

lim

x→a

f(x) = f(a).

The following limits are important in the calculations of some derivatives.

lim

x→0

cos x −1

x

= 0, lim

x→0

sin x

x

= 1. and lim

x→a

x

n

−a

n

x −a

= na

n−1

. (1.4)

Hints: The ﬁrst two limits follow easily from the estimates in Theo-

rem 1.7, discussed in the following subsection. The last assertion can be

proved using synthetic division, at least if n is an integer.

4 CHAPTER 1. BASIC CONCEPTS

1.2.1 Two important estimates

In preparation of the proof of Theorem 1.7 we show

Theorem 1.6. If h ∈ [−π/4, π/4], then

| sin h| ≤ |h| ≤ | tan h|. (1.5)

Proof. In Figure 1.1 you see part of the unit circle. For h ∈ [−π/4, π/4] we

set C = (cos h, sin h). Given two points X and Y in the plane, the distance

between them is denoted by XY . We denote by

¯

BC the length of the arc

(part of the unit circle) between B and C.

O A B

C

D

E

Figure 1.1: The unit circle

We ﬁnd that | sin h| = AC ≤ |h| =

¯

BC because going from C straight

down to the x-axis is shorter than following the circle from C to the x-axis.

Secondly, to show that |h| =

¯

BC ≤ | tan h| = BD, imagine that you roll

the circle along the vertical line through B until the point C touches it in the

point E. We use the process of rolling the circle along the line to measure

|h|. In particular, |h| = BE. It appears to be clear

3

that BE ≤ BD. This

3

Here our argument relies on intuition. A rigorous argument requires work. One can

show that the area of a disk with radius one is π. From this is follows by elementary

geometry that the area of the slice of the disk with vertices O, B and C has area |h|/2.

This slice is contained in the triangle with vertices O, B and D, and the area of the slice

is (tan |h|)/2. It follows that |h| ≤ tan |h|.

1.2. LIMITS 5

veriﬁes that |h| ≤ tan |h|, the second inequality which we claimed in the

theorem.

Theorem 1.7. If h ∈ [−π/4, π/4], then

4

|1 −cos h| ≤

h

2

2

and |h −sin h| ≤

h

2

2

. (1.6)

Proof of Theorem 1.7. In Figure 1.2 you see half of a circle of radius 1

centered at the origin, and a triangle with vertices A, B, and C. Let

h ∈ [−π/4, π/4] be the number for which we want to show the inequal-

ity and C = (cos h, sin h). Denote by XY the length of the straight line

segment between the points X and Y . Let

¯

BC be the length of the arc

(part of the unit circle) between B and C.

A B

C

D

Figure 1.2: The unit circle

From the picture we read oﬀ that

AB = 2, DB = (1 −cos h),

¯

BC = |h|, and BC ≤

¯

BC.

Using similar triangles we see AB/BC = BC/DB and (BC)

2

= AB×DB.

In other words

2(1 −cos h) = AB ×DB = (BC)

2

≤ (

¯

BC)

2

= h

2

.

4

The inequalities hold without the restriction on h, but we only need them on an

interval around zero. Restricting ourselves to this interval simpliﬁes the proofs somewhat.

6 CHAPTER 1. BASIC CONCEPTS

The ﬁrst estimate in (1.6) is an immediate consequence.

If h = 0, then both sides of the second inequality in (1.6) are zero,

verifying the assertion in this case. If 0 = h ∈ [−π/4, π/4], then Theorem 1.6

tells us that

| sin h| ≤ |h| ≤ | tan h| =

| sin h|

cos h

hence 0 ≤ cos h ≤

sin h

h

≤ 1.

Subtracting the terms in this inequality from 1 we ﬁnd

0 ≤ 1 −

sin h

h

≤ 1 −cos h ≤ 1.

Using our previous estimate for |1 − cos h| and our assumption that |h| ≤

π/4 < 1, we conclude that

¸

¸

¸

¸

h −sin h

h

¸

¸

¸

¸

≤ |1 −cos h| ≤

h

2

2

≤

h

2

.

The second estimate claimed in the theorem is an immediate consequence.

1.3 More Limits

The material in the previous, ﬁrst section about limits suﬃces for a while.

In some situations one would like to modify the deﬁnition in Section 1.2,

and we do so in this section. The ﬁrst two limits express how the function

behaves as we approach a point a from the right or left. They are called the

right and left hand limits. The next two limits express what happens as the

variable tends to plus or minus inﬁnity. We call them limits at inﬁnity. The

last two limits allow us to express that the values of a function tend to plus

or minus inﬁnity. We call them inﬁnite limits.

Deﬁnition 1.8. Let f be a function and L a real number. We say that

L = lim

x→a

+

f(x)

if for all > 0 there exists a δ > 0, such that |f(x) − L| < whenever x is

in the domain of f and a < x < a +δ.

Deﬁnition 1.9. Let f be a function and L a real number. We say that

L = lim

x→a

−

f(x)

if for all > 0 there exists a δ > 0, such that |f(x) − L| < whenever x is

in the domain of f and a −δ < x < a.

1.3. MORE LIMITS 7

For example, if f(x) = sign(x) = x/|x|, then

lim

x→a

+

f(x) = 1 and lim

x→a

−

f(x) = −1.

We can consider what happens to the values of a function f(x) as x

approaches ∞ or −∞.

Deﬁnition 1.10. Let f be a function and L a real number. We say that

L = lim

x→∞

f(x)

if for all > 0 there exists a number M, such that |f(x) −L| < whenever

x is in the domain of f and x > M.

Deﬁnition 1.11. Let f be a function and L a real number. We say that

L = lim

x→−∞

f(x)

if for all > 0 there exists a number M, such that |f(x) −L| < whenever

x is in the domain of f and x < M.

For example

lim

x→∞

1

x

= 0 and lim

x→−∞

1

1 +x

2

= 1.

Deﬁnition 1.12. Let f be a function and a a real number. We say that

lim

x→a

f(x) = ∞

if for all M there exists a δ > 0 such that f(x) > M whenever x is in the

domain of f and 0 < |a −x| < δ.

In other words, we can make sure that the value of f(x) is larger than

any given number M, no matter how large, by taking x close to a.

Deﬁnition 1.13. Let f be a function and a a real number. We say that

lim

x→a

f(x) = −∞

if for all M there exists a δ > 0 such that f(x) < M whenever x is in the

domain of f and 0 < |a −x| < δ.

In the last two deﬁnitions a may be replaced by a

±

, so that we approach

a from the left or right, and a can be replaced by ±∞.

For example

lim

x→0

+

1

x

= ∞ and lim

x→∞

√

x = ∞.

8 CHAPTER 1. BASIC CONCEPTS

1.4 Continuous Functions

We deﬁne continuous functions and discuss a few of their basic properties.

The class of continuous functions will play a central role later.

Deﬁnition 1.14. Let f be a function and c a point in its domain. The

function is said to be continuous at c if for all > 0 there exists a δ > 0,

such that |f(c) − f(x)| < whenever x belongs to the domain of f and

|x−c| < δ. A function f is continuous if it is continuous at all points in its

domain.

In most cases the condition in Deﬁnition 1.14 says that

lim

x→c

f(x) = f(c). (1.7)

In fact, this equation holds whenever there are points in the domain of f

arbitrarily close to c. See the footnote to Proposition 1.2. If c is an isolated

point in the domain of f, i.e., there are no other points in the domain of f

arbitrarily close to c, then the function is always continuous at c.

Polynomials, rational functions, and trigonometric functions are contin-

uous. One can produce many more continuous functions through standard

operations on functions.

Proposition 1.15. Let f and g be continuous functions. Then f +g, f · g,

f/g and f ◦ g are continuous, wherever these functions are deﬁned.

The clarify the remark about the domain in the proposition, we note that

the function (f +g)(x) = f(x) +g(x) is deﬁned for those x for which both f

and g are deﬁned. The same statement holds for (f · g)(x) = f(x) · g(x). To

determine the domain of f/g one needs to exclude those points where g is

zero. For the composition (f ◦ g)(x) = f(g(x)) on needs that g takes values

in the domain of f.

One may also reverse the order of applying a continuous function and

calculating a limit:

lim

x→c

f(g(x)) = f

_

lim

x→c

g(x)

_

, (1.8)

provided the natural technical assumption hold, i.e., g is deﬁned at points

arbitrarily close to c, f is deﬁned for all g(x) where x is in the domain of g

and close to c, and f is continuous at lim

x→c

g(x).

Theorem 1.16 (Intermediate Value Theorem). Suppose that f is de-

ﬁned and continuous on the closed interval [a, b]. If C is in between f(a)

and f(b), then there exists a c ∈ [a, b], such that f(c) = 0.

1.5. LINES 9

E.g., suppose that p(x) = x

3

−x

2

+2x −1. The polynomial is certainly

a continuous function, p(0) = −1 and p(1) = 1. According to the theorem

there exists some c ∈ (0, 1), such that p(c) = 0.

Theorem 1.17 (Extreme Value Theorem). Let f be deﬁned and con-

tinuous on the closed interval [a, b]. Then there exist points c and d in [a, b],

such that f(c) ≤ f(x) ≤ f(d) for all x ∈ [a, b].

Expressed in words, the theorem says that a continuous function on a

closed interval assumes a smallest and largest value.

The Intermediate Value and Extreme Value theorem are typically proved

in an introductory analysis course. They are equivalent to the completeness

of the real line. We mentioned this property of the real numbers in Sec-

tion 1.1.

1.5 Lines

In general, a line consists of the points (x, y) in the plane which satisfy the

equation

ax +by = c (1.9)

for some given real numbers a, b and c, where it is assumed that a and b

are not both zero. The line is vertical if and only if b = 0. If b = 0 we may

rewrite the equation as

y = −

a

b

x +

c

b

= mx +B. (1.10)

The number m is called the slope of the line, and B is the point in which the

line intersects the y-axis, also called the y-intercept. Given any two points

(x

1

, y

1

) and (x

2

, y

2

) in the plane, the line through them has slope

m =

y

2

−y

1

x

2

−x

1

.

For our purposes, the most useful version of the equation of a line is its

point-slope formula. The equation of a line with slope m through the point

(x

1

, y

1

) is

y = m(x −x

1

) +y

1

. (1.11)

10 CHAPTER 1. BASIC CONCEPTS

1.6 Tangent Lines and the Derivative

We like to introduce the concept of tangent lines. To be able to express

ourselves concisely, let us say

Deﬁnition 1.18. A point c is an interior point of a subset B of R if there

is an open interval I, such that c ∈ I ⊆ B.

We give a ﬁrst deﬁnition for a tangent line.

Deﬁnition 1.19. Suppose f(x) is a function and c is an interior point of

its domain. We call a line t(x) the tangent line to the graph of f(x) at x = c

if t(x) is the best linear approximation of f(x) on some open interval around

c, i.e., the line t(x) is closer to the graph of f(x) than any other line for all

x in some open interval around c.

For a given function and an interior point c in its domain there may or

may not be a tangent line, but it there is a tangent line, then it is unique.

Although the term ‘best linear approximation near c’ gives an excellent

intuitive picture what a tangent line is, this deﬁnition is hard to work with.

It is easier to work with a more concrete deﬁnition.

Deﬁnition 1.20. Suppose f(x) is a function and c is an interior point of

its domain. We call a line t(x) the tangent line to the graph of f(x) at x = c

if

lim

x→c

f(x) −t(x)

x −c

= 0. (1.12)

The equation in (1.12) expresses in a precise form in which sense the

tangent line is close to the graph of f(x) near c. Not only does f(x) −t(x)

converge to zero as x approaches c, it does so even when divided by x −c.

We use tangent lines to deﬁne the concept of diﬀerentiability and the

derivative.

Deﬁnition 1.21. Suppose f(x) is a function and c is an interior point of

its domain, and assume that there is a tangent line to the graph of f(x) at

x = c. Then we say that f(x) is diﬀerentiable at c. We call the slope of the

tangent line the derivative of f(x) at c, and we denote it by f

(c).

Utilizing the notation in the previous deﬁnition we can write down the

equation of the tangent line to the graph of f(x) at x = c in point-slope

form:

t(x) = f

(c)(x −c) +f(c). (1.13)

1.6. TANGENT LINES AND THE DERIVATIVE 11

To diﬀerentiate a function means to ﬁnd its derivative.

By deﬁnition, an open set is a set, such that each of its points is an

interior point.

Deﬁnition 1.22. Suppose the domain of the function f(x) is an open set.

Then say that f(x) is diﬀerentiable if it is diﬀerentiable at each point of

its domain. We consider f

**(x) as a function, whose domain consists of all
**

those points where f(x) is diﬀerentiable.

Example 1.23. Let p(x) = 2x

4

−3x

2

+5. Find the tangent line t(x) to the

graph of p(x) at x = −2 and p

(−2).

Solution: As a ﬁrst step we expand p in powers of u = (x + 2). To do

so, we substitute u−2 for x and expand p in powers of u. You are expected

to ﬁll in some of the arithmetic steps.

p = 2(u −2)

4

−3(u −2)

2

+ 5

= 2(u

4

−8u

3

+ 24u

2

−32u + 16) −3(u

2

−4u + 4) + 5

= 2u

4

−16u

3

+ 45u

2

−52u + 25

Reversing the substitution, replacing u by (x + 2), we ﬁnd:

p(x) = 2(x + 2)

4

−16(x + 2)

3

+ 45(x + 2)

2

−52(x + 2) + 25.

We assert that t(x) = −52(x + 2) + 25 and p

(−2) = −52.

For t(x) as proposed, we see that

¸

¸

¸

¸

p(x) −t(x)

x −c

¸

¸

¸

¸

=

¸

¸

¸

¸

p(x) −t(x)

x + 2

¸

¸

¸

¸

= |2(x + 2)

3

−16(x + 2)

2

+ 45(x + 2)|

≤ 65|x + 2| (provided |x + 2| ≤ 1)

This estimate shows that (p(x) − t(x))/(x − c) converges to zero as x ap-

proaches c = −2. By deﬁnition, this means that t(x) is the desired tangent

line. Its slope is p

(−2) = −52. ♦

The example is generic. We can use any polynomial p(x) and point x = c

and write p(x) in powers of (x −c). Say, the result is

p(x) = A

n

(x −c)

n

+· · · +A

1

(x −c) +A

0

.

The technique used in the example, suitably generalized, shows that

t(x) = A

1

(x −c) +A

0

12 CHAPTER 1. BASIC CONCEPTS

is the tangent line to the graph of p(x) at x = c, and p

(c) = A

1

. Eventually

we will ﬁnd a more eﬃcient method for diﬀerentiating polynomials, but we

have shown that

Proposition 1.24. Polynomials are diﬀerentiable.

1.6.1 Derivatives without Limits

Without a doubt, the deﬁnition of a limit is the most diﬃcult one in a

ﬁrst semester of calculus, and it is interesting to explore ways to develop

calculus, rigorously, without the limit concept. One can do this by replacing

the condition in (1.12) by a slightly stronger one.

Deﬁnition 1.25. Suppose f(x) is a function and c is an interior point of

its domain. We call a line t(x) the tangent line to the graph of f(x) at x = c

if there exists and open interval I around c and a number A, such that

|f(x) −t(x)| ≤ A(x −c)

2

(1.14)

for all x ∈ I.

With this deﬁnition fewer functions will be diﬀerentiable than with the

one given in Deﬁnition 1.20, but this is not crucial.

The inequality in (1.14) can be rewritten as

q(x) = t(x) −A(x −c)

2

≤ f(x) ≤ t(x) +A(x −c)

2

= p(x),

where the parabolas q(x) and p(x) are deﬁned by the expressions they are

adjacent to. All four function f(x), q(x), p(x), and t(x) have the same value

at x = c. In an example, this situation is shown in Figure 1.3. There you

see the function f(x) = sin x, the parabola p(x) (dotted and open upwards),

the parabola q(x) (dotted and open downwards), and the tangent line t(x)

(dashed). The parabolas p(x) and q(x) touch each other without crossing,

and the picture shows how they ‘hug’ each other. There is very little space

left between p(x) and q(x), and f(x) and t(x) are squeezed in between them.

In this sense, the graphs of f(x) and t(x) have to be close to each other near

x = c.

A pedagogical advantage of the approach is that one does not have to

understand limits before one can understand the deﬁnition of the derivative.

There is also a geometric picture which illustrates the concept of closeness,

tangent line, and derivative. The condition in (1.14) is also more accessible

to computer assisted algebra than the limit deﬁnition. In terms of algebraic

geometry (1.14) at least alludes to a divisibility condition.

1.7. SECANT LINES AND THE DERIVATIVE 13

0.5 1 1.5 2

0.25

0.5

0.75

1

1.25

1.5

Figure 1.3: Sine Function and Tangent Line between two Parabolas

1.7 Secant Lines and the Derivative

Often a diﬀerent approach is taken to motivate and introduce the derivative.

Theorem 1.26. Suppose f is a function and c is an interior point of its

domain. If f is diﬀerentiable at c, then

f

(c) = lim

x→c

f(c) −f(x)

c −x

.

Proof. This is obvious once one uses the expression for the tangent line in

(1.13) and substitutes it in the expression in (1.12) inside the limit.

f(x) −t(x)

x −c

=

f(x) −f(c)

x −c

−f

(c). (1.15)

Apply limits to both sides of the equation and the assertion follows.

Let us explain the situation geometrically. Suppose a and b are distinct

points in the domain of the function f. The line through (a, f(a)) and

(b, f(b)) is called a secant line, and its slope (f(a) − f(b))/(a − b) is called

14 CHAPTER 1. BASIC CONCEPTS

the average rate of change of f over the interval [a, b]. In (1.15) we are

considering the slopes of secant lines through (c, (f(c)) and (x, f(x)), and

then we take the limit as x approaches c. The theorem asserts that for a

diﬀerentiable function this limit of the slopes of secant lines is the slope of

the tangent line. For the obvious reason f

**(c) is called the rate of change or
**

instantaneous rate of change of f at c.

Many authors introduce the derivative as the limit of the slopes of secant

lines, call t(x) = f

**(x − c) + f(c) the tangent line, and possibly illustrate
**

that the tangent line is close to the graph in the sense of Deﬁnition 1.20.

-0.1 -0.05 0.05 0.1

-0.0075

-0.005

-0.0025

0.0025

0.005

0.0075

Figure 1.4: f(x) = x

2

sin(1/x)

-0.01 -0.005 0.005 0.01

-0.0001

-0.00005

0.00005

0.0001

Figure 1.5: f(x) = x

2

sin(1/x)

It is misleading to say that the graph of f(x) looks like, or resembles, a

line near c. Eventually you will be able to show that the function

f(x) =

_

x

2

sin(1/x) if x = 0

0 if x = 0

is diﬀerentiable everywhere on the real line. You see part of its graph over

two diﬀerent intervals in Figure 1.4 and 1.5. By no stretch of imagination

will you say that the graph of the function looks like a line.

1.8 Diﬀerentiability implies Continuity

It is worth pointing out that

Theorem 1.27. If a function is diﬀerentiable at a point, then it is contin-

uous at this point.

1.9. BASIC EXAMPLES OF DERIVATIVES 15

Proof. Denote the function by f(x) and the point of diﬀerentiability by c.

By assumption we have the derivative f

(c) and

lim

x→c

_

f(x) −f(c)

x −c

−f

(c)

_

= 0.

Then certainly

lim

x→c

[(f(x) −f(c)) −f

(c)(x −c)] = 0.

Because f

**(c)(x−c) converges to zero as x approaches c, so does (f(x)−f(c)).
**

This implies that lim

x→c

f(x) = f(c) and that f(x) is continuous at c.

-2 -1 1 2

0.5

1

1.5

2

Figure 1.6: The absolute value function

The converse of the theorem is false. There are continuous functions

which are not diﬀerentiable. E.g., the function f(x) = |x| is continuous,

but it is not diﬀerentiable at x = 0. It is apparent from the graph (see

Figure 1.6) that there is not line close to the graph of this function near

x = 0.

We can also give an analytic argument. According to the deﬁnition of

diﬀerentiability, we have to study the diﬀerence quotients (|x|−|0|)/(x−0) =

|x|/x. They are 1 if x > 0 and −1 if x < 0. There is no number these

diﬀerence quotients converge to, and f(x) = |x| is not diﬀerentiable at x = 0.

1.9 Basic Examples of Derivatives

Let us use the deﬁnitions and work out a few derivatives.

16 CHAPTER 1. BASIC CONCEPTS

Example 1.28. If f(x) = x

n

and n is a non-negative integer, i.e., n = 0,

1, 2, . . . , then f

(x) = nx

n−1

.

Proof. Suppose that n ≥ 2. Then

lim

x→c

x

n

−c

n

x −c

= lim

x→c

(x

n−1

+x

n−2

c +· · · xc

n−2

+c

n−1

) = nc

n−1

The cases n = 0 and n = 1 are even easier and left to the reader.

Example 1.29. If f(x) = 1/x, then f

(x) = −1/x

2

.

Proof. Suppose c = 0.

lim

x→c

1

x

−

1

c

x −c

= lim

x→c

c −x

xc(x −c)

= −

1

c

2

.

Example 1.30. If f(x) =

√

x and x > 0, then f

(x) = 1/(2

√

x).

Proof.

lim

x→c

√

x −

√

c

x −c

= lim

x→c

x −c

(x −c)(

√

x +

√

c)

=

1

2

√

c

.

Remark 1. Eventually we will see that if f(x) = x

a

for any real number

a, then f

(x) = ax

a−1

, generalizing all of the examples above.

Exercise 1. Suppose that f(x) =

√

ax +b and ax +b > 0. Show that

f

(x) =

a

2

√

ax +b

.

The tangent line to the graph of f(x) at x = c is then

t(x) =

a

2

√

ac +b

(x −c) +

√

ac +b.

Verify that

|f(x) −t(x)| ≤

a

2

2(

√

ac +b)

3

(x −c)

2

. (1.16)

1.9. BASIC EXAMPLES OF DERIVATIVES 17

The estimate in (1.16) shows diﬀerentiability in the sense of Deﬁni-

tion 1.25, and provides an explicit error estimate, a bound on the diﬀerence

between the function and its tangent line.

Example 1.31. Show that sin

**x = cos x. For this equation to hold, the
**

angle x needs to be measured in radians.

Proof. Below we will set x = c +h and x −c = h.

lim

x→c

_

sin x −sin c

x −c

_

= lim

h→0

_

sin(c +h) −sin c

h

_

= lim

h→0

_

sinc cos h + cos c sin h −sin c

h

_

= lim

h→0

_

sinc(cos h −1) + cos c sin h

h

_

= sin c · lim

h→0

cos h −1

h

+ cos c · lim

h→0

sin h

h

= cos c.

For computation of the limits in the second to last line see (1.4).

The tangent line to the graph of the sine function at x = c is

t(x) = cos c(x −c) + sin c.

It is left as an exercise for the reader to show that

| sin x −t(x)| ≤ (x −c)

2

(1.17)

The steps are essentially the same as in the proof above. The estimate in

(1.17) does not only show diﬀerentiability in the sense of Deﬁnition 1.25,

but it provides an explicit error estimate, a bound on the diﬀerence between

the function and its tangent line.

Exercise 2. If f(x) = cos x, then f

**(x) = −sin x. The details are similar
**

to the ones in Example 1.31. Furthermore, if

t(x) = sin c(x −c) + cos x

is the tangent line to the graph of f(x) at x = c, then

|f(x) −t(x)| ≤ (x −c)

2

.

18 CHAPTER 1. BASIC CONCEPTS

1.10 The Exponential and Logarithm Functions

The exponential and logarithm are of great importance and we do not want

to delay their introduction any further. Still, technically we are not quite

prepared for it and at a later point we have to revisit the introduction to ﬁll

in details.

Suppose a is a positive real number and a = 1. For any rational number

r = p/q (p and q are integers) one can deﬁne a

r

=

q

√

a

p

. First we take a p-th

power and then a q-root. In this sense we have a function h(r) = a

r

, whose

domain consists of all rational numbers. This function is monotonic. More

precisely, h(r) is increasing if a > 1 and decreasing when 0 < a < 1.

Theorem-Deﬁnition 1.32. Let a be a positive number, a = 1. There

exists exactly one monotonic function, called the exponential function with

base a and denoted by exp

a

(x), which is deﬁned for all real numbers x such

that exp

a

(x) = a

x

whenever x is a rational number. Furthermore, a

x

> 0 for

all x, so that the domain of the exponential function is (−∞, ∞). For every

number y > 0 there exists exactly one number x, such that exp

a

(x) = y, so

we use (0, ∞) as the range of the exponential function exp

a

(x).

It is common, and we will follow this convention, to use the notation

a

x

for exp

a

(x) also if x is not rational. The arithmetic properties of the

exponential function, also called the exponential laws, are collected in our

next theorem. The theorem just says that the exponential laws, which you

previously learned for rational exponents, also hold in the generality of our

current discussion.

Theorem 1.33 (Exponential Laws). For any positive real number a and

all real numbers x and y

a

x

a

y

= a

x+y

a

x

/a

y

= a

x−y

(a

x

)

y

= a

xy

If x is the unique solution of the equation a

x

= y, then we set

log

a

(y) = x. (1.18)

We just deﬁned a function log

a

(y). It is called the logarithm function with

base a, and by construction it is the inverse of the exponential function

exp

a

(x). More explicitly,

a

log

a

y

= y and log

a

(a

x

) = x

1.10. THE EXPONENTIAL AND LOGARITHM FUNCTIONS 19

for all x ∈ R and all y > 0. The domain of the logarithm function is

(0, ∞) and its range is (−∞, ∞). It is increasing if a > 1 and decreasing if

0 < a < 1.

Corresponding to the exponential laws in Theorem 1.33 we have the laws

of logarithms. One set of laws implies the other one, and vice versa.

Theorem 1.34 (Laws of Logarithms). For any positive real number a =

1, for all positive real numbers x and y, and any real number z

log

a

(xy) = log

a

(x) + log

a

(y)

log

a

(x/y) = log

a

(x) −log

a

(y)

log

a

(x

z

) = z log

a

(x)

In Figures 1.7 and 1.8 you see parts of the graphs of the exponential and

logarithm functions with base 2.

-1 -0.5 0.5 1 1.5

0.5

1

1.5

2

2.5

Figure 1.7: exp

2

(x)

0.5 1 1.5 2 2.5 3

-1

-0.5

0.5

1

1.5

Figure 1.8: log

2

(x)

The Euler number e as base

There is one number which is preferrable as base over the others. This

irrational number is called the Euler number (named after Leonard Euler)

and denoted by e, and e ≈ 2.718281828. We will deﬁne it precisely later.

Deﬁnition 1.35. The exponential function is the exponential function for

the base e. It is denoted by exp(x) or e

x

. Its inverse is the natural logarithm

function. It is denoted by ln(x). So exp(x) = exp

e

(x) and ln(x) = log

e

(x).

20 CHAPTER 1. BASIC CONCEPTS

Eventually we will see

exp

(x) = exp(x) and ln

(x) =

1

x

. (1.19)

The derivative of the exponential function is the exponential function, and

the derivative of the natural logarithm function is 1/x.

Other Bases

Finally, let us relate the exponential and logarithm functions for diﬀerent

bases to those with base e. For any positive number a (a = 1),

Theorem 1.36.

a

x

= e

x lna

and log

a

x =

ln x

ln a

.

These identities follow from the exponential laws and the laws of loga-

rithms.

1.11 Diﬀerentiability on Closed Intervals

In Deﬁnition 1.22 we deﬁned what it means that a function is diﬀerentiable

on an open set. There are situations in which one would like to apply the

notion of diﬀerentiability to functions with other kinds of domains. Let us

formalize the idea of extending functions.

Deﬁnition 1.37. Suppose that I and J are subsets of the real line R and

I ⊆ J, that I is the domain of a function f, and that J is the domain of

a function F. We call F an extention of f if it agrees with f on I, i.e.,

F(x) = f(x) for all x ∈ I.

Deﬁnition 1.38. A function f is said to be diﬀerentiable on a subset I

of R if it extends to a diﬀerentiable function F on an open set. We set

f

(x) = F

(x) for all x ∈ I.

Without some restrictions on I, a function may be diﬀerentiable without

the derivative being well deﬁned. The least technical and for our purposes

suﬃcient solution is captured in

Proposition 1.39. Suppose the function f is deﬁned on an interval I, the

interval is neither empty nor a single point, and f extends to a diﬀerentiable

function F on an open interval containing I, then f

(x) = F

(x) is unique

for all x ∈ I.

1.12. OTHER NOTATIONS FOR THE DERIVATIVE 21

We are mostly concerned with deﬁning diﬀerentiability for functions

whose domain is a closed interval [a, b], where a < b. Some authors use

one-sided limits and one-sided derivatives to contemplate derivatives at the

end points of the interval. Our discussion is less painful, and it lends itself

more to generalizations in higher dimensions.

Let us discuss two examples. The function f(x) = x

2

with domain [0, 1]

is diﬀerentiable. It extends to the diﬀerentiable function F(x) = x

2

with

the open set (−∞, ∞) as its domain. In contrast, the function g(x) =

√

x

is not diﬀerentiable on the interval [0, ∞). The only sensible candidate for

the tangent line to the graph of g(x) at the point (0, 0) is a vertical line.

The slope of this line is not a real number and we do not have a derivative.

(The function g(x) is diﬀerentiable if we use (0, ∞) as domain.)

1.12 Other Notations for the Derivative

There are diﬀerent notations for the derivative of a function. Physicists will

indicate a derivative with respect to time by a dot. E.g., if x is a function

of time, then they will write ˙ x(t) instead of x

**(t). Leibnitz’ notation for the
**

derivative of a function f of a variable x is

df

dx

. We will use it frequently. Ex-

pressing the derivatives of the exponential and natural logarithm functions

this way (see (1.19)) we have:

If y(x) = e

x

, then

dy

dx

= y = e

x

, and if y(x) = ln x, then

dy

dx

=

1

x

.

This notation is not always speciﬁc enough. The expression dy/dx stands

for the derivative of y with respect to x, and that is a function. The ex-

pression does not tell where dy/dx is evaluated. To be speciﬁc about this

aspect, it makes sense to write (compare Example 1.31):

If y(x) = sinx, then

dy

dx

(x) = cos x.

In this notation x plays two roles. It is the name of the variable of y as

well as the name of the variable of the derivative of y. This in acceptable

because it won’t lead to confusion. Instead of

df

dx

(x) we also write

d

dx

f(x).

This is particularly convenient if f stands for a larger expression as in

d

dx

sin x = cos x or

d

dx

e

x

= e

x

.

22 CHAPTER 1. BASIC CONCEPTS

1.13 Rules of Diﬀerentiation

We discuss formulas for calculating the derivative of a composite function

from the derivatives of its constituents. These formulas, together with the

knowledge of the derivatives of some basic functions, turn the process of

diﬀerentiation for many functions into an algorithm, a rather mechanical

process. You can do it even on the computer, which means that no “under-

standing” is required. You are expected to learn the basic rules, be able to

apply the accurately, and practice many examples. In the last section of this

chapter we summarize the computational results of this section. We collect

the rules established in this section and tabulate the derivatives of many of

the important functions which we considered.

1.13.1 Linearity of the Derivative

Diﬀerentiation is compatible with addition of functions and multiplication

with a constant. In a more mathematical language one says that diﬀeren-

tiation is linear. Let f and g be functions, and assume that both of them

are diﬀerentiable at x. Let c be a real number. Then f + g and cf are

diﬀerentiable at x and their derivatives are given by

(f +g)

(x) = f

(x) +g

(x) and (cf)

(x) = cf

(x). (1.20)

In Leibnitz’ notation this reads

d

dx

(f +g)(x) =

df

dx

(x) +

dg

dx

(x) and

d

dx

(cf)(x) = c

df

dx

(x). (1.21)

In words, the derivative of a sum of functions is the sum of the derivatives,

and the derivative of a multiple of a function is the multiple of the derivative.

Example 1.40. Diﬀerentiate

h(x) = x

2

+ 3e

x

.

Solution: Set f(x) = x

2

, g(x) = e

x

and c = 3. Then h(x) = f(x) + 3g(x).

Previously we found that f

(x) = 2x and that g

(x) = e

x

, see (1.19). We

conclude that

h

(x) =

dh

dx

(x) = 2x + 3e

x

. ♦

1.13. RULES OF DIFFERENTIATION 23

Example 1.41. Diﬀerentiate log

a

x, the logarithm functions for an arbi-

trary positive base a, a = 1.

Solution: Recall that log

a

x =

ln x

ln a

, see Theorem 1.36. In this sense

log

a

x = cf(x) where c = 1/ ln a and f(x) = ln x. We stated previously that

ln

**x = 1/x, see (1.19). Using the linearity of the derivative, we ﬁnd
**

log

a

x =

d

dx

_

ln x

ln a

_

=

1

ln a

ln

x =

1

ln a

×

1

x

=

1

xln a

. ♦

Suppose f and g are deﬁned and diﬀerentiable on an set. Thinking of f

and g more as functions, and not so much as functions evaluated at a point,

we may omit (x) from the notation. Then the diﬀerentiation rules are

(f +g)

= f

+g

or

d

dx

(f +g) =

df

dx

+

dg

dx

(1.22)

and

(cf)

= cf

or

d

dx

(cf) = c

df

dx

. (1.23)

Example 1.42. Find the derivative of an arbitrary polynomial.

Solution: A polynomial is a ﬁnite sum of multiples of non-negative

powers of the variable, i.e., a function of the form

f(x) = a

n

x

n

+a

n−1

x

n−1

+· · · +a

1

x +a

0

,

where the a

i

are constants. Using Example 1.28 and the linearity of the

derivative we see right away that

f

(x) = na

n

x

n−1

+ (n −1)a

n−1

x

n−2

+· · · +a

1

.

Here is a speciﬁc example, a special case of the formula which we just

derived.

If f(x) = 4x

5

−3x

2

+ 4x + 5, then f

(x) = 20x

4

−6x + 4. ♦

1.13.2 Product and Quotient Rules

Next we state the product and the quotient rule. They allow us to calculate

the derivatives of products and quotients of functions. Again, let f and g

be functions, and assume that both of them are diﬀerentiable at x. For the

quotient rule assume in addition that g(x) = 0. Then the product fg and

the quotient f/g are diﬀerentiable at x and their derivatives are given by

24 CHAPTER 1. BASIC CONCEPTS

(fg)

(x) = f

(x)g(x) +f(x)g

(x) (1.24)

_

f

g

_

(x) =

f

(x)g(x) −f(x)g

(x)

[g(x)]

2

. (1.25)

In Leibnitz’ notation these formulas become

d

dx

(fg)(x) =

df

dx

(x)g(x) +f(x)

dg

dx

(x) (1.26)

d

dx

_

f

g

_

(x) =

df

dx

(x)g(x) −f(x)

dg

dx

(x)

[g(x)]

2

. (1.27)

Example 1.43. Diﬀerentiate the function h(x) = x

2

ln x.

Solution: Write h(x) = f(x)g(x) with f(x) = x

2

and g(x) = ln x.

Then f

(x) = 2x and g

**(x) = 1/x, see (1.19). Putting this into the product
**

formula yields

h

(x) = f

(x)g(x) +f(x)g

(x) = 2xln x +x

2

1

x

= x(2 ln x + 1). ♦

Example 1.44. Find the derivative of the rational function.

r(x) =

x

2

−5

x

3

+ 1

.

Solution: We set p(x) = x

2

− 5 and q(x) = x

3

+ 2. Then p

(x) = 2x

and q

(x) = 3x

2

. According to the quotient rule

r

(x) =

2x(x

3

+ 1) −(x

2

−5)3x

2

(x

3

+ 1)

2

=

−x

4

+ 15x

2

+ 2x

(x

3

+ 1)

2

. ♦

Example 1.45. The formula

d

dx

x

n

= nx

n−1

for all integer powers n. If n ≤ −1, then we domain of the function is R\{0},

the real line with the origin removed.

Solution: We veriﬁed this formula for n ≥ 0 in Example 1.28. Let n be

a negative integer and m = −n. Then

d

dx

x

n

=

d

dx

_

1

x

m

_

=

0 · x

m

−1 · mx

m−1

x

2m

=

−m

x

m+1

= nx

n−1

.

1.13. RULES OF DIFFERENTIATION 25

Example 1.46. Find the derivative of

f(x) = tan x.

Solution: We express f(x) as a quotient of two functions, f(x) =

sin x/ cos x, and apply the quotient rule. Use that sin

**x = cos x (see Exam-
**

ple 1.31) and cos

**x = −sinx (see Exercise 2 on page 17). We ﬁnd
**

tan

x =

sin

xcos x −sin xcos

x

cos

2

x

=

cos

2

x + sin

2

x

cos

2

x

=

1

cos

2

x

= sec

2

x.

(1.28)

Some books and computer programs will give this result in a diﬀerent form.

Based on the relevant trigonometric identity, they write

tan

x = 1 + tan

2

x. (1.29)

That draws our attention to the fact that the function f(x) = tan x satisﬁes

the diﬀerential equation

f

(x) = 1 +f

2

(x). ♦

Example 1.47. Diﬀerentiate the function

f(x) = sec x.

Solution: We write the function as a quotient: f(x) = 1/ cos x. The

function is deﬁned for all x for which cos x = 0, i.e., for x not of the form

nπ + 1/2, where n is an integer. We apply the quotient rule, using that

cos

**x = −sin x (see Exercise 2 on page 17), and that the derivative of a
**

constant vanishes. We ﬁnd

sec

x =

sin x

cos

2

x

=

sin x

cos x

·

1

cos x

= tan xsec x. ♦ (1.30)

Suppose f and g are deﬁned and diﬀerentiable on an open set. Thinking

of f and g again more as functions, and not so much as functions evaluated

at a point, we may once more omit (x) from the notation. Then the product

rule and quotient rule become

(fg)

= f

g +fg

or

d

dx

(fg) =

df

dx

g +f

dg

dx

(1.31)

and, wherever g(x) = 0,

_

f

g

_

=

f

g −fg

g

2

or

d

dx

_

f

g

_

=

df

dx

g −f

dg

dx

g

2

. (1.32)

Here g

2

is the square of the function g, given by g

2

(x) = [g(x)]

2

.

26 CHAPTER 1. BASIC CONCEPTS

1.13.3 Chain Rule

Let f and g be functions, and suppose that the domain of f contains the

range of g, so that the composition (f ◦ g)(x) = f(g(x)) is deﬁned for all x

in the domain of g. Set h = f ◦ g, so that h(x) = f(g(x)). The chain rule

says that whenever g is diﬀerentiable at x and f is diﬀerentiable at g(x),

then h(x) is diﬀerentiable at x and

h

(x) = (f ◦ g)

(x) = f

(g(x))g

(x). (1.33)

In Leibnitz’ notation the chain rule says that

dh

dx

(x) =

d

dx

f(g(x)) =

df

du

(g(x))

dg

dx

(x). (1.34)

Example 1.48. Diﬀerentiate the function

h(x) = e

x

2

+1

.

Solution: We write h = f ◦ g as a composition of two functions, with

g(x) = x

2

+ 1 and f(u) = e

u

. Remember that f

(u) = f(u) = e

u

and

g

(x) = 2x. In particular, f

(g(x)) = e

x

2

+1

. The chain rule tells us that

h

(x) = f

(g(x))g

(x) = 2xe

x

2

+1

.

In the last expression we reversed the order of the factors to make the

expression more readable. ♦

Example 1.49. Let u(x) be a diﬀerentiable function.

If f(x) = e

u(x)

then f

(x) = u

(x)e

u(x)

.

Here are some speciﬁc examples:

d

dx

e

2x+5

= 2 e

2x+5

d

dx

e

sin x

= cos x e

sin x

d

dx

e

tan x

= sec

2

x e

tan x

. ♦

Example 1.50. Combining Example 1.45 with the chain rule we ﬁnd

d

dx

u

n

(x) = nu

(x)u

n−1

(x)

1.13. RULES OF DIFFERENTIATION 27

for all integers n, assuming only that u is diﬀerentiable at x and u(x) = 0 if

n ≤ −1.

Solution: Set g(x) = u(x) and f(u) = u

n

. Then

h(x) = f(g(x)) = u

n

(x).

According to Example 1.45, f

(u) = nu

n−1

. The chain rule tells us now that

h

(x) = f

(g(x))g

(x) = n(g(x))

n−1

g

(x) = nu

(x)u

n−1

(x).

We reordered the expressions so that the expression is more readable.

To be speciﬁc, here are concrete examples:

d

dx

(3x + 5)

8

= 8(3x + 5)

8−1

· 3 = 24(3x + 5)

7

d

dx

(x

2

+ 1)

25

= 25(x

2

+ 1)

24

· 2x = 50x(x

2

+ 1)

24

d

dx

tan

3

x = 3 sec

2

xtan

2

x

d

dx

cos

2

x = 2 cos x(−sin x) = −2 cos xsin x

d

dx

sec

5

x = 5 sec

4

xsec xtan x = 5 sec

5

xtan x. ♦

Example 1.51. Diﬀerentiate the function ln |u| for u = 0.

Solution: We asserted that ln

**u = 1/u for positive values of u, see
**

(1.19). So, suppose that u < 0. Then u = −|u| and ln |u| = ln(−u). The

chain rule tells us that, for u < 0,

d

du

ln |u| =

1

|u|

d

du

(−u) = (−1)

1

−u

=

1

u

.

This means that for all non-zero u

d

du

ln |u| =

1

u

. ♦ (1.35)

More generally, envoking the chain rule

d

dx

ln|u(x)| =

u

(x)

u(x)

, (1.36)

assuming that u is diﬀerentiable and nowhere zero on its domain. E.g.,

d

dx

ln |x

2

−4| =

2x

x

2

−4

28 CHAPTER 1. BASIC CONCEPTS

for all x = ±2.

We push matters a bit further. We use the formulae for diﬀerentiating

the exponential and natural logarithm functions. Eventually we will verify

them independently.

Consider a function u which is diﬀerentiable and nowhere zero on its

domain and q any real number. Then

If f(x) = |u(x)|

q

then f

(x) = q

u

(x)

u(x)

|u(x)|

q

. (1.37)

The assertion follows from (1.36), Example 1.49 and the exponential laws.

f

(x) =

d

dx

e

ln f(x)

=

d

dx

e

ln(|u(x)|

q

)

=

d

dx

e

q ln|u(x)|

=

_

d

dx

(q ln |u(x)|)

_

e

q ln |u(x)|

= q

u

(x)

u(x)

|u(x)|

q

.

Here is a concrete example:

d

dx

¸

¸

¸

¸

1

2

−sin x

¸

¸

¸

¸

5

= 5

−cos x

1

2

−sin x

¸

¸

¸

¸

1

2

−sin x

¸

¸

¸

¸

5

whenever sin x = 1/2. Speciﬁcally, we have to exclude all x of the form

π

6

+ 2nπ and

5π

6

+ 2nπ, where n is an arbitrary integer.

For diﬀerentiable functions which are everywhere positive on their do-

main and any real number q the diﬀerentiation formula in (1.37) specializes

to

d

dx

u

q

(x) = qu

(x)u

q−1

(x). (1.38)

For example:

d

dx

(sin x)

1/2

=

cos x

2

√

sin x

for x ∈ (0, π) and

d

dx

(sec

2

x + 5)

π

= 2π sec

2

xtan x(sec

2

x + 5)

π−1

for x ∈ (−π/2, π/2).

1.13. RULES OF DIFFERENTIATION 29

Using the tricks from above, we get the following derivatives:

d

dx

a

x

= a

x

ln a (Assume a > 0. Hint: a

x

= e

x lna

)

d

dx

x

x

= (1 + ln x)x

x

(Assume x > 0, x = 1. Hint: x

x

= e

x lnx

)

d

dx

x

sinx

=

_

sin x

x

+ cos xln x

_

x

sinx

(Assume x ∈ (0, π/4)).

To diﬀerentiate a composition of more than two diﬀerentiable functions

we apply the chain rule repeatedly. E.g.,

d

dx

f(g(h(x))) = f

(g(h(x))

d

dx

g(h(x)) = f

(g(h(x)) · g

(h(x)) · h

(x).

For example

d

dx

e

√

x

2

+1

= e

√

x

2

+1

·

1

2

√

x

2

+ 1

· 2x =

xe

√

x

2

+1

√

x

2

+ 1

d

dx

tan

3

(5x

2

−x + 5) = 3 tan

2

(5x

2

−x + 5) sec

2

(5x

2

−x + 5) · (10x −1)

1.13.4 Hyperbolic Functions

The exponential function may be used to deﬁne the hyperbolic sine and

cosine.

sinhx =

1

2

_

e

x

−e

−x

¸

& cosh x =

1

2

_

e

x

+e

−x

¸

(1.39)

You are invited to verify that

cosh

2

x −sinh

2

x = 1.

Conversely, one can show that any point (u, v) on the hyperbola

u

2

−v

2

= 1

can be expressed as (±cosh x, sinh x) for some x ∈ (−∞, ∞). These obser-

vations motive the attribute ‘hyperbolic’.

It is elementary to compute the derivatives of the hyperbolic functions:

sinh

x = cosh x and cosh

x = sinhx.

30 CHAPTER 1. BASIC CONCEPTS

One may also deﬁne other hyperbolic functions

tanh x =

sinhx

cosh x

, coth x =

cosh x

sinhx

, sech x =

1

cosh x

, and csch x =

1

sinh x

.

As a routine application of the rules of diﬀerentiation, you may calculate

the derivatives of these functions. There are identities for these hyperbolic

functions, comparable to the identities for the trigonometric functions. You

can ﬁnd them in any table of mathematical formulas, or you can work them

out yourself.

1.13.5 Derivatives of Inverse Functions

Let us recall. Two functions f and g are said to be inverses of each other

(or each function is the inverse of the other one) if the domain of f is equal

to the range of g, the domain of g is equal to the range of f, and

g(f(x)) = x and f(g(y)) = y (1.40)

for all x in the domain of f and all y in the domain of g. A few essential

properties of inverse functions are listed in

Proposition 1.52. Suppose f and g are inverses of each other.

1. The graph of g is obtained from the graph of f by reﬂection at the

diagonal.

2. If f is increasing, then so is g. If f is decreasing, then so is g.

3. If f is continuous, then it is monotonic (increasing or decreasing) on

any interval in its domain.

4. If f is continuous and I is an interval in the domain of f, then J =

f(I), the image of I under the map f, is an interval. If I is an open

interval, then J is an open interval.

Some parts of this proposition are elementary, others are consequences

of the intermediate value theorem.

For example, the function f(x) = cos x maps the interval [0, π] to the

inteval [−1, 1]. The function f(x) = e

x

maps the interval (−∞, ∞) to the

interval (0, ∞). The function tan x maps the interval (−π/2, π/2) to the

interval (−∞, ∞). It is customary to deﬁne its inverse arctan x as a func-

tion from (−∞, ∞) to (−π/2, π/2). The function sin x maps the interval

[−π/2, π/2] to the interval [−1, 1]. Its inverse arcsin x is typically used with

1.13. RULES OF DIFFERENTIATION 31

-1.5 -1 -0.5 0.5 1 1.5

-1

-0.5

0.5

1

Figure 1.9: sin x on [−π/2, π/2]

-1 -0.5 0.5 1

-1.5

-1

-0.5

0.5

1

1.5

Figure 1.10: arcsin y on [−1, 1]

domain [−1, 1], and its range is [−π/2, π/2]. You see the graph of these two

functions in Figures 1.9 and 1.10.

The relation between the derivative of a function and its inverse is spelled

out in our next theorem.

Theorem 1.53. Let f be a diﬀerentiable and invertible function which is

deﬁned on an open interval (a, b), and denote the image of f by (A, B).

Denote the inverse of f by g. Then g is diﬀerentiable at all points y ∈ (A, B)

for which f

**(g(y)) = 0. For these values of y and for x such that f(x) = y
**

the derivative is given by:

g

(y) =

1

f

(g(y))

or g

(f(x)) =

1

f

(x)

.

Proof. We will not give a formally complete proof of the diﬀerentiability

assertion. Still, if the line t(x) is close to the graph of the function f(x) at

the point (x, f(x)) and y = f(x), then its reﬂection T(x) at the diagonal

is close to the graph of the function g(x) at the point (f(x), x) = (y, g(y)).

We need that T(x) is not vertical, and this is assured by the assumption

that t(x) is not horizontal. With the role of x and y being interchanged,

the slope of t(x) is the reciprocal of the slope of T(x). This provides the

formula for the derivative. Actually, this is also easy to calculate.

By deﬁnition we have f(g(y)) = y for all y ∈ (A, B). Diﬀerentiate both

sides of the equation. We ﬁnd

f

(g(y))g

(y) = 1 and g

(y) =

1

f

(g(y))

,

32 CHAPTER 1. BASIC CONCEPTS

as claimed. If y = f(x), then g(y) = g(f(x)) = x, and we obtain the second

version of the formula for the derivative of the inverse of the function:

g

(f(x)) =

1

f

(x)

.

We apply the theorem to ﬁnd some important derivatives.

Example 1.54. Assume that the natural logarithm function is diﬀeren-

tiable and that ln

**x = 1/x, as asserted in (1.19). Show that the exponential
**

function is diﬀerentiable and that

d

dy

e

y

= e

y

.

Solution: By deﬁnition, the exponential function is the inverse of the

natural logarithm function ln. Set f(x) = ln x and g(y) = e

y

in Theo-

rem 1.53. We note that ln

**(x) = 0 for all x in (0, ∞), the domain of the
**

natural logarithm. The theorem says that the exponential function is dif-

ferentiable and provides the formula for the derivative:

d

dy

e

y

=

1

ln

(e

y

)

=

1

1/e

y

= e

y

,

as claimed. ♦

Example 1.55. Show that the function g(y) = arctan y (the inverse of

f(x) = tan x) is diﬀerentiable, and that

d

dy

arctan y =

1

1 +y

2

.

According to standard conventions we use (−∞, ∞) as the domain and

(−π/2, π/2) as the range for arctan.

Solution: The function f(x) = tan x is diﬀerentiable on its entire do-

main, and f

(x) = sec

2

x is nowhere zero. Theorem 1.53 tells us that

g(y) = arctan y is diﬀerentiable on its entire domain (−∞, ∞). The the-

orem also provides us with the formula for the derivative:

arctan

(y) =

1

tan

(arctan y)

=

1

sec

2

(arctan y)

= cos

2

(arctan y).

All we need to do now is to ﬁgure out what cos

2

(arctan y) is. To do this

we draw a triangle in which we identify the available data. We refer to the

notation in Figure 1.11.

1.13. RULES OF DIFFERENTIATION 33

y

1

u

A B

Figure 1.11: An informative triangle

There you see a rectangular triangle, the right angle is at the vertex

B. The angle at the vertex A is called u. The adjacent side to this angle is

chosen to be of length 1, and the opposing side of length y. So, by deﬁnition,

tan u = y and arctan y = u.

By the theorem of Pythagoras, the length of the hypotenuse is

_

1 +y

2

.

Then

cos u =

1

_

1 +y

2

and cos

2

(arctan y) =

1

1 +y

2

.

The conclusion is that

arctan

(y) =

1

1 +y

2

. (1.41)

This is exactly what we claimed. ♦

Combined with the chain rule, and assuming the diﬀerentiability of u(x),

we ﬁnd a slightly more general formula:

d

dx

arctan(u(x)) =

u

(x)

1 +u

2

(x)

. (1.42)

34 CHAPTER 1. BASIC CONCEPTS

For example:

d

dx

arctan(x

2

+ 5) =

2x

1 + (x

2

+ 5)

2

d

dx

arctan(sin x) =

cos x

1 + sin

2

x

.

The reader is invited to verify the formulas for the other inverse trigono-

metric functions arcsin x, arccos x, arccot x, and arcsec x as they are given

in Table 1.3 on page 63. For example

Exercise 3. It is customary to think of arcsin x as a function from [−1, 1]

to [−π/2, π/2]. Show that arcsin x is diﬀerentiable on (−1, 1), and that its

derivative is

d

dx

arcsin x =

1

√

1 −x

2

.

We may once more improve on this formula. Let u(x) be a diﬀerentiable

function which is deﬁned on an open interval, and suppose that |u(x)| < 1.

Then, using the chain rule, we ﬁnd that

d

dx

arcsin(u(x)) =

u

(x)

_

1 −u

2

(x)

. (1.43)

For example:

d

dx

arcsin(3x) =

3

√

1 −9x

2

if x ∈ (−1/3, 1/3)

d

dx

arcsin(x

2

) =

2x

√

1 −x

4

if x ∈ (−1, 1)

1.13.6 Implicit Diﬀerentiation

Until now we considered functions which were given explicitly. I.e., we were

given an equation y = f(x), where f(x) is some instruction which assigns a

value to x. The points on the graph of f are the points which satisfy the

equation. Consider the equation

(x

2

+y

2

)

2

= x

2

−y

2

. (1.44)

The solutions of this equation form a curve

5

in the plane called a lemniscate,

see Figure 1.12. Parts of this curve look like the graph of a function, such

1.13. RULES OF DIFFERENTIATION 35

-1 -0.5 0.5 1

-0.3

-0.2

-0.1

0.1

0.2

0.3

Figure 1.12: Lemniscate

as the points for which y ≥ 0. Without solving the equation for y, we still

like to calculate the slope of curve at one of its points. This process is called

implicit diﬀerentiation.

Let us start out with an example which we have studied before.

Example 1.56. The unit circle consists of all points which satisfy the equa-

tion x

2

+y

2

= 1. Find the slope of the tangent line to the unit circle at the

point (1/2,

√

3/2).

Solution: We write y = y(x) to emphasize that y as a function of x.

Diﬀerentiating both sides of the equation of the circle we get

2x + 2y

dy

dx

= 0 or

dy

dx

=

−x

y

.

Plugging in the coordinates of the speciﬁed point, we ﬁnd that

dy

dx

¸

¸

¸

¸

(1/2,

√

3/2)

=

−1

√

3

.

We used a diﬀerent way to indicate at which point we evaluate the derivative

because we had to specify the x and the y coordinate of the point. ♦

5

We will rely on the readers intuitive idea of a curve in the plane.

36 CHAPTER 1. BASIC CONCEPTS

Example 1.57. Find the slope of the tangent line to the lemniscate

(x

2

+y

2

)

2

= x

2

−y

2

,

and ﬁnd the coordinates of the points where the tangent line is horizontal.

Solution: You see a picture of the lemniscate in Figure 1.12. As in

Example 1.56, we consider y as a function of x and diﬀerentiate both sides

of the equation. We ﬁnd

2(x

2

+y

2

)(2x + 2y

dy

dx

) = 2x −2y

dy

dx

.

Bring all terms with a factor dy/dx to the left hand side of the equation and

those without to the right hand side.

(2y(x

2

+y

2

) +y)

dy

dx

= x(1 −2(x

2

+y

2

)).

Finally we get an explicit expression for

dy

dx

in terms of x and y:

dy

dx

=

x(1 −2(x

2

+y

2

))

2y(x

2

+y

2

) +y

=

x(1 −2(x

2

+y

2

))

y(2(x

2

+y

2

) + 1)

.

Given any point (x, y) with y = 0 on the lemniscate, we can plug it into the

expression for

dy

dx

and we get the slope of the curve at this point.

E.g, the point (x, y) = (

1

2

,

1

2

_

−3 + 2

√

3) is a point on the lemniscate,

and at this point the slope of the tangent line is

dy

dx

=

2 −

√

2

√

3

_

−3 + 2

√

3

.

This speciﬁc calculation takes a bit of arithmetic skill and eﬀort to carry

out.

The tangent line is horizontal whenever

dy

dx

= 0. A quick look at Fig-

ure 1.12 tells us that we may ignore points where x = 0 or y = 0. That

means that

dy

dx

= 0 whenever

1 −2(x

2

+y

2

) = 0 or x

2

+y

2

=

1

2

.

Substitute x

2

+ y

2

=

1

2

, and y

2

=

1

2

− x

2

into the equation of the curve.

Then we get an equation in one variable:

1

4

= x

2

−

_

1

2

−x

2

_

or x

2

=

3

8

and y

2

=

1

8

.

1.13. RULES OF DIFFERENTIATION 37

The points at which the tangent line to the lemniscate is horizontal are

(x, y) = (±

√

6

4

, ±

√

2

4

) ≈ (±.6124, ±.3536). ♦

Example 1.58. Suppose you drop a circle of radius 1 into a parabola with

the equation y = 2x

2

. At which points will the circle touch the parabola?

6

-1 -0.5 0.5 1

0.5

1

1.5

2

2.5

3

Figure 1.13: Ball in a Cup.

Solution: You see a picture of the problem in Figure 1.13. The crucial

observation in this example is, that the tangent line to the parabola and the

circle will be the same at the point of contact.

Suppose the coordinates of the center of the circle are (0, a), then its

equation is x

2

+ (y − a)

2

= 1. Diﬀerentiating the equation of the parabola

with respect to x, we ﬁnd that

dy

dx

= 4x. Diﬀerentiating the equation of the

circle with respect to x, we get

2x + 2(y −a)

dy

dx

= 0.

Assuming that

dy

dx

is the same for both curves at the point of contact, we

substitute

dy

dx

= 4x into the second equation. After some impliﬁcations we

6

More sensibly, drop a ball of radius 1 into a cup whose vertical cross section is the

parabola y = 2x

2

.

38 CHAPTER 1. BASIC CONCEPTS

ﬁnd:

x(1 + 4(y −a)) = 0.

The ball it too large to ﬁt into the parabola and touch at (0, 0). So we may

assume that x = 0. Solving the equation 1 + 4(y − a) = 0 for y, we ﬁnd

that the y coordinate of the point of contact is y = a −

1

4

. We substitute

this expression into the equation of the circle and ﬁnd that the x coordinate

of the point of contact is x = ±

√

15

4

. Substituting this into the equation of

the parabola, we ﬁnd that y =

15

8

at the point of contact. In summary, the

circle touches the parabola in the points

(x, y) =

_

±

√

15

4

,

15

8

_

. ♦

Exercise 4. Consider the curve given by the equation

x

3

+y

3

= 1 + 3xy

2

.

Find the slope of the curve at the point (x, y) = (2, −1).

Exercise 5. Consider the curve given by the equation x

2

= sin y. Find the

slope of the curve at the point with coordinates x = 1/

4

√

2 and y = π/4.

Exercise 6. Repeat Example 1.57 with the curve given by the equation

y

2

−x

2

(1−x

2

) = 0. You ﬁnd a picture of this Lissajous ﬁgure in Figure 1.14.

1.14 Related Rates

Many times you encounter situations in which you have two related variables,

you know at which rate one of them changes, and you like to know at which

rate the other one changes. In this section we treat such problems.

Example 1.59. Suppose the radius of a ball changes at a rate of 2 cm/min.

At which rate does its volume change when r = 20 cm?

Solution: Denote the volume of the ball by V and its radius by r. We

use t to denote the time variable. We consider V as a function of r as well

as t. The formula for the volume of a ball is V (r) =

4π

3

r

3

. According the

the chain rule:

dV

dt

=

dV

dr

dr

dt

= 4πr

2

dr

dt

.

With r = 20 and

dr

dt

= 2 we get

dV

dt

= 3200π cm

3

/min. This is the rate at

which the volume of the ball changes with respect to time. ♦

1.14. RELATED RATES 39

-1 -0.5 0.5 1

-0.4

-0.2

0.2

0.4

Figure 1.14: y

2

−x

2

(1 −x

2

) = 0

Example 1.60. Suppose a particle moves on a circle of radius 10 cm and

centered at the origin (0, 0) in the Cartesian plane. At some time the particle

is at the point (5, 5

√

3) and moves downwards at a rate of 3 cm/min. At

which rate does it move in the horizontal direction?

Solution: The equation of the circle is x

2

+y

2

= 100. We consider both

variables, x and y, as functions of the time variable t. Implicit diﬀerentiation

of the equation of the circle gives us the equation

2x

dx

dt

+ 2y

dy

dt

= 0.

In the given situation x = 5, y = 5

√

3, and

dy

dt

= −3. We ﬁnd that

dx

dt

= 3

√

3,

so that the particle is moving to the right at a rate of 3

√

3 cm/min. ♦

Example 1.61. Pressure (P) and volume (V ) of air at room temperature

are related by the equation

7

PV

1.4

= C.

7

Boyle-Mariotte described the relation between the pressure and volume of a gas. They

derived the equation PV

γ

= C. It is called the adiabatic law. The constant γ depends on

the molecular structure of the gas and the temperature. For the purpose of this problem,

we suppose that γ = 1.4 for air at room temperature.

40 CHAPTER 1. BASIC CONCEPTS

Here C is a constant. At some instant t

0

the pressure of the gas is 25 kg/cm

2

and the volume is 200 cm

3

. Find the rate of change of P if the volume

increases at a rate of 10 cm

3

/min.

Solution: We consider P as a function of V . Diﬀerentiation of the

equation yields

dP

dV

V

1.4

+ 1.4PV

.4

= 0 or

dP

dV

= −

1.4P

V

.

According to the chain rule

dP

dt

=

dP

dV

dV

dt

= −

1.4P

V

dV

dt

.

Substituting the given information we ﬁnd that the pressure decreases at a

rate of 1.75 kg/cm

2

sec. ♦

Example 1.62. The mass M of a particle at velocity v, as perceived by an

observer in resting position, is

M(v) =

m

_

1 −v

2

/c

2

,

where m is that mass at rest and c is the speed of light. This formula is from

Einstein’s special theory of relativity. At which rate is the mass changing

when the particle’s velocity is 90% of the speed of light, and increasing at

.001c per second?

Solution: According to our rules of diﬀerentiation

dM

dv

=

mvc

(c

2

−v

2

)

3/2

.

Applying the chain rule and substituting the values, we ﬁnd

dM

dt

=

dM

dv

dv

dt

=

mvc

2

1000(c

2

−v

2

)

3/2

=

9

√

19m

3610

≈ .010867m.

The perceived mass increases at a rate of approximately 1% of its mass at

rest. ♦

Exercise 7. A ladder, 7 m long, is leaning against a wall. Right now the

foot of the latter is 1 m away from the wall. You are pulling the foot of the

ladder further away from the wall at a rate of .1 m/sec. At which rate is

the top of the ladder sliding down the wall?

1.15. EXPONENTIAL GROWTH AND DECAY 41

1.15 Exponential Growth and Decay

An idealistic, but very useful model for population growth is the Malthusian

Law

A

(t) = aA(t). (1.45)

It says that the rate of change of a population is proportional to its size. We

denoted the proportionality factor by a. We saw that the functions A(t) =

Ce

at

are solutions of this equation, and it can be shown that on an interval

any solution is of this form. We also say that A(t) grows exponentially and

a is the relative growth rate.

The equation in (1.45) is an example of a diﬀerential equation, an equa-

tion which involves a function and its derivatives, and the unknown is a

function.

We may specify the value of A at some time t

0

, say A

0

= A(t

0

). Then

we have an initial value problem

A

(t) = aA(t) and A

0

= A(t

0

). (1.46)

Theorem 1.63. On an interval which contains t

0

the function

A(t) = A

0

e

a(t−t

0

)

is the unique solution

8

of the initial value problem in (1.46).

The essential aspects of dealing with (1.46) are addressed in

Example 1.64. Suppose the size of a population of bacteria in a laboratory

experiment is C

1

= 5, 000 at time t

1

= 2 and C

2

= 7, 000 at time t

2

= 5.

Here time is measured in hours since the beginning of the experiment.

1. Find the relative growth rate a of the population.

2. Find the formula for the size of the population at any time t ≥ 0.

3. Predict the size of the population at time t = 10.

4. Find the time at which the population reaches 8, 000.

5. Find the time within which the population doubles

9

.

8

That the function satisﬁes the diﬀerential equation follows from (1.19), which we still

need to prove. The uniqueness assertion follows from Proposition 2.9 on page 68.

9

Note that the doubling time depends only on the relative growth rate a.

42 CHAPTER 1. BASIC CONCEPTS

Solution: We denote the size of the population at time t by A(t). The

theorem tells us that A(t) = A

0

e

a(t−t

0

)

, where A

0

= A(t

0

).

1. To calculate the relative growth rate a observe that

C

2

C

1

=

A

0

e

a(t

2

−t

0

)

A

0

e

a(t

1

−t

0

)

= e

a(t

2

−t

1

)

and ln

_

C

2

C

1

_

= a(t

2

−t

1

).

We ﬁnd that

a =

ln C

2

−ln C

1

t

2

−t

1

=

ln 1.4

3

≈ .11.

The population grows at a rate of about 11% per hour.

2. and 3. The size of the population at any time t ≥ 0 is

A(t) = 5000e

a(t−2)

,

where a is as above. Substituting t = 10 we ﬁnd that A(10) ≈ 12, 264.

4. Suppose the size of the population reaches 8, 000 at time t

1

, then

8000 = 5000e

a(t

1

−2)

or ln(8/5) = a(t

1

−2) or t

1

=

ln(1.6)

a

+ 2 ≈ 6.2

The size of the population reaches 8, 000 about 6.2 hours into the experiment.

5. Suppose at some time t

0

the size of the population is A

0

= A(t

0

) and

T hours later the size of the population is 2A

0

= A(t

0

+T). Then

A(t

0

+T) = A

0

e

aT

= 2A

0

or e

aT

= 2 and aT = ln 2.

Thus the doubling time is T =

ln 2

a

≈ 6.18 hours.

Consider a radioactive substance. Experiments have shown that the

rate at which radioactive decays occur is proportional to the amount of

radioactive material present. This rate is proportional to the rate at which

the amount of the material decreases. Suppose t denotes time and A(t) the

amount of radioactive substance at time t. The experience which we just

described can be expressed as a diﬀerential equation

A

(t) = −kA(t). (1.47)

The minus sign in the equation is included so that k will be positive. The

half-life T of a radioactive substance is the time within which half of it

decays. As in the computation of the doubling time in the previous example,

one ﬁnds

T =

ln 2

k

. (1.48)

1.16. MORE EXPONENTIAL GROWTH AND DECAY 43

In the late 1940ies Willard Libby invented (or discovered) the method of

carbon-14 dating. He was awarded the Nobel price for it. In brief, the idea is

as follows. Carbon-14 occurs naturally in the atmosphere, and the amount

is believed to have been essentially constant for a long time (until recent

nuclear testing). All living organisms absorb it. Within a living organism

there is an equilibrium. The amount which is absorbed equals the amount

which decays. The level of the equilibrium is characteristic for the organism,

or a part thereof (e.g. wood from an oak or a human bone). After death

no more carbon-14 is absorbed, and the carbon-14 which was present at the

time of death decays. The half-life of carbon-14 has been determined to be

about 5568 years. For many organisms one also knows how many carbon-14

decays to expect at the time of death. Measuring the number of decays in

a dead organism allows us to determine the time of death, approximately.

We explain the process in a numerical example.

Example 1.65. Suppose we measure 6.68 carbon-14 decays per minute and

gram in a certain kind of wood at the time of death of the tree. Suppose

dead wood of the same kind shows 1.8 decays per minute and gram. How

long ago did the tree die?

Solution: Let t

0

= 0 be the time of death of the tree, and t

1

the present

time, measured in years. The number of decays to be expected t years after

death is

A(t) = 6.68e

−

ln 2

5568

t

.

We have that A(t

1

) = 1.8. From this we calculate:

ln

1.8

6.68

= −

ln 2

5568

t

1

or t

1

= −

5568

ln 2

ln

_

1.8

6.68

_

≈ 10, 534. (1.49)

The tree died approximately 10, 500 years ago.

1.16 More Exponential Growth and Decay

More generally than in (1.45), consider the diﬀerential equation

f

(t) = af(t) +b, (1.50)

where a and b are constants, and a = 0. A time independent solution (steady

state solution) of this equation is f(t) = −b/a.

44 CHAPTER 1. BASIC CONCEPTS

Theorem 1.66. Functions of the form

f(t) = ce

at

−

b

a

are solutions of the diﬀerential equation in (1.50). Here c denotes an arbi-

trary constant. On an interval every solution of (1.50) is of this form.

We obtain a unique solution if we add an initial condition to the diﬀerential

equation in (1.50).

Theorem 1.67. On an interval which contains t

o

, the function

f(t) =

_

y

0

+

b

a

_

e

a(t−t

0

)

−

b

a

is the unique solution of the initial value problem

f

(t) = af(t) +b and f(t

0

) = y

0

.

Remark 2. It is not hard to verify that the given functions are solutions of

the respective problems. The uniqueness assertion is a minor modiﬁcation

of Proposition 2.9 on page 68.

Let us apply these ideas to solve some problems. The important aspects

are to translate the given information into a mathematical equation. The

rest will be routine calculation.

Example 1.68. On graduation day the balance of your student loan is

$15,000. Interest is added at a rate of .5% per month, and you are repaying

the loan at a rate of $ 200.00 per month. Analyze the future of the loan.

Solution: As variable we use time, denoted by t and measured in

months. We set t = 0 at the time of graduation. This is the time at

which you start to repay the loan. Denote the balance of your loan at time

t by B(t). The balance increases at a rate of .005B(t) due to interest being

added and decreases at a rate of $200.00 per month due to payments which

you make. In summary, we have the initial value problem

B

**(t) = .005B(t) −200 and B(0) = 15, 000.
**

According to Theorem 1.67 the solution of the initial value problem is

B(t) =

_

15, 000 +

−200

.005

_

e

.005t

−

−200

.005

= −25, 000e

.005t

+ 40, 000.

1.16. MORE EXPONENTIAL GROWTH AND DECAY 45

For example, B(T) = 0 if

T =

1

.005

ln

_

40

25

_

≈ 94.

After approximately 94 months (7 years and 10 months) you repaid the

loan. Your total payments were $18,800, so that you paid the principal plus

$3,800 in interest. ♦

Example 1.69. You are absorbing a medication at a rate of 3 mg per hour.

(You can keep this rate constant with a skin patch.) The liver metabolizes

the medication at a rate of 4% per hour. Analyze the amount of medication

in your body at any time.

Solution: We use time as independent variable, denote it by t and

measure it in hours. We denote by t = 0 the time when we start taking

the medication. Let A(t) denote the amount of medication in your body,

measured in milligrams. Then A(t) increases at a rate of 3 mg per hour

because you are taking in medication and at the same time A(t) decreases

at a rate of .04A(t) due to your liver metabolizing the medication. We have

the initial value problem

A

(t) = −.04A(t) + 3 and A(0) = 0.

The solution of this problem is

A(t) = −75e

−.04t

+ 75.

For example, after 12 hours there will be about 28.6 milligram of medication

in your body. It will take slightly more than 40 hours before the amount of

medication in your body reaches 60 milligrams. The steady state solution

of the problem is A(t) = 75. The amount of medication will stabilize at this

amount with time. ♦

Example 1.70 (Newton’s Law of Cooling). Suppose you have an ob-

ject whose temperature is diﬀerent from the temperature of its surround-

ings. With time, the temperature of the object will approach the one of

its surroundings. We discuss how this happens, at least under idealized

circumstances.

Think of a cup of coﬀee. You stir the coﬀee gently so that the tem-

perature in the cup remains homogeneous and almost no energy is added

46 CHAPTER 1. BASIC CONCEPTS

through the process of stirring.

10

Denote the temperature of the coﬀee by

T. It is a function of time, so that we write T(t). Newton’s law of cooling

says that the rate at which the heat is transferred, and with this the rate

of change of temperature of the coﬀee, is proportional to the temperature

diﬀerence. If K is the temperature of the surroundings, then

T

(t) = a(T(t) −K) = aT(t) −aK. (1.51)

Setting b = −aK, this is the diﬀerential equation in (1.50).

Let us work out a numerical example. At time t = 0, just after you

poured the coﬀee into your cup, its temperature is 95 degrees Celsius. Five

minutes later the temperature has dropped to 80 degree, while you stir it

slightly and patiently. The room temperature is 25 degrees.

1. Determine the function T(t).

2. Find t

1

, such that T(t

1

) = 70 degrees Celsius.

Solution: To apply Theorem 1.67, we set t

0

= 0, y

0

= 95, and K = 25.

Note that −b/a = K. Putting all of this into the formula for the solution of

the initial value problem, we get that

T(t) = (95 −25)e

at

+ 25 = 70e

at

+ 25.

To determine a we use that

T(5) = 80 = 70e

5a

+ 25,

and we conclude that a =

1

5

ln

_

55

70

_

≈ −.0482. Using these data, Equa-

tion (1.51) says that the temperature of the coﬀee drops at a rate of about

.048 degrees per minute for each degree of diﬀerence between the tempera-

ture of the coﬀee and the room temperature. Having a numerical value for

a gives us an explicit expression for the temperature T as a function of t:

T(t) = 70e

−.0482t

+ 25.

10

The physics of heat transfer changes substantially if you take a solid object, such

as a turkey in the oven. The temperature in the solid object will not be homogenous,

the outside warms up much faster than the inside. In addition, the speciﬁc heat (the

amount of energy needed to increase the temperature of one unit of the material by one

degree) varies. It is diﬀerent for fat, protein, and bone. Furthermore, the speciﬁc heat

is highly temperature dependent for substances like protein. That means, a in (1.51)

depends on the temperature T. All of this leads to a signiﬁcantly diﬀerent development

of the temperature inside a turkey as you roast it for your Thanksgiving dinner.

1.16. MORE EXPONENTIAL GROWTH AND DECAY 47

We like to ﬁnd out the time t

1

for which

T(t

1

) = 70e

−.0482t

1

+ 25 = 70.

Solving the equation for t

1

, we ﬁnd that t

1

≈ 9.17. That means that the

temperature drops to 70 degrees approximately 9.17 minutes after pouring

it. ♦

Exercise 8. A chemical factory is located on the banks of a river. Down

stream from the factory is a lake, and the river is the only contributor to

the lake. Assume that the amount of water carried by the river is the same

all year around, and the amount of water in the lake is 10 times the amount

of water carried by the river per year. In negotiations which the EPA, the

owner has agreed to an acceptable level of 2.5 mg per m

3

of a pollutant in

the lake. After a major accident the level has risen to 15 mg per m

3

. As a

remedy, the factory owner proposes to reduce the emission of pollution so

that the level of pollutant in the river is only 1.5 mg per m

3

. It is assumed

that the pollutant is distributed uniformly in the lake at any time.

1. Let P(t) denote the amount of pollutant (measured in mg per m

3

) in

the lake at time t. Let t

0

= 0 be the time just after the accident and

at which the clean-up strategy is implemented. State the initial value

problem for P(t).

2. Find the function P(t).

3. At which time will the level of pollution be back to 2.5 mg per m

3

?

Exercise 9. The population of an endangered species of birds on Kauai

decreases at a relative rate of 25% per year. Currently, at time t

0

= 0, the

population is estimated to be 700 birds. A government agency raises the

species in captivity and releases birds into the wild at a rate of 80 birds per

year. Denote the size of the population at time t by P(t), where t denotes

time and is measured in years.

1. State the initial value problem for P(t).

2. Find the function P(t).

3. At which time will the population drop to 500 birds?

4. What is the long term estimate for the population of this species in

the wild?

48 CHAPTER 1. BASIC CONCEPTS

1.17 The Second and Higher Derivatives

Let f(x) be a function which is deﬁned on an open set. If the function is

diﬀerentiable at each point of its domain, then f

**(x) is again a function with
**

the same domain as f(x). We may ask whether the function f

(x) is dif-

ferentiable. Its derivative, wherever it exists, is called the second derivative

of f. It is denoted by f

**(x). This process can be iterated. The deriva-
**

tive of the second derivative is called the third derivative, and denoted by

f

**(x), etc. We will make use of the second derivative. Leibnitz’s notation
**

for the second derivative of a function f(x) is d

2

f/dx

2

. Here is a sample

computation in which you are invited to ﬁll in the details:

d

2

dx

2

e

sin x

=

d

dx

cos xe

sin x

= (−sin x + cos

2

x)e

sin x

.

Exercise 10. Find the second derivatives of the following functions:

(1) f(x) = 3x

3

+ 5x

2

(2) g(x) = sin 5x

(3) h(x) =

_

x

2

+ 2

(4) i(x) = e

5x

(5) j(x) = tan x

(6) k(x) = cos(x

2

)

(7) l(x) = ln 2x

(8) m(x) = ln(x

2

+ 3)

(9) n(x) = arctan 3x

(10) o(x) = sec(x

3

)

(11) p(x) = ln

2

(x + 4)

(12) q(x) = e

cos x

(13) r(x) = ln(tan x)

(14) s(x) = e

x

2

−1

(15) t(x) = sin

3

x.

1.18 Numerical Methods

In this section we introduce some methods for numerical computations.

Their common feature is, that for a diﬀerentiable function we do not make

a large error when we use the tangent line to the graph instead of the graph

itself. This rather casual statement will become clearer when you look at

the individual methods.

1.18.1 Approximation by Diﬀerentials

Suppose x

0

is an interior point of the domain of a function f(x) and f(x)

is diﬀerentiable at x

0

. Assume also that f(x

0

) and f

(x

0

) are known. The

method of approximation by diﬀerentials provides an approximate values

f(x

1

) if x

1

is near x

0

. We use the symbol ‘≈’ to stand for ‘is approximately’.

One uses the formula

f(x

1

) ≈ f(x

0

) +f

(x

0

)(x

1

−x

0

). (1.52)

1.18. NUMERICAL METHODS 49

On the right hand side in (1.52) we have l(x

1

), the tangent line to the graph

of f(x) at (x

0

, f(x

0

)) evaluated at x

1

. In the sense of the deﬁnition of the

tangent line in Section 1.6, f(x

1

) is close to l(x

1

) for x

1

near x

0

.

Example 1.71. Find an approximate value for

3

√

9.

Solution: We set f(x) =

3

√

x, so we are supposed to ﬁnd f(9). Note

that

f

(x) =

1

3

x

−2/3

, f(8) = 2, and f

(8) =

1

12

.

Formula (1.52), applied with x

1

= 9 and x

0

= 8, says that

3

√

9 = f(9) ≈ 2 +

1

12

(9 −8) =

25

12

≈ 2.0833.

Your calculator will give you

3

√

9 ≈ 2.0801. ♦

Example 1.72. Find an approximate value for tan 46

◦

.

Solution: We carry out the calculation in radial measure. Note that

46

◦

= 45

◦

+ 1

◦

, and this corresponds to π/4 + π/180. Use the function

f(x) = tan x. Then f

(x) = sec

2

x, f(π/4) = 1, and f

(π/4) = 2. Formula

(1.52), applied with x

1

= (π/4 +π/180) and x

0

= π/4 says

tan 46

◦

= tan

_

π

4

+

π

180

_

≈ tan

_

π

4

_

+ sec

2

_

π

4

__

π

180

_

= 1 +

π

90

≈ 1.0349.

Your calculator will give you tan 46

◦

≈ 1.0355. ♦

Exercise 11. Use approximation by diﬀerentials to ﬁnd approximate values

for

(1)

5

√

34 (2) tan 31

◦

(3) ln 1.2 (4) arctan 1.1.

In each case, compare your answer with one found on your calculator.

We have been causal in (1.52) insofar as we have not estimated (provided

an upper bound for) the error which we make using the right hand side of

(1.52) instead of of the actual value of the function on the left hand side.

The inequality in Deﬁnition 1.25 on page 12 provides us with an estimate.

According to this slightly more demanding deﬁnition, diﬀerentiability of the

function f(x) means that there exist numbers A and d > 0, such that

|f(x

1

) −[f(x

0

) +f

(x

0

)(x

1

−x

0

)| ≤ A(x

1

−x

0

)

2

whenever |x

1

−x

0

| < d. Thus, if we know A and d, then we can approximate

the error as long as |x

1

−x

0

| < d.

50 CHAPTER 1. BASIC CONCEPTS

Example 1.73. Find an approximate value for sin31

◦

and estimate the

error.

Solution: Set f(x) = sinx. Then f

(x) = cos x, f(π/6) = 1/2, and

f

(π/6) =

√

3/2. Measuring angles in radians we set x

0

= π/6 and x

1

=

π/6 +π/180. Applying the formula in (1.52), we ﬁnd

sin 31

◦

≈ sin

π

6

+

π

180

cos

π

6

=

1

2

_

1 +

√

3

π

180

_

≈ .515115.

The calculator will tell that sin 31

◦

≈ .515038.

From the computation in Example 1.31 on page 17 we also know that we

may use A = 1 and d = π/4 in the diﬀerentiability estimate. The estimate

assures us that the error is at most

(x

1

−x

0

)

2

=

_

π

180

_

2

≤ .000305.

Comparison of the actual and approximate value conﬁrm this. ♦

Example 1.74. Use approximation by diﬀerentials to ﬁnd an approximate

value of

√

10 and give an upper bound for the error.

Solution: We use f(x) =

√

x and x

0

= 9. The f

(x) = 1/(2

√

x),

f(x

0

) = 3, and f

(x

0

) = 1/6. The formula in (1.52) tells us that

√

10 = f(10) ≈ f(9) +f

(9)(10 −9) = 3 +

1

6

≈ 3.16666.

The calculator will give you

√

10 ≈ 3.16228.

For the error estimate we may use

A =

1

2(

√

x

0

)

3

and any d > 0, see (1.16). The estimate assures us that the error is at most

1

2(

√

x

0

)

3

(x

1

−x

0

)

2

=

1

54

.

The actual error is again substantially less than this. ♦

Exercise 12. Use approximation by diﬀerentials to ﬁnd approximate values

for

(1) cos 28

◦

(2)

√

26 (3) sin 47

◦

.

In each case, estimate also the maximal error which you may have made by

using the method of approximation by diﬀerentials.

1.18. NUMERICAL METHODS 51

1.18.2 Newton’s Method

Newton’s method is designed to ﬁnd the zeros of a function. You have learned

how to solve linear and quadratic equations, i.e., ﬁnding the zeros of func-

tions of degree 1 and 2. More sophisticated methods allow you to ﬁnd the

exact solutions of polynomial equations of degree three and four. For poly-

nomials of degree greater or equal to 5 and most other functions there are

no general methods for ﬁnding their roots.

Newton’s method works as follows. Suppose we want to ﬁnd a zero of a

diﬀerentiable function f(x), i.e., we want to ﬁnd some x, such that f(x) = 0.

Suppose that by some means we know that such an x exists, and that x

0

is

not far from x. Then we set

x

1

= x

0

−

f(x

0

)

f

(x

0

)

, x

2

= x

1

−

f(x

1

)

f

(x

1

)

, x

3

= x

2

−

f(x

2

)

f

(x

2

)

, etc. (1.53)

and in general

x

n+1

= x

n

−

f(x

n

)

f

(x

n

)

. (1.54)

Geometry of Newton’s Method: Let us give a geometric explanation

for the formlas. Given any x

0

at which f is deﬁned and diﬀerentiable, we

obtain the tangent line l(x) to the graph of f at this point. Then x

1

, as

given in (1.53), is the point at which l(x) intersects the x-axis. Speciﬁcally,

l(x) = f

(x

0

)(x −x

0

) +f(x

0

), and l(x

1

) = 0 if x

1

= x

0

−

f(x

0

)

f

(x

0

)

.

This means that we accept that the tangent line is close to the graph of

the function, and instead of ﬁnding the zero of the function itself, we ﬁnd

the zero of the tangent line. The process is then iterated.

Let us calculate

√

A, i.e., the positive root of the function f(x) = x

2

−A.

Then f

(x) = 2x, and

x

n+1

= x

n

−

f(x

n

)

f

(x

n

)

= x

n

−

x

2

n

−A

2x

n

=

x

2

n

+A

2x

n

=

1

2

_

x

n

+

A

x

n

_

(1.55)

If we use A = 3 and x

0

= 2, then we ﬁnd

x

1

=

1

2

_

2 +

3

2

_

=

7

4

, x

2

=

1

2

_

7

4

+

12

7

_

=

97

56

and x

3

=

18817

10864

.

We summarize the computation in Table 1.1. In the ﬁrst column you ﬁnd

the subscript n. In the following two columns you ﬁnd the values of x

n

, once

52 CHAPTER 1. BASIC CONCEPTS

expressed as a fraction of integers, once in decimal form. In the last column

you see the square of x

n

. At least x

2

3

is rather close to 3. Your calculator

will give you 1.73205080757 as an approximate value of

√

3. You see that

our value for x

3

is rather precise. In fact, if you carry the calculation one

step further and ﬁnd x

4

, then the accuracy of this approximation of

√

3 will

exceed the accuracy of most calculators. The numbers in the last column

show that we are making rapid progress in ﬁnding a good approximation of

√

3.

n x

n

x

n

x

2

n

0 2 2.0000000000 4.0000000000

1 7/4 1.7500000000 3.0625000000

2 97/56 1.7321428571 3.0003188775

3 18817/10864 1.7320508100 3.0000000085

Table 1.1: The Babylonian Method

More than 4000 years ago the Babylonians used the outermost expres-

sions in (1.55)

x

n+1

=

1

2

_

x

n

+

A

x

n

_

to ﬁnd good approximations of square roots, expressed as rational numbers.

We refer to the described procedure as the Babylonian method.

Let us consider one more example to illustrate Newton’s method. Find

a solution of the equation

xsin x = cos x.

Equivalently, we may say, ﬁnd a root of the function f(x) = xsinx −cos x.

Step 1: Let us make sure that there is a root of the function to be found.

Observe that f(0) = −1 < 0 and f(π/2) = π/2 > 0. The intermediate value

theorem tells us that f(x), as a continuous funtion, has a root in the interval

(0, π/2). Let us call this root x.

Step 2: Let us come up with a ﬁrst guess for a root. Considering the

values of f at the end points of the interval, we guess that x

0

= 1 is not

1.18. NUMERICAL METHODS 53

too far away from the root, which we know to exist by Step 1. Actually

f(1) ≈ .3.

Step 3: Let us improve the guess: Set

x

1

= x

0

−

f(x

0

)

f

(x

0

)

≈ .8645.

Your calculator will tell you that f(x

1

) ≈ .00874. You see that f(x

1

) is

much closer to zero than f(x

0

), and in this sense we expect that x

1

is much

closer to the root x of f(x) than x

0

. We made progress ﬁnding x.

Step 4: Repeat Step 3 and calculate x

2

, x

3

, . . . . The distance between

x and x

n

will decrease rapidly as n increases.

We explained Newton’s method because we want to illustrate the power

of the concept ‘tangent line.’ A full discussion of Newton’s method requires

mathematical tools which are not available to us at this time. In general,

many interesting phenomena can occur. Still, the principle problem is as

follows. Suppose that f(x) is a diﬀerentiable function and f(x) = 0 and x

0

is given. Suppose that x

n

for n ≥ 1 are computed according to (1.54). Do

the x

n

tend (converge) to x, and how fast?

For completeness sake, we give an answer. Consider an interval I =

[x−a, x +a], and suppose that |f

(x)| ≥ m and |f

(x)| ≤ M on I. Suppose

x

n

∈ I and |x −x

n

| <

2m

M

. A theorem from advanced calculus asserts that

|x −x

n+1

| ≤

M

2m

(x −x

n

)

2

. (1.56)

We illustrate the theorem by applying it to the previous example. Ob-

serve that f(.8) < 0 and f(.9) > 0. This tells us that x ∈ [.8, .9]. Let us

set a = .2, so that I ⊂ J = [.6, 1.1]. On J, and with this also on I, we

have that |f

(x)| ≥ m = 1.5 and |f

**(x)| ≤ M = 2.5. You are invited to
**

verify these estimates using technology. In (1.56) we use that

M

2m

< 1. As

ﬁrst guess we used x

0

= 1, so that we know that |x − x

0

| < .2. The quoted

theorem asserts that |x − x

1

| < .04. If we repeat the process, then we see

that |x −x

2

| < .0016 and |x −x

3

| < .00000256. This illustrates that the x

n

approach x rapidly.

There is one feature of Newton’s method which helps. You may say

that with each iteration you make a fresh start, and in this sense previous

round-oﬀ errors don’t carry over.

1.18.3 Euler’s Method

Euler’s method is designed to ﬁnd, by numerical means, an approximate

solution of the following kind of problem:

54 CHAPTER 1. BASIC CONCEPTS

Problem 1. Find a function y(t) which satisﬁes

y

=

dy

dt

= F(t, y) and y(t

0

) = y

0

. (1.57)

Here F(t, y) denotes a given function in two variables, and t

0

and y

0

are

given numbers.

The ﬁrst condition on y in (1.57) is a ﬁrst order diﬀerential equation. It is

an equation which involves a function and its derivative, and the unknown is

the function. The second condition is called an initial condition. It speciﬁes

the value of the function at one point. For short, the problem in (1.57) is

called an initial value problem.

Approach in one step: Suppose you want to ﬁnd y(T) for some T = t

0

.

Then you might try the formula

y(T) ≈ y(t

0

) +y

(t

0

)(T −t

0

) = y

0

+F(t

0

, y

0

)(T −t

0

). (1.58)

The tangent line to the graph of y at (t

0

, y

0

) is

l(t) = y(t

0

) +y

(t

0

)(t −t

0

),

so that the middle term in (1.58) is just l(T). The ﬁrst, approximate equality

in (1.58) expresses the philosophy that the graph of a diﬀerentiable function

is close to its tangent line, at least as long as T is close to t

0

. To get the

second equality in (1.58) we use the diﬀerential equation and initial condition

in (1.57), which tell us that

y

(t

0

) = F(t

0

, y(t

0

)) = F(t

0

, y

0

).

The Logistic Law

The diﬀerential equation in our next example is known as the logistic law

of population growth. In the equation, t denotes time and y(t) the size of

a population, which depends on t. The constants a and b are called the

vital coeﬃcients of the population. The equation was ﬁrst used in popula-

tion studies by the Dutch mathematician-biologist Verhulst in 1837. The

equation reﬁnes the Malthusian law for population growth (see (1.45)).

In the diﬀerential equation, the term ay expresses that population growth

is proportional to the size of the population. In addition, the members of

the population meet and compete for food and living space. The probability

of this happening is proportional to y

2

, so that it is assumed that population

growth is reduced by a term which is proportional to y

2

.

1.18. NUMERICAL METHODS 55

Example 1.75. Consider the initial value problem:

dy

dt

= ay −by

2

and y(t

0

) = y

0

, (1.59)

where a and b are given constants. Find an approximate value for y(T).

Remark 3. An exact solution of the initial value problem in (1.59) is given

by the equation

y(t) =

ay

0

by

0

+ (a −by

0

)e

−a(t−t

0

)

(1.60)

This is not the time to derive this exact solution, though you are invited to

verify that it satisﬁes (1.59). We are providing the exact solution, so that

we can see how well our approximate values match it.

Solution: Setting F(t, y) = ay −by

2

, you see that the diﬀerential equa-

tion in this example is a special case of the one in (1.57). According to the

formula in (1.58) we ﬁnd

y(T) ≈ y

0

+ (ay

0

−by

2

0

)(T −t

0

). (1.61)

We expect a close approximation only for T close to t

0

. ♦

Let us be even more speciﬁc and give a numerical example.

Example 1.76. Consider the initial value problem.

dy

dt

=

1

10

y −

1

10000

y

2

and y(0) = 300. (1.62)

Find approximate values for y(1) and y(10).

Solution: Substituting a = 1/10, b = 1/10000, t

0

= 0, and y

0

= 300

into the solution in (1.61), we ﬁnd that

y(1) ≈ 300 +

_

300

10

−

300

2

10000

_

(1 −0) = 321.

According to the exact solution in (1.60), we ﬁnd that

y(t) =

3000

3 + 7e

−t/10

.

Substituting t = 1, we ﬁnd the exact value y(1) = 321.4; this number is

rounded oﬀ. So, our approximate value is close.

For T = 10 the formula suggests that y(10) ≈ 510. According to the

exact solution for this initial value problem, y(10) = 538.1. For this larger

value of T, the formula in (1.61) gives us a less satisfactory result. ♦

56 CHAPTER 1. BASIC CONCEPTS

Multi-step approach: We like to ﬁnd a remedy for the problem which

we discovered in Example 1.76 for T further away from t

0

. Consider again

Problem 1 on page 54. We want to get an approximate value for y(T). For

notational convenience we assume that T > t

0

. Pick several t

i

between t

0

and T:

t

0

< t

1

< t

2

< · · · < t

n

= T.

Starting out with t

0

and y(t

0

), we use the one step method from above to get

an approximate value for y(t

1

). Then we pretend that y(t

1

) is exact, and we

repeat the process. We use t

1

and y(t

1

) to calculate an approximate value for

y(t

2

). Again we pretend that y(t

2

) is exact and use t

2

and y(t

2

) to calculate

y(t

3

). Iteratively, we calculate [t

i+1

, y(t

i+1

)] from [t

i

, y(t

i

)] according to the

formula in (1.58):

[t

i+1

, y(t

i+1

)] = [t

i+1

, y(t

i

) +F(t

i

, y(t

i

))(t

i+1

−t

i

)] (1.63)

We continue this process until we reach T.

For reasonably nice

11

expressions F(t, y) the accuracy of the value which

we get for y(T) will increase with n, the number of steps we make (at least if

all steps are of the same length). On the other hand, in an actual numerical

computation we also make round-oﬀ errors in each step, and the more steps

we make the worse the result might get. Experience will guide you in the

choice of the step length.

Example 1.77. Consider the initial value problem

dy

dt

=

1

10

y −

1

10000

y

2

and y(0) = 10. (1.64)

1. Apply the multi-step method to ﬁnd approximate values for y(t) at

t

1

= 5, t

2

= 10, t

3

= 15, . . . , t

20

= 100. Arrange them in a table.

2. Graph the points found in the previous step together with the actual

solution of the initial value problem.

Solution: As points in the multi-step process we use

t

0

= 0, t

1

= 5, t

2

= 10, t

3

= 15, t

4

= 20, . . . , t

20

= 100.

11

We do not want to make this term precise, but the F(t, y) in Example 1.75 is of this

kind.

1.18. NUMERICAL METHODS 57

t y(t) & t y(t) & t y(t)

0 10.00 35 153.96 70 857.73

5 14.95 40 219.09 75 918.74

10 22.31 45 304.62 80 956.07

15 33.22 50 410.55 85 977.07

20 49.28 55 531.55 90 988.27

25 72.70 60 656.05 95 994.07

30 106.41 65 768.87 100 997.02

Table 1.2: Solution of Problem 1.77

For each t

i

(0 ≤ i ≤ 19) we use the formula

y(t

i+1

) = y(t

i

) + 5

_

y(t

i

)

10

−

y

2

i

(t

i

)

10000

_

and calculate y(t

1

), y(t

2

), y(t

3

), . . . , y(t

20

) consecutively. We summarize

the calculation in Table 1.2.

In Figure 1.15 you see the graph of the exact solution of the initial value

problem. You also see the points from Table 1.2. The points suggest a graph

which does follow the actual one reasonably closely. But you see that we are

deﬁnitely making errors, and they get worse as t increases

12

. You may try

a shorter step length. The points will follow the curve much more closely if

you use t

1

= 1, t

2

= 2, t

3

= 3, . . . , t

100

= 100 in your calculation. ♦

Steady States: Let us consider some very speciﬁc solutions of our initial

value problem in (1.57):

y

=

dy

dt

= F(t, y) and y(t

0

) = y

0

.

Suppose F(y

0

, t) = 0 for all t. Then the constant function y(t) = y

0

is a

solution of the problem. Such a solution is called a steady state solution.

12

It is incidental that the points eventually get closer to the graph again. This is due

to the speciﬁc problem, and will not occur in general.

58 CHAPTER 1. BASIC CONCEPTS

20 40 60 80 100

200

400

600

800

1000

Figure 1.15: Illustration of Euler’s Method

Example 1.78. Find the steady states of the diﬀerential equation (see

(1.50) in Section 1.16)

f

(t) = af(t) +b. (1.65)

Solution: Apparently, f

**(t) = 0 if and only if f(t) = −b/a. So the
**

constant function f(t) = −b/a is the only steady state of this diﬀerential

equation. ♦

In review of Example 1.68 in Section 1.16, you see that the steady state

in that example is B(t) = 40, 000. I.e., if your loan balance is $40,000.00, the

bank charges you interest at a rate of .5% per month, and you are repaying

the loan at a rate of $ 200.00 per month, then the principal balance of your

account will stay unchanged. Your payments cover exactly the occuring

interest charges.

Example 1.79. For the logistic law (see Equation (1.59))

dy

dt

= F(y, t) = ay −by

2

= y(a −by)

1.18. NUMERICAL METHODS 59

we ﬁnd that F(y, t) = 0 if and only if y = 0 or y = a/b. There are two

steady state solutions: y

u

(t) = 0 and y

s

(t) = a/b.

Let us interpret these steady state solutions for the speciﬁc numerical

values of a = 1/10 and b = 1/10, 000 in Example 1.77. If the initial value

y

0

of the population is positive, then the population size will tend to and

stabilize

13

at y(t) = a/b = 1, 000. In this sense, y

s

(t) = a/b = 1, 000 is a

stable steady state solution. It is also referred to as the carrying capacity.

It tells you which size population of the given kind the speciﬁc habitat will

support.

If the initial value y

0

is negative, then y(t) will tend to −∞ as time

increases. If y

0

= 0, then y(t) will not tend to the steady state y(t) = 0. In

this sense, y(t) = 0 is an unstable steady state. ♦

Exercise 13. Consider the initial value problem

y

(t) = −50 +

1

2

y(t) −

1

2000

y

2

(t) and y

0

= y(0) = 200. (1.66)

To make the problem explicit, you should think of a population of deer in a

protected wildlife preserve. There are no predators. The deer are hunted at

a rate of 50 animals per year. The population has a growth rate of 50% per

year. Reproduction takes place at a constant rate all year round. Finally,

the last term in the diﬀerential equation accounts for the competition for

space and food.

1. Use Euler’s method to ﬁnd the population size over the next 30 years.

Proceed in 1 year steps. Tabulate and plot your results.

2. Guess at which level the population stabilizes.

3. Repeat the ﬁrst two steps of the problem if hunting is stopped.

4. Repeat the ﬁrst two steps of the problem if the initial population is

100 animals.

5. Find the steady states of the original equation in which hunting takes

place. I.e., ﬁnd for which values of y you have that y

= 0? You will

ﬁnd two values. Call the smaller one of them Y

u

and the larger one

Y

s

. Experiment with diﬀerent initial values to see which of the steady

states is stable, and which one is unstable.

13

The common language meaning of these expressions suﬃces for the purpose of our

discussion, and the mathematical deﬁnition of ‘tends to’ and ‘stabilizes at’ only make these

terms precise.

60 CHAPTER 1. BASIC CONCEPTS

Orthogonal Trajectories

Let us explore a diﬀerent kind of application. Suppose we are given a family

F(x, y, a) = 0 of curves. In Figure 1.16 you see a family of ellipses

C

a

: F(x, y, a) = x

2

+ 3y

2

−a = 0. (1.67)

There is one ellipse for each a > 0. We like to ﬁnd curves D

b

which in-

tersect the curves C

a

perpendicularly. (We say that D

b

and C

a

intersect

perpendicularly in a point (x

1

, y

1

), if the tangent lines to the curves at this

point intersect perpendicularly.) We call such a curve D

b

an orthogonal tra-

jectory to the family of the C

a

’s. You also see one orthogonal trajectory in

Figure 1.16.

-6 -4 -2 0 2 4 6

-6

-4

-2

0

2

4

6

Figure 1.16: Orthogonal Trajectory to Level Curves

Let us explain where this type of situation occurs. Suppose the curves

C

a

are the level curves in a crater. Here a represents the elevation, so that

the elevation is constant along each curve C

a

. The orthogonal trajectory

gives a path of steepest descent. A new lava ﬂow which originates at some

point in the crater will follow this path.

Suppose that each ellipse represents an equipotential line of an electro-

magnetic ﬁeld. The orthogonal trajectory provides you with a path which

is always in the direction of the most rapid change of the ﬁeld. A charged

particle will move along an orthogonal trajectory.

1.18. NUMERICAL METHODS 61

Suppose a stands for temperature, so that along each ellipse the tem-

perature is constant. In this case the curves are called isothermal lines

14

.

A heat seeking bug will, at any time, move in the direction in which the

temperature increases most rapidly, i.e., along an orthogonal trajectory to

the isothermal lines.

Suppose a stands for the concentration of a nutrient in a solution. It is

constant along each curve C

a

. On their search for food, bacteria will follow

a path in the direction in which the concentration increases most rapidly.

They will move along an orthogonal trajectory.

Example 1.80. Find orthogonal trajectories for the family of ellipses

C

a

: F(x, y, a) = x

2

+ 3y

2

−a = 0. (1.68)

Solution: Diﬀerentiating the equation for the ellipses, we get

2x + 6y

dy

dx

= 0 or

dy

dx

=

−x

3y

.

The slope of the tangent line to a curve C

a

at a point (x

1

, y

1

) is

−x

1

3y

1

. If

a curve D

b

intersects C

a

in (x

1

, y

1

) perpendicularly, then we need that the

slope of the tangent line to D

b

at this point is

3y

1

x

1

. Thus, to ﬁnd an orthog-

onal trajectory to the family of the C

a

’s we need to ﬁnd functions which

satisfy this diﬀerential equation. If we also require that the orthogonal tra-

jectory goes through a speciﬁc point (x

0

, y

0

), then we end up with the initial

value problem

dy

dx

=

3y

x

and y(x

0

) = y

0

.

This is exactly the kind of problem which we solved with Euler’s method. In

this particular example it is not diﬃcult to ﬁnd solutions for the diﬀerential

equation. They are functions of the form y(x) = bx

3

. The orthogonal

trajectory shown in Figure 1.16 has the equation y = x

3

/25. There is one

orthogonal trajectory which does not have this form, and this is the curve

x = 0.

Let us apply Euler’s method to solve the problem. Let us ﬁnd approxi-

mate values for the initial value problem

dy

dx

=

3y

x

and y(1) =

1

25

.

14

The idea of isothermal lines, and with this the method in all of these applications,

was pioneered by Alexander von Humbold (1769–1859).

62 CHAPTER 1. BASIC CONCEPTS

Use x

0

= 1, x

1

= 1.2, x

2

= 1.4, . . . , x

20

= 5.

We set (x

0

, y

0

) = (1, 1/25) and calculate (x

n

, y

n

) according to the for-

mula

y

n

= y

n−1

+.2

3y

n−1

x

n−1

for n = 1, 2, . . . , 20.

Without recording the results of this calculation, we graphed the points in

Figure 1.16. ♦

Exercise 14. Consider the family of hyperbolas:

C

a

: x

2

−5y

2

+a = 0.

There is one hyperbola for each value of a, only for a = 0 the hyperbola

degenerates into two intersecting lines.

1. Graph several of the curves C

a

.

2. Find the diﬀerential equation for an orthogonal trajectory.

3. Use Euler’s method to ﬁnd points on the orthogonal trajectory through

the point (3, 4). Use the points x

0

= 3, x

1

= 3.2, x

2

= 3.4, . . . ,

x

20

= 7. Plot the points (x

n

, y

n

) in your ﬁgure.

4. Check that the graph of y(x) = bx

−5

is an orthogonal trajectory to the

family of hyperbolas for every b. Determine b, so that the orthogonal

trajectory passes through the point (3, 4), and add this graph to your

ﬁgure.

1.19. TABLE OF IMPORTANT DERIVATIVES 63

1.19 Table of Important Derivatives

f(x) f

(x) Assumptions

x

q

qx

q−1

q a natural number, or x > 0

e

x

e

x

x ∈ (−∞, ∞)

ln |x| 1/x x ∈ (−∞, ∞), x = 0

sin x cos x x ∈ (−∞, ∞)

cos x −sin x x ∈ (−∞, ∞)

tan x sec

2

x all x for which tan x is deﬁned

cot x −csc

2

x all x for which cot x is deﬁned

sec x sec xtan x all x for which sec x is deﬁned

csc x −csc xcot x all x for which csc x is deﬁned

arctan x

1

1+x

2

x ∈ (−∞, ∞)

arcsin x

1

√

1−x

2

x ∈ (−1, 1), arcsin x ∈ (−π/2, π/2)

arccos x

−1

√

1−x

2

x ∈ (−1, 1), arccos x ∈ (0, π)

arccot x

−1

1+x

2

x ∈ (−∞, ∞), arccot x ∈ (0, π)

arcsec x

1

|x|

√

x

2

−1

x < −1 or x > 1, arcsec x ∈ (0, π/2) ∪ (π/2, π)

arccsc x

−1

|x|

√

x

2

−1

x < −1 or x > 1, arcsec x ∈ (−π/2, 0) ∪ (0, π/2)

Table 1.3: Some Derivatives

64 CHAPTER 1. BASIC CONCEPTS

Chapter 2

Global Theory

So far we studied the local behaviour of a function. All concepts related to

the behaviour of a function near a point. In this chapter we will use local in-

formation about a function to draw global conclusions. We will discuss some

uniqueness properties of solutions of diﬀerential equations. Then we discuss

geometric properties of graphs, their monotonicity and concavity. We apply

these ideas to the study of extrema of functions. With this information it is

possible to sketch graphs capturing their essential features.

The fundamental result which allows us to do this is referred to as

Cauchy’s mean value theorem. Augustin-Louis Cauchy (1789–1857) was

one of the great mathematicians of the 19-th century. He made major con-

tributions to make calculus a rigorous mathematical theory.

2.1 Cauchy’s Mean Value Theorem

It is useful to make the following

Deﬁnition 2.1. Let f(x) be a function which is deﬁned on the interval

[a, b]. Then we call

f(b) −f(a)

b −a

the average rate of change of f over the interval [a, b].

For example, the average rate of change of f(x) = x

2

over the interval

[0, 2] is 2. The average rate of change of f(x) = sin x over [0, π/2] is 2/π

and over [0, π] it is 0.

65

66 CHAPTER 2. GLOBAL THEORY

Theorem 2.2 (Chauchy’s Mean Value Theorem). Let f be a real val-

ued function which is deﬁned and continuous on the interval [a, b] and dif-

ferentiable on (a, b), where a < b. Then there exists a number c ∈ (a, b) such

that

f

(c) =

f(b) −f(a)

b −a

.

In words, the theorem asserts that the average rate of change over an

interval is equal to the rate of change at some point in the interval. For

example, the average rate of change of f(x) = x

2

over the interval [−2, 1] is

−1, and f

(−1/2) = −1.

The following special case of the theorem, called Rolle’s theorem (named

after Michel Rolle (1652–1719)), is of particular interest.

Theorem 2.3 (Rolle’s Theorem). Let f be a real valued function which

is deﬁned and continuous on the interval [a, b] and diﬀerentiable on (a, b),

where a < b. If f(a) = f(b), then there exists a number c between a and b

(i.e., a < c < b) such that

f

(c) = 0.

We are not going to say anything about the proof of these two theorems,

except that Cauchy’s theorem and Rolle’s theorem are equivalent (each is

an easy consequence of the other one), and that the proof of both of them

depends heavily on the completeness

1

of the real numbers. We are also not

interested in ﬁnding the points c, as they occur in the two theorems. We

are interested in more general consequences.

Corollary 2.4. Let f be a real valued function which is deﬁned and contin-

uous on an interval I. If f

**(x) = 0 for all interior points x of I, then f is
**

constant on this interval. In other words, there exists a number d such that

f(x) = d for all x ∈ I.

Proof. A diﬀerent formulation of the claim is that f(a) = f(b) for all a,

b ∈ I. We prove this statement using Cauchy’s theorem. If f(a) = f(b),

then a = b and there exists some c ∈ (a, b), such that

f

(c) =

f(b) −f(a)

b −a

= 0.

But this contradicts the assumption that f

**(c) = 0 for all c ∈ I, and the
**

corollary is proved.

1

We discussed this property of the real numbers in Section 1.1.

2.2. UNIQUE SOLUTIONS OF DIFFERENTIAL EQUATIONS 67

We are going to use the following corollary frequently.

Corollary 2.5. Let h and g be functions which are deﬁned and continuous

on an interval I. If h

(x) = g

**(x) for all x ∈ I, then h and g diﬀer by a
**

constant, i.e., there exists a number d such that

h(x) = g(x) +d

for all x ∈ I.

Proof. Apply the previous corollary to f(x) = h(x) −g(x).

Deﬁnition 2.6. Suppose the function f(x) is deﬁned on the interval I. We

call a function F(x) with domain I an antiderivative of f if F

(x) = f(x)

for all x ∈ I.

Using this notion, we can reformulate Corollary 2.5.

Corollary 2.7. Suppose h and g are antiderivatives of a function f, deﬁned

on an interval. Then h and g diﬀer by a constant.

2.2 Unique Solutions of Diﬀerential Equations

Corollary 2.4 implies

Proposition 2.8. If the function F(x) is deﬁned on an interval I and

F

**(x) = 0 for all x ∈ I, then F(x) is constant on I.
**

In other words, on intervals the only solutions of the diﬀerential equation

F

**(x) = 0 are the constant functions.
**

More generally, if you like to ﬁnd all antiderivatives F(x) of a function

f(x) on an interval, then it suﬃces to ﬁnd one antiderivative H(x). Any

antiderivative F(x) is of the form H(x) + c where c is a constant. The

constant c is referred to as integration constant. For the time being you

depend on being able to guess such a function H(x). By diﬀerentiating

H(x) you can check whether you guessed right.

For example, any antiderivative F(x) of the function f(x) = 2x on the

real line (−∞, ∞) is of the form F(x) = x

2

+ c where c is a constant. Any

antiderivative F(x) of the function f(x) = sec

2

x on the interval (−π/2, π/2)

is of the form F(x) = tan x +c.

Typically, the integration constant is determined by an initial condition.

Suppose we like to solve the initial value problem

f

(x) = cos x and f(0) = 1.

68 CHAPTER 2. GLOBAL THEORY

Our ﬁrst conclusion is that f(x) = sin x + c. This follows from the above

because (sin x + c)

**= cos x. Next we substitute x = 0 in the equation.
**

Then we see that f(0) = c = 1. The solution of the initial value problem is

f(x) = sin x + 1.

Of particular importance to our discussion of expenential growth and

decay is

Proposition 2.9. Every solution f(x) of the diﬀerential equation

f

(x) = af(x)

on an interval is of the form f(x) = ce

ax

for some constant c.

Proof. We asserted in (1.19), and will eventually prove, that all functions of

the form f(x) = ce

ax

satisfy the diﬀerential equation. We want to see that

these are the solutions.

Let f(x) be any function which satisﬁes the diﬀerential equation on some

interval. Consider the function

h(x) = f(x)e

−ax

.

As a product of diﬀerentiable functions, h is diﬀerentiable. Its derivative is

h

(x) = f

(x)e

−ax

−af(x)e

−ax

= af(x)e

−ax

−af(x)e

−ax

= 0.

Corollary 2.4 tells us that h(x) is a constant function. Calling the constant

c we ﬁnd that

f(x) = ce

ax

.

This means that all solutions of the diﬀerential equation f

(x) = af(x) are

of the form f(x) = ce

ax

, where c is a constant.

Proposition 2.10. The initial value problem

f

(x) = af(x) and f(x

0

) = C

has a unique solution on an interval containing x

0

. In fact

f(x) = Ce

a(x−x

0

)

.

Proof. By the previous proposition we know that the solution is of the form

f(x) = ce

ax

for some c. Substituting the initial condition we obtain

C = f(x

0

) = ce

ax

0

.

Thus c = Ce

−ax

0

and f(x) = ce

ax

= Ce

−ax

0

e

ax

= Ce

a(x−x

0

)

.

2.3. THE FIRST DERIVATIVE AND MONOTONICITY 69

Remark 4. The uniqueness of the solution of an initial value problem as in

the previous proposition is not only of theoretical importance. Imagine that

you study the growth rate of a strain of bacteria, as we did in Example 1.64

on page 41. Before you can publish your result, it must be certain that your

experiment can be reproduced at a diﬀerent time in a diﬀerent location.

That is a requirement which any experiment in science must satisfy. If there

is more than one mathematical solution to your problem, then you have to

expect that the experiment can go either way, and this would invalidate your

experiment.

2.3 The First Derivative and Monotonicity

One of the interesting properties of a function is whether it is increasing or

decreasing. We might want to ﬁnd out whether the part of a population

which is infected with a disease is increasing or decreasing. We might want

to know how the level of pollution in a body of water is changing. The ﬁrst

derivative of a function gives us information of this kind.

2.3.1 Monotonicity on Intervals

Recall that a function f is called increasing if f(b) > f(a) whenever b > a.

It is called decreasing if f(b) < f(a) whenever b > a. A function is called

monotonic if it is either increasing or decreasing.

Theorem 2.11. Suppose that the function f is deﬁned and continuous on

the interval I.

1. If f

**(x) > 0 for all x ∈ I, then f is increasing on I.
**

2. If f

**(x) < 0 for all x ∈ I, then f is decreasing on I.
**

3. More generally, the conclusions in (1) and (2) still hold if in each

ﬁnite interval J ⊂ I there are only ﬁnitely many points at which the

assumption on f

(x) is not satisﬁed.

2

Proof. We show (1). Let a and b be points in I, and suppose that a < b.

Cauchy’s theorem says that there exists a point c, a < c < b, such that

f

(c) =

f(b) −f(a)

b −a

.

2

It is permissable that f is not diﬀerentiable at a few points in J, or that f

(x) = 0.

It is not possible that f

**(x) < 0 at some point in the interval, and f(x) is increasing on
**

the interval.

70 CHAPTER 2. GLOBAL THEORY

We have that f

**(c) > 0 and b − a > 0, and it follows that f(b) − f(a) > 0.
**

This means that f(b) > f(a). The proof of the second claim is similar. We

leave it and the generalization of both statements to the reader.

For example, log

2

x =

_

ln x

ln 2

_

=

1

x ln2

> 0 for all x ∈ (0, ∞). In the

computation we used (1.19), Theorem 1.36, and that ln2 > 0. It follows

from Theorem 2.11 that log

2

x is increasing on x ∈ (0, ∞). You see part of

the graph of the function in Figure 1.8.

The exponential function exp

a

x = a

x

is increasing on (−∞, ∞) if a > 1

and decreasing if 0 < a < 1. To see this, observe that a

x

= e

x ln a

and

d

dx

a

x

= (ln a)a

x

. Furthermore, a

x

> 0 and lna > 0 if a > 1 and ln a < 0

if 0 < a < 1. Now Theorem 2.11 implies our assertion. You may also

want to have a look at the graph of the exponential function with base 2 in

Figure 1.7.

The function f(x) = 1/x is deﬁned and diﬀerentiable on the set of all

nonzero real numbers, and its derivative is f

(x) = −1/x

2

. In particular

f

**(x) < 0 for all nonzero real numbers. According to Theorem 2.11, f(x)
**

is decreasing on the interval (−∞, 0), and that f(x) is decreasing on the

interval (0, ∞). The function is not decreasing on the union of the two

intervals. The example illustrates that it is crucial in Theorem 2.11 that we

deal with functions which are deﬁned and diﬀerentiable on an interval.

The function f(x) = tan x, deﬁned on (−π/2, π/2), has as its deriva-

tive f

(x) = sec

2

x, and the derivative is positive. Consequently, f(x) is

increasing on (−π/2, π/2). Its inverse g(x) = arctan x, deﬁned on (−∞, ∞),

has as its derivative g

(x) =

1

1+x

2

, which is positive on (−∞, ∞), so that

g(x) = arctan x is increasing on (−∞, ∞). As a general priciple, one may

show that the inverse of an increasing function is increasing.

Example 2.12. For a three dimensional solid we set E = A/V , where

A denotes the surface area and V the volume. For example, for a ball

E(r) = (4πr

2

)/(

4

3

πr

3

) = 3/r, where r denotes the radius. Then E

(r) < 0.

The same principle holds for other shapes, E decreases as we enlarge the

solid without changing its shape. What does this have to do with the size

of animals?

Warm blooded animals living in cold climates need to preserve their body

temperature. The total amount of heat stored in the body is proportional to

the volume, while the heat loss is proportional to the surface area. The ratio

of volume to surface area increases as the animal gets larger, so that for warm

blooded animals it is of advantage to be large if they live in cold climates.

In hot climates they need to give oﬀ heat, so that it is of advantage to be

2.3. THE FIRST DERIVATIVE AND MONOTONICITY 71

small. Natural selection (Darwinism) should favor the larger specimens of

a warm blooded species in a cold climate and smaller ones in a hot climate.

You can observe this phenomenon in real life.

For cold blooded animals the converse holds. They absorbe heat so

that they body reaches a temperature at which they can be active. In cold

climates it helps to be small, because then the surface area is relatively large,

compared to the volume. In hot climates cold blooded animals can aﬀord to

be large, as it is easy to reach and maintain the temperature at which they

can be active. The argument is again consistent with real life.

Needless to say, there are other mechanisms to increase the surface area

of a body than decreasing its size, and the maintenance of the body temper-

ature is only one factor which inﬂuences the size of specimens of a species.

Larger animals need more food, are stronger, cannot hide so well, and are of-

ten less agile. All of these factors need to be taken into account to determine

the optimal size of an animal. ♦

So far we have only discussed examples where we used (1) and (2) of

Theorem 2.11. Let us show how to use the conclusion in (3). To apply it

we need to determine intervals on which a function does not change signs.

We recall a procedure which works well for continuous functions.

Deﬁnition 2.13. Suppose f(x) is a function. We call a point x

0

on the

real line exceptional if either f(x

0

) = 0 or f(x

0

) is not deﬁned.

The following result is an immediate consequence of the Intermediate

Value Theorem, see Theorem 1.16 on page 8. Expressed casually it says

that a continuous function can change signs only at exceptional points.

Proposition 2.14. Suppose f(x) is continuous and f(x) has no exceptional

points in the interval (x

0

, x

1

). Then f(x) > 0 for all points in the interval

(x

0

, x

1

), or f(x) < 0 for all points in the interval (x

0

, x

1

). In particular, if

f(x) is positive at one point in the interval, then it is positive at all points

in the interval. If f(x) is negative at one point in the interval, then it is

negative at all points in the interval.

Example 2.15. For example, consider the function

f(x) =

x

2

(x

2

−4)

x

2

+ 2x −15

=

x

2

(x −2)(x + 2)

(x −3)(x + 5)

.

The zeros of the numerator, and with this the zeros of f(x), are x = 0,

x = 2, and x = −2. The zeros of the denominator, i.e., the points where

f(x) is not deﬁned, are x = 3 and x = −5.

72 CHAPTER 2. GLOBAL THEORY

According to the proposition, the sign of f(x) remains unchanged on

each of the intervals (−∞, −5), (−5, −2), (−2, 0), (0, 2), (2, 3) and (3, ∞).

Counting signs of the factors in the expression for f(x), we see f(x) is

positive on the interval (−∞, −5), negative on (−5, −2), positive on (−2, 0)

and on (0, 2), negative on (2, 3), and positive on (3, ∞). You see that the

sign changes at some, but not all, exceptional numbers. ♦

Exercise 15. Find intervals on which the following functions do not change

signs. Decide whether the functions are positive or negative on these inter-

vals.

(1) f(x) = x

3

−x

2

−5x −3 (2) g(x) =

x

x

3

+ 5x

2

−4x −20

.

We are ready to discuss the monotonicity of functions whose derivative

vanishes at some points.

Example 2.16. Find intervals of monotonicity for the function

f(x) = 3x

2

+ 5x −4.

-3 -2 -1 1 2 3 4

-5

5

10

15

20

25

30

Figure 2.1: A quadratic polyno-

mial, f(x) = 3x

2

+ 5x −4

-2 2 4

-20

-15

-10

-5

5

Figure 2.2: A cubic polynomial,

p(x) = x

3

−3x

2

−9x + 3

Solution: We graphed the function in Figure 2.1. Its derivative is

f

(x) = 6x + 5. In particular, f

(x) > 0 if x ∈ (−5/6, ∞). So f

(x) > 0

for all points x ∈ [−5/6, ∞), except at x = −5/6. Theorem 2.11 (3) says

that f is increasing on the interval [−5/6, ∞). By a similar argument, f is

decreasing on the interval (−∞, −5/6]. ♦

2.3. THE FIRST DERIVATIVE AND MONOTONICITY 73

Example 2.17. Find intervals of monotonicity for the degree three poly-

nomial (for a graph see Figure 2.2)

p(x) = x

3

−3x

2

−9x + 3

Solution: The function is deﬁned and diﬀerentiable on the real line. Its

derivative is

p

(x) = 3x

2

−6x −9 = 3(x

2

−2x −3) = 3(x −3)(x + 1).

Counting the signs of the factors we see that p

(x) is positive on (−∞, −1)

and on (3, ∞). We conclude that p(x) is increasing on the interval [3, ∞)

and that it is increasing on the interval (−∞, −1]. The derivative is negative

on the interval (−1, 3). The theorem implies that p(x) is decreasing on the

interval [−1, 3]. ♦

Example 2.18. Find intervals of monotonicity for the rational function

f(x) =

x

2

+ 3x

x −1

.

Solution: The simpliﬁed expression for the derivative of f is

f

(x) =

(x + 1)(x −3)

(x −1)

2

.

We see that the exceptional points for f

(x) are x = 1, x = −1 and x = 3. We

conclude that f

**(x) does not change signs on the intervals (−∞, −1), (−1, 1),
**

(1, 3), and (3, ∞). Counting the signs of the factors of f

(x), we conclude

that f

(x) > 0 on the intervals (−∞, −1) and (3, ∞), and f

(x) < 0 on the

intervals (−1, 1) and (1, 3). Observe that f(x) is deﬁned and diﬀerentiable

on the entire real line with the only exception of x = 1. We conclude that

f(x) is increasing on the (−∞, −1] and [3, ∞). The function is decreasing

on the intervals [−1, 1) and (1, 3]. ♦

Example 2.19. Find intervals on which the function

f(x) = sin 2x + 2 sin x

is monotonic. Restrict your discussion to the interval [0, 2π].

Solution: We diﬀerentiate the function and rewrite the expression for

the derivative so that it is easier to ﬁnd its exceptional points.

f

(x) = 2 cos 2x + 2 cos x

= 2[2 cos

2

x + cos x −1]

= 4(cos x + 1)

_

cos x −

1

2

_

.

74 CHAPTER 2. GLOBAL THEORY

To see the second equality we used that cos 2x = 2 cos

2

x − 1. Then we

solved the quadratic equation in terms of cos x. We ﬁnd exceptional points

where cos x = −1 (i.e., x = π) and where cos x =

1

2

(i.e., x =

π

3

and x =

5π

3

).

1 2 3 4 5 6

-2

-1

1

2

3

4

Figure 2.3: A function and its derivative.

Observe that f is diﬀerentiable on [0, 2π], and that f

(x) = 0 at the end

points of this interval. This provides us with the intervals [0, π/3), (π/3, π),

(π, 5π/3) and (5π/3, 2π] on which f

**does not change sign. Checking the
**

sign of f

(at one point) in each of the intervals, we ﬁnd that f

(x) > 0 for

x ∈ [0, π/3) and x ∈ (5π/3, 2π], and f

(x) < 0 for x ∈ (π/3, π) and (π, 5π/3).

We conclude that f is increasing on the interval [0, π/3] and [5π/3, 2π]. The

function is decreasing on the interval [π/3, 5π/3], and in this interval there

are three points at which f

(x) is not positive.

You may conﬁrm the calculation by having a look at Figure 2.3. There

you see the graph of the function (solid line) and the graph of its derivative

(dashed line). As you see, wherever f

**(x) is positive, there f(x) is increasing.
**

Wherever f

**(x) is negative, there f(x) is decreasing. ♦
**

Exercise 16. Find intervals on which the function f increases and intervals

on which f decreases. In the last two problems, (g) and (h), restrict yourself

2.3. THE FIRST DERIVATIVE AND MONOTONICITY 75

to the interval [0, 2π].

(a) f(x) = 3x

2

+ 5x + 7

(b) f(x) = x

3

−3x

2

+ 6

(c) f(x) = (x + 3)/(x −7)

(d) f(x) = x + 1/x

(e) f(x) = x

3

(1 +x)

(f) f(x) = x/(1 +x

2

)

(g) f(x) = cos 2x + 2 cos x

(h) f(x) = sin

2

x −

√

3 sin x

2.3.2 Monotonicity at a Point

It is quite natural to ask what it means that a function is increasing at a

point, and how this concept is related to the one of being increasing on an

interval. We address both questions in this subsection.

Deﬁnition 2.20. Suppose f is a function and c is an interior point of its

domain. We say that f is increasing at c if, for some d > 0,

f(x) < f(c) for all x ∈ (c −d, c) and f(x) > f(c) for all x ∈ (c, c +d).

We say that f is decreasing at c if this statement holds with the inequalities

reversed.

Expressed informally, to the left of c the function is smaller and to the

right of c it is larger than at c, at least for a while.

Being increasing or decreasing at a point c is a local property. We are

making a statement about the behavior of the function on some open interval

which contains c. Being increasing on an interval is a global property. For

the global property the interval is given to us. For the local property we may

chose the, possibly rather small, interval. The global property has to hold

for any two points in the given interval. For the local property we compare

f(x) to f(c) where c is ﬁxed and x is any point in an open interval around

c which we may chose.

Theorem 2.21. Suppose f is a function which is deﬁned on an open inter-

val I. Then f is increasing (decreasing) on I if and only it it is increasing

(decreasing) at each point in I.

This theorem establishes the relation between the local and the global

property. The ‘only if’ part is not diﬃcult to show, but the ‘if’ part uses

some deeper facts about ﬁnite closed intervals. Our second result gives us a

valuable tool to detect monotonicity of functions at a point.

76 CHAPTER 2. GLOBAL THEORY

Proposition 2.22. Let f be a function and c an interior point of its do-

main. If f is diﬀerentiable at c and f

(c) > 0, then f is increasing at c. If

f

(c) < 0, then f is decreasing at c.

Remark 5. A function does not have to be diﬀerentiable to be increasing.

Graph the function f(x) = 2x+|x| to convince yourself of this fact. A func-

tion can be diﬀerentiable and increasing at a point x, even if the assumptions

of Proposition 2.22 do not hold, i.e., f(x) = x

3

is increasing at x = 0, but if

f

**(0) = 0. A function can also be increasing at a point x, but there is not
**

open interval which contains x such that the function is increasing on this

interval.

Remark 6. The ideas of of a function being increasing or decreasing at

a point may be generalized to cover domains of functions which are half-

closed or closed intervals, and where we like to make a statement about the

behavior of a function at an endpoint. We have no speciﬁc needs for such

statements, but the motivated reader is encouraged to explore them.

2.4 The Second Derivative and Concavity

We like to capture the property of a graph being bent upwards or downwards.

Secant lines will either be required to lie above or below the graph, and the

rates of change will be either increasing of decreasing. These properties can

be described globally over intervals and locally at points. You may use the

graphs in Figures 2.4 and 2.5 as illustrations of the discussion.

-2 -1 1 2

2

4

6

8

10

Figure 2.4: Concave Up

-2 -1 1 2 3 4

-8

-6

-4

-2

2

4

Figure 2.5: Concave Down

2.4. THE SECOND DERIVATIVE AND CONCAVITY 77

2.4.1 Concavity on Intervals

Let f(x) be a function and let (a, f(a)) and (b, f(b)) be two distinct points

on its graph. The line through these two points is

l(x) = f(a) +

f(b) −f(a)

b −a

(x −a).

If we restrict l(x) to x ∈ [a, b], then we get the secant line through the two

point, i.e., the line segment joining the two points.

Deﬁnition 2.23. Let f be a function which is deﬁned on an interval I.

We say that f is concave up on I if f(c) < l(c) for all a, b in I and

c ∈ (a, b). Here l(x) is the secant line through (a, f(a)) and (b, f(b)). The

inequality expresses that between the points a and b the secant line lies above

the graph. We say that f is concave down on I if f(c) > l(c) for all a, b

in I and c ∈ (a, b). The inequality expresses that between the points a and b

the secant line lies below the graph.

We state a theorem which provides you with assumptions under which a

function is concave up or down. We will not provide a proof of the theorem.

Theorem 2.24. Let f be a function which is deﬁned on an interval I.

1. Suppose that f(x) is diﬀerentiable on I. If f

(x) is increasing on I,

then f(x) is concave up on I. If f

(x) is decreasing on I, then f(x) is

concave down on I.

2. Suppose that f(x) is twice diﬀerentiable

3

on I. If f

(x) > 0 for all x

in I, then f(x) is concave up on I. If f

**(x) < 0 for all x in I, then
**

f(x) is concave down on I.

3. More generally, the conclusions in (2) still hold if in each ﬁnite interval

J ⊂ I there are only ﬁnitely many points at which the assumption

f

(x) > 0, resp. f

(x) < 0, is not satisﬁed.

For example, the function shown in Figure 2.4 is q(x) = x

2

−2x +3. Its

second derivative is q

**(x) = 2 > 0. Theorem 2.24 (2) says that q is concave
**

3

Strictly speaking, so far we can consider being ‘twice diﬀerentiable’ only for functions

which are deﬁned on open intervals. More generally, we proceed as in Section 1.11. We say

that f(x) is twice diﬀerentiable on I, if f(x) extends to a function F(x) which is deﬁned

on an open interval J which contains I, and F(x) is twice diﬀerentiable on J. The second

derivative will be unique at all points in I if I is not empty and does not consist of exactly

one point.

78 CHAPTER 2. GLOBAL THEORY

up on (−∞, ∞). The function shown in Figure 2.5 is g(x) = −x

2

+ 5x −1,

and its second derivative is g

**(x) = −2 < 0. Theorem 2.24 (2) says that q
**

is concave down on (−∞, ∞).

The function ln x is concave down on the interval (0, ∞). To see this, you

may use that ln

(x) = −1/x

2

< 0 on (0, ∞) and apply Theorem 2.24 (2).

Alternatively, you may note that the derivative ln

x = 1/x is decreasing on

(0, ∞) and apply Theorem 2.24 (1). The exponential function exp(x) = e

x

is

concave up on (−∞, ∞). To see this, you may note that exp

(x) = exp(x) >

0 and apply Theorem 2.24 (2). You may also use that exp

(x) is increasing

on the real line, and then quote Theorem 2.24 (1) to derive the desrired

conclusion. Finally, you may observe that a function is concave up if its

inverse is convave down

4

. So, ln x being concave down implies that exp(x)

is concave up.

Let us look at examples where we apply condition Theorem 2.24 (3).

Example 2.25. Study the concavity properties of the function

p(x) = x

3

−3x

2

−9x + 3.

Solution: You ﬁnd the graph of this function in Figure 2.2. Its second

derivative is p

(x) = 6x−6 = 6(x−1). We see that p

(x) > 0 for x ∈ (1, ∞),

and p

(x) < 0 for x ∈ (−∞, 1). This means that p

(x) > 0 for all x ∈ [1, ∞)

with only one exception, x = 1. Theorem 2.24 (3) tells us that p(x) is

concave up on the interval [1, ∞). Similarly, p

(x) < 0 for x ∈ [−∞, 1) with

only one exception, x = 1. One deduces that f(x) is concave down on the

interval (−∞, 1]. ♦

Consider the function tan x. You may verify that tan

x = 2 sec

2

xtan x.

In particular, tan

x < 0 for x ∈ (−π/2, 0) and tan

x > 0 for x ∈ (0, π/2).

Theorem 2.24 (3) implies that tan x is concave down on (−π/2, 0] and con-

cave up on [0, π/2). You may conﬁrm these statements visually by inspecting

a graph of the tangent function. You are invited to study the concavity of

the other trigonometric and hyperbolic functions.

Remark 7. You may consider the spread of a desease. Denote the number

of infected people by I(t). It may be scary if I

(t) > 0, i.e., I(t) increases.

It is worse, and often true in the early stages of an epedemic, if I

(t) > 0.

4

If f and g are inverses of each other, then the graph of one of the functions is obtained

from the one of the other one by reﬂection at the diagonal x = y. In this process, secant

lines which are above the graph turn into secant lines below the graph. Thus, if f is

concave up, then g is concave down, and vice versa.

2.4. THE SECOND DERIVATIVE AND CONCAVITY 79

This means that I

**(t) increases, and the desease spreads at an increasing
**

rate. Medical professional will not necessarily wait for the time when I(t),

the number of infected people, starts decreasing. When I

(t) turns negative,

then I

**(t) decreases. The spread or the desease slows. One may hope that
**

eventually I

**(t) becomes negative, so that the actual number of sick people
**

decreases. The point at which I

**(t) changes signs from being positive to
**

being negative may be considered the turning point in the spread of the

desease. One of the recent presidents was confused by a subtle argument of

this kind

5

.

Let us look at this phenomena in a concrete example. Earlier we con-

sidered the logistic equation

y

= ay −by

2

.

See Example 1.75 and the graph of a solution of this diferential equation in

Figure 1.15. Use implicit diﬀerentiation to ﬁnd the second derivative:

y

= ay

−2byy

= (a −2by)y

.

We see that y

= 0 if y

**= 0 or y = a/(2b). The ﬁrst case occurs if y = 0
**

or y = a/b. We called y = a/b the carrying capacity of the system, and it

was the stable equilibrium point. The inﬂection occurs when y is half the

carrying capacity. As long as y is less than a/(2b), the population grows at

an increasing rate. If a/(2b) < y < a/b, then growth slows. You see the

turning point in the graph in Figure 1.15. For a while the population seems

to explode, but after a while it levels oﬀ so that it does not exceed a the

carrying capacity.

Exercise 17. Find intervals on which the following functions are concave

up, resp., concave down.

1. f(x) = x

3

−4x

2

+ 8x −7

2. g(x) = x

4

+ 2x

3

−3x

2

+ 5x −2

3. h(x) = x + 1/x

4. i(x) = 2x

4

−x

2

5. j(x) = x/(x

2

−1)

6. k(x) = 2 cos

2

x −x

2

for x ∈ [0, 2π].

5

During a televised presidential debate, one of the candidates said (see the New York

Times from October 8th, 1984, page B6): “Some of these facts and ﬁgures just don’t add

up. Yes, there has been an increase in poverty but it is a lower rate of increase than it was

in the preceding years before we got here. It has begun to decline, but it is still going up.”

80 CHAPTER 2. GLOBAL THEORY

2.4.2 Concavity at a Point

The notion of being concave up or down was deﬁned for functions which are

deﬁned on intervals. Still, we got a picture how the function has to look

like near a point, and this is the behavior which we like to capture in a

deﬁnition.

Deﬁnition 2.26. Let f be a function and c an interior point

6

of its do-

main. We say that f is concave up, resp., concave down, at c if there exists

an open interval I and a line l, called a support line, such that l(c) = f(c)

and

f(x) > l(x), resp., f(x) < l(x),

for all x ∈ I with x = c.

0.5 1 1.5 2

1

2

3

4

Figure 2.6: Concave up at •

1 2 3 4 5

-0.5

0.5

1

1.5

2

Figure 2.7: Concave down at •

In other words, we are asking for a line l(x), such that the graph lies

on one side of the graph, at least near c. If the graph is above the line,

then the function is concave up, if it is below, then the function is concave

down. We assume that the graph and the line agree at c. You see this

situation illustrated in two generic pictures in Figures 2.6 and 2.7. One

shows a function which is concave up at the indicated point, one shows a

function which is concave down.

Our next theorem tells us how to detect concavity, and it tells us how

to ﬁnd the support line if the function is diﬀerentiable.

6

The idea of an interior point was deﬁned in Deﬁnition 1.18 on page 10.

2.5. LOCAL EXTREMA AND INFLECTION POINTS 81

Theorem 2.27. Let f be a function and c an interior point of its domain.

1. If f

is increasing at c or if f

(c) > 0, then f is concave up at c.

2. If f

is decreasing at c or if f

**(c) < 0, then f is concave down at c.
**

3. If f is diﬀerentiable and concave up or down at c, then there is only

one support line, and this line is the tangent line to the graph of f at

c.

The sign of the second derivative of a functions tells us whether a function

is concave up or down at a point. If the second derivative is zero, then the

test is inconclusive. The function can be concave up, down, or neither.

In general, there can be many support lines at any given point, but if the

function is diﬀerentiable at c, then the support line is unique. It is the

tangent line. So, for a diﬀerentiable function which is concave up or down

at a point, we can draw the tangent line easily. We just hold the ruler

against the graph.

For example, the function f(x) = x

5

−7x

4

+2x

3

+2x

2

−5x+4 is concave

down at x = 2 because f

(2) = −148 < 0.

To relate concavity properties on an interval to those at each point in

the interval we state, without proof, the following theorem.

Theorem 2.28. Let f be a function which is deﬁned on an open interval

(a, b). Then f is concave up (resp., down) on (a, b) if and only if f is concave

up (resp., down) at each point in (a, b).

2.5 Local Extrema and Inﬂection Points

We are going to discuss two types of points which are particularly important

in the discussion of (graphs of) functions. As we like to apply local properties

of the function, we focus on interior points is the domain of the function.

Deﬁnition 2.29 (Local Extrema). Let f be a function and c an interior

point in its domain

7

. We say that f has a local maximum, resp. minimum,

at c if

f(c) ≥ f(x), resp. f(c) ≤ f(x),

for all x in some open interval I around c. In this case we call f(c) a local

maximum, resp. minimum, of f. A local extremum is a local maximum or

minimum.

82 CHAPTER 2. GLOBAL THEORY

-2 -1 1 2

-2

-1

1

2

3

Figure 2.8: A local minimum

-1 -0.5 0.5 1

-1.5

-1

-0.5

0.5

1

1.5

Figure 2.9: An Inﬂection Point

In other words, f has a local maximum of f(c) at c, if f(c) is the largest

value compared the values at points near c. The function shown in Figure 2.8

has a local minimum at x = −1. We will study tests which allow us ﬁnd local

extrema soon. We do not need any test to see that f(x) = |x| has a local

minimum at x = 0, and f(x) = −(x − 1)

2

has a local maximum at x = 1.

The vertex of a parabola is always a local extremum, a local minimum if the

coeﬃcient of x

2

is positive, and a local maximum if the coeﬃcient of x

2

is

negative.

Deﬁnition 2.30 (Inﬂection Points). Let f be a function and c an inte-

rior point of its domain. We call c an inﬂection point of f if the concavity

of f changes at c. I.e., for some numbers a and b with a < c < b, we have

that f is concave up on the interval (a, c] and concave down on [c, b), or vice

versa.

Soon we will develop tests which detect inﬂections points. No test is

required to see that f(x) = tan x has an inﬂection point at x = 0. The

function is concave down on the interval (−π/2, 0] and concave up on the

interval [0, π/2). So the concavity changes at x = 0 and that means that

there is an inﬂection point at x = 0. You see the graph of this function in

Figure 2.9.

7

According to Deﬁnition 1.18 on page 10 this means that f(x) is deﬁned for all x in

some open interval around c.

2.6. DETECTION OF LOCAL EXTREMA 83

2.6 Detection of Local Extrema

We will discuss how to detect local extrema. The ﬁrst result excludes many

points. Typically, there are very few points where local extrema can occur.

Theorem 2.31. Let f be a function and c an interior point of its domain. If

f is diﬀerentiable at c and f

**(c) = 0, then f does not have a local extremum
**

at c. In other words, if f has a local extremum at c, then f is either not

diﬀerentiable at c or f

(c) = 0.

To have an abbreviation for the points which are recognized as important

in this theorem, it is customary to say:

Deﬁnition 2.32 (Critical Points). Let f be a function and c an interior

point of its domain. We say that c is a critical point of f if f is diﬀerentiable

at c and f

(c) = 0, or if f is not diﬀerentiable at c.

Theorem 2.31 provides us with a necessary condition. If a function has

a local extremum at c, then c is a critical point of the function. No local

extrema can occur at points which are not critical. The test does not give

a suﬃcient condition for a local extremum. If c is a critical point of the

function, then the function need not have a local extremum at c. It makes

sense to introduce one more word.

Deﬁnition 2.33 (Saddle Points). Let f be a function and c an interior

point of its domain. We say that c is a saddle point of f if f is diﬀerentiable

at c and f

**(c) = 0, but f does not have a local extremum at c.
**

Proof of Theorem 2.31. Suppose that f is diﬀerentiable at c and f

(c) > 0.

Proposition 2.22 on page 76 tells us that there exists some positive number

d, such that f(x) < f(c) for all x ∈ (c − d, c), and f(x) > f(c) for all

x ∈ (c, c +d). So, there are points x to the left of and arbitrarily close to c

such that f(x) < f(c), and there are points x to the right of and arbitrarily

close to c such that f(x) > f(c). This means, by deﬁnition, that f does not

have a local extremum at c. If f

**(x) < 0, then the same argument applies
**

with inequalities reversed. If f

(c) = 0, then either f

(c) > 0 or f

(c) < 0,

and in neither case we have an extremum at c.

Neither the exponential function nor the logarithm function have local

extrema. To see this, observe that these functions are diﬀerentiable on their

domain, and their derivatives exp

x = exp x and ln

x = 1/x are every-

where nonzero. These functions have no critical points, and according to

Theorem 2.31 they have no local extrema.

84 CHAPTER 2. GLOBAL THEORY

Example 2.34. Find the local extrema of the function

q(x) = x

2

−2x + 3.

Solution: The function is diﬀerentiable for all real numbers x, and

q

(x) = 2x −2 = 2(x −1).

So q

**(x) = 0 if x = 1. The only point at which we can have a local extremum,
**

i.e., the only critical point, is x = 1. If we write the function in the form

q(x) = (x −1)

2

+ 2,

then we see that q does indeed that a local minimum at x = 1. You should

conﬁrm this result by having a look at Figure 2.10, where this function is

graphed. ♦

-1 1 2 3

2

4

6

8

Figure 2.10: A local minimum

-2 -1 1 2

-8

-6

-4

-2

2

4

6

8

Figure 2.11: A saddle point

Example 2.35. Show that the function g(x) = x

3

has a saddle point at

x = 0.

Solution: The function g(x) is everywhere diﬀerentiable, and its only

critical point is at x = 0, which is the only zero of g

(x) = 3x

2

. Obviously,

g(x) > 0 for all x ∈ (0, ∞) and g(x) < 0 for all x ∈ (−∞, 0). This means

that there is no local extremum at x = 0. As g

(0) = 0 and there is no

local extremum at x = 0, the function has a saddle point at this point. This

saddle point is shown in Figure 2.11. ♦

2.6. DETECTION OF LOCAL EXTREMA 85

Let us formulate a criterion which conﬁrms that a function has a local

extremum at a point c. It gives us a suﬃcient condition for a local extremum

at c.

Theorem 2.36. Suppose c is an interior point of the domain of a function

f, and suppose that for some d > 0 the function is increasing on (c−d, c] and

decreasing on [c, c + d). Then f has a local maximum at c. If the function

is decreasing on (c − d, c] and increasing on [c, c + d), then f has a local

minimum at c.

Taking advantage of the information provided by the ﬁrst derivative, we

obtain the following test.

Theorem 2.37 (First Derivative Test). Suppose f is a function which

is deﬁned and diﬀerentiable on (c − d, c + d) for some d > 0, and c is a

critical point.

1. If f

(x) > 0 for all x ∈ (c − d, c) and f

(x) < 0 for all x ∈ (c, c + d),

then f has a local maximum at c.

2. If f

(x) < 0 for all x ∈ (c − d, c) and f

(x) > 0 for all x ∈ (c, c + d),

then f has a local minimum at c.

3. If f

**(x) > 0 for all x ∈ (c−d, c)∪(c, c+d), then f has a saddle point at
**

c. This conclusion also holds if f

**(x) < 0 for all x ∈ (c−d, c)∪(c, c+d).
**

Let us illustrate the use of the theorem with an example.

Example 2.38. Find the local extrema of the function

f(x) = x

3

−3x

2

+ 2x + 2.

Solution: We diﬀerentiate f(x) and express f

(x) as a product of linear

factors:

f

(x) = 3x

2

−6x + 2 = 3

_

x −

_

1 +

√

3

3

___

x −

_

1 −

√

3

3

__

It is easy to determine where the factors are zero, positive and negative. We

conclude that f

(x) = 0 if x = 1 ±

√

3/3, f

**(x) is positive on the intervals
**

(−∞, 1 −

√

3/3) and (1 +

√

3/3, ∞), and f

**(x) is negative on the interval
**

(1 −

√

3/3, 1 +

√

3/3). You can see graphs of f and f

in Figures 2.12 and

2.13

86 CHAPTER 2. GLOBAL THEORY

0.5 1 1.5 2

1.2

1.4

1.6

1.8

2.2

2.4

Figure 2.12: f(x) = x

3

− 3x

2

+

2x + 2

0.5 1 1.5 2

-1

1

2

3

4

Figure 2.13: f

(x) = 3x

2

−6x +2

The only only critical points of f are at x = 1±

√

3/3, and these are the

only points where a local extremum can occur. Based on the sign of f

(x) on

intervals to the left and right of these two critical points we see that f has

a local maximum at x = 1 −

√

3/3 and a local minimum at x = 1 +

√

3/3.

♦

Exercise 18. Find the local extrema of the following function:

(1) f(x) =

x

2

+ 3x

x −1

(2) g(x) = sin2x + 2 sin x for x ∈ [0, 2π].

Hint: We discussed the monotonicity properties of these functions in Exam-

ples 2.18 and 2.19.

Exercise 19. Find the local extrema of the following functions. In the last

two problems, (g) and (h), restrict yourself to the interval [0, 2π].

(a) f(x) = 3x

2

+ 5x + 7

(b) f(x) = x

3

−3x

2

+ 6

(c) f(x) = (x + 3)/(x −7)

(d) f(x) = x + 1/x

(e) f(x) = x

3

(1 +x)

(f) f(x) = x/(1 +x

2

)

(g) f(x) = cos 2x + 2 cos x

(h) f(x) = sin

2

x −

√

3 sin x

Hint: You discussed the intervals of monotonicity for these functions in

Exercise 16.

2.6. DETECTION OF LOCAL EXTREMA 87

We may use the second derivative to detect the change of sign of the ﬁrst

derivative, as it is called for in the assumptions in Theorem 2.37.

Theorem 2.39 (Second Derivative Test). Let f be a function and c an

interior point in its domain. Assume also that f

(c) and f

**(c) exist and that
**

f

(c) = 0. If f

(c) > 0, then f has a local minimum at c. If f

(c) < 0, then

f has a local maximum at c.

To apply the theorem to the detection of the local extrema of a diﬀer-

entiable function f(x), we diﬀerentiate f and ﬁnd the critical points, the

zeros of f

(x). Then we diﬀerentiate f

(x). The sign of f

at the critical

points tells us whether we found a local minimum or a local maximum. If

f

(c) = f

**(c) = 0, then the test is inconclusive. There may or may not be a
**

local extremum at c. Furthermore, the function f can have a local extremum

at c, and the assumptions of the test are not satisﬁed. In this sense, the test

provides us with a suﬃcient condition for the existence of a local extremum

at a point. It does not provide us with a necessary condition.

Example 2.40. Find the local extrema of the function (for a graph, see

Figure 2.2 on page 72)

p(x) = x

3

−3x

2

−9x + 3.

Solution: We calculated the ﬁrst derivative,

p

(x) = 3x

2

−6x −9 = 3(x + 1)(x −3).

The critical points of the function are x = −1 and x = 3. Furthermore,

p

(x) = 6x −6 = 6(x −1).

In particular, p

(−1) = −12 and p

**(3) = 12. The second derivative test
**

tells us that we have a local maximum at x = −1, because this is a critical

point and p

**(−1) < 0. We also have a local minimum at x = 3 because at
**

this critical point the second derivative of the function is positive. ♦

Proof of the Second Derivative Test. First, let us assume that f

(c) = 0 and

f

**(c) > 0. We will show that f has a local minimum at c. The assumption
**

that f

**(c) = 0 means that the tangent line to the graph of f at (c, f(c))
**

is horizontal. Its equation is l(x) = f(c). The assumption that f

(c) > 0

means that f is concave up at c (see Theorem 2.27 (1)). Spelled out explicitly

this means that

f(x) > l(x) = f(c)

88 CHAPTER 2. GLOBAL THEORY

for some positive number d and for all x ∈ (c − d, c) ∪ (c, c + d). In other

words, f has a local minimum at c.

The proof that f has a local maximum at c if f

(c) = 0 and f

(c) < 0 is

similar. We leave it to the reader.

Exercise 20. Find the critical points and the local extrema.

(a) f(x) = 4x

2

−7x + 13

(b) f(x) = x

3

−3x

2

+ 6

(c) f(x) = x + 3/x

(d) f(x) = x

2

(1 −x)

(e) f(x) = |x

2

−16|

(f) f(x) = x

2

/(1 +x

2

).

2.7 Detection of Inﬂection Points

We deﬁned an inﬂection point to be a point at which the concavity of a

function changes. If we know where the function is concave up and down,

then we can just answer this question. We want to detect inﬂection points

more eﬃciently. A theorem provides a necessary and a suﬃcient condition

for the existence of an inﬂection point. Let us start out with an example.

Example 2.41. Find the the inﬂection points of the function

g(x) = x

3

−4x

2

+ 3x −5.

-0.5 0.5 1 1.5 2 2.5

-0.5

0.5

1

1.5

2

2.5

Figure 2.14: The graph of g.

-0.5 0.5 1 1.5 2 2.5

-2

2

4

6

8

Figure 2.15: The graph of g

.

2.7. DETECTION OF INFLECTION POINTS 89

You see the graph of g in Figure 2.14 and the one of g

in Figure 2.15.

We calculate the ﬁrst and second derivative of g:

g

(x) = 3x

2

−8x + 3 and g

(x) = 6x −8.

From the formula for the second derivative we conclude that

g

(x) < 0 if x ∈ (−∞, 4/3) and that g

(x) > 0 if x ∈ (4/3, ∞).

This means that g is concave down on the interval (−∞, 4/3] and concave

up on [4/3, ∞). By deﬁnition, we have an inﬂection point at x = 4/3. You

see the inﬂection point indicated as a dot in Figure 2.14. You also see that

g

**(x) has a local extremum at the same point. ♦
**

Theorem 2.42. Let f be a function and c an interior point of its domain.

Suppose that the ﬁrst and second derivatives of f exist at c.

1. If f has an inﬂection point at c, then f

(c) = 0.

2. If f

(c) = 0, f

(c) exists and f

**(c) = 0, then f has an inﬂection
**

point at c.

Example 2.43. Find the inﬂection points of

f(t) = 2t

4

−6t

3

+ 5t

2

−7t + 4.

Solution: We calculate the second derivative of the function and ﬁnd

f

(t) = 24t

2

−36t + 10.

According to the theorem, we have to ﬁnd the zeros of f

(x) to determine

where an inﬂection point can be. The roots are

t =

3

4

±

1

12

√

21 =

9 ±

√

21

12

.

Now, let us check whether there are inﬂection points at either of these values

for t. We calculate the third derivative of f:

f

(t) = 48t −36.

We could plug t = (9 ±

√

21)/12 into the expression for f

, but this is a

bit cumbersome. We see right away that f

(t) = 0 exactly if t = 3/4, and

this means that f

(9 ±

√

21)/12) = 0. The theorem says that the inﬂection

points of f(t) are at t = (9 ±

√

21)/12. ♦

90 CHAPTER 2. GLOBAL THEORY

Apparently, our ability it ﬁnd inﬂection points of a function is limited by

our ability to ﬁnd the zeros of its second derivative. If we are given graphical

information, then this quite easy.

Example 2.44. Find the inﬂection points of the function

f(x) =

_

1.2 +x

2

−3(sin x)

3

.

-3 -2 -1 0 1 2 3

0.5

1

1.5

2

2.5

3

Figure 2.16: The graph of f.

-3 -2 -1 1 2 3

-2

2

4

6

8

Figure 2.17: The graph of f

.

Apparently, it will take an eﬀort to calculate the second derivative of

this function, and it will be nearly impossible to ﬁnd the zeros of f

. Any

reasonable software has no problem with this. We asked the computer to

graph f and f

**for x ∈ [−3, 3]. You see the graphs in Figures 2.16 and 2.17.
**

A look at the graph of f barely reveals some of the inﬂection points, but

the graph of f

**shows them clearly. Zooming in on parts of the graph f will
**

not improve this. At least in this example, the graph of f

tells us much

more about the concavity of the function f than its own graph. ♦

Exercise 21. Discuss the relation between the inﬂection points of a function

f and the local extrema of its derivative f

.

2.8 Absolute Extrema of Functions

We said that a function f has a local maximum at c if its value at c is largest

in comparison to the values at point near c. In many cases we like to ﬁnd the

maximal value of a function, and where it occurs, anywhere in the domain

of the function. This concept is captured in

2.8. ABSOLUTE EXTREMA OF FUNCTIONS 91

Deﬁnition 2.45. Let f be a function, and c a point in its domain. We say

that f has an absolute maximum at c if f(x) ≤ f(c) for all x in the domain

of f. Then we call f(c) the absolute maximum of f. If f(x) ≥ f(c) for all

x in the domain of f, then we say that f has an absolute minimum at c,

and we call f(c) the absolute minimum of f.

A diﬀerent expression is to say that the function assumes its absolute

extremum at c.

Theorem 2.46. A continuous function on a closed interval [a, b] assumes

its absolute maximum and minimum either at a critical point or at an end-

point of the interval.

Proof. In Theorem 1.17 we asserted that a continuous function assumes

its absolute maximum at some point in the interval. If the function does

not assume its absolute maximum at an endpoint, then it does so at some

interior point c, and the function has a local maximum at c. If f is not

diﬀerentiable at c, the c is critical. If f is diﬀerentiable at c, then f

(c) = 0

by Theorem 2.31, and c is critical as well. The argument for the absolute

minimum is left to the reader.

Example 2.47. Find the absolute extrema of the function

f(x) = x

3

−5x

2

+ 6x + 1

for x ∈ [0, 4].

1 2 3 4

2

4

6

8

Figure 2.18: x

3

−5x

2

+ 6x + 1.

1 2 3 4

-2.5

2.5

5

7.5

10

12.5

Figure 2.19: 3x

2

−10x + 6

Solution: According to Theorem 2.46, the absolute extrema of the func-

tion occur either at one of the end points x = 0, x = 4, or at a critical

92 CHAPTER 2. GLOBAL THEORY

point. In fact, f(0) = 1 and f(4) = 9. The critical points, i.e., the zeros of

f

(x) = 3x

2

− 10x + 6, are x = (5 ±

√

7)/3. Approximate values of these

roots are 2.5486 and .7848. You may also check that f

(x) = 6x −10, and

f

((5 +

√

7)/3) > 0 and f

((5 −

√

7)/3) < 0.

The second derivative test tells us that the function has a local minimum at

x = (5 +

√

7)/3 and a local maximum at x = (5 −

√

7)/3. The approximate

values of the function at these points are

f((5 +

√

7)/3) = 3.1126 and f((5 −

√

7)/3) = .3689.

Comparing the values of f(x) at these four points, we conclude that the

function assumes its absolute maximum of 9 at x = 4, and its absolute

minimum of approximately .3689 at x = (5 −

√

7)/3.

You may compare our calculation with the graphs of f in Figure 2.18

and the one of f

in Figure 2.19. ♦

Exercise 22. Find the absolute extrema of the functions on the indicated

intervals.

(a) f(x) = x

2

−5x + 2 for x ∈ [0, 5]

(b) f(x) = x

3

+ 3x

2

−5x + 2 for x ∈ [−3, 2.5]

(c) f(x) =

√

2 +x/

√

1 +x for x ∈ [0, 5]

(d) f(x) = cos 2x + 2 cos x for 0 ≤ x ≤ 2π

(e) f(x) = sinx + cos x for 0 ≤ x ≤ 2π

2.9 Optimization Story Problems

Many real-life problems are formulated as optimization problems. Calculus

helps us to solve these optimization problems. To avoid lenghty introduc-

tions to real-life problems, we content ourselves with problems of an algebraic

or geometric nature. We consider a few examples and give some problems

for practice.

Example 2.48. Cut a string of length 50 centimeters into two pieces. Use

one piece as the perimeter of an equilateral triangle and the other one as the

perimeter of a disk. How long should each piece be, so that the combined

2.9. OPTIMIZATION STORY PROBLEMS 93

area of the triangle and the circle is minimal? How long should each piece

be, so that the combined area of the triangle and the circle is maximal?

In our solution we will go through several steps.

Introduction of notation: There are many ways to set up the notation

to solve this problem. Among them we say that the side length of the triangle

is a and the radius of the circle is r.

Express information as equations: The perimeter of the triangle

will be 3a and the perimeter of the circle will be 2πr. This means that

3a + 2πr = 50 and a =

50 −2πr

3

.

The height of the triangle is h =

a

√

3

2

, and its area is

a

2

√

3

4

. The area of the

disk is πr

2

. The combined area of the triangle and disk is

A =

a

2

√

3

4

+πr

2

=

√

3

4

_

50 −2πr

3

_

2

+πr

2

.

For this to make sense, we need that 0 ≤ r ≤ 25/π.

Formulate the problem mathematically: Find the absolute mini-

mum (maximum) of the function

A(r) =

√

3

4

_

50 −2πr

3

_

2

+πr

2

.

for r ∈ [0, 25/π].

Solve the mathematical problem: The derivative of A(r) is

A

(r) = −

4π

3

·

√

3

4

_

50 −2πr

3

_

+ 2πr =

−π

√

3

_

50 −2πr

3

_

+ 2πr,

and A

(r) = 0 if and only if r = 50/(2π + 6

√

3). We note that A(r) is a

parabola which is open upwards. The critical point, which we just found, is

where the local minimum occurs. It is also the absolute minimum of A(r) on

any interval which contains the critical point. For the end points we have:

A(0) ≈ 120.28 amd A(25/π) ≈ 198.94.

Answer the original question: The combined area of the disk and

the triangle will be minimal if r = 50/(2π + 6

√

3), and it will be maximal

of r = 25/π. In the latter case, all string is used for the circle. ♦

Exercise 23. Repeat the previous example with

1. a disk and a square.

94 CHAPTER 2. GLOBAL THEORY

2. an equilateral triangle and a square.

3. a regular hexagon and a square.

4. a disk and half an equilateral triangle (the angles at 30, 60 and 90

degrees).

5. two geometric shapes of your own choice.

Example 2.49. Construct an open box from a rectangular piece of card

board of length L and width W. What are the dimensions of the box with

the largest possible volume?

In our solution we will go through several steps.

Clariﬁcation and introduction of notation: We construct the box

by making an incision at a 45 degree angle at each corner. Then we fold

up a strip of width x along each side

8

. For yourself, draw a picture of

this production process, and convince yourself that any box obtained by a

diﬀerent process will have smaller volume. To simplify matters, we call the

longer side of the rectangle L and the shorter one W.

Express information as equations: As we folded up a strip of width

x, the box will have width W −2x, length L −2x, height x, and volume

V (x) = (W −2x)(L −2x)x = WLx −2(L +W)x

2

+ 4x

3

.

By construction, x ≥ 0, x ≤ W/2, and x ≤ L/2, in fact x ≤ W/2.

Formulate the problem mathematically: Find the absolute maxi-

mum of the function

V (x) = WLx −2(L +W)x

2

+ 4x

3

for x ∈ [0, W/2].

Solve the mathematical problem: At the end points of the interval

V vanishes, i.e., V (0) = V (W/2) = 0. On the interior of the interval the

function is positive. The derivative of V is

V

(x) = WL −4(W +L)x + 12x

2

.

The zeros of V

are at

x =

1

6

_

(L +W) ±

_

L

2

+W

2

−LW

_

.

8

You could have cut out a square of size x ×x at each corner.

2.9. OPTIMIZATION STORY PROBLEMS 95

The function has an inﬂection point at (W +L)/6, to the right of which V

**is positive and V is concave up, and to the left of which V is concave down.
**

We conclude that V has a local maximum at

x =

1

6

_

(L +W) −

_

L

2

+W

2

−LW

_

.

As the function V (x) has only one local maximum in the interval, the local

maximum is the same as the absolute maximum.

Answer the original question: The box with the largest volume will

have a height of

x =

1

6

_

(L +W) −

_

L

2

+W

2

−LW

_

.

Its width will be W −2x and its length L −2x. ♦

Exercise 24. Repeat the previous example with speciﬁc numbers for the

width and length of the piece of card board.

Exercise 25. Start out with an equilateral piece of card board with side

length a. Make incisions at the corners, and fold up strips along the edges.

You will get an open box whose base is an equilateral triangle. How broad

should the folded up strips be, so that the volume of the box is maximal?

Exercise 26. Modify the problem from above, constructing a box with a

round base from a circular piece of card board.

Exercise 27. What is the largest possible volume for a right circular cone

of slant height a?

Example 2.50. Determine the rectangle of maximal area which can be

placed between the x-axis and the graph of the function f(x) = sin x.

Solution: Draw a graph of sin x so that you can follow the discussion.

Convince yourself that the vertices of the rectangle should be (x, 0), (π −

x, 0), (x, sin x) and (π − x, sin x) for some x ∈ [0, π/2]. The width of the

rectangle is π −2x and its height is sin x, so that its area is

A(x) = (π −2x) sin x.

We need to ﬁnd the absolute maximum for this function for x ∈ [0, π/2].

The ﬁrst derivative of this function is A

(x) = −2 sin x + (π − 2x) cos x.

After a simple algebraic simpliﬁcation, you ﬁnd that

A

**(x) = 0 if and only if tan x =
**

π −2x

2

.

96 CHAPTER 2. GLOBAL THEORY

Find an approximate solution of the equation using Newton’s method or your

calculator. A fairly good approximation of the zero of A

(x) is x

0

= .710462.

Convince yourself

9

that this is the only zero of A

(x) for x ∈ [0, π/2]. We

conclude that x

0

is the only critical point of A(x).

You may calculate A

(x). Substituting x

0

you will see that A

(x

0

) < 0.

It follows from the second derivative test that A(x) has a local maximum

at x

0

. Apparently A(x) = 0 at the end points x = 0 and x = π/2 of the

interval. This tells us that A(x) assumes its absolute maximum at x

0

.

With this, the ﬁnal answer to our problem is: The rectangle of maximal

area which can be placed between the x-axis and the graph of the sine

function will have a width of approximately π −2x

0

= 1.72066 and a height

of sinx

0

= .652183. Its area will be about 1.12218. ♦

To ﬁnd the absolute extrema of a continuous function on an interval of

the form [a, b] we could inspect the values of the function at the critical

points and at a and b. It allows us to decide whether a local extremum is

also an abolute one. Our next result allows us to do the same even if the

interval is not closed and bounded. The assumptions of this theorem are

satisﬁed in many applied problems.

Theorem 2.51. Suppose f is deﬁned on an interval I.

(a) If f is concave up on I and has a local minimum at x

0

, then f assumes

its absolute minimum at x

0

.

(b) If f is concave down on I and has a local maximum at x

0

, then f

assumes its absolute maximum at x

0

.

Example 2.52. Find the absolute minimum of the function

f(x) = x +

1

x

for x ∈ (0, ∞).

Solution: We calculate the ﬁrst and second derivative of f(x):

f

(x) = 1 −

1

x

2

and f

(x) =

2

x

3

.

We ﬁnd that f

(x) = 0 if x = 1, and that f

(x) > 0 for all x in (0, ∞). So f

has a local minimum at x = 1, and f is concave up on (0, ∞). Theorem 2.51

tells us that the absolute minimum of the function is f(1) = 2. ♦

9

One possible argument is that tan x is increasing on the interval [0, π/2), and that

π−2x

2

is decreasing. So these functions can intersect in only one point.

2.9. OPTIMIZATION STORY PROBLEMS 97

Exercise 28. Find the largest possible area for a rectangle with base on

the x-axis and upper vertices on the curve y = 4 −x

2

.

Exercise 29. A rectangular warehouse will have 5000 m

2

of ﬂoor space and

will be separated into two rectangular rooms by an interior wall. The cost of

the exterior walls is $ 1,000.00 per linear meter and the cost of the interior

wall is $ 600.00 per linear meter. Find the dimensions of the warehouse that

minimizes the construction cost.

Exercise 30. One side of a rectangular meadow is bounded by a cliﬀ, the

other three sides by straight fences. The total length of the fence is 600

meters. Determine the dimensions of the meadow so that its area is maximal.

Exercise 31. Draw a rectangle with one vertex at the origin (0, 0) in the

plane, one vertex on the positive x-axis, one vertex on the positive y-axis,

and one vertex on the line 3x + 5y = 15. What are the dimensions of a

rectangle of this kind with maximal area?

Exercise 32. Two hallways, one 8 feet wide and one 6 feet wide, meet at a

right angle. Determine the length of the longest ladder that can be carried

horizontally from one hallway into the other one.

Exercise 33. Inscribe a right circular cylinder into a right circular cone of

height 25 cm and radius 6 cm. Find the dimensions of the cylinder if its

volume is the be a maximum.

Exercise 34. A right circular cone is inscribed in a sphere of radius R.

Find the dimensions of the cone if its volume is to be maximal.

Exercise 35. Find the dimensions of a right circular cone of minimal vol-

ume, so that a ball of radius 10 centimeters can be inscribed.

Exercise 36. Consider a triangle in the plane with vertices (0, 0), (a, 0),

and (0, b). Suppose that a and b are positive, and that (2, 5) lies on the line

through the points (a, 0), and (0, b). What should the slope of the line be,

so that the area of the triangle is minimal?

Exercise 37. Minimize the cost of the material needed to make a round

drum with a volume of 200 liter (i.e., .2 m

3

) if

(a) the drum has a bottom and a top, and the same material is used for

the top, bottom and sides.

(b) the drum has no top (but a bottom) and the same material is used for

the bottom and sides.

98 CHAPTER 2. GLOBAL THEORY

(c) the drum has a bottom and a top, the same material is used for the

top and bottom, and the material for the top and bottom is twice as

expensive as the material for the sides.

(d) the situation is as in the previous case, but the top and the bottom

are cut out of squares, and the left over material is recycled for half

its value.

Exercise 38. Consider a box with a round base and no lid whose interior

is subdivided into six wedge shaped sectors. Which shape should it have,

so that its volume is maximal, assuming you are allowed a ﬁxed amount of

material? More speciﬁcally determine the ratio of radius and height which

will maximize the volume.

Exercise 39. Design a roman window with a perimeter of 4 m which admits

the largest amount of light. (A roman window has the shape of a rectangle

capped by a semicircle.)

Exercise 40. A rectangular banner has a red border and a white center.

The width of the border at top and bottom is 15 cm, and along the sides

10 cm. The total area is 1 m

2

. What should be the dimensions of the banner

if the area of the white area is to be maximized?

Exercise 41. A power line is needed to connect a power station on the

shore line to an island 2 km oﬀ shore. The point on the coast line closest to

the island is 6 km from the power station, and, for all practical purposes, you

may suppose that the shore line is straight. To lay the cable costs $40,000

per kilometer under ground and $70,000 under water. Find the minimal

cost for laying the cable.

Exercise 42. Consider the distance D(x) between a point P(x) = (x, f(x))

on the graph of a diﬀerentiable function f(x) and a point Q = (x

0

, y

0

)

not on this graph. Suppose D(x) has a local minimum at x

1

. Then the

tangent line to the graph of f at x

1

intersects the line joining P(x

1

) and Q

perpendicularly.

2.10 Sketching Graphs

The techiques which we developed so far provide us with some valuable tools

for graphing functions. Let us make a list of data which we may determine,

so that we can sketch a graph rather precisely. Going through the following

2.10. SKETCHING GRAPHS 99

program is also a good review of the material which we developed in this

chapter.

Useful information for graphing a function: We call the function

f(x).

(a) Plot some points on the graph, such as the y-intercept. If the function

is given on a closed interval, plot the values at its endpoints.

(b) Plot the zeros of the function. If you cannot ﬁnd the zeros by analytical

means, try it numerically (Newton’s method).

(c) If possible, decide on which intervals the function is positive, resp.,

negative.

(d) Find the ﬁrst derivative f

(x) of f(x).

(e) Repeat (b) and (c) with f

**(x) in place of f(x). Intervals on which
**

f

**(x) is positive give you intervals on which f(x) is increasing, and
**

intervals on which f

**(x) is negative give you intervals on which f(x)
**

is decreasing. The zeros of f

**(x) provide you with the critical points
**

of f(x). Plot the critical points (x and y value), and keep track of the

intervals on which the function is increasing, resp., decreasing.

(f) Find the second derivative f

(x) of f(x).

(g) Repeat (b) and (c) with f

**(x) in place of f(x). Intervals on which
**

f

**(x) is positive give you intervals on which f(x) is concave up, and
**

intervals on which f

**(x) is negative give you intervals on which f(x)
**

is concave down. Find the inﬂection points of the function, i.e., the

points where the concavity changes. Plot the inﬂection points (x and y

value), and keep track of the intervals on which the function is concave

up, resp., concave down.

(h) Decide at which critical points of f(x) the function has a saddle point

or local extremum, and whether it is a minimum or a maximum.

If you now draw a graph which exhibits all of the properties which you

gathered in the course of the suggested program, then your graph will look

very much like the graph of f(x). More importantly, the graph will have all

of the essential features of the graph of f(x). Let us go through the program

in an example.

100 CHAPTER 2. GLOBAL THEORY

Example 2.53. Discuss the graph of the function

f(x) = x

4

−2x

3

−3x

2

+ 8x −4 for x ∈ [−3, 3].

Solution: To make the discussion a little easier, we note that

f(x) = (x −1)

2

(x

2

−4) = (x −1)

2

(x −2)(x + 2). (2.1)

You should verify this by multiplying out the expression for f(x) in (2.1).

(a): Plot the y intercept of the function and its values at the end points

of the given interval: f(−3) = 80, f(0) = −4 and f(3) = 20.

(b): As a polynomial, the function f(x) is diﬀerentiable on the given

interval. The only exceptional points are its zeros. Having written f(x) as

in (2.1), we see right away that f(x) = 0 if and only if x = −2, x = 1, or

x = 2. Plot these x-intercepts.

(c): Counting the signs of the factors of f(x), we see that f(x) is positive

on the intervals [−3, −2) and (2, 3], and negative on (−2, 1) and (1, 2).

(d): We calculate the derivative of f(x):

f

(x) = 2(x −1)(x

2

−4) + (x −1)

2

2x = 2(x −1)(2x

2

−x −4).

We based the calculation on the description of f(x) in (2.1). In the ﬁrst

step we applied the product rule, and then we used elementary algebra.

(e): We use the quadratic formula to ﬁnd the zeros of the factor 2x

2

−x−4

in the expression for f

(x). They are (1 ±

√

33)/4. This allows us to factor

the expression for f

(x), and we ﬁnd:

f

(x) = 4(x −1)

_

x −

1

4

[1 +

√

33]

__

x −

1

4

[1 −

√

33]

_

.

We conclude that:

• f

**(x) is negative on the interval [−3, (1−
**

√

33)/4) and f(x) is decreas-

ing on [−3, (1 −

√

33)/4].

• f

**(x) is positive on the interval ((1−
**

√

33)/4, 1) and f(x) is increasing

on [(1 −

√

33)/4, 1].

• f

**(x) is negative on the interval (1, (1+
**

√

33)/4) and f(x) is decreasing

on [1, (1 −

√

33)/4].

• f

**(x) is positive on the interval ((1+
**

√

33)/4, 3] and f(x) is increasing

on [(1 +

√

33)/4, 3].

2.10. SKETCHING GRAPHS 101

• f(x) has a critical point and local minimum at (1 −

√

33)/4 ≈ −1.19,

a critical point and local maximum at x = 1, and a critical point and

local minimum at (1 +

√

33)/4 ≈ 1.69.

The values of the function at its three critical points are approximately:

f(

1 −

√

33

4

) ≈ −12.39 & f(1) = 0 & f(

1 +

√

33

4

) ≈ −.54.

Plot these points.

(f): We rewrite the ﬁrst derivative as f

(x) = 4x

3

− 3x

2

− 3x + 4, and

ﬁnd

f

(x) = 12x

2

−12x −6.

(g): We use the quadratic formula to ﬁnd the zeros on f

(x) and factor

it:

f

(x) = 12

_

x −

1

2

[1 +

√

3]

__

x −

1

2

[1 −

√

3]

_

.

We conclude that:

• f

**(x) is positive on the interval [−3, (1 −
**

√

3)/2) and f(x) is concave

up on [−3, (1 −

√

3)/2]

• f

**(x) is negative on the interval ((1 −
**

√

3)/2, (1 +

√

3)/2) and f(x) is

concave down on [(1 −

√

3)/2, (1 +

√

3)/2]

• f

**(x) is positive on the interval ((1+
**

√

3)/2, 3] and f(x) is concave up

on [(1 +

√

3)/2, 3]

• f(x) has inﬂection points at x = (1 −

√

3)/2 ≈ −.37 and at x =

(1 +

√

3)/2 ≈ 1.37.

The values of the function at its inﬂection points is approximately:

f(

1 −

√

3

2

) ≈ −7.21 & f(

1 −

√

3

2

) ≈ −.29.

Plot these points.

(h): At this point we could use the second derivative test to ﬁnd at which

critical points the function has local extrema, but we decided this already

based on ﬁrst derivative behaviour in (e).

102 CHAPTER 2. GLOBAL THEORY

Let us gather and organize our information. We consider the interval:

I

1

= [−3, −2]

I

2

=

_

−2,

1 −

√

33

4

_

I

3

=

_

1 −

√

33

4

,

1 −

√

3

2

_

I

4

=

_

1 −

√

3

2

, 1

_

I

5

=

_

1,

1 +

√

3

2

_

I

6

=

_

1 +

√

3

2

,

1 +

√

33

4

_

I

7

=

_

1 +

√

33

4

, 2

_

I

8

= [2, 3] .

We tabulate the which properties hold on which interval. It should be

understood, that at some end points of intervals the function is zero.

Property I

1

I

2

I

3

I

4

I

5

I

6

I

7

I

8

Sign pos neg neg neg neg neg neg pos

Monotonicity dec dec inc inc dec dec inc inc

Concavity up up up down down up up up

Table 2.1: Properties of the Graph

In Figure 2.20 you see the graph of the function. We have shown it on

a slightly smaller interval, as the values at the endpoint a comparetively

large. Showing all of the graph would show less clearly what happens near

the intercept, extrema, and inﬂection points. The dots indicate the points

which we suggests to plot.

In Figure 2.21 you see the graph of f on an even smaller interval, and

parts of the graphs of f

and f

**. You can use them to see that f is decreasing
**

where f

is negative, f is concave down where f

is negative, etc. ♦

Exercise 43. In analogy with the previous example, discuss the function

f(x) = (x −1)(x −2)(x + 2) = x

3

−x

2

−4x + 4

on the interval [−3, 2.5]. In addition, ﬁnd the absolute extrema of this

function.

2.10. SKETCHING GRAPHS 103

-3 -2 -1 1 2 3

-10

-5

5

10

Figure 2.20: The Graph

-2 -1 1 2

-10

-5

5

10

Figure 2.21: f, f

, f

**Exercise 44. In analogy with the previous example, discuss the function
**

f(x) = x

3

−3x + 2

on the interval [−2, 2]. In addition, ﬁnd the absolute extrema of this func-

tion.

Exercise 45. In analogy with the previous example, discuss the function

f(x) = 2 sin x + cos 3x

on the interval [0, 2π]. In addition, ﬁnd the absolute extrema of this function.

You may have to apply Newton’s method to ﬁnd zeros of f, f

, and f

.

104 CHAPTER 2. GLOBAL THEORY

Chapter 3

Integration

We will introduce the ideas of the deﬁnite and the indeﬁnite integral. Sup-

pose that f is a function which is deﬁned and bounded on the interval [a, b].

If it exists, then the deﬁnite integral of f over the interval [a, b] is a real

number. It is denoted by

_

b

a

f(x) dx.

The deﬁnition is set up, so that for a non-negative function it makes sense

to think of the integral as the area of the region bounded by the graph of

the function, the x-axis, and the lines x = a and x = b.

The indeﬁnite integral of a function f is the family (set) of all antideriva-

tives of f, i.e. all functions whose derivative is f. For important classes of

functions one may utilize deﬁnite integrals to construct antiderivatives. The

Fundamental Theorem of Calculus relates deﬁnite integrals and antideriva-

tives.

To be concrete, consider the function f(x) = x

2

e

−x

, shown in Figure 3.1,

and ﬁnd the area of the region Ω bounded by the graph of f(x), the lines

x = 1 and x = 5, and the x-axis.

3.1 Properties of Areas

So far, we only know the area of some simple regions, like rectangles. We

will denote the area of a region Ω by Area(Ω). Whatever concept of area we

have in mind, it should have the following properties:

• The area of a rectangle is the product of the lengths of its sides.

105

106 CHAPTER 3. INTEGRATION

1 2 3 4 5 6

0.1

0.2

0.3

0.4

0.5

0.6

Figure 3.1: f(x) = x

2

e

−x

• Suppose that Ω

1

and Ω

2

are regions in the plane, and that the area of

each of them is deﬁned.

If Ω

1

⊆ Ω

2

, then Area(Ω

1

) ≤ Area(Ω

2

).

• Suppose that Ω

1

and Ω

2

are regions in the plane, and that the area

of each of them is deﬁned. If the regions Ω

1

and Ω

2

do not intersect,

then the area of the union Ω

1

∪ Ω

2

of Ω

1

and Ω

2

is deﬁned, and

Area(Ω

1

∪ Ω

2

) = Area(Ω

1

) + Area(Ω

2

).

Suppose for a moment, that the region under the graph shown in Fig-

ure 3.1 has an area. In Figure 3.2 you see a rectangle R

l

with area .6, which

is contained in Ω. In Figure 3.3 you see a rectangle R

u

with area 2.24 which

contains Ω. The ﬁrst two principles tell us that

Area(R

l

) = .6 ≤ Area(Ω) ≤ Area(R

u

) = 2.24.

From above principles one may derive another one, which occurs fre-

quently in our upcoming constructions:

3.2. PARTITIONS AND SUMS 107

1 2 3 4 5 6

0.1

0.2

0.3

0.4

0.5

0.6

Figure 3.2: A rectangle R

l

con-

tained in Ω

1 2 3 4 5 6

0.1

0.2

0.3

0.4

0.5

0.6

Figure 3.3: A rectangle R

u

con-

taining Ω

• Suppose the region R in the plane is the union of a ﬁnite number of

rectangles R

1

, . . . , R

n

and any two of them intersect at most in an

edge. Then Area(R) is deﬁned, and it is equal to the sum of the areas

of the regions R

1

, . . . , R

n

:

Area(R) = Area(R

1

) +· · · + Area(R

n

).

3.2 Partitions and Sums

We like to reﬁne the approach to calculating areas of regions which we

started in the previous section. We do so by partitioning the interval before

applying the ideas from above, and then we add up what we get over the

individual intervals.

A partition of an interval [a, b] is of a collection is points {x

j

| 0 ≤ j ≤ n},

such that

a = x

0

≤ x

1

≤ · · · ≤ x

n−1

≤ x

n

= b.

The interval [a, b] is partitioned into n intervals [x

j−1

, x

j

] with 1 ≤ j ≤ n.

108 CHAPTER 3. INTEGRATION

3.2.1 Upper and Lower Sums

As before, f denotes a function which is deﬁned and bounded on [a, b]. On

each interval we pick numbers m

j

and M

j

, such that

m

j

≤ f(x) ≤ M

j

for all x ∈ [x

j−1

, x

j

].

We deﬁne the lower sum to be

S

l

= m

1

(x

1

−x

0

) +m

2

(x

2

−x

1

) +· · · +m

n

(x

n

−x

n−1

). (3.1)

and the upper sum to be

S

u

= M

1

(x

1

−x

0

) +M

2

(x

2

−x

1

) +· · · +M

n

(x

n

−x

n−1

). (3.2)

These sums depend on the choice of partition and the choices for the m

j

and M

j

.

1 2 3 4 5 6

0.1

0.2

0.3

0.4

0.5

0.6

Figure 3.4: A union of rectangles

contained in Ω

1 2 3 4 5 6

0.1

0.2

0.3

0.4

0.5

0.6

Figure 3.5: A union of rectangles

containing Ω

Let us return to the example of the function f(x) = x

2

e

−x

on the interval

[1, 4]. In the computation of the lower sum we use the partition

x

0

= 1 < x

1

= 2 < x

2

= 3 < x

3

= 4 < x

4

= 5

of the interval. We also pick m

1

= .35, m

2

= .43, m

3

= .28 and m

4

= .16.

This leads to a lower sum S

l

= 1.22. In the computation of the upper sum

we use the partition

x

0

= 1 < x

1

= 3 < x

2

= 4 < x

3

= 5

3.2. PARTITIONS AND SUMS 109

of the interval. We also pick M

1

= .55, M

2

= .45 and M

3

= .3. This leads

to an upper sum S

u

= 1.85. The m

j

and M

j

represent the heights of the

rectangles in Figures 3.4 and 3.5, and we trust these ﬁgures to show that

m

j

≤ f(x) and f(x) ≤ M

j

on the respective interval.

As before, let Ω denote the region under the graph. Then the union of

the rectangles shown in Figure 3.4 is contained in Ω, and the union of the

rectangles shown in Figure 3.5 contains Ω. Thus, if Ω has an area, the our

principles tell us that

S

l

= 1.22 ≤ Area(Ω) ≤ S

u

= 1.85.

In fact the only number greater or equal to all lower sums and smaller

or equal to all upper sums is

5

e

−

37

e

5

, and this will be the area of the region

Ω. Here e is the Euler number.

Example 3.1. Let us ﬁnd upper and lower sums for the function

f(x) = x

3

−7x

2

+ 14x −8

for x ∈ [.5, 4.5]. In contrast to the function in the previous example, this

function is not non-negative.

1 2 3 4

-3

-2

-1

1

2

3

4

Figure 3.6: Rectangles for calcu-

lating an upper sum.

1 2 3 4

-3

-2

-1

1

2

3

4

Figure 3.7: Rectangles for calcu-

lating a lower sum.

Solution: For the purpose of calculating an upper sum, we partitioned

the interval [.5, 4.5] using the intermediate points x

0

= .5, x

1

= 1.1, x

2

= 2.4,

x

3

= 3.8, and x

4

= 4.5. As numbers M

i

(so that M

i

≥ f(x) for x ∈ [x

i−1

, x

i

])

110 CHAPTER 3. INTEGRATION

we chose M

1

= .3, M

2

= .7, M

3

= −.9, and M

4

= 4.4. These data are shown

in Figure 3.6. With these choices, the upper sum is

S

u

= .3(1.1 −.5) +.7(2.4 −1.1) + (−.9)(3.8 −2.4) + 4.4(4.5 −3.8)

= 2.91.

In Figure 3.6 you see four rectangles. Their areas are combined to calculate

the upper sum. The areas of the ones above the x-axis are added, the ones

below the axis are subtracted, in accordance with the sign of the M

i

.

In the calculation of the lower sum we partitioned [.5, 4.5] using x

0

= .5,

x

1

= .8, x

2

= 2.3, x

3

= 4.2, and x

4

= 4.5. As numbers m

i

(so that

m

i

≤ f(x) for x ∈ [x

i−1

, x

i

]) we chose m

1

= −2.7, m

2

= −.8, m

3

= −2.2,

and m

4

= 1.3. These data are shown in Figure 3.7. With these choices we

calculate a lower sum of

S

l

= −2.7(.8 −.5) + (−.8)(2.3 −.8) + (−2.2)(4.2 −2.3) + 1.3(4.5 −4.2)

= −5.8.

In Figure 3.7 you see four rectangles. Their areas are combined to calculate

the lower sum. The areas of the ones above the x-axis are added, the ones

below the axis are subtracted, in accordance with the sign of the m

i

.

In summary, you see that we still combine areas of rectangles in the

calculation of the upper and lower sum, only that, depending on the sign

of the M

i

or m

i

, these rectangles are either above or below the x-axis, and

depending on this, their areas are either added or subtracted. ♦

Let us make a simple albeit important observation:

Theorem 3.2. Let f be a function which is deﬁned and bounded on a closed

interval [a, b]. Let S

l

be any lower sum of f and S

u

any upper sum. Then

S

l

≤ S

u

.

Let us repeat the statement of the theorem to emphasize its meaning.

Whichever partition of the interval [a, b] and whichever m

i

we use in the

calculation of the lower sum S

l

and whichever partition of the interval and

whichever M

i

we use in the calculation of the upper sum S

u

, the lower sum

is always smaller or equal to the upper sum. To see this, one reﬁnes the

partitions for the upper and lower sum computation so that they become

the same. Then one notes that m

i

≤ M

i

for all i.

3.2. PARTITIONS AND SUMS 111

3.2.2 Riemann Sums

Suppose once again that f(x) is a function which is deﬁned on the interval

[a, b]. Pick once more a partition

a = x

0

≤ x

1

≤ · · · ≤ x

n−1

≤ x

n

= b

of the interval. In each subinterval, pick a point x

j

∈ [x

j−1

, x

j

]. Then we

deﬁne the Riemann Sum

S

R

= f(x

1

)(x

1

−x

0

) +f(x

2

)(x

2

−x

1

) +· · · +f(x

n

)(x

n

−x

n−1

). (3.3)

We leave it to the reader to contemplate

Proposition 3.3. Let f be a function which is deﬁned and bounded on a

closed interval [a, b]. Let S

l

be any lower sum of f, S

u

any upper sum, and

S

R

any Riemann sum. Then

S

l

≤ S

R

≤ S

u

.

1 2 3 4 5 6

0.1

0.2

0.3

0.4

0.5

0.6

Figure 3.8: Representing a Riemann Sum

To be more concrete, let us return to the example f(x) = x

2

e

−x

on the

interval [1, 5]. Let us use the partition

x

0

= 1 < x

1

=

√

3 < x

2

= 5.

In the two interval of this subdivision we pick the points x

1

=

√

2 ∈ [1,

√

3]

and x

2

= π ∈ [

√

3, 5]. As Riemann sum we obtain

S

R

= f(x

1

)(x

1

−x

0

) +f(x

2

)(x

2

−x

1

) ≈ 1.749741.

In Figure 3.8 you see the picture illustrating the computation. There are

two rectangles, their bases are the intervals in the subdivision, and their

heights are f(x

1

) and f(x

2

). The sum of the areas of these rectangles is the

Riemann sum.

112 CHAPTER 3. INTEGRATION

3.3 Limits and Integrability

The idea is to reﬁne the partitions in our previous construction, so that in

the limit our sums can be justiﬁably called the area of the region under the

graph, if the function is non-negative. The speciﬁcs depend on which sums

we are working with.

3.3.1 The Darboux Integral and Areas

As we discussed earlier, whatever choices we make in the calculation of

lower and upper sums S

l

and S

u

, we always have that S

l

≤ S

u

. A crucial

additional fact is stated in the next result.

Theorem 3.4. Let f be a function which is deﬁned and bounded on a closed

interval [a, b]. There exists a real number Y , such that

S

l

≤ Y ≤ S

u

for all lower sums S

l

and upper sums S

u

of f.

Idea of Proof. To deduce the theorem from the completeness of the real

numbers, one observes that the set of all lower sums of f has a least upper

bound. Call it Y

l

. The set of all upper sums of f has a greatest lower

bound. Call it Y

u

. Apparently, Y

l

≤ Y

u

. Then Y is any number such that

Y

l

≤ Y ≤ Y

u

.

We are now prepared to deﬁne the concept of integrability of a function.

Deﬁnition 3.5. Let f be a function which is deﬁned and bounded on a

closed interval [a, b]. If there is exactly one number Y , such that

S

l

≤ Y ≤ S

u

for all lower sums S

l

and all upper sums S

u

of f, then we say that f is

integrable over the interval [a, b]. In this case, the number Y is called the

integral

1

of f for x between a and b. It is also denoted by

_

b

a

f(x) dx.

1

To distinguish it from the result of a diﬀerent, but typically equivalent, construction

we should Y the Darboux integral.

3.3. LIMITS AND INTEGRABILITY 113

Remark 8. For completeness sake and later use, let us explain what hap-

pens when a function is not integrable. In this case there are at least two

diﬀerent numbers, and with this an entire interval, between all upper and

lower sums. So, a function over a closed interval [a, b] is not integrable if

and only if the exists a positive number D such that S

u

− S

l

≥ D for any

lower sum S

u

and any upper sum S

u

.

On the other hand, a function is integrable if for every positive number

D there is an upper sum S

u

and a lower sum S

l

such that S

u

−S

l

< D.

Example 3.6. Explore upper sums, lower sums, and integrability for the

function f(x) = x

2

on the interval [0, 1].

Solution: Fix a natural number n and set

x

0

= 0 < x

1

=

1

n

< x

2

=

2

n

< · · · < x

n−1

=

n −1

n

< x

n

=

n

n

= 1.

This is an equidistant partition of the interval [0, 1], all subintervals have

the same length 1/n.

For the upper sums we pick

M

1

= f(x

1

) =

_

1

n

_

2

, M

2

= f(x

2

) =

_

2

n

_

2

, M

3

= f(x

3

) =

_

3

n

_

2

, . . .

and M

j

= f(x

j

) =

_

j

n

_

2

in general. Apparently, M

j

≥ f(x) for all x ∈

[x

j−1

, x

j

] because f(x) is increasing on [0, 1]. Without proof, we use that

1

2

+ 2

2

+ 3

2

+· · · +n

2

=

n(n + 1)(2n + 1)

6

.

We calculate the upper sum

S

u

= M

1

(x

1

−x

0

) +M

2

(x

2

−x

1

) +· · · +M

n

(x

n

−x

n−1

)

=

_

1

n

_

2

×

1

n

+

_

2

n

_

2

×

1

n

+· · · +

_

n

n

_

2

×

1

n

=

1

n

3

_

1

2

+ 2

2

+· · · n

2

¸

=

n(n + 1)(2n + 1)

6n

3

=

1

3

+

1

2n

+

1

6n

2

For the lower sums we pick

m

1

= f(x

0

) = 0, m

2

= f(x

1

) =

_

1

n

_

2

, m

3

= f(x

2

) =

_

2

n

_

2

, . . .

114 CHAPTER 3. INTEGRATION

-0.2 0.2 0.4 0.6 0.8 1 1.2

0.25

0.5

0.75

1

1.25

Figure 3.9: Rectangles for calcu-

lating a lower sum.

-0.2 0.2 0.4 0.6 0.8 1 1.2

0.25

0.5

0.75

1

1.25

Figure 3.10: Rectangles for calcu-

lating an upper sum.

and m

j

= f(x

j−1

) =

_

j−1

n

_

2

in general. The resulting lower sum is

S

l

=

1

3

−

1

2n

+

1

6n

2

For n = 5 you see the rectangles whose areas are the summands in the

lower and upper sums in Figures 3.9 and 3.10.

Using the expressions for S

u

and S

l

you see that S

u

− S

l

= 1/n. We

do not only see that S

l

≤

1

3

≤ S

u

, but also that Y = 1/3 is the only real

number, so that S

l

≤ Y ≤ S

u

for all natural numbers n. According to the

deﬁnition this means, that f(x) = x

2

is integrable over the interval [0, 1]

and that

_

1

0

x

2

dx =

1

3

. ♦

We motivated our introduction of upper and lower sums by our quest to

deﬁne the concept of area. Our answer is formulated as a

Deﬁnition 3.7. Let f be a function which is deﬁned, bounded, and non-

negative on a closed interval [a, b]. Let Ω be the region bounded by the graph

of f, the x-axis, and the lines x = a and x = b. If f is integrable over this

interval, then we say that the region Ω has an area and

Area(Ω) =

_

b

a

f(x) dx.

3.3. LIMITS AND INTEGRABILITY 115

The upper and lower sum were constructed such that if there is any

justiﬁcation to assigning an area to Ω then

S

l

≤ Area(Ω) ≤ S

u

.

For an integrable function there is exactly one real number between the

lower and upper sums, so this is the only number which we can call the area

of Ω.

For example, the area of the region Ω bounded by the graph of the

function f(x) = x

2

, the x-axis, and the lines x = 0 and x = 1 is

Area(Ω) =

_

1

0

x

2

dx =

1

3

. ♦

3.3.2 The Riemann Integral

Earlier we introduced the idea of a Riemann sum. Consider an interval [a, b]

and a function f(x) deﬁned on it. We picked a partition

P : a = x

0

≤ x

1

≤ x

2

≤ · · · x

n−1

≤ x

n

= b,

which broke [a, b] up into smaller interval [x

j−1

, x

j

]. In each of the subin-

tervals we picked a point x

j

∈ [x

j−1

, x

j

], and set

S

R

= f(x

1

)(x

1

−x

0

) +f(x

2

)(x

2

−x

1

) +· · · +f(x

n

)(x

n

−x

n−1

).

We want to consider a limit Riemann sums. This is trickier than for

functions, because there are a lot of choices which we make to deﬁne such a

sum. We deﬁne the norm of the partition P to be

|P| = max{x

j

−x

j−1

| 1 ≤ j ≤ n},

in other words, the norm of P is the length of the longest of the intervals

[x

j−1

, x

j

].

Deﬁnition 3.8 (Limit for Riemann Sums). Suppose the function f(x)

is deﬁned on [a, b]. We say that

L = lim

|P|→0

S

R

if for all > 0 there exists a δ > 0, such that |L−S

R

| < whenever |P| < δ.

If the limit of the S

R

exists, then we say that f is Riemann integrable over

[a, b], call L the Riemann integral of f, and write

L = lim

|P|→0

S

R

=

_

b

a

f(x) dx.

116 CHAPTER 3. INTEGRATION

Thus L = limS

R

if we can force S

R

to be close to L, as close as we like,

by making the partition ﬁne, by making each subinterval no longer that

some number.

It is worth pointing out and not very diﬃcult to show the following

proposition.

Proposition 3.9. Suppose the function f is deﬁned on the interval [a, b].

Then f is Riemann integrable if and only if it is Darboux integrable. If

deﬁned, the Riemann and the Darboux integral are the same.

3.4 Integrable Functions

We like to provide a supply of integrable functions. Our ﬁrst result is typi-

cally proved in an analysis course.

Theorem 3.10. Suppose f is deﬁned and continuous on [a, b]. Then f is

integrable over [a, b].

According to this theorem, polynomials are integrable over any interval

of the form [a, b]. Rational functions (i.e., functions of the form p(x)/q(x)

where p(x) and q(x) are polynomials) are integrable over intervals of the

form [a, b] as long as q does not vanish anywhere on the interval. The

trigonometric functions (sin, cos, tan, cot, sec, and csc) are integrable on

intervals where the functions are deﬁned. Arbitrary powers of a variable,

f(x) = x

α

, are integrable. One just needs to make sure that the function is

deﬁned on the interval [a, b]. For any real number α it suﬃces to assume that

a > 0. For any real α ≥ 0, it suﬃces to assume a ≥ 0. For rational numbers

α = p/q, where p and q are integers and q is odd, it suﬃces to assume

0 ∈ [a, b]. For non-negative integers α no assumption needs to be made on a

and b. Just making sure that the resulting functions are deﬁned everywhere

on [a, b], the functions just mentioned may be added, subtracted, multiplied,

divided, and composed, and one still ends up with integrable functions.

Let us introduce another class of functions for which we can prove that

they are integrable.

Deﬁnition 3.11. Suppose f(x) is a function. We say that f(x) is non-

decreasing if f(x

1

) ≤ f(x

2

) whenever x

1

and x

2

are in the domain of f(x)

and x

1

≤ x

2

. We say that f(x) is non-increasing if f(x

1

) ≥ f(x

2

) whenever

x

1

≤ x

2

.

Proposition 3.12. Let [a, b] be a closed interval and let f be deﬁned and

non-increasing or non-decreasing on [a, b]. Then f is integrable on [a, b]. In

particular, monotonic (increasing or decreasing) functions are integrable.

3.4. INTEGRABLE FUNCTIONS 117

Proof. We will use Darboux integrability. Let us assume that the function

f is non-decreasing on the interval. The non-increasing case is left as an

exercise. Take any partition of the interval:

a = x

0

< x

1

< · · · < x

n

= b.

The reader may justify why we can use the same partition in the computation

of the upper and lower sum. For i = 1, . . . , n we set

m

i

= f(x

i−1

) & M

i

= f(x

i

).

Then, because f is non-decreasing,

m

i

≤ f(x) ≤ M

i

for all x ∈ [x

i−1

, x

i

].

We use the m

i

and M

i

to compute upper and lower sums. Let ∆ be the

largest value of the x

i

−x

i−1

. Then

S

u

−S

l

= [M

1

(x

1

−x

0

) +· · · +M

n

(x

n

−x

n−1

)]

−[m

1

(x

1

−x

0

) +· · · +m

n

(x

n

−x

n−1

)]

= (M

1

−m

1

)(x

1

−x

0

) +· · · + (M

n

−m

n

)(x

n

−x

n−1

)

≤ [(M

1

−m

1

) + (M

2

−m

2

) +· · · + (M

n

−m

n

)] ∆

= (M

n

−m

1

)∆

= [f(b) −f(a)]∆

The inequality in the computation follows from the choice of ∆. The

second to last equality follows because M

i−1

= m

i

for all i = 2, . . . , n.

Many terms in the computation cancel. Given any positive number D, we

can make the partition ﬁne enough so that [f(b) −f(a)]∆ < D. According

to our Remark 8 this means that f is integrable over the interval, as we

claimed.

We illustrate the steps in the proof in a concrete example. In Figure 3.11

you see the upper and lower sum. The lower sum is the sum of the areas

of the darkly shaded rectangles. The upper sum is the sum of the areas of

the lightly and darkly shaded rectangles. The diﬀerence between the upper

and the lower sum is the sum of the lightly shaded rectangles shown in

Figure 3.12. We can combine these areas by sliding the rectangles sideways

so that they form one column. Its height will be f(b) −f(a). Its width may

vary, but in the widest place it is no wider than ∆, the width of the largest

interval in the partition of [a, b]. That means, the diﬀerence between the

upper and the lower sum is at most [f(b) −f(a)]∆. As above, we conclude

that the function is integrable.

118 CHAPTER 3. INTEGRATION

0.2 0.4 0.6 0.8 1

-0.2

0.2

0.4

0.6

0.8

1

1.2

Figure 3.11: Rectangles for calcu-

lating a lower and an upper sum.

0.2 0.4 0.6 0.8 1

-0.2

0.2

0.4

0.6

0.8

1

1.2

Figure 3.12: Rectangles for calcu-

lating the diﬀerence between an

upper and a lower sum.

Remark 9. There are functions which are not integrable over any interval

of the form [a, b] with a < b.

Remark 10. Here we only discuss integrability of function over closed ﬁnite

intervals, i.e., intervals of the form [a, b]. The discussion of integrability of

functions over intervals which are not of this form, e.g., half-open intervals

like [a, b) or unbounded closed intervals like [a, ∞), requires additional ideas

and techniques which we are not ready to discuss yet.

3.5 Some elementary observations

In spite of our success calculating some integrals using upper and lower

sums and the deﬁnition, this is certainly not the way to go in general. To

integrate “well behaved” functions we want a theory which allows us to

calculate integrals more easily. We have to develop a few basic tools. These

are fairly straight forward consequences of the deﬁnition of the integral.

Proposition 3.13. If the function f is deﬁned at a, then

_

a

a

f(x) dx = 0 (3.4)

Proof. The reader should contemplate the proposition.

3.5. SOME ELEMENTARY OBSERVATIONS 119

Proposition 3.14. Let [a, b] be a closed interval, c a point between a and

b, and f a function which is deﬁned on the interval. Then

_

c

a

f(x) dx +

_

b

c

f(x) dx =

_

b

a

f(x) dx. (3.5)

Implicitly in the formulation of the proposition is the statement that f

is integrable over [a, b] if and only if it is integrable over the intervals [a, c]

and [c, b]. If one of the sides of Equation (3.5) exists, then so does the other

one.

Idea of Proof. Use c as one of the points in the partition. The remaining

details are left to the reader.

As an immediate consequence of Propositions 3.12 and 3.14 we ﬁnd

Corollary 3.15. Let f be deﬁned on the interval [a, b]. Suppose that we

can partition the interval into a ﬁnite number of intervals such that f is

non-increasing or non-decreasing on each of them. Then f is integrable on

[a, b].

We can also extend Theorem 3.10.

Deﬁnition 3.16. Suppose that f is deﬁned on an interval [a, b]. We call f

piecewise continuous if there is a partition

a = x

0

< x

1

< · · · < x

n−1

< x

n

= b

such that f is continuous on the open intervals (x

j−1

, x

j

) for all 1 ≤ j ≤ n,

and the one-sided limits (see Section 1.3)

lim

x→x

+

j−1

f(x) and lim

x→x

−

j

f(x).

exist and are ﬁnite.

Corollary 3.17. If f is a piecewise contiuous function on [a, b], then f is

integrable on [a, b].

Idea of Proof. According to Proposition 3.14 we may break the problem up,

and consider it over each of the intervals [x

j−1

, x

j

] separately. On each of

these smaller intervals, we can change the deﬁntion of the function at a point

or two and make it continuous. This changes neither the integrability nor

the value of the integral. So the assertion follows from Theorem 3.10.

120 CHAPTER 3. INTEGRATION

Deﬁnition 3.18. Let f be deﬁned and integrable on the interval [a, b]. Then

_

b

a

f(x) dx = −

_

a

b

f(x) dx.

This deﬁnition is convenient and consistent with what we have said so

far about the integral. The approach to integrals via lower and upper sums

could also be generalized to include integrals

_

b

a

where b < a, leading to

exactly this formula.

Using the deﬁnition of the integral it is not diﬃcult to show:

Proposition 3.19. Let [a, b] be a closed interval and c a scalar. Suppose

that f and g are integrable over the interval. Then f +g and cf are integrable

over [a, b] and

_

b

a

(f(x) +g(x)) dx =

_

b

a

f(x) dx +

_

b

a

g(x) dx

and

_

b

a

cf(x) dx = c

_

b

a

f(x) dx.

We mention a few useful estimates for integrals.

Proposition 3.20. If f is integrable over [a, b], and f(x) ≥ 0 for all x ∈

[a, b], then

_

b

a

f(x) dx ≥ 0.

Proof. The proof is left to the reader.

Corollary 3.21. If h and g are integrable over [a, b], and g(x) ≥ h(x) for

all x ∈ [a, b], then

_

b

a

g(x) dx ≥

_

b

a

h(x) dx.

Proof. Use that f(x) = g(x) −h(x) ≥ 0 for all x ∈ [a, b].

Proposition 3.22. Let [a, b] be a closed interval and f integrable over [a, b].

Then the absolute value of f is integrable over [a, b], and

¸

¸

¸

¸

_

b

a

f(x) dx

¸

¸

¸

¸

≤

_

b

a

|f(x)| dx. (3.6)

The proof of this proposition is elementary, though a bit tricky.

3.6. AREAS AND INTEGRALS 121

3.6 Areas and Integrals

Let us return to the relation between areas and integrals. Suppose f(x) is

a non-negative integrable function over an interval [a, b]. If Ω is the area

bounded by the graph of f(x), the x-axis, and the lines x = a and x = b,

then

Area(Ω) =

_

b

a

f(x) dx.

The question is, what happens if f(x) is not non-negative?

Let f be a function which is deﬁned and bounded on a closed interval

[a, b] and Ω the set of points which lie between the graph of f(x) and the

x-axis for a ≤ x ≤ b. We decompose Ω into the union of two sets, Ω

+

and Ω

−

. Speciﬁcally, Ω

+

consist of those points (x, y) in the plane for which

a ≤ x ≤ b and 0 ≤ y ≤ f(x), and Ω

−

of those points for which a ≤ x ≤ b and

f(x) ≤ y ≤ 0. Then Ω is the union of the sets Ω

+

and Ω

−

. We decompose

the region between the x-axis and the graph into the part Ω

+

above the

x-axis and the part Ω

−

below it. Making use of this notation, we have:

Proposition 3.23. If f is integrable, then the areas of the regions Ω

+

and

Ω

−

are deﬁned

2

and

_

b

a

f(x) dx = Area(Ω

+

) −Area(Ω

−

). (3.7)

Idea of Proof. We deﬁne two functions:

f

+

(x) =

_

f(x) if f(x) ≥ 0

0 if f(x) ≤ 0

and f

−

(x) =

_

f(x) if f(x) ≤ 0

0 if f(x) ≥ 0

It is elementary, though a bit tricky, to show that the integrability of f(x)

implies the integrability of f

+

(x) and f

−

(x). Apparently, f = f

+

+ f

−

, so

that the additivity of the integral implies that

_

b

a

f(x) dx =

_

b

a

f

+

(x) dx +

_

b

a

f

−

(x) dx. (3.8)

According to Deﬁnition 3.7 we have

Area(Ω

+

) =

_

b

a

f

+

(x) dx. (3.9)

2

If you want to be formal, then you have to ﬂip the region Ω

−

to lie above the x-axis.

Only then have we addressed the question of it having an area.

122 CHAPTER 3. INTEGRATION

Let −Ω

−

be the area obtained by ﬂipping Ω

−

up, i.e., we take its mirror

image along the x-axis. This process does not change areas, so Area(Ω

−

) =

Area(−Ω

−

). The function −f

−

(x) is non-negative, and −Ω

−

is bounded by

the graph of −f

−

(x), the x-axis, and the lines x = a and x = b. According

to Deﬁnition 3.7 and our elementary properties of the integral we have

Area(Ω

−

) = Area(−Ω

−

) =

_

b

a

−f

−

(x) dx = −

_

b

a

f

−

(x) dx. (3.10)

Our claim follows now by substituting the results in (3.9) and (3.10) into

(3.8).

For example,

_

π/2

−π/2

sin x dx = 0 because the graph bounds congruent

regions above and below the x-axis.

3.7 Anti-derivatives

Consider a function f(x) with domain I. In Deﬁnition 2.6 we called a func-

tion F(x) with domain I an antiderivative of f(x) if F

(x) = f(x). Having

an anti-derivative of a function will (typically) make it easy to integrate it

over a closed interval.

Remember that any antiderivatives F

1

and F

2

of a function f on an

interval I diﬀer only by a constant (see Corollary 2.5). In other words,

there exists a constant c, such that

F

1

(x) = F

2

(x) +c for all x ∈ I.

Deﬁnition 3.24. Let f be a function which is deﬁned on an interval I, and

suppose that f has an antiderivative. The set of all antiderivatives of f is

called the indeﬁnite integral of f. It is denoted by

_

f(x) dx.

Given a function f and an antiderivative F of it, we typically write

_

f(x) dx = F(x) +c. (3.11)

In this expression c stands for an arbitrary constant. Diﬀerent values for c

result in diﬀerent functions. Allowing all real numbers as possible values for

c, we understand the the right hand side of (3.11) as a set of functions. The

constant c in the expression is referred to as integration constant.

3.7. ANTI-DERIVATIVES 123

Example 3.25. Given a function f(x) we might know or guess a function

F(x), such that F

**(x) = f(x). Then we can write down the indeﬁnite
**

integral of f in the form F(x) + c. You can check the correctness of your

guess by diﬀerentiation. You may want to consult Table 1.3 on page 63 to

come up with ideas for antiderivatives. Here are some examples.

_

1 dx = x +c

_

√

x dx =

2

3

x

3/2

+c

_

sin x dx = −cos x +c

_

cos x dx = sin x +c

_

dx

1 +x

2

= arctan x +c

_

x dx =

1

2

x

2

+c

_

x

n

dx =

1

n + 1

x

n+1

(n = −1)

_

sec

2

x dx = tan x +c

_

sec xtan x dx = sec x +c

_

dx

√

1 −x

2

= arcsin x +c

Using the linearity of the diﬀerentiation (see the diﬀerentiation rules

in (1.20)), it is easy to produce more examples. E.g.

_

5x

2

−2 cos x dx =

5

3

x

3

−2 sin x +c.

Occasionally, an additional idea is required before we can see the anti-

derivative. E.g., using the trigonometric identity cos

2

x = (1 + cos 2x)/2,

we ﬁnd that

_

cos

2

x dx =

1

2

_

(1 + cos(2x)) dx =

1

2

_

x +

1

2

sin(2x)

_

+c.

Using a diﬀerent trigonometric identity we ﬁnd

_

(1 + cot

2

x) dx =

_

csc

2

x dx = −cot x +c. ♦

We shall explore additional ideas for ﬁnding antiderivatives at a later.

The reader may practice ﬁnding some antiderivatives for the functions in

the next exercise. As you go through them you are expected to learn, or

pick up some new ideas as you go along.

124 CHAPTER 3. INTEGRATION

Exercise 46. Find the following indeﬁnite integrals:

(a)

_

3 dx

(b)

_

(x + 4) dx

(c)

_

(x

2

−5) dx

(d)

_

cos 2x dx

(e)

_

(3 +x)

3

dx

(f)

_

(3 + 2x)

5

dx

(g)

_

1

x

3

dx

(h)

_

csc

2

x dx

(i)

_

(1 + tan

2

x) dx

(j)

_

csc xcot x dx

(k)

_

sin

2

x dx

(l)

_

sec

2

(3x) dx

(m)

_

e

x/3

dx

(n)

_

2x

x

2

+ 1

dx

(o)

_

(4 −3x)

5

dx

(p)

_

cos(4 −3x) dx

(q)

_

2x

(x

2

+ 3)

2

dx

(r)

_

xsec

2

(x

2

+ 5) dx

3.8 The Fundamental Theorem of Calculus

Our ﬁrst result provides us with a large class of functions which have an-

tiderivatives.

Theorem 3.26. Continuous functions, deﬁned over intervals, have anti-

derivatives. More speciﬁcally, suppose that a function f is deﬁned and con-

tinuous over the interval I. Let a ∈ I. Then

f(x) =

d

dx

_

x

a

f(t) dt

for all x ∈ I.

The major tool for calculating integrals, and the grand conclusion of our

discussion of antiderivatives is the Fundamental Theorem of Calculus.

Theorem 3.27 (Fundamental Theorem of Calculus). Suppose that f

is a continuous function over a closed interval [a, b] and that F is an an-

tiderivative of f. Then

_

b

a

f(x) dx = F(b) −F(a).

For example, F(x) = −cos x is an antiderivative of f(x) = sinx, so that

the Fundamental Theorem of Calculus tells us that

_

π

0

sinx dx = −cos(π) −(−cos(0)) = −(−1) −(−1) = 2.

3.8. THE FUNDAMENTAL THEOREM OF CALCULUS 125

As another example, note that F(x) = tan x is an anti-derivative of f(x) =

sec

2

x, so that the Fundamental Theorem of Calculus tells us that

_

π/4

0

sec

2

x dx = tan(π/4) −tan(0) = 1.

Remark 11 (Notational Convention). One commonly uses the nota-

tion

F(x)

¸

¸

¸

b

a

= F(b) −F(a).

This is quite convenient. E.g., we write

sin x

¸

¸

¸

π

0

= sinπ −sin0.

If there are ambiguities due to the length of the expression to which this

construction is applied, we also use the notation shown in the following

example:

_

x

3

−5x

2

+ 2x −8

_

5

3

= p(5) −p(3)

where p(x) = x

3

−5x

2

+ 2x −8.

Using this notation, we calculate that

_

3

−2

(x

2

−2x + 5) dx =

_

x

3

3

−x

2

+ 5x

_

3

−2

=

95

3

.

Other examples are

_

π/4

0

sec xtan x dx = sec x

¸

¸

¸

π/4

0

=

√

2 −1

and

_

π/3

π/4

csc xcot x dx = −csc x

¸

¸

¸

π/3

π/4

=

_

−2

√

3

3

_

−(−

√

2) =

√

2 −

2

√

3

3

.

The reader is invited to practice a few examples.

126 CHAPTER 3. INTEGRATION

Exercise 47. Evaluate the following deﬁnite integrals:

(a)

_

1

0

(3x + 2) dx

(b)

_

2

1

6 −t

t

3

dt

(c)

_

5

2

2

√

x −1 dx

(d)

_

0

1

(t

3

−t

2

) dt

(e)

_

π/4

π/6

csc xcot x dx

(f)

_

−1

−1

7x

6

dx

(g)

_

π

0

1

2

cos x dx

(h)

_

π

0

cos(x/2) dx

(i)

_

2

−2

|x

2

−1| dx

(j)

_

π/2

0

cos

2

x dx

(k)

_

π/2

0

sin

2

(2x) dx

(l)

_

π/4

0

sec

2

x dx

3.8.1 Some Proofs

Because of their importance, we like to prove Theorem 3.26 and the Funda-

mental Theorem of Calculus.

Proof of the Fundamental Theorem of Calculus. Essentially, the desired re-

sult is an easy consequence of Theorem 3.26. Let F(x) be any anti-derivative

of f(x) on I, and H(x) =

_

x

a

f(t) dt the one provided by Theorem 3.26. In

particular, F

(x) = H

**(x) = f(x). Cauchy’s Theorem (see its application in
**

Corollary 2.5) tells us that F and H diﬀer by a constant. For some constant

c and all x ∈ I:

H(x) =

_

x

a

f(t) dt = F(x) +c (3.12)

We can ﬁnd out the value for c by substituting x = a in this equation. In

particular, we ﬁnd that

_

a

a

f(t) dt = 0 = F(a) +c or c = −F(a).

Using this calculation of c and substituting x = b in (3.12), we obtain

_

b

a

f(t) dt = F(b) −F(a),

as claimed.

3.8. THE FUNDAMENTAL THEOREM OF CALCULUS 127

Proof of Theorem 3.26. Because we assumed continuity of f on the interval

I, it follows from Theorem 3.10 that

F(x) =

_

x

a

f(t) dt

exists. So it is our task to show that F is diﬀerentiable at x, and that

F

**(x) = f(x). Using Theorem 1.26, after adjusting the notation to ﬁt the
**

current setting, the task becomes to show that

f(x) = lim

h→0

F(x +h) −F(x)

h

= lim

h→0

1

h

__

x+h

a

f(t) dt −

_

x

a

f(t) dt

_

= lim

h→0

1

h

_

x+h

x

f(t) dt.

Here we assume that x is not an endpoint of I, so that x and x+h are both

in I. We omit (leave to the reader) the modiﬁcations of the proof which are

required in the case where x is an endpoint of I.

According to the Extreme Value Theorem (see Theorem 1.17) there are

points c and d between x and x +h, such that

f(c) = m(h) ≤ f(x) ≤ f(d) = M(h) (3.13)

for all t between x and x + h. The points c and d may not be uniquely

determined by h, but m and M are. It follows from (3.13) and Corollary 3.21

that

m(h) · h =

_

x+h

x

m(h) dt ≤

_

x+h

x

f(t) dt ≤

_

x+h

x

M(h) dt = M(h) · h,

and with this that

m(h) ≤

1

h

_

x+h

x

f(t) dt ≤ M(h).

Continuity of f(x) implies that

lim

h→0

m(h) = f(x) = lim

h→0

M(h).

It follows from a pinching argument (see Proposition 1.4) that

lim

h→0

_

x+h

x

f(t) dt = f(x),

and this is exactly what we needed to show.

128 CHAPTER 3. INTEGRATION

3.9 Substitution

In some cases it is not that easy to ‘see’ an antiderivative of the function one

likes to integrate. Substitution is a method which, when applied correctly,

will simplify the expression for the function you like to integrate. You hope

that you can ﬁnd an antiderivative for the simpliﬁed expression. The method

is based on the chain rule for diﬀerentiation. Sometimes this method is

helpful, other times it is not. Your success with this method depends greatly

on experience, i.e., practice.

We explain the method. Let F and g be functions which are deﬁned and

diﬀerentiable on an interval I. Set F

**= f. Then, according to the chain
**

rule,

d

dx

F(g(x)) = f(g(x))g

(x).

Assume that f and g

are continuous on I. Then f(g(x))g

(x) is continuous

as well. We may take antiderivatives of both sides of our previous equation,

and conclude that

_

f(g(x))g

(x) dx = F(g(x)) +c. (3.14)

The variable for the functions f and F is often called u, and this means in

context that u = g(x).

Let us give a few examples to illustrate how this method can be put

to use. There are no general rules what substitution must be used, rather

success justiﬁes the means. Working through the examples will teach you

how to apply this method in some typical situations. It will give you at least

some experience which you may then rely on in similar examples.

For example,

_

(2x −3)

3

dx =

1

2

_

(2x −3)

3

· 2dx =

1

8

(2x −3)

4

+c.

Here we used g(x) = 2x −3, g

(x) = 2, f(u) = u

3

, and F(u) =

u

4

4

.

There is a pattern, a way to use the notation, which can be applied to

write down the steps in an integration using substitution eﬃciently. Setting

u = g(x) we write

du = g

(x)dx,

3.9. SUBSTITUTION 129

instead of g

(x) = du/dx

3

. Suppose also that F is an anti-derivative of f,

so F

**= f. Then the pattern for calculating an integral via substitution is
**

_

f(g(x))g

(x) dx =

_

f(u) du = F(u) +c = F(g(x)) +c. (3.15)

In the ﬁrst step of this calculation we carry out the substitution, in the

second one we ﬁnd the anti-derivative, and in the third one we reverse the

substitution. We make use of this notation in our next example.

For example, we calculate that

_

x

_

x

2

+ 2 dx =

1

2

_

_

x

2

+ 2 · 2xdx

=

1

2

_

√

u du

=

1

3

u

3/2

+c

=

1

3

(x

2

+ 2)

3/2

+c.

We used the substitution u = x

2

+ 2. Then

du

dx

= 2x, or du = 2xdx.

We calculate that

_

t

2

(t + 1)

7

dt =

_

(u −1)

2

u

7

du

=

_

(u

2

−2u + 1)u

7

du

=

_

(u

9

−2u

8

+u

7

) du

=

1

10

u

10

−

2

9

u

9

+

1

8

u

8

+c

=

1

10

(t + 1)

10

−

2

9

(t + 1)

9

+

1

8

(t + 1)

8

+c.

Here we used the substitution u = t + 1. Then du = dx and t = u −1.

We may have to use a substitution and a trigonometric identity to solve

3

We do not attach any particular meaning to the symbols dx and du in their own right.

The equation du = g

**(x)dx helps us to write down what happens when we perform the
**

substitution as in the ﬁrst equality in (3.15). Thought of as inﬁnitesimals or diﬀerentials,

these symbols have a meaning, but this is beyond the scope of these notes.

130 CHAPTER 3. INTEGRATION

an integration problem:

_

2xsin

2

(x

2

+ 5) dx =

_

sin

2

u du

=

1

2

_

[1 −cos 2u] du

=

1

2

_

u −

1

2

sin2u

_

+c

=

1

2

_

(x

2

+ 5) −

1

2

sin[2(x

2

+ 5)]

_

+c.

We used the substitution u = x

2

+ 5, so that du = 2xdx and the identity

sin

2

α = [1 −cos 2α]/2.

Find the substitution which we used in the following computation, and

check the details:

_

sec

2

xtan x dx =

_

sec x · sec xtan x dx

=

_

u du

=

1

2

u

2

+c

=

1

2

sec

2

x +c.

Sometimes we have to apply the method of substitution twice, or more

often, to work out an integral. Here is an example.

_

(x

2

+ 1) sin

3

(x

3

+ 3x −2) cos(x

3

+ 3x −2) dx =

1

3

_

sin

3

ucos u du

=

1

3

_

v

3

dv

=

1

12

v

4

+c

=

1

12

sin

4

u +c

=

sin

4

(x

3

+ 3x −2)

12

+c

In the computation we used the substitution u = x

3

+ 3x − 2. Then du =

3(x

2

+1) dx. In a second substitution we set v = sin u. Then dv = cos u du.

3.9. SUBSTITUTION 131

Here are two examples, which are important in the context of integrating

rational functions. In the ﬁrst example we assume that a = 0, and we use

the subsitution x = au. The dx = adu.

_

dx

x

2

+a

2

dx =

_

adu

a

2

u

2

+a

2

=

1

a

_

du

u

2

+ 1

=

1

a

arctan(u) +c

=

1

a

arctan

_

x

a

_

+c

Adding another idea, we calculate

_

dx

x

2

+ 2x + 5

=

_

dx

(x + 1)

2

+ 4

=

_

du

u

2

+ 4

=

1

2

arctan

_

x + 1

2

_

+c.

We used the substitution u = x+1, and then we proceeded as in the previous

example.

3.9.1 Substitution and Deﬁnite Integrals

Let us now explore how substitution is used to calculate deﬁnite integrals.

Assuming as before that f and g

**are continuous on the interval [a, b], we
**

have

_

b

a

f(g(x))g

(x) dx =

_

g(b)

g(a)

f(u) du. (3.16)

To see this, observe that f has an anti-derivative, which we again denote

by F. Then

_

b

a

f(g(x))g

(x) dx = F(g(x))

¸

¸

¸

b

a

= F(u)

¸

¸

¸

g(b)

g(a)

=

_

g(b)

g(a)

f(u) du.

The ﬁrst identity is obtained as a combination of the Fundamental Theorem

of Calculus and (3.14). The second one is obvious, and the third one is

another application of the Fundamental Theorem of Calculus.

Let us apply this formula in a few examples.

_

1

0

(x

2

−1)(x

3

−3x + 5)

3

dx =

1

3

_

3

5

u

3

du =

1

12

u

4

¸

¸

¸

3

5

= −

136

3

.

132 CHAPTER 3. INTEGRATION

We used the substitution u = x

3

− 3x + 5. Then du = (3x

2

− 3) dx, and

1

3

du = (x

2

−1) dx. To obtain the limits for the integral we calculate u(0) = 5

and u(1) = 3.

Another example is

_

π/4

0

cos

2

xsin x dx = −

_

√

2/2

1

u

2

du = −

1

3

u

3

¸

¸

¸

√

2/2

1

= −

1

3

_

1 −

√

2

4

_

.

We use the substitution u = cos x. Then −du = sinx dx. If x = 0, then

u = 1, and if x = π/4, then u =

√

2/2.

Incorporating one of our previous techniques, we calculate

_

2

0

x(x + 1)

6

dx =

_

3

1

(u −1)u

6

du =

_

3

1

u

7

−u

6

du =

3554

7

.

We use the substitution u = x + 1. Then du = dx and x = u − 1. If x = 0,

then u = 1, and if x = 2, the u = 3.

Similarly,

_

√

8

0

x

3

_

x

2

+ 1 dx =

1

2

_

9

1

(u −1)

√

u du =

1

2

_

9

1

_

u

3/2

−u

1/2

_

du =

596

15

.

We use the substitution u = x

2

+ 1. Then

1

2

du = x dx and x

2

= u −1. For

the limits we calculate, if x = 0, then u = 1, and if x =

√

8, then u = 9.

Finally,

_

1

0

_

1 −x

2

dx =

_

π/2

0

_

1 −sin

2

ucos u du =

_

π/2

0

cos

2

u du =

π

4

.

We use the substitution x = sin u. Then dx = cos u du. If x = 0, then

u = 0, and if x = 1, then u = π/2. For our given values of x, there are other

possible values for u, but they will lead to the same results.

Remark 12. The graph of f(x) =

√

1 −x

2

is the northern part of a circle.

Using x ∈ [0, 1] means that we calculated the area under this graph in the

ﬁrst quadrant, i.e., the area of one forth of the disk of radius 1. You were

told long time ago in school, that the area of this unit disk is π, so that the

result of the calculation is hardly surprising.

There is a more serious matter. Is the example genuine, or did we assume

the answer previously? By deﬁnition, π is the ratio of the circumference of

a circle by its diameter. In our calculation of the derivative of the sine and

cosine functions we used the estimate that | sin h − h| ≤ h

2

/2. When we

3.10. AREAS BETWEEN GRAPHS 133

showed this, we used that |h| ≤ | tan h| for h ∈ [−π/4, π/4]. A typical proof

of the latter inequality starts out by ﬁrst showing that the area of the unit

disk is π. This means, we assumed the result in the example, we did not

derive it.

Exercise 48. Find the following integrals:

(a)

_

dx

√

2x + 1

(b)

_

t

(4t

2

+ 9)

2

dt

(c)

_

t(1 +t

2

)

3

dt

(d)

_

2s

3

√

6 −5s

2

ds

(e)

_

b

3

x

3

√

1 −a

4

x

4

dx

(f)

_

π

0

xcos x

2

dx

(g)

_

x

2

√

x + 1 dx

(h)

_

x + 3

√

x + 1

dx

(i)

_

sin

2

(3x) dx

(j)

_

π/2

0

cos

2

x dx

(k)

_

π/4

π/6

sec(2x) tan(2x) dx

(l)

_

1/2

0

dx

4 +x

2

(m)

_

sec

2

x

√

1 + tan x

dx

(n)

_

√

1 + sinxcos x dx

(o)

_

r

0

_

r

2

−x

2

dx

3.10 Areas between Graphs

Previously we related the integral to areas of a region under a graph. This

idea can be generalized to the discussion of areas of regions between two

graphs. Let us look at an example.

Example 3.28. Calculate the area of the region between the graphs of the

functions f(x) = x

2

and g(x) =

√

1 −x

2

.

Solution To get a better understanding, we draw the two graphs, see

Figure 3.13. Now you see the region between the two graphs whose area we

want to calculate. We call the region Ω.

The graphs intersect in two points. To ﬁnd their x-coordinates, we solve

the equation

f(x) = x

2

= g(x) =

_

1 −x

2

.

After squaring the equation and solving it for x

2

, we ﬁnd x

2

=

−1±

√

5

2

.

Only the + sign occurs as x

2

≥ 0. Taking the square root, we ﬁnd the

x-coordinates of the points where the curves intersect:

A = −

¸

−1 +

√

5

2

and B =

¸

−1 +

√

5

2

.

134 CHAPTER 3. INTEGRATION

-1 -0.5 0.5 1

0.2

0.4

0.6

0.8

1

Figure 3.13: Region between two

graphs

0.5 1 1.5 2 2.5 3

-1

-0.5

0.5

1

Figure 3.14: Region between two

graphs

To get the area of the region under the graph of f(x) and g(x) over the

interval [A, B] we can calculate the appropriate integrals. To get the area of

the region Ω between the graphs, we take the area of the region under the

graph of g(x) and subtract the area of the region under the graph of f(x).

Concretely:

Area(Ω) =

_

B

A

g(x) dx −

_

B

A

f(x) dx =

_

B

A

(g(x) −f(x)) dx ≈ 1.06651.

The numerical value was obtained by computer. You are invited to work

out the integral with the help of the Fundamental Theorem of Calculus to

verify the result. ♦

Some problems are a bit more subtle.

Example 3.29. Find the area of the region between the graphs of the func-

tions f(x) = cos x and g(x) = sin x for x between 0 and π.

Solution: The region Ω between the graphs is shown in Figure 3.14.

The region breaks up into two pieces, the region Ω

1

over the interval [0, π/4]

on which f(x) ≥ g(x), and the region Ω

2

over the interval [π/4, π] where

g(x) ≥ f(x). We calculate the areas of the regions Ω

1

and Ω

2

separately.

3.10. AREAS BETWEEN GRAPHS 135

In each case, we proceed as in the previous example:

Area(Ω

1

) =

_

π/4

0

(cos x −sin x) dx = (sin x + cos x)

¸

¸

¸

π/4

0

=

√

2 −1

Area(Ω

2

) =

_

π

π/4

(sin x −cos x) dx = −(sin x + cos x)

¸

¸

¸

π

π/4

= 1 +

√

2.

In summary we ﬁnd:

Area(Ω) = Area(Ω

1

) + Area(Ω

2

) = 2

√

2.

An additional remark may be in place. When we compared integrals and

areas, we had to take into account where the function is non-negative, resp.,

non-positive. Here we did not. We took care of this aspect by breaking up

the interval into the part where f(x) ≥ g(x) and the part where g(x) ≥ f(x).

♦

Our general deﬁnition for the area between two graphs is as follows.

Deﬁnition 3.30. Suppose f(x) and g(x) are integrable functions over an

interval [a, b]. Let Ω be the region between the graphs of f(x) and g(x) for

x between a and b. The area of Ω is

Area(Ω) =

_

b

a

|f(x) −g(x)| dx.

This deﬁnition generalizes Deﬁnition 3.7 on page 114. The deﬁnition is

also consistent with the intuitive idea of the area of a region, and it incor-

porates and generalizes Proposition 3.23 on page 121. Taking the absolute

value of the diﬀerence of f(x) and g(x) allows us avoid the question where

f(x) ≥ g(x) and where g(x) ≥ f(x). Typically this problem gets addressed

when the integral is calculated. In some problems a and b are explicitly

given, in others you have to determine them from context. In all cases it is

good to graph the functions before calculating the area of the region between

them. Having the correct picture in mind helps you to avoid mistakes.

Exercise 49. Sketch and ﬁnd the area of the region bounded by the curves:

(a) y = x

2

and y = x

3

.

(b) y = 8 −x

2

and y = x

2

(c) y = x

2

and y = 3x + 5.

(d) y = sin x and y = πx −x

2

.

(e) y = sin x and y = 2 sin xcos x for x between 0 and π.

136 CHAPTER 3. INTEGRATION

3.11 Numerical Integration

The Fundamental Theorem of Calculus provided us with a highly eﬃcient

method for calculating deﬁnite integrals. Still, for some functions we have

no good expression for its anti-derivative. In such cases we may have to

rely on numerical methods for integrating. Let us take such a function, and

show some methods for ﬁnding an approximate value for the integral.

We describe diﬀerent ways to ﬁnd, by numerical means, approximate

values for the integral of a function f(x) over the interval [a, b]:

_

b

a

f(x) dx.

In all of the diﬀerent approaches we partition the interval into smaller ones:

a = x

0

< x

1

< · · · < x

n−1

< x

n

= b.

Left and Right Endpoint Method: In the left endpoint method we

ﬁnd the value of the function at each left endpoint of the intervals of the

partition. We multiply it with the length of the associated interval, and

then add up the terms. Explicitly, we calculate

I

L

= f(x

0

)(x

1

−x

0

) +f(x

1

)(x

2

−x

1

) +· · · +f(x

n−1

)(x

n

−x

n−1

). (3.17)

In the right endpoint method we proceed as we did on the left endpoint

method, only we use the value of the function at the right endpoint instead

of the left endpoint:

I

R

= f(x

1

)(x

1

−x

0

) +f(x

2

)(x

2

−x

1

) +· · · +f(x

n

)(x

n

−x

n−1

). (3.18)

Both expressions provide us with speciﬁc examples of Riemann sums.

Example 3.31. Use the left and right endpoint method to ﬁnd approximate

values for

_

2

0

e

−x

2

dx.

Solution: Set f(x) = e

−x

2

and choose the partition:

x

0

= 0 < x

1

=

1

2

< x

2

= 1 < x

3

=

3

2

< x

4

= 2.

3.11. NUMERICAL INTEGRATION 137

Then x

k

−x

k−1

= 1/2 for k = 1, 2, 3, and 4. Formula (3.17) for I

L

specializes

to

I

L

=

f(0) +f(1/2) +f(1) +f(3/2)

2

≈ 1.126039724.

Formula (3.18) for I

R

specializes to

I

R

=

f(1/2) +f(1) +f(3/2) +f(2)

2

≈ .6351975438.

0.5 1 1.5 2

0.2

0.4

0.6

0.8

1

Figure 3.15: Use left end points

0.5 1 1.5 2

0.2

0.4

0.6

0.8

1

Figure 3.16: Use right end points

Apparently, I

L

and I

R

are calculated by combining the areas of certain

rectangles. In our case the values of f(x) are all positive and all of the

rectangles are above the x axis, so the areas of the rectangles are all added.

Note also, that our speciﬁc function f(x) is decreasing on the interval [0, 2],

so that I

L

is an upper sum for the function f(x) over the interval [0, 2], and

I

R

is a lower sum. In this sense, we have

I

R

= .6351975438 ≤

_

2

0

e

−x

2

dx ≤ I

L

= 1.126039724.

The function and the rectangles whose areas are added to give us I

L

and I

R

are shown in Figure 3.15 and Figure 3.16. ♦

Midpoint and Trapezoid Method: We may try and improve on the

endpoint methods. In the midpoint methods, we use the value of the function

at the midpoints of the intervals of the partition. That should be less bias.

138 CHAPTER 3. INTEGRATION

We use the same partition and notation as above. Then the formula for the

midpoint method is:

I

M

= f

_

x

0

+x

1

2

_

(x

1

−x

0

) +· · · +f

_

x

n

+x

n−1

2

_

(x

n

−x

n−1

). (3.19)

In the trapezoid method we do not take the function at the average (i.e.

midpoint) of the end points of the intervals in the partition, but we average

the values of the function at the end points. Speciﬁcally, the formula is

I

T

=

f(x

0

) +f(x

1

)

2

(x

1

−x

0

) +· · · +

f(x

n−1

) +f(x

n

)

2

(x

n

−x

n−1

). (3.20)

It is quite easy to see that

I

T

=

I

L

+I

R

2

. (3.21)

Let us explain the reference to the word trapezoid. For simplicity, sup-

pose that f(x) is non-negative on the interval [a, b]. Consider the trapezoid

of width (x

1

− x

0

) which has height f(x

0

) at its left and f(x

1

) at its right

edge. The area of this trapezoid is

f(x

0

)+f(x

1

)

2

(x

1

− x

0

). This is the ﬁrst

summand in the formula for I

T

, see (3.20). We have such a trapezoid over

each of the intervals in the partition, and their areas are added to give I

T

.

Expressed diﬀerently, we can draw a secant line through the points

(x

0

, f(x

0

)) and (x

1

, f(x

1

)). This gives us the graph of a function T(x)

over the interval [x

0

, x

1

]. Over the interval [x

1

, x

2

] the graph of T(x) is the

secant line through the points (x

1

, f(x

1

)) and (x

2

, f(x

2

)). Proceeding in

the fashion, we use appropriate secant lines above all of the intervals in the

partition to deﬁne the function T(x) over the entire interval [a, b]. Then

I

T

=

_

b

a

T(x) dx.

This integral is easily computed by the formula in (3.20).

Example 3.32. Use the midpoint and trapezoid method to ﬁnd approxi-

mate values for

_

2

0

e

−x

2

dx.

Solution: We use the same partition of [0, 2] as in Example 3.31. The

formula for I

M

(see (3.19)) specializes to

I

M

=

f(.25) +f(.75) +f(1.25) +f(1.75)

2

≈ .8827889485.

3.11. NUMERICAL INTEGRATION 139

As for the endpoint methods, I

M

is the combined area of certain rectangles.

Their heights are the values f(x

i

) at the midpoints of the intervals of the

partition. Their width are the lengths of the intervals of the partition. You

see the rectangles for this calculation in Figure 3.17.

0.5 1 1.5 2

0.2

0.4

0.6

0.8

1

Figure 3.17: Use midpoints

0.5 1 1.5 2

0.2

0.4

0.6

0.8

1

Figure 3.18: Trapezoid Method

Based on our previous calculations and Formula (3.21) we ﬁnd

I

T

=

I

L

+I

R

2

≈ .8806186341.

We illustrated this calculation in Figure 3.18. There you see the function

f(x) = e

−x

2

and ﬁve dots on the graph. The dots are connected by straight

line segments. These line segments form the graph of a function T(x), and

I

T

is the area of the region under this graph. So

I

T

=

_

2

0

T(x) dx. ♦

Simpson’s Method: In Simpson’s method we combine the endpoint

and midpoint methods in a weighted fashion. Again, we use the same no-

tation for the function and the partition as above. The speciﬁc formula for

an approximate value of the integral of f(x) over [a, b] is

I

S

=

1

6

_

f(x

0

) + 4f

_

x

0

+x

1

2

_

+f(x

1

)

_

(x

1

−x

0

) +· · ·

+

1

6

_

f(x

n−1

) + 4f

_

x

n−1

+x

n

2

_

+f(x

n

)

_

(x

n

−x

n−1

)

(3.22)

140 CHAPTER 3. INTEGRATION

It is quite easy to see that

I

S

=

I

L

+ 4I

M

+I

R

6

=

I

T

+ 2I

M

3

.

Let us explain the background to Simpson’s method. We deﬁne a func-

tion P(x) over the interval [a, b] by deﬁning a degree 2 polynomial on each

of the intervals of the partition. The polynomial over the interval [x

k−1

, x

k

]

is chosen so that it agrees with f(x) at the end points and at the midpoint

of this interval. Simpson’s method is a reﬁnement of the Trapezoid method.

In one method we use two points on the graph and connect them by a

straight line segment. In the other one we use three points on the graph and

construct a parabola through them. With some work one can show that

I

S

=

_

b

a

P(x) dx.

0.5 1 1.5 2

0.2

0.4

0.6

0.8

1

Figure 3.19: Simpson’s Method

Example 3.33. Use Simpson’s method to ﬁnd an approximate value for

_

2

0

e

−x

2

dx.

3.11. NUMERICAL INTEGRATION 141

Solution: We use the same partition of [0, 2] as in Example 3.31. The

formula for I

S

(see the special case of (3.22)) specializes to

I

S

=

I

L

+ 4I

M

+I

R

6

≈ .88206555104,

where I

L

, I

M

and I

R

are as above.

You see the method illustrated in Figure 3.19. There you see the graphs

of two functions, the function f(x) = e

−x

2

and the function P(x) from the

discussion of Simpson’s method. Only the thickness of the line suggests that

there are two graphs of almost identical functions. ♦

Example 3.34. Compare the accuracy of the various approximate values

of

_

2

0

e

−x

2

dx.

Solution: We compare the approximate values for the integral obtained

by the diﬀerent formulas. We partition the interval [0, 2] into n intervals

of the same length, and vary n. We tabulate the results. They should be

compared with an approximate value for the integral of

0.882081390762421.

n = 1 n = 10 n = 100 n = 1000

I

L

2.0000000 0.9800072469 0.891895792451 0.883063050702697

I

R

0.0366313 0.7836703747 0.872262105229 0.881095681980474

I

M

0.7357589 0.8822020700 0.882082611663 0.882081402972833

I

T

1.0183156 0.8818388108 0.882078948840 0.882081366341586

I

S

0.8299445 0.8820809836 0.882081390722 0.882081390762417

Table 3.1: Approximate Values of the Integral

Simpson’s method is more accurate than the other ones. E.g., Simpson’s

method with n = 4 gives a result which is better than the left and right

endpoint method with n = 1000. Even if you use the midpoint and trapezoid

method with n = 1000, then the result is far less accurate that Simpson’s

method with n = 100. ♦

142 CHAPTER 3. INTEGRATION

Remark 13. It is important that we keep the number n of intervals into

which we partition [a, b] small. It does not only keep the number of overall

computations small. In each computational step we expect to make a round-

oﬀ error, and these may add up. The fewer computations we make, the

smaller the cummulative round-oﬀ error will be.

Exercise 50. Proceed as in Example 3.34 and compare the diﬀerent meth-

ods applied to the calculation of

_

π/2

0

sin x dx = 1.

3.12 Applications of the Integral

In Deﬁnition 3.7 and Proposition 3.23 we related deﬁnite integrals to areas.

Based on the context, this can have a more concrete meaning. Consider a

function f(t) on an interval [a, b] and the integral

I =

_

b

a

f(t) dt.

If f(t) stands for the rate at which a drug is absorbed, then I is the total

amount of the drug which has been absorbed in the time interval [a, b]. If

f(t) stands for the speed with which you travel, then I stands for the total

distance which you traveled during the time interval [a, b]. You are invited

to come up with more interpretations. In addition, the following deﬁnition

expresses the common notion of the average value of a function.

Deﬁnition 3.35. Suppose that f(t) is an integrable function over the in-

terval [a, b]. Then the quantity

f

av

:=

1

b −a

_

b

a

f(t) dt

is called the average value of f(t) over the interval [a, b].

For example, the average value of the sine function f(x) = sinx over the

interval [0, π] is 2/π.

Let us explore the diﬀerent aspects of integration in an example.

Example 3.36. The river Little Brook ﬂows into a reservoir, referred to

as Beaver Pond by the locals. The amount of water carried by the river

depends on the season. As a function of time, it is

g(t) = 2 + sin

_

πt

180

_

.

3.12. APPLICATIONS OF THE INTEGRAL 143

We measure time in days, and t = 0 corresponds to New Year. The units

of g(t) are millions of liter of water per day. Water is released from Beaver

Pond at a constant rate of 2 million liters per day. At the beginning of the

year, there are 200 million liters of water in the reservoir.

(a) How many liter of water are in Beaver Pond by the end of April?

(b) Suppose F(t) tells how much water there is in the reservoir on day t

of the year. Find F(t).

(c) At which rate does the amount of water in the reservoir change at the

beginning of September?

(d) On which days will there be 250 million liters of water in Beaver Pond?

(e) At which amount of water will the reservoir crest?

(f) On the average, by how much has the amount of water in Beaver Pond

increased per day during the ﬁrst three months of the year?

Solution: Water enters and leaves the pond. The net rate entering is

f(t) = g(t) −2 = sin

_

πt

180

_

millions of liters per day.

We obtain the total change of the amount of water in the reservoir by inte-

grating f(t). Set

A(T) =

_

T

0

f(t) dt.

On the T-th day of the year, the total amount of water in Beaver Pond is

F(T) = 200 +

_

T

0

f(t) dt = 200 +

180

π

_

1 −cos

_

πT

180

__

millions of liters.

This answers (b). By the end of April, after 120 days, there are

F(120) = 200 +

180

π

_

1 −cos

2π

3

_

≈ 238.2

millions of liters of water in the pond. This answers (a).

The rate at which the amount of water in the pond changes is F

(t) =

f(t). At the beginning of September, after 240 days, the rate of change is

144 CHAPTER 3. INTEGRATION

f(240) ≈ −.866. The pond is losing water at a rate of 866, 000 liters per

day.

To answer (d), we like to know for which T we have F(T) = 250. We

solve the equation for T:

250 = 200 +

180

π

_

1 −cos

_

πT

180

__

or cos

_

πT

180

_

= 1 −

5π

18

.

We apply the function arccos to both sides of the last equation and ﬁnd

T =

180

π

arccos

_

1 −

5π

18

_

≈ 88, or 272.

On the 88-th and 272-nd day of the year there will be 250 millions of liters

of water in the reservoir.

To ﬁnd at which amount the reservoir crests, we have to ﬁnd the max-

imum value of F(t). This occurs apparently when cos(πt/180) = −1 or

t = 180. The pond crests at mid-year, and then the amount of water in it

is about 314.6 millions of liters of water. This answers (e).

After three months or 90 days there are about 257.3 millions of liters

of water in Beaver Pond. Within this time, the amount of water has in-

creased by 57.3 millions of liters. On the average, the amount of water in

the reservoir increased by about 640,000 liters per day. ♦

Exercise 51. A pain reliever has been formulated such that it is absorbed

at a rate of 600 sin(πt) (mg/hr) by the body. Here t measures time in hours,

t = 0 at the time you take the medication, and the absorption process is

complete at time t = 1.

(a) What is the total amount of the drug which is absorbed?

(b) Find a function F(t), such that F(t) tells how much medication has

been absorbed at time t.

(c) A total of 150 mg of the medication has to be absorbed before the

drug is eﬀective. How long does it take until this threshold is reached?

3.13 The Exponential and Logarithm Functions

In Section 1.10 we introduced the exponential function exp(x) = e

x

and the

natural logarithm function ln x. At the time we only stated that they exist

because we did not have the tools to properly deﬁne them. We will now ﬁll

in the details. Many of the routine calculations are formulated as exercises.

3.13. THE EXPONENTIAL AND LOGARITHM FUNCTIONS 145

Deﬁnition 3.37. Let x ∈ (0, ∞). The natural logarithm of x is deﬁned as

ln x =

_

x

1

dt

t

. (3.23)

Theorem 3.38. The natural logarithm function is diﬀerentiable on its en-

tire domain (0, ∞), its derivative is

ln

x =

1

x

,

and ln x is increasing on (0, ∞).

Proof. The function 1/x is deﬁned and continuous on (0, ∞). According

to Theorem 3.10 this means that lnx is deﬁned for all x in (0, ∞). Theo-

rem 3.26 tells us that ln

**x = 1/x. According to Theorem 2.11, the function
**

is increasing because its derivative ln

x > 0 for all x > 0.

Let us also verify one of the central equations for calculating with loga-

rithms, the third rule in Theorem 1.34.

Proposition 3.39. For any x, y > 0,

ln(xy) = ln x + ln y. (3.24)

Proof. We need a short calculation. Here x and y are ﬁxed positive numbers.

We use the substitution u =

t

x

, so that du =

1

x

dt. For the adjustment of

the limits of integration, observe that t/x = u = 1 when t = x, and that

t/x = u = y when t = xy. Then

_

xy

x

dt

t

=

_

xy

x

1

(t/x)

1

x

dt =

_

y

1

du

u

= ln y.

Using this calculation we deduce that

ln(xy) =

_

xy

1

dt

t

=

_

x

1

dt

t

+

_

xy

x

dt

t

= ln x + ln y.

This is exactly our claim.

Exercise 52. Show:

(1) ln 1 = 0.

(2) ln(1/y) = −ln y for all y > 0.

146 CHAPTER 3. INTEGRATION

(3) ln(x/y) = ln x −ln y for all x, y > 0.

Exercise 53. Show that ln 4 > 1. Hint: Using the partition

1 = x

0

< 2 = x

1

< 3 = x

2

< 4 = x

3

,

ﬁnd a lower sum S

l

for the function 1/t over the interval [1, 4] so that S

l

> 1.

We can now deﬁne the Euler number:

Deﬁnition 3.40. The number Euler number e is the unique number such

that

ln e = 1 or, equivalently,

_

e

1

dt

t

= 1.

For this deﬁnition to make sense, we have to show that there is a number

e which has the property used in the deﬁnition. To see this, observe that

ln 1 = 0 < 1 < ln 4. Because ln x is diﬀerentiable, it follows from the

Intermediate Value Theorem (see Theorem 1.16) that there is a number e

for which ln e = 1. It also follows that 1 < e < 4.

Proposition 3.41. For every real number x there exists exactly one positive

number y, such that

ln y = x (3.25)

Proof. Observe that ln(e

n

) = n and ln(1/e

n

) = −n for all natural num-

bers n. So all integers (whole numbers) are values of the natural logarithm

function. Every real number x lies between two integers. According to the

Intermediate Value Theorem, every real number is a value of the function

ln y. We saw that ln y is an increasing function. This means that, for any

given x, the equation ln y = x has at most one solution. Taken together it

means that it has a unique solution.

Exercise 54. Show that

ln(a

r

) = r ln a

for all positive numbers a and all rational numbers r, i.e., numbers of the

form r = p/q where p and q are integers and q = 0.

In summary, we have seen that

Corollary 3.42. The natural logarithm function ln x is a diﬀerentiable, in-

creasing function with domain (0, ∞) and range (−∞, ∞), and ln

x = 1/x.

3.13. THE EXPONENTIAL AND LOGARITHM FUNCTIONS 147

We are now ready to deﬁne the exponential function.

Deﬁnition 3.43. Given any real number x, we deﬁne exp(x) to be the

unique number for which

ln(exp(x)) = x, (3.26)

i.e., y = exp(x) is the unique solution of the equation ln(y) = x. This

assignment (mapping x to exp(x)) deﬁnes a function, called the exponential

function, with domain (−∞, ∞) and range (0, ∞).

Exercise 55. Show that the exponential function exp and the natural log-

arithm function ln are inverses of each other. In addition to the equation in

(3.26), you need to show that

exp(ln(y)) = y (3.27)

for all y ∈ (0, ∞).

Summarizing this discussion, and adding some observations which we

have made elsewhere, we have:

Proposition 3.44. The exponential function exp(x) is a diﬀerentiable, in-

creasing function with domain (−∞, ∞) and range (0, ∞), and the exponen-

tial function is its own derivative, i.e., exp

(x) = exp(x).

Exercise 56. Show for all real numbers x and y that:

(1) exp(0) = 1

(2) exp(1) = e

(3) exp(x) exp(y) = exp(x +y)

(4) 1/ exp(y) = exp(−y)

(5) exp(x)/ exp(y) = exp(x −y).

Hint: Use the results of Exercise 52, the deﬁnition of e in Deﬁnition 3.40,

and that the exponential and logarithm functions are inverses of each other.

Exercise 57. Show that exp(r) = e

r

for all rational numbers r. Hint: Use

Exercise 54 and that the exponential and logarithm functions are inverses

of each other.

148 CHAPTER 3. INTEGRATION

The expression e

r

makes sense only if r is a rational number. If r = p/q

then we raise e to the r-th power and take the q-th root of the result. For

an arbitrary real number we set

e

x

= exp(x). (3.28)

This is consistent with the meaning of the expression for rational exponents

due to Exercise 57, and it deﬁnes what we mean by raising e to any real

power.

3.13.1 Other Bases

So far we discussed the natural logarithm function and the exponential func-

tion with base e. We now expand the discussion to other bases.

Deﬁnition 3.45. Let a be a positive number, a = 1. Set

log

a

x =

ln x

ln a

and exp

a

(x) = exp(xln a). (3.29)

We call log

a

(x) the logarithm function with base a and exp

a

(x) the expo-

nential function with base a. For the function log

a

we use the domain (0, ∞)

and range (−∞, ∞). For the exponential function exp

a

we use the domain

(−∞, ∞) and range (0, ∞).

Exercise 58. Show

(1) ln a > 0 if a > 1 and lna < 0 if 0 < a < 1.

(2) log

a

(x) and exp

a

(x) are diﬀerentiable functions.

(3) log

a

(x) and exp

a

(x) are increasing functions if a > 1.

(4) log

a

(x) and exp

a

(x) are decreasing functions if 0 < a < 1.

Exercise 59. Suppose a > 0 and a = 1. Show that

(a) exp

a

(log

a

(y)) = y for all y > 0.

(b) log

a

(exp

a

(x)) = x for all real numbers x.

Taken together, the speciﬁcations for the domains and ranges for the

functions exp

a

and log

a

and the results from Exercise 59 tell us that

Corollary 3.46. Suppose a > 0 and a = 1. The functions exp

a

and log

a

are inverses of each other.

3.13. THE EXPONENTIAL AND LOGARITHM FUNCTIONS 149

Exercise 60. Suppose a > 0 and a = 1. Show the laws of logarithms:

(a) log

a

1 = 0 and log

a

a = 1.

(b) log

a

(xy) = log

a

x + log

a

y for all x, y > 0.

(c) log

a

(1/y) = −log

a

y for all y > 0.

(d) log

a

(x/y) = log

a

x −log

a

y for all x, y > 0.

Exercise 61. Suppose a > 0 and a = 1. Show the exponential laws:

(1) exp

a

(0) = 1 and exp

a

(1) = a

(2) exp

a

(x) exp

a

(y) = exp

a

(x +y)

(3) 1/ exp

a

(y) = exp

a

(−y)

(4) exp

a

(x)/ exp

a

(y) = exp

a

(x −y).

Exercise 62. Suppose a > 0, a = 1, and r is a rational number. Show

log

a

(a

r

) = r and exp

a

(r) = a

r

.

We rephrase a convention which we made previously for e. Suppose

a > 0 and a = 1. The expression a

r

makes sense if r is a rational number.

If r = p/q then we raise a to the r-th power and take the q-th root of the

result. For an arbitrary real number we set

a

x

= exp

a

(x). (3.30)

This is consistent with the meaning of the expression for rational exponents

due to Exercise 62, and it deﬁnes what we mean by raising a to any real

power. Equation 3.30 specializes to the one in Equation 3.28 if we set a = e.

It is also a standard convention to set

1

x

= 1 and 0

x

= 0

for any real number x. Typically 0

0

is set 1.

We can now state an equation which is typically considered to be one of

the laws of logarithms:

Exercise 63. Suppose a > 0, a = 1, x > 0, and z is any real number. Then

log

a

(x

z

) = z log

a

(x).

150 CHAPTER 3. INTEGRATION

We are now ready to ﬁll in the details for one of the major statements

which we made in Section 1.10. We are ready to prove

Theorem 3.47. Let a be a positive number, a = 1. There exists exactly one

monotonic function, called the exponential function with base a and denoted

by exp

a

(x), which is deﬁned for all real numbers x such that exp

a

(x) = a

x

whenever x is a rational number.

Proof. In this section we constructed the function exp

a

(x), and this function

has all of the properties called for in the theorem. That settles the existence

statement. We have to show the uniqueness statement, i.e., there in only

one such function.

Suppose f(x) is any monotonic function and f(r) = a

r

= exp

a

(r) for

all rational numbers r. We have to show that f(x) = exp

a

(x) for all real

numbers x. We leave the veriﬁcation of this assertion to the reader. Here one

uses that f(x) and exp

a

(x) are monotonic, and that exp

a

(x) is continuous.

Chapter 4

Trigonometric Functions

In this section we discuss the radian measure of angles and introduce the

trigonometric functions. These are the functions sine, cosine, tangent, et. al.

We collect some formulas relating these functions.

Arc Length and Radian Measure of Angles: Consider the unit

circle (a circle with radius 1) centered at the origin in the Cartesian plane.

It is shown in Figure 4.1. We take a practical approach to measuring the

length of an arc on this circle. We imagine that we can straighten it out,

and measure how long it is. It requires some work to introduce the idea of

the length of a curve in a mathematically rigorous fashion.

Deﬁnition 4.1. The number π is the ratio between the circumference of a

circle and its diameter.

This deﬁnition goes back to the Greeks. Stated diﬀerently it says, that

the circumference of a circle of radius r is 2πr. Observe that the ratio

referred to in the deﬁnition does not depend on the radius of the circle.

Consider an angle α between the positive x-axis and a ray which origi-

nates at the origin of the coordinate system and intersects the unit circle in

the point p. We like to ﬁnd the radian measure of the angle α. Consider an

arc on the unit circle which starts out at the point (1, 0) and ends at p, and

suppose its length is s. Then

α = ±s (radians). (4.1)

The + sign is used if the arc goes counter clockwise around the circle. The

− sign is used if it proceeds clockwise. We may also consider arcs which

wrap around the circle several times before they end at p. In this sense, the

radian measure of the angle α is not unique, but any two radian measures

of the angle diﬀer by an integer multiple of 2π.

151

152 CHAPTER 4. TRIGONOMETRIC FUNCTIONS

-1 -0.5 0.5 1

-1

-0.5

0.5

1

(cos t, sin t)

Figure 4.1: The unit circle

Conversely, let t be any real number. We construct the angle with radian

measure t. Starting at the point (1, 0) we travel the distance |t| along the

unit circle (here |t| denotes the absolute value of t). By convention, we travel

counter clockwise if t is positive and clockwise if t is negative. In this way

we reach a point p on the circle. Let α be the angle between the positive

x-axis and the ray which starts at the origin and intersects the unit circle

in p. This angle has radian measure t.

Comparison of Angles in Degrees and Radians: We suppose that

you are familiar with measuring angles in degrees. The measure of half a

revolution (a straight angle) comprises π radians and 180 degrees. So, one

degree corresponds to π/180 ≈ 0.017453293 radians, and one radian corre-

sponds to 180/π ≈ 57.29577951 degrees. We have the conversion formula

x degrees =

π

180

x radians. (4.2)

Trigonometric Functions: Let t be once more a real number. Starting

at the point (1, 0) we travel the distance |t| along the unit circle, counter

clockwise if t is positive and clockwise if t is negative. In this way we reach

a point p = (x(t), y(t)) on the circle, and we set

x(t) = cos t and y(t) = sin t. (4.3)

153

This deﬁnes the functions sint and cos t. You see the construction im-

plemented in Figure 4.1. You can ﬁnd the graphs of the sine and cosine

functions on the interval [0, 2π] in Figures 4.2 and 4.3.

1 2 3 4 5 6

-1

-0.5

0.5

1

Figure 4.2: f(x) = sin x

1 2 3 4 5 6

-1

-0.5

0.5

1

Figure 4.3: f(x) = cos x

The other trigonometric functions, tangent (tan), cotangent (cot), secant

(sec), and cosecant (csc) are deﬁned as follows:

tan x =

sin x

cos x

cot x =

cos x

sin x

sec x =

1

cos x

csc x =

1

sin x

(4.4)

To make sure you have some idea about the behavior of the tangent

and cotangent function we provided two graphs for each of them. They

are drawn over diﬀerent parts of the domain to show diﬀerent aspects. See

Figure 4.4 to Figure 4.7. You can see the graphs of the secant and cosecant

functions in Figure 4.8 and 4.9.

A small table with angles given in degrees and radians, as well as the

associated values for the trigonometric functions is given in Table 4.1. If the

functions are not deﬁned at some point, then this is indicated by ‘n/a’. Older

calculus books may still contain tables with the values of the trigonometric

functions, and there are books which were published for the speciﬁc purpose

of providing these tables. This is really not necessary anymore because any

scientiﬁc calculator gives those values to you with rather good accuracy.

Trigonometric Functions deﬁned at a right triangle: Occasionally

it is more convenient to use a right triangle to deﬁne the trigonometric

functions. To do this we return to Figure 4.1. You see a right triangle with

vertices (0, 0), (x, 0) and (x, y). We may use a circle of any radius r. The

154 CHAPTER 4. TRIGONOMETRIC FUNCTIONS

-3 -2 -1 1 2 3

-40

-20

20

40

Figure 4.4: tan x on [−π, π]

-1 -0.5 0.5 1

-2

-1

1

2

Figure 4.5: tan x on [−1.1, 1.1]

-3 -2 -1 1 2 3

-40

-20

20

40

Figure 4.6: cot x on [−π, π]

1.5 2 2.5

-1.5

-1

-0.5

0.5

1

1.5

Figure 4.7: cot x on [

π

2

−1,

π

2

+1]

-3 -2 -1 1 2 3

-10

10

20

Figure 4.8: sec x on [−π, π]

-3 -2 -1 1 2 3

-20

-10

10

20

Figure 4.9: csc x on [−π, π]

155

degrees radians sin x cos x tan x cot x sec x csc x

0 0 0 1 0 n/a 1 n/a

30 π/6

1

2

√

3

2

√

3

3

√

3

2

√

3

3

2

45 π/4

√

2

2

√

2

2

1 1

√

2

√

2

60 π/3

√

3

2

1

2

√

3

√

3

3

2

2

√

3

3

90 π/2 1 0 n/a 0 1 n/a

120 2π/3

√

3

2

−

1

2

−

√

3 −

√

3

3

−2

2

√

3

3

135 3π/4

√

2

2

−

√

2

2

−1 −1 −

√

2

√

2

150 5π/6

1

2

−

√

3

2

−

√

3

3

−

√

3 −

2

√

3

3

2

180 π 0 −1 0 n/a −1 n/a

Table 4.1: Values of Trigonometric Functions

right angle is at the vertex (x, 0) and the hypotenuse has length r. Let α

be the angle at the vertex (0, 0). In the following the words adjacent and

opposing are in relation to α. Then

sin α =

opposing side

hypothenuse

cos α =

adjacent side

hypothenuse

tan α =

opposing side

adjacent side

cot α =

adjacent side

opposing side

sec α =

hypothenuse

adjacent side

csc α =

hypothenuse

opposing side

Trigonometric Identities: There are several important identities for

the trigonometric functions. Some of them you should know, others you

should be aware of, so that you can look them up whenever needed. From

the theorem of Pythagoras and the deﬁnitions you obtain

sin

2

x + cos

2

x = 1, sec

2

x = 1 + tan

2

x, csc

2

x = 1 + cot

2

x. (4.5)

The following identities are obtained from elementary geometric observa-

156 CHAPTER 4. TRIGONOMETRIC FUNCTIONS

tions using the unit circle.

sin x = sin(x + 2π) = sin(π −x) = −sin(−x)

cos x = cos(x + 2π) = −cos(π −x) = cos(−x)

cos x = sin(x +

π

2

) = −cos(x +π) = −sin(x +

3π

2

)

sin x = −cos(x +

π

2

) = −sin(x +π) = cos(x +

3π

2

)

You should have seen, or even derived, the following addition formulas in

precalculus.

sin(α +β) = sin αcos β + cos αsin β (4.6)

sin(α −β) = sin αcos β −cos αsin β (4.7)

cos(α +β) = cos αcos β −sinαsin β (4.8)

cos(α −β) = cos αcos β + sinαsin β (4.9)

tan(α +β) =

tan α + tan β

1 −tan αtan β

(4.10)

tan(α −β) =

tan α −tan β

1 + tan αtan β

(4.11)

These formulas specialize to the double angle formulas

sin2α = 2 sin αcos α and cos 2α = cos

2

α −sin

2

α (4.12)

From the addition formulas we can also obtain

sinαsin β =

1

2

[cos(α −β) −cos(α +β)] (4.13)

sinαcos β =

1

2

[sin(α −β) + sin(α +β)] (4.14)

cos αcos β =

1

2

[cos(α −β) + cos(α +β)] (4.15)

which specialize to the the half-angle formulas

sin

2

α =

1

2

[1 −cos 2α] and cos

2

α =

1

2

[1 + cos 2α] (4.16)

c Copyright 2003 by the author. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the author. Printed in the United States of America. This publication was typeset using AMS-TEX, the American MathematA ical Society’s TEX macro system, and L TEX 2ε . The graphics were produced 1. with the help of Mathematica This is an incomplete draft which will undergo further changes.

1

Mathematica Version 2.2, Wolfram Research, Inc., Champaign, Illinois (1993).

Contents

Preface 1 Basic Concepts 1.1 Real Numbers and Functions . . . . . . . . 1.2 Limits . . . . . . . . . . . . . . . . . . . . . 1.2.1 Two important estimates . . . . . . 1.3 More Limits . . . . . . . . . . . . . . . . . . 1.4 Continuous Functions . . . . . . . . . . . . 1.5 Lines . . . . . . . . . . . . . . . . . . . . . . 1.6 Tangent Lines and the Derivative . . . . . . 1.6.1 Derivatives without Limits . . . . . 1.7 Secant Lines and the Derivative . . . . . . . 1.8 Diﬀerentiability implies Continuity . . . . . 1.9 Basic Examples of Derivatives . . . . . . . . 1.10 The Exponential and Logarithm Functions . 1.11 Diﬀerentiability on Closed Intervals . . . . . 1.12 Other Notations for the Derivative . . . . . 1.13 Rules of Diﬀerentiation . . . . . . . . . . . 1.13.1 Linearity of the Derivative . . . . . . 1.13.2 Product and Quotient Rules . . . . . 1.13.3 Chain Rule . . . . . . . . . . . . . . 1.13.4 Hyperbolic Functions . . . . . . . . 1.13.5 Derivatives of Inverse Functions . . . 1.13.6 Implicit Diﬀerentiation . . . . . . . . 1.14 Related Rates . . . . . . . . . . . . . . . . . 1.15 Exponential Growth and Decay . . . . . . . 1.16 More Exponential Growth and Decay . . . 1.17 The Second and Higher Derivatives . . . . . 1.18 Numerical Methods . . . . . . . . . . . . . . i v 1 1 2 4 6 8 9 10 12 13 14 15 18 20 21 22 22 23 26 29 30 34 38 41 43 48 48

. . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . .9. . . .2 Partitions and Sums . . . 2. . .3 Limits and Integrability . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3 Euler’s Method . . . . . 2. . . . . . .1. . 2. . . . . . . . . ii . . . . . . . . . .7 Anti-derivatives . . . . 3. . 3.8. . . . . .1 Monotonicity on Intervals . . . . 3 Integration 3. . . . .18. . . . . . . . . . . . . . . . 3. . . .2 Newton’s Method . . . . . . . . . . . . . . . . . . .18. . . .7 Detection of Inﬂection Points . . . . . . .3. . . . . .2 Riemann Sums .4. . . .6 Areas and Integrals . . . . . . . . . . . . . . . . . . . . . . . 2. . . . . . .4 Integrable Functions . . .3. . . . . . . . . . . . 3.5 Some elementary observations . . . . . . . . . . . . . 3. .2 The Riemann Integral . . . . . . . . . .2 Concavity at a Point . . . . . . . . . . . . . . . . . . . . . 3. . . . . . . 3. . . . . . . . . . . . . . . . . .2 Monotonicity at a Point . . . . . . . . . . . 2. . . . . . . . . . . . . . . . 3. . . . . . 2. . . . . . . . 3. . .4 The Second Derivative and Concavity . .1 The Darboux Integral and Areas . . . . . . . . . . . . . . . . . . . . . . . . . .1 Substitution and Deﬁnite Integrals 3. . . . .8 Absolute Extrema of Functions . . . . . . . . . . . . . .1 Concavity on Intervals . . . . 2.12 Applications of the Integral . . . . . . . . . . . . . . . . . 2. . . .3 The First Derivative and Monotonicity . . . . . . . . . . .1 Approximation by Diﬀerentials 1. . . . . . . . . .6 Detection of Local Extrema . . . . .9 Optimization Story Problems . 2.2. . . . . . . . . . . . . . .3. . .11 Numerical Integration . . . . .10 Sketching Graphs . . . . . . . . . . . . . . . . . . . . . 2. . . . . . . . . . . . . . . . 3. . . .18. . . . . . . . .1 Cauchy’s Mean Value Theorem . . . . . . . . . . . .1 Some Proofs . . 3. 3. . . . . . . . . . . . . . . . . 3. . . . . . . . . . . . . . . . . . .10 Areas between Graphs . . . . . . . .2 Unique Solutions of Diﬀerential Equations 2. . . . . . . . . . . . . . . . . 48 51 53 63 65 65 67 69 69 75 76 77 80 81 83 88 90 92 98 105 105 107 108 111 112 112 115 116 118 121 122 124 126 128 131 133 136 142 2 Global Theory 2. . . . . . 3. . . .4. . . . . .9 Substitution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19 Table of Important Derivatives . . . . . . . .8 The Fundamental Theorem of Calculus . . . . . . . 2. . . .1 Upper and Lower Sums . . 3. . . . . . . . . . 1. .5 Local Extrema and Inﬂection Points . 2. . . . . . . . . .2. . . . . . . . . . . . . . . . . . . .1 Properties of Areas . . . . . . . . . . . . . . . . 3. . . . . . . . . . . .3. . . . . . . . .

. . 144 3. . . . . . . . .13 The Exponential and Logarithm Functions .13. . 148 4 Trigonometric Functions 151 iii . . . . . . . . . . . .3. . . . . .1 Other Bases . . . . . . .

iv

Preface

In these notes we like to summarize calculus.

v

vi

PREFACE

x > y) if x − y is positive. we will work with real valued functions in one real variable. such that f (x) = y.. negative. 1. A real number is either positive. The image of a function f consists of all those points y in the range for which there exists an x in the domain of f . If x and y are real numbers. We will make frequent use of the absolute value function.e. are subsets of the real numbers. and they take values in R.1 Real Numbers and Functions We assume that the reader is familiar with the real numbers (denoted by R) and the operations of addition and multiplication. Their domains. These are basic concepts of calculus.Chapter 1 Basic Concepts Introduction In this chapter we introduce limits and derivatives. We provide some rules for their computations. then x is larger than y (i. Until further notice. x if x > 0 |x| = −x if x < 0 0 if x = 0 The distance between two points a and b on the real line is |a − b|. the sets on which these functions are deﬁned. This allows us to order the real numbers. or zero. The range of a function is a set in which the function takes values. and {x ∈ R | |x − a| < } is the set of all real numbers whose distance from 1 .

points for which there are no other arbitrarily close points in the domain of the function. Suppose that f (x) has a limit at x = a.1) |x · y| = |x| · |y|. which means that every bounded subset of the real line has a least upper bound. based on the values of f (x) for x near a.2 Limits Limits are a central tool in calculus and other areas of mathematics. such that |f (x) − L| < in the domain of f and 0 < |x − a| < δ. but arguments using it are too diﬃcult for an introductory course on the subject. This property is crucial for calculus. provided that the domain of the function f contains points arbitrarily close to a. whenever x is The equation in (1. We say that (1. for all δ > 0 there is a point b in the domain of f . for any two real numbers x and x (1. In all but a few degenerate cases. and the last one is a variation of it.1 2 The latter assumption in the proposition is satisﬁed if the domain of f contains an interval.2 CHAPTER 1. Deﬁnition 1.2) L = lim f (x) x→a if for all > 0 there exists a δ > 0. For computations with absolute values it is worth noting that. a + ). and either a belongs to this interval or a is an end point Expressed in mathematical language this means. such that 0 < |b − a| < δ. Let f be a function and L a real number. 1. BASIC CONCEPTS a is less than . |x + y| ≤ |x| + |y|.1. 1 . Proposition 1. limits are unique if they exist. then this limit is unique.2. and ||x| − |y|| ≤ |x − y|. An intuitive interpretation is that the expected value of f (x) at x = a is L.2) reads as L is the limit of f (x) as x approaches a. Every now and then we will allude to the completeness of the real line. We also say that f (x) approaches or converges to L as x approaches a. We discuss them in this section. 2 Some authors do not apply the concept of a limit at isolated points of the domain of a function. The ﬁrst inequality is referred to as triangle inequality. Expressed as an interval this set is (a − .

. x→a then the limit of h(x) exists as x approaches a. e) where d < a < e.5. LIMITS 3 of it. discussed in the following subsection.3. and h(x) all contain an interval of the form (d.4) cos x − 1 = 0. Proposition 1. If x→a lim f (x) = L = lim g(x). The following limits are important in the calculations of some derivatives. or a trigonometric function and f (a) is deﬁned. x→0 x lim and lim xn − an = nan−1 . For many functions the computation of limits is no challenge. x→0 x lim sin x = 1. Taking limits is compatible with the basic algebraic operations in the following sense.4 (Pinching Theorem). we make this kind of an assumption for the remainder of this section.2. Suppose that x→a lim f (x) = L and x→a lim g(x) = M. Then x→a lim (f + g)(x) = M + L x→a x→a lim cf (x) = cM lim (f · g)(x) = M · L x→a lim (f /g)(x) = M/L provided that L = 0. g(x). (1.3) x→c lim f (x) = L if and only if x→c lim (f (x) − L) = 0. a rational function. Proposition 1. Assume that the domains of the functions f (x) and g(x) both contain an interval of the form (d. a) or (a.1. a) or (a.7. and it is equal to L. Proposition 1. then x→a lim f (x) = f (a). x→a x − a Hints: The ﬁrst two limits follow easily from the estimates in Theorem 1. If f (x) is a polynomial. To avoid intricate language. and that c is a constant. e) where d < a < e and that f (x) ≤ h(x) ≤ g(x). Assume that the domains of the functions f (x). The last assertion can be proved using synthetic division. at least if n is an integer. As a special case we obtain the following useful observation: (1.

From this is follows by elementary geometry that the area of the slice of the disk with vertices O. D E C O A B Figure 1. to show that |h| = BC ≤ | tan h| = BD. It appears to be clear3 that BE ≤ BD. We denote by BC the length of the arc (part of the unit circle) between B and C. Given two points X and Y in the plane. π/4] we set C = (cos h. Secondly. Proof.5) | sin h| ≤ |h| ≤ | tan h|. One can show that the area of a disk with radius one is π. and the area of the slice is (tan |h|)/2. B and D. the distance between them is denoted by XY .4 CHAPTER 1. We use the process of rolling the circle along the line to measure |h|. 3 .1 Two important estimates In preparation of the proof of Theorem 1. A rigorous argument requires work.7 we show Theorem 1. For h ∈ [−π/4.1 you see part of the unit circle. This slice is contained in the triangle with vertices O. sin h). |h| = BE. imagine that you roll the circle along the vertical line through B until the point C touches it in the point E. If h ∈ [−π/4.1: The unit circle We ﬁnd that | sin h| = AC ≤ |h| = BC because going from C straight down to the x-axis is shorter than following the circle from C to the x-axis.6. then (1. This Here our argument relies on intuition. BASIC CONCEPTS 1. In Figure 1. B and C has area |h|/2. In particular. It follows that |h| ≤ tan |h|. π/4].2.

LIMITS 5 veriﬁes that |h| ≤ tan |h|. B. BC = |h|.7. In other words 2(1 − cos h) = AB × DB = (BC)2 ≤ (BC)2 = h2 .2: The unit circle From the picture we read oﬀ that AB = 2. π/4]. (1. sin h). DB = (1 − cos h).6) |1 − cos h| ≤ C A D B Figure 1. but we only need them on an interval around zero. Using similar triangles we see AB/BC = BC/DB and (BC)2 = AB × DB.1. Denote by XY the length of the straight line segment between the points X and Y . π/4] be the number for which we want to show the inequality and C = (cos h. 4 The inequalities hold without the restriction on h. and BC ≤ BC. 2 2 Proof of Theorem 1. Let h ∈ [−π/4. Let BC be the length of the arc (part of the unit circle) between B and C.7. In Figure 1. the second inequality which we claimed in the theorem. and C. then4 h2 h2 and |h − sin h| ≤ . . If h ∈ [−π/4. Restricting ourselves to this interval simpliﬁes the proofs somewhat. Theorem 1.2 you see half of a circle of radius 1 centered at the origin. and a triangle with vertices A.2.

Let f be a function and L a real number. They are called the right and left hand limits. then Theorem 1. 1. π/4]. The ﬁrst two limits express how the function behaves as we approach a point a from the right or left. In some situations one would like to modify the deﬁnition in Section 1.8. Deﬁnition 1. We say that L = lim f (x) x→a− if for all > 0 there exists a δ > 0. then both sides of the second inequality in (1.6) are zero.6 tells us that | sin h| sin h | sin h| ≤ |h| ≤ | tan h| = hence 0 ≤ cos h ≤ ≤ 1. Let f be a function and L a real number.2. such that |f (x) − L| < in the domain of f and a − δ < x < a. whenever x is Deﬁnition 1. ﬁrst section about limits suﬃces for a while. BASIC CONCEPTS The ﬁrst estimate in (1. The last two limits allow us to express that the values of a function tend to plus or minus inﬁnity.6 CHAPTER 1. such that |f (x) − L| < in the domain of f and a < x < a + δ. The next two limits express what happens as the variable tends to plus or minus inﬁnity.9.6) is an immediate consequence. we conclude that h − sin h h2 h ≤ |1 − cos h| ≤ ≤ . We call them limits at inﬁnity. cos h h Subtracting the terms in this inequality from 1 we ﬁnd 0 ≤1− sin h ≤ 1 − cos h ≤ 1. If 0 = h ∈ [−π/4. If h = 0. We call them inﬁnite limits. verifying the assertion in this case. h Using our previous estimate for |1 − cos h| and our assumption that |h| ≤ π/4 < 1. whenever x is . h 2 2 The second estimate claimed in the theorem is an immediate consequence.3 More Limits The material in the previous. and we do so in this section. We say that L = lim f (x) x→a+ if for all > 0 there exists a δ > 0.

then x→a+ 7 lim f (x) = 1 and x→a− lim f (x) = −1. Let f be a function and a a real number. and a can be replaced by ±∞. For example √ 1 lim = ∞ and lim x = ∞. In other words. We can consider what happens to the values of a function f (x) as x approaches ∞ or −∞. Let f be a function and a a real number. We say that x→a lim f (x) = −∞ if for all M there exists a δ > 0 such that f (x) < M whenever x is in the domain of f and 0 < |a − x| < δ. In the last two deﬁnitions a may be replaced by a± . we can make sure that the value of f (x) is larger than any given number M . For example x→∞ whenever lim 1 = 0 and x x→−∞ lim 1 = 1. We say that L = lim f (x) x→−∞ if for all > 0 there exists a number M . Let f be a function and L a real number. Deﬁnition 1.11.10. such that |f (x) − L| < x is in the domain of f and x < M .13.1. We say that x→a lim f (x) = ∞ if for all M there exists a δ > 0 such that f (x) > M whenever x is in the domain of f and 0 < |a − x| < δ. + x x→∞ x→0 . MORE LIMITS For example.3. such that |f (x) − L| < x is in the domain of f and x > M . no matter how large. whenever Deﬁnition 1. Let f be a function and L a real number. 1 + x2 Deﬁnition 1. if f (x) = sign(x) = x/|x|. by taking x close to a. We say that L = lim f (x) x→∞ if for all > 0 there exists a number M . Deﬁnition 1. so that we approach a from the left or right.12.

we note that the function (f + g)(x) = f (x) + g(x) is deﬁned for those x for which both f and g are deﬁned.e. Suppose that f is deﬁned and continuous on the closed interval [a. Proposition 1. See the footnote to Proposition 1.7) x→c lim f (x) = f (c). b]. Let f and g be continuous functions. . To determine the domain of f /g one needs to exclude those points where g is zero.16 (Intermediate Value Theorem). The same statement holds for (f · g)(x) = f (x) · g(x). g is deﬁned at points arbitrarily close to c.. and trigonometric functions are continuous. then the function is always continuous at c. such that f (c) = 0. Theorem 1.4 Continuous Functions We deﬁne continuous functions and discuss a few of their basic properties. Then f + g. f is deﬁned for all g(x) where x is in the domain of g and close to c. One can produce many more continuous functions through standard operations on functions. If C is in between f (a) and f (b). Let f be a function and c a point in its domain.8 CHAPTER 1. In most cases the condition in Deﬁnition 1.8) x→c lim f (g(x)) = f lim g(x) . The clarify the remark about the domain in the proposition. i. then there exists a c ∈ [a. and f is continuous at limx→c g(x). In fact. rational functions.15. BASIC CONCEPTS 1. there are no other points in the domain of f arbitrarily close to c. A function f is continuous if it is continuous at all points in its domain..2. Polynomials. For the composition (f ◦ g)(x) = f (g(x)) on needs that g takes values in the domain of f . The class of continuous functions will play a central role later. wherever these functions are deﬁned. this equation holds whenever there are points in the domain of f arbitrarily close to c. Deﬁnition 1.14 says that (1. One may also reverse the order of applying a continuous function and calculating a limit: (1. f · g. i. b]. If c is an isolated point in the domain of f . such that |f (c) − f (x)| < whenever x belongs to the domain of f and |x − c| < δ.14.e. The function is said to be continuous at c if for all > 0 there exists a δ > 0. x→c provided the natural technical assumption hold. f /g and f ◦ g are continuous.

9) ax + by = c for some given real numbers a.11) y = m(x − x1 ) + y1 . Theorem 1. such that p(c) = 0. .1. b]. They are equivalent to the completeness of the real line. LINES 9 E. According to the theorem there exists some c ∈ (0. b b The number m is called the slope of the line.17 (Extreme Value Theorem). y1 ) is (1. If b = 0 we may rewrite the equation as (1.1. The Intermediate Value and Extreme Value theorem are typically proved in an introductory analysis course. where it is assumed that a and b are not both zero.5 Lines In general.10) a c y = − x + = mx + B. 1. y1 ) and (x2 .g. 1). suppose that p(x) = x3 − x2 + 2x − 1. b]. Expressed in words. The equation of a line with slope m through the point (x1 . the most useful version of the equation of a line is its point-slope formula. such that f (c) ≤ f (x) ≤ f (d) for all x ∈ [a. b and c. Given any two points (x1 . b]. x2 − x 1 For our purposes. the theorem says that a continuous function on a closed interval assumes a smallest and largest value. We mentioned this property of the real numbers in Section 1. and B is the point in which the line intersects the y-axis. Let f be deﬁned and continuous on the closed interval [a. Then there exist points c and d in [a..5. a line consists of the points (x. The line is vertical if and only if b = 0. p(0) = −1 and p(1) = 1. y2 ) in the plane. y) in the plane which satisfy the equation (1. also called the y-intercept. the line through them has slope m= y2 − y1 . The polynomial is certainly a continuous function.

19. BASIC CONCEPTS 1. We call a line t(x) the tangent line to the graph of f (x) at x = c if t(x) is the best linear approximation of f (x) on some open interval around c.20.21. and we denote it by f (c). this deﬁnition is hard to work with. Not only does f (x) − t(x) converge to zero as x approaches c. We call a line t(x) the tangent line to the graph of f (x) at x = c if (1. Suppose f (x) is a function and c is an interior point of its domain. i. Then we say that f (x) is diﬀerentiable at c. let us say Deﬁnition 1. x−c The equation in (1.e.10 CHAPTER 1.12) expresses in a precise form in which sense the tangent line is close to the graph of f (x) near c. such that c ∈ I ⊆ B. To be able to express ourselves concisely. Deﬁnition 1. the line t(x) is closer to the graph of f (x) than any other line for all x in some open interval around c.18. Deﬁnition 1. Deﬁnition 1.13) t(x) = f (c)(x − c) + f (c). It is easier to work with a more concrete deﬁnition. Although the term ‘best linear approximation near c’ gives an excellent intuitive picture what a tangent line is. . and assume that there is a tangent line to the graph of f (x) at x = c. then it is unique.. Suppose f (x) is a function and c is an interior point of its domain. We use tangent lines to deﬁne the concept of diﬀerentiability and the derivative. but it there is a tangent line.6 Tangent Lines and the Derivative We like to introduce the concept of tangent lines.12) x→c lim f (x) − t(x) = 0. Utilizing the notation in the previous deﬁnition we can write down the equation of the tangent line to the graph of f (x) at x = c in point-slope form: (1. We give a ﬁrst deﬁnition for a tangent line. Suppose f (x) is a function and c is an interior point of its domain. A point c is an interior point of a subset B of R if there is an open interval I. it does so even when divided by x − c. We call the slope of the tangent line the derivative of f (x) at c. For a given function and an interior point c in its domain there may or may not be a tangent line.

Deﬁnition 1. Say. whose domain consists of all those points where f (x) is diﬀerentiable. shows that t(x) = A1 (x − c) + A0 .23. By deﬁnition. an open set is a set. By deﬁnition.22. The technique used in the example. Its slope is p (−2) = −52. Suppose the domain of the function f (x) is an open set. We can use any polynomial p(x) and point x = c and write p(x) in powers of (x − c). For t(x) as proposed. To do so. suitably generalized. the result is p(x) = An (x − c)n + · · · + A1 (x − c) + A0 . Find the tangent line t(x) to the graph of p(x) at x = −2 and p (−2). Let p(x) = 2x4 − 3x2 + 5. You are expected to ﬁll in some of the arithmetic steps. replacing u by (x + 2). TANGENT LINES AND THE DERIVATIVE 11 To diﬀerentiate a function means to ﬁnd its derivative. Then say that f (x) is diﬀerentiable if it is diﬀerentiable at each point of its domain.6. we see that p(x) − t(x) x−c = p(x) − t(x) x+2 (provided |x + 2| ≤ 1) = |2(x + 2)3 − 16(x + 2)2 + 45(x + 2)| ≤ 65|x + 2| This estimate shows that (p(x) − t(x))/(x − c) converges to zero as x approaches c = −2. We consider f (x) as a function. this means that t(x) is the desired tangent line. such that each of its points is an interior point. ♦ The example is generic. Example 1.1. Solution: As a ﬁrst step we expand p in powers of u = (x + 2). we ﬁnd: p(x) = 2(x + 2)4 − 16(x + 2)3 + 45(x + 2)2 − 52(x + 2) + 25. We assert that t(x) = −52(x + 2) + 25 and p (−2) = −52. we substitute u − 2 for x and expand p in powers of u. p = 2(u − 2)4 − 3(u − 2)2 + 5 = 2(u4 − 8u3 + 24u2 − 32u + 16) − 3(u2 − 4u + 4) + 5 = 2u4 − 16u3 + 45u2 − 52u + 25 Reversing the substitution.

14) is also more accessible to computer assisted algebra than the limit deﬁnition.3.24. without the limit concept. and the picture shows how they ‘hug’ each other. The parabolas p(x) and q(x) touch each other without crossing. In an example. The condition in (1. and t(x) have the same value at x = c. Polynomials are diﬀerentiable. With this deﬁnition fewer functions will be diﬀerentiable than with the one given in Deﬁnition 1. |f (x) − t(x)| ≤ A(x − c)2 . the deﬁnition of a limit is the most diﬃcult one in a ﬁrst semester of calculus.14) for all x ∈ I. the graphs of f (x) and t(x) have to be close to each other near x = c. p(x). A pedagogical advantage of the approach is that one does not have to understand limits before one can understand the deﬁnition of the derivative. BASIC CONCEPTS is the tangent line to the graph of p(x) at x = c. In this sense. There you see the function f (x) = sin x. We call a line t(x) the tangent line to the graph of f (x) at x = c if there exists and open interval I around c and a number A. The inequality in (1. There is very little space left between p(x) and q(x). tangent line. and p (c) = A1 . and the tangent line t(x) (dashed).12 CHAPTER 1. this situation is shown in Figure 1. 1.14) at least alludes to a divisibility condition. but we have shown that Proposition 1.12) by a slightly stronger one. q(x). Suppose f (x) is a function and c is an interior point of its domain. Eventually we will ﬁnd a more eﬃcient method for diﬀerentiating polynomials. rigorously. but this is not crucial. There is also a geometric picture which illustrates the concept of closeness.14) can be rewritten as q(x) = t(x) − A(x − c)2 ≤ f (x) ≤ t(x) + A(x − c)2 = p(x). the parabola q(x) (dotted and open downwards). and f (x) and t(x) are squeezed in between them. such that (1.20. and it is interesting to explore ways to develop calculus. where the parabolas q(x) and p(x) are deﬁned by the expressions they are adjacent to. In terms of algebraic geometry (1. One can do this by replacing the condition in (1.1 Derivatives without Limits Without a doubt.6.25. All four function f (x). Deﬁnition 1. and derivative. the parabola p(x) (dotted and open upwards).

5 2 Figure 1.25 0.3: Sine Function and Tangent Line between two Parabolas 1.7 Secant Lines and the Derivative Often a diﬀerent approach is taken to motivate and introduce the derivative.7.1.26. then f (c) = lim f (c) − f (x) .75 0.25 1 0. (1. and its slope (f (a) − f (b))/(a − b) is called . c−x x→c Proof.5 1. Suppose f is a function and c is an interior point of its domain. The line through (a. Theorem 1. This is obvious once one uses the expression for the tangent line in (1.15) f (x) − t(x) f (x) − f (c) = − f (c). x−c x−c Apply limits to both sides of the equation and the assertion follows. f (b)) is called a secant line. Let us explain the situation geometrically.12) inside the limit. If f is diﬀerentiable at c. SECANT LINES AND THE DERIVATIVE 13 1.5 1 1.5 0.13) and substitutes it in the expression in (1. f (a)) and (b. Suppose a and b are distinct points in the domain of the function f .

005 0.005 -0. a line near c.00005 Figure 1.5. or resembles.05 0. b].01 0. You see part of its graph over two diﬀerent intervals in Figure 1.27. . BASIC CONCEPTS the average rate of change of f over the interval [a.0075 -0. and possibly illustrate that the tangent line is close to the graph in the sense of Deﬁnition 1.1 -0. call t(x) = f (x − c) + f (c) the tangent line.0001 0. The theorem asserts that for a diﬀerentiable function this limit of the slopes of secant lines is the slope of the tangent line. (f (c)) and (x. f (x)).005 0.20.14 CHAPTER 1.005 0. If a function is diﬀerentiable at a point.01 -0.8 Diﬀerentiability implies Continuity It is worth pointing out that Theorem 1.4: f (x) = x2 sin(1/x) Figure 1. 1. By no stretch of imagination will you say that the graph of the function looks like a line.00005 0.4 and 1. Eventually you will be able to show that the function f (x) = x2 sin(1/x) 0 if x = 0 if x = 0 is diﬀerentiable everywhere on the real line. 0. In (1.15) we are considering the slopes of secant lines through (c. and then we take the limit as x approaches c. For the obvious reason f (c) is called the rate of change or instantaneous rate of change of f at c.0025 -0. Many authors introduce the derivative as the limit of the slopes of secant lines.0025 -0.0001 -0. then it is continuous at this point.05 -0.0075 0.5: f (x) = x2 sin(1/x) It is misleading to say that the graph of f (x) looks like.1 -0.

2 1. This implies that limx→c f (x) = f (c) and that f (x) is continuous at c. x−c Then certainly x→c lim [(f (x) − f (c)) − f (c)(x − c)] = 0. and f (x) = |x| is not diﬀerentiable at x = 0. the function f (x) = |x| is continuous.6: The absolute value function The converse of the theorem is false. so does (f (x)−f (c)).6) that there is not line close to the graph of this function near x = 0. We can also give an analytic argument.5 1 0.9.g..9 Basic Examples of Derivatives Let us use the deﬁnitions and work out a few derivatives. By assumption we have the derivative f (c) and x→c lim f (x) − f (c) − f (c) = 0. There are continuous functions which are not diﬀerentiable. They are 1 if x > 0 and −1 if x < 0. we have to study the diﬀerence quotients (|x|−|0|)/(x−0) = |x|/x.1. but it is not diﬀerentiable at x = 0. BASIC EXAMPLES OF DERIVATIVES 15 Proof. It is apparent from the graph (see Figure 1. 1. E. . Because f (c)(x−c) converges to zero as x approaches c.5 -2 -1 1 2 Figure 1. According to the deﬁnition of diﬀerentiability. Denote the function by f (x) and the point of diﬀerentiability by c. There is no number these diﬀerence quotients converge to.

Example 1. . . 2 ax + b The tangent line to the graph of f (x) at x = c is then √ a t(x) = √ (x − c) + ac + b. . then f (x) = −1/x2 . √ Exercise 1. = lim x→c x − c x→c (x − c)( x + c) 2 c lim Remark 1. If f (x) = xn and n is a non-negative integer.28. . n = 0. then f (x) = axa−1 . 2. 2 ac + b Verify that (1. generalizing all of the examples above. Suppose c = 0. then f (x) = nxn−1 . Proof. Show that a f (x) = √ . Then xn − cn = lim (xn−1 + xn−2 c + · · · xcn−2 + cn−1 ) = ncn−1 x→c x − c x→c lim The cases n = 0 and n = 1 are even easier and left to the reader. Suppose that f (x) = ax + b and ax + b > 0. lim 1 x x→c −1 c−x 1 c = lim = − 2. 3 2( ac + b) .. If f (x) = 1/x. Proof.29.16 CHAPTER 1. i. Example 1. x→c xc(x − c) x−c c √ √ x and x > 0.16) |f (x) − t(x)| ≤ a2 √ (x − c)2 .e. If f (x) = Proof. Suppose that n ≥ 2. then f (x) = 1/(2 x).30. 1. √ √ x− c x−c 1 √ √ = √ . Eventually we will see that if f (x) = xa for any real number a. BASIC CONCEPTS Example 1.

The details are similar to the ones in Example 1. Exercise 2. then |f (x) − t(x)| ≤ (x − c)2 . Proof. then f (x) = − sin x.17) | sin x − t(x)| ≤ (x − c)2 The steps are essentially the same as in the proof above.1.4). a bound on the diﬀerence between the function and its tangent line.31. and provides an explicit error estimate. Show that sin x = cos x.25. the angle x needs to be measured in radians. if t(x) = sin c(x − c) + cos x is the tangent line to the graph of f (x) at x = c. BASIC EXAMPLES OF DERIVATIVES 17 The estimate in (1. lim sin x − sin c x−c = = = = = sin(c + h) − sin c h→0 h sin c cos h + cos c sin h − sin c lim h→0 h sin c(cos h − 1) + cos c sin h lim h→0 h cos h − 1 sin h sin c · lim + cos c · lim h→0 h→0 h h cos c. If f (x) = cos x.25. but it provides an explicit error estimate. Example 1. It is left as an exercise for the reader to show that (1. For this equation to hold.16) shows diﬀerentiability in the sense of Deﬁnition 1.9.17) does not only show diﬀerentiability in the sense of Deﬁnition 1. a bound on the diﬀerence between the function and its tangent line. The estimate in (1. Furthermore.31. Below we will set x = c + h and x − c = h. The tangent line to the graph of the sine function at x = c is t(x) = cos c(x − c) + sin c. lim x→c For computation of the limits in the second to last line see (1. .

then we set (1. First we take a p-th power and then a q-root. Theorem-Deﬁnition 1. The arithmetic properties of the exponential function.18) loga (y) = x. √ any rational number For r = p/q (p and q are integers) one can deﬁne ar = q ap . More precisely. ax > 0 for all x.33 (Exponential Laws). The theorem just says that the exponential laws. to use the notation ax for expa (x) also if x is not rational. More explicitly. Theorem 1. such that expa (x) = y. which is deﬁned for all real numbers x such that expa (x) = ax whenever x is a rational number. It is common. Let a be a positive number. called the exponential function with base a and denoted by expa (x). For every number y > 0 there exists exactly one number x. Furthermore.18 CHAPTER 1. aloga y = y and loga (ax ) = x . This function is monotonic. are collected in our next theorem.10 The Exponential and Logarithm Functions The exponential and logarithm are of great importance and we do not want to delay their introduction any further. so we use (0. a = 1. whose domain consists of all rational numbers. Suppose a is a positive real number and a = 1. h(r) is increasing if a > 1 and decreasing when 0 < a < 1. also called the exponential laws. In this sense we have a function h(r) = ar . also hold in the generality of our current discussion. We just deﬁned a function loga (y). and we will follow this convention. For any positive real number a and all real numbers x and y ax ay = ax+y ax /ay = ax−y (ax )y = axy If x is the unique solution of the equation ax = y. ∞) as the range of the exponential function expa (x). It is called the logarithm function with base a. BASIC CONCEPTS 1. and by construction it is the inverse of the exponential function expa (x). There exists exactly one monotonic function. ∞). which you previously learned for rational exponents. technically we are not quite prepared for it and at a later point we have to revisit the introduction to ﬁll in details. Still. so that the domain of the exponential function is (−∞.32.

5 1.10.1.7 and 1. The domain of the logarithm function is (0.34 (Laws of Logarithms).7: exp2 (x) Figure 1. ∞). So exp(x) = expe (x) and ln(x) = loge (x). for all positive real numbers x and y.5 -1 -1 -0. It is denoted by ln(x).5 0. 2.5 0. Corresponding to the exponential laws in Theorem 1. It is increasing if a > 1 and decreasing if 0 < a < 1.33 we have the laws of logarithms.8 you see parts of the graphs of the exponential and logarithm functions with base 2. For any positive real number a = 1.718281828.5 2 1. and vice versa. ∞) and its range is (−∞. This irrational number is called the Euler number (named after Leonard Euler) and denoted by e. . THE EXPONENTIAL AND LOGARITHM FUNCTIONS 19 for all x ∈ R and all y > 0. The exponential function is the exponential function for the base e.5 1 1. Theorem 1.5 1 0. We will deﬁne it precisely later. Deﬁnition 1.5 3 Figure 1.35. One set of laws implies the other one.5 1 -0.5 0. Its inverse is the natural logarithm function. It is denoted by exp(x) or ex . and e ≈ 2.8: log2 (x) The Euler number e as base There is one number which is preferrable as base over the others.5 2 2. and any real number z loga (xy) = loga (x) + loga (y) loga (x/y) = loga (x) − loga (y) loga (xz ) = z loga (x) In Figures 1.5 1 1.

The least technical and for our purposes suﬃcient solution is captured in Proposition 1. Deﬁnition 1. For any positive number a (a = 1). F (x) = f (x) for all x ∈ I. and f extends to a diﬀerentiable function F on an open interval containing I.11 Diﬀerentiability on Closed Intervals In Deﬁnition 1. then f (x) = F (x) is unique for all x ∈ I. 1. and that J is the domain of a function F . let us relate the exponential and logarithm functions for diﬀerent bases to those with base e. and the derivative of the natural logarithm function is 1/x. Let us formalize the idea of extending functions. .e. There are situations in which one would like to apply the notion of diﬀerentiability to functions with other kinds of domains.39.36. Suppose that I and J are subsets of the real line R and I ⊆ J. i. ln a These identities follow from the exponential laws and the laws of logarithms. Other Bases Finally. the interval is neither empty nor a single point. Theorem 1.19) exp (x) = exp(x) CHAPTER 1. Without some restrictions on I. x The derivative of the exponential function is the exponential function. that I is the domain of a function f . Deﬁnition 1.20 Eventually we will see (1. BASIC CONCEPTS and ln (x) = 1 . a function may be diﬀerentiable without the derivative being well deﬁned.37. Suppose the function f is deﬁned on an interval I.38. ax = ex ln a and loga x = ln x .22 we deﬁned what it means that a function is diﬀerentiable on an open set. A function f is said to be diﬀerentiable on a subset I of R if it extends to a diﬀerentiable function F on an open set. We call F an extention of f if it agrees with f on I.. We set f (x) = F (x) for all x ∈ I.

∞) as its domain. it makes sense to write (compare Example 1. then dy dy 1 = y = ex . dx In this notation x plays two roles. This in acceptable df d because it won’t lead to confusion. ∞) as domain. E. In contrast. and it lends itself more to generalizations in higher dimensions. b]. 0) is a vertical line.) 1.1. Physicists will indicate a derivative with respect to time by a dot. then dy (x) = cos x. Instead of dx (x) we also write dx f (x). To be speciﬁc about this aspect. (The function g(x) is diﬀerentiable if we use (0.. if x is a function of time. The slope of this line is not a real number and we do not have a derivative. Leibnitz’ notation for the ˙ df derivative of a function f of a variable x is dx . and if y(x) = ln x.12 Other Notations for the Derivative There are diﬀerent notations for the derivative of a function. We will use it frequently.31): If y(x) = sin x. Let us discuss two examples. The only sensible candidate for the tangent line to the graph of g(x) at the point (0. This is particularly convenient if f stands for a larger expression as in d sin x = cos x dx or d x e = ex .g. then = . 1] is diﬀerentiable. then they will write x(t) instead of x (t). The expression does not tell where dy/dx is evaluated. It is the name of the variable of y as well as the name of the variable of the derivative of y. dx dx x This notation is not always speciﬁc enough. It extends to the diﬀerentiable function F (x) = x2 with √ the open set (−∞. Our discussion is less painful. OTHER NOTATIONS FOR THE DERIVATIVE 21 We are mostly concerned with deﬁning diﬀerentiability for functions whose domain is a closed interval [a. The expression dy/dx stands for the derivative of y with respect to x. Some authors use one-sided limits and one-sided derivatives to contemplate derivatives at the end points of the interval.12.19)) we have: If y(x) = ex . Expressing the derivatives of the exponential and natural logarithm functions this way (see (1. The function f (x) = x2 with domain [0. and that is a function. dx . ∞). where a < b. the function g(x) = x is not diﬀerentiable on the interval [0.

19). Example 1. and the derivative of a multiple of a function is the multiple of the derivative. be able to apply the accurately. 1.1 Linearity of the Derivative Diﬀerentiation is compatible with addition of functions and multiplication with a constant. You can do it even on the computer. Then h(x) = f (x) + 3g(x). and assume that both of them are diﬀerentiable at x.13 Rules of Diﬀerentiation We discuss formulas for calculating the derivative of a composite function from the derivatives of its constituents. Let c be a real number. dx dx (1. In the last section of this chapter we summarize the computational results of this section. and practice many examples. In a more mathematical language one says that diﬀerentiation is linear. Previously we found that f (x) = 2x and that g (x) = ex .21) and In words. g(x) = ex and c = 3. the derivative of a sum of functions is the sum of the derivatives.20) (f + g) (x) = f (x) + g (x) and (cf ) (x) = cf (x). We collect the rules established in this section and tabulate the derivatives of many of the important functions which we considered. Let f and g be functions.13. Then f + g and cf are diﬀerentiable at x and their derivatives are given by (1. These formulas. turn the process of diﬀerentiation for many functions into an algorithm. which means that no “understanding” is required. Diﬀerentiate h(x) = x2 + 3ex . ♦ dx . You are expected to learn the basic rules. see (1.22 CHAPTER 1. In Leibnitz’ notation this reads d df dg (f + g)(x) = (x) + (x) dx dx dx d df (cf )(x) = c (x). BASIC CONCEPTS 1. together with the knowledge of the derivatives of some basic functions. We conclude that h (x) = dh (x) = 2x + 3ex .40. a rather mechanical process. Solution: Set f (x) = x2 .

1. Again.22) and (1. see (1. a special case of the formula which we just derived. If f (x) = 4x5 − 3x2 + 4x + 5. a = 1. then f (x) = 20x4 − 6x + 4.. Then the product f g and the quotient f /g are diﬀerentiable at x and their derivatives are given by . and assume that both of them are diﬀerentiable at x. Then the diﬀerentiation rules are (1. We stated previously that ln x = 1/x.41. In this sense ln a loga x = cf (x) where c = 1/ ln a and f (x) = ln x.2 Product and Quotient Rules Next we state the product and the quotient rule. They allow us to calculate the derivatives of products and quotients of functions.36.e.13. Thinking of f and g more as functions. we ﬁnd loga x = d dx ln x ln a = 1 1 1 1 ln x = × = .19).23) (cf ) = cf or d df (cf ) = c . RULES OF DIFFERENTIATION 23 Example 1. For the quotient rule assume in addition that g(x) = 0. Here is a speciﬁc example. and not so much as functions evaluated at a point. a function of the form f (x) = an xn + an−1 xn−1 + · · · + a1 x + a0 .42. Using the linearity of the derivative. see Theorem 1. let f and g be functions. Find the derivative of an arbitrary polynomial. Using Example 1. Solution: Recall that loga x = ln x . i. Diﬀerentiate log a x. Solution: A polynomial is a ﬁnite sum of multiples of non-negative powers of the variable. ♦ 1. the logarithm functions for an arbitrary positive base a. we may omit (x) from the notation. ♦ ln a ln a x x ln a Suppose f and g are deﬁned and diﬀerentiable on an set.28 and the linearity of the derivative we see right away that f (x) = nan xn−1 + (n − 1)an−1 xn−2 + · · · + a1 . dx dx (f + g) = f + g or d df dg (f + g) = + dx dx dx Example 1. where the ai are constants.13.

25) (x) = In Leibnitz’ notation these formulas become (1. ♦ (x3 + 1)2 (x3 + 1)2 Example 1. Let n be a negative integer and m = −n. Find the derivative of the rational function.43. BASIC CONCEPTS (1. the real line with the origin removed. [g(x)]2 (1. then we domain of the function is R\{0}.24) (f g) (x) = f (x)g(x) + f (x)g (x) f g f (x)g(x) − f (x)g (x) . The formula d n x = nxn−1 dx for all integer powers n. Putting this into the product formula yields 1 = x(2 ln x + 1). According to the quotient rule r (x) = 2x(x3 + 1) − (x2 − 5)3x2 −x4 + 15x2 + 2x = . h (x) = f (x)g(x) + f (x)g (x) = 2x ln x + x2 r(x) = x2 − 5 . Solution: We veriﬁed this formula for n ≥ 0 in Example 1. Diﬀerentiate the function h(x) = x2 ln x. Then p (x) = 2x and q (x) = 3x2 . x3 + 1 ♦ Solution: We set p(x) = x2 − 5 and q(x) = x3 + 2.45. [g(x)]2 (1. x = m 2m dx dx x x x . see (1. Then f (x) = 2x and g (x) = 1/x.26) d (f g)(x) = dx d dx f g (x) = df dg (x)g(x) + f (x) (x) dx dx df dx (x)g(x) dg − f (x) dx (x) .28.44. Solution: Write h(x) = f (x)g(x) with f (x) = x2 and g(x) = ln x.19). If n ≤ −1.27) Example 1.24 CHAPTER 1. x Example 1. Then d n 0 · xm − 1 · mxm−1 d −m 1 = = m+1 = nxn−1 .

Solution: We write the function as a quotient: f (x) = 1/ cos x. where n is an integer. i. ♦ 2x cos cos x cos x Suppose f and g are deﬁned and diﬀerentiable on an open set.31) (f g) = f g + f g or d df dg (f g) = g+f dx dx dx d dx f g df dx g dg − f dx . f (x) = sin x/ cos x. given by g2 (x) = [g(x)]2 . and not so much as functions evaluated at a point. Find the derivative of f (x) = tan x.28) sin x cos x − sin x cos x cos2 x + sin2 x 1 = = = sec2 x. and apply the quotient rule. they write tan x = (1. and that the derivative of a constant vanishes. The function is deﬁned for all x for which cos x = 0. for x not of the form nπ + 1/2.e.31) and cos x = − sin x (see Exercise 2 on page 17). We ﬁnd sin x sin x 1 sec x = (1. Then the product rule and quotient rule become (1. Thinking of f and g again more as functions.46.13.47. RULES OF DIFFERENTIATION Example 1.. We ﬁnd (1. That draws our attention to the fact that the function f (x) = tan x satisﬁes the diﬀerential equation f (x) = 1 + f 2 (x).1. . g2 ♦ and. Use that sin x = cos x (see Example 1. we may once more omit (x) from the notation. We apply the quotient rule.30) = · = tan x sec x.32) f g f g − fg = g2 or = Here g2 is the square of the function g. Diﬀerentiate the function f (x) = sec x. Example 1. wherever g(x) = 0. Based on the relevant trigonometric identity. using that cos x = − sin x (see Exercise 2 on page 17).29) tan x = 1 + tan2 x. 25 Solution: We express f (x) as a quotient of two functions. (1. 2x 2x cos cos cos2 x Some books and computer programs will give this result in a diﬀerent form.

BASIC CONCEPTS 1.48.26 CHAPTER 1. In the last expression we reversed the order of the factors to make the expression more readable. dx dx du dx Example 1. with g(x) = x2 + 1 and f (u) = eu .34) dh d df dg (x) = f (g(x)) = (g(x)) (x).45 with the chain rule we ﬁnd d n u (x) = nu (x)un−1 (x) dx . so that the composition (f ◦ g)(x) = f (g(x)) is deﬁned for all x in the domain of g. Here are some speciﬁc examples: d 2x+5 = 2 e2x+5 e dx d sin x = cos x esin x e dx d tan x = sec2 x etan x . If f (x) = eu(x) then f (x) = u (x)eu(x) . ♦ e dx Example 1. Let u(x) be a diﬀerentiable function. Remember that f (u) = f (u) = eu and 2 g (x) = 2x. and suppose that the domain of f contains the range of g.33) h (x) = (f ◦ g) (x) = f (g(x))g (x). Set h = f ◦ g. ♦ Example 1. In Leibnitz’ notation the chain rule says that (1. so that h(x) = f (g(x)). f (g(x)) = ex +1 .13. Diﬀerentiate the function h(x) = ex 2 +1 . The chain rule says that whenever g is diﬀerentiable at x and f is diﬀerentiable at g(x). In particular. The chain rule tells us that h (x) = f (g(x))g (x) = 2xex 2 +1 .3 Chain Rule Let f and g be functions. Solution: We write h = f ◦ g as a composition of two functions.49. then h(x) is diﬀerentiable at x and (1.50. Combining Example 1.

51. So.45.19). Solution: Set g(x) = u(x) and f (u) = un . envoking the chain rule (1.g.1. du |u| du −u u This means that for all non-zero u (1.13. E. Then h(x) = f (g(x)) = un (x). d 1 d 1 1 ln |u| = (−u) = (−1) = . f (u) = nun−1 . The chain rule tells us now that h (x) = f (g(x))g (x) = n(g(x))n−1 g (x) = nu (x)un−1 (x). According to Example 1. Diﬀerentiate the function ln |u| for u = 0. Then u = −|u| and ln |u| = ln(−u).36) d u (x) ln |u(x)| = . The chain rule tells us that. see (1. for u < 0.. To be speciﬁc.35) d 1 ln |u| = . RULES OF DIFFERENTIATION 27 for all integers n. We reordered the expressions so that the expression is more readable. Solution: We asserted that ln u = 1/u for positive values of u. assuming only that u is diﬀerentiable at x and u(x) = 0 if n ≤ −1. d 2x ln |x2 − 4| = 2 dx x −4 . ♦ du u More generally. ♦ Example 1. here are concrete examples: d (3x + 5)8 dx d 2 (x + 1)25 dx d tan3 x dx d cos2 x dx d sec5 x dx = 8(3x + 5)8−1 · 3 = 24(3x + 5)7 = 25(x2 + 1)24 · 2x = 50x(x2 + 1)24 = 3 sec2 x tan2 x = 2 cos x(− sin x) = −2 cos x sin x = 5 sec4 x sec x tan x = 5 sec5 x tan x. suppose that u < 0. dx u(x) assuming that u is diﬀerentiable and nowhere zero on its domain.

π/2). For diﬀerentiable functions which are everywhere positive on their domain and any real number q the diﬀerentiation formula in (1. d q u (x) = qu (x)uq−1 (x). where n is an arbitrary integer. We use the formulae for diﬀerentiating the exponential and natural logarithm functions. Speciﬁcally. Consider a function u which is diﬀerentiable and nowhere zero on its domain and q any real number.37) specializes to (1. BASIC CONCEPTS for all x = ±2. dx d (sec2 x + 5)π = 2π sec2 x tan x(sec2 x + 5)π−1 dx .36). u(x) − cos x 1 =51 − sin x 2 2 − sin x 5 whenever sin x = 1/2.37) If f (x) = |u(x)|q then f (x) = q u (x) |u(x)|q . f (x) = = = = = Here is a concrete example: d 1 − sin x dx 2 5 d ln f (x) e dx d ln(|u(x)|q ) e dx d q ln |u(x)| e dx d (q ln |u(x)|) eq ln |u(x)| dx u (x) q |u(x)|q .28 CHAPTER 1. Then (1. Example 1. we have to exclude all x of the form π 5π 6 + 2nπ and 6 + 2nπ. π) and for x ∈ (−π/2.38) For example: d (sin x)1/2 = dx cos x √ 2 sin x for x ∈ (0. We push matters a bit further. u(x) The assertion follows from (1. Eventually we will verify them independently.49 and the exponential laws.

we get the following derivatives: d x a = ax ln a (Assume a > 0. π/4)).13. . It is elementary to compute the derivatives of the hyperbolic functions: sinh x = cosh x and cosh x = sinh x.13. sinh x) for some x ∈ (−∞. v) on the hyperbola u2 − v 2 = 1 can be expressed as (± cosh x. Hint: xx = ex ln x ) dx d sin x sin x = x + cos x ln x xsin x (Assume x ∈ (0.39) sinh x = 1 x e − e−x 2 & cosh x = 1 x e + e−x 2 You are invited to verify that cosh2 x − sinh2 x = 1.. d d f (g(h(x))) = f (g(h(x)) g(h(x)) = f (g(h(x)) · g (h(x)) · h (x).g. one can show that any point (u.1. These observations motive the attribute ‘hyperbolic’. Hint: ax = ex ln a ) dx d x x = (1 + ln x)xx (Assume x > 0. E. dx dx For example √ d √x2 +1 1 xe x +1 2 = e x +1 · √ e · 2x = √ dx 2 x2 + 1 x2 + 1 √ 2 d tan3 (5x2 − x + 5) = 3 tan2 (5x2 − x + 5) sec2 (5x2 − x + 5) · (10x − 1) dx 1. RULES OF DIFFERENTIATION Using the tricks from above. ∞). dx x 29 To diﬀerentiate a composition of more than two diﬀerentiable functions we apply the chain rule repeatedly. (1. x = 1.4 Hyperbolic Functions The exponential function may be used to deﬁne the hyperbolic sine and cosine. Conversely.

∞). 2. the function f (x) = cos x maps the interval [0.40) g(f (x)) = x and f (g(y)) = y for all x in the domain of f and all y in the domain of g. A few essential properties of inverse functions are listed in Proposition 1. cosh x sinh x cosh x sinh x As a routine application of the rules of diﬀerentiation. Its inverse arcsin x is typically used with . π/2) to the interval (−∞. ∞) to (−π/2. or you can work them out yourself. sech x = .5 Derivatives of Inverse Functions Let us recall. Some parts of this proposition are elementary. then so is g. 1. 1. If f is increasing. The function sin x maps the interval [−π/2. others are consequences of the intermediate value theorem. then J = f (I). The graph of g is obtained from the graph of f by reﬂection at the diagonal.52. For example. and csch x = . π/2] to the interval [−1. If f is continuous and I is an interval in the domain of f . BASIC CONCEPTS One may also deﬁne other hyperbolic functions tanh x = sinh x cosh x 1 1 . 4. and (1. If I is an open interval. Suppose f and g are inverses of each other. then J is an open interval. If f is continuous. The function f (x) = ex maps the interval (−∞.30 CHAPTER 1. then it is monotonic (increasing or decreasing) on any interval in its domain. It is customary to deﬁne its inverse arctan x as a function from (−∞. You can ﬁnd them in any table of mathematical formulas. is an interval.13. Two functions f and g are said to be inverses of each other (or each function is the inverse of the other one) if the domain of f is equal to the range of g. the image of I under the map f . ∞). the domain of g is equal to the range of f . If f is decreasing. 1]. then so is g. π/2). comparable to the identities for the trigonometric functions. The function tan x maps the interval (−π/2. 1]. 3. you may calculate the derivatives of these functions. coth x = . ∞) to the interval (0. π] to the inteval [−1. There are identities for these hyperbolic functions.

10.5 -1 -0. Then g is diﬀerentiable at all points y ∈ (A. We will not give a formally complete proof of the diﬀerentiability assertion. 1] domain [−1. f (x)) and y = f (x). and its range is [−π/2.5 0. f (x) Proof.5 0.5 1 -0. Denote the inverse of f by g. π/2]. RULES OF DIFFERENTIATION 1 1. and this is assured by the assumption that t(x) is not horizontal.5 31 1 0. and denote the image of f by (A. the slope of t(x) is the reciprocal of the slope of T (x).5 Figure 1. Still.5 -1 -0.5 -0. Theorem 1.5 -1 -1 -1. x) = (y. g(y)). B). You see the graph of these two functions in Figures 1. Let f be a diﬀerentiable and invertible function which is deﬁned on an open interval (a. We need that T (x) is not vertical.9 and 1. By deﬁnition we have f (g(y)) = y for all y ∈ (A. then its reﬂection T (x) at the diagonal is close to the graph of the function g(x) at the point (f (x).5 -1.5 1 1. if the line t(x) is close to the graph of the function f (x) at the point (x. With the role of x and y being interchanged. We ﬁnd f (g(y))g (y) = 1 and g (y) = 1 .5 0.13.9: sin x on [−π/2. This provides the formula for the derivative. f (g(y)) . B). this is also easy to calculate. B) for which f (g(y)) = 0. The relation between the derivative of a function and its inverse is spelled out in our next theorem. Actually.1.10: arcsin y on [−1. b). For these values of y and for x such that f (x) = y the derivative is given by: g (y) = 1 f (g(y)) or g (f (x)) = 1 . Diﬀerentiate both sides of the equation. π/2] Figure 1.53. 1].

π/2) as the range for arctan.11. f (x) We apply the theorem to ﬁnd some important derivatives. ∞) as the domain and (−π/2. The theorem says that the exponential function ferentiable and provides the formula for the derivative: d y 1 1 = ey . arctan y = dy 1 + y2 According to standard conventions we use (−∞. then g(y) = g(f (x)) = x.55.53. and f (x) = sec2 x is nowhere zero.53 tells us that g(y) = arctan y is diﬀerentiable on its entire domain (−∞. Assume that the natural logarithm function is diﬀerentiable and that ln x = 1/x.54. tan (arctan y) sec2 (arctan y) All we need to do now is to ﬁgure out what cos2 (arctan y) is. as asserted in (1. the exponential function is the inverse natural logarithm function ln.32 CHAPTER 1. . Example 1. Show that the exponential function is diﬀerentiable and that d y e = ey . Theorem 1. dy Solution: By deﬁnition. If y = f (x). ∞). We note that ln (x) = 0 for all x in (0. ∞).19). The theorem also provides us with the formula for the derivative: arctan (y) = 1 1 = = cos2 (arctan y). and we obtain the second version of the formula for the derivative of the inverse of the function: 1 g (f (x)) = . and that d 1 . e = = y) dy 1/ey ln (e as claimed. To do this we draw a triangle in which we identify the available data. Set f (x) = ln x and g(y) = ey in rem 1. Solution: The function f (x) = tan x is diﬀerentiable on its entire domain. the domain natural logarithm. ♦ of the Theoof the is dif- Example 1. We refer to the notation in Figure 1. Show that the function g(y) = arctan y (the inverse of f (x) = tan x) is diﬀerentiable. BASIC CONCEPTS as claimed.

By the theorem of Pythagoras. and assuming the diﬀerentiability of u(x). The angle at the vertex A is called u. So. 1 + y2 This is exactly what we claimed. we ﬁnd a slightly more general formula: (1. the length of the hypotenuse is Then cos u = The conclusion is that (1. 1 + y2 1 1+ y2 and cos2 (arctan y) = 1 .1.42) d u (x) arctan(u(x)) = .11: An informative triangle There you see a rectangular triangle. tan u = y and arctan y = u. 1 + y2. by deﬁnition.41) arctan (y) = ♦ 1 . and the opposing side of length y. Combined with the chain rule. RULES OF DIFFERENTIATION 33 y u A 1 B Figure 1. dx 1 + u2 (x) . The adjacent side to this angle is chosen to be of length 1. the right angle is at the vertex B.13.

and arcsec x as they are given in Table 1. we were given an equation y = f (x). 1 + sin2 x The reader is invited to verify the formulas for the other inverse trigonometric functions arcsin x.12. Consider the equation (1. 1). 1/3) 1 − 9x2 2x √ if x ∈ (−1. Show that arcsin x is diﬀerentiable on (−1.44) (x2 + y 2 )2 = x2 − y 2 . It is customary to think of arcsin x as a function from [−1. The solutions of this equation form a curve5 in the plane called a lemniscate. BASIC CONCEPTS d arctan(x2 + 5) = dx d arctan(sin x) = dx 2x 1 + (x2 + 5)2 cos x .43) For example: d arcsin(3x) = dx d arcsin(x2 ) = dx 3 if x ∈ (−1/3.13. For example Exercise 3. and suppose that |u(x)| < 1.6 Implicit Diﬀerentiation Until now we considered functions which were given explicitly. Then. Let u(x) be a diﬀerentiable function which is deﬁned on an open interval. see Figure 1. Parts of this curve look like the graph of a function. π/2]. we ﬁnd that (1. 1. arccot x.34 For example: CHAPTER 1.. 1] to [−π/2. arccos x. arcsin x = √ dx 1 − x2 We may once more improve on this formula. The points on the graph of f are the points which satisfy the equation. such .e. and that its derivative is d 1 . 1) 1 − x4 √ d arcsin(u(x)) = dx u (x) 1 − u2 (x) . I.3 on page 63. where f (x) is some instruction which assigns a value to x. using the chain rule.

5 1 Figure 1. Find the slope of the tangent line to the unit circle at the √ point (1/2.1 -1 -0.3 0. The unit circle consists of all points which satisfy the equation x2 + y 2 = 1.12: Lemniscate as the points for which y ≥ 0. Solution: We write y = y(x) to emphasize that y as a function of x. 3/2).2 -0. Diﬀerentiating both sides of the equation of the circle we get 2x + 2y dy = 0 or dx dy −x = . ♦ 5 We will rely on the readers intuitive idea of a curve in the plane.5 -0. Let us start out with an example which we have studied before. Without solving the equation for y. dx y Plugging in the coordinates of the speciﬁed point.2 0. .1 -0.1. This process is called implicit diﬀerentiation. we still like to calculate the slope of curve at one of its points. we ﬁnd that dy dx √ (1/2. 3/2) −1 =√ . 3 We used a diﬀerent way to indicate at which point we evaluate the derivative because we had to specify the x and the y coordinate of the point.13.3 0.56. Example 1. RULES OF DIFFERENTIATION 35 0.

√ E. BASIC CONCEPTS Example 1. A quick look at Figure 1. dx 2y(x2 + y 2 ) + y y(2(x2 + y 2 ) + 1) Given any point (x. 1 −3 + 2 3) is a point on the lemniscate.36 CHAPTER 1. dy The tangent line is horizontal whenever dx = 0. dx dx Bring all terms with a factor dy/dx to the left hand side of the equation and those without to the right hand side. y) = ( 1 . (2y(x2 + y 2 ) + y) dy = x(1 − 2(x2 + y 2 )).56. and ﬁnd the coordinates of the points where the tangent line is horizontal. That dy means that dx = 0 whenever 1 − 2(x2 + y 2 ) = 0 or 1 x2 + y 2 = . Solution: You see a picture of the lemniscate in Figure 1. and y 2 = 1 − x2 into the equation of the curve.g. we can plug it into the dy expression for dx and we get the slope of the curve at this point.57. 2 2 and at this point the slope of the tangent line is √ dy 2− 2 =√ √ . dx 3 −3 + 2 3 This speciﬁc calculation takes a bit of arithmetic skill and eﬀort to carry out. we consider y as a function of x and diﬀerentiate both sides of the equation. 2 Substitute x2 + y 2 = 1 . 8 .12 tells us that we may ignore points where x = 0 or y = 0. As in Example 1. the point (x. Find the slope of the tangent line to the lemniscate (x2 + y 2 )2 = x2 − y 2 . y) with y = 0 on the lemniscate. dx dy dx Finally we get an explicit expression for in terms of x and y: dy x(1 − 2(x2 + y 2 )) x(1 − 2(x2 + y 2 )) = = . We ﬁnd 2(x2 + y 2 )(2x + 2y dy dy ) = 2x − 2y . 2 2 Then we get an equation in one variable: 1 = x2 − 4 1 − x2 2 or x2 = 3 8 and 1 y2 = .12.

we get 2x + 2(y − a) dy = 0. RULES OF DIFFERENTIATION The points at which the tangent line to the lemniscate is horizontal are √ √ 6 2 (x.3536). then its equation is x2 + (y − a)2 = 1. we dy substitute dx = 4x into the second equation. Suppose the coordinates of the center of the circle are (0. y) = (± . dx dy Assuming that dx is the same for both curves at the point of contact. The crucial observation in this example is.1.58.5 0.5 1 0. we ﬁnd that dx = 4x. that the tangent line to the parabola and the circle will be the same at the point of contact. . Diﬀerentiating the equation of the circle with respect to x.6124. At which points will the circle touch the parabola?6 3 2.5 -1 -0.± ) ≈ (±.5 1 Figure 1.13. ♦ 4 4 37 Example 1. After some impliﬁcations we 6 More sensibly.13. Diﬀerentiating the equation of the parabola dy with respect to x.5 2 1. Suppose you drop a circle of radius 1 into a parabola with the equation y = 2x2 . ±.13: Ball in a Cup. Solution: You see a picture of the problem in Figure 1. a). drop a ball of radius 1 into a cup whose vertical cross section is the parabola y = 2x2 .

Consider the curve given by the equation x2 = sin y. The formula for the volume of a ball is V (r) = 4π r 3 . In summary. Solving the equation 1 + 4(y − a) = 0 for y. According the 3 the chain rule: dV dr dV dr = = 4πr 2 . BASIC CONCEPTS x(1 + 4(y − a)) = 0.14.59. At which rate does its volume change when r = 20 cm? Solution: Denote the volume of the ball by V and its radius by r. Find the √ slope of the curve at the point with coordinates x = 1/ 4 2 and y = π/4. dt dr dt dt With r = 20 and dr = 2 we get dV = 3200π cm3 /min. the 8 circle touches the parabola in the points √ 15 15 (x. y) = (2. . We use t to denote the time variable. The ball it too large to ﬁt into the parabola and touch at (0. 0).38 ﬁnd: CHAPTER 1. you know at which rate one of them changes. Exercise 5.14 Related Rates Many times you encounter situations in which you have two related variables. −1). Substituting this into the equation of the parabola. So we may assume that x = 0. y) = ± . You ﬁnd a picture of this Lissajous ﬁgure in Figure 1. This is the rate at dt dt which the volume of the ball changes with respect to time. ♦ . ♦ 4 8 Exercise 4.57 with the curve given by the equation y 2 −x2 (1−x2 ) = 0. Suppose the radius of a ball changes at a rate of 2 cm/min. Consider the curve given by the equation x3 + y 3 = 1 + 3xy 2 . 1. We consider V as a function of r as well as t. we ﬁnd that y = 15 at the point of contact. Repeat Example 1. Exercise 6. Example 1. In this section we treat such problems. and you like to know at which rate the other one changes. We substitute 4 this expression into the equation √ the circle and ﬁnd that the x coordinate of 15 of the point of contact is x = ± 4 . we ﬁnd that the y coordinate of the point of contact is y = a − 1 . Find the slope of the curve at the point (x.

y = 5 3. ♦ 2x Example 1. and dy = −3. dt dt √ so that the particle is moving to the right at a rate of 3 3 cm/min. dt dt √ √ In the given situation x = 5. x and y. Implicit diﬀerentiation of the equation of the circle gives us the equation dx dy + 2y = 0.4 0. as functions of the time variable t. At which rate does it move in the horizontal direction? Solution: The equation of the circle is x2 + y 2 = 100. The constant γ depends on the molecular structure of the gas and the temperature. It is called the adiabatic law.2 0.2 -1 -0.1. 0) in the Cartesian plane. For the purpose of this problem. RELATED RATES 39 0. We consider both variables.60.14. 5 3) and moves downwards at a rate of 3 cm/min. Boyle-Mariotte described the relation between the pressure and volume of a gas.5 1 -0.14: y 2 − x2 (1 − x2 ) = 0 Example 1. We ﬁnd that dx = 3 3. They derived the equation P V γ = C. we suppose that γ = 1.4 for air at room temperature.5 -0. Pressure (P ) and volume (V ) of air at room temperature are related by the equation7 P V 1.61. 7 .4 = C.4 Figure 1. Suppose a particle moves on a circle of radius 10 cm and centered at the origin (0. At some time the particle √ is at the point (5.

This formula is from Einstein’s special theory of relativity. ♦ Exercise 7. we ﬁnd √ dM 9 19m dM dv mvc2 = = = ≈ .4P =− . Right now the foot of the latter is 1 m away from the wall.62. The mass M of a particle at velocity v. ♦ Example 1. BASIC CONCEPTS Here C is a constant. A ladder.75 kg/cm2 sec.4P V . dV V where m is that mass at rest and c is the speed of light. is M (v) = m 1 − v 2 /c2 . dt dv dt 3610 1000(c2 − v 2 )3/2 The perceived mass increases at a rate of approximately 1% of its mass at rest. dt dV dt V dt Substituting the given information we ﬁnd that the pressure decreases at a rate of 1. Solution: We consider P as a function of V . as perceived by an observer in resting position. is leaning against a wall.010867m. and increasing at . At which rate is the mass changing when the particle’s velocity is 90% of the speed of light. dP 1.1 m/sec. At some instant t0 the pressure of the gas is 25 kg/cm2 and the volume is 200 cm3 . Find the rate of change of P if the volume increases at a rate of 10 cm3 /min.4P dV = =− . Diﬀerentiation of the equation yields dP 1. You are pulling the foot of the ladder further away from the wall at a rate of . At which rate is the top of the ladder sliding down the wall? .40 CHAPTER 1.4 V + 1.4 = 0 or dV According to the chain rule dP dP dV 1.001c per second? Solution: According to our rules of diﬀerentiation dM mvc . 7 m long. = 2 dv (c − v 2 )3/2 Applying the chain rule and substituting the values.

Suppose the size of a population of bacteria in a laboratory experiment is C1 = 5.64. Predict the size of the population at time t = 10.15.45) is an example of a diﬀerential equation.46). say A0 = A(t0 ). 4. Find the relative growth rate a of the population. but very useful model for population growth is the Malthusian Law (1. We saw that the functions A(t) = Ceat are solutions of this equation. It says that the rate of change of a population is proportional to its size. and it can be shown that on an interval any solution is of this form. 1.15 Exponential Growth and Decay An idealistic. On an interval which contains t0 the function A(t) = A0 ea(t−t0 ) is the unique solution8 of the initial value problem in (1. 000 at time t2 = 5.63. 000. We denoted the proportionality factor by a. 8 . We may specify the value of A at some time t0 . Find the time at which the population reaches 8. Find the time within which the population doubles9 .1.19). 2. Then we have an initial value problem (1. We also say that A(t) grows exponentially and a is the relative growth rate.46) A (t) = aA(t) and A0 = A(t0 ). 000 at time t1 = 2 and C2 = 7. Theorem 1. That the function satisﬁes the diﬀerential equation follows from (1. Here time is measured in hours since the beginning of the experiment. 5. The equation in (1. which we still need to prove. and the unknown is a function.45) A (t) = aA(t). EXPONENTIAL GROWTH AND DECAY 41 1. an equation which involves a function and its derivatives. 3.9 on page 68. Find the formula for the size of the population at any time t ≥ 0. The uniqueness assertion follows from Proposition 2. The essential aspects of dealing with (1. 9 Note that the doubling time depends only on the relative growth rate a.46) are addressed in Example 1.

Experiments have shown that the rate at which radioactive decays occur is proportional to the amount of radioactive material present. This rate is proportional to the rate at which the amount of the material decreases. and 3. 000 at time t1 . The minus sign in the equation is included so that k will be positive. where A0 = A(t0 ). To calculate the relative growth rate a observe that C2 A0 ea(t2 −t0 ) = = ea(t2 −t1 ) C1 A0 ea(t1 −t0 ) We ﬁnd that a= ln C2 − ln C1 ln 1. k .48) T = ln 2 . The theorem tells us that A(t) = A0 ea(t−t0 ) . Then A(t0 + T ) = A0 eaT = 2A0 Thus the doubling time is T = ln 2 a or eaT = 2 and aT = ln 2. 264. Substituting t = 10 we ﬁnd that A(10) ≈ 12. then 8000 = 5000ea(t1 −2) or ln(8/5) = a(t1 − 2) or t1 = ln(1. Consider a radioactive substance. 5. The population grows at a rate of about 11% per hour.6) + 2 ≈ 6. BASIC CONCEPTS Solution: We denote the size of the population at time t by A(t). As in the computation of the doubling time in the previous example. Suppose t denotes time and A(t) the amount of radioactive substance at time t. t2 − t1 3 and ln C2 C1 = a(t2 − t1 ).4 = ≈ . 2. 1.2 a The size of the population reaches 8. Suppose the size of the population reaches 8.42 CHAPTER 1. The size of the population at any time t ≥ 0 is A(t) = 5000ea(t−2) . The half-life T of a radioactive substance is the time within which half of it decays. Suppose at some time t0 the size of the population is A0 = A(t0 ) and T hours later the size of the population is 2A0 = A(t0 + T ).47) A (t) = −kA(t).18 hours.11. one ﬁnds (1. ≈ 6. where a is as above. 000 about 6.2 hours into the experiment. The experience which we just described can be expressed as a diﬀerential equation (1. 4.

45).8 decays per minute and gram.68 5568 or t1 = − 5568 ln ln 2 1. Suppose we measure 6. Suppose dead wood of the same kind shows 1. and the amount is believed to have been essentially constant for a long time (until recent nuclear testing). the idea is as follows. where a and b are constants. He was awarded the Nobel price for it. After death no more carbon-14 is absorbed. and the carbon-14 which was present at the time of death decays. A time independent solution (steady state solution) of this equation is f (t) = −b/a. We have that A(t1 ) = 1. approximately.8 6. Within a living organism there is an equilibrium. All living organisms absorb it.68e− 5568 t . ln 2 The tree died approximately 10. How long ago did the tree die? Solution: Let t0 = 0 be the time of death of the tree. 534.68 ≈ 10. Example 1. 500 years ago.50) f (t) = af (t) + b.8 ln 2 =− t1 6. We explain the process in a numerical example. In brief.16. The number of decays to be expected t years after death is A(t) = 6. The half-life of carbon-14 has been determined to be about 5568 years. Carbon-14 occurs naturally in the atmosphere. and t1 the present time. 1. The amount which is absorbed equals the amount which decays. measured in years.49) ln 1. wood from an oak or a human bone).1.8. .68 carbon-14 decays per minute and gram in a certain kind of wood at the time of death of the tree. and a = 0. MORE EXPONENTIAL GROWTH AND DECAY 43 In the late 1940ies Willard Libby invented (or discovered) the method of carbon-14 dating. or a part thereof (e.16 More Exponential Growth and Decay More generally than in (1. From this we calculate: (1. Measuring the number of decays in a dead organism allows us to determine the time of death.g.65. For many organisms one also knows how many carbon-14 decays to expect at the time of death. consider the diﬀerential equation (1. The level of the equilibrium is characteristic for the organism.

In summary.005t + 40.005 . This is the time at which you start to repay the loan. Solution: As variable we use time.005t − −200 = −25. Remark 2.00 per month due to payments which you make. On graduation day the balance of your student loan is $15.68. It is not hard to verify that the given functions are solutions of the respective problems.005 e. The balance increases at a rate of . Let us apply these ideas to solve some problems. Example 1.005B(t) − 200 and B(0) = 15. On an interval every solution of (1.67. On an interval which contains to .50). We obtain a unique solution if we add an initial condition to the diﬀerential equation in (1.50). Denote the balance of your loan at time t by B(t). the function f (t) = y0 + b a ea(t−t0 ) − b a is the unique solution of the initial value problem f (t) = af (t) + b and f (t0 ) = y0 . Here c denotes an arbitrary constant.5% per month. Interest is added at a rate of . denoted by t and measured in months.66. 000. Theorem 1.9 on page 68. We set t = 0 at the time of graduation.44 CHAPTER 1. 000.50) is of this form.00 per month.67 the solution of the initial value problem is B(t) = 15. 000e. Functions of the form f (t) = ceat − b a are solutions of the diﬀerential equation in (1. .005B(t) due to interest being added and decreases at a rate of $200. we have the initial value problem B (t) = . 000 + −200 . and you are repaying the loan at a rate of $ 200. The rest will be routine calculation. BASIC CONCEPTS Theorem 1. The uniqueness assertion is a minor modiﬁcation of Proposition 2. According to Theorem 1.000. Analyze the future of the loan. The important aspects are to translate the given information into a mathematical equation.

Suppose you have an object whose temperature is diﬀerent from the temperature of its surroundings. With time.005 40 25 ≈ 94.04t + 75.) The liver metabolizes the medication at a rate of 4% per hour. We denote by t = 0 the time when we start taking the medication. Analyze the amount of medication in your body at any time. Solution: We use time as independent variable. Your total payments were $18. (You can keep this rate constant with a skin patch. You stir the coﬀee gently so that the temperature in the cup remains homogeneous and almost no energy is added A(0) = 0. The amount of medication will stabilize at this amount with time.04A(t) + 3 and The solution of this problem is A(t) = −75e−. Let A(t) denote the amount of medication in your body.69. after 12 hours there will be about 28.16. 45 After approximately 94 months (7 years and 10 months) you repaid the loan. Think of a cup of coﬀee. the temperature of the object will approach the one of its surroundings. denote it by t and measure it in hours.1. ♦ Example 1.800. so that you paid the principal plus $3.04A(t) due to your liver metabolizing the medication. For example. It will take slightly more than 40 hours before the amount of medication in your body reaches 60 milligrams. MORE EXPONENTIAL GROWTH AND DECAY For example.6 milligram of medication in your body. . The steady state solution of the problem is A(t) = 75. We discuss how this happens. B(T ) = 0 if T = 1 ln . ♦ Example 1. at least under idealized circumstances. measured in milligrams.800 in interest. You are absorbing a medication at a rate of 3 mg per hour. We have the initial value problem A (t) = −.70 (Newton’s Law of Cooling). Then A(t) increases at a rate of 3 mg per hour because you are taking in medication and at the same time A(t) decreases at a rate of .

Determine the function T (t).10 Denote the temperature of the coﬀee by T . protein. The physics of heat transfer changes substantially if you take a solid object.0482. the speciﬁc heat (the amount of energy needed to increase the temperature of one unit of the material by one degree) varies.46 CHAPTER 1. All of this leads to a signiﬁcantly diﬀerent development of the temperature inside a turkey as you roast it for your Thanksgiving dinner. and bone. is proportional to the temperature diﬀerence. we get that T (t) = (95 − 25)eat + 25 = 70eat + 25. To determine a we use that T (5) = 80 = 70e5a + 25.0482t + 25. Furthermore. just after you poured the coﬀee into your cup. we set t0 = 0. If K is the temperature of the surroundings. and with this the rate of change of temperature of the coﬀee. 2. and K = 25. Having a numerical value for a gives us an explicit expression for the temperature T as a function of t: T (t) = 70e−. such that T (t1 ) = 70 degrees Celsius. a in (1. At time t = 0. Find t1 . It is diﬀerent for fat. 1.51) depends on the temperature T . Putting all of this into the formula for the solution of the initial value problem. Let us work out a numerical example.51) T (t) = a(T (t) − K) = aT (t) − aK. Five minutes later the temperature has dropped to 80 degree. while you stir it slightly and patiently. Using these data. and we conclude that a = 1 ln 55 ≈ −. the speciﬁc heat is highly temperature dependent for substances like protein. It is a function of time. y0 = 95. its temperature is 95 degrees Celsius. this is the diﬀerential equation in (1. Note that −b/a = K. BASIC CONCEPTS through the process of stirring.67. then (1.51) says that the temperature of the coﬀee drops at a rate of about . Newton’s law of cooling says that the rate at which the heat is transferred. In addition.50). The temperature in the solid object will not be homogenous. 10 . Setting b = −aK. That means. so that we write T (t).048 degrees per minute for each degree of diﬀerence between the temperature of the coﬀee and the room temperature. Solution: To apply Theorem 1. such as a turkey in the oven. Equa5 70 tion (1. The room temperature is 25 degrees. the outside warms up much faster than the inside.

State the initial value problem for P (t). we ﬁnd that t1 ≈ 9.5 mg per m3 ? Exercise 9. In negotiations which the EPA. Assume that the amount of water carried by the river is the same all year around. MORE EXPONENTIAL GROWTH AND DECAY We like to ﬁnd out the time t1 for which T (t1 ) = 70e−. 1. 3. at time t0 = 0. Let P (t) denote the amount of pollutant (measured in mg per m3 ) in the lake at time t. After a major accident the level has risen to 15 mg per m3 .17. As a remedy. It is assumed that the pollutant is distributed uniformly in the lake at any time. Let t0 = 0 be the time just after the accident and at which the clean-up strategy is implemented. The population of an endangered species of birds on Kauai decreases at a relative rate of 25% per year. 2. the factory owner proposes to reduce the emission of pollution so that the level of pollutant in the river is only 1. Find the function P (t).0482t1 + 25 = 70.5 mg per m3 of a pollutant in the lake. ♦ Exercise 8.16. At which time will the level of pollution be back to 2. A government agency raises the species in captivity and releases birds into the wild at a rate of 80 birds per year.1. What is the long term estimate for the population of this species in the wild? . A chemical factory is located on the banks of a river. Find the function P (t). That means that the temperature drops to 70 degrees approximately 9. Denote the size of the population at time t by P (t). the owner has agreed to an acceptable level of 2. State the initial value problem for P (t). the population is estimated to be 700 birds. 2.17 minutes after pouring it. 47 Solving the equation for t1 . Currently. At which time will the population drop to 500 birds? 4. and the river is the only contributor to the lake. 1. Down stream from the factory is a lake.5 mg per m3 . 3. and the amount of water in the lake is 10 times the amount of water carried by the river per year. where t denotes time and is measured in years.

etc.52) f (x1 ) ≈ f (x0 ) + f (x0 )(x1 − x0 ). BASIC CONCEPTS 1.18. This rather casual statement will become clearer when you look at the individual methods. Here is a sample computation in which you are invited to ﬁll in the details: d2 sin x d e = cos xesin x = (− sin x + cos2 x)esin x . 1. Assume also that f (x0 ) and f (x0 ) are known. is called the second derivative of f . If the function is diﬀerentiable at each point of its domain.48 CHAPTER 1. then f (x) is again a function with the same domain as f (x). This process can be iterated. 1. One uses the formula (1. .18 Numerical Methods In this section we introduce some methods for numerical computations. The derivative of the second derivative is called the third derivative.1 Approximation by Diﬀerentials Suppose x0 is an interior point of the domain of a function f (x) and f (x) is diﬀerentiable at x0 . Leibnitz’s notation for the second derivative of a function f (x) is d2 f /dx2 . that for a diﬀerentiable function we do not make a large error when we use the tangent line to the graph instead of the graph itself. 2 dx dx Exercise 10. It is denoted by f (x). Find the second derivatives of the following functions: (1) f (x) = 3x3 + 5x2 (2) g(x) = sin 5x (3) h(x) = x2 + 2 (4) i(x) = e5x (5) j(x) = tan x (6) k(x) = cos(x2 ) (7) l(x) = ln 2x (8) m(x) = ln(x2 + 3) (9) n(x) = arctan 3x (10) o(x) = sec(x3 ) (11) p(x) = ln2 (x + 4) (12) q(x) = ecos x (13) r(x) = ln(tan x) (14) s(x) = ex 2 −1 (15) t(x) = sin3 x. We may ask whether the function f (x) is differentiable. We use the symbol ‘≈’ to stand for ‘is approximately’. and denoted by f (x).17 The Second and Higher Derivatives Let f (x) be a function which is deﬁned on an open set. The method of approximation by diﬀerentials provides an approximate values f (x1 ) if x1 is near x0 . Its derivative. We will make use of the second derivative. Their common feature is. wherever it exists.

1. Find an approximate value for tan 46◦ . the tangent line to the graph of f (x) at (x0 . then we can approximate the error as long as |x1 − x0 | < d.71. f (x0 )) evaluated at x1 . Thus.52) instead of of the actual value of the function on the left hand side. and f (π/4) = 2.52). and this corresponds to π/4 + π/180. x 3 f (8) = 2.25 on page 12 provides us with an estimate. NUMERICAL METHODS 49 On the right hand side in (1. Formula (1. f (π/4) = 1.6.1.18. diﬀerentiability of the function f (x) means that there exist numbers A and d > 0.52). 12 12 ♦ Your calculator will give you 9 ≈ 2. The inequality in Deﬁnition 1. compare your answer with one found on your calculator. so we are supposed to ﬁnd f (9). if we know A and d.52) we have l(x1 ). We have been causal in (1. Note that f (x) = 1 −2/3 . such that |f (x1 ) − [f (x0 ) + f (x0 )(x1 − x0 )| ≤ A(x1 − x0 )2 whenever |x1 − x0 | < d. 12 Formula (1. Use the function 46 f (x) = tan x. In each case. 180 90 ♦ Your calculator will give you tan 46◦ ≈ 1. √ Example 1. applied with x1 = (π/4 + π/180) and x0 = π/4 says tan 46◦ = tan π π π π + ≈ tan + sec2 4 180 4 4 π π =1+ ≈ 1.72. According to this slightly more demanding deﬁnition.0349. Solution: We carry out the calculation in radial measure.2 (4) arctan 1. says that √ 3 9 = f (9) ≈ 2 + √ 3 1 25 (9 − 8) = ≈ 2. Note that ◦ = 45◦ + 1◦ .0833. Exercise 11.0801. applied with x1 = 9 and x0 = 8. In the sense of the deﬁnition of the tangent line in Section 1. f (x1 ) is close to l(x1 ) for x1 near x0 . and f (8) = 1 . . Use approximation by diﬀerentials to ﬁnd approximate values for √ 5 (1) 34 (2) tan 31◦ (3) ln 1. Then f (x) = sec2 x. √ Solution: We set f (x) = 3 x. Example 1. Find an approximate value for 3 9.0355.52) insofar as we have not estimated (provided an upper bound for) the error which we make using the right hand side of (1.

BASIC CONCEPTS Example 1. The formula in (1. Solution: Set f (x) = sin x.52) tells us that √ 1 10 = f (10) ≈ f (9) + f (9)(10 − 9) = 3 + ≈ 3. 6 180 6 2 180 The calculator will tell that sin 31◦ ≈ . Then f (x) = cos x. Measuring angles in radians we set x0 = π/6 and x1 = π/6 + π/180.73.31 on page 17 we also know that we may use A = 1 and d = π/4 in the diﬀerentiability estimate. Use approximation by diﬀerentials to ﬁnd an approximate √ value of 10 and give an upper bound for the error. Applying the formula in (1.16228. √ √ Solution: We use f (x) = x and x0 = 9. . we ﬁnd sin 31◦ ≈ sin √ π π π π 1 + cos = 1+ 3 ≈ . In each case. ♦ Exercise 12. From the computation in Example 1. ♦ Comparison of the actual and approximate value conﬁrm this. and f (x0 ) = 1/6. The estimate assures us that the error is at most 1 1 √ 3 (x1 − x0 )2 = . The estimate assures us that the error is at most (x1 − x0 )2 = π 180 2 ≤ . Find an approximate value for sin 31◦ and estimate the error. estimate also the maximal error which you may have made by using the method of approximation by diﬀerentials. Example 1.52).000305.16).515115.515038. and √ f (π/6) = 3/2. see (1.50 CHAPTER 1. f (x0 ) = 3. f (π/6) = 1/2. 2( x0 ) 54 The actual error is again substantially less than this. 6 √ The calculator will give you 10 ≈ 3.74. The f (x) = 1/(2 x). Use approximation by diﬀerentials to ﬁnd approximate values for √ (1) cos 28◦ (2) 26 (3) sin 47◦ .16666. For the error estimate we may use A= 1 √ 3 2( x0 ) and any d > 0.

Newton’s method works as follows.53). i. NUMERICAL METHODS 51 1. f (x0 ) x 2 = x1 − f (x1 ) . we obtain the tangent line l(x) to the graph of f at this point. √ Let us calculate A. f (x2 ) and in general (1. and instead of ﬁnding the zero of the function itself. f (xn ) Geometry of Newton’s Method: Let us give a geometric explanation for the formlas. as given in (1.55) xn+1 = xn − f (xn ) x2 − A x2 + A 1 = n = = xn − n f (xn ) 2xn 2xn 2 xn + A xn If we use A = 3 and x0 = 2.18. i. In the following two columns you ﬁnd the values of xn .54) xn+1 = xn − f (xn ) . such that f (x) = 0. The process is then iterated.. f (x0 ) This means that we accept that the tangent line is close to the graph of the function. 10864 We summarize the computation in Table 1. and (1. and that x0 is not far from x. etc.e.e.53) x1 = x0 − f (x0 ) . and l(x1 ) = 0 if x 1 = x0 − f (x0 ) . For polynomials of degree greater or equal to 5 and most other functions there are no general methods for ﬁnding their roots. Suppose that by some means we know that such an x exists.2 Newton’s Method Newton’s method is designed to ﬁnd the zeros of a function. is the point at which l(x) intersects the x-axis. f (x1 ) x 3 = x2 − f (x2 ) .e.1. the positive root of the function f (x) = x2 − A. once . In the ﬁrst column you ﬁnd the subscript n. Suppose we want to ﬁnd a zero of a diﬀerentiable function f (x). l(x) = f (x0 )(x − x0 ) + f (x0 ). Then f (x) = 2x.1. Given any x0 at which f is deﬁned and diﬀerentiable. You have learned how to solve linear and quadratic equations. then we ﬁnd x1 = 1 2 2+ 3 2 7 = .. 4 x2 = 1 2 7 12 + 4 7 = 97 56 and x3 = 18817 . ﬁnding the zeros of functions of degree 1 and 2. More sophisticated methods allow you to ﬁnd the exact solutions of polynomial equations of degree three and four. Then x1 . Then we set (1. we ﬁnd the zero of the tangent line.. i. we want to ﬁnd some x. Speciﬁcally.18.

BASIC CONCEPTS expressed as a fraction of integers.7321428571 3. we guess that x0 = 1 is not . x2 n n xn 0 1 2 3 2 7/4 97/56 18817/10864 xn 2. once in decimal form.0625000000 1. ﬁnd a root of the function f (x) = x sin x − cos x.0003188775 1. Equivalently. Find a solution of the equation x sin x = cos x. Step 1: Let us make sure that there is a root of the function to be found. You see that our value for x3 is rather precise. has a root in the interval (0. Let us consider one more example to illustrate Newton’s method.55) xn+1 = 1 2 xn + A xn to ﬁnd good approximations of square roots. The numbers in the last column show that we are making rapid progress in ﬁnding a good approximation of √ 3. Observe that f (0) = −1 < 0 and f (π/2) = π/2 > 0. then the accuracy of this approximation of 3 will exceed the accuracy of most calculators.0000000000 1. π/2). if you carry the calculation one √ step further and ﬁnd x4 . In the last column you see the square of xn . Considering the values of f at the end points of the interval.73205080757 as an approximate value of 3.0000000085 Table 1. At least x2 is rather close to 3. The intermediate value theorem tells us that f (x). Step 2: Let us come up with a ﬁrst guess for a root. as a continuous funtion.1: The Babylonian Method More than 4000 years ago the Babylonians used the outermost expressions in (1. we may say. Your calculator 3 √ will give you 1. In fact. Let us call this root x.0000000000 4.7320508100 3. We refer to the described procedure as the Babylonian method.52 CHAPTER 1. expressed as rational numbers.7500000000 3.

In (1. Actually f (1) ≈ . the principle problem is as follows. As ﬁrst guess we used x0 = 1.18. then we see that |x − x2 | < .8645. and suppose that |f (x)| ≥ m and |f (x)| ≤ M on I. If we repeat the process.00000256. and in this sense we expect that x1 is much closer to the root x of f (x) than x0 . NUMERICAL METHODS 53 too far away from the root. Suppose that f (x) is a diﬀerentiable function and f (x) = 0 and x0 is given. 2m We illustrate the theorem by applying it to the previous example. which we know to exist by Step 1.1].2. Do the xn tend (converge) to x.’ A full discussion of Newton’s method requires mathematical tools which are not available to us at this time.3 Euler’s Method Euler’s method is designed to ﬁnd.5 and |f (x)| ≤ M = 2.0016 and |x − x3 | < .18. . We explained Newton’s method because we want to illustrate the power of the concept ‘tangent line. You are invited to M verify these estimates using technology. Step 4: Repeat Step 3 and calculate x2 .3. and how fast? For completeness sake. You see that f (x1 ) is much closer to zero than f (x0 ).00874.04. The quoted theorem asserts that |x − x1 | < . Let us set a = . This tells us that x ∈ [.9) > 0.8) < 0 and f (. (1. . You may say that with each iteration you make a fresh start. we give an answer.56) we use that 2m < 1. and with this also on I.1. There is one feature of Newton’s method which helps. by numerical means.5. .2. 1. This illustrates that the xn approach x rapidly.54). Consider an interval I = [x − a. Still. . Observe that f (. x + a]. and in this sense previous round-oﬀ errors don’t carry over. . Step 3: Let us improve the guess: Set x 1 = x0 − f (x0 ) ≈ . so that we know that |x − x0 | < .56) |x − xn+1 | ≤ 1. f (x0 ) Your calculator will tell you that f (x1 ) ≈ . On J.6. so that I ⊂ J = [. we have that |f (x)| ≥ m = 1. Suppose that xn for n ≥ 1 are computed according to (1. We made progress ﬁnding x.9]. an approximate solution of the following kind of problem: . The distance between x and xn will decrease rapidly as n increases. In general. Suppose xn ∈ I and |x − xn | < 2m . x3 .8. many interesting phenomena can occur. A theorem from advanced calculus asserts that M M (x − xn )2 .

so that it is assumed that population growth is reduced by a term which is proportional to y 2 . y(t0 )) = F (t0 . The equation was ﬁrst used in population studies by the Dutch mathematician-biologist Verhulst in 1837. The ﬁrst condition on y in (1. y0 ). y0 )(T − t0 ). In the equation. The tangent line to the graph of y at (t0 . t denotes time and y(t) the size of a population.57) is called an initial value problem. The Logistic Law The diﬀerential equation in our next example is known as the logistic law of population growth. For short. y0 ) is l(t) = y(t0 ) + y (t0 )(t − t0 ). In addition. which tell us that y (t0 ) = F (t0 . To get the second equality in (1. y) denotes a given function in two variables.58) we use the diﬀerential equation and initial condition in (1. which depends on t. The second condition is called an initial condition. approximate equality in (1. the members of the population meet and compete for food and living space. Then you might try the formula (1. Approach in one step: Suppose you want to ﬁnd y(T ) for some T = t0 . The equation reﬁnes the Malthusian law for population growth (see (1.54 CHAPTER 1.45)).58) is just l(T ). y) dt and y(t0 ) = y0 .58) y(T ) ≈ y(t0 ) + y (t0 )(T − t0 ) = y0 + F (t0 . BASIC CONCEPTS Problem 1.58) expresses the philosophy that the graph of a diﬀerentiable function is close to its tangent line. Here F (t. and the unknown is the function. at least as long as T is close to t0 .57) is a ﬁrst order diﬀerential equation.57). The ﬁrst. The probability of this happening is proportional to y 2 . and t0 and y0 are given numbers. The constants a and b are called the vital coeﬃcients of the population. the term ay expresses that population growth is proportional to the size of the population. It speciﬁes the value of the function at one point. so that the middle term in (1. Find a function y(t) which satisﬁes (1. . In the diﬀerential equation.57) y = dy = F (t. It is an equation which involves a function and its derivative. the problem in (1.

Remark 3.75. ♦ Let us be even more speciﬁc and give a numerical example.4.60). So. 3 + 7e−t/10 Substituting t = 1.1.61). so that we can see how well our approximate values match it.58) we ﬁnd (1. the formula in (1. Solution: Setting F (t.59) is given by the equation ay0 y(t) = (1.1. (1. we ﬁnd that y(1) ≈ 300 + 300 3002 − 10 10000 (1 − 0) = 321. According to the exact solution for this initial value problem.59) dy = ay − by 2 dt and y(t0 ) = y0 .61) gives us a less satisfactory result.62) dy 1 1 = y− y2 dt 10 10000 and y(0) = 300. For T = 10 the formula suggests that y(10) ≈ 510. this number is rounded oﬀ. According to the formula in (1. Solution: Substituting a = 1/10. t0 = 0.76. b = 1/10000. Find an approximate value for y(T ). Consider the initial value problem. We expect a close approximation only for T close to t0 . For this larger value of T . Find approximate values for y(1) and y(10). ♦ . our approximate value is close. you see that the diﬀerential equation in this example is a special case of the one in (1. we ﬁnd that y(t) = 3000 . Consider the initial value problem: (1.59). y(10) = 538. We are providing the exact solution.57). 55 where a and b are given constants. though you are invited to verify that it satisﬁes (1. and y0 = 300 into the solution in (1.18.61) 2 y(T ) ≈ y0 + (ay0 − by0 )(T − t0 ). y) = ay − by 2 . NUMERICAL METHODS Example 1. Example 1. we ﬁnd the exact value y(1) = 321. According to the exact solution in (1. An exact solution of the initial value problem in (1.60) by0 + (a − by0 )e−a(t−t0 ) This is not the time to derive this exact solution.

. 11 We do not want to make this term precise. . t2 = 10. Experience will guide you in the choice of the step length. and we repeat the process. .75 is of this kind. BASIC CONCEPTS Multi-step approach: We like to ﬁnd a remedy for the problem which we discovered in Example 1. For notational convenience we assume that T > t0 . We use t1 and y(t1 ) to calculate an approximate value for y(t2 ). y) the accuracy of the value which we get for y(T ) will increase with n. Then we pretend that y(t1 ) is exact. Consider again Problem 1 on page 54. Starting out with t0 and y(t0 ). For reasonably nice11 expressions F (t. t20 = 100. we calculate [ti+1 . t1 = 5. Solution: As points in the multi-step process we use t0 = 0. y(ti )] according to the formula in (1. . t3 = 15. y) in Example 1. but the F (t. . 1. t3 = 15. in an actual numerical computation we also make round-oﬀ errors in each step. and the more steps we make the worse the result might get. . Example 1. y(ti ))(ti+1 − ti )] We continue this process until we reach T . t4 = 20.77. Arrange them in a table. On the other hand. we use the one step method from above to get an approximate value for y(t1 ).56 CHAPTER 1. . We want to get an approximate value for y(T ). . y(ti+1 )] = [ti+1 . Apply the multi-step method to ﬁnd approximate values for y(t) at t1 = 5. Iteratively. Consider the initial value problem (1. y(ti ) + F (ti .76 for T further away from t0 . y(ti+1 )] from [ti .63) [ti+1 . the number of steps we make (at least if all steps are of the same length). Pick several ti between t0 and T : t0 < t1 < t2 < · · · < tn = T. Graph the points found in the previous step together with the actual solution of the initial value problem. t20 = 100.58): (1. t2 = 10. 2. .64) dy 1 1 = y− y2 dt 10 10000 and y(0) = 10. Again we pretend that y(t2 ) is exact and use t2 and y(t2 ) to calculate y(t3 ).

74 80 956.2: Solution of Problem 1. y(t3 ). The points will follow the curve much more closely if you use t1 = 1.07 100 997.41 Table 1. t100 = 100 in your calculation. y) dt and y(t0 ) = y0 . . . ♦ Steady States: Let us consider some very speciﬁc solutions of our initial value problem in (1. t) = 0 for all t. y(t2 ).2.28 72.96 40 219.70 t y(t) & t y(t) 35 153.77 For each ti (0 ≤ i ≤ 19) we use the formula y(ti+1 ) = y(ti ) + 5 2 y(ti ) yi (ti ) − 10 10000 and calculate y(t1 ). Then the constant function y(t) = y0 is a solution of the problem.95 22. .07 90 988.18. This is due to the speciﬁc problem. 12 It is incidental that the points eventually get closer to the graph again.27 95 994. and will not occur in general. . . You also see the points from Table 1. The points suggest a graph which does follow the actual one reasonably closely.15 you see the graph of the exact solution of the initial value problem.09 45 304. .31 33.05 65 768. In Figure 1.57): y = dy = F (t. . . We summarize the calculation in Table 1.22 49. . NUMERICAL METHODS 57 t 0 5 10 15 20 25 y(t) & 10. You may try a shorter step length.07 85 977.00 14. t3 = 3.73 75 918.55 55 531. Suppose F (y0 .1. Such a solution is called a steady state solution.62 50 410.87 70 857. But you see that we are deﬁnitely making errors. t2 = 2. y(t20 ) consecutively. and they get worse as t increases12 .02 30 106.2.55 60 656.

and you are repaying the loan at a rate of $ 200. Example 1. Your payments cover exactly the occuring interest charges.000.15: Illustration of Euler’s Method Example 1.58 CHAPTER 1. For the logistic law (see Equation (1. I. So the constant function f (t) = −b/a is the only steady state of this diﬀerential equation.78. Solution: Apparently.59)) dy = F (y.16) (1.68 in Section 1. 000. Find the steady states of the diﬀerential equation (see (1. if your loan balance is $40. you see that the steady state in that example is B(t) = 40.50) in Section 1..16.00. the bank charges you interest at a rate of . ♦ In review of Example 1.65) f (t) = af (t) + b.79. BASIC CONCEPTS 1000 800 600 400 200 20 40 60 80 100 Figure 1. t) = ay − by 2 = y(a − by) dt .00 per month. f (t) = 0 if and only if f (t) = −b/a. then the principal balance of your account will stay unchanged.e.5% per month.

then y(t) will tend to −∞ as time increases. the last term in the diﬀerential equation accounts for the competition for space and food.77. To make the problem explicit. There are two steady state solutions: yu (t) = 0 and ys (t) = a/b. Finally. 3. y(t) = 0 is an unstable steady state. 000 is a stable steady state solution. It is also referred to as the carrying capacity. If the initial value y0 of the population is positive. Use Euler’s method to ﬁnd the population size over the next 30 years.e. NUMERICAL METHODS 59 we ﬁnd that F (y.18. you should think of a population of deer in a protected wildlife preserve. Consider the initial value problem (1. Proceed in 1 year steps. The common language meaning of these expressions suﬃces for the purpose of our discussion. 13 .66) 1 1 2 y (t) = −50 + y(t) − y (t) 2 2000 and y0 = y(0) = 200. Call the smaller one of them Yu and the larger one Ys . ♦ Exercise 13. ﬁnd for which values of y you have that y = 0? You will ﬁnd two values. 5. If the initial value y0 is negative. and which one is unstable. The deer are hunted at a rate of 50 animals per year. I. Repeat the ﬁrst two steps of the problem if hunting is stopped. In this sense. If y0 = 0. The population has a growth rate of 50% per year. ys (t) = a/b = 1. Experiment with diﬀerent initial values to see which of the steady states is stable. It tells you which size population of the given kind the speciﬁc habitat will support. 000. Reproduction takes place at a constant rate all year round. Repeat the ﬁrst two steps of the problem if the initial population is 100 animals.1. then y(t) will not tend to the steady state y(t) = 0. Guess at which level the population stabilizes. 000 in Example 1. There are no predators. Let us interpret these steady state solutions for the speciﬁc numerical values of a = 1/10 and b = 1/10. Tabulate and plot your results. In this sense. and the mathematical deﬁnition of ‘tends to’ and ‘stabilizes at’ only make these terms precise. Find the steady states of the original equation in which hunting takes place. then the population size will tend to and stabilize13 at y(t) = a/b = 1. 1.. t) = 0 if and only if y = 0 or y = a/b. 4. 2.

16. In Figure 1. There is one ellipse for each a > 0. The orthogonal trajectory provides you with a path which is always in the direction of the most rapid change of the ﬁeld. Suppose we are given a family F (x.67) Ca : F (x. a) = 0 of curves. .16: Orthogonal Trajectory to Level Curves Let us explain where this type of situation occurs. A new lava ﬂow which originates at some point in the crater will follow this path.) We call such a curve Db an orthogonal trajectory to the family of the Ca ’s. y. 6 4 2 0 -2 -4 -6 -6 -4 -2 0 2 4 6 Figure 1. Suppose the curves Ca are the level curves in a crater. (We say that Db and Ca intersect perpendicularly in a point (x1 . Suppose that each ellipse represents an equipotential line of an electromagnetic ﬁeld. y1 ). A charged particle will move along an orthogonal trajectory. BASIC CONCEPTS Orthogonal Trajectories Let us explore a diﬀerent kind of application.60 CHAPTER 1. We like to ﬁnd curves Db which intersect the curves Ca perpendicularly.16 you see a family of ellipses (1. Here a represents the elevation. a) = x2 + 3y 2 − a = 0. so that the elevation is constant along each curve Ca . if the tangent lines to the curves at this point intersect perpendicularly. y. You also see one orthogonal trajectory in Figure 1. The orthogonal trajectory gives a path of steepest descent.

If 3y a curve Db intersects Ca in (x1 .. a) = x2 + 3y 2 − a = 0. then we end up with the initial value problem dy 3y = dx x and y(x0 ) = y0 . i. Find orthogonal trajectories for the family of ellipses (1.e. Thus.80. was pioneered by Alexander von Humbold (1769–1859). to ﬁnd an orthogx onal trajectory to the family of the Ca ’s we need to ﬁnd functions which satisfy this diﬀerential equation. A heat seeking bug will. and this is the curve x = 0. Let us ﬁnd approximate values for the initial value problem dy 3y = dx x and y(1) = 1 . . The orthogonal trajectory shown in Figure 1. It is constant along each curve Ca .68) Ca : F (x. There is one orthogonal trajectory which does not have this form. They are functions of the form y(x) = bx3 . NUMERICAL METHODS 61 Suppose a stands for temperature.18. so that along each ellipse the temperature is constant. Example 1. y0 ). we get 2x + 6y dy = 0 or dx dy −x = .1. This is exactly the kind of problem which we solved with Euler’s method. move in the direction in which the temperature increases most rapidly. y. They will move along an orthogonal trajectory. at any time. and with this the method in all of these applications. bacteria will follow a path in the direction in which the concentration increases most rapidly. Suppose a stands for the concentration of a nutrient in a solution. In this case the curves are called isothermal lines14 . Let us apply Euler’s method to solve the problem. In this particular example it is not diﬃcult to ﬁnd solutions for the diﬀerential equation. along an orthogonal trajectory to the isothermal lines. On their search for food. If we also require that the orthogonal trajectory goes through a speciﬁc point (x0 . y1 ) is −x11 .16 has the equation y = x3 /25. Solution: Diﬀerentiating the equation for the ellipses. 25 14 The idea of isothermal lines. dx 3y The slope of the tangent line to a curve Ca at a point (x1 . then we need that the slope of the tangent line to Db at this point is 3y11 . y1 ) perpendicularly.

Graph several of the curves Ca . Use the points x0 = 3. Find the diﬀerential equation for an orthogonal trajectory. 4).4. . 3. . y0 ) = (1. x20 = 7. x1 = 3. . yn ) according to the formula yn = yn−1 + . x2 = 3. yn ) in your ﬁgure. . 1/25) and calculate (xn . 2. Check that the graph of y(x) = bx−5 is an orthogonal trajectory to the family of hyperbolas for every b. .2 3yn−1 xn−1 for n = 1.2. . . . BASIC CONCEPTS Use x0 = 1. 4). x1 = 1.16. . and add this graph to your ﬁgure. . There is one hyperbola for each value of a. 2. 4. . only for a = 0 the hyperbola degenerates into two intersecting lines. Plot the points (xn . . . Consider the family of hyperbolas: Ca : x2 − 5y 2 + a = 0. Determine b. Use Euler’s method to ﬁnd points on the orthogonal trajectory through the point (3. 20. ♦ Exercise 14. 1. We set (x0 . x20 = 5.4. so that the orthogonal trajectory passes through the point (3. we graphed the points in Figure 1. x2 = 1.2. Without recording the results of this calculation.62 CHAPTER 1.

π/2) ∪ (π/2. arcsec x ∈ (0. 1). ∞). x = 0 x ∈ (−∞. 1). ∞) x ∈ (−∞. 0) ∪ (0. π) x < −1 or x > 1. arccos x ∈ (0. ∞).19 Table of Important Derivatives f (x) xq ex ln |x| sin x cos x tan x cot x sec x csc x arctan x arcsin x arccos x arccot x arcsec x arccsc x f (x) qxq−1 ex 1/x cos x − sin x sec2 x − csc2 x sec x tan x 1 1+x2 √ 1 1−x2 √ −1 1−x2 −1 1+x2 √1 |x| x2 −1 −1 √ |x| x2 −1 Assumptions q a natural number. arccot x ∈ (0. ∞) x ∈ (−1.1. π/2) Table 1. arcsec x ∈ (−π/2. π) x < −1 or x > 1.3: Some Derivatives . TABLE OF IMPORTANT DERIVATIVES 63 1. π/2) x ∈ (−1. ∞) x ∈ (−∞. or x > 0 x ∈ (−∞.19. ∞) all x for which tan x is deﬁned all x for which cot x is deﬁned all x for which sec x is deﬁned − csc x cot x all x for which csc x is deﬁned x ∈ (−∞. π) x ∈ (−∞. arcsin x ∈ (−π/2.

64 CHAPTER 1. BASIC CONCEPTS .

b]. π] it is 0. With this information it is possible to sketch graphs capturing their essential features. Let f (x) be a function which is deﬁned on the interval [a. All concepts related to the behaviour of a function near a point. π/2] is 2/π and over [0. We will discuss some uniqueness properties of solutions of diﬀerential equations.1 Cauchy’s Mean Value Theorem It is useful to make the following Deﬁnition 2. He made major contributions to make calculus a rigorous mathematical theory. The average rate of change of f (x) = sin x over [0. We apply these ideas to the study of extrema of functions. b]. Then we call f (b) − f (a) b−a the average rate of change of f over the interval [a. Augustin-Louis Cauchy (1789–1857) was one of the great mathematicians of the 19-th century. The fundamental result which allows us to do this is referred to as Cauchy’s mean value theorem. 2] is 2. Then we discuss geometric properties of graphs. For example. 65 . their monotonicity and concavity. In this chapter we will use local information about a function to draw global conclusions. the average rate of change of f (x) = x2 over the interval [0. 2.1.Chapter 2 Global Theory So far we studied the local behaviour of a function.

66 CHAPTER 2.e. If f (a) = f (b). Theorem 2.1. such that f (c) = f (b) − f (a) = 0. 1] is −1. where a < b. b] and diﬀerentiable on (a.3 (Rolle’s Theorem). then a = b and there exists some c ∈ (a. where a < b. We prove this statement using Cauchy’s theorem. The following special case of the theorem. For example. If f (x) = 0 for all interior points x of I. as they occur in the two theorems. except that Cauchy’s theorem and Rolle’s theorem are equivalent (each is an easy consequence of the other one). b−a In words. b) such that f (c) = f (b) − f (a) . the average rate of change of f (x) = x2 over the interval [−2. called Rolle’s theorem (named after Michel Rolle (1652–1719)). then there exists a number c between a and b (i. and f (−1/2) = −1. Corollary 2. Let f be a real valued function which is deﬁned and continuous on the interval [a. We are interested in more general consequences. We are also not interested in ﬁnding the points c. Proof. b). is of particular interest. Let f be a real valued function which is deﬁned and continuous on an interval I. 1 We discussed this property of the real numbers in Section 1.. We are not going to say anything about the proof of these two theorems. If f (a) = f (b). b−a But this contradicts the assumption that f (c) = 0 for all c ∈ I. there exists a number d such that f (x) = d for all x ∈ I.2 (Chauchy’s Mean Value Theorem). and that the proof of both of them depends heavily on the completeness1 of the real numbers. b ∈ I. the theorem asserts that the average rate of change over an interval is equal to the rate of change at some point in the interval.4. b). then f is constant on this interval. and the corollary is proved. b] and differentiable on (a. GLOBAL THEORY Theorem 2. a < c < b) such that f (c) = 0. In other words. Then there exists a number c ∈ (a. A diﬀerent formulation of the claim is that f (a) = f (b) for all a. . b). Let f be a real valued function which is deﬁned and continuous on the interval [a.

7. We call a function F (x) with domain I an antiderivative of f if F (x) = f (x) for all x ∈ I.4 implies Proposition 2. 67 Corollary 2. UNIQUE SOLUTIONS OF DIFFERENTIAL EQUATIONS We are going to use the following corollary frequently. the integration constant is determined by an initial condition. then it suﬃces to ﬁnd one antiderivative H(x). The constant c is referred to as integration constant. Then h and g diﬀer by a constant. then h and g diﬀer by a constant. any antiderivative F (x) of the function f (x) = 2x on the real line (−∞. we can reformulate Corollary 2. Suppose we like to solve the initial value problem f (x) = cos x and f (0) = 1. . π/2) is of the form F (x) = tan x + c.5. deﬁned on an interval. Deﬁnition 2.2.8. By diﬀerentiating H(x) you can check whether you guessed right. Proof.2 Unique Solutions of Diﬀerential Equations Corollary 2. Apply the previous corollary to f (x) = h(x) − g(x). More generally. then F (x) is constant on I. 2. If h (x) = g (x) for all x ∈ I.6. Any antiderivative F (x) of the function f (x) = sec2 x on the interval (−π/2.e. Let h and g be functions which are deﬁned and continuous on an interval I. Suppose the function f (x) is deﬁned on the interval I. Any antiderivative F (x) is of the form H(x) + c where c is a constant. If the function F (x) is deﬁned on an interval I and F (x) = 0 for all x ∈ I. Corollary 2.. For example. Suppose h and g are antiderivatives of a function f . In other words.5. Typically.2. ∞) is of the form F (x) = x2 + c where c is a constant. if you like to ﬁnd all antiderivatives F (x) of a function f (x) on an interval. Using this notion. on intervals the only solutions of the diﬀerential equation F (x) = 0 are the constant functions. i. there exists a number d such that h(x) = g(x) + d for all x ∈ I. For the time being you depend on being able to guess such a function H(x).

Proposition 2. Its derivative is h (x) = f (x)e−ax − af (x)e−ax = af (x)e−ax − af (x)e−ax = 0. In fact f (x) = Cea(x−x0 ) . Let f (x) be any function which satisﬁes the diﬀerential equation on some interval. Proof. The solution of the initial value problem is f (x) = sin x + 1. h is diﬀerentiable.9. Then we see that f (0) = c = 1. .10. and will eventually prove. Thus c = Ce−ax0 and f (x) = ceax = Ce−ax0 eax = Cea(x−x0 ) . We want to see that these are the solutions. Proof. This follows from the above because (sin x + c) = cos x. Substituting the initial condition we obtain C = f (x0 ) = ceax0 . As a product of diﬀerentiable functions.68 CHAPTER 2. Of particular importance to our discussion of expenential growth and decay is Proposition 2. Consider the function h(x) = f (x)e−ax . Calling the constant c we ﬁnd that f (x) = ceax . Every solution f (x) of the diﬀerential equation f (x) = af (x) on an interval is of the form f (x) = ceax for some constant c. The initial value problem f (x) = af (x) and f (x0 ) = C has a unique solution on an interval containing x0 . We asserted in (1. Corollary 2. This means that all solutions of the diﬀerential equation f (x) = af (x) are of the form f (x) = ceax . that all functions of the form f (x) = ceax satisfy the diﬀerential equation.19). GLOBAL THEORY Our ﬁrst conclusion is that f (x) = sin x + c. Next we substitute x = 0 in the equation. By the previous proposition we know that the solution is of the form f (x) = ceax for some c. where c is a constant.4 tells us that h(x) is a constant function.

2.3. THE FIRST DERIVATIVE AND MONOTONICITY

69

Remark 4. The uniqueness of the solution of an initial value problem as in the previous proposition is not only of theoretical importance. Imagine that you study the growth rate of a strain of bacteria, as we did in Example 1.64 on page 41. Before you can publish your result, it must be certain that your experiment can be reproduced at a diﬀerent time in a diﬀerent location. That is a requirement which any experiment in science must satisfy. If there is more than one mathematical solution to your problem, then you have to expect that the experiment can go either way, and this would invalidate your experiment.

2.3

The First Derivative and Monotonicity

One of the interesting properties of a function is whether it is increasing or decreasing. We might want to ﬁnd out whether the part of a population which is infected with a disease is increasing or decreasing. We might want to know how the level of pollution in a body of water is changing. The ﬁrst derivative of a function gives us information of this kind.

2.3.1

Monotonicity on Intervals

Recall that a function f is called increasing if f (b) > f (a) whenever b > a. It is called decreasing if f (b) < f (a) whenever b > a. A function is called monotonic if it is either increasing or decreasing. Theorem 2.11. Suppose that the function f is deﬁned and continuous on the interval I. 1. If f (x) > 0 for all x ∈ I, then f is increasing on I. 2. If f (x) < 0 for all x ∈ I, then f is decreasing on I. 3. More generally, the conclusions in (1) and (2) still hold if in each ﬁnite interval J ⊂ I there are only ﬁnitely many points at which the assumption on f (x) is not satisﬁed.2 Proof. We show (1). Let a and b be points in I, and suppose that a < b. Cauchy’s theorem says that there exists a point c, a < c < b, such that f (c) =

2

f (b) − f (a) . b−a

It is permissable that f is not diﬀerentiable at a few points in J, or that f (x) = 0. It is not possible that f (x) < 0 at some point in the interval, and f (x) is increasing on the interval.

70

CHAPTER 2. GLOBAL THEORY

We have that f (c) > 0 and b − a > 0, and it follows that f (b) − f (a) > 0. This means that f (b) > f (a). The proof of the second claim is similar. We leave it and the generalization of both statements to the reader.

1 For example, log2 x = ln x = x ln 2 > 0 for all x ∈ (0, ∞). In the ln 2 computation we used (1.19), Theorem 1.36, and that ln 2 > 0. It follows from Theorem 2.11 that log2 x is increasing on x ∈ (0, ∞). You see part of the graph of the function in Figure 1.8. The exponential function expa x = ax is increasing on (−∞, ∞) if a > 1 and decreasing if 0 < a < 1. To see this, observe that ax = ex ln a and d x x x dx a = (ln a)a . Furthermore, a > 0 and ln a > 0 if a > 1 and ln a < 0 if 0 < a < 1. Now Theorem 2.11 implies our assertion. You may also want to have a look at the graph of the exponential function with base 2 in Figure 1.7. The function f (x) = 1/x is deﬁned and diﬀerentiable on the set of all nonzero real numbers, and its derivative is f (x) = −1/x2 . In particular f (x) < 0 for all nonzero real numbers. According to Theorem 2.11, f (x) is decreasing on the interval (−∞, 0), and that f (x) is decreasing on the interval (0, ∞). The function is not decreasing on the union of the two intervals. The example illustrates that it is crucial in Theorem 2.11 that we deal with functions which are deﬁned and diﬀerentiable on an interval. The function f (x) = tan x, deﬁned on (−π/2, π/2), has as its derivative f (x) = sec2 x, and the derivative is positive. Consequently, f (x) is increasing on (−π/2, π/2). Its inverse g(x) = arctan x, deﬁned on (−∞, ∞), 1 has as its derivative g (x) = 1+x2 , which is positive on (−∞, ∞), so that g(x) = arctan x is increasing on (−∞, ∞). As a general priciple, one may show that the inverse of an increasing function is increasing.

Example 2.12. For a three dimensional solid we set E = A/V , where A denotes the surface area and V the volume. For example, for a ball E(r) = (4πr 2 )/( 4 πr 3 ) = 3/r, where r denotes the radius. Then E (r) < 0. 3 The same principle holds for other shapes, E decreases as we enlarge the solid without changing its shape. What does this have to do with the size of animals? Warm blooded animals living in cold climates need to preserve their body temperature. The total amount of heat stored in the body is proportional to the volume, while the heat loss is proportional to the surface area. The ratio of volume to surface area increases as the animal gets larger, so that for warm blooded animals it is of advantage to be large if they live in cold climates. In hot climates they need to give oﬀ heat, so that it is of advantage to be

2.3. THE FIRST DERIVATIVE AND MONOTONICITY

71

small. Natural selection (Darwinism) should favor the larger specimens of a warm blooded species in a cold climate and smaller ones in a hot climate. You can observe this phenomenon in real life. For cold blooded animals the converse holds. They absorbe heat so that they body reaches a temperature at which they can be active. In cold climates it helps to be small, because then the surface area is relatively large, compared to the volume. In hot climates cold blooded animals can aﬀord to be large, as it is easy to reach and maintain the temperature at which they can be active. The argument is again consistent with real life. Needless to say, there are other mechanisms to increase the surface area of a body than decreasing its size, and the maintenance of the body temperature is only one factor which inﬂuences the size of specimens of a species. Larger animals need more food, are stronger, cannot hide so well, and are often less agile. All of these factors need to be taken into account to determine the optimal size of an animal. ♦ So far we have only discussed examples where we used (1) and (2) of Theorem 2.11. Let us show how to use the conclusion in (3). To apply it we need to determine intervals on which a function does not change signs. We recall a procedure which works well for continuous functions. Deﬁnition 2.13. Suppose f (x) is a function. We call a point x0 on the real line exceptional if either f (x0 ) = 0 or f (x0 ) is not deﬁned. The following result is an immediate consequence of the Intermediate Value Theorem, see Theorem 1.16 on page 8. Expressed casually it says that a continuous function can change signs only at exceptional points. Proposition 2.14. Suppose f (x) is continuous and f (x) has no exceptional points in the interval (x0 , x1 ). Then f (x) > 0 for all points in the interval (x0 , x1 ), or f (x) < 0 for all points in the interval (x0 , x1 ). In particular, if f (x) is positive at one point in the interval, then it is positive at all points in the interval. If f (x) is negative at one point in the interval, then it is negative at all points in the interval. Example 2.15. For example, consider the function f (x) = x2 (x2 − 4) x2 (x − 2)(x + 2) = . 2 + 2x − 15 x (x − 3)(x + 5)

The zeros of the numerator, and with this the zeros of f (x), are x = 0, x = 2, and x = −2. The zeros of the denominator, i.e., the points where f (x) is not deﬁned, are x = 3 and x = −5.

Find intervals of monotonicity for the function f (x) = 3x2 + 5x − 4. 30 5 25 20 15 10 5 -3 -2 -1 -5 1 2 3 4 -2 -5 -10 -15 -20 2 4 Figure 2. GLOBAL THEORY According to the proposition.1: A quadratic polynomial. ∞). x (1) f (x) = x3 − x2 − 5x − 3 (2) g(x) = 3 . and positive on (3. ♦ Exercise 15. negative on (2. −5/6]. 3) and (3. x + 5x2 − 4x − 20 We are ready to discuss the monotonicity of functions whose derivative vanishes at some points. ∞). Theorem 2. p(x) = x3 − 3x2 − 9x + 3 Solution: We graphed the function in Figure 2.16. You see that the sign changes at some. ∞). but not all. ♦ .11 (3) says that f is increasing on the interval [−5/6. (2. we see f (x) is positive on the interval (−∞. ∞). 0). f is decreasing on the interval (−∞. (−2. Find intervals on which the following functions do not change signs. 2). f (x) = 3x2 + 5x − 4 Figure 2. positive on (−2. Its derivative is f (x) = 6x + 5. 0) and on (0. In particular. By a similar argument. negative on (−5. f (x) > 0 if x ∈ (−5/6. Counting signs of the factors in the expression for f (x).72 CHAPTER 2. exceptional numbers. −5). Decide whether the functions are positive or negative on these intervals. Example 2. except at x = −5/6. −2). −5). 3).2: A cubic polynomial. (−5. So f (x) > 0 for all points x ∈ [−5/6. (0. −2). the sign of f (x) remains unchanged on each of the intervals (−∞. ∞). 2).1.

THE FIRST DERIVATIVE AND MONOTONICITY 73 Example 2. The function is decreasing on the intervals [−1. 1) and (1. −1]. 1) and (1. 3]. x = −1 and x = 3. f (x) = 2 cos 2x + 2 cos x = 2[2 cos2 x + cos x − 1] = 4(cos x + 1) cos x − 1 2 . We conclude that f (x) is increasing on the (−∞. ∞). −1). (x − 1)2 We see that the exceptional points for f (x) are x = 1. . The theorem implies that p(x) is decreasing on the interval [−1. ∞). ♦ Example 2. Find intervals of monotonicity for the degree three polynomial (for a graph see Figure 2. (1. (−1. 1). Counting the signs of the factors of f (x). Its derivative is p (x) = 3x2 − 6x − 9 = 3(x2 − 2x − 3) = 3(x − 3)(x + 1). Restrict your discussion to the interval [0. Observe that f (x) is deﬁned and diﬀerentiable on the entire real line with the only exception of x = 1. −1) and on (3. 2π].17. 3). We conclude that p(x) is increasing on the interval [3. The derivative is negative on the interval (−1. ∞).18. Find intervals of monotonicity for the rational function x2 + 3x .19. Solution: We diﬀerentiate the function and rewrite the expression for the derivative so that it is easier to ﬁnd its exceptional points. ∞). 3). Counting the signs of the factors we see that p (x) is positive on (−∞. 3). We conclude that f (x) does not change signs on the intervals (−∞. and (3. x−1 Solution: The simpliﬁed expression for the derivative of f is f (x) = f (x) = (x + 1)(x − 3) . −1] and [3. ♦ Example 2. 3]. −1) and (3. and f (x) < 0 on the intervals (−1.2. we conclude that f (x) > 0 on the intervals (−∞.3. ∞) and that it is increasing on the interval (−∞.2) p(x) = x3 − 3x2 − 9x + 3 Solution: The function is deﬁned and diﬀerentiable on the real line. Find intervals on which the function f (x) = sin 2x + 2 sin x is monotonic.

2π] on which f does not change sign. 5π/3) and (5π/3. You may conﬁrm the calculation by having a look at Figure 2. We ﬁnd exceptional points where cos x = −1 (i. 2π]. (g) and (h). Checking the sign of f (at one point) in each of the intervals. We conclude that f is increasing on the interval [0. there f (x) is decreasing. wherever f (x) is positive. π/3) and x ∈ (5π/3. x = π and x = 5π ). (π. 2π]. x = π) and where cos x = 1 (i.. This provides us with the intervals [0. (π/3.e. we ﬁnd that f (x) > 0 for x ∈ [0. 5π/3]. π) and (π.. In the last two problems. Wherever f (x) is negative. restrict yourself . and f (x) < 0 for x ∈ (π/3. Find intervals on which the function f increases and intervals on which f decreases. ♦ Exercise 16. There you see the graph of the function (solid line) and the graph of its derivative (dashed line). As you see. and that f (x) = 0 at the end points of this interval.74 CHAPTER 2. π). there f (x) is increasing. Observe that f is diﬀerentiable on [0. GLOBAL THEORY To see the second equality we used that cos 2x = 2 cos2 x − 1. The function is decreasing on the interval [π/3. 2 3 3 4 3 2 1 1 -1 -2 2 3 4 5 6 Figure 2. 5π/3).3.3: A function and its derivative. π/3).e. and in this interval there are three points at which f (x) is not positive. π/3] and [5π/3. Then we solved the quadratic equation in terms of cos x. 2π].

c) and f (x) > f (c) for all x ∈ (c.3. For the local property we may chose the. Expressed informally. We address both questions in this subsection. and how this concept is related to the one of being increasing on an interval.2. for some d > 0. at least for a while. c + d). We are making a statement about the behavior of the function on some open interval which contains c. possibly rather small.2 Monotonicity at a Point It is quite natural to ask what it means that a function is increasing at a point.3. f (x) < f (c) for all x ∈ (c − d. Deﬁnition 2. Being increasing or decreasing at a point c is a local property. Suppose f is a function which is deﬁned on an open interval I. Then f is increasing (decreasing) on I if and only it it is increasing (decreasing) at each point in I. . Suppose f is a function and c is an interior point of its domain. Theorem 2.21. THE FIRST DERIVATIVE AND MONOTONICITY to the interval [0. This theorem establishes the relation between the local and the global property. We say that f is increasing at c if. but the ‘if’ part uses some deeper facts about ﬁnite closed intervals. The global property has to hold for any two points in the given interval.20. 2π]. interval. (a) f (x) = 3x2 + 5x + 7 (b) f (x) = x3 − 3x2 + 6 (c) f (x) = (x + 3)/(x − 7) (d) f (x) = x + 1/x (e) f (x) = x3 (1 + x) (f) f (x) = x/(1 + x2 ) (g) f (x) = cos 2x + 2 cos x √ (h) f (x) = sin2 x − 3 sin x 75 2. For the local property we compare f (x) to f (c) where c is ﬁxed and x is any point in an open interval around c which we may chose. Being increasing on an interval is a global property. to the left of c the function is smaller and to the right of c it is larger than at c. Our second result gives us a valuable tool to detect monotonicity of functions at a point. The ‘only if’ part is not diﬃcult to show. For the global property the interval is given to us. We say that f is decreasing at c if this statement holds with the inequalities reversed.

22. A function can also be increasing at a point x. These properties can be described globally over intervals and locally at points. Secant lines will either be required to lie above or below the graph.. 10 8 6 4 -2 -1 4 2 1 -2 -4 2 3 4 2 -6 -2 -1 1 2 -8 Figure 2. Remark 5. A function can be diﬀerentiable and increasing at a point x. but the motivated reader is encouraged to explore them. If f is diﬀerentiable at c and f (c) > 0.76 CHAPTER 2. then f is decreasing at c. Let f be a function and c an interior point of its domain. Graph the function f (x) = 2x + |x| to convince yourself of this fact. but there is not open interval which contains x such that the function is increasing on this interval. then f is increasing at c.4: Concave Up Figure 2.22 do not hold. You may use the graphs in Figures 2. GLOBAL THEORY Proposition 2. 2. The ideas of of a function being increasing or decreasing at a point may be generalized to cover domains of functions which are halfclosed or closed intervals. A function does not have to be diﬀerentiable to be increasing. even if the assumptions of Proposition 2. and where we like to make a statement about the behavior of a function at an endpoint.4 The Second Derivative and Concavity We like to capture the property of a graph being bent upwards or downwards.5 as illustrations of the discussion. and the rates of change will be either increasing of decreasing. f (x) = x3 is increasing at x = 0.e. If f (c) < 0. but if f (0) = 0. Remark 6. We have no speciﬁc needs for such statements. i.5: Concave Down .4 and 2.

We say that f (x) is twice diﬀerentiable on I. f (b)). Let f be a function which is deﬁned on an interval I. The inequality expresses that between the points a and b the secant line lies below the graph. b in I and c ∈ (a. the line segment joining the two points. We say that f is concave down on I if f (c) > l(c) for all a. f (b)) be two distinct points on its graph. b]. then f (x) is concave up on I. . then f (x) is concave up on I. More generally. resp. Here l(x) is the secant line through (a. Theorem 2. if f (x) extends to a function F (x) which is deﬁned on an open interval J which contains I. f (x) < 0.24 (2) says that q is concave 3 Strictly speaking. b in I and c ∈ (a. the function shown in Figure 2. and F (x) is twice diﬀerentiable on J. b).e. For example. Theorem 2. 1. f (a)) and (b. If f (x) > 0 for all x in I. THE SECOND DERIVATIVE AND CONCAVITY 77 2. Let f be a function which is deﬁned on an interval I. Suppose that f (x) is twice diﬀerentiable3 on I. The inequality expresses that between the points a and b the secant line lies above the graph.4. If f (x) < 0 for all x in I. Deﬁnition 2. b). Suppose that f (x) is diﬀerentiable on I. i. If f (x) is increasing on I. We will not provide a proof of the theorem. 2..11. 3.1 Concavity on Intervals Let f (x) be a function and let (a. The line through these two points is l(x) = f (a) + f (b) − f (a) (x − a). then f (x) is concave down on I. b−a If we restrict l(x) to x ∈ [a. then f (x) is concave down on I. then we get the secant line through the two point. We state a theorem which provides you with assumptions under which a function is concave up or down. f (a)) and (b.2.24. the conclusions in (2) still hold if in each ﬁnite interval J ⊂ I there are only ﬁnitely many points at which the assumption f (x) > 0. We say that f is concave up on I if f (c) < l(c) for all a. If f (x) is decreasing on I. is not satisﬁed.4. we proceed as in Section 1. More generally. The second derivative will be unique at all points in I if I is not empty and does not consist of exactly one point. Its second derivative is q (x) = 2 > 0.23.4 is q(x) = x2 − 2x + 3. so far we can consider being ‘twice diﬀerentiable’ only for functions which are deﬁned on open intervals.

GLOBAL THEORY up on (−∞. x = 1. Thus. you may note that exp (x) = exp(x) > 0 and apply Theorem 2. If f and g are inverses of each other. Theorem 2. if f is concave up. So.24 (1) to derive the desrired conclusion. In particular. You may verify that tan x = 2 sec2 x tan x.24 (2). you may use that ln (x) = −1/x2 < 0 on (0. and its second derivative is g (x) = −2 < 0. Theorem 2. 1]. It is worse. ∞) and apply Theorem 2. ∞) with only one exception.24 (3) tells us that p(x) is concave up on the interval [1. You may consider the spread of a desease. ∞). We see that p (x) > 0 for x ∈ (1. I(t) increases. You are invited to study the concavity of the other trigonometric and hyperbolic functions. The function ln x is concave down on the interval (0. Theorem 2. and p (x) < 0 for x ∈ (−∞. p (x) < 0 for x ∈ [−∞.24 (3). ∞). You may conﬁrm these statements visually by inspecting a graph of the tangent function. The exponential function exp(x) = ex is concave up on (−∞. 1) with only one exception.24 (2) says that q is concave down on (−∞. Its second derivative is p (x) = 6x−6 = 6(x−1). 4 . Remark 7. Denote the number of infected people by I(t). and vice versa.25. To see this. ∞). ∞). Study the concavity properties of the function p(x) = x3 − 3x2 − 9x + 3.78 CHAPTER 2. 1). ∞).24 (3) implies that tan x is concave down on (−π/2. ∞). 0] and concave up on [0. ♦ Consider the function tan x. Example 2. and then quote Theorem 2. i.24 (1).5 is g(x) = −x2 + 5x − 1. secant lines which are above the graph turn into secant lines below the graph. if I (t) > 0. then the graph of one of the functions is obtained from the one of the other one by reﬂection at the diagonal x = y. You may also use that exp (x) is increasing on the real line. Finally. This means that p (x) > 0 for all x ∈ [1. ln x being concave down implies that exp(x) is concave up. 0) and tan x > 0 for x ∈ (0.24 (2). π/2).2. To see this.e. In this process. One deduces that f (x) is concave down on the interval (−∞. π/2). tan x < 0 for x ∈ (−π/2. Similarly. Solution: You ﬁnd the graph of this function in Figure 2. then g is concave down.. x = 1. It may be scary if I (t) > 0. you may note that the derivative ln x = 1/x is decreasing on (0. and often true in the early stages of an epedemic. The function shown in Figure 2. ∞) and apply Theorem 2. Alternatively. you may observe that a function is concave up if its inverse is convave down4 . Let us look at examples where we apply condition Theorem 2.

then growth slows. THE SECOND DERIVATIVE AND CONCAVITY 79 This means that I (t) increases.75 and the graph of a solution of this diferential equation in Figure 1. During a televised presidential debate. For a while the population seems to explode. the number of infected people. When I (t) turns negative. 1. there has been an increase in poverty but it is a lower rate of increase than it was in the preceding years before we got here. page B6): “Some of these facts and ﬁgures just don’t add up. resp. Medical professional will not necessarily wait for the time when I(t). We called y = a/b the carrying capacity of the system. Earlier we considered the logistic equation y = ay − by 2 . Find intervals on which the following functions are concave up.” 5 . It has begun to decline. The spread or the desease slows. Use implicit diﬀerentiation to ﬁnd the second derivative: y = ay − 2byy = (a − 2by)y . 1984. The inﬂection occurs when y is half the carrying capacity.15. One may hope that eventually I (t) becomes negative. f (x) = x3 − 4x2 + 8x − 7 2. k(x) = 2 cos2 x − x2 for x ∈ [0. concave down. Yes. starts decreasing.2.4. g(x) = x4 + 2x3 − 3x2 + 5x − 2 3. You see the turning point in the graph in Figure 1. so that the actual number of sick people decreases. j(x) = x/(x2 − 1) 6. The point at which I (t) changes signs from being positive to being negative may be considered the turning point in the spread of the desease. one of the candidates said (see the New York Times from October 8th. As long as y is less than a/(2b). 2π]. If a/(2b) < y < a/b. One of the recent presidents was confused by a subtle argument of this kind5 . and the desease spreads at an increasing rate. but it is still going up. but after a while it levels oﬀ so that it does not exceed a the carrying capacity. Let us look at this phenomena in a concrete example. the population grows at an increasing rate. See Example 1.. Exercise 17. h(x) = x + 1/x 4. The ﬁrst case occurs if y = 0 or y = a/b. We see that y = 0 if y = 0 or y = a/(2b). then I (t) decreases.15. and it was the stable equilibrium point. i(x) = 2x4 − x2 5.

7: Concave down at • In other words. 4 2 3 1. GLOBAL THEORY 2. . then the function is concave down. We say that f is concave up. at least near c. f (x) < l(x). One shows a function which is concave up at the indicated point.5 1 -0. we got a picture how the function has to look like near a point. resp. Still. such that the graph lies on one side of the graph.7. You see this situation illustrated in two generic pictures in Figures 2. resp.26.80 CHAPTER 2. Our next theorem tells us how to detect concavity. such that l(c) = f (c) and f (x) > l(x). called a support line. We assume that the graph and the line agree at c.18 on page 10. 6 The idea of an interior point was deﬁned in Deﬁnition 1.. we are asking for a line l(x). and this is the behavior which we like to capture in a deﬁnition. for all x ∈ I with x = c.5 2 1 2 3 4 5 Figure 2. and it tells us how to ﬁnd the support line if the function is diﬀerentiable. at c if there exists an open interval I and a line l. Deﬁnition 2.4.6 and 2. one shows a function which is concave down.2 Concavity at a Point The notion of being concave up or down was deﬁned for functions which are deﬁned on intervals.. If the graph is above the line.5 0.5 1 1.6: Concave up at • Figure 2. if it is below. then the function is concave up. Let f be a function and c an interior point 6 of its domain.5 1 2 0. concave down.

Let f be a function and c an interior point of its domain. we can draw the tangent line easily. The sign of the second derivative of a functions tells us whether a function is concave up or down at a point. then f is concave down at c.. For example. b). the function f (x) = x5 − 7x4 + 2x3 + 2x2 − 5x + 4 is concave down at x = 2 because f (2) = −148 < 0. minimum. It is the tangent line. minimum. then f is concave up at c. of f . We just hold the ruler against the graph. Theorem 2. resp. As we like to apply local properties of the function. then the test is inconclusive. So. 1. b) if and only if f is concave up (resp. To relate concavity properties on an interval to those at each point in the interval we state. there can be many support lines at any given point. LOCAL EXTREMA AND INFLECTION POINTS 81 Theorem 2. 2. resp. we focus on interior points is the domain of the function. f (c) ≤ f (x). If f is diﬀerentiable and concave up or down at c. down) on (a. and this line is the tangent line to the graph of f at c. or neither. for all x in some open interval I around c. resp. down.2. In general. then there is only one support line. then the support line is unique. down) at each point in (a.27. without proof. b).28. If the second derivative is zero. A local extremum is a local maximum or minimum.5 Local Extrema and Inﬂection Points We are going to discuss two types of points which are particularly important in the discussion of (graphs of) functions.5.29 (Local Extrema). If f is increasing at c or if f (c) > 0. for a diﬀerentiable function which is concave up or down at a point. 3. The function can be concave up. Then f is concave up (resp. We say that f has a local maximum.. In this case we call f (c) a local maximum. at c if f (c) ≥ f (x). 2. Deﬁnition 2. but if the function is diﬀerentiable at c. Let f be a function and c an interior point in its domain7 . Let f be a function which is deﬁned on an open interval (a. . the following theorem. If f is decreasing at c or if f (c) < 0.

The vertex of a parabola is always a local extremum. . π/2).5 -1 -1.5 0. Soon we will develop tests which detect inﬂections points.5 1 Figure 2. I.e. So the concavity changes at x = 0 and that means that there is an inﬂection point at x = 0. 0] and concave up on the interval [0. a local minimum if the coeﬃcient of x2 is positive. We do not need any test to see that f (x) = |x| has a local minimum at x = 0. we have that f is concave up on the interval (a.5 1 -1 -2 -1 -1 -2 1 2 -0.82 CHAPTER 2. We call c an inﬂection point of f if the concavity of f changes at c.30 (Inﬂection Points). c] and concave down on [c. The function is concave down on the interval (−π/2. f has a local maximum of f (c) at c. if f (c) is the largest value compared the values at points near c. We will study tests which allow us ﬁnd local extrema soon. and f (x) = −(x − 1)2 has a local maximum at x = 1.5 -0. Let f be a function and c an interior point of its domain. Deﬁnition 2.9. GLOBAL THEORY 1. or vice versa.9: An Inﬂection Point In other words. You see the graph of this function in Figure 2. and a local maximum if the coeﬃcient of x2 is negative. for some numbers a and b with a < c < b. b).8 has a local minimum at x = −1.5 3 1 2 0. No test is required to see that f (x) = tan x has an inﬂection point at x = 0.. The function shown in Figure 2.18 on page 10 this means that f (x) is deﬁned for all x in some open interval around c. 7 According to Deﬁnition 1.8: A local minimum Figure 2.

We say that c is a critical point of f if f is diﬀerentiable at c and f (c) = 0.32 (Critical Points).22 on page 76 tells us that there exists some positive number d. Suppose that f is diﬀerentiable at c and f (c) > 0. and there are points x to the right of and arbitrarily close to c such that f (x) > f (c). there are very few points where local extrema can occur. The test does not give a suﬃcient condition for a local extremum. Let f be a function and c an interior point of its domain. then c is a critical point of the function. We say that c is a saddle point of f if f is diﬀerentiable at c and f (c) = 0. then the function need not have a local extremum at c. c).6. there are points x to the left of and arbitrarily close to c such that f (x) < f (c). Theorem 2. or if f is not diﬀerentiable at c. If a function has a local extremum at c. it is customary to say: Deﬁnition 2. and their derivatives exp x = exp x and ln x = 1/x are everywhere nonzero. It makes sense to introduce one more word.31. To have an abbreviation for the points which are recognized as important in this theorem. So. To see this.33 (Saddle Points).2. These functions have no critical points. c + d).31 they have no local extrema. Proposition 2.31. but f does not have a local extremum at c. If c is a critical point of the function. then either f (c) > 0 or f (c) < 0. Let f be a function and c an interior point of its domain. Neither the exponential function nor the logarithm function have local extrema. In other words.6 Detection of Local Extrema We will discuss how to detect local extrema. DETECTION OF LOCAL EXTREMA 83 2. and in neither case we have an extremum at c. and according to Theorem 2. No local extrema can occur at points which are not critical.31 provides us with a necessary condition. if f has a local extremum at c. Let f be a function and c an interior point of its domain. such that f (x) < f (c) for all x ∈ (c − d. then f does not have a local extremum at c. If f (c) = 0. If f is diﬀerentiable at c and f (c) = 0. The ﬁrst result excludes many points. Proof of Theorem 2. and f (x) > f (c) for all x ∈ (c. that f does not have a local extremum at c. This means. Theorem 2. Deﬁnition 2. . then the same argument applies with inequalities reversed. by deﬁnition. If f (x) < 0. observe that these functions are diﬀerentiable on their domain. Typically. then f is either not diﬀerentiable at c or f (c) = 0.

Solution: The function is diﬀerentiable for all real numbers x.11. is x = 1. So q (x) = 0 if x = 1.e.10. GLOBAL THEORY Example 2. If we write the function in the form q(x) = (x − 1)2 + 2. Find the local extrema of the function q(x) = x2 − 2x + 3. and its only critical point is at x = 0. which is the only zero of g (x) = 3x2 . the only critical point. the function has a saddle point at this point. You should conﬁrm this result by having a look at Figure 2. then we see that q does indeed that a local minimum at x = 1. and q (x) = 2x − 2 = 2(x − 1). The only point at which we can have a local extremum. 0). ♦ 8 8 6 6 4 2 4 -2 -1 -2 1 2 2 -4 -6 -1 1 2 3 -8 Figure 2. This saddle point is shown in Figure 2.84 CHAPTER 2. where this function is graphed. Obviously.10: A local minimum Figure 2.35. i. g(x) > 0 for all x ∈ (0. Solution: The function g(x) is everywhere diﬀerentiable. ∞) and g(x) < 0 for all x ∈ (−∞. Show that the function g(x) = x3 has a saddle point at x = 0. This means that there is no local extremum at x = 0. ♦ .11: A saddle point Example 2.34.. As g (0) = 0 and there is no local extremum at x = 0.

You can see graphs of f and f in Figures 2.37 (First Derivative Test). If f (x) > 0 for all x ∈ (c−d. If f (x) > 0 for all x ∈ (c − d. c + d) for some d > 0. If the function is decreasing on (c − d. 1 + 3/3). c] and decreasing on [c. ∞). then f has a local maximum at c. Suppose c is an interior point of the domain of a function f . Theorem 2. and suppose that for some d > 0 the function is increasing on (c−d. then f has a local minimum at c. If f (x) < 0 for all x ∈ (c − d. Let us illustrate the use of the theorem with an example. c] and increasing on [c. c) and f (x) < 0 for all x ∈ (c. then f has a saddle point at c. then f has a local minimum at c.36. Taking advantage of the information provided by the ﬁrst derivative. 1 − 3/3) and (1 + 3/3. positive and negative. c + d).13 . f (x) is positive on the intervals x √ (−∞. and c is a critical point. Then f has a local maximum at c. Find the local extrema of the function f (x) = x3 − 3x2 + 2x + 2.12 and 2. c)∪(c. and f (x) is negative on the interval √ √ (1 − 3/3. Solution: We diﬀerentiate f (x) and express f (x) as a product of linear factors: √ √ 3 3 2 f (x) = 3x − 6x + 2 = 3 x − 1 + x− 1− 3 3 It is easy to determine where the factors are zero. c)∪(c. c+d). 1. we obtain the following test.6. We √ conclude that f (x) = 0 if √ = 1 ± 3/3.38. c + d). DETECTION OF LOCAL EXTREMA 85 Let us formulate a criterion which conﬁrms that a function has a local extremum at a point c. c + d). c) and f (x) > 0 for all x ∈ (c. c+d). Suppose f is a function which is deﬁned and diﬀerentiable on (c − d. Example 2.2. This conclusion also holds if f (x) < 0 for all x ∈ (c−d. 2. 3. It gives us a suﬃcient condition for a local extremum at c. Theorem 2. c + d).

and these are the only points where a local extremum can occur. (g) and (h).5 -1 1 1. 2π].18 and 2. Find the local extrema of the following functions.2 0. Hint: We discussed the monotonicity properties of these functions in Examples 2.12: f (x) = x3 − 3x2 + 2x + 2 Figure 2.86 CHAPTER 2. 2π]. In the last two problems. Based on the sign of f (x) on intervals to the left and right √ these two critical points we see that √ has of f a local maximum at x = 1 − 3/3 and a local minimum at x = 1 + 3/3.5 2 4 3 2 1 0.5 2 Figure 2. restrict yourself to the interval [0.13: f (x) = 3x2 − 6x + 2 √ The only only critical points of f are at x = 1 ± 3/3. (a) f (x) = 3x2 + 5x + 7 (b) f (x) = x3 − 3x2 + 6 (c) f (x) = (x + 3)/(x − 7) (d) f (x) = x + 1/x (e) f (x) = x3 (1 + x) (f) f (x) = x/(1 + x2 ) (g) f (x) = cos 2x + 2 cos x √ (h) f (x) = sin2 x − 3 sin x Hint: You discussed the intervals of monotonicity for these functions in Exercise 16.19.8 1. . Exercise 19.5 1.6 1.2 1 1.4 1.4 2. Find the local extrema of the following function: (1) f (x) = x2 + 3x x−1 (2) g(x) = sin 2x + 2 sin x for x ∈ [0. GLOBAL THEORY 2. ♦ Exercise 18.

let us assume that f (c) = 0 and f (c) > 0. The sign of f at the critical points tells us whether we found a local minimum or a local maximum. It does not provide us with a necessary condition. Assume also that f (c) and f (c) exist and that f (c) = 0. If f (c) < 0. The assumption that f (c) > 0 means that f is concave up at c (see Theorem 2. we diﬀerentiate f and ﬁnd the critical points. then f has a local maximum at c. We also have a local minimum at x = 3 because at this critical point the second derivative of the function is positive. p (−1) = −12 and p (3) = 12. First. ♦ Proof of the Second Derivative Test. In particular. Furthermore. Its equation is l(x) = f (c). Theorem 2. because this is a critical point and p (−1) < 0. In this sense. If f (c) = f (c) = 0. Let f be a function and c an interior point in its domain. the function f can have a local extremum at c. the zeros of f (x). then f has a local minimum at c. Spelled out explicitly this means that f (x) > l(x) = f (c) . the test provides us with a suﬃcient condition for the existence of a local extremum at a point. see Figure 2. DETECTION OF LOCAL EXTREMA 87 We may use the second derivative to detect the change of sign of the ﬁrst derivative.2 on page 72) p(x) = x3 − 3x2 − 9x + 3.39 (Second Derivative Test). Solution: We calculated the ﬁrst derivative. The second derivative test tells us that we have a local maximum at x = −1.2. The assumption that f (c) = 0 means that the tangent line to the graph of f at (c.6.37. Example 2. f (c)) is horizontal. If f (c) > 0. Find the local extrema of the function (for a graph. There may or may not be a local extremum at c. then the test is inconclusive.27 (1)). We will show that f has a local minimum at c. p (x) = 6x − 6 = 6(x − 1).40. The critical points of the function are x = −1 and x = 3. and the assumptions of the test are not satisﬁed. p (x) = 3x2 − 6x − 9 = 3(x + 1)(x − 3). To apply the theorem to the detection of the local extrema of a diﬀerentiable function f (x). Then we diﬀerentiate f (x). as it is called for in the assumptions in Theorem 2. Furthermore.

We leave it to the reader. then we can just answer this question. c) ∪ (c. Let us start out with an example. c + d).5 2 1.5 1 2 0. f has a local minimum at c.5 -0.5 -2 0.5 1 1. (a) f (x) = 4x2 − 7x + 13 (b) f (x) = x3 − 3x2 + 6 (c) f (x) = x + 3/x (d) f (x) = x2 (1 − x) (e) f (x) = |x2 − 16| (f) f (x) = x2 /(1 + x2 ).5 1 1.5 6 4 Figure 2. Example 2. A theorem provides a necessary and a suﬃcient condition for the existence of an inﬂection point.88 CHAPTER 2. 2. Figure 2. GLOBAL THEORY for some positive number d and for all x ∈ (c − d. Exercise 20.5 -0.7 Detection of Inﬂection Points We deﬁned an inﬂection point to be a point at which the concavity of a function changes. If we know where the function is concave up and down. Find the critical points and the local extrema.5 -0.5 2 2. .14: The graph of g. In other words. Find the the inﬂection points of the function g(x) = x3 − 4x2 + 3x − 5. The proof that f has a local maximum at c if f (c) = 0 and f (c) < 0 is similar.41.5 2 2. We want to detect inﬂection points more eﬃciently. 8 2.15: The graph of g .5 0.

1. The roots are √ 3 9 ± 21 1√ t= ± 21 = . Solution: We calculate the second derivative of the function and ﬁnd f (t) = 24t2 − 36t + 10. From the formula for the second derivative we conclude that g (x) < 0 if x ∈ (−∞.15.2. If f has an inﬂection point at c.42. ♦ . we have an inﬂection point at x = 4/3. You see the inﬂection point indicated as a dot in Figure 2. 4 12 12 Now. but this is a bit cumbersome.14 and the one of g in Figure 2. f (c) exists and f (c) = 0. 2. By deﬁnition.14.7. √ We could plug t = (9 ± 21)/12 into the expression for f . Let f be a function and c an interior point of its domain. 4/3] and concave up on [4/3. This means that g is concave down on the interval (−∞. Find the inﬂection points of f (t) = 2t4 − 6t3 + 5t2 − 7t + 4. The theorem says that the inﬂection √ points of f (t) are at t = (9 ± 21)/12. According to the theorem. then f has an inﬂection point at c.43. We calculate the third derivative of f : f (t) = 48t − 36. let us check whether there are inﬂection points at either of these values for t. DETECTION OF INFLECTION POINTS 89 You see the graph of g in Figure 2. then f (c) = 0. ∞). 4/3) and that g (x) > 0 if x ∈ (4/3. ♦ Theorem 2. we have to ﬁnd the zeros of f (x) to determine where an inﬂection point can be. ∞). and √ this means that f (9 ± 21)/12) = 0. Suppose that the ﬁrst and second derivatives of f exist at c. We see right away that f (t) = 0 exactly if t = 3/4. We calculate the ﬁrst and second derivative of g: g (x) = 3x2 − 8x + 3 and g (x) = 6x − 8. You also see that g (x) has a local extremum at the same point. Example 2. If f (c) = 0.

17: The graph of f .44. Apparently. In many cases we like to ﬁnd the maximal value of a function. Discuss the relation between the inﬂection points of a function f and the local extrema of its derivative f . Zooming in on parts of the graph f will not improve this.5 8 6 4 2 -3 0 1 2 3 -2 -1 -2 1 2 3 -3 -2 -1 Figure 2. You see the graphs in Figures 2. If we are given graphical information. and it will be nearly impossible to ﬁnd the zeros of f . Any reasonable software has no problem with this.8 Absolute Extrema of Functions We said that a function f has a local maximum at c if its value at c is largest in comparison to the values at point near c. our ability it ﬁnd inﬂection points of a function is limited by our ability to ﬁnd the zeros of its second derivative.2 + x2 − 3(sin x)3 . and where it occurs. Figure 2. then this quite easy.17.16: The graph of f . At least in this example. anywhere in the domain of the function. ♦ Exercise 21. 2. it will take an eﬀort to calculate the second derivative of this function.5 2 1. the graph of f tells us much more about the concavity of the function f than its own graph. 3]. Example 2. A look at the graph of f barely reveals some of the inﬂection points.16 and 2.5 1 0. but the graph of f shows them clearly.90 CHAPTER 2. Find the inﬂection points of the function f (x) = 1. GLOBAL THEORY Apparently. This concept is captured in . 3 2. We asked the computer to graph f and f for x ∈ [−3.

Then we call f (c) the absolute maximum of f . A continuous function on a closed interval [a. Let f be a function. The argument for the absolute minimum is left to the reader. If f is diﬀerentiable at c.5 2 1 1 2 3 4 -2. b] assumes its absolute maximum and minimum either at a critical point or at an endpoint of the interval.45. 4]. the absolute extrema of the function occur either at one of the end points x = 0. the c is critical.47. and the function has a local maximum at c. 8 12.5 4 5 2. or at a critical . then we say that f has an absolute minimum at c. then it does so at some interior point c. and we call f (c) the absolute minimum of f .2. and c a point in its domain. and c is critical as well. Proof. We say that f has an absolute maximum at c if f (x) ≤ f (c) for all x in the domain of f . A diﬀerent expression is to say that the function assumes its absolute extremum at c.17 we asserted that a continuous function assumes its absolute maximum at some point in the interval. ABSOLUTE EXTREMA OF FUNCTIONS 91 Deﬁnition 2. If f is not diﬀerentiable at c. If the function does not assume its absolute maximum at an endpoint. In Theorem 1. Figure 2.8.19: 3x2 − 10x + 6 Solution: According to Theorem 2.31. Theorem 2. If f (x) ≥ f (c) for all x in the domain of f . Example 2. x = 4.46. then f (c) = 0 by Theorem 2.5 10 6 7.5 2 3 4 Figure 2.18: x3 − 5x2 + 6x + 1. Find the absolute extrema of the function f (x) = x3 − 5x2 + 6x + 1 for x ∈ [0.46.

2. we conclude that the function assumes its absolute maximum of 9√ x = 4. Example 2. we content ourselves with problems of an algebraic or geometric nature. (a) f (x) = x2 − 5x + 2 for x ∈ [0. You may also check that f (x) = 6x − 10. the zeros of 2 − 10x + 6. f (0) = 1 and f (4) = 9.5486 and .19. GLOBAL THEORY point. 5] (b) f (x) = x3 + 3x2 − 5x + 2 for x ∈ [−3. are x = (5 ± f (x) = 3x 7)/3. ♦ Exercise 22.. To avoid lenghty introductions to real-life problems.1126 and f ((5 − 7)/3) = . 5] (d) f (x) = cos 2x + 2 cos x for 0 ≤ x ≤ 2π (e) f (x) = sin x + cos x for 0 ≤ x ≤ 2π 2. In fact. so that the combined . The approximate values of the function at these points are √ √ f ((5 + 7)/3) = 3. and √ √ f ((5 + 7)/3) > 0 and f ((5 − 7)/3) < 0.18 and the one of f in Figure 2.7848. You may compare our calculation with the graphs of f in Figure 2. Comparing the values of f (x) at these four points.5] √ √ (c) f (x) = 2 + x/ 1 + x for x ∈ [0. and its absolute at minimum of approximately . The second derivative test tells us that the function has a local minimum at √ √ x = (5 + 7)/3 and a local maximum at x = (5 − 7)/3. Find the absolute extrema of the functions on the indicated intervals.9 Optimization Story Problems Many real-life problems are formulated as optimization problems.3689 at x = (5 − 7)/3. i. Approximate values of these roots are 2. How long should each piece be. Cut a string of length 50 centimeters into two pieces.3689. Use one piece as the perimeter of an equilateral triangle and the other one as the perimeter of a disk.48.√ The critical points.e. Calculus helps us to solve these optimization problems.92 CHAPTER 2. We consider a few examples and give some problems for practice.

This means that 3a + 2πr = 50 and a = √ 50 − 2πr . We note that A(r) is a parabola which is open upwards. ♦ Exercise 23. and its area is a 4 3 . 3 4 3 3 3 √ and A (r) = 0 if and only if r = 50/(2π + 6 3). √ . Answer the original question: The combined area of the disk and √ the triangle will be minimal if r = 50/(2π + 6 3).94. all string is used for the circle. Introduction of notation: There are many ways to set up the notation to solve this problem. so that the combined area of the triangle and the circle is maximal? In our solution we will go through several steps. The critical point. The combined area of the triangle and disk is √ √ a2 3 3 50 − 2πr 2 2 A= + πr 2 . we need that 0 ≤ r ≤ 25/π. which we just found. 3 2 The height of the triangle is h = a 2 3 .28 amd A(25/π) ≈ 198.9. In the latter case. The area of the disk is πr 2 . Formulate the problem mathematically: Find the absolute minimum (maximum) of the function √ 3 50 − 2πr 2 A(r) = + πr 2 . 25/π]. Express information as equations: The perimeter of the triangle will be 3a and the perimeter of the circle will be 2πr. It is also the absolute minimum of A(r) on any interval which contains the critical point.2. and it will be maximal of r = 25/π. OPTIMIZATION STORY PROBLEMS 93 area of the triangle and the circle is minimal? How long should each piece be. Repeat the previous example with 1. a disk and a square. Among them we say that the side length of the triangle is a and the radius of the circle is r. For the end points we have: A(0) ≈ 120. is where the local minimum occurs. + πr = 4 4 3 For this to make sense. Solve the mathematical problem: The derivative of A(r) is √ 3 50 − 2πr 4π −π 50 − 2πr A (r) = − · + 2πr = √ + 2πr. 4 3 for r ∈ [0.

The derivative of V is V (x) = W L − 4(W + L)x + 12x2 . Express information as equations: As we folded up a strip of width x. length L − 2x. The zeros of V are at x= 8 1 (L + W ) ± 6 L2 + W 2 − LW . 4. GLOBAL THEORY 2. On the interior of the interval the function is positive. For yourself.. 3. You could have cut out a square of size x × x at each corner. height x. V (0) = V (W/2) = 0.49. 60 and 90 degrees). Construct an open box from a rectangular piece of card board of length L and width W . two geometric shapes of your own choice. and convince yourself that any box obtained by a diﬀerent process will have smaller volume. and volume V (x) = (W − 2x)(L − 2x)x = W Lx − 2(L + W )x2 + 4x3 . and x ≤ L/2. Then we fold up a strip of width x along each side8 . What are the dimensions of the box with the largest possible volume? In our solution we will go through several steps. in fact x ≤ W/2.e. 5. i. a regular hexagon and a square. we call the longer side of the rectangle L and the shorter one W . a disk and half an equilateral triangle (the angles at 30. x ≥ 0.94 CHAPTER 2. Solve the mathematical problem: At the end points of the interval V vanishes. Formulate the problem mathematically: Find the absolute maximum of the function V (x) = W Lx − 2(L + W )x2 + 4x3 for x ∈ [0. the box will have width W − 2x. By construction. W/2]. an equilateral triangle and a square. draw a picture of this production process. To simplify matters. Clariﬁcation and introduction of notation: We construct the box by making an incision at a 45 degree angle at each corner. x ≤ W/2. . Example 2.

Modify the problem from above. The ﬁrst derivative of this function is A (x) = −2 sin x + (π − 2x) cos x. 0). As the function V (x) has only one local maximum in the interval. (x. You will get an open box whose base is an equilateral triangle. Convince yourself that the vertices of the rectangle should be (x. π/2]. Start out with an equilateral piece of card board with side length a. ♦ Its width will be W − 2x and its length L − 2x. constructing a box with a round base from a circular piece of card board. π/2]. We need to ﬁnd the absolute maximum for this function for x ∈ [0. Exercise 27. Solution: Draw a graph of sin x so that you can follow the discussion. Exercise 24. What is the largest possible volume for a right circular cone of slant height a? Example 2. the local maximum is the same as the absolute maximum. How broad should the folded up strips be. OPTIMIZATION STORY PROBLEMS 95 The function has an inﬂection point at (W + L)/6.2. We conclude that V has a local maximum at x= 1 (L + W ) − 6 L2 + W 2 − LW . (π − x. you ﬁnd that A (x) = 0 if and only if tan x = π − 2x . so that the volume of the box is maximal? Exercise 26. Determine the rectangle of maximal area which can be placed between the x-axis and the graph of the function f (x) = sin x.9. Answer the original question: The box with the largest volume will have a height of x= 1 (L + W ) − 6 L2 + W 2 − LW . and to the left of which V is concave down. Exercise 25. Repeat the previous example with speciﬁc numbers for the width and length of the piece of card board. sin x) and (π − x. so that its area is A(x) = (π − 2x) sin x. 2 . After a simple algebraic simpliﬁcation. Make incisions at the corners.50. to the right of which V is positive and V is concave up. The width of the rectangle is π − 2x and its height is sin x. sin x) for some x ∈ [0. and fold up strips along the edges. 0).

We conclude that x0 is the only critical point of A(x). x3 We ﬁnd that f (x) = 0 if x = 1.652183.51 tells us that the absolute minimum of the function is f (1) = 2. With this. (b) If f is concave down on I and has a local maximum at x0 . and that is decreasing.72066 and a height of sin x0 = . Our next result allows us to do the same even if the interval is not closed and bounded. b] we could inspect the values of the function at the critical points and at a and b. and f is concave up on (0. Apparently A(x) = 0 at the end points x = 0 and x = π/2 of the interval. ∞). Substituting x0 you will see that A (x0 ) < 0. It follows from the second derivative test that A(x) has a local maximum at x0 . This tells us that A(x) assumes its absolute maximum at x0 . You may calculate A (x). ♦ 9 π−2x 2 One possible argument is that tan x is increasing on the interval [0. It allows us to decide whether a local extremum is also an abolute one. and that f (x) > 0 for all x in (0. Convince yourself9 that this is the only zero of A (x) for x ∈ [0. GLOBAL THEORY Find an approximate solution of the equation using Newton’s method or your calculator. π/2). ∞).52. So these functions can intersect in only one point. the ﬁnal answer to our problem is: The rectangle of maximal area which can be placed between the x-axis and the graph of the sine function will have a width of approximately π − 2x0 = 1. Theorem 2. ∞).51. Its area will be about 1. So f has a local minimum at x = 1.710462. The assumptions of this theorem are satisﬁed in many applied problems. Example 2. then f assumes its absolute maximum at x0 . then f assumes its absolute minimum at x0 . ♦ To ﬁnd the absolute extrema of a continuous function on an interval of the form [a. π/2]. A fairly good approximation of the zero of A (x) is x0 = .12218. (a) If f is concave up on I and has a local minimum at x0 . Suppose f is deﬁned on an interval I.96 CHAPTER 2. . Theorem 2. Find the absolute minimum of the function f (x) = x + 1 x for x ∈ (0. Solution: We calculate the ﬁrst and second derivative of f (x): f (x) = 1 − 1 x2 and f (x) = 2 .

Two hallways. b). and that (2. (a. Minimize the cost of the material needed to make a round drum with a volume of 200 liter (i. . Exercise 36. Exercise 35. 0).00 per linear meter. and (0. bottom and sides. Exercise 34. Exercise 30. b). 0) in the plane. Suppose that a and b are positive. Find the dimensions of the cone if its volume is to be maximal..000. so that a ball of radius 10 centimeters can be inscribed. Exercise 33. Exercise 31. meet at a right angle. Find the dimensions of the cylinder if its volume is the be a maximum.9. 0). Exercise 29. the other three sides by straight fences. and one vertex on the line 3x + 5y = 15. The cost of the exterior walls is $ 1. A right circular cone is inscribed in a sphere of radius R. Find the dimensions of the warehouse that minimizes the construction cost. and the same material is used for the top. Consider a triangle in the plane with vertices (0. 0). Determine the dimensions of the meadow so that its area is maximal. 5) lies on the line through the points (a. and (0.2. Find the dimensions of a right circular cone of minimal volume. Find the largest possible area for a rectangle with base on the x-axis and upper vertices on the curve y = 4 − x2 . What should the slope of the line be. . one 8 feet wide and one 6 feet wide. A rectangular warehouse will have 5000 m2 of ﬂoor space and will be separated into two rectangular rooms by an interior wall.2 m3 ) if (a) the drum has a bottom and a top. One side of a rectangular meadow is bounded by a cliﬀ. one vertex on the positive x-axis. (b) the drum has no top (but a bottom) and the same material is used for the bottom and sides. The total length of the fence is 600 meters. What are the dimensions of a rectangle of this kind with maximal area? Exercise 32.00 per linear meter and the cost of the interior wall is $ 600. Inscribe a right circular cylinder into a right circular cone of height 25 cm and radius 6 cm. so that the area of the triangle is minimal? Exercise 37.e. one vertex on the positive y-axis. Draw a rectangle with one vertex at the origin (0. Determine the length of the longest ladder that can be carried horizontally from one hallway into the other one. OPTIMIZATION STORY PROBLEMS 97 Exercise 28.

assuming you are allowed a ﬁxed amount of material? More speciﬁcally determine the ratio of radius and height which will maximize the volume. and the left over material is recycled for half its value. Consider a box with a round base and no lid whose interior is subdivided into six wedge shaped sectors. What should be the dimensions of the banner if the area of the white area is to be maximized? Exercise 41. Going through the following . so that its volume is maximal. The point on the coast line closest to the island is 6 km from the power station. Consider the distance D(x) between a point P (x) = (x. Then the tangent line to the graph of f at x1 intersects the line joining P (x1 ) and Q perpendicularly. y0 ) not on this graph. and along the sides 10 cm.98 CHAPTER 2. Exercise 38. for all practical purposes. The width of the border at top and bottom is 15 cm. and the material for the top and bottom is twice as expensive as the material for the sides. GLOBAL THEORY (c) the drum has a bottom and a top. Design a roman window with a perimeter of 4 m which admits the largest amount of light. the same material is used for the top and bottom. but the top and the bottom are cut out of squares. Suppose D(x) has a local minimum at x1 . Exercise 39. Which shape should it have.10 Sketching Graphs The techiques which we developed so far provide us with some valuable tools for graphing functions. (d) the situation is as in the previous case. Let us make a list of data which we may determine. and. To lay the cable costs $40. The total area is 1 m2 . (A roman window has the shape of a rectangle capped by a semicircle. so that we can sketch a graph rather precisely. Find the minimal cost for laying the cable. f (x)) on the graph of a diﬀerentiable function f (x) and a point Q = (x0 . 2.000 under water.) Exercise 40. A rectangular banner has a red border and a white center.000 per kilometer under ground and $70. you may suppose that the shore line is straight. Exercise 42. A power line is needed to connect a power station on the shore line to an island 2 km oﬀ shore.

negative. Intervals on which f (x) is positive give you intervals on which f (x) is concave up. i. concave down. (a) Plot some points on the graph. then your graph will look very much like the graph of f (x). Useful information for graphing a function: We call the function f (x). If you cannot ﬁnd the zeros by analytical means. and keep track of the intervals on which the function is increasing. (d) Find the ﬁrst derivative f (x) of f (x). plot the values at its endpoints. (f) Find the second derivative f (x) of f (x). decreasing. (h) Decide at which critical points of f (x) the function has a saddle point or local extremum. resp. If the function is given on a closed interval. try it numerically (Newton’s method). Plot the inﬂection points (x and y value). and intervals on which f (x) is negative give you intervals on which f (x) is decreasing. decide on which intervals the function is positive.. . and whether it is a minimum or a maximum. resp.e. Plot the critical points (x and y value). Let us go through the program in an example. (e) Repeat (b) and (c) with f (x) in place of f (x). the graph will have all of the essential features of the graph of f (x). If you now draw a graph which exhibits all of the properties which you gathered in the course of the suggested program. and keep track of the intervals on which the function is concave up. the points where the concavity changes. Find the inﬂection points of the function. such as the y-intercept. The zeros of f (x) provide you with the critical points of f (x). More importantly..10. resp. and intervals on which f (x) is negative give you intervals on which f (x) is concave down. (g) Repeat (b) and (c) with f (x) in place of f (x). Intervals on which f (x) is positive give you intervals on which f (x) is increasing... (b) Plot the zeros of the function. (c) If possible. SKETCHING GRAPHS 99 program is also a good review of the material which we developed in this chapter.2.

x = 1. 3] and f (x) is increasing √ on [(1 + 33)/4.53. √ • f (x) is positive on the interval ((1 − 33)/4. This allows us to factor the expression for f (x). (1 − 33)/4]. In the ﬁrst step we applied the product rule. 1) and f (x) is increasing √ on [(1 − 33)/4. √ • f (x) is negative on the interval (1. We based the calculation on the description of f (x) in (2. You should verify this by multiplying out the expression for f (x) in (2.1) f (x) = (x − 1)2 (x2 − 4) = (x − 1)2 (x − 2)(x + 2).1). the function f (x) is diﬀerentiable on the given interval. −2) and (2. (1 − 33)/4) and f (x) is decreason ing on [−3. 3].100 CHAPTER 2. f (0) = −4 and f (3) = 20. Plot these x-intercepts. or x = 2. (1 − 33)/4].1). √ 1 x − [1 − 33] . (b): As a polynomial. (e): We use the quadratic formula to ﬁnd the zeros of the factor 2x2 −x−4 √ in the expression for f (x). we note that (2. (a): Plot the y intercept of the function and its values at the end points of the given interval: f (−3) = 80.1). (1+ 33)/4) and f (x) is decreasing √ on [1. and we ﬁnd: √ 1 f (x) = 4(x − 1) x − [1 + 33] 4 We conclude that: √ • f (x) is negative √ the interval [−3. 1) and (1. we see right away that f (x) = 0 if and only if x = −2. Solution: To make the discussion a little easier. Having written f (x) as in (2. Discuss the graph of the function f (x) = x4 − 2x3 − 3x2 + 8x − 4 for x ∈ [−3. we see that f (x) is positive on the intervals [−3. 2). 3]. GLOBAL THEORY Example 2. (d): We calculate the derivative of f (x): f (x) = 2(x − 1)(x2 − 4) + (x − 1)2 2x = 2(x − 1)(2x2 − x − 4). 4 . 3]. √ • f (x) is positive on the interval ((1 + 33)/4. and negative on (−2. (c): Counting the signs of the factors of f (x). They are (1 ± 33)/4. The only exceptional points are its zeros. and then we used elementary algebra. 1].

3] √ • f (x) √ inﬂection points at x = (1 − 3)/2 ≈ −.10. 3] and f (x) is concave up √ on [(1 + 3)/2. The values of the function at its inﬂection points is approximately: √ √ 1− 3 1− 3 f( ) ≈ −7.37 and at x = has (1 + 3)/2 ≈ 1. SKETCHING GRAPHS 101 √ • f (x) has a critical point and local minimum at (1 − 33)/4 ≈ −1. (g): We use the quadratic formula to ﬁnd the zeros on f (x) and factor it: √ 1 f (x) = 12 x − [1 + 3] 2 We conclude that: √ • f (x) is positive√ the interval [−3. 2 . 4 4 Plot these points. The values of the function at its three critical points are approximately: √ √ 1 − 33 1 + 33 f( ) ≈ −12. a critical point and local maximum at x = 1. 2 2 Plot these points. (h): At this point we could use the second derivative test to ﬁnd at which critical points the function has local extrema.2.37.21 & f ( ) ≈ −. (1 − 3)/2] √ √ • f (x) is negative on the interval ((1 − 3)/2. (f): We rewrite the ﬁrst derivative as f (x) = 4x3 − 3x2 − 3x + 4. (1 + 3)/2] √ • f (x) is positive on the interval ((1 + 3)/2.69. but we decided this already based on ﬁrst derivative behaviour in (e). and a critical point and √ local minimum at (1 + 33)/4 ≈ 1. and ﬁnd f (x) = 12x2 − 12x − 6. √ 1 x − [1 − 3] . (1 − 3)/2) and f (x) is concave on up on [−3. (1 + 3)/2) and f (x) is √ √ concave down on [(1 − 3)/2.39 & f (1) = 0 & f ( ) ≈ −.29.19.54.

The dots indicate the points which we suggests to plot. We consider the interval: I1 = [−3. −2] I2 = −2. as the values at the endpoint a comparetively large. 2 √ √ 1 + 3 1 + 33 I6 = . 33 √ 33 1 − 3 I3 = . that at some end points of intervals the function is zero. etc. 2 4 √ 1 + 33 I7 = . It should be understood.5]. ♦ Exercise 43. Showing all of the graph would show less clearly what happens near the intercept.1: Properties of the Graph In Figure 2. discuss the function f (x) = (x − 1)(x − 2)(x + 2) = x3 − x2 − 4x + 4 on the interval [−3. 1− 1− √ √ 4 √ 1+ 3 I5 = 1.20 you see the graph of the function. In Figure 2.21 you see the graph of f on an even smaller interval. 2. We have shown it on a slightly smaller interval. and inﬂection points.1 2 We tabulate the which properties hold on which interval. ﬁnd the absolute extrema of this function. . In addition. GLOBAL THEORY Let us gather and organize our information. and parts of the graphs of f and f . f is concave down where f is negative. You can use them to see that f is decreasing where f is negative. 3] .102 CHAPTER 2. Property Sign I1 I2 I3 I4 neg inc I5 neg dec I6 I7 I8 pos neg neg inc up neg neg pos dec up inc up inc up Monotonicity dec dec Concavity up up down down Table 2. In analogy with the previous example.2 4 I8 = [2. extrema. 4 2 √ 1− 3 I4 = .

You may have to apply Newton’s method to ﬁnd zeros of f . In addition. f Exercise 44. 2π]. SKETCHING GRAPHS 103 10 10 5 5 -3 -2 -1 -5 1 2 3 -2 -1 -5 1 2 -10 -10 Figure 2. In analogy with the previous example. discuss the function f (x) = x3 − 3x + 2 on the interval [−2. f .2.10. ﬁnd the absolute extrema of this function. . ﬁnd the absolute extrema of this function. In analogy with the previous example. and f . f . In addition.21: f . Exercise 45. 2].20: The Graph Figure 2. discuss the function f (x) = 2 sin x + cos 3x on the interval [0.

GLOBAL THEORY .104 CHAPTER 2.

Suppose that f is a function which is deﬁned and bounded on the interval [a. The indeﬁnite integral of a function f is the family (set) of all antiderivatives of f .1 Properties of Areas So far. and the x-axis.Chapter 3 Integration We will introduce the ideas of the deﬁnite and the indeﬁnite integral. then the deﬁnite integral of f over the interval [a. For important classes of functions one may utilize deﬁnite integrals to construct antiderivatives. it should have the following properties: • The area of a rectangle is the product of the lengths of its sides. 3. 105 . and the lines x = a and x = b. like rectangles. the x-axis. we only know the area of some simple regions. We will denote the area of a region Ω by Area(Ω). If it exists. and ﬁnd the area of the region Ω bounded by the graph of f (x). shown in Figure 3. Whatever concept of area we have in mind. b]. b] is a real number. The Fundamental Theorem of Calculus relates deﬁnite integrals and antiderivatives. all functions whose derivative is f . consider the function f (x) = x2 e−x . It is denoted by b f (x) dx. the lines x = 1 and x = 5.e. To be concrete.1. so that for a non-negative function it makes sense to think of the integral as the area of the region bounded by the graph of the function. i. a The deﬁnition is set up.

then the area of the union Ω1 ∪ Ω2 of Ω1 and Ω2 is deﬁned.106 CHAPTER 3. If the regions Ω 1 and Ω2 do not intersect. • Suppose that Ω1 and Ω2 are regions in the plane.2 0. and that the area of each of them is deﬁned.1 1 2 3 4 5 6 Figure 3. and Area(Ω1 ∪ Ω2 ) = Area(Ω1 ) + Area(Ω2 ).3 you see a rectangle Ru with area 2.24 which contains Ω. that the region under the graph shown in Figure 3. In Figure 3.2 you see a rectangle Rl with area .24.6 ≤ Area(Ω) ≤ Area(Ru ) = 2.1 has an area.4 0. which is contained in Ω.5 0. which occurs frequently in our upcoming constructions: . INTEGRATION 0. The ﬁrst two principles tell us that Area(Rl ) = . then Area(Ω1 ) ≤ Area(Ω2 ). In Figure 3. From above principles one may derive another one.6 0.6.3 0. If Ω1 ⊆ Ω2 .1: f (x) = x2 e−x • Suppose that Ω1 and Ω2 are regions in the plane. Suppose for a moment. and that the area of each of them is deﬁned.

2 0.3.2: A rectangle Rl contained in Ω Figure 3.2 0.5 0.3: A rectangle Ru containing Ω • Suppose the region R in the plane is the union of a ﬁnite number of rectangles R1 . . .3 0. xj ] with 1 ≤ j ≤ n. . . such that a = x0 ≤ x1 ≤ · · · ≤ xn−1 ≤ xn = b. . We do so by partitioning the interval before applying the ideas from above. .1 1 2 3 4 5 6 1 2 3 4 5 6 Figure 3. b] is of a collection is points {xj | 0 ≤ j ≤ n}. Then Area(R) is deﬁned. and it is equal to the sum of the areas of the regions R1 . .6 0.5 0.4 0.2 Partitions and Sums We like to reﬁne the approach to calculating areas of regions which we started in the previous section. .3 0. and then we add up what we get over the individual intervals.2. 3.4 0. PARTITIONS AND SUMS 107 0. . Rn and any two of them intersect at most in an edge. Rn : Area(R) = Area(R1 ) + · · · + Area(Rn ). A partition of an interval [a. b] is partitioned into n intervals [xj−1 . The interval [a.6 0.1 0.

108

CHAPTER 3. INTEGRATION

3.2.1

Upper and Lower Sums

As before, f denotes a function which is deﬁned and bounded on [a, b]. On each interval we pick numbers mj and Mj , such that mj ≤ f (x) ≤ Mj We deﬁne the lower sum to be (3.1) Sl = m1 (x1 − x0 ) + m2 (x2 − x1 ) + · · · + mn (xn − xn−1 ). for all x ∈ [xj−1 , xj ].

and the upper sum to be (3.2) Su = M1 (x1 − x0 ) + M2 (x2 − x1 ) + · · · + Mn (xn − xn−1 ).

These sums depend on the choice of partition and the choices for the mj and Mj .

0.6 0.5 0.4 0.3 0.2 0.1

0.6 0.5 0.4 0.3 0.2 0.1

1

2

3

4

5

6

1

2

3

4

5

6

Figure 3.4: A union of rectangles contained in Ω

Figure 3.5: A union of rectangles containing Ω

Let us return to the example of the function f (x) = x2 e−x on the interval [1, 4]. In the computation of the lower sum we use the partition x0 = 1 < x1 = 2 < x2 = 3 < x3 = 4 < x4 = 5 of the interval. We also pick m1 = .35, m2 = .43, m3 = .28 and m4 = .16. This leads to a lower sum Sl = 1.22. In the computation of the upper sum we use the partition x0 = 1 < x1 = 3 < x2 = 4 < x3 = 5

3.2. PARTITIONS AND SUMS

109

of the interval. We also pick M1 = .55, M2 = .45 and M3 = .3. This leads to an upper sum Su = 1.85. The mj and Mj represent the heights of the rectangles in Figures 3.4 and 3.5, and we trust these ﬁgures to show that mj ≤ f (x) and f (x) ≤ Mj on the respective interval. As before, let Ω denote the region under the graph. Then the union of the rectangles shown in Figure 3.4 is contained in Ω, and the union of the rectangles shown in Figure 3.5 contains Ω. Thus, if Ω has an area, the our principles tell us that Sl = 1.22 ≤ Area(Ω) ≤ Su = 1.85. In fact the only number greater or equal to all lower sums and smaller or equal to all upper sums is 5 − 37 , and this will be the area of the region e e5 Ω. Here e is the Euler number. Example 3.1. Let us ﬁnd upper and lower sums for the function f (x) = x3 − 7x2 + 14x − 8 for x ∈ [.5, 4.5]. In contrast to the function in the previous example, this function is not non-negative.

4 3 2 1 1 -1 -2 -3 2 3 4

4 3 2 1 1 -1 -2 -3 2 3 4

Figure 3.6: Rectangles for calculating an upper sum.

Figure 3.7: Rectangles for calculating a lower sum.

Solution: For the purpose of calculating an upper sum, we partitioned the interval [.5, 4.5] using the intermediate points x0 = .5, x1 = 1.1, x2 = 2.4, x3 = 3.8, and x4 = 4.5. As numbers Mi (so that Mi ≥ f (x) for x ∈ [xi−1 , xi ])

110

CHAPTER 3. INTEGRATION

we chose M1 = .3, M2 = .7, M3 = −.9, and M4 = 4.4. These data are shown in Figure 3.6. With these choices, the upper sum is Su = .3(1.1 − .5) + .7(2.4 − 1.1) + (−.9)(3.8 − 2.4) + 4.4(4.5 − 3.8) = 2.91. In Figure 3.6 you see four rectangles. Their areas are combined to calculate the upper sum. The areas of the ones above the x-axis are added, the ones below the axis are subtracted, in accordance with the sign of the Mi . In the calculation of the lower sum we partitioned [.5, 4.5] using x0 = .5, x1 = .8, x2 = 2.3, x3 = 4.2, and x4 = 4.5. As numbers mi (so that mi ≤ f (x) for x ∈ [xi−1 , xi ]) we chose m1 = −2.7, m2 = −.8, m3 = −2.2, and m4 = 1.3. These data are shown in Figure 3.7. With these choices we calculate a lower sum of Sl = −2.7(.8 − .5) + (−.8)(2.3 − .8) + (−2.2)(4.2 − 2.3) + 1.3(4.5 − 4.2) = −5.8. In Figure 3.7 you see four rectangles. Their areas are combined to calculate the lower sum. The areas of the ones above the x-axis are added, the ones below the axis are subtracted, in accordance with the sign of the mi . In summary, you see that we still combine areas of rectangles in the calculation of the upper and lower sum, only that, depending on the sign of the Mi or mi , these rectangles are either above or below the x-axis, and depending on this, their areas are either added or subtracted. ♦ Let us make a simple albeit important observation: Theorem 3.2. Let f be a function which is deﬁned and bounded on a closed interval [a, b]. Let Sl be any lower sum of f and Su any upper sum. Then Sl ≤ Su . Let us repeat the statement of the theorem to emphasize its meaning. Whichever partition of the interval [a, b] and whichever mi we use in the calculation of the lower sum Sl and whichever partition of the interval and whichever Mi we use in the calculation of the upper sum Su , the lower sum is always smaller or equal to the upper sum. To see this, one reﬁnes the partitions for the upper and lower sum computation so that they become the same. Then one notes that mi ≤ Mi for all i.

3] √ and x2 = π ∈ [ 3.5 0. PARTITIONS AND SUMS 111 3.8 you see the picture illustrating the computation. In each subinterval.749741.2 Riemann Sums Suppose once again that f (x) is a function which is deﬁned on the interval [a. Su any upper sum. We leave it to the reader to contemplate Proposition 3. 0. Then we deﬁne the Riemann Sum (3.8: Representing a Riemann Sum To be more concrete.3. As Riemann sum we obtain SR = f (x1 )(x1 − x0 ) + f (x2 )(x2 − x1 ) ≈ 1. and SR any Riemann sum. Let us use the partition √ x0 = 1 < x1 = 3 < x2 = 5. let us return to the example f (x) = x2 e−x on the interval [1.2.2 0. √ √ In the two interval of this subdivision we pick the points x1 = 2 ∈ [1. Let Sl be any lower sum of f .4 0.2. Let f be a function which is deﬁned and bounded on a closed interval [a. 5]. and their heights are f (x1 ) and f (x2 ). In Figure 3.3 0. . The sum of the areas of these rectangles is the Riemann sum.3) SR = f (x1 )(x1 − x0 ) + f (x2 )(x2 − x1 ) + · · · + f (xn )(xn − xn−1 ). Pick once more a partition a = x0 ≤ x1 ≤ · · · ≤ xn−1 ≤ xn = b of the interval. b]. b].3. pick a point xj ∈ [xj−1 . Then Sl ≤ SR ≤ Su .1 1 2 3 4 5 6 Figure 3. There are two rectangles.6 0. their bases are the intervals in the subdivision. 5]. xj ].

In this case. whatever choices we make in the calculation of lower and upper sums Sl and Su .3 Limits and Integrability The idea is to reﬁne the partitions in our previous construction. A crucial additional fact is stated in the next result. Call it Yl . if the function is non-negative.1 The Darboux Integral and Areas As we discussed earlier.4.112 CHAPTER 3. b]. a 1 To distinguish it from the result of a diﬀerent. Theorem 3. such that Sl ≤ Y ≤ Su for all lower sums Sl and upper sums Su of f . we always have that Sl ≤ Su . Then Y is any number such that Yl ≤ Y ≤ Yu . INTEGRATION 3. There exists a real number Y . the number Y is called the integral1 of f for x between a and b. such that S l ≤ Y ≤ Su for all lower sums Sl and all upper sums Su of f . so that in the limit our sums can be justiﬁably called the area of the region under the graph. construction we should Y the Darboux integral. . one observes that the set of all lower sums of f has a least upper bound. Idea of Proof. Yl ≤ Yu . We are now prepared to deﬁne the concept of integrability of a function. It is also denoted by b f (x) dx. Let f be a function which is deﬁned and bounded on a closed interval [a. 3. If there is exactly one number Y . Apparently. then we say that f is integrable over the interval [a. Deﬁnition 3. Let f be a function which is deﬁned and bounded on a closed interval [a. b]. The set of all upper sums of f has a greatest lower bound. To deduce the theorem from the completeness of the real numbers.3. The speciﬁcs depend on which sums we are working with.5. but typically equivalent. b]. Call it Yu .

Example 3. all subintervals have the same length 1/n. Apparently. n n n n This is an equidistant partition of the interval [0. a function over a closed interval [a.3.. xj ] because f (x) is increasing on [0. and with this an entire interval. On the other hand. between all upper and lower sums. M3 = f (x3 ) = 3 n 2 .. lower sums. For completeness sake and later use. For the upper sums we pick M1 = f (x1 ) = 1 n 2 . . m2 = f (x1 ) = 1 n 2 2 × 1 n . a function is integrable if for every positive number D there is an upper sum Su and a lower sum Sl such that Su − Sl < D. M2 = f (x2 ) = 2 2 n 2 . Mj ≥ f (x) for all x ∈ [xj−1 . Solution: Fix a natural number n and set 1 2 n−1 n x0 = 0 < x1 = < x2 = < · · · < xn−1 = < xn = = 1. we use that 12 + 22 + 32 + · · · + n2 = We calculate the upper sum n(n + 1)(2n + 1) . 1]. LIMITS AND INTEGRABILITY 113 Remark 8. Explore upper sums. b] is not integrable if and only if the exists a positive number D such that Su − Sl ≥ D for any lower sum Su and any upper sum Su . 6 Su = M1 (x1 − x0 ) + M2 (x2 − x1 ) + · · · + Mn (xn − xn−1 ) 1 2 1 2 2 1 n = × + × + ··· + n n n n n 1 2 = 1 + 22 + · · · n2 n3 n(n + 1)(2n + 1) = 6n3 1 1 1 = + + 2 3 2n 6n For the lower sums we pick m1 = f (x0 ) = 0. 1]. So. In this case there are at least two diﬀerent numbers. j and Mj = f (xj ) = n in general. and integrability for the function f (x) = x2 on the interval [0. .. m3 = f (x2 ) = 2 n 2 .3.. .6. 1]. let us explain what happens when a function is not integrable. Without proof.

25 1 0.4 0. INTEGRATION 1. the x-axis. then we say that the region Ω has an area and b Area(Ω) = a f (x) dx. .2 0. bounded. and mj = f (xj−1 ) = j−1 n in general.6 0. Let f be a function which is deﬁned. According to the deﬁnition this means. We do not only see that Sl ≤ 1 ≤ Su .8 1 1. Let Ω be the region bounded by the graph of f . that f (x) = x2 is integrable over the interval [0. Using the expressions for Su and Sl you see that Su − Sl = 1/n. and nonnegative on a closed interval [a.10.2 1. so that Sl ≤ Y ≤ Su for all natural numbers n.8 1 1.5 0.25 1 0.75 0.5 0.9: Rectangles for calculating a lower sum.75 0.2 0. and the lines x = a and x = b.2 0. b].6 0. Our answer is formulated as a Deﬁnition 3.10: Rectangles for calculating an upper sum. but also that Y = 1/3 is the only real 3 number. 2 Figure 3.2 Figure 3.9 and 3.4 0.7. The resulting lower sum is Sl = 1 1 1 − + 2 3 2n 6n For n = 5 you see the rectangles whose areas are the summands in the lower and upper sums in Figures 3.25 -0.114 CHAPTER 3. 1] and that 1 0 x2 dx = 1 .25 -0. ♦ 3 We motivated our introduction of upper and lower sums by our quest to deﬁne the concept of area. If f is integrable over this interval.2 0.

We say that L = lim SR |P|→0 if for all > 0 there exists a δ > 0. LIMITS AND INTEGRABILITY 115 The upper and lower sum were constructed such that if there is any justiﬁcation to assigning an area to Ω then Sl ≤ Area(Ω) ≤ Su . xj ]. then we say that f is Riemann integrable over [a. If the limit of the SR exists. the x-axis. For an integrable function there is exactly one real number between the lower and upper sums. because there are a lot of choices which we make to deﬁne such a sum. and write b L = lim SR = |P|→0 a f (x) dx. Consider an interval [a. and the lines x = 0 and x = 1 is 1 Area(Ω) = 0 1 x2 dx = . b]. so this is the only number which we can call the area of Ω. call L the Riemann integral of f .2 The Riemann Integral Earlier we introduced the idea of a Riemann sum.8 (Limit for Riemann Sums).3. b] up into smaller interval [xj−1 . the area of the region Ω bounded by the graph of the function f (x) = x2 . which broke [a. . Deﬁnition 3.3. xj ]. b]. We want to consider a limit Riemann sums.3. in other words. Suppose the function f (x) is deﬁned on [a. and set SR = f (x1 )(x1 − x0 ) + f (x2 )(x2 − x1 ) + · · · + f (xn )(xn − xn−1 ). For example. b] and a function f (x) deﬁned on it. This is trickier than for functions. We picked a partition P : a = x0 ≤ x1 ≤ x2 ≤ · · · xn−1 ≤ xn = b. ♦ 3 3. xj ]. We deﬁne the norm of the partition P to be |P| = max{xj − xj−1 | 1 ≤ j ≤ n}. such that |L − SR | < whenever |P| < δ. In each of the subintervals we picked a point xj ∈ [xj−1 . the norm of P is the length of the longest of the intervals [xj−1 .

116

CHAPTER 3. INTEGRATION

Thus L = lim SR if we can force SR to be close to L, as close as we like, by making the partition ﬁne, by making each subinterval no longer that some number. It is worth pointing out and not very diﬃcult to show the following proposition. Proposition 3.9. Suppose the function f is deﬁned on the interval [a, b]. Then f is Riemann integrable if and only if it is Darboux integrable. If deﬁned, the Riemann and the Darboux integral are the same.

3.4

Integrable Functions

We like to provide a supply of integrable functions. Our ﬁrst result is typically proved in an analysis course. Theorem 3.10. Suppose f is deﬁned and continuous on [a, b]. Then f is integrable over [a, b]. According to this theorem, polynomials are integrable over any interval of the form [a, b]. Rational functions (i.e., functions of the form p(x)/q(x) where p(x) and q(x) are polynomials) are integrable over intervals of the form [a, b] as long as q does not vanish anywhere on the interval. The trigonometric functions (sin, cos, tan, cot, sec, and csc) are integrable on intervals where the functions are deﬁned. Arbitrary powers of a variable, f (x) = xα , are integrable. One just needs to make sure that the function is deﬁned on the interval [a, b]. For any real number α it suﬃces to assume that a > 0. For any real α ≥ 0, it suﬃces to assume a ≥ 0. For rational numbers α = p/q, where p and q are integers and q is odd, it suﬃces to assume 0 ∈ [a, b]. For non-negative integers α no assumption needs to be made on a and b. Just making sure that the resulting functions are deﬁned everywhere on [a, b], the functions just mentioned may be added, subtracted, multiplied, divided, and composed, and one still ends up with integrable functions. Let us introduce another class of functions for which we can prove that they are integrable. Deﬁnition 3.11. Suppose f (x) is a function. We say that f (x) is nondecreasing if f (x1 ) ≤ f (x2 ) whenever x1 and x2 are in the domain of f (x) and x1 ≤ x2 . We say that f (x) is non-increasing if f (x1 ) ≥ f (x2 ) whenever x1 ≤ x2 . Proposition 3.12. Let [a, b] be a closed interval and let f be deﬁned and non-increasing or non-decreasing on [a, b]. Then f is integrable on [a, b]. In particular, monotonic (increasing or decreasing) functions are integrable.

3.4. INTEGRABLE FUNCTIONS

117

Proof. We will use Darboux integrability. Let us assume that the function f is non-decreasing on the interval. The non-increasing case is left as an exercise. Take any partition of the interval: a = x0 < x1 < · · · < xn = b. The reader may justify why we can use the same partition in the computation of the upper and lower sum. For i = 1, . . . , n we set mi = f (xi−1 ) Then, because f is non-decreasing, mi ≤ f (x) ≤ Mi for all x ∈ [xi−1 , xi ]. & Mi = f (xi ).

We use the mi and Mi to compute upper and lower sums. Let ∆ be the largest value of the xi − xi−1 . Then Su − Sl = [M1 (x1 − x0 ) + · · · + Mn (xn − xn−1 )] −[m1 (x1 − x0 ) + · · · + mn (xn − xn−1 )] = (M1 − m1 )(x1 − x0 ) + · · · + (Mn − mn )(xn − xn−1 ) ≤ [(M1 − m1 ) + (M2 − m2 ) + · · · + (Mn − mn )] ∆ = (Mn − m1 )∆ = [f (b) − f (a)]∆ The inequality in the computation follows from the choice of ∆. The second to last equality follows because Mi−1 = mi for all i = 2, . . . , n. Many terms in the computation cancel. Given any positive number D, we can make the partition ﬁne enough so that [f (b) − f (a)]∆ < D. According to our Remark 8 this means that f is integrable over the interval, as we claimed. We illustrate the steps in the proof in a concrete example. In Figure 3.11 you see the upper and lower sum. The lower sum is the sum of the areas of the darkly shaded rectangles. The upper sum is the sum of the areas of the lightly and darkly shaded rectangles. The diﬀerence between the upper and the lower sum is the sum of the lightly shaded rectangles shown in Figure 3.12. We can combine these areas by sliding the rectangles sideways so that they form one column. Its height will be f (b) − f (a). Its width may vary, but in the widest place it is no wider than ∆, the width of the largest interval in the partition of [a, b]. That means, the diﬀerence between the upper and the lower sum is at most [f (b) − f (a)]∆. As above, we conclude that the function is integrable.

118

CHAPTER 3. INTEGRATION

1.2 1.2 1 1 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 0.2 0.2 -0.2 0.4 0.6 0.8 1 -0.2 0.4 0.6 0.8 1

Figure 3.11: Rectangles for calculating a lower and an upper sum.

Figure 3.12: Rectangles for calculating the diﬀerence between an upper and a lower sum.

Remark 9. There are functions which are not integrable over any interval of the form [a, b] with a < b. Remark 10. Here we only discuss integrability of function over closed ﬁnite intervals, i.e., intervals of the form [a, b]. The discussion of integrability of functions over intervals which are not of this form, e.g., half-open intervals like [a, b) or unbounded closed intervals like [a, ∞), requires additional ideas and techniques which we are not ready to discuss yet.

3.5

Some elementary observations

In spite of our success calculating some integrals using upper and lower sums and the deﬁnition, this is certainly not the way to go in general. To integrate “well behaved” functions we want a theory which allows us to calculate integrals more easily. We have to develop a few basic tools. These are fairly straight forward consequences of the deﬁnition of the integral. Proposition 3.13. If the function f is deﬁned at a, then

a

(3.4)

a

f (x) dx = 0

Proof. The reader should contemplate the proposition.

and consider it over each of the intervals [xj−1 .14 we may break the problem up. b]. . and the one-sided limits (see Section 1. b]. Suppose that we can partition the interval into a ﬁnite number of intervals such that f is non-increasing or non-decreasing on each of them.3) x→x+ j−1 lim f (x) and x→x− j lim f (x).17.14 we ﬁnd Corollary 3. exist and are ﬁnite.10. SOME ELEMENTARY OBSERVATIONS 119 Proposition 3. b] be a closed interval. Use c as one of the points in the partition. We call f piecewise continuous if there is a partition a = x0 < x1 < · · · < xn−1 < xn = b such that f is continuous on the open intervals (xj−1 . On each of these smaller intervals. As an immediate consequence of Propositions 3. Then c b b (3. and f a function which is deﬁned on the interval.5. Deﬁnition 3.12 and 3. Implicitly in the formulation of the proposition is the statement that f is integrable over [a. b] if and only if it is integrable over the intervals [a. If one of the sides of Equation (3. This changes neither the integrability nor the value of the integral. Idea of Proof. Suppose that f is deﬁned on an interval [a. Idea of Proof. we can change the deﬁntion of the function at a point or two and make it continuous. xj ] separately. Then f is integrable on [a. c a point between a and b.16. So the assertion follows from Theorem 3. Let f be deﬁned on the interval [a.14.3. b]. Let [a.10. According to Proposition 3.5) a f (x) dx + c f (x) dx = a f (x) dx. then f is integrable on [a. The remaining details are left to the reader. Corollary 3. We can also extend Theorem 3.15. b]. c] and [c. If f is a piecewise contiuous function on [a. then so does the other one. b].5) exists. b]. xj ) for all 1 ≤ j ≤ n.

18.19.21. Let f be deﬁned and integrable on the interval [a.22. Proof. b]. b] be a closed interval and c a scalar. The approach to integrals via lower and upper sums b could also be generalized to include integrals a where b < a. Suppose that f and g are integrable over the interval. Proposition 3. b]. We mention a few useful estimates for integrals. b] and b b b (f (x) + g(x)) dx = a a f (x) dx + a g(x) dx and b b cf (x) dx = c a a f (x) dx. INTEGRATION Deﬁnition 3.6) a f (x) dx ≤ a b |f (x)| dx. and b (3. and g(x) ≥ h(x) for all x ∈ [a. The proof of this proposition is elementary. b]. If h and g are integrable over [a. If f is integrable over [a. b]. leading to exactly this formula. then b a f (x) dx ≥ 0. and f (x) ≥ 0 for all x ∈ [a. b]. b].120 CHAPTER 3. Proof. b]. then b a g(x) dx ≥ a b h(x) dx. Use that f (x) = g(x) − h(x) ≥ 0 for all x ∈ [a.20. b] be a closed interval and f integrable over [a. Let [a. . Let [a. Then f +g and cf are integrable over [a. Corollary 3. Proposition 3. The proof is left to the reader. b]. though a bit tricky. Then b a f (x) dx = − b a f (x) dx. This deﬁnition is convenient and consistent with what we have said so far about the integral. Using the deﬁnition of the integral it is not diﬃcult to show: Proposition 3. Then the absolute value of f is integrable over [a.

7) a f (x) dx = Area(Ω+ ) − Area(Ω− ). We decompose the region between the x-axis and the graph into the part Ω + above the x-axis and the part Ω− below it. though a bit tricky. We deﬁne two functions: f + (x) = f (x) if f (x) ≥ 0 0 if f (x) ≤ 0 and f − (x) = f (x) if f (x) ≤ 0 0 if f (x) ≥ 0 It is elementary.9) Area(Ω+ ) = a 2 If you want to be formal. We decompose Ω into the union of two sets. the x-axis. and the lines x = a and x = b. we have: Proposition 3. so that the additivity of the integral implies that b b (3.7 we have (3. Ω+ consist of those points (x.6 Areas and Integrals Let us return to the relation between areas and integrals. The question is. Speciﬁcally. b] and Ω the set of points which lie between the graph of f (x) and the x-axis for a ≤ x ≤ b. then you have to ﬂip the region Ω− to lie above the x-axis. Making use of this notation. f = f + + f − . y) in the plane for which a ≤ x ≤ b and 0 ≤ y ≤ f (x).6.8) a f (x) dx = a f + (x) dx + a b f − (x) dx. Only then have we addressed the question of it having an area. b f + (x) dx. and Ω− of those points for which a ≤ x ≤ b and f (x) ≤ y ≤ 0. Then Ω is the union of the sets Ω+ and Ω− . then the areas of the regions Ω+ and Ω− are deﬁned2 and b (3. Idea of Proof.23. what happens if f (x) is not non-negative? Let f be a function which is deﬁned and bounded on a closed interval [a. AREAS AND INTEGRALS 121 3. Suppose f (x) is a non-negative integrable function over an interval [a. then b Area(Ω) = a f (x) dx. Ω+ and Ω− . to show that the integrability of f (x) implies the integrability of f + (x) and f − (x). If f is integrable. According to Deﬁnition 3. If Ω is the area bounded by the graph of f (x). . Apparently.3. b].

and the lines x = a and x = b.24.e.9) and (3.122 CHAPTER 3. there exists a constant c. The set of all antiderivatives of f is called the indeﬁnite integral of f . we understand the the right hand side of (3. Remember that any antiderivatives F1 and F2 of a function f on an interval I diﬀer only by a constant (see Corollary 2. In this expression c stands for an arbitrary constant. i.8). we typically write (3. In Deﬁnition 2. Let f be a function which is deﬁned on an interval I.10) into (3. Deﬁnition 3. INTEGRATION Let −Ω− be the area obtained by ﬂipping Ω− up. π/2 3. Given a function f and an antiderivative F of it.11) as a set of functions. Diﬀerent values for c result in diﬀerent functions. Allowing all real numbers as possible values for c.7 and our elementary properties of the integral we have (3.6 we called a function F (x) with domain I an antiderivative of f (x) if F (x) = f (x). It is denoted by f (x) dx. and −Ω− is bounded by the graph of −f − (x). In other words. The function −f − (x) is non-negative. .11) f (x) dx = F (x) + c.10) Area(Ω− ) = Area(−Ω− ) = a b −f − (x) dx = − a b f − (x) dx. This process does not change areas.. −π/2 sin x dx = 0 because the graph bounds congruent regions above and below the x-axis. so Area(Ω− ) = Area(−Ω− ). and suppose that f has an antiderivative. we take its mirror image along the x-axis. According to Deﬁnition 3. Having an anti-derivative of a function will (typically) make it easy to integrate it over a closed interval. The constant c in the expression is referred to as integration constant. the x-axis. For example.7 Anti-derivatives Consider a function f (x) with domain I. such that F1 (x) = F2 (x) + c for all x ∈ I. Our claim follows now by substituting the results in (3.5).

Then we can write down the indeﬁnite integral of f in the form F (x) + c. we ﬁnd that cos2 x dx = 1 2 (1 + cos(2x)) dx = 1 1 x + sin(2x) + c. or pick up some new ideas as you go along. E.20)). Here are some examples. As you go through them you are expected to learn. Given a function f (x) we might know or guess a function F (x). such that F (x) = f (x). .g. an additional idea is required before we can see the antiderivative.3 on page 63 to come up with ideas for antiderivatives. E. ANTI-DERIVATIVES 123 Example 3.7. 1 dx = x + c √ x dx = 2 3/2 x +c 3 1 x dx = x2 + c 2 1 xn dx = xn+1 (n = −1) n+1 sec2 x dx = tan x + c sec x tan x dx = sec x + c √ dx = arcsin x + c 1 − x2 sin x dx = − cos x + c cos x dx = sin x + c dx = arctan x + c 1 + x2 Using the linearity of the diﬀerentiation (see the diﬀerentiation rules in (1. You may want to consult Table 1. using the trigonometric identity cos2 x = (1 + cos 2x)/2. The reader may practice ﬁnding some antiderivatives for the functions in the next exercise. 3 Occasionally. 5 5x2 − 2 cos x dx = x3 − 2 sin x + c. it is easy to produce more examples. 2 2 Using a diﬀerent trigonometric identity we ﬁnd (1 + cot2 x) dx = csc2 x dx = − cot x + c.g.. ♦ We shall explore additional ideas for ﬁnding antiderivatives at a later.25. You can check the correctness of your guess by diﬀerentiation.3.

Find the following indeﬁnite integrals: 1 dx x3 csc2 x dx (1 + tan2 x) dx csc x cot x dx sin2 x dx sec2 (3x) dx (a) (b) (c) (d) (e) (f) 3 dx (x + 4) dx (x2 − 5) dx cos 2x dx (3 + x)3 dx (3 + 2x)5 dx (g) (h) (i) (j) (k) (l) (m) (n) (o) (p) (q) (r) ex/3 dx x2 2x dx +1 (4 − 3x)5 dx cos(4 − 3x) dx 2x dx (x2 + 3)2 x sec2 (x2 + 5) dx 3. INTEGRATION Exercise 46.8 The Fundamental Theorem of Calculus Our ﬁrst result provides us with a large class of functions which have antiderivatives. . More speciﬁcally. have antiderivatives. Then b a d dx x f (t) dt a f (x) dx = F (b) − F (a). b] and that F is an antiderivative of f . and the grand conclusion of our discussion of antiderivatives is the Fundamental Theorem of Calculus. deﬁned over intervals.26. F (x) = − cos x is an antiderivative of f (x) = sin x. Theorem 3.27 (Fundamental Theorem of Calculus).124 CHAPTER 3. so that the Fundamental Theorem of Calculus tells us that π 0 sin x dx = − cos(π) − (− cos(0)) = −(−1) − (−1) = 2. suppose that a function f is deﬁned and continuous over the interval I. Continuous functions. Theorem 3. Suppose that f is a continuous function over a closed interval [a. Let a ∈ I. The major tool for calculating integrals. For example. Then f (x) = for all x ∈ I.

g. we calculate that 3 −2 5 3 = p(5) − p(3) (x2 − 2x + 5) dx = x3 − x2 + 5x 3 3 = −2 95 . This is quite convenient. One commonly uses the notation b F (x) a = F (b) − F (a). THE FUNDAMENTAL THEOREM OF CALCULUS 125 As another example. 3 Other examples are π/4 π/4 sec x tan x dx = sec x 0 0 = √ 2−1 and π/3 π/4 csc x cot x dx = − csc x π/3 π/4 = √ −2 3 3 √ √ √ 2 3 − (− 2) = 2 − ..3. Using this notation. we write π sin x 0 = sin π − sin 0.8. we also use the notation shown in the following example: x3 − 5x2 + 2x − 8 where p(x) = x3 − 5x2 + 2x − 8. . E. If there are ambiguities due to the length of the expression to which this construction is applied. Remark 11 (Notational Convention). so that the Fundamental Theorem of Calculus tells us that π/4 0 sec2 x dx = tan(π/4) − tan(0) = 1. note that F (x) = tan x is an anti-derivative of f (x) = sec2 x. 3 The reader is invited to practice a few examples.

Let F (x) be any anti-derivative x of f (x) on I. In particular. the desired result is an easy consequence of Theorem 3.26. Using this calculation of c and substituting x = b in (3. as claimed. F (x) = H (x) = f (x).8.12) H(x) = a f (t) dt = F (x) + c We can ﬁnd out the value for c by substituting x = a in this equation.26 and the Fundamental Theorem of Calculus.26.1 Some Proofs Because of their importance. Evaluate the following deﬁnite integrals: 1 π (a) 0 2 (3x + 2) dx 6−t dt t3 1 5 √ 2 x − 1 dx 2 0 (g) 0 π 1 cos x dx 2 cos(x/2) dx (b) (c) (d) (h) 0 2 (i) (j) 0 1 (t3 − t2 ) dt csc x cot x dx −2 π/2 |x2 − 1| dx cos2 x dx sin2 (2x) dx sec2 x dx π/4 π/2 (e) π/6 −1 (k) 0 π/4 (f) −1 7x dx 6 (l) 0 3.12). we obtain b a f (t) dt = F (b) − F (a). and H(x) = a f (t) dt the one provided by Theorem 3. In particular. we ﬁnd that a f (t) dt = 0 = F (a) + c a or c = −F (a). Proof of the Fundamental Theorem of Calculus. For some constant c and all x ∈ I: x (3.126 CHAPTER 3. . INTEGRATION Exercise 47. Essentially. we like to prove Theorem 3.5) tells us that F and H diﬀer by a constant. Cauchy’s Theorem (see its application in Corollary 2.

13) f (c) = m(h) ≤ f (x) ≤ f (d) = M (h) for all t between x and x + h. so that x and x + h are both in I. THE FUNDAMENTAL THEOREM OF CALCULUS 127 Proof of Theorem 3. Because we assumed continuity of f on the interval I.4) that x+h h→0 x lim f (t) dt = f (x).26.10 that x F (x) = a f (t) dt exists. So it is our task to show that F is diﬀerentiable at x. The points c and d may not be uniquely determined by h. such that (3.13) and Corollary 3. x Here we assume that x is not an endpoint of I. after adjusting the notation to ﬁt the current setting. the task becomes to show that f (x) = F (x + h) − F (x) h x+h 1 = lim f (t) dt − h→0 h a h→0 lim x f (t) dt a = 1 h→0 h lim x+h f (t) dt.21 that m(h) · h = x x+h m(h) dt ≤ x x+h f (t) dt ≤ x x+h M (h) dt = M (h) · h.3.26. According to the Extreme Value Theorem (see Theorem 1. and with this that m(h) ≤ 1 h x+h x f (t) dt ≤ M (h). but m and M are.17) there are points c and d between x and x + h. and this is exactly what we needed to show. Continuity of f (x) implies that h→0 lim m(h) = f (x) = lim M (h). and that F (x) = f (x). . It follows from (3. h→0 It follows from a pinching argument (see Proposition 1.8. it follows from Theorem 3. Using Theorem 1. We omit (leave to the reader) the modiﬁcations of the proof which are required in the case where x is an endpoint of I.

f (u) = u3 . and conclude that (3. There are no general rules what substitution must be used. rather success justiﬁes the means. (2x − 3)3 dx = 1 2 (2x − 3)3 · 2dx = 1 (2x − 3)4 + c. It will give you at least some experience which you may then rely on in similar examples. dx Assume that f and g are continuous on I. Substitution is a method which. g (x) = 2. Setting u = g(x) we write du = g (x)dx. a way to use the notation. You hope that you can ﬁnd an antiderivative for the simpliﬁed expression. . d F (g(x)) = f (g(x))g (x).. which can be applied to write down the steps in an integration using substitution eﬃciently.9 Substitution In some cases it is not that easy to ‘see’ an antiderivative of the function one likes to integrate. Your success with this method depends greatly on experience. Sometimes this method is helpful. Let F and g be functions which are deﬁned and diﬀerentiable on an interval I. 8 4 Here we used g(x) = 2x − 3. when applied correctly.e. Set F = f .128 CHAPTER 3. practice. We explain the method. and F (u) = u . will simplify the expression for the function you like to integrate. Let us give a few examples to illustrate how this method can be put to use. Then f (g(x))g (x) is continuous as well. and this means in context that u = g(x). The method is based on the chain rule for diﬀerentiation. The variable for the functions f and F is often called u. We may take antiderivatives of both sides of our previous equation. according to the chain rule. 4 There is a pattern. For example. Then. i. INTEGRATION 3. other times it is not.14) f (g(x))g (x) dx = F (g(x)) + c. Working through the examples will teach you how to apply this method in some typical situations.

The equation du = g (x)dx helps us to write down what happens when we perform the substitution as in the ﬁrst equality in (3. We may have to use a substitution and a trigonometric identity to solve We do not attach any particular meaning to the symbols dx and du in their own right. Suppose also that F is an anti-derivative of f . or du = 2xdx. but this is beyond the scope of these notes. we calculate that x x2 + 2 dx = = = = 1 x2 + 2 · 2xdx 2 1 √ u du 2 1 3/2 u +c 3 1 2 (x + 2)3/2 + c. these symbols have a meaning. In the ﬁrst step of this calculation we carry out the substitution. SUBSTITUTION 129 instead of g (x) = du/dx3 .3.9. Then du = dx and t = u − 1. For example. 3 .15). in the second one we ﬁnd the anti-derivative. Thought of as inﬁnitesimals or diﬀerentials. so F = f . 3 du dx We used the substitution u = x2 + 2. Then We calculate that t2 (t + 1)7 dt = = = = = (u − 1)2 u7 du = 2x. Then the pattern for calculating an integral via substitution is (3. (u2 − 2u + 1)u7 du (u9 − 2u8 + u7 ) du 1 10 2 9 1 8 u − u + u +c 10 9 8 1 2 1 (t + 1)10 − (t + 1)9 + (t + 1)8 + c.15) f (g(x))g (x) dx = f (u) du = F (u) + c = F (g(x)) + c. We make use of this notation in our next example. and in the third one we reverse the substitution. 10 9 8 Here we used the substitution u = t + 1.

INTEGRATION sin2 u du 1 [1 − cos 2u] du 2 1 1 u − sin 2u + c 2 2 1 1 (x2 + 5) − sin[2(x2 + 5)] + c. 2 2 We used the substitution u = x2 + 5. Then dv = cos u du. to work out an integral. 2 Sometimes we have to apply the method of substitution twice. Here is an example. . In a second substitution we set v = sin u. Then du = 3(x2 + 1) dx. so that du = 2xdx and the identity sin2 α = [1 − cos 2α]/2. and check the details: sec2 x tan x dx = = = = sec x · sec x tan x dx u du 1 2 u +c 2 1 sec2 x + c.130 an integration problem: 2x sin2 (x2 + 5) dx = = = = CHAPTER 3. Find the substitution which we used in the following computation. (x2 + 1) sin3 (x3 + 3x − 2) cos(x3 + 3x − 2) dx = = = = = 1 sin3 u cos u du 3 1 v 3 dv 3 1 4 v +c 12 1 sin4 u + c 12 sin4 (x3 + 3x − 2) +c 12 In the computation we used the substitution u = x3 + 3x − 2. or more often.

b]. 3 . In the ﬁrst example we assume that a = 0.14). which are important in the context of integrating rational functions. we calculate dx = x2 + 2x + 5 dx = (x + 1)2 + 4 du 1 = arctan u2 + 4 2 x+1 2 + c.3. Assuming as before that f and g are continuous on the interval [a. we have b g(b) (3. dx dx = x2 + a2 = = = Adding another idea. The ﬁrst identity is obtained as a combination of the Fundamental Theorem of Calculus and (3. Let us apply this formula in a few examples. The second one is obvious. 1 0 (x2 − 1)(x3 − 3x + 5)3 dx = 1 3 3 5 u3 du = 1 4 u 12 3 5 =− 136 . adu a2 u2 + a2 1 du a u2 + 1 1 arctan(u) + c a 1 x arctan +c a a We used the substitution u = x+1. and we use the subsitution x = au.9. which we again denote by F . Then b b g(b) g(b) f (g(x))g (x) dx = F (g(x)) a a = F (u) g(a) = g(a) f (u) du.1 Substitution and Deﬁnite Integrals Let us now explore how substitution is used to calculate deﬁnite integrals.16) a f (g(x))g (x) dx = g(a) f (u) du. SUBSTITUTION 131 Here are two examples. To see this. and the third one is another application of the Fundamental Theorem of Calculus. observe that f has an anti-derivative. The dx = adu. and then we proceeded as in the previous example.9. 3.

Then du = (3x2 − 3) dx. 4 We use the substitution x = sin u. When we .132 CHAPTER 3. π is the ratio of the circumference of a circle by its diameter. If x = 0. 1] means that we calculated the area under this graph in the ﬁrst quadrant. or did we assume the answer previously? By deﬁnition. then u = 9. that the area of this unit disk is π. the area of one forth of the disk of radius 1. 3 1 3 4 0 1 We use the substitution u = cos x. i. Then −du = sin x dx. 7 We use the substitution u = x + 1. and if x = 2. then u = 0. √ 8 0 x3 x2 + 1 dx = 1 2 9 1 √ 1 (u − 1) u du = 2 9 1 u3/2 − u1/2 du = 596 . and if x = 1. then u = 1. Using x ∈ [0. The graph of f (x) = 1 − x2 is the northern part of a circle. If x = 0. so that the result of the calculation is hardly surprising. In our calculation of the derivative of the sine and cosine functions we used the estimate that | sin h − h| ≤ h2 /2. Then du = dx and x = u − 1. 1 0 1 − x2 dx = π/2 0 1 − sin2 u cos u du = π/2 0 cos2 u du = π . 15 We use the substitution u = x2 + 1. Similarly.. Incorporating one of our previous techniques. Then dx = cos u du. then u = 2/2. we calculate 2 0 x(x + 1)6 dx = 1 3 (u − 1)u6 du = 3 1 u7 − u6 du = 3554 . if x = 0. For our given values of x. To obtain the limits for the integral we calculate u(0) = 5 and u(1) = 3. There is a more serious matter.e. Then 1 du = x dx and x2 = u − 1. but they will lead to the same results. there are other possible values for u. and if x = 8. and if x = π/4. You were told long time ago in school. and 1 2 3 du = (x −1) dx. the u = 3. √ Remark 12. Finally. Another example is √ √ √ π/4 2/2 2 1 3 2/2 1 2 2 cos x sin x dx = − u du = − u =− 1− . then u = π/2. INTEGRATION We used the substitution u = x3 − 3x + 5. then √ u = 1. Is the example genuine. For 2 √ the limits we calculate. If x = 0. then u = 1.

Taking the square root. A typical proof of the latter inequality starts out by ﬁrst showing that the area of the unit disk is π.13. we draw the two graphs. 2 Only the + sign occurs as x2 ≥ 0. see Figure 3.3. we solve the equation f (x) = x2 = g(x) = 1 − x2 . This idea can be generalized to the discussion of areas of regions between two graphs. This means. The graphs intersect in two points. we ﬁnd x2 = −1± 5 . π/4]. Let us look at an example. Find the following integrals: dx 2x + 1 t dt 2 + 9)2 (4t √ t(1 + t2 )3 dt 2s ds 6 − 5s2 b3 x3 √ dx 1 − a4 x4 √ 3 π π/4 (a) (b) (c) (d) (e) (f) 0 x cos x dx √ x2 x + 1 dx x+3 √ dx x+1 sin2 (3x) dx π/2 2 (k) π/6 1/2 sec(2x) tan(2x) dx dx 4 + x2 0 sec2 x √ dx 1 + tan x √ 1 + sin x cos x dx r (g) (h) (i) (j) 0 (l) (m) (n) (o) cos2 x dx 0 r 2 − x2 dx 3. Solution To get a better understanding. Example 3.10.10 Areas between Graphs Previously we related the integral to areas of a region under a graph. √ After squaring the equation and solving it for x2 . we used that |h| ≤ | tan h| for h ∈ [−π/4. AREAS BETWEEN GRAPHS 133 showed this.28. Now you see the region between the two graphs whose area we want to calculate. Exercise 48. we ﬁnd the x-coordinates of the points where the curves intersect: A=− −1 + 2 √ 5 and B = −1 + 2 √ 5 . Calculate the area of the region between the graphs of the √ functions f (x) = x2 and g(x) = 1 − x2 . we did not derive it. We call the region Ω. we assumed the result in the example. . To ﬁnd their x-coordinates.

14: Region between two graphs To get the area of the region under the graph of f (x) and g(x) over the interval [A. To get the area of the region Ω between the graphs. The region breaks up into two pieces. π/4] on which f (x) ≥ g(x).6 0.2 -1 -1 -0. we take the area of the region under the graph of g(x) and subtract the area of the region under the graph of f (x).13: Region between two graphs Figure 3.5 3 0. ♦ Some problems are a bit more subtle. INTEGRATION 1 1 0. Solution: The region Ω between the graphs is shown in Figure 3.5 2 2. B] we can calculate the appropriate integrals. the region Ω1 over the interval [0. Concretely: B Area(Ω) = A g(x) dx − B B f (x) dx = A A (g(x) − f (x)) dx ≈ 1.5 0.5 1 Figure 3.5 1 1.5 0.06651. You are invited to work out the integral with the help of the Fundamental Theorem of Calculus to verify the result. The numerical value was obtained by computer. and the region Ω2 over the interval [π/4.134 CHAPTER 3.4 0. Example 3. . π] where g(x) ≥ f (x).5 -0.8 0. We calculate the areas of the regions Ω1 and Ω2 separately.14.29. Find the area of the region between the graphs of the functions f (x) = cos x and g(x) = sin x for x between 0 and π.

When we compared integrals and areas.7 on page 114. This deﬁnition generalizes Deﬁnition 3. In all cases it is good to graph the functions before calculating the area of the region between them. Here we did not. resp. ♦ Our general deﬁnition for the area between two graphs is as follows. we proceed as in the previous example: π/4 135 Area(Ω1 ) = Area(Ω2 ) = 0 π π/4 (cos x − sin x) dx = (sin x + cos x) π/4 0 π π/4 = √ 2−1 √ 2.. An additional remark may be in place.3. we had to take into account where the function is non-negative. The area of Ω is b Area(Ω) = a |f (x) − g(x)| dx. Typically this problem gets addressed when the integral is calculated. in others you have to determine them from context. Deﬁnition 3.30.10. Having the correct picture in mind helps you to avoid mistakes. Sketch and ﬁnd the area of the region bounded by the curves: (a) y = x2 and y = x3 . (sin x − cos x) dx = −(sin x + cos x) =1+ In summary we ﬁnd: √ Area(Ω) = Area(Ω1 ) + Area(Ω2 ) = 2 2. AREAS BETWEEN GRAPHS In each case. (e) y = sin x and y = 2 sin x cos x for x between 0 and π. b]. (d) y = sin x and y = πx − x2 . and it incorporates and generalizes Proposition 3. Taking the absolute value of the diﬀerence of f (x) and g(x) allows us avoid the question where f (x) ≥ g(x) and where g(x) ≥ f (x). Let Ω be the region between the graphs of f (x) and g(x) for x between a and b. In some problems a and b are explicitly given. The deﬁnition is also consistent with the intuitive idea of the area of a region. We took care of this aspect by breaking up the interval into the part where f (x) ≥ g(x) and the part where g(x) ≥ f (x). non-positive. Exercise 49.23 on page 121. . (b) y = 8 − x2 and y = x2 (c) y = x2 and y = 3x + 5. Suppose f (x) and g(x) are integrable functions over an interval [a.

only we use the value of the function at the right endpoint instead of the left endpoint: (3.17) IL = f (x0 )(x1 − x0 ) + f (x1 )(x2 − x1 ) + · · · + f (xn−1 )(xn − xn−1 ). INTEGRATION 3. Let us take such a function.11 Numerical Integration The Fundamental Theorem of Calculus provided us with a highly eﬃcient method for calculating deﬁnite integrals. by numerical means.31. We multiply it with the length of the associated interval. In such cases we may have to rely on numerical methods for integrating. Still. Example 3. 2 Solution: Set f (x) = e−x and choose the partition: x0 = 0 < x1 = 1 3 < x2 = 1 < x3 = < x4 = 2.18) IR = f (x1 )(x1 − x0 ) + f (x2 )(x2 − x1 ) + · · · + f (xn )(xn − xn−1 ). approximate values for the integral of a function f (x) over the interval [a. Use the left and right endpoint method to ﬁnd approximate values for 2 0 e−x dx. and then add up the terms.136 CHAPTER 3. for some functions we have no good expression for its anti-derivative. we calculate (3. Explicitly. Both expressions provide us with speciﬁc examples of Riemann sums. Left and Right Endpoint Method: In the left endpoint method we ﬁnd the value of the function at each left endpoint of the intervals of the partition. and show some methods for ﬁnding an approximate value for the integral. We describe diﬀerent ways to ﬁnd. 2 2 2 . In the right endpoint method we proceed as we did on the left endpoint method. b]: b f (x) dx. a In all of the diﬀerent approaches we partition the interval into smaller ones: a = x0 < x1 < · · · < xn−1 < xn = b.

that our speciﬁc function f (x) is decreasing on the interval [0.5 1 1. In this sense. 2.15 and Figure 3.6351975438. and IR is a lower sum. NUMERICAL INTEGRATION 137 Then xk −xk−1 = 1/2 for k = 1. In the midpoint methods. and 4.6351975438 ≤ 2 0 e−x dx ≤ IL = 1. 2].16. Note also.5 2 0.4 0. so the areas of the rectangles are all added. 3. 2 Formula (3.15: Use left end points Figure 3. 2]. .126039724. 2 1 0. That should be less bias. IL and IR are calculated by combining the areas of certain rectangles.6 0.16: Use right end points Apparently. we use the value of the function at the midpoints of the intervals of the partition.126039724.11.3.8 0. Formula (3.5 2 Figure 3.2 1 0.5 1 1.6 0.17) for IL specializes to IL = f (0) + f (1/2) + f (1) + f (3/2) ≈ 1. so that IL is an upper sum for the function f (x) over the interval [0. we have IR = .4 0. ♦ Midpoint and Trapezoid Method: We may try and improve on the endpoint methods. In our case the values of f (x) are all positive and all of the rectangles are above the x axis.8 0.2 0.18) for IR specializes to IR = f (1/2) + f (1) + f (3/2) + f (2) ≈ . 2 The function and the rectangles whose areas are added to give us IL and IR are shown in Figure 3.

the formula is f (x0 ) + f (x1 ) f (xn−1 ) + f (xn ) (x1 − x0 ) + · · · + (xn − xn−1 ). INTEGRATION We use the same partition and notation as above.25) + f (. 2 Solution: We use the same partition of [0. Consider the trapezoid of width (x1 − x0 ) which has height f (x0 ) at its left and f (x1 ) at its right edge. we use appropriate secant lines above all of the intervals in the partition to deﬁne the function T (x) over the entire interval [a.32.20). 2 Let us explain the reference to the word trapezoid. 2 2 It is quite easy to see that (3. Example 3. 2 . b].19)) specializes to IM = f (. Proceeding in the fashion. Expressed diﬀerently. b]. Then (3.25) + f (1. see (3. We have such a trapezoid over each of the intervals in the partition.e. we can draw a secant line through the points (x0 . Speciﬁcally.75) ≈ . and their areas are added to give IT . suppose that f (x) is non-negative on the interval [a. This integral is easily computed by the formula in (3. Then the formula for the midpoint method is: (3. This gives us the graph of a function T (x) over the interval [x0 .21) IT = b IT = a T (x) dx. The formula for IM (see (3. Over the interval [x1 .31. For simplicity. This is the ﬁrst 2 summand in the formula for IT . f (x0 )) and (x1 . 2] as in Example 3. Use the midpoint and trapezoid method to ﬁnd approximate values for 2 0 e−x dx. In the trapezoid method we do not take the function at the average (i.75) + f (1.20). midpoint) of the end points of the intervals in the partition. f (x1 )).20) IT = IL + IR .19) IM = f x 0 + x1 2 (x1 − x0 ) + · · · + f xn + xn−1 2 (xn − xn−1 ). f (x2 )). x1 ].8827889485. but we average the values of the function at the end points. The area of this trapezoid is f (x0 )+f (x1 ) (x1 − x0 ). f (x1 )) and (x2 .138 CHAPTER 3. x2 ] the graph of T (x) is the secant line through the points (x1 .

3. IM is the combined area of certain rectangles.4 0.21) we ﬁnd IT = IL + IR ≈ .5 2 0. The speciﬁc formula for an approximate value of the integral of f (x) over [a.18: Trapezoid Method Based on our previous calculations and Formula (3.8806186341. So 2 IT = T (x) dx. and IT is the area of the region under this graph.5 2 Figure 3. 0 ♦ Simpson’s Method: In Simpson’s method we combine the endpoint and midpoint methods in a weighted fashion.2 1 0. Again.5 1 1.17.17: Use midpoints Figure 3. Their heights are the values f (xi ) at the midpoints of the intervals of the partition.8 0.18. You see the rectangles for this calculation in Figure 3. There you see the function 2 f (x) = e−x and ﬁve dots on the graph.2 0.22) 1 x 0 + x1 f (x0 ) + 4f 6 2 1 + f (xn−1 ) + 4f 6 + f (x1 ) (x1 − x0 ) + · · · xn−1 + xn 2 + f (xn ) (xn − xn−1 ) . The dots are connected by straight line segments. These line segments form the graph of a function T (x). b] is IS = (3. 1 0. 2 We illustrated this calculation in Figure 3.11.5 1 1.8 0.6 0. we use the same notation for the function and the partition as above. Their width are the lengths of the intervals of the partition. NUMERICAL INTEGRATION 139 As for the endpoint methods.6 0.4 0.

6 3 Let us explain the background to Simpson’s method. INTEGRATION IL + 4IM + IR IT + 2IM = .2 0.8 0.33. 2 . Simpson’s method is a reﬁnement of the Trapezoid method.19: Simpson’s Method Example 3. We deﬁne a function P (x) over the interval [a.5 2 Figure 3. Use Simpson’s method to ﬁnd an approximate value for 2 0 e−x dx. xk ] is chosen so that it agrees with f (x) at the end points and at the midpoint of this interval. b] by deﬁning a degree 2 polynomial on each of the intervals of the partition.140 It is quite easy to see that IS = CHAPTER 3.4 0. 1 0. With some work one can show that b IS = a P (x) dx. In one method we use two points on the graph and connect them by a straight line segment.6 0. In the other one we use three points on the graph and construct a parabola through them. The polynomial over the interval [xk−1 .5 1 1.

♦ IS = Example 3.g.881095681980474 0. They should be compared with an approximate value for the integral of 0.31. the function f (x) = e−x and the function P (x) from the discussion of Simpson’s method.7836703747 0.0366313 0. and vary n.8820809836 Table 3.8822020700 1.883063050702697 0.891895792451 0. There you see the graphs 2 of two functions. Simpson’s method with n = 4 gives a result which is better than the left and right endpoint method with n = 1000. E.11.34.882081402972833 0. NUMERICAL INTEGRATION 141 Solution: We use the same partition of [0.9800072469 0. You see the method illustrated in Figure 3.88206555104.8818388108 0.. Compare the accuracy of the various approximate values of 2 0 e−x dx. Even if you use the midpoint and trapezoid method with n = 1000.0183156 0. then the result is far less accurate that Simpson’s method with n = 100. ♦ .872262105229 0. IM and IR are as above.882078948840 0.3. 6 where IL .8299445 0. Only the thickness of the line suggests that there are two graphs of almost identical functions. We partition the interval [0.882081390762421. The formula for IS (see the special case of (3. 2] into n intervals of the same length.0000000 0.19. 2] as in Example 3.882081390762417 2. We tabulate the results.882081366341586 0. n=1 IL IR IM IT IS n = 10 n = 100 0.882081390722 n = 1000 0.7357589 0.22)) specializes to IL + 4IM + IR ≈ . 2 Solution: We compare the approximate values for the integral obtained by the diﬀerent formulas.882082611663 0.1: Approximate Values of the Integral Simpson’s method is more accurate than the other ones.

then I stands for the total distance which you traveled during the time interval [a. It is important that we keep the number n of intervals into which we partition [a.142 CHAPTER 3. b]. referred to as Beaver Pond by the locals. and these may add up. You are invited to come up with more interpretations. then I is the total amount of the drug which has been absorbed in the time interval [a. b] small. this can have a more concrete meaning. π] is 2/π. the average value of the sine function f (x) = sin x over the interval [0. INTEGRATION Remark 13.36. If f (t) stands for the rate at which a drug is absorbed. The amount of water carried by the river depends on the season.7 and Proposition 3. 0 3. b].12 Applications of the Integral In Deﬁnition 3. . Then the quantity fav := 1 b−a b f (t) dt a is called the average value of f (t) over the interval [a. b] and the integral b I= a f (t) dt. Deﬁnition 3.23 we related deﬁnite integrals to areas.34 and compare the diﬀerent methods applied to the calculation of π/2 sin x dx = 1. The fewer computations we make. Example 3. Proceed as in Example 3. it is g(t) = 2 + sin πt 180 . Suppose that f (t) is an integrable function over the interval [a. Based on the context. The river Little Brook ﬂows into a reservoir. Consider a function f (t) on an interval [a.35. As a function of time. In addition. In each computational step we expect to make a roundoﬀ error. b]. It does not only keep the number of overall computations small. Exercise 50. the smaller the cummulative round-oﬀ error will be. Let us explore the diﬀerent aspects of integration in an example. the following deﬁnition expresses the common notion of the average value of a function. If f (t) stands for the speed with which you travel. For example. b].

2 π 3 millions of liters of water in the pond. the rate of change is . APPLICATIONS OF THE INTEGRAL 143 We measure time in days. (c) At which rate does the amount of water in the reservoir change at the beginning of September? (d) On which days will there be 250 million liters of water in Beaver Pond? (e) At which amount of water will the reservoir crest? (f) On the average. after 240 days. there are 200 million liters of water in the reservoir.3. This answers (b). Find F (t). At the beginning of September. Water is released from Beaver Pond at a constant rate of 2 million liters per day. there are F (120) = 200 + 180 2π 1 − cos ≈ 238. At the beginning of the year. the total amount of water in Beaver Pond is T F (T ) = 200 + 0 f (t) dt = 200 + 180 1 − cos π πT 180 millions of liters. The net rate entering is f (t) = g(t) − 2 = sin πt 180 millions of liters per day. The units of g(t) are millions of liter of water per day. We obtain the total change of the amount of water in the reservoir by integrating f (t). The rate at which the amount of water in the pond changes is F (t) = f (t). Set T A(T ) = 0 f (t) dt. after 120 days. By the end of April. This answers (a). (a) How many liter of water are in Beaver Pond by the end of April? (b) Suppose F (t) tells how much water there is in the reservoir on day t of the year.12. On the T -th day of the year. by how much has the amount of water in Beaver Pond increased per day during the ﬁrst three months of the year? Solution: Water enters and leaves the pond. and t = 0 corresponds to New Year.

On the average. t = 0 at the time you take the medication.3 millions of liters. 18 We apply the function arccos to both sides of the last equation and ﬁnd T = 180 5π arccos 1 − π 18 ≈ 88. we like to know for which T we have F (T ) = 250. A pain reliever has been formulated such that it is absorbed at a rate of 600 sin(πt) (mg/hr) by the body. the amount of water in the reservoir increased by about 640.144 CHAPTER 3. (c) A total of 150 mg of the medication has to be absorbed before the drug is eﬀective. 000 liters per day.6 millions of liters of water.10 we introduced the exponential function exp(x) = ex and the natural logarithm function ln x. The pond crests at mid-year. and then the amount of water in it is about 314. . such that F (t) tells how much medication has been absorbed at time t. At the time we only stated that they exist because we did not have the tools to properly deﬁne them.3 millions of liters of water in Beaver Pond.000 liters per day. To ﬁnd at which amount the reservoir crests.866. Many of the routine calculations are formulated as exercises. How long does it take until this threshold is reached? 3. To answer (d). We solve the equation for T : 250 = 200 + 180 1 − cos π πT 180 or cos πT 180 =1− 5π . After three months or 90 days there are about 257. INTEGRATION f (240) ≈ −. This occurs apparently when cos(πt/180) = −1 or t = 180. On the 88-th and 272-nd day of the year there will be 250 millions of liters of water in the reservoir. This answers (e). or 272. (a) What is the total amount of the drug which is absorbed? (b) Find a function F (t). Within this time.13 The Exponential and Logarithm Functions In Section 1. Here t measures time in hours. we have to ﬁnd the maximum value of F (t). We will now ﬁll in the details. and the absorption process is complete at time t = 1. ♦ Exercise 51. The pond is losing water at a rate of 866. the amount of water has increased by 57.

Let us also verify one of the central equations for calculating with logarithms. ∞). observe that t/x = u = 1 when t = x. . y > 0. We need a short calculation. Exercise 52. Proof.11.24) ln(xy) = ln x + ln y. so that du = x dt. Here x and y are ﬁxed positive numbers. (3.39. ∞). x Proof. ∞). The natural logarithm of x is deﬁned as x (3. t Theorem 3. ∞). According to Theorem 2. u Using this calculation we deduce that xy ln(xy) = 1 dt = t x 1 dt + t xy x dt = ln x + ln y. THE EXPONENTIAL AND LOGARITHM FUNCTIONS 145 Deﬁnition 3.38.34. its derivative is ln x = and ln x is increasing on (0. and that t/x = u = y when t = xy. Proposition 3. The natural logarithm function is diﬀerentiable on its entire domain (0. ∞).37. t 1 We use the substitution u = x . (2) ln(1/y) = − ln y for all y > 0. the function is increasing because its derivative ln x > 0 for all x > 0.13. 1 . For the adjustment of the limits of integration. Theorem 3. Then xy x dt = t xy x 1 1 dt = (t/x) x y 1 du = ln y.3. For any x.10 this means that ln x is deﬁned for all x in (0. the third rule in Theorem 1. t This is exactly our claim.23) ln x = 1 dt .26 tells us that ln x = 1/x. According to Theorem 3. The function 1/x is deﬁned and continuous on (0. Show: (1) ln 1 = 0. Let x ∈ (0.

To see this. Show that ln(ar ) = r ln a for all positive numbers a and all rational numbers r. We saw that ln y is an increasing function. Show that ln 4 > 1. and ln x = 1/x. increasing function with domain (0. Taken together it means that it has a unique solution. we have seen that Corollary 3.. t For this deﬁnition to make sense. The number Euler number e is the unique number such that e ln e = 1 or. Hint: Using the partition 1 = x 0 < 2 = x1 < 3 = x2 < 4 = x3 . Observe that ln(en ) = n and ln(1/en ) = −n for all natural numbers n. observe that ln 1 = 0 < 1 < ln 4. for any given x.e. For every real number x there exists exactly one positive number y.146 (3) ln(x/y) = ln x − ln y for all x. So all integers (whole numbers) are values of the natural logarithm function. such that (3. every real number is a value of the function ln y. According to the Intermediate Value Theorem. numbers of the form r = p/q where p and q are integers and q = 0.16) that there is a number e for which ln e = 1. we have to show that there is a number e which has the property used in the deﬁnition. 4] so that Sl > 1. the equation ln y = x has at most one solution. Proposition 3. i. CHAPTER 3.42. y > 0.40. The natural logarithm function ln x is a diﬀerentiable. Exercise 54.25) ln y = x Proof. This means that. ∞) and range (−∞. it follows from the Intermediate Value Theorem (see Theorem 1. . ﬁnd a lower sum Sl for the function 1/t over the interval [1. 1 dt = 1. ∞). We can now deﬁne the Euler number: Deﬁnition 3. In summary. INTEGRATION Exercise 53. It also follows that 1 < e < 4. Every real number x lies between two integers. equivalently.41. Because ln x is diﬀerentiable.

the deﬁnition of e in Deﬁnition 3.3. and adding some observations which we have made elsewhere. ∞). THE EXPONENTIAL AND LOGARITHM FUNCTIONS We are now ready to deﬁne the exponential function. we have: Proposition 3.26). you need to show that (3. i.e. with domain (−∞.. called the exponential function. ∞).26) ln(exp(x)) = x. increasing function with domain (−∞. Summarizing this discussion.43. exp(ln(y)) = y . Show that exp(r) = er for all rational numbers r. ∞) and range (0. Exercise 56. This assignment (mapping x to exp(x)) deﬁnes a function. and that the exponential and logarithm functions are inverses of each other. 147 Deﬁnition 3. Show for all real numbers x and y that: (1) exp(0) = 1 (2) exp(1) = e (3) exp(x) exp(y) = exp(x + y) (4) 1/ exp(y) = exp(−y) (5) exp(x)/ exp(y) = exp(x − y). exp (x) = exp(x). and the exponential function is its own derivative.e. Hint: Use Exercise 54 and that the exponential and logarithm functions are inverses of each other. we deﬁne exp(x) to be the unique number for which (3. Exercise 57. ∞) and range (0. In addition to the equation in (3. ∞). The exponential function exp(x) is a diﬀerentiable.. Hint: Use the results of Exercise 52. Given any real number x.13. i.40.27) for all y ∈ (0. Show that the exponential function exp and the natural logarithm function ln are inverses of each other. Exercise 55. y = exp(x) is the unique solution of the equation ln(y) = x.44.

Suppose a > 0 and a = 1. Set (3.28) ex = exp(x). and it deﬁnes what we mean by raising e to any real power. We now expand the discussion to other bases. Deﬁnition 3. This is consistent with the meaning of the expression for rational exponents due to Exercise 57. ∞). (b) loga (expa (x)) = x for all real numbers x. (2) loga (x) and expa (x) are diﬀerentiable functions. For the exponential function expa we use the domain (−∞. Suppose a > 0 and a = 1. the speciﬁcations for the domains and ranges for the functions expa and loga and the results from Exercise 59 tell us that Corollary 3. 3.1 Other Bases So far we discussed the natural logarithm function and the exponential function with base e. Let a be a positive number.29) loga x = ln x ln a and expa (x) = exp(x ln a). Taken together.45. ∞) and range (0. Show (1) ln a > 0 if a > 1 and ln a < 0 if 0 < a < 1. If r = p/q then we raise e to the r-th power and take the q-th root of the result. . ∞). Exercise 59. ∞) and range (−∞. For an arbitrary real number we set (3.46. a = 1. (3) loga (x) and expa (x) are increasing functions if a > 1. For the function loga we use the domain (0. INTEGRATION The expression er makes sense only if r is a rational number. Show that (a) expa (loga (y)) = y for all y > 0. We call loga (x) the logarithm function with base a and expa (x) the exponential function with base a. The functions expa and loga are inverses of each other. (4) loga (x) and expa (x) are decreasing functions if 0 < a < 1. Exercise 58.13.148 CHAPTER 3.

Show the exponential laws: (1) expa (0) = 1 and expa (1) = a (2) expa (x) expa (y) = expa (x + y) (3) 1/ expa (y) = expa (−y) (4) expa (x)/ expa (y) = expa (x − y). y > 0. 149 We rephrase a convention which we made previously for e. (c) loga (1/y) = − loga y for all y > 0. Suppose a > 0 and a = 1.30) ax = expa (x). If r = p/q then we raise a to the r-th power and take the q-th root of the result. and it deﬁnes what we mean by raising a to any real power. a = 1. and r is a rational number.28 if we set a = e. We can now state an equation which is typically considered to be one of the laws of logarithms: Exercise 63. (b) loga (xy) = loga x + loga y for all x. Show loga (ar ) = r and expa (r) = ar . Suppose a > 0 and a = 1. .13. Then loga (xz ) = z loga (x). y > 0. It is also a standard convention to set 1x = 1 and 0x = 0 for any real number x. Typically 00 is set 1. Suppose a > 0.3. Suppose a > 0. Equation 3. and z is any real number. Suppose a > 0 and a = 1. This is consistent with the meaning of the expression for rational exponents due to Exercise 62.30 specializes to the one in Equation 3. Show the laws of logarithms: (a) loga 1 = 0 and loga a = 1. Exercise 61. x > 0. For an arbitrary real number we set (3. The expression ar makes sense if r is a rational number. THE EXPONENTIAL AND LOGARITHM FUNCTIONS Exercise 60. a = 1. (d) loga (x/y) = loga x − loga y for all x. Exercise 62.

and this function has all of the properties called for in the theorem. Proof.150 CHAPTER 3. We have to show the uniqueness statement. called the exponential function with base a and denoted by expa (x). Suppose f (x) is any monotonic function and f (r) = ar = expa (r) for all rational numbers r. there in only one such function. We are ready to prove Theorem 3. We have to show that f (x) = expa (x) for all real numbers x. a = 1. which is deﬁned for all real numbers x such that expa (x) = ax whenever x is a rational number.47. .e. That settles the existence statement.10. There exists exactly one monotonic function. INTEGRATION We are now ready to ﬁll in the details for one of the major statements which we made in Section 1. We leave the veriﬁcation of this assertion to the reader. Here one uses that f (x) and expa (x) are monotonic. i. In this section we constructed the function expa (x).. Let a be a positive number. and that expa (x) is continuous.

We collect some formulas relating these functions. Observe that the ratio referred to in the deﬁnition does not depend on the radius of the circle. that the circumference of a circle of radius r is 2πr. but any two radian measures of the angle diﬀer by an integer multiple of 2π. 151 . Deﬁnition 4. the radian measure of the angle α is not unique. These are the functions sine. 0) and ends at p. We like to ﬁnd the radian measure of the angle α. We may also consider arcs which wrap around the circle several times before they end at p. In this sense. Stated diﬀerently it says. The number π is the ratio between the circumference of a circle and its diameter. It requires some work to introduce the idea of the length of a curve in a mathematically rigorous fashion. Then (4. Consider an angle α between the positive x-axis and a ray which originates at the origin of the coordinate system and intersects the unit circle in the point p. The − sign is used if it proceeds clockwise. and suppose its length is s. The + sign is used if the arc goes counter clockwise around the circle. and measure how long it is. cosine. al.Chapter 4 Trigonometric Functions In this section we discuss the radian measure of angles and introduce the trigonometric functions. We take a practical approach to measuring the length of an arc on this circle. We imagine that we can straighten it out.1.1. Consider an arc on the unit circle which starts out at the point (1. It is shown in Figure 4.1) α = ±s (radians). tangent. This deﬁnition goes back to the Greeks. Arc Length and Radian Measure of Angles: Consider the unit circle (a circle with radius 1) centered at the origin in the Cartesian plane. et.

We construct the angle with radian measure t. Let α be the angle between the positive x-axis and the ray which starts at the origin and intersects the unit circle in p.1: The unit circle Conversely.2) x degrees = π x radians. 0) we travel the distance |t| along the unit circle (here |t| denotes the absolute value of t). sin t) -1 -0.5 -1 Figure 4.3) x(t) = cos t and y(t) = sin t. 0) we travel the distance |t| along the unit circle. one degree corresponds to π/180 ≈ 0.152 CHAPTER 4. and one radian corresponds to 180/π ≈ 57. .5 (cos t.017453293 radians. 180 Trigonometric Functions: Let t be once more a real number.29577951 degrees. Starting at the point (1. The measure of half a revolution (a straight angle) comprises π radians and 180 degrees. Comparison of Angles in Degrees and Radians: We suppose that you are familiar with measuring angles in degrees. In this way we reach a point p = (x(t). we travel counter clockwise if t is positive and clockwise if t is negative. So. This angle has radian measure t. By convention. and we set (4. counter clockwise if t is positive and clockwise if t is negative. y(t)) on the circle. Starting at the point (1. We have the conversion formula (4. let t be any real number.5 1 -0. TRIGONOMETRIC FUNCTIONS 1 0. In this way we reach a point p on the circle.5 0.

We may use a circle of any radius r.2 and 4. A small table with angles given in degrees and radians. This is really not necessary anymore because any scientiﬁc calculator gives those values to you with rather good accuracy. You can ﬁnd the graphs of the sine and cosine functions on the interval [0.3.9.153 This deﬁnes the functions sin t and cos t. To do this we return to Figure 4.5 1 2 3 4 5 6 1 2 3 4 5 6 -0.7.5 -0.1. cotangent (cot). You see the construction implemented in Figure 4. 0). y). (x.2: f (x) = sin x Figure 4. 1 1 0. Older calculus books may still contain tables with the values of the trigonometric functions.1.3: f (x) = cos x The other trigonometric functions.5 0. then this is indicated by ‘n/a’.8 and 4. 2π] in Figures 4. 0) and (x.4) tan x = sin x cos x cot x = cos x sin x sec x = 1 cos x csc x = 1 sin x To make sure you have some idea about the behavior of the tangent and cotangent function we provided two graphs for each of them. They are drawn over diﬀerent parts of the domain to show diﬀerent aspects. The . and cosecant (csc) are deﬁned as follows: (4. secant (sec). Trigonometric Functions deﬁned at a right triangle: Occasionally it is more convenient to use a right triangle to deﬁne the trigonometric functions. You see a right triangle with vertices (0.5 -1 -1 Figure 4. See Figure 4. tangent (tan). If the functions are not deﬁned at some point.1. as well as the associated values for the trigonometric functions is given in Table 4.4 to Figure 4. and there are books which were published for the speciﬁc purpose of providing these tables. You can see the graphs of the secant and cosecant functions in Figure 4.

π] .5 1 -20 -1 -40 -2 Figure 4.5 -3 -2 -1 -20 1 2 3 -0.5: tan x on [−1.5 0.5 -40 -1.4: tan x on [−π.1] 40 1.5 -1 1. π + 1] 2 2 20 20 10 10 -3 -2 -1 1 2 3 -3 -2 -1 1 2 3 -10 -10 -20 Figure 4. π] Figure 4.5 1 20 0.8: sec x on [−π.1. 1.9: csc x on [−π.7: cot x on [ π − 1. TRIGONOMETRIC FUNCTIONS 40 2 20 1 -3 -2 -1 1 2 3 -1 -0.5 Figure 4. π] Figure 4.6: cot x on [−π. π] Figure 4.5 2 2.154 CHAPTER 4.

so that you can look them up whenever needed.155 degrees radians sin x cos x tan x cot x 0 30 45 60 90 120 135 150 180 0 π/6 π/4 π/3 π/2 2π/3 3π/4 5π/6 π 0 1 2 √ 2 2 √ 3 2 √ √ sec x √ 2 3 3 csc x n/a 2 √ 2 1 3 2 √ 2 2 1 2 √ 0 3 3 n/a √ 3 √ 1 1 √ 3 n/a √ − 3 √ − 33 1 3 3 √ √ 2 2 1 −2 √ − 2 −1 √ 2 3 3 1 0 √ 2 2 √ − 23 0 − n/a √ 2 3 3 3 2 √ 2 2 1 2 −1 2 3 3 − −1 −1 √ √ − 3 −233 n/a √ 2 2 n/a 0 −1 0 Table 4. The following identities are obtained from elementary geometric observa- . Some of them you should know. 0) and the hypotenuse has length r. Then opposing side hypothenuse opposing side tan α = adjacent side hypothenuse sec α = adjacent side sin α = cos α = adjacent side hypothenuse adjacent side cot α = opposing side hypothenuse csc α = opposing side Trigonometric Identities: There are several important identities for the trigonometric functions. others you should be aware of. From the theorem of Pythagoras and the deﬁnitions you obtain (4. Let α be the angle at the vertex (0. sec2 x = 1 + tan2 x. In the following the words adjacent and opposing are in relation to α.1: Values of Trigonometric Functions right angle is at the vertex (x. csc2 x = 1 + cot2 x.5) sin2 x + cos2 x = 1. 0).

6) (4.11) sin(α + β) = sin α cos β + cos α sin β sin(α − β) = sin α cos β − cos α sin β cos(α + β) = cos α cos β − sin α sin β cos(α − β) = cos α cos β + sin α sin β tan α + tan β tan(α + β) = 1 − tan α tan β tan α − tan β tan(α − β) = 1 + tan α tan β These formulas specialize to the double angle formulas (4.8) (4.9) (4. or even derived.16) sin2 α = 1 [1 − cos 2α] 2 and cos2 α = 1 [1 + cos 2α] 2 .13) (4. (4.14) (4.15) sin α sin β = sin α cos β = cos α cos β = 1 [cos(α − β) − cos(α + β)] 2 1 [sin(α − β) + sin(α + β)] 2 1 [cos(α − β) + cos(α + β)] 2 which specialize to the the half-angle formulas (4.12) sin 2α = 2 sin α cos α and cos 2α = cos2 α − sin2 α From the addition formulas we can also obtain (4.7) (4.10) (4.156 CHAPTER 4. TRIGONOMETRIC FUNCTIONS tions using the unit circle. the following addition formulas in precalculus. sin x cos x cos x sin x = sin(x + 2π) = cos(x + 2π) = sin(x + π ) 2 = − cos(x + π ) 2 = sin(π − x) = − cos(π − x) = − cos(x + π) = − sin(x + π) = − sin(−x) = cos(−x) = − sin(x + 3π ) 2 = cos(x + 3π ) 2 You should have seen.

Behavior, Disease, And Health in the Twentieth-Century United States

Ethics Cheat Sheet Final

Christensen - Ragtime Instruction for Piano

Magnus - Logic

Cells - Molecules and Mechanisms

Cantey - Calculus Memory Book

Common Derivatives and Integrals

Cantey - Calculus Memory Book

Carr - A Synopsis of Elementary Results in Pure Mathematics

Dounis - Daily Dozen

Pasquali - Thorough Bass Made Easy

Rimsky-Korsakov - Manual of Melody

Rimsky-Korsakov - Manual of Orchestration

Rameau - Treatise of Harmony

Rameau - Treatise of Harmony

- Read and print without ads
- Download to keep your version
- Edit, email or read offline

Xenakis_ Iannis - Formalized Music. Thought and Mathematics in Composition

Calculus

polymorphism

Chaos&Dynamics

introduction.to.linear.algebra.D.Norman.1995

The_Thirteen_Books_of_Euclid_s_Elements

Longitudinal Data Analysis

SPSS Statistcs Base User's Guide 17.0

Solutions_Manual_-_Mathematics_for_Physicists_2ed_by_Lea(高物數解答)

Generalized Linear Models

Business_Statistics_Project2_2010

Gelfand - Lectures on Linear Algebra

ExplainingLogarithms

schaumselemalg

geometry

Fun With Mathmatics

Fields and Galois Theory

Bullinger EW - Number in Scripture

SPSS for beginner 428pages

Functions

Real-Life-Math-Everyday-Use-of-Mathematical-Concepts

Mathematics Gr12

Handbook of Time Series Analysis, Signal Processing, And Dynamics

MI lecture notes

DifferentialEquationSolvingWithDSolve

des-book-lulu

calculus

Theory of Vortex Sound (Cambridge Texts in Applied Mathematics) by M. S. Howe

curvilinear

Practicing Sabermetrics

Are you sure?

This action might not be possible to undo. Are you sure you want to continue?

CANCEL

OK

You've been reading!

NO, THANKS

OK

scribd