Professional Documents
Culture Documents
1
i f ij i il^ 1 ■T3
■ ■ IH 11 1 B m VB
E M. Wilkes
INTERNATIONAL
THOMSON
BUSINESS PRESS
London • Bonn • Boston • Johannesburg • Madrid • Melbourne • Mexico City • New York • Paris
Singapore • Tokyo • Toronto • Albany, NY • Belmont, CA • Cincinnati, OH • Detroit, MI
Mathematics for business, finance and economics
All rights reserved. No part of this work which is copyright may be reproduced
or used in any form or by any means - graphic, electronic, or mechanical,
including photocopying, recording, taping or information storage and
retrieval systems - without the written permission of the Publisher, except in
accordance with the provisions of the Copyright Designs and Patents Act
1988.
Whilst the Publisher has taken all reasonable care in the preparation of this
book the Publisher makes no representation, express or implied, with regard to
the accuracy of the information contained in this book and cannot accept any
legal responsibility or liability for any errors or omissions from the book or the
consequences thereof.
Products and services that are referred to in this book may be either
trademarks and/or registered trademarks of their respective owners. The
Publisher/s and Author/s make no claim to these trademarks.
ISBN 1-86152-241-X
http://www.itbp.com
Contents
Preface ix
Chapter 3 Matrices 77
3.1 Introduction 77
3.2 Some fundamentals 78
3.2.1 Labelling of matrices 78
3.2.2 Dimensions of matrices 78
3.2.3 The elements of matrices 79
3.2.4 Sub-matrices 79
3.2.5 The principal diagonal 80
3.2.6 Triangular matrices 80
3.2.7 Diagonal matrices 80
3.2.8 Zero matrices 81
3.2.9 Matrix notation 81
3.2.10 Equal matrices 82
3.2.11 Transposition 82
3.3 Addition, subtraction and scalar multiplication 85
3.3.1 Matrix addition 85
3.3.2 Subtraction of matrices 86
3.3.3 The identity element for matrix addition 87
3.3.4 Scalar multiplication 87
3.4 Matrix multiplication 89
3.5 Matrix inversion 97
3.5.1 The determinant 99
3.5.2 The adjugate matrix 100
3.5.3 The inverse 101
3.6 Simultaneous equations (i) 105
3.7 Rank 111
3.8 Higher-order systems: determinants and the inverse matrix 115
3.9 Simultaneous equations (ii) 119
3.10 Cramer’s rule 123
3.11 Concluding remarks 125
iv Contents
4.2.1 One linear and one quadratic equation 142
4.2.2 Two quadratic equations 143
4.2.3 Inequalities 145
4.2.4 General form 147
4.3 Cubics and quartics 149
4.3.1 Cubics 149
4.3.2 Quartics 153
4.4 Polynomials 156
4.5 Rational functions 157
4.6 Other functions 163
4.6.1 Composite functions 163
4.6.2 Inverse functions 164
4.6.3 Continuous and discontinuous functions 167
4.6.4 Discrete functions 169
4.6.5 Homogeneous functions 170
4.6.6 Monotonic functions 171
4.6.7 Odd and even functions 172
4.6.8 Periodic functions 172
4.6.9 Absolute value functions 173
VI Contents
9.5 The derivative of natural logarithmic functions 373
9.6 Elasticity 377
9.7 Natural logarithmic functions and integration 383
9.8 Integration by parts 386
9.9 Logarithmic and exponential functions to bases other than e 390
9.10 Aggregate sales curves 396
Appendix 401
A.l Numbers 401
A.2 Sets 402
A.2.1 Operations on sets 404
A. 3 Exponents 406
A.3.1 The product rule 406
A.3.2 The power rule 409
A.3.3 The quotient rule 409
A. 4 Absolute values 411
A. 5 Place value notation 412
A. 6 Precedence in arithmetic operations 415
A. 7 Scales of measurement 417
A. 8 Summation and product notation 420
A.8.1 Summation notation 420
A.8.2 Product notation 423
A.9 Sequences and series 423
Index 527
Contents VII
»lObMn >ji< 4tiaui>^ Vv ^ 'j4’I F .V 259
.y *».<* 239
m>(Ur!%a?til h<\«i C.O .241 9
AKf i
V 243
»iwd; Toflkt »«iiKt o^ 4J»iuUwi^4 h:>' i yvoa * ‘'''‘*' i»,V 247
<W»F
_ r T ^:. ^^
•,r4 r ' ■•-• 250
lOK - I t . •«. .. - . ... ..., •( KiKiw^qA 2M>
’ ^ ,1^ i* ’.v* I KA 258
^04* r.*» ■‘- ^A;r^'> v.v ; 2M
r*<)f •:Jj f.t.A 2^»7
•iKH f.i i . - Ji*.::.'-i.A., 270
iVM ‘4 17 A 272
«» ♦ T f ^ ^ . ■^*'^'A 275
4 ? ,4iu>W) •••,‘%ti / ' / 274
m 4 k ., . . r. Ht vh^ij^A. 275
*a- 7; ■ . K, -. 289 »
298 F4
UPtNWJli
I fl
.Ik
1
Preface
E M. Wilkes
June 1998
Preface
CHART E R
LEARNING OUTCOMES
By the end of the chapter you should be able to plot lines with given
equations, express the lines in the most suitable form of equation and
find the equation of any line with two known points or other known
properties. You will have covered material useful in linear
programming, the calculus and other contexts and will have laid
foundations which will be built on in later chapters. You will also be
able to manipulate and solve systems of linear simultaneous equations
in two, three or more variables. You will have seen a number of linear
models, learnt of their use in business and economics, and seen how
the models can be adapted to changing circumstances.
1.1 The simplest ideas are usually the most valuable and most widely used.
Introduction This is only partly due to the fact that simpler concepts are likely to be
more widely known. It also results from the fact that basic techniques
are likely to be of more practical value than their complex cousins. This
principle is nowhere more evident than in the case of straight lines and
linear equations, which are widely used in the social sciences and in
management science and business studies. Among the variety of
applications that we will examine are breakeven analysis, depreciation,
stock control, forecasting, investment income generation, and market
supply and demand. Linear expressions occur frequently in these and
other contexts. But we begin with the representation of algebraic
expressions in diagrammatic form. It is of great value to be able to
show equations in an understandable way and also to have the option
of using algebraic techniques on a problem originally seen in
geometrical terms.
Figure 1.1
The real line contains all the real numbers, with no gaps unfilled. There
is a one-to-one correspondence between the points on the real line and
the real numbers which are the co-ordinates of the points. The real
line is a single co-ordinate axis which can be used to make
comparisons between numbers in terms of algebraic size, the larger
numbers being to the right. For example, 3 is to the right of 2, and —2
is to the right of —3. The line can also be used to show intervals (the
numbers lying between specified limits) and other subsets of the real
numbers. All of this work relates to a single dimension.
The usefulness of co-ordinates is greatly expanded when a second
real line is introduced at right angles to the original and passing
through zero. The two lines are co-ordinate axes and the resulting
system of rectangular or Cartesian co-ordinates allows all points in
two dimensions (the plane) to be uniquely located by two co-ordinates.
All points on the plane could be labelled using any intersecting pair of
straight lines as axes, but when the lines intersect at right angles we
speak of a Cartesian co-ordinate system. The x axis is sometimes called
the abscissa and the y axis the ordinate. The co-ordinates represent
an ordered pair of numbers in which the horizontal or x co-ordinate
value is stated first and the vertical or y co-ordinate value is stated
second. These are matters of convention rather than logical necessities.
The two axes do not have to be measured to the same scale or in the
same physical or monetary units. In Figure 1.2 the rectangular co¬
ordinate system is shown and points Pi, P2, P3 and P4 are indicated,
with the following co-ordinates:
Figure 1.2
Once you have identified points in the plane, three questions arise:
d=^(4-2)'+(3-l)'
= ^8
d=^[2-(-2)]2 + (-3-l)'
= ^32
Two special cases are included in equation (1.1). In the case where the
two points are on the same vertical line the formula simplifies to give
the difference between the y co-ordinates, and where the two points are
on the same horizontal line the distance is the difference in the x co¬
ordinates. The results for distances between the points are as follows:
Points Distance
P1P2 2.828427
P1P3 4
P1P4 4
P2P3 6.324555
P2P4 2
P3P4 5.656854
All lines other than vertical lines have a defined slope. The slope of a
straight line connecting two points is the ratio of the change in y
values to the change in x values. The slope of the line joining P2 and
Pi is
, 3-1
slope = = +1
-3-1
slope = = -1
2-(-2)
If the co-ordinates of the two points are (xi, yi) and (x2, yj), then the
slope is
slope =
yi - yi
X2 - Xi
It does not matter which of the points is taken to be first - the result is
the same for distance and slope. For distance the differences are squared,
and for slope both numerator and denominator of the ratio are reversed
in sign if the order of the points is changed. In any slope calculation
there must be consistency as to which point is considered as first in
taking values for x and y. The slope of P1P3 is zero since there is no
change in the y value, while the slope of P1P4 is undefined as division by
zero would be involved (the line is vertical). Full results are as follows:
P1P3 0
P1P4 undefined
P2P3 0.333333
P2P4 3
P3P4 -1
Figure 1.3
2 1
3 4
The first quadrant, where neither variable is negative, is often called the
positive quadrant. In the first and third quadrants the two variables
have the same sign, while in the second and fourth quadrants the signs
are opposite. Points in three dimensions can be identified uniquely if a
third axis, z, is introduced at right angles to the x, y plane. This is
usually shown as in Figure 1.4.
Note that there is a right angle between each pair of axes - as at the
cbrner of a cube. The co-ordinate system divides three-dimensional
space into eight orthants. Practical wbrk is usually confined to the
non-negative orthant. Although manual sketches in three dimensions
can sometimes be worthwhile, precise geometry is not normally
attempted by hand. Wire-frame diagrams of functions in three
dimensions can be produced by software such as Derive, Maple or
Mathcad, the use of which can enhance understanding of general
is given by
= V4+16 + 36
= ^56 7.48
Exercises 1.2
1 Given the following points:
Pi: (2,5) P2:(5,2) >3: (3,-2)
P4:(-2,-2) P5:(-1,3)
find the distance between the following pairs of points:
(i) Pi and P2 (ii) P2 and P3 (iii) Pi and P5
(iv) Pi and P4 (v) P3 and P5
2 For the points Pi to P5 in question 1, find the slopes of the
1.3 The equation of a straight line can be written in several ways. The
Straight lines - most convenient of these is often the slope-intercept form:
preliminaries
y = mx + c (1-2)
Figure 1.5
y = —0.5x -I-10
y = 1.2x — 5
y = constant (1-3)
y = 10
y = -5
is a horizontal line five units below the x axis. Even this most simple
form of equation has its uses. Vertical lines have an undefined slope
(informally referred to as infinite). For such lines the x value does not
change, and in the ratio that would otherwise define the slope, division
by zero would be involved. Since this is not permitted, the slope is
undefined. A vertical line can be written as
X = constant (1-4)
Thus X = —10 defines a vertical line that cuts the x axis ten units to
the left of the origin. Equations (1.3) and (1.4) themselves have special
cases of interest. The lines defined by
y = 0 and x = 0
are the equations of the x axis and the y axis, respectively. Equation
(1.2) has a further important special case where the intercept c is zero,
so the line goes through the origin. An equivalent way of expressing
this is to observe that when c is zero x and y always remain in the same
proportion, the factor of proportionality being m - as evidenced by the
fact that equation (1.2) can now be written as
y = mx + c
y = 4:X+10
y = —7 + 4:X
y = mx + c
and
y = —x + k
m
y = 2x — 4l
and
y = 3 — 0.5x
y = —0.5x + k
y = 100 — 0.5x
and
y = —0.5x
y = 2x + c
1.4 To use straight lines effectively some basic skills are necessary. In
Identifying straight relation to an individual straight line these include:
lines and equations
• obtaining the equation of a straight line from a literal description of
the relationship between the variables:
• plotting a straight line with a given equation;
1.4.1 The starting point for building models in business applications is often a
Obtaining the equation literal description of part of the overall problem and it is important to
of a straight line from a be able to make use of information in this form. For example suppose
literal description of the that the following statements are made;
relationship between
(a) Total costs are the sum of fixed and variable costs. Variable costs
the variables
accrue at the constant rate of £6 per unit produced and there are
inescapable costs of £10 000.
(b) The maximum price that can be charged for the product is £500
less lOp for each unit sold.
(c) Sales revenue is price times quantity. The market currently
determines price at £10 regardless of the number of units that we
sell.
(a) y = 10 000 + 6x
(b) y = 500-0.lx
(c) y = lOx
y = —4x + 20
The point where the line crosses the y axis is easiest to find. This is
done by setting x = 0 in the equation and solving for y. The result is
y = 20 and the line is known to pass through the point (0, 20). Next,
the X intercept is identified by setting y = 0 and solving for x. The
result is
0 = -4x + 20
4x = 20
i.e.
X= 5
So the point (5, 0) is on the line. Note that the intercept with the x axis
is a solution of the equation
mx + c = 0
and the x intercept therefore occurs at the point x = —cjm. Once you
have identified two points the straight line passing through them can be
drawn in. For the current example this is shown in Figure 1.9.
Figure 1.9
The exceptional case is where the straight line passes through the
origin so that x and y intercepts are the same. In this case a further
point is generated by putting a value for x into the equation, and the
point with the resulting co-ordinates is joined to (0, 0). For example, if
the line is
y = —4x
y = mx + c
y = 2x + c
5 = 2(3)+ c
5 = 6 -I- c
c = —1
y = 2x — 1
1 the line with slope —3, passing through the point (3, 8);
2 the line with zero slope, passing through the point (4, 5):
3 the line with undefined slope passing through (5, —6).
y = —3x -f c
and, given that the point (3,8) must satisfy the equation, c must be
such that
8 = -3(3)-Fc
so that
c= 8 9
= 17
y — —3x +17
In example 2 the slope is zero, so the equation of the line must read
y = 0x + c
5 = 0(4) -f- c
c= 5
y = 5
X = Oy + c
5 = 0( — 6) + c
c= 5
so
X = 5
1.4.4 If two points on the straight line are given the co-ordinates of these
Finding the equation of points inserted into the straight-line equation produce simultaneous
o line through two equations in m and c - one equation corresponding to each point. The
known points equations are then solved for m and c. Methods for solving
simultaneous linear equations will be discussed later; for now we will
consider two examples solved by elimination. The problem is: what
line is determined by the points (2, 13) and (—1, 4)?
The two parameters m and c must fit the data given. Thus with
y = mx + c
13 = m(2) + c
that is
13 = 2m + c
and
4 = m( —1) + c
that is
4 = —m + c
If, term by term, the second equation is subtracted from the first c
disappears, leaving
which simplifies to
9 = 3m
m = 3
y = 3x +7
9 = 3m + c
and
-15 = -m + c
subtraction gives
24 = 4m
m = 6
Substitution of this value into either of the original equations yields the
result that c = -9, so the equation of the tine is
y = 6x — 9
1.4.5 This is important when the line represents a resource limitation - for
Establishing on which example the maximum amount of capital available - or a measure of
side of o line o given performance such as return for given risk in portfolio theory. We will
point lies usually want to know if the y co-ordinate of the point is greater or
less than that given by the equation. Thus given the point (1,8) and
the line
y = 4iX + 2
Exercises 1.4
1 Find the equations of the following lines:
(i) slope = 4, passing through the point (5, 25);
(ii) slope = —5, passing through the point (18, 10);
(iii) horizontal, passing through the point (7, 10);
(iv) vertical, passing through the point (8, 4).
2 (i) For o particular straight line passing through the origin, if
X changes, then y changes by half the amount but in the
opposite direction. What is the equation of the line?
(ii) What is the equation of the line that passes through the
points (2, 10) and (—1,1)?
3 (i) What is the equation of the straight line passing through
the points (1,7) and (2, 1 1)?
(ii) For a given line, if x changes in value, then y changes by
the same amount in the opposite direction. The line passes
through the point (4, 6). What is the equation of the line?
4 (i) What is the equation of the straight line passing through
the points (2,-1) and (4, 9)?
(ii) What is the equation of the straight line passing through
the point (1,3) and which is orthogonal to the line:
y = 4 - 0.5x?
1.5 In forecasting with simple linear regression a straight line of best fit
Two applications is calculated using observations of corresponding values of two
variables. The equation can be written as
1.5.1
Forecasting Y = a + bX
The intercept term, a, and the slope coefficient, b, are set at values
which together match the data as closely as possible. With X as the
independent variable, there should be grounds for supposing that
changes in X bring about changes in the dependent variable Y - there
should be a causative relation between the variables. For example, X
may represent income and Y expenditure, or X may be the outlay on
sales promotion, while Y represents total sales revenue. The
independent variable often represents time, either abstract time
(t = 0, 1, 2 ...) or calendar date (2000 etc.). To illustrate, suppose that
observations of the level of a company’s turnover (sales revenue) are
related to time in the following manner:
Y = 45 + 3.8X
inventory of 525 reams of paper, which are used at the rate of nine
reams per day. Due to delay between ordering and delivery - lead-time
- orders for replenishment of stock are placed when stock on hand falls
to 150 units. The relationship between inventory level, I, and the
number of working days, t, is linear, with the intercept term being the
opening inventory and where the slope is given by the rate of
withdrawal from stock. Thus
I = 525 - 9t
is the equation of stock against time. The equation can be used to find
the number of days before an order for replenishment is placed. This is
t such that:
1 = 525 - 9t = 150
so that
9t = 375
t = 41.67
So the order should be placed during the forty-second working day. The
graph of inventory against time is shown in Figure 1.12.
Note that, ignoring any practical problems involving fractions of
days, a stockout would occur if lead-time exceeded 150/9 = 16.67
days. The level of stock which triggers a reorder is known as the
reorder level or reorder point (150 here).
Exercises 1.5
1 A company's profits in year t are forecast by linear regression
to be 7r(t) where: 7r(t) = 80 + 5t and t = 0 corresponds to
1992.
(i) What profit is given by the equation for
(a) 1994 (b) 1997
(ii) Assuming that the relationship was valid at the time, what
would be the estimate of the profit figure for 1989?
2 A company has an opening stock of 1000 units of an item,
the stock level being depleted by forty units per day's trading.
An order for replenishment of stock is placed when the level of
inventory is sufficient to cover six days' demand.
(i) Write down an equation showing stock level against the
number of days' trading.
(ii) Given the current policy, on what day should a
replenishment order be placed?
y = 2x + 3 (1.7)
y-{-7) = 2[x-{-5)]
that is
So, as before.
y = 2x + 3
The point-slope form arises from the fact that given one point (xi, yi)
on a particular line, any other unspecified point (%, y) on the same line
must be such as to result in the same slope as given by
X — Xi
Figure 1.13
Ax + By + C = 0 (1-10)
2x — y + 3 = 0 (1.11a)
-2x + y-3 = 0 (1.11b)
Ax + By + Cz + D = 0 (1-12)
ax + by = c
This is often the way that a linear equation arises, for example as a
constraint. The equation written in slope-intercept form as
y = 2x + 3
2x — y = —3
—2x + y = 3
—2x + y = 3
does the point (—4, — 6) lie above or below the line.? The point is below
the line since when x = — 4 is put into the equation the resulting value
of y is —5. Since this is greater than the y co-ordinate of —6, the point
lies below the line. Equations in which the constant term and no
variable terms are on the right-hand side often occur in problems with
resource limitations. For example, suppose that x and y represent the
number of units purchased of each of two types of security, the prices of
5x + 4:ij = 2000
Exercises 1.6
1 Use the point-slope form to express the equations of the
following lines;
(i) slope 3 passing through the point (5, 8);
(ii) slope -6 passing through the point (8, 12);
(iii) slope —0.5 and y intercept at —20;
(iv) a 45° slope passing through the origin.
2 Express, in general form, the equation of the straight line that
passes through the points (5, 5) and (-5, 35).
3 A straight line with slope 4 passes through the point (5, 25).
Express the equation of the line in:
(i) point-slope form;
(ii) slope-intercept form;
(iii) general form.
4 For the line given by: —5x + 2y = 20, is the point P(8, 32)
located on, above or below the line?
1.7 For two straight lines, L and M, exactly one of the following
Straight lines and relationships holds:
simultaneous
1 L and M have all their points in common;
equations
2 L and M have none of their points in common:
3 L and M have exactly one point in common.
L: y = 7.5 - 0.25x
M: 2x + 8^ — 60 = 0
L: y = —2,x 5
M : y = -2x + 7
In Figure 1.14 (c) the lines have different slopes. This is the important
distinction between this case and cases (a) and (b). Straight lines of
different slope must cross once and once only. For example, the
equations of the lines in Figure 1.14(c) might be
L: y = -2x + 10
M: 1/ = -X + 8
The one point that the lines have in common when both equations
are satisfied is (2, 6). This point of intersection corresponds to the
simultaneous solution of the equations defining the lines. For a unique
solution of linear equations to exist, the equations have to be both
independent and consistent. Two equations are independent if it is
not possible to obtain one equation from the other by multiplication
through by a non-zero constant. Thus cases (b) and (c) represent lines
with independent equations. Consistent equations are those which can
be simultaneously satisfied. Thus cases (a) and (c) represent lines with
consistent equations. Clearly then, only case (c) corresponds to
equations which are both independent and consistent. Table 1.1
summarizes the properties of the three cases above.
Table 1.1
Case Independent Consistent No. of solutions
(a) * infinite
* 0
(b)
* * 1
(c)
1.7.1 There are several ways in which systems of linear equations can be
Solving simultaneous solved. At this point we will introduce two simple methods:
equations substitution and elimination.
3x + y = 11 (1.13)
X + 2y = 12 (1.14)
y = ll-3x (1.15)
3C-f2(ll - 3x) = 12
X -f- 22 — 6x = 12
and therefore
-5x = -10
X = 2
y = ii-3(2)
y = 5
X = 12 — 2y
36 - 6y + y = 11
36-5y = 11
-Sy = -2S
and therefore
y=5
7x + 4:y — 51 = 0
—3x + 7y + 4:8 ==0
4y = 51 - 7x
y = 12.75 - 1.75x
so:
15.25X = 137.25
X =9
6x + 2y = 22
6x + 2y = 22
X + 2y = 12
5x = 10
X = 2
3x + y = 11
3x + 6y = 36
-5y = -25
y = 5
Example I 5x — 2y = 26
—X — 4i/ = 8
X = —4i/ — 8
5{ — 4y — 8} — 2y = 26
so that
-20y - 40 - 2i/ = 26
—22y= 66
x = -4(-3)-8 = 4
Example 2 3x + 8y = 64
—2x + 2y = 16
3x + 8y = 64
—8x + 8y = 64
llx = 0
Example 3 3x + y = 11
6x + 2y = 24
6x + 2y = 22
6x + 2y = 24
0 = -2
Example 4 3x + y = 11
— 12x — 4i/ = —44
y = 11 — 3x
therefore:
0 = 0
which means that the two equations are not independent (the second is
—4 times the first). So there are an infinite number of solutions in this
case, but not anything will do in terms of % and y values; ‘most’ points
are not solutions. Only points having co-ordinates satisfying the
relationship y = 11 — 3x will fit. All other points are off the line. Now
consider a practical illustration of simultaneous equations in personal
financial management.
Financial management On retirement a manager expects to receive a pension plus a lump sum
example of £60 000. He wants to obtain an annual income of £7000 by
investing the lump sum in loan stock yielding 13% and a deposit
account yielding 9%. What amounts should be invested in each case.?
The problem produces a pair of simultaneous equations which can be
solved by elimination or substitution to give the required amounts. If %
is the sum invested in loan stock and y is the amount placed on
deposit, then, assuming nothing is held as cash,
x + y = 60 000
y = 60 000 — %
and so
X = 40 000
so
y = 20 000
x + y = 60 000
0.12x + 0.08z/ = 7000
y = 60 000 — X
from which
0.04x = 2200
4x = 220 000
X = 55 000
X + y = 60 000
0.1 lx+ 0.071/ = 7000
0.07x4-0.07y = 4200
O.llx+ 0.071/ = 7000
-0.04x = -2800
so that
4x = 280 000
X = 70 000
y = -10 000
While the algebra does not rule out this outcome, an implicit
assumption in the use of this model would be that neither x nor y is
negative. So unless the investor has an unlikely facility to borrow at
7%, the income target can no longer be achieved even if the entire
lump sum was invested in the potentially more lucrative (and higher-
risk) security.
Exercises 1.7
Solve the following simultaneous equations using elimination:
(i) 4x + 3y = 110
2x +y =:50
(ii) 5x — y =:20
3x + 2y = 12
(iii) 6x — 5y = 16
4x + 2y = 0
(iv) 0.4x + 5y = 7
2x- 0.5y = 4.4
Solve the following simultaneous equations using substitution:
(i) 2x + y =:20
7x + 3y = 66
(ii) X- 1 ly = 40
2x- 7y = 35
Solve the following systems using the method of your choice:
(i) y = 4x-35
2x + y = 25
(ii) X + 25 = 3.5y
x = 4.5y-33
4 Where possible, find solutions to the following systems of
simultaneous equations. Where there is not a unique solution,
say whether there are ony solutions and give the reason why
the situation occurs.
(i) 0.4x + 0.2y = 3.6
0.1x + 0.3y = 2.9
3x + y = 11
X + 2y = 12
3x + y = 11
7x + 4:y = 34
Here the second equation of the original system has been replaced by
the second equation plus twice the first equation. The following system
also has the solution x = 2, y = 5:
X — 3y = —13
X + 2y = 12
In this case the original first equation has been replaced by the first less
twice the second equation. As a final example, the following system:
X — 3i/ = —13
7x + 4y = 34
Figure 1.16
Note that any three or more of these equations in two variables have
the solution (2, 5). In general, for a system of three equations in two
variables to have a solution any one of the equations must be
obtainable from the other two by elementary row operations.
Elementary row operations are implicit in the simplex method of linear
programming, which is closely related to Gaussian elimination. Further
operations on equations are possible. For example, both sides of an
equation can be raised to the same power. Consider the following:
2x — 1 = x + 2
(2x- 1)^ = ix + 2f
which simplifies to
3x-30 = 0
X = 10
Here the single equation has a single solution. Now suppose that
Simplification produces
which reduces to
mx + c = 0
which solves for x = —c/m, a unique value except for the cases where
m = 0, c = 0 (where the line is the x axis) and m = 0, c ^ 0 (where no
solution exists). The value of x = —c/m represents a zero of the
following function:
y = mx + c
and is the root of the equation (an x value for which the corresponding
y value is 0).
This equation has an infinite number of solutions but this does not
mean that any pair of values satisfies the equation. Unlike an identity,
some X and y combinations are excluded. The equation imposes a
restriction that not all pairs of values satisfy. Only those pairs
corresponding to points on the line defined by the equation are
solutions. Some non-solutions can be usefully categorized: for example,
in this case there are no solutions in quadrant 3, where x and y are
both negative. In the case of lines going through the origin, apart from
the origin x and y are always the same sign for a line with positive
slope or always of opposite sign if the slope is negative. In finding
solutions to equation (1.16) either x or y may be freely chosen and a
specific value of the other variable results. There is one degree of
freedom in the setting of variables. In one linear equation in n
variables there are n — 1 degrees of freedom.
X y
0 12
2 7
4 2
Exercises 1.9
1 Find the value of x which satisfies the following equations;
(i) 5x + 3(x-8)+4(12-0.5) +16 = 100
(ii) 20 - 4x + 3(2x - 5) + 5(4x + 68) = 2(4x + 5) + 1 10
2 Comment on the following equations:
(i) 3(5x+10)-(8x + 50) = -5(8-2x)-3x
(ii) 6(4x + 2)-7(3x+1) =8(5-f 3x)-3(7x+11)-2
3 Given the equation: 8x -f 5y = 40 which of the following are
solutions, or are consistent with solutions to the equation:
(i) The point X = 2.5, y = 4
(ii) The point x = 0, y = 8
(iii) The point x = —2, y = 11.2
(iv) The point x = 5, y = 8
(v) Points on the line 0.5x + 0.3125y — 2.5 = 0
3x + 2y — z = 0
z + X +y = 34:
z— X + y = 26
z=3x4-2y
3x -\- 2y 4- X 4- y = 34
3x 4- 2y — X 4- y = 26
which simplify to
4x4- 3y = 34
2x4- 3y = 26
X = 4 y = 6
and insertion of these values into the expression for z gives the result
z = 24. The approach used above is as follows:
1 Use one equation to state one variable in terms of the other two.
2 Substitute for the chosen variable in the remaining two equations.
3 Use substitution or elimination to solve the two-variable system
that is left.
3x — 2z/ -f 4z = 110
2x -f 2i/ — 3z = —30
—2x + 6i/ -F 4z = 220
X = 3z/ + 2z — 110
7y + lOz = 440
8y + z= 190
7y + lOz = 440
8y + z= 190
z = 190 — 8y
-73y = -1460
y = 20
z = 190- 8(20) = 30
X = 10 y = 20 z = 30
2x + 3i/ + 4z = 90
4x + 6i/ + 8z = 150
5x + 2y + 3z = 100
the left-hand side of the second equation is twice that of the first, but
the right-hand side is 150 rather than 180. So the equations are
inconsistent and the system has no solution. But if the right-hand side
of the second equation had been 180 the equations would have been
consistent but not independent (the second equation now being twice
the first) and the system would have a continuum of solutions.
Similarly, a system that begins as one of two equations in three
variables will not have a unique solution. For example, if the following
equation:
z— X + y = 26
4x + 3^ = 34
and, of course,
z = 3x + 2y
Figure 1.17
The case shown in Figure 1.17 is one of three variables and two
independent and consistent equations, and shows the two planes
corresponding to the equations intersecting in the positive orthant. The
planes are not limited to the positive orthant, but only those parts of
the planes are illustrated. The points common to both planes, lying on
a straight line, are shown. If a third equation is added, and if this is
independent of the original two equations and consistent with them,
the third plane that this equation produces will intersect the line
common to the first two planes at a single point. The co-ordinates of
this point will be the values of the variables which uniquely solve the
three-equation system.
A system of three independent equations in two variables cannot be
consistent and therefore cannot have a solution. In general, a system in
2x + y = 13
x+y = 9
X + 2y = 11
Figure 1.18
(i) 2x + y = 13
(ii) x+y = 8
(iii) x + 2y = 11
X = 5 y = 3
2x + 2y + 8z = 64
(i) 5x + 1/ + 4z = 100
(ii) 2x + 2y — 8z = 64
y = 100 — 5x — 4z
2x + 200 - lOx - 82 - 8z = 64
so that:
or:
(iii) X = 17 — 2z
(iv) y = 15 + 62
X = 9 y = 39
emerge as the solution. The same unique solution results if either of the
following equations is added to the original system:
X 2y -(- 5z =11
3x — 4i/ + z = 67
Note that both of the above additional equations include the point
(9, —9, 4) and another of the family of planes associated with this
point would have done as well. The addition of any particular
equation says more about the necessary relationships between x, y
and z than does the selection of an individual value for z. The
usefulness of this arrangement, in which x and y are stated in
terms of z, is that when one of the variables, say z, is determined
outside the system the corresponding values of the in-system
variables can be worked out. Variables determined outside a system
are said to be exogenous, while variables determined within the
system are endogenous. Treating some variables as exogenous is
one way that an otherwise indeterminate system can be made to
work out. The equations in which each of the endogenous variables
is stated in terms of the exogenous variables are said to be reduced
form equations. Thus (iii) and (iv) above are reduced form
equations.
Economics provides a number of examples of systems with fewer
equations than unknowns. For example, in barter economies a system
with n goods can be solved for the n — 1 exchange ratios between n — 1
of the goods and an arbitrarily selected nth good or numeraire. If
‘accounting money’ is introduced and a specific price set for the
numeraire, equilibrium prices result for the remaining n — 1 goods.
A linear equation in three variables produces a plane in three
dimensions. But how can a straight line in three dimensions be
represented in algebraic terms.? One way is to note that a straight line
in three space satisfies two linear equations:
z = aix + biy + ci
z = ajx + b2y + C2
1.11 One of the most efficient ways to solve simultaneous linear equations is
Gaussian Gaussian elimination. This method uses the fact that the solution of
elimination the system, if there is one, is unaffected by the use of elementary row
operations. The method will be explained through numerical examples,
starting with the following system:
2x + 5i/ + z = 46
x + 3y + 2.52 = 45
2x + 4.5i/ + 42 = 74
First divide the first row by 2 in order to produce a unit coefficient for
X, giving
X + 2.5y + 0.5z = 23
0.5y + 2z = 22
Subtracting twice the new first row from the third gives
—O.Sy -I- 3z = 28
X + 2.5y + 0.5z = 23
0.5y + 2z — 22
—0.5y -t- 3z = 28
The first row will remain as it now is, and the process is repeated for the
second and subsequent rows. Multiplying the new second row by 2 gives
1/ -f 4z = 44
5z= 50
Now complete the process by dividing the third row by 5 (there are no
further rows to deal with) to give a unit coefficient for z;
z = 10
X + 2.5y + 0.5z = 23
y -|- 4z = 44
z= 10
For a system with a unique solution the value of the last variable is
given in the last row. If in eliminating y from the last equation z also
disappears, then if the right-hand side is not also zero the equations are
inconsistent. If the right-hand side value is zero, the equations are not
independent and there are infinitely many solutions.
In the current problem the values of the remaining variables are
obtained by back substitution. Putting z = 10 into the second equation
gives
y + 40 = 44
y = 4
X -F 10 -|- 5 = 23
so X = 8
x = 8 i/ = 4 z=10
1/ -f 42 = 44
X — 9.5z = —87
1/ -I- 42 = 44
5z= 50
The third equation is now divided by 5; 9.5 times the result is added to
the first equation and four times the result is subtracted from the
second equation, to give the neat result
X= 8
y = 4:
z= 10
X — 1/5-2 = 15
2x 5- 2i/ 5- 2 = 20
5x -T 4i/ — 2 = 0
X- y + z = 15
4i/ - 2 = -10
9y-ez = -75
y — 0.25z = —2.5
-3.75z = -52.5
z = 14
X — y + z = 15
y — 0.25z = —2.5
z= 14
y - 0.25(14) = -2.5
y=1
Substituting the obtained values of z and y into the first equation gives
X - 1 + 14= 15
so X=2
X = 2 y = 1 z=14
Exercises 1.11
1 Use Gaussian elimination to solve the following system:
X + 3y + 2z = 29
2x + 8y + 6z = 78
3x + 7y + 7z = 85
2 Solve the following system using Gaussian elimination:
2x + y + 2z = 19
4x + 3y + 6z = 37
2x + 2y + 3z = 26
1.12 An identity is an equation which is true for all values of its variables.
identities For this reason it is sometimes called an identical equation, as
distinct from a conditional equation (the form we have been dealing
with up to now), which is true for only certain values of its variables.
For an equation to be an identity it must be possible to cancel out all
terms involving variables leaving only the equation of two equal
constants. Where it is necessary to distinguish identities from equations
the symbol = is used in place of the equation symbol =. Identities may
involve expressions in several variables, and they arise in economics,
finance and management science. Any manipulation permitted for an
equation can also be applied to an identity. So elementary row
y= C+S
Now GDP can also be defined and measured from the expenditure side
as
Y = C+l
y= C+S
= C+I
thus
S=I
This identity states that actual savings always equal actual investment.
This does not mean that the levels of intended savings and investment
are always equal - the equality is a property of equilibria only. When
working with models of national income accounting of this kind the
identity sign is not always used. This will not lead to errors, but care is
necessary in drawing conclusions.
In production planning x and y might represent daily production
levels of two goods manufactured by a firm. A feasible production plan
(represented by a pair of values of x and y) will not call for the use of
more resources than are currently available to the firm each day. If
each unit of good x requires 20 minutes of machine time and each unit
of good y requires 13 minutes of machine time, then the total machine¬
time requirement for any production plan is given by
20x + I3y
s = 480-20x- 13i/
1.13 Here we consider two further examples of straight lines in business and
Further economics: breakeven analysis and market equilibrium. First
Applications consider the business application.
c= F+ y
V = bq
C = F + bq
R^pq
R = C
pq = F-hbq
ip -b)q = F
so that
(1.18)
For the revenue and cost lines to cross in the positive quadrant
product price must be strictly greater than unit variable cost. If this is
not the case, no breakeven output level exists. The relationship of q* to
qc is also important. If the breakeven output level is a high proportion
of plant capacity the financial viability of the enterprise will be at the
mercy of minor changes in parameter values such as unit variable cost
For a numerical example let p = £10, b = £6, F = £10 000 and
qc = 4000. Substitution in equation (1.18) yields
*
10 000
Q. = 2500
10-6
so the breakeven output level represents use of the plant at 62.5%
capacity. Note that if price had been p = 8, while this value of unit
revenue covers unit variable cost, the theoretical breakeven level,
q* = 5000, is not reached by the time that the firm runs into the
capacity limit. The breakeven formula can also be used to deduce the
consequences of changes in the parameter values, p, b and F. For
example, suppose that interest-rate changes, a wage agreement and
inflation of raw material costs result in new values of b = £6.80 and
F = £10 400. Two questions of interest would be:
*
10 000
= 3250
3.2
10 400
2500 =
p- 6.8
so that:
27 400
2 500
= £10.96
1.13.2 Analysis of the market for a good often uses linear supply and demand
Market equilibrium curves relating price to quantity supplied or demanded. A market is in
equilibrium at a price where the quantity of the good consumers wish
to buy is the same as the quantity that the price stimulates producers
to offer. In other words the market reaches equilibrium where the
supply and demand equations are solved simultaneously. Suppose that
in the market for a product supply and demand are given by the
following linear equations:
Demand: p = 1000 — 5q
Supply: p = 200 + 3q
1000- 5q = 200+ 3q
800 = 8q
q= 100
p = 500
1000-5q = 200 + 3^ + 40
760 = 8q
Therefore:
q = 95 p=525
so the market clearing price has gone up by £25 rather than the full
£40 of the tax. While consumers pay an extra £25 per unit, the firm is
receiving £15 less.
These relative magnitudes, 25 and 15, depend on the slopes of the
supply and demand curves, and give an indication of the comparative
burden of the tax born by consumers and producers. A further question
is the level of tax that would maximize tax revenue - a problem best
addressed by calculus methods.
2 Linear inequalities
LEARNING OUTCOMES:
By the end of the chapter you will have seen the usefulness of linear
inequalities in formulating management science and economic models.
You will have learnt how to simplify individual and simultaneous linear
inequalities and how to obtain the set of solutions. You will also be able
to solve small-scale linear programming problems using a graphical
method.
. W>Q (2.1)
In (2.1) the symbol > means ‘is strictly greater than’. In a society
^ where the law of scarcity did not apply the study of economics would
be reduced to an abstract exercise. Needless to say, no such society is
known to exist. In management science a frequently encountered
inequality relationship is that between the quantity of a resource
needed by a production plan and the amount of that resource available.
Linear inequalities 53
Consider an example in which x and y represent the daily output levels
of two products made by a firm. Each unit of the first product requires
20 minutes of machine time, while each unit of the second product
requires 13 minutes of machine time. There are 8 hours of machine
time available each day, and the amount of machine time called for in
any production plan should not exceed the available total. This
requirement can be stated as an inequality;
where < means ‘less than or equal to’, which has exactly the same
meaning as ‘should not exceed’ or ‘not greater than’. ‘Less than or
equal to’ is an example of a weak inequality, where the equation of
the two sides is permissible as well as the allowable inequality. Weak
inequalities arise much more commonly than strict inequalities and
they are also much more convenient. Whenever there are budgetary
constraints, limited physical resources, minimum targets or variables
which cannot be negative, a weak inequality is defined. Other inequality
symbols in common use are:
The symbol < could be read as ‘neither greater than nor equal to’,
while > could equally well be read as ‘not less than’. If any of the
symbols are struck out the opposite meaning to that of the original
symbol is taken. For example, the symbol represents ‘not greater
than’ (and is equivalent to <,) while ^ means ‘not less than or equal
to’ (i.e. greater than). Most importantly, the struck-out equality symbol
7^ has the meaning ‘not equal to’, i.e. necessarily greater than or less
than. Taken together, the equality and inequality relationships are
called relational operators, and in computer programming the
relational operators are used to compare collections of symbols
(‘strings’) as well as numbers. In total there are six possible
relationships:
3.1389 3.14
the ability to add a constant to both sides means that it is valid to write
X + 4 < 14
or, since the constant may be any real number, it will also be true that
X — 71 < 10 — TT
x + b < 10 + b
all that is required is that both sides of the inequality are varied by the
same amount.
Linear inequalities 55
or after division by 20 it will be true that
2.2.3 The validity of this rule is most clearly illustrated by the comparison of
Multiplication by a two real numbers, for example
negative constant and
6 < 7
reversal of direction
But when this inequality relationship is multiplied through by —1, the
valid statement is
-6 > -7
since of two negative numbers, the value further to the right on the
real line is the greater of the two. Thus with an initial statement
2x — 4 > X — 3
—2x 5“ 4 ^ —X “I" 3
while
2 — 2x < 4x — 10
divided by —2 becomes
X — 1 > 5 — 2x
2x — 3 < X + 5
the solution set can be found - the set of values of x for which the
relationship is true. The solution set, if not empty, will be infinite. This
is shown in the relationship above, where the addition of 3 to both
sides produces
2x < X + 8
X < 8
10-2x<22 + x
—2x < 12 + X
-3x < 12
X > —4
X—4 > X— 3
0 > 1
The solution set in this case is the empty set cj). This inequality
relationship therefore cannot be satisfied by any value of x. In contrast,
suppose the original inequality relationship had been
This simplifies to
4x + 35 > 4x + 30
5 > 0
which means that the original relationship is true for all values of x.
This is so, since it is a truthful relationship that did not really involve
f
X at all, and therefore cannot not be true for any x. So far we have
considered cases involving weak inequalities. Similar principles apply
to strict inequalities (you could rework the above examples using <
and > in place of < and >, respectively). In either case, when just one
inequality is involved, the resulting solution set, if not empty, is
defined by an upper or lower end-point for permissible values of x.
When more than one relationship must be satisfied there may be both
upper and lower bounds on x. Consider the following simultaneous
inequalities:
(a) 2x - 3 < X + 5
(b) x + 1 < 2x — 3
4 < X < 8
which is a solution set with both upper and lower end-points. There is
an important distinction between linear equations and linear
inequalities. As inequalities, (a) and (b) can be satisfied simultaneously
and produce a non-empty solution set. But if (a) and (b) were equations
they would be inconsistent - as must always be the case for
independent equations in a single variable. But inconsistency can occur
with inequalities too. For example, the system
(c) 20-3x<10-x
(d) 7x — 12 < 3 -I- 2x
requires that x > 5 (from (c)) and simultaneously that x < 3 (from (d)).
Thus the solution set is empty. To further point up the distinction
between simultaneous equations and inequalities, there is no limit to
the number of distinct linear inequalities that can simultaneously hold.
For example, suppose that
(e) 2x — 3 < X -f 5
(f) 3x — 4 < 2x -f 2
(g) X + 1 < 2x — 3
(h) 1 — 2x < —X — 4
(e) X < 8
(f) X < 6
(g) x>4
(h) X > 5
Note that (e) and (f) give upper bounds on x. The more restrictive
of the two is (f), which is the least upper bound (LUB) or
supremum. If the least upper bound, (f), is satisfied, then (e) must also
hold. In general, if the least upper bound is fulfilled all other upper
bounds will be satisfied and therefore do not need to be referred to. We
can then concentrate on a single numerical value - the LUB. Lower
bounds are given by (g) and (h), and, of these, (h) is the greatest
lower bound (GLB) or infimum. If (h) is satisfied (g) must also be
fulfilled. Of all the lower bounds, only the greatest lower bound need be
in this case
5<%<6 (2.3)
Solution sets of this kind, defined between the infimum and the
supremum, often result when the stability of the solution to a model is
being investigated in relation to the values taken by certain key
parameters. Work of this nature is called sensitivity analysis, and the
range of values of a variable for which a solution holds good (expressed
as in (2.3)) is sometimes called a tolerance interval or a range of
feasibility. A solution set for one variable can be identified as a set of
points on the real line - a section of it - which may or may not include
the end-points. A range such as (2.3) which involves weak inequalities
at both ends of the interval is said to be a closed interval on the line,
as the end-points, 5 and 6, are included in the set. This is shown in
Figure 2.1, the solid circles at the limits of the interval indicating that
the end-points are included. If the solution set (2.3) had involved strict
inequalities at both ends, i.e.
5 < X < 6
then an open interval on the real line would have been defined. This
would appear as shown in Figure 2.1(b), with the end-points as ‘hoUow’
circles - the convention to signify that they are not included in the interval.
Figure 2.1
X
(a)
-0-e-►
5 6
(b)
- 0-•-►
’ 5 6
(c)
5 < X < 6
Linear inequalities 59
which would have appeared as shown in Figure 2.1(c). Obviously when
this situation is reversed, as for
5 < X < 6
Exercises 2.2
1 Find the solution set for each of the following linear
inequalities;
(i) 4x-8 <2x+12 (ii) 10x + 8 >12x-9
(iii) 6x + 25 > 3x + 10 (iv) 4(5 + 2x) < 20 + 6x
2 Comment on the solution sets in the following coses:
(i) 6(x- 10) + 3(25-x) <7(4 + 2x)-(llx+ 14)
(ii) 7(x- 10) + 5(16-x) <6(x-8)-4(x- 15)
3 Find the solution set for the following systems:
(i) 3x + 5 < 6x — 10
X — 15 < 33 - 3x
(ii) 4x-20<3x+10
3x + 15 < 8x — 30
(iii) 6x — 5 > 5x + 5
7x — 5 < 2x + 20
4 Find the solution set for the following:
(i) 3x + 2<2x+10
3x - 4 < 4x - 8
5x + 6 < 4x + 12
6 — X < 2x — 3
(ii) X + 20 > 3x + 6
4x + 12 < 8x + 4
X + 2 > 11 - 2x
5x — 5 < 4x + 4
(iii) 2x + 6 < 4x H- 2
4-x<x-2
5x - 1 < 4x + 8
3x + 2 < X + 16
(iv) 5x + 5 < 4x + 14
3x — 2 < X + 12
X + 3 < 2x + 1
2-X < x-4
2.3 Linear inequalities usually involve more than a single variable as, for
Linear inequalities example, when the condition relates to an expenditure limit on
in two or more resources. Consider the following inequality:
variables
2x + y < 13
The solution set is all points on or below the line defined by the
equality part of the relation and is shown in Figure 2.2.
^ + 3iy < 19
The solution set is as shown in Figure 2.3.
Figure 2.3
Linear inequalities 61
in addition to the existing inequalities. In practice it is almost always
the case that sign requirements will take the form of non-negativity
requirements (rather than non-positivity requirements). These conditions
restrict the solution set to that part of the original set lying in the
positive quadrant, as shown in Figures 2.4(a) and 2.4(b).
Figure 2A
lx + Oy > 0
and:
Ox ly > 0
2x -i- y < 13
x + ^y < 19
> 0 y > 0
X -h 3y < 18
3x -I- 9y > 57
the intersection of the individual solution sets would have been empty,
since they would have been on conflicting sides of the parallel lines.
Linear inequalities 63
Figure 2.6
2x -f- ^ < 18
X + 2y > 40
X > 0 y > 0
2.4 Whatever the shape of the solution set for a system of simultaneous
Convex solution linear inequalities, it always has the important property of convexity.
sets In two dimensions, none of the interior angles of the solution set
polygon should be reflex (over 180°) and hence the boundary of the
feasible region should not be a reflex polygon. That is to say, the feasible
region will not be re-entrant - a convex set is one without
indentations. In Figure 2.7 since the angle at point A is reflex, a
feasible region of this nature could not result from a system of
simultaneous linear inequalities. Convexity means that for any two
points P and Q in the set all points on a straight line connecting P and
Q must also be in the set. So all the solution sets of Figure 2.6 are
convex. However, the set illustrated in Figure 2.7 is not convex, since
because of the inward pointing vertex it is possible to select points
within the set (for example P and Q as shown) such that some of the
points on the line segment connecting them are outside the set.
Figure 2.7
Linear inequalities 65
Convexity of the feasible region is an important property in problems
where the optimal value of a function is sought for points l5hng within
the feasible region. If the function to be optimized satisfies certain
conditions, then if the feasible region is convex we can be sure that a
point that is better than any other point in its locality will be the best
overall. An important concept in relation to convex sets is that of the
extreme point. An extreme point in a convex set does not lie on a line
segment joining any two other points in the set. Extreme points are
important in the simplex method of linear programming, an adjacent
extreme-point solution procedure, since it works towards the best
position by moving between successively better adjacent extreme points.
A non-convex set cannot be produced by linear inequalities which must
all be simultaneously satisfied. Each linear inequality produces a solution
set which is convex (a half-plane in the two-dimensional case, as we
have seen) and the intersection of convex sets must itself be a convex set.
A non-convex feasible region involving only linear inequalities could be
produced if there are alternative constraint systems. For example, suppose
that the permitted values of x and y must satisfy either (i) or (ii):
(i) 2x + y <3
x + y >2
(ii) 2x + y>3
x+y < 2
Figure 2.8
z = 2x + 5y (2-4)
4x + 3i/<48 (2.5)
X + 2y < 22 (2-6)
Linear inequalities 67
requirements of earlier acquaintance. The complete problem in
algebraic form is then to
maximize z = 2x + 5y
subject to 4x + 3iy < 48
x + 2y <22
> 0 y >0
Figure 2.9
The solution set is the interior and perimeter of the polygon OABC.
The solution set is known as the feasible set, feasible region or
opportunity set. In a production context it is also referred to as the
feasible production set, and the outer edge of the set - the line
segments connecting A, B and C - is the production possibility
frontier, representing the feasible production combinations which, with
the given resource levels, are Pareto optimal. Pareto optimality, an
important concept in economics, means that the output of one of the
products cannot be increased without that of the other being reduced.
Here we wilt refer to Figure 2.9 as the production possibility diagram.
2 Make sure that the production plan adopted makes full use of the
scarce resources.
Figure 2.10
Linear inequalities 69
The further a contour is from the origin, the greater is the
corresponding value of z. The point within OABC lying on the highest
contour is the optimal solution to the linear programming problem. The
graphical procedure is implemented by first drawing a specimen
contour through a convenient point - usually one of the intercepts.
This could be C, for example, where the labour constraint line cuts the
X axis. At this point y = 0, so that from the constraint
4lX = 48
x=12
24 = z = 2(0) + 5y
from which
i/ = 4.8
With the intercepts obtained, the sample contour can be drawn in the
diagram and used as a guide for other contours. A value of z greater
than 24 produces a contour parallel to the original but further out
from the origin. For example, the z = 40 iso-profit line has its intercepts
at (20, 0) and (0, 8). The specimen contour can now be moved
upwards and to the right until there is just one point in common with
the feasible region - in this case point A. This will be the profit-
maximizing position. In linear programming problems the optimum
point will always be at a corner of the feasible region, that is, at an
extreme point. The optimal point may not be unique, but all optima
will give the same value of the objective function. In linear
programming the optimal solutions must always include corner points.
So in two variable problems a place where two of the constraint lines
intersect will be an optimum. Note that one of the intersecting lines
may be the x or y axis, representing the sign requirements satisfied as
equalities. Once an optimum has been identified in the diagram, the
approximate values of x and y can be read from the axes. More
accurate values for the variable at the optimum can be obtained by
solving the intersecting constraints as equations (these will be the only
requirements that hold in equality form at this point). Thus at A, where
the materials and x sign requirement both hold as equalities,
X + 2y = 22
X = 0
4:X + 3y = 48
x + 2y = 22
4:X + 3y = 48
4:X + 8y = 88
-5y = -40
so that
y= 8 and x= 6
z = 4iX + 5y
4:X + 3y = 48
3y = 48 — 4x
so that
y = 16 — 1.33x
and the slope in absolute terms is 1.33. For the materials constraint we
have
x + 2y = 22
2y = 22 — X
so that
y = 11 — O.Sx
Again disregarding sign, the slope is 0.5. With the original unit profit
figures the objective function was
z = 2x + 5y
5y = z — 2x
y = 0.22 — OAx
where z takes a constant value (e.g. 24, 40) for any particular contour.
The absolute value of the slope is 0.4, which is less than either
constraint; hence A is optimal. With the constraints in explicit form,
note that the production possibility frontier (PPF) is an example of a
piece-wise defined function:
ll-O.Sx 0<x<6
16-1.33X 6<x<12
z = 8x + 6y
Figure 2.12
/A
>■
O X
Linear inequalities 73
In practice non-uniqueness at the optimum is usually a desirable
property, since the decision-maker is offered a costless choice between
the equally profitable joint optima. Consider another example:
maximize z = 12x + 2y
subject to 4iX + y < 800
2x+ 3y < 900
X < 180
^ > 0 y >0
4:x + y = 800
X = 180
from which it rapidly follows that y = 80 and that the value of the
objective function is maximized at z = 2320. At C there are 300 units
of slack in the second constraint. At B, which corresponds to full
resource utilization, the level of profit is z = 2200, revealing an
opportunity loss of £120 in comparison with the optimum. Now
consider minimization of the objective function and the following
problem:
minimize z = 3x + 5^
subject to X + 4y > 60
2x + > 50
^ > 0 y >0
Figure 2.13
X + 4y = 60
2x + y = 50
x = 20 y = 10
Figure 2.14
75
Linear inequalities
management information in deciding whether a possible increase in
resource is worthwhile. We can also find out how the optimal
production plan responds to changes in product profitability as well as
variations in resource levels. This is the realm of sensitivity analysis,
which in small problems can be carried out in a graphical context but
is more efficiently conducted using the simplex method.
Exercises 2.5
1 Use a graphical approach to find the optimum for the
following linear programming problem:
maximize F = Sx] + 10x2
subject to 3xi + 2x2 < 54
2xi + 5x2 < 69
X] > 0 X2 > 0
Indicate the location of the optimum and state the optimal
values of the decision variables and the objective function.
2 With the constraints of problem 1, indicate the optimum
position(s) and find the optimal values of xi ,X2 and F for the
following objective functions:
(i) F = 8xi + 5x2
(ii) F = 3xi + 10x2
(iii) F = 8xi + 20x2
3 Solve the following linear programming problem:
maximize F = 35xi + 25x2
subject to 4xi + 3x2 < 92
xi + X2 < 3 8
X] < 20
X2 < 20
X] > 0 X2 > 0
4 Solve the following problem using the graphical method:
minimize F = 7xi + 3x2
subject to xi + 2x2 > 55
4xi + X2 > 80
xi > 0 X2 > 0
Matrices
Sections 3.1 and 3.2 introduce basic concepts and matrix notation.
Sections 3.3-3.5 consider operations on matrices. Determinants are
introduced and linear dependence is discussed. Use of matrices in
solving linear systems is explained, and section 3.10 introduces
Cramer’s rule.
LEARNING OUTCOMES:
By the end of this chapter you will be able to add, subtract and
multiply matrices, evaluate determinants, find the inverse of a matrix,
and use matrices and determinants to solve systems of simultaneous
linear equations.
3.1 Introduction Quantitative models often use many unknowns and numerical values.
Masses of data could be handled ad hoc but this is inefficient. Error
tracking is difficult, and systematizing procedures and computer
implementation are more difficult. It is often desirable to use a
rectangular array, so conventions are needed for handling such arrays.
As an example, suppose that a firm can use two scarce resources to
produce two products, the Alpha model and the Omega model. Table 3.1
states the resource requirements per unit of each product produced.
Materials (kg) 4 1
Labour (hrs) 2 3
'4 1'
2 3
77
Matrices
additional properties). Matrices are notation. In principle there is
nothing that can be done with matrices which could not be done
without them. But in practice matrices make a big difference because, as
with all good notation, matrices:
3.2.2 The dimensions are stated as the number of rows times the number of
Dimensions of matrices columns. The dimensions give the order of the matrix. Thus R is a
matrix of order 2x2. The matrix:
6 3 10
A = 4 2 5
21 3 9
'1 5'
__ 2 6
^ 3 7
4 8
D =
E = [1 2 3 5 4]
3.2.3 The elements of (or entries in) a matrix are the entities of which it is
The elements of composed. They could be integers - as in matrices A, B, C, D, E and F
matrices - or other numbers, including irrationals, as in
6.2 -4
G = 4/3 10^
TT —e
3 —X
then
1 7
L = [6 8 -7 4]
7 2.5 4
1 -3
R M= [ttI
3 6
20 32 16 15 12
-17 44 27 13 41
15 34 26 57 12
22 19 20 35 17
79
Matrices
may conveniently be considered as being made up of sub-matrices
A B'
C D
12 3 4
5 6 7 8
9 10 n 12
13 14 15 16
3.2.6 A triangular matrix is a square matrix where only the elements on,
Triangular matrices or on one side of, the main diagonal are non-zero, an example being:
■ 1 0 0 O'
5 6 0 0
9 10 11 0
13 14 15 16
■-5 -4 -3 -2'
0-101
0 0 6 3
0 0 0 5_
3.2.7 A diagonal matrix is a square matrix in which all the elements except
Diagonal matrices those on the leading diagonal are zero. For example.
4 0 0
T = 0 4 0
0 0 4
1 0 0
1 = 0 1 0
0 0 1
3.2.8 In a zero matrix all elements are zero. But zero matrices, unlike
Zero matrices identity matrices, need not be square. So, for example.
is a 2 X 3 zero matrix.
3.2.9 Elements can be referenced by their row and column number, e.g. row
Matrix notation 1, column 4, or row i, column j. To refer to the elements in a general
way, matrix notation is employed. Lower-case letters are used —
usually fl, b or c - in conjunction with a double subscript or index to
indicate the row and column of the element, as in the m x n matrix
81
Matrices
In this notation the ranges of the subscripts are not explicitly stated
when the order of the matrix is understood. Vectors require only one
subscript to identify their elements, as in the following column and row
vectors:
bi
bm
3.2.10 Two matrices are equal when (1) the matrices have the same dimensions
Equal matrices and (2) corresponding elements in the matrices are equal. So if
3 3
A = and
6 6
the matrices are equal if and only if x = 9 and y — —1. Note that
3 3 0
A = and
6 6 0
3 7-4
then A'
2-1 6
and if
7
8
then B' = [7 8 9 10]
9
10
1 3 4
A = 3 7 8
4 8 2
The transpose is
Here the elements of the transposed matrix have the same absolute
value as the elements in corresponding positions in A but are of
opposite sign. For this to be so, aij = —aji and the matrix must square.
A matrix with this property is skew-symmetric or anti-symmetric.
The principal diagonal elements in a skew-symmetric matrix are zero.
Transposition can also be used to describe the interchange of two
rows or of two columns of a matrix. Such an operation leaves certain
essential properties unchanged and is an elementary matrix
operation.
Exercises 3.2
1 State the dimensions of the following:
3 2
B=[l 5 6 7]
6 1
-3
-2
-1 D = [0]
-2
-3
f = (o//)
where: ; = 1 to t;
/ = 1 to s.
2 Given
6 3 7
2 1 4
5 9 3
3 7'
6 7
B 1 4
2 4
9 3
2 4
6 3
5 9
83
Matrices
3 Find the trace of
10 13 26 18
16 27 13 22
14 30 35 23
20 19 21 29
(ii) 10 10 10
10 10 10
B = (b,v)
where; b,y = 10 for / = 1 to m and / = 1 to n.
6 Find the transpose of the following matrices:
4 -1
B= [2 TT 3-10]
-3 8
-8
0
V2 D = [100]
-1
4
9 4 13'
4 12 7
12 3 5
3 7 3 6
1 -3 0 9
L 6 0 -9 0.
r 0 1 2 11
1 0 3 5
C=
2 3 0 4
1 5 4 0.
r 8 8 1 7 -
8 TT 2 0
D=
1 2 0 6
7 0 6 e
6
to
3‘
A = 3 0 and B = -2 -1
-1 2_ 4 9
then
8 7
A+B = 1 -1
3 11
1_1
1_1
00 o^
(N O
'7 9‘ -3 2'
1
+ + =
'X)
3 2 5 3
1
b 4a 11' a -1 7
A = -b a 9 B = a c^ —c
a+b c 2c _-2b b b—c
Matrices 85
b+a 4fl — 1 18
A+B = a—b a + c^ 9 -c
a— b c+b c+b
and if
15 -8 2x 7 25 -1
+ 7 2y z 31
10 3x
15 + 2x = 25
10 + 7 = z
so z = 17, and from the elements in the (2, 2) positions it must also be
that
3x + 2i/ = 31
therefore;
15 + 2y = 31
so:
2y = 16
i.e.
1/ = 8
A+B=B+A
A + {B + C) = {A+B)+C={A + C)+B
A B = (djj bjj)
2 3 4 6 2-1
A = and B =
10 2 4-7 0
-4 1 5
A-B =
-3 7 2
A-B^B-A
Here B — A is
-1 -5
B
-7 -2
3.3.3 The zero matrix is the identity matrix for the operation of matrix
The identity element for addition (requirements needed to fulfill an identity role depend on the
matrix addition operation being performed): that is, for conformable matrices A and 0:
0-A = -A (3.1)
2
A = then
-4
A + (-A)=0
that is, a matrix plus its negative is a zero matrix of the same order.
3.3.4 First consider the addition of equal matrices. For example, start with a
Scalar multiplication matrix A where
Matrices 87
and add A to itself
4 1 8 2
2 3 4 6
A+A = 2A
and, in general, for c equal matrices which are added the result is
CA = (Cfly)
(i) {k ± m)A = kA ± mA
(ii) ikA±B) = kA±kB
(hi) m{kA) = {mk)A
(iv) 0A = 0
(v) -1A = -A
Exercises 3.3
1 Find the sum of the matrices in the following coses;
10 -8‘ 2 16' -3 12
+ +
4 11 - 9 14 2 -7
4 2a _ b' —a 6 0
a b 6 + 9 —c 24
9 0+ c 1 9 c —a
2 Given that
5x 6 4-
—4y —z ■ z 4 ■
1_
2x 2y 24 12
1
7
and
2
3.4 In the example of the two-product manufacturer the per unit resource
Matrix consumption data formed the following 2x2 matrix:
multiplication
Suppose we now want to find the unit costs of production for the
two goods, that materials cost £5 per kilo and that the wage rate for
labour is £6 per hour. The prices can be written as a 1 x 2 row vector
thus:
P=|5 6]
These unit costs could be written as the following 1x2 row vector
C = [32 23]
[5 6]
'4'
2
[5 = [32]
Matrices 89
product or scalar product of the two vectors. Workings for the unit
cost of the omega model - the second element of the unit cost vector -
can be laid out in a similar way:
[5 = [23]
4 1 (3.3)
5 6 32 23]
2 3
PR=C (3-4)
r
[5 [32 23]
3
Dimensions: 1x2 2x2 1x2
For this to work, the number of elements in the row vector P must be
the same as the number of elements in each column of R. This means
that the number of columns of P is the same as the number of rows of
R.
If the firm finds another supplier charging £7 per kilogram for
materials and £4 per hour for labour, what would be the unit costs of
the products.? The new prices can be included by adding a second row
to P The result is a second row in the product matrix C. First we set
out the results and then we describe them. The outcome is
■4 ■32
1
1
IT)
-
_
1' 23'
1_
.2 3. .36 19.
1
The unit cost of the Alpha model at the new prices, £36, is in the
second row, first column of the product matrix, and is the second row of
P multiplied into the first column of R. The new unit cost of Omegas,
£19, is in the second row, second column of the product matrix, and is
the product of the second row of P into the second column of R. The
workings show that a 2 x 2 matrix multiplied by a 2 x 2 matrix
produces another 2x2 matrix. Now imagine a third source for
materials and labour charging £3 per kilogram and £8 per hour. The
further expanded matrix operations are:
where the Alpha cost, £28, is element C31 of C and is the third row ofP
multiplied into the first column of R. The Omega cost, £27, is element
C32 of C and results from multiplying the third row of P into the second
column of R. Note the dimensions of the matrices;
P X R = C
3x2 X 2x2 = 3x2
and there will now be three sets of unit costs for the three products,
represented in matrix form as
The unit costs in the third column of the expanded C matrix are
successive rows of P into the last column of R. If P had dimensions
mx 2 and R was 2 xn then C would he m x n. If the number of
columns of P and the number of rows of R are both p, then the
matrices are conformable in multiplication, and the dimensions of
the product matrix will be m x n. Notice that the product of the
matrices:
A B
'5 6 2
■4 1'
7 4 4
2 3
3 8 6
Dimensions: 3x3 2x2
91
Matrices
columns it can be multiplied by itself. The result is the square of the
original matrix:
AA=A^
For example, if
4 1
A =
2 3
then
Notice that the elements oi A^ are not the squares of the elements of
A[{aij)^]. A square matrix can be raised to higher powers: for example,
AB^BA
4 1 6
and
2 3 4
{ABf = B^A^
i.e. the transpose of the product is the product of the transposes — but
in reverse order. The matrices A and B above can be used to
demonstrate this result:
B^ A^ {ABf
■5 7' ■4 2' '27 31'
.6 4. .1 3. .28 24.
5 4 [3 2"
and B= ^ ,
-4 5
22
= BA
7
AB = 0
4 6 0 0
and then AB =
2 3 0 0
93
Matrices
In the arithmetic of real numbers for given scalars a, c and d, if ac = ad
for a^O, then c must equal d. But in the case of matrices A, C and D, if
AC = AD where D^C
4 6 2 3 5 3
A = C= D
2 3 4 1 2 1
then
32 18
AC = = AD
16 9
A^ =0 for A^O
For example, if
1 2 0 0
then A^ =
-0.5 -1 0 0
(i) A B C
'2' '2'
1 4 1'
2 3] 3 = [11 8 17] 3
.3 0 5
_1
rH
1
= [63]
(ii) A B C
2 1
'3'
1 3 '4 2 [4 2:
2
0 5 10
32 16
36 18
40 20
A B C
3
'4 5'
2 [6 9]
1 7
1
A(B + C)=AB+AC
(B + C)A = BA+ CA
A B C
5 '3 7' '2 11'
+
-1 .0 5. .4 6..
A (B+C)
■ 5 4' '5 8' '41 84'
.-1 9. .4 11. .31 91.
AB + AC
■ 15 55' 26 29' '41 84'
+
-3 38 34 53. 31 91
95
Matrices
In the multiplication of real numbers 1 is the identity element for
multiplication, leaving unchanged any number by which it is
multiplied. In matrix multiplication the equivalent role is filled by the
identity matrix, I, such that for a matrix A
AI = A and lA = A
'1 0 0 o'
1 0' '1 0 o' 1 0 0
0
0 1 0 =h 0 0 1 0
0 1
0 0 1 0 0 1
0
Note that the order of the identity matrix can be shown by a
subscript and /i = [1]. To show that I is the identity element in
multiplication suppose that
1 o' ■4 1 6‘ ■4 1 6'
0 1 2 3 5 2 3 5
10 0
4 16 4 16
0 10
2 3 5 2 3 5
0 0 1
Exercises 3.4
1 Carry out the following matrix multiplications:
(i) 8 3
[3 5]
-1 4
2 1 ■ '6 9'
3 -2 1 4
(iii)
8
1
-1
(iv)
2 5
■5 4 3'
-3 4
1 -2 6
6 -1
2 1
A
-1 3
(ii) 8 7
-8 -7
3 7 -2 -4
and B=
-7 3 4 -2
2 5 -1
[4 3
-2 6 4
(ii) r3 2
1 3 3 6
4 -1
(iii) r 3 2 01 ri 31 r
1 0
1 2 -1 2 1
3 5 -1
2 5 1 0 4 -
3,5 For two square matrices A and B of the same order, if and only if
Matrix inversion
AB = BA = I
It follows that
so a matrix is the inverse of its own inverse. A matrix and its inverse
are commutative. Consider two examples of inverses. If
2 3 0.8 -0.6
A = then is A ^
1 4 -0.2 0.4
Matrices 97
As a check, the multiplications AA~^ and A~^A may be carried out to
confirm that the result in both cases is /2- In the former case the
workings are
2 1
A = then is
1 1
+ (^12^21 = 1
diibn + «i2l^22 = 0
^^211^11 + (^22^21 = 0
(i2lbl2 + «22l’22 = 1
When these requirements are solved to state the by in terms of the fly,
the results are
, —fll2
b,,
”11 —
Allfl22 — <^12^*21 flllA22 — «12fl21
, —^^21 , flu
b21 = b22 =
«11«22 — ^^12^21 flllfl22 — «12«21
4->=B =-1-[ ““
^11^22 ~ ^*12^*21 [~^21 ^*11
is written as either
1A| or det A
and is the product of the main diagonal elements less the product of the
two remaining elements. It requires more substantial calculations to
obtain determinants of matrices of order higher than 2x2, and these
will be considered later. For the moment we will develop the idea and
illustrate its uses in the context of 2 x 2 matrices. Consider some
examples:
2 3'
(i) A = A| =2(4) - 1(3) = 5
1 4
■ 1
(ii) A = |A1 = 1(3)-(-!)(-!) = 2
-1
0.5
(iii) A = detA = 0.5(8) - (7)(3) = -17
3
2 3
(iv) A = detA = 2(6)-(3)(4) = 0
4 6
9 4.5
(v) A = detA = 9(-2) - (4.5)(-4) = 0
-4 -2
ail fll2
A =
kail kai2
99
Matrices
3.5.2 Given the matrix A, where:
The ad jugate matrix
All «12
A =
an ^22
the matrix
«22 —fll2
—fl21
adj A
3' -3‘
(i) A = adj A =
4 2 .
-1 1'
(ii) A = adj A =
3 1
The role of the adjugate matrix becomes clear in forming the inverse
when we examine the products A adj A and adj A A. The order of
multiplication is immaterial as A and adj A are commutative so the
products are always the same. Take the matrices above as illustrations.
In the case of example (i) A adj A works out as
In the case of (ii), the products A adj A and adj A A work out as:
A 0
A adj A =
0 A
^ad]A=A-^ (3.5)
|A|
We can now complete the inversion process for the two examples above:
(a) If
since
4 -3
\A\ = 5 and adj A =
-1 2
then
4/5 -3/5
-1/5 2/5
(b) If
since
3 1
A| = 2 and adj A =
1 1
then
-1 1 . .
A ^ = -—r adj A =
3/2
1/2
1/2
1/2
|A|
since
3 -4
|A| = 10 and adj A =
-2 6
then
1 , . 0.3 -0.4'
\A\ ’ " -0.2 0.6
Matrices 101
(d) If
7 1
A =
9 2
since
2 -
\A\ = 5 and adj A =
-9
then
1-1
1 , 0.4 -0.2'
— adj A =
|A| -1.8 1.4
(e) If
2 1
A =
5 2
since
then
1 1 , ■-2 1'
A ^ = 7—7 adj A =
|A| 5 -2
(f) If
4 6
A =
2 3
since |A| = 0, A is singular and the inverse matrix in this case does not
exist.
Now consider two special cases. A square matrix is orthogonal if
A'A=I
that is, if the transpose of the matrix is the inverse. For example,
0.8 0.6'
- 0.6 0.8
=A
\A-^\ = {\A\)-^
(A^)-i = {A-^f
• The inverse of the product of two matrices is, in reverse order, the
product of their two inverses:
BA = I
AC=I
Both left and right inverses exist only if A is square, in which case
B = C=
Exercises 3.5
1 Establish whether or not B is the inverse of A where
B
4 .-1 1.
Matrices 103
Find I A| for the following;
(i) 9 7
A
2 3
(ii) -3 -2
-1 -4
2 5
3 4
0.4 0.3
0.3 0.2
3 tt"
27r -2.
(vi) 8 2
12 3
3 2
A
1 5
(ii) 2 -6
A
3 -7
(iii) 3 — TT
A
2 e
(iv) 5 0
A
0
4 Find the inverse, if it exists, for the following:
4 5
A
2 3
4 2
A
3 4
3 2
A
10 5
(iv) 1.5 -0.5
A
-0.4 0.2
8 -4
A
-6 2
5 Which of the following ore orthogonal or involutional?
(i) 7 -6
8 -7
0.6 -0.8
0.8 0.6
II
.1.8 -0.8.
■-0.6 0.8'
II
0.8 0.6.
■7 1' Xi ■34'
9 2 _X2_ 48
Ax = b
ax = b
b
X = -
a
that is
X = a~^b (3-6)
Ax = b
Since the product of a matrix and its inverse is the identity matrix (3.7)
becomes
Ix = A~^b
X = A^^b (3.8)
Matrices
105
In terms of our current example, (3.8) is
'xf ■7 r -1 ■34'
.^2. .9 2. .48.
0.4 -0.2' ■34‘
-1.8 1.4 .48.
~ .6 .
2xi + 3x2 = 16
Xi -I- 4x2 = 13
can be expressed as
-1
Xi '2 3' 16'
X2 1 4 13
and so
(ii) If
■3 -1' Xi '23'
4 2 X2 24
then
(iii) If
then:
Xi ■7 1 '-50' -lOO'
X2 9 2 250 50
■3 1' Xi '20'
6 2 _X2_ 40
the determinant of the coefficient matrix is zero, so the inverse does not
exist. The equations as a whole are either inconsistent or, as here,
linearly dependent. The coefficient matrix has linear dependence in
either case. If the right-hand-side elements are in the same relationship
as the rows in the A matrix the equations are consistent (but linearly
dependent). Otherwise the equations are inconsistent - in the two-
variable case producing parallel straight lines. Here the second equation
is twice the first and there are infinitely many solutions.
In the case of
(V)
Use of the inverse matrix as outlined above is not the most rapid
means of obtaining a once-only solution to a linear system, but
advantages follow if the inverse of the coefficient matrix is available. For
example, it can be useful to know the range of values of the right-hand
side for which the solution values are non-negative - a situation which
arises in linear programming. Suppose that Xi and Xj are production
levels and the right-hand side values are resources. Suppose that 48
units of the second resource are obtainable, but that availability of the
first resource is variable: for what range of values of the first resource
are both Xi and X2 non-negative? To answer this question, let the level
of the first resource be bi. The system is then
■7 r 'xi' 'h'
9 2 X2 48
and the solution values of Xi and X2 - along with the statement of sign
requirements - will be
bi > 24
Similarly, as regards X2
bi < 37.33
So, with other data at original levels, for the solution values of both
variables to be non-negative the amount of the first resource must lie in
the range
24<bi< 37.33
that is
b2 < 68
implies that
1.8(34)
b2 > 43.71
1.4
In summary:
that is:
ba < 2bi
To make sure that X2 does not fall below zero it is necessary that
or (approximately)
The permitted region is on and between the lines. The individual range
of variation for bi given a value of ba is found by taking a horizontal
line into the diagram at the given height. Points of intersection with
the rays then give the end-points of the individual range for bi.
Figure 3.1
Exercises 3.6
1 Solve, if possible, the following systems of simultaneous linear
equations using the inverse of the matrix of coefficients:
(i) 4 ^1 55
1 ^2 20
(ii) 2 120
1 3j [x2_ 10
(iii) 5 9irxi 45
24j [x2 20
(iv) 9 61 [xi 30
6 4j [x2_ 20
(v) 2 -71 rxi -18
2 -2} [X2 -8
(vi) 9 -6' xi 30
-6 4j [x2_ 20
(vii) 8 10 xi 189
4 6 X2 106
(viii) -6 0.4
0.8 -0.2j[x2.
2 For
(i) b] 67
L^2 J 46
.b2_ .109
(v) b]' 133
>2. 114
(vi) 'bi 1 -230
>2. .-340
(vii) 'b]' '220 ■
>2. 16C
■4 5‘ bi
2 3 .^2. .b2_
'2 3‘ Xi '12'
4 6 _X2. 24
we can write
,2
X2 = 4 - 2
Matrices 111
But note that in the case of
’0 r 'xi r
0 1 X2 1
Xi /Xj can have any value. The original system of equations is, in a
sense, not as big as it seems. While the matrix A is of order 2 x 2, it
has rank 1. The rank of a matrix is the largest number of its rows or
columns which are linearly independent. If the rank of a square
matrix is less than its order, there will not be a unique solution to the
corresponding linear equations. Consider some examples of rank.
(i) The matrix
'2 -1 4‘
6 5 3
_8 4 7_
is of order 3x3 but has rank = 2. The linear dependence is most easily
seen in terms of the rows; the third row is the sum of the first two. This
does not mean that it is the third row in particular which is dependent.
Any one of the rows can be expressed as a linear combination of the
other two (row 1 = row 3 - row 2) and can be dropped from the
system without losing information. There will be the same degree of
linear dependence between the columns as between the rows of a
square matrix. In case (i) the link between the columns is less obvious,
but inspection reveals that:
® 2) ■ ■ ■
Wi, W2,---Wm
then the vectors making up the matrix are linearly dependent. If no such
numbers Wj exist, then the rows (or columns) are linearly independent. A
collection of linearly independent vectors forms the basis of a set of
vectors if every vector in the set is a linear combination of vectors in the
collection. This may seem obscure, but the idea of a basis is important in
linear programming. The columns of coefficients of variables in a basic
solution can be used to generate the columns of coefficients of variables
currently zero. For example, given the three vectors:
3 1 4
2 0 2
4 15
3 and 1
2 0
4 1
form a basis for the three-vector system as a whole, since the third
vector can be formed from a linear combination of the first two (it is
their sum in this case). Of course, bases are also formed by the other
possible pairings of vectors. Thus, if the vectors
3 and 4
2 2
4 5
are selected, the second vector is the difference between the third and
the first. If the vectors selected are
1 and 4
0 2
1 5
then, from this basis, the first vector is obtainable by taking the
difference between the third and the second vectors. Consider two
further examples of matrices where the order is greater than the rank.
(ii) The matrix
■ 2 1 7 4'
6 8 4 6
-15 9 2
3 4 2 3
is of order 4 x 4 but has rank = 3. In this case the last row is half the
second row. Given consistency, it would be possible to state three of the
variables in terms of the fourth. This situation arises quite commonly in
economics, where the variable used as the parameter is determined
outside of the system (it is exogenous). The in-system (endogenous)
variables can then be given specific values.
(iii) The matrix
' 9 7 8 6‘
2 10 3
7 6 8 3
11 8 8 9_
is of rank 2. In this case any collection of more than two of the four
rows (or columns) shows linear dependence. The row relationship can
be expressed as
’2 1 4 0.5 3'
4 2 8 1 6
is of order 2x5 and rank 1. In terms of the rows, the second row is
twice the first. This is the row rank. The column rank must be the same
as the row rank, which means that in this case there is only one
independent column, of which all others are multiples. The matrix
■ 2 3 4 1.5'
-1 7 -2 3.5
4 6 8 3
has order 3x4 and rank 2, since there are only two linearly
independent rows or columns.
Exercises 3.7
1 In the following systems, state the rank of the matrix of
coefficients, A. What can be said about the relationship
between the solution values, if any, of the variables in each
case?
■
■
(i) 2 -1 'xi ■ 20'
.-1 0.5. .X2. .-10.
(ii) ■9 16" 'Xl 1r
1-
1_
CM
.X2 5.
-6 12' 'xi ■ • 24'
4 -8. .X2. .-18.
2 What is the rank of the following:
'1 2 3'
2 4 6
4 8 12
'2a a -0.5 a
2b b -0.5b for a b,c^0?
2c c -0.5c
a b c
2a 3b 4c for 0, b, c^ 0?
2a 2b 2c
6 and 1
-2 4
3 -1
6 1 7 5
A= -2 4 2 -6
3 -1 2 4
'42-1 0.5 -2
-8 -4 2-1 4
3.8 Applications may involve systems with anything from three to several
Higher-order thousand equations. Computer software is available to solve systems of
systems: linear equations and to obtain matrix inverses. While it is quicker and
determinants and less error prone to use packages such as Derive or Maple for larger
the inverse matrix matrices than to work out the inverse by hand, there is value in
knowing something about larger systems. Consider, for example:
We will find the solution to this system using the inverse of the matrix
of coefficients (A). In obtaining the inverse of A we shall first find the
determinant. There are various ways in which the determinant can be
found, including a special procedure for 3 x 3 matrices. Here we will
use Laplace expansion, which evaluates a determinant as a weighted
sum of lower-order determinants. To use this approach we will need
two new concepts. A minor is the determinant of a submatrix of A. We
will need the minors associated with the individual elements, fly, of A.
For the element fly, this is the determinant of the submatrix obtained by
striking out the ith row and the jth column of A. For example, the
minor associated with an is mu where:
mu = -1
mu = 2 and mu = = 1
Matrices 115
given by: ( - 1)''^^ Call the cofactors of elements Cij. The cofactors then
are
Cj; = ( - ly^'niij
The point of this is the following result:
so the determinant is
|A| = 2
2
m2i = so C21 (-l)^^^m2i = l
1
3
^22 = so C22 (-l)' + 'm22 = -6
2
3
^23 = so C23 ( - l)^ + ^m23 = 1
2
So:
The same value results from using elements of the third row and their
cofactors. Similarly, the value of the determinant could have been
obtained by using elements and cofactors in one of the columns. But
note that the sum of products of the elements of one row and the
cofactors of another is zero. The amount of work involved is reduced if
there are zeros in a row or column. For example, the determinant of
2 4 0
B= 5 7 0
3 6 5
\B\ = 5( - 1)^ + ^
cn C21 C31
m3i = = 1 so C31 = ( - = 1
■3
0
The inverse matrix is valuable when solutions are needed for various
right-hand-side values. Incidentally, note that for the order 2 matrix:
uii ai2
> A =
CI21 ^22
0-22 —^21
—ai2 dll _
Exercises 3.8
1 Given the matrix:
'7 5 9
A= 3 8 4
6 2 1
(i) Find the minors of the following elements:
(o) 033 (b)aii (c) 022 (d) 032 (e) 021
‘2 4 3
A = 4 5 1
9 2 4
(i) rZ 9 8‘
A= 1 0 2
5 3 4
3.9 In this section the use of determinants and inverse matrices in the
Simultaneous solution of linear equations is taken further. For a numerical example,
equations (ii) we shall use the inverse matrix approach to solve the system
4xi + 7x3 = bi
7x\ + 3X2 + 5X3 = ^2
4xi + 2X2 + 3X3 = ^3
■34'
1
'
'h'
bo
LO
00
bi 19 and
1
.^3. 35 12 -2_
4 0 7
7 3 5
4 2 3
3 5
mil = = -1 so Cii = ( - l)^^^mii = -1
2 3
7 5
^ni2 = so Cl2 = { - l)^^^mi2 = -1
4 3
7 3
mi3 = = 2 so ci3 = ( - l)^'^^mi3 = 2
4 2
Matrices 119
So the determinant is
I A| = fliiCii + ai2Ci2 + — 4( — 1) + 0( — 1) + 7(2) — 10
mi = = -14 so C21 = ( - = 14
So the inverse is
We can now use the inverse to solve the system of linear equations
for the three sets of right-hand-side values. For the first right-hand side
which implies
bi < 77
and from X2
i.e.
which implies
bi < 87
and from X3
i.e.
which implies
bi > 22
Of the two upper bounds, 77 is the least, and so gives the tightest
restriction, and in conjunction with the one lower bound the overall
requirement on bj for sign feasibility of all variables is that
22<bi<77 (3.9)
Matrices
121
The range (3.9) is the tolerance interval or range of feasibility
for bj. The tolerance interval for X2 can be found by similar means from
the second row of the inverse into the right-hand-side vector, in which
b2 is substituted for the original specific value of 58. The resulting
tolerance interval is
Note that this is a much narrower range than for bi. The feasibility of
the solution is more sensitive to variations in b2 than in bi. This is
likely to be an important concern for management if ^2 is not a control
variable - being, perhaps, the subject of random variation, external
regulation or the actions of competitors. For the right-hand-side
element bj, the tolerance interval works out as
Exercises 3.9
1 Solve the system
2 Given
(i) b] 85
62 185
b3 185
(ii) bi 70
62 50
bs 90
(iii) ibi -4
62 139
bs 78
(iv) b] 9
b2 63
bs 45
bi <^12 fll3
b2 ^22 ^23
bz ^32 fl33
Xi
1^1
flu bi fll3
fl21 b2 «23
fl31 bs ^*33
X2
1A|
flu fll2 bi
fl21 «22 b2
fl31 ^^32 bz
X3
|A|
47 0 7
58 3 5
35 2 3
7
5
= 3( - 104) - 2( - 171)
= 30
123
Matrices
so:
Xi = 30/10 = 3
and Xi will be
4 47 7
7 58 5
4 35 3
7 5 4 7 7
-47 + 58 - 35
4 3 4 3 5
= - 47(1) + 58( - 16) - 35( - 29)
= 40
so
X2 = 4
and X3 is
0 47
3 58
2 35
X3
10
Using the second column to expand, the determinant works out as
4 47 4 47
4 35 7 58
= 3(-48) 2(-97)
= 50
Xi = 3 X2 = 4 X3 = 5
Cramer’s rule is equivalent to use of the inverse matrix but does not
require the complete inverse to find values for particular variables. The
rule is a useful means of solving systems of simultaneous linear
equations. Because it finds solution values for variables individually,
Cramer’s rule is efficient when values of all unknowns are not required.
Exercises 3.10
1 Use Cramer's rule to solve the following linear equations:
\
-9 1 -
■^1 ■ ■53-
.6 2. .^2. .46.
■4 5' ■^i ■ ■26'
.1 3. .^2. 3.
2 3 r 'xi ‘ ■16‘
4 1 5 X2 = 34
2 6 3 .^3. 30
3.11 In this brief introduction to matrices the focus has been on systems of
Concluding simultaneous linear equations. In this chapter and in Chapter 2 we
remarks discussed other methods of solving such systems. No single method is
the most appropriate for all applications, but there are circumstances in
which each technique is likely to work to advantage. Less systematized
approaches, such as elimination or substitution, are quick ways to solve
2x2 systems. They can also be practicable in 3 x 3 cases, where a
convenient feature of the system can be exploited. But informal
approaches are not recommended for larger systems. The use of
determinants, as in Cramer’s rule, has advantages if the values of only
some unknowns are required. Gauss-Jordan elimination, or a variant
making use of elementary row operations, is highly efficient if a one-off,
complete solution is required. If a solution is required for more than
one set of right-hand-side values, or if tolerance intervals for right-
hand-side values are needed, then the inverse matrix is valuable.
The number of arithmetic operations needed to find the solution to a
system of n equations in n unknowns varies roughly with the cube of n.
In comparison to a system of order three, over twice as much work is
involved in solving a system of order four, and about five times as much
work is required to solve a system of order live. So it is as well that
there are software packages (such as Derive, Maple, MathCad, etc.) for
evaluating determinants, finding inverse matrices and solving systems
of linear simultaneous equations. On a modern PC inversion of a
100 X 100 matrix takes but a few seconds. Mathematics software is
well worth investigating if you do much work in this area.
125
Matrices
CHART E R
LEARNING OUTCOMES
After reading this chapter you will be able to solve quadratic equations
and mixed systems of one linear and one quadratic equation. You will
learn principles of curve sketching, and consider business and economic
applications of quadratic, cubic and rational functions.
4.1 Managers and economists deal with measurable entities - for example
Functions prices, quantities, rates of interest, inflation and growth, profits and
rates of taxation - and try to understand the relationships between
them. There are many possible objectives, including predicting the
consequences of decisions, influencing or controlling outcomes, or
ideally optimizing the values of key decision variables. Where an
optimization model is used the objective will be to achieve the best
value of some measure of performance or effectiveness. For example, a
firm may wish to maximize its profits, or a contractor may wish to
minimize cost or the completion time of a project. The concept of a
function arises in most theoretical or empirical studies of relationships
between variables whether or not the relationships are initially
expressed in symbolic terms.
A function can be seen as a rule which links together a given
number, ac, and another single number, the function of x - written as
f(x) and read as f of x’. Formally, a function is a relation between two
sets that associates a unique element of the second set with each
element (or n-tuple of elements in the case of functions of several
variables) of the first set. If the rule represented by /(x) gives more than
one value for a given x, as for example when a square root is taken, in
strict terms it does not satisfy the definition of a function - which
/(x) = 2x + 5
/(x) = x^ — 10
y = — 3x — 10 (4-1)
x^ — 3x — 10 — 1/ = 0
y = x^
This case is illustrated in Figure 4.1(a). The domain is the real numbers
for which the operation of squaring can be performed - all real
numbers. Since the square of a real number cannot be negative, the
range of the function is, by its own nature, restricted to the positive real
numbers and zero.
In the second case the function is
y — +V4: — 2x
(meaning that only the positive square root is involved in the function).
The domain and range are
y = 1.2x + 7
^ Range
0
>■
X
(b)
which is graphed in Figure 4.2. Solid disks show points in the range
where the form of the function changes. An unfilled disk indicates a
point where the value of /(x) is not included in the range. So with the
uppermost line segment the end-point at x = 15 is excluded.
Figure 4.2 fM A
>■
X
Exercises 4.1
1 In the absence of other restrictions, find the domain for the
following functions: ^
(i) y = (x+10)^ (ii) y = (40-2x)2
2 What is the range for the following functions:
(i) y = x^ (ii) y = 2x2 - 10
3 With the given domains, find the range for the following
functions:
(i) y = 3x-15 domain:-5 < X < 20
(ii) y=100 —5x domain:—10 < X < 100
(iii) y = 250 - 1 Ox domain: -200 < x < -100
4 Do the functions below have step discontinuities against the y
axis?
f/ 1 / X + 10 for X < 30
^ 1 2x — 20 for X >30
131
Functions and turning points
5x + 5 for 0 < X < 15
2x + 60 for X > 15
y = ax^ + bx + c (4.2)
y = ax^ +bx-\-c
Figure 4.4
y = ax^ + c
Here, c gives the y intercept, and the effect of varying it is to shift the
whole parabola up if c increases or down if c is reduced.
y = ax^ + bx + c
move the vertex of the parabola relative to both of the axes, although
the parabola remains upright. As b is decreased, the algebraic value of
the slope is decreased. So if at a point the slope is positive to begin
with, the curve becomes less steep. If the slope is negative, then the
curve will decrease more rapidly at the given point. While variations in
the value of c move the parabola relative to the vertical axis, and
changes in b move the parabola relative to both axes, the curve can be
shifted to the left or to the right alone by replacing x with x - k (which
produces a shift to the right) or with x + k (to produce a shift to the
left), where k > 0 in both cases. Making such a change to the
independent variable affects both b and c in such a way as to give a
horizontal movement only.
These shift effects can be confirmed with the parabola
y = ax^ + bx + c
which shows the changes to b and the constant term needed to effect a
shift of the whole curve k units to the right. Consider the simplest
parabola, y = x^. A one-unit shift to the right is represented by
= (x — 1)^ = x^ — 2x -I- 1
Figure 4.6 illustrates the shift as a two-part process. Figure 4.6(a) showing
the original function, y = x^. Figure 4.6(b) shows the effect of the
subtraction of 2x: this moves the parabola across and down. In Figure
4.6(c) the constant. 1, has been added, with the original function indicated
by the broken line. The overall effect is a lateral movement only. The effect
on the slope of the function for any given value of x can be seen in Figure
4.6(c). For any value of x less than i, the function y = x^-2x+lis
further from its vertex than is y = x^, and is steeper in absolute terms. For
any value of x greater than ^ the function y = x^ — 2x + 1 is nearer to its
vertex than is ^ = x^ and has a lesser slope.
The X intercepts are important in most applications. These are the
points where y = 0 and are the zeros of the function, and the
corresponding values of x are the roots of the equation
ax^ + bx-h c = 0
x^ - lOx + 21 = 0
factors as
(x - 3)(x — 7) = 0
X = 3 and x = 7
y — X^ — lOx + 21
-b ± (b^ - 4ac)2 ^
x =--- 14. J
2a
The quadratic formula can always be used to solve for the exact values
of the roots, and can be slightly rearranged as
—b (b^ — 4(ic)2
^ ^-
2a 2a
which shows more clearly that the two roots of the quadratic will be
equal distances of
(b^ — 4flc)2
2a
on either side of the vertex. The vertex itself, as suggested by the fact
that it lies on the axis of symmetry, is located centrally between the
roots at -~b/2a. The expression
b^ — 4ac
is the discriminant.
Here the discriminant is positive and the quadratic has two distinct real
roots. In geometric terms, the parabola crosses the x axis at two
separate points. As an example, consider the quadratic equation
y = — 3x — 10
(x + 2){x — 5) = 0
Figure 4.7
Case 2: — 4ac = 0
A zero value for the discriminant means that the two roots are equal
and, as can be seen from the quadratic formula, will be at the vertex
—b/2fl. This is the case of double roots. In an equation of higher
degree, a root occurring three times is a triple root. In general, a root
occurring n times, where n > 2, is said to be a multiple or repeated
root. In contrast, a single root is sometimes called a simple root. A
case of a quadratic with double roots is illustrated in Figure 4.8, which
is drawn for the equation
y = x^ — 6x + 9
(x — 3)(x — 3) = 0
y = + 6x + 13
Figure 4.9
that is
X = — 3 + 2i
X = —3 — 2i
1 The y intercept. This is the point where the graph cuts the
vertical axis, and is found by setting x equal to zero.
2 The X intercepts. These are the points, if any, at which the graph
cuts the horizontal axis (so that y is zero) and can be found by use
of the quadratic formula.
3 The turning points. The graph of a quadratic has a single
turning point, the vertex of the parabola at x = —b(2a. This result
is convenient for quadratics, but calculus methods are usually
required to locate the turning points of other functions.
4 Extreme x values. These are used to show the ultimate directions
in which the function is heading. The function is worked out for
very large positive and negative values of x.
5 Other x values. Used as necessary, depending on the particular
case and the accuracy required, a selection of x values not covered
by points 1-4 above is plotted.
1/ = — 4x — 21
1 When x = 0,^ = —21, so that the y intercept lies below the x axis.
The co-ordinates of this point are (0, —21).
2 When y = 0 the quadratic formula (or factoring) results in
X = —3 and x = +7
-(-4)
— +2
2(1)
X y
-2 -9
-1 -16
-|-3 -24
+5 -16
Figure 4.10
For parabolas in general much can be learnt from the value of the
leading coefficient.
139
Functions and turning points
1 If a > 0 the parabola opens upwards and the vertex is a minimum
of the function.
2 If fl < 0 the parabola opens downwards and the vertex corresponds
to a maximum.
— X — 870 = 0
X = 30 and x = —29
The interest of this equation is that its solution was first obtained 3900
years ago in Mesopotamia. It arises from a problem stated on a
Babylonian tablet dated 1900 bc. The problem was expressed as that of
finding the side of a square the area of which exceeds the side by 870.
Babylonian algebra could find the positive root or roots of a limited
number of quadratics. This amazing civilization was also able to solve
some equations of higher degree.
Consider an example of quadratic equations and parabolas in
business management. Suppose that a firm’s costs, C, are given by
C = 4q + 450
in which q represents both output level and sales. The product is sold at
a price, p, under demand conditions given by
p = 50 — O.lq
R = pq
= 50q — O.lq^
a = —0.1 b = 50 c= 0
7r = R-C
= 50q-0.1q^ -4q-450 (4.5)
q = 10 and q = 450
These are the intercepts with the horizontal axis. The lower of these
levels (with co-ordinates (10, 0)) is the breakeven output level, while the
other zero is a point of theoretical interest only, being well past the level
at which a firm would choose to operate in practice. The vertex of the
profit parabola is at q = 230, at which the maximum profit of
TT = 4840 is made. The revenue and profit parabolas are shown in
Figure 4.11.
Figure 4.11
The profit curve always peaks at a lower output than the revenue
curve (so long as costs always increase with output level). This will not
be proved here, but it is a useful exercise to see how the subtraction of
the cost expression affects the values of b and a (and hence —b/2a) in
the profit equation as compared to the turnover expression.
We have already considered the solution of systems of linear
equations. Systems which contain even one equation with a single non¬
linear term are non-linear systems and are much less tractable than
linear systems — statements relating the number of solutions to the
of independent equations and unknowns no ionger apply.
141
Functions and turning points
Non-linear systems are less widely used in applications than linear
alternatives and we will confine the presentation here to simple cases.
4.2.1 Consider the solution of a mixed system of one linear and one quadratic
One linear and one equation. For example, suppose that
quadratic equation
4:X + 2y = 20
y= - 7x + 16
y = 10 — 2x
10 — 2x = x^ — 7x + 16
— 5x -1- 6 = 0
X= 2 and y= 6
X= 3 and y = 4:
2x + y = 7
and
y — x^ +4:X + 20
The result to be obtained is not one which could occur with two linear
equations. Substitution of 7 — 2x for y into the original quadratic
equation produces
x^ -1- 6x + 13 = 0
which has no solution in real terms. The solution values (as may be
confirmed) are the complex numbers
y = 3x^ — lx-2
and
1/ = — 2x + 16
subtraction gives
0 = 2x^ - 5x- 18
X = —2 and x = 4.5
y = 24 and y = 27.25
D : p = 90 — 2q
where p = price and q = output (sales), and with market supply given by
S: p = 20 + 0.5q^
Figure 4.12
p^90-2q
= 20 + 0.5q^
0.5q^+2q-70 = 0
+4^-140 = 0
which factors as
{q-10){q+14) = 0
q — 10 and p = 70
Note that the other solution to this mixed system (which does not
correspond to a practically realizable situation) occurs at the values
D : p = 200 — 2.5q
C= 1600 + 1.5g^
Breakeven is the output level where sales revenue first covers costs. This
is where
R = 200q-2.5q^
= 1600 + 1.5g^
= C
thus
so that
iq-10)iq-40) = 0
giving
q = 10 and q = 40
> 3x + 10
Rearrangement gives
x^-3x-10>0 (4.6)
X = 5 and x = —2
The function
/(x) = x^ — 3x — 10
— oo < X < —5
2 < X < oo
Figure 4.14
4x -F 35 > 14 + x^
-x^ + 4x + 21 > 0
-3 < X < 7
where x is output level. The firm sells in a market where price is set at
p = 210. Management regards the product - one of several it makes -
in strategic rather than immediate profit-making terms, but it is not
company policy to sell anything at a loss. Given these conditions, what
range of output levels is available to management.? The requirements
mean that the range must be such that unit costs do not exceed unit
revenue: that is
so that
(x-20)(x- 140) = 0
Figure 4.15
Ax^ + Dx + Ey + F — 0 (4-7)
in which not all of A, B and C can be zero, and in which all coefficients
are real numbers. The general form of a parabola with axis parallel to
the y axis emerges when B and C are zero. Parabolas symmetric about
axes not parallel to the y axis are generated when
B^ - 4AC = 0
and where the values of A, B and C are not all zero. For example, if
A = 0, B = 0 and D = — 1 the resulting equation can be written as
X = Cy^ + Ey + F
which describes parabolas with axes parallel to the x axis, and which
open to the right or to the left. The simplest such parabola is where
E = F= 0 and C=1
in which case
x = y^ or y = ^/x
A = C and B = 0
an ellipse is produced if
B^ - 4AC < 0
’ - 4AC > 0
Ax^ = 0
and the y axis is produced - a conic section can produce a straight line
rather than a curve, but such a section is said to be degenerate. In
the event that A, B and C are all zero, the equation (4.8) reduces to the
general linear equation - which is contained as a special case.
Exercises 4.2
1 Find the roots of the following quadratic equations;
(i) x2 - 7x + 12 = 0
(ii) x^+x-30 = 0
(iii) x^ + 9x + 14 = 0
(iv) x^ - 16x + 64 = 0
(v) x^ + 1 8x + 81 = 0
(vi) x^ - 7x + 0 = 0
2 Find the roots of the following quadratic equations:
(i) 2x2-14x +24 = 0
(ii) x2-2.3x+1.32 = 0
(iii) 2x2 _ 3^ _ 9 _ Q
(iv) x2 + 99x - 100 = 0
3 Find the roots of the following equations and comment on the
results:
(i) x2 -20x+ 100 = 0
(ii) x2+9 = 0
(iii) x2 - 10x + 29 = 0
4 Find the solutions to the following systems of one linear and
one quadratic equation:
(i) 4x + y = 20
y = x2 - 25x +118
(ii) 6x + 2y = 100
2x2-y-68x +550 = 0
p = 700 — 5x
C= 150 + 40x + x2
(i) What output level maximizes the firm's revenue from sales,
and what is the maximum revenue level achievable?
(ii) What output maximizes profit, and what is the maximum
profit made?
8 If market demand is given by
p = 720-4q
p= 160 + 0.25q2
p = £10500
y = ax^
Figure 4.17(a) is drawn for a > 0, while Figure 4.17(b) shows the
appearance when a < 0. If a = 1 the resulting function is
y = x^
y = — 2x^ — 24%
Note that with no constant term the graph of this cubic passes through
the origin, which is one of the three zeros of the function.
Figure 4.18
y = x^ — 2x^ — 24x
because of the absence of the constant term the right-hand side factors
as
y = x{x^ — 2x — 24)
X = 0 or X = —4 or x = 6
The appearance of this function, with the (local) high point before the
low point, is a property of all cubics in which the leading coefficient is
positive. If a < 0, the cubic with turning points will appear as in Figure
4.19, although the actual positions of the turning points need not be as
shown. The cubic in Figure 4.19 has one real positive root (where the
curve crosses the x axis) and two complex roots.
Figure 4.19
z = x^
z^ - 125Z + 2500 = 0
which factors as
so that
z = 100 or z = 25
and therefore
= 100 or = 25
90003C^ -
for the domain x > 0. The profit function is as shown in Figure 4.22.
Substitution confirms that breakeven is at x = 30. The other zero in
the domain is at x = 90. (4.13) can be expressed as a quadratic in
z = x^. This is
-9000
z= = 4500
-2
So in terms of the original independent variable, the maximum
occurs when
X = v/4500 « 67
Exercises 4.3
1 Find the x intercepts of the following cubic functions:
(i) y = — 5x^ — 50x
(ii) y = x^ - 2x^ + X
2 Find the x values which satisfy the following quortic equations:
(i) x^ - 13x2 + 36 = 0
(ii) x^-37x2 + 36 = 0
3 A firm's total variable costs are given by:
VC = x^ - 90x2 ^ 4000x
5000x2 _ ^4
2x5-486 = 0
solves for x = 3, which is the only real root of the equation. The sixth-
degree equation:
x^ - 9x5 + 8 = 0
y = 7x^ + 3x5 5 + 2x -f 6
Exercises 4.4
1 Which of the following ore polynomials?
(i) y = - 4x^ + 2x + 1
(ii) y = x^ - TTX^ + 2x^ + \/2x
(iii) y = x^ - 5x^ + 3x“^ 4- 2x + 1
(iv) y = 4x - 1
(v) y = x^ + x5
(vi) y = x'00
2 State the degrees of the following polynomials:
(i) f(x) = 4x^ - 6x^ + 2x
(ii) fjxj = 1000-2x3-0.001 x^
(iii) y = 100 - X
(iv) y = 7
P(x)
fix) =
Q(x)
a„x'' + a„-ix''~^ +an-2x'‘ ^-h aix^ + aix + ap
(4.15)
bmX"’ + br„-lX'”-^ + br„-2X'^-^ ■■■ + b2X^ + biX + bg
where P(x) and Q(x) are polynomials and where f(x) is defined for
Q(x) 7^ 0. So, for example, the following are rational functions
— 8X3 _|_ yj
(i) f(x) =
6x4 _ 7
x^ — 1
(ii) fix) =
X — 1
(iii) fix) = ^
X
(x + l)(x - 1)
Figure 4.23
0 X
xy=l
(x + 2)(y + 3) — 1 = 0
which expands as
xy + 2y + 3x + 5 = 0
> 4AC
A = C = D = E = 0, B = +l and F=-l
7 4
3x^ -F Sx -f-
f(x) = ax + -
X
P=l {4.17)
0 <q<
{q + a){p + b) = c
n c — ab
“ =—
while the lowest price that will choke off all demand is where
n c — ab
p a
The fact that consumer expenditure varies along this curve can be
deduced from the fact that, while revenue is zero at and it is
strictly positive at points in between. Suppose that the demand curve is
given by
(g + 5)(p + 8) = 240
240 - 40
4 = = 25
8
0 240 - 40
P = 40
5
_ 1 000 000
AFC =-
<1
might represent average fixed costs for an organization, and the graph
of the branch of the hyperbola for <? > 0 is shown in Figure 4.27.
Figure 4.27
Low levels of output mean that the price of the product must reflect
a disproportionate element of overhead, particularly if variable costs are
not great. A classic example is oil production, and other examples are
found in utilities such as railways. In these instances, increasing output
level will mean that prices can be sharply reduced or high short-term
profits made if the producer is a monopolistic provider.
Exercises 4.5
1 Find the location of the asymptotes for the following
rectangular hyperbola.
xy-y-2x + 2 = 0
4.6 This section introduces other types of function that can arise in a
Other functions quantitative study of the management and social sciences and to which
we may later need to refer. These are
• composite:
• inverse:
• discontinuous:
• discrete:
• homogeneous:
• monotonic:
• odd and even:
• periodic:
• absolute value functions.
y = 2x + 1 (4.18a)
z = 3y - 5 (4.18b)
z = 3(2x + 1) — 5
= 6x — 2
y = 5x + 4:
z = 3y^
so by substitution
z= 3(5x + 4)^
z=f{g{x))
g{x) = 5x -h 4
w = 5x -t- 4
y = 2w — 6
z = 0.5y + 7
it follows that
y = lOx -h 2
z = 5x -I- 8
4.6.2
Inverse functions
The inverse of a function is the function which ‘undoes’ the
operations of the original function. For example, if the operation of the
original function is ‘doubling’, the inverse function will be ‘halving’; if
the original function is ‘squaring’, the inverse function is ‘taking the
square root’. Suppose the original function is doubling and adding 5.
The inverse function is then halving and subtracting 2.5. These
operations can be written as
C = 2q+5 (4.20)
q = 0.5c-2.5 (4.21)
q= 1000-0.25p (4.22)
p = 4000-4q (4.23)
y=f{x) (4.24)
*=r‘W (4.25)
where in (4.25) f~^ represents the inverse function. This does not mean
the reciprocal of the original function [l//(x)]. In (4.20) and (4.21), as
in (4.22) and (4.23), the same underlying relationship is being
described. However, if the roles of dependent and independent variables
are exchanged and the function is inverted as a separate step, the two
variables retain their original roles and a different functional
relationship results. For example, starting with
y = 2x + 5 (4-26)
X = 2i/ + 5 (4.27)
Now invert
Wx
y = 0.5x-2.5
/ = X
Figure 4.30
In Figure 4.30, C represents cost, and two price breaks are shown at
sales levels qo and qi. A step function is one which takes different
Figure 4.31
->■
X X
(a) (b) (c)
per week. Even when the product is mass produced and costs only
pence, the practice of packaging items for sale in quanta other than
one unit will present the customer with a discrete unit cost function.
For example, Figure 4.32(c) might represent the average cost to a
consumer of items retailed in packets of two, four or six units.
y = mx
m(px) = pmx = py
y = ax^
y = x^ — 2x
so, with the positive root of p also exclusively taken, the square root
function is homogeneous of degree one half. The degree of homogeneity
need not be a positive number. For instance, y = 1/x is homogeneous of
degree —1. In the case of functions of several variables, degree-zero
homogeneity can be a convenient property. To illustrate, consider the
breakeven formula
tF _ tF
tp — tb tip — b)
F 0
=-r = q = qt°
p— b
X2 > Xi
f{X2)>f{X\)
4.6.7 A function is an odd function if it changes sign but not its absolute
Odd and even functions value when the sign of the independent variable is reversed. So, for any
odd function
f{-x) = -fix)
y = 7x^ + 6x
/(- =fix)
and the graph of an even function is symmetrical about the y axis. An
example of an even function is the bi-quadratic
y = —— 3x^ — 10
Figure 4.33
y = \f{x)\
Absolute value functions are plotted only in quadrants one and two,
and will change abruptly when the function would otherwise have
extended below the x axis into quadrants three and four. Figure 4.34(a)
shows the graph of y = \ x\, while Figure 4.34(b) represents
y = \ x^ — c\.
» Absolute values are involved in calculations of distance and are an
important concept in computer science. In statistics, absolute values
are used in one measure of the deviation of a variable from its average
value - the mean absolute deviation (MAD). In management science,
such measures of dispersion themselves find use in adaptive
forecasting methods - as applied, for example, in some models in stock
control.
Exercises 4.6
1 Express z as a function of x where
(i) y = 5x - 6 z = 3y + 4
(ii) y=100 —4x z= 100 + 4/
2 Express z os a function of x where
(i) w = 3x + 4 y = 2w + 5 z = 4/— 8
(ii) w=10 —2x y = 3w —50 z= 100 — 4/
3 Express z os a function of x where
(i) / = 2x + 5 z = y^-4/+10
(ii) / = x-2 z =/^+6/^ + 12/ +8
4 Find the inverse function stating x as a function of / where
(i) / = 4x+12
(ii) / = 20x — 50
(iii) / = 0.025x - 0.5
(iv) y = +Vx
(v) / = 200-10/x
5 Establish whether or not the following functions are
homogeneous and, if so, state the degree of homogeneity in
each case:
(i) / = 6x
(ii) / = 6x + 5
(iii) / = 3x^
(iv) / = x(3x + 4)
(v) / = X2
(vi) / = 1 /x2
6 Are the following odd functions, even functions or neither?
(i) f(x) = 4x^ + 2x^ + 1 Ox
(ii) f(x) = x"* + 3x^ + 10
(iii) f(x) = x*^ + x^ + 10
LEARNING OUTCOMES
By the end of the chapter you should be able to find derivatives for
many polynomial and rational functions, understand their uses in
management and economic applications, and find local and global
optima in a variety of cases.
5.2 Consider the function of Figure 5.1. The expression, if any, that would
Slope and turning produce a function of this appearance would be of daunting complexity,
points but it is only the graph that is of interest here. For values of x which are
large in absolute terms the function has the x axis as an asymptote and
approaches this from above. The function has seven turning points, at
values of x from xi to X/. It is clear that Xi, X3, X5, and Xy produce
values of the function which are the largest in their own neighbourhood
- if not necessarily overall. The values of the function at these points are
said to be local maxima. The values of x = X2, X4, and xe produce the
lowest value of the function in their immediate vicinity. The values of the
function at these points are local minima. Turning points can be either
maxima or minima and are called stationary points or extreme
points. The term ‘stationary point’ highlights the property common to
aU turning points - the fact that at the precise point the function is
stationary - it has zero slope. If a function does not have zero slope at
a point it must increase for a change in x in one direction (either an
increase or a decrease) and fall as x changes in the other direction. So
the function cannot have either a maximum or a minimum at the point
itself.
Figure 5.1
The first step requires a means by which to determine all values of x for
which the function has zero slope. This could be done if
5.3 To move to finding an expression which gives the slope of the function,
An approach to the consider the most convenient non-linear function, the squaring
derivative function:
y = x^
Take any point P on this function. As shown in Figure 5.2, the slope of
the function at point P is approximately equal to the slope of the
straight line connecting P and the nearby P° (a wide separation of P
and P° is shown for illustrative purposes).
Let the value of the independent variable at P° be x + Ax, where
Ax, read as delta x, means a small change in x. The value of the
function at P is x^ and at P*^ it is (x + Ax)^. Thus the change in the
value of the function as x changes to x + Ax will be Ay, where
Ay = (x -F Ax)^ — x^
The slope of the line PP'^ (which approximates the slope of the function
at P) is
pOn
slope of PP° =-
^ PQ
Ax
This ratio will now be used to describe the slope of the line segment
PP°. From what we have already established we can write
Ay (x + Ax)^ — x^
Ax Ax
x^ + 2xAx + (Ax)^ — x^
Ax
2xAx + (Ax)^
Ax
so that
Am
— = 2x + Ax (5.1)
Figure 5.3
^ = 2x=r{x) (5.2)
dx
Each scheme has its advantages and we will use them both. The
dy/dx notation for the derivative compares most directly with the ratio
Ay/Ax, which gave the slope of the chord. This is Leibniz’s notation.
In a similar vein, the derivative of /(x) can also be indicated by
d/ d/(x)
dx dx
D/(x) or DJ{x)
y = x^ dy/dx = 3x^
y = x'^ dy/dx = (5.3)
y = x^ dy/dx = 5x^
y = x^ dy/dx = 1
n (5.4)
y — x^ dy/dx = 0
Exercises 5.3
1 Using the Ay/Ax approach above, find the derivative of
y = 5x^.
5.4 Our results so far invite the conclusion that to obtain the derivative of x
The power function raised to any power, the power is reduced by one and the ‘old’ power
rule becomes the coefficient of x in the derivative. That is, if
1 / = x"
n-l
(5.5)
y = x^
= -2x-^
dx
dy ^ -2
dx:
y=
dv
Here we have found the derivative of = i/x. Again the first step is to
convert the expression to a form to which the rule can be applied. So for
y = yjx
dx 2^Jx
f{x) =
f{x) = x^/^
= I.5y/X
fix) =
fix) = \/2x'^^~^
dx
y =
—■= 12x^
ax
and
fix) = -27.5%-^
fix) = llOx-5
Consider the special case of the power function rule where n = 0. For
y = cx°
fix) = 0
Exercises 5.4
1 Find the derivative for the following functions:
(i) y = x^
(ii) y = xioo
(iii) y = X-3
(iv) y = 1/x^
(v) y = xl/3
(vi) y = -X-2/3
(vii) f[x)
= X^3
(viii) f(x] = -l/x^5
(ix) f{x) = x2^
(x) y = 5x8
(xi) y = 0.25x^
(xii) y = -0.2/xi
(xiii) f[x) = lOxO
2 Find the derivatives for the following:
ID f|x| = x’
|ii| f|x| = X-’
(iii) f(x) = -x“®
(iv) y = 1 /x'°
(v) y = 1 /1 Ox
(vi) f(x) = x^"^
(vii) f(x] = -x~'^'^
(viii) y =-10/10x10
(ix) f(x) = x^’^
^=f\x)±g'{x) (5.6)
y = 8x^ + x^
32x^ + 3x2
dx
y = 7x^ — 9x2
35x^ - 18x
dx
= x5-4x^-5+ V2.x^^-x“^
R=50x-0.1x2 (5.7)
dR
_=50-0.2x (5.8)
dx
^ = 50 - 0.2(100) = 30 = MR
dx
Now repeat the calculations for a level of output where the unit
change is a greater proportion of the initial output level. At x = 2
equation (5.8) gives the marginal revenue as 49.6. Table 5.2 shows the
finite change calculations around the initial x = 2 position. The finite
measures are an even better approximation accurate to one-fifth of a
percent of the rate of change figure. The significance of this high
precision is that where the derivative is easy to calculate it can be used
to assess the consequences of finite changes. Where the derivative is
hard to find, finite measures may approximate well enough to the rate
of change at a point. As we have seen, to give a stationary value of the
function the derivative must be zero. In the present example the one
stationary value is a maximum taken for a value of x satisfying
dR
= 50 — 0.2x = 0
dx
so that:
X = 250
Sales of 250 units per period will produce revenue of 6250 (by
substitution of x = 250 in (5.7)). Figure 5.4 graphs the problem.
Table 5.2
X TR TR(x) - TR(2)
2 99.6 0
3 149.1 -649.5
1 49.9 -49.7
7r = 46x-0.1x^-450 (5.9)
Using the knowledge that the profits parabola opens down, a maximum
of profit occurs when the derivative of (5.9) is zero. That is
dTT
— = 46-0.2% = 0 (5.10)
from which
X = 230
n = R-C
dTT dR dC
(5.11)
d% d% dx
SO
(5.12)
dx dx
MR = MC
MR = 50-0.2(230) =4
MC = ^ = 4
MR = 2 MC = 4
so the firm could increase profit by reducing output, as the rate of loss
of revenue (2) is less than the rate of reduction of cost (4), so the excess
of revenue over cost must rise. An alternative description of marginal
cost as escapable cost comes to mind. Again, if an output level (225
units) was set such that
MR = 5 and MC = 4
-1
y = x^ -90x + 4000 + 25 OOOx (5.13)
x^ — 90x + 4000
25 000
become ever smaller. The firm wishes to know the level of output, x*,
for which unit cost is lowest. This is the output where its most
competitive price can be charged without making a loss. While this is
not likely to be the most profitable point in the short term, it can be
important in aggressive competition. In theory a minimum unit cost
position such as this should result under conditions of pure competition
in the long run. Knowing the general shape of the curve, the minimum
unit cost position is found by setting the derivative of (5.13) to zero.
That is
dy T
90 - 25 000X-2 = 0 (5.14)
— = 2x
dx
This equation has one real solution at x* = 50, the other two roots
being complex. An output of 50 units minimizes unit cost at £2500.
It is a useful exercise to compare this value of x = 50 with the
output level at which unit variable cost reaches a minimum. Given
the shape of the unit fixed cost hyperbola, unit variable cost always
reaches its minimum at a lower output level than unit cost as a
whole. In the present case unit variable cost reaches its minimum
value at X = 45.
Exercises 5.5
1 Use the sum-difference rule to find the derivative for the
following:
(i) y =
(ii) y =
(iii) y = -f x^ — x^
5.6 It is often necessary to find the derivative of expressions which are ratio
The product and or the product of two or more functions, each of which may be easier
quotient rules to differentiate than the expression as a whole. So if the derivative of
the whole can be set out in terms of the derivatives of its parts a more
convenient method will result. Here we consider two rules for
differentiation: the product rule and the quotient rule.
5.6.1 For a function which is the product of two functions, the product rule
The product rule states:
fg' + af
Begin with an expression for which the result is already known. The
cubing function y = x^ can be represented as the product of two
functions,
y = x^x
^ = x^l + x2x
ax
= + 2x2 = 3x2
y = 4x2(x2 + 2x^)
/(x) = 4x2
g{x) = (x^ + 2x^)
The unsimplified form of the derivative may well suffice. If the aim is to
find the numerical value of slope it is usually quicker to use the
unsimplified form. The product rule also applies to expressions involving
the product of more than two functions. For example,
y = f{x)g{x)h{x)
^=fgh'+fg'h+f'gh
Thus for
y = 2x(3x - l)(4x + 2)
dll
~ = 2x(3x - 1)4 + 2x3(4x + 2) + 2(3x - l)(4x + 2)
QX
5.6.2 As its name suggests, the quotient rule applies to the differentiation of
The quotient rule the ratio of two functions:
fix)
The quotient rule Where y =
aix)
dt/ _ gf -fg'
dx g^ (5.16)
dy x^5x^ — x^2x
dx
5x^ — 2x^
x"^
_ 3x^
x^
= 3x^
X^ + X
y = 3o,+ x^2
x^ — 2x^
4-x7
Here
1 .u
y= then — =-Y
m dx
1 j dy —2x —2
y = —r and — = —^ = —r
x^ dx x^ x^
which confirms the result already obtained by other means.
(5x^ + 2x7)
(ii) y (3x4 _ 2x3)
X
(iii) fN (7x3 + 5)
r \ fi \ Tlx^ - 5x + 3
H Hx =-3-
3 (i) Use the quotient rule to find the derivative of
f(x} = (x^ - 1 )x“'
Can your result be confirmed by proceeding in a different
way?
(ii) Find the derivative of f(x) = (x^ - 2x^)(x3 + 2x)(3x - 5)“^
and evaluate the slope of the function f(x) at
(a) x=l
(b) x = 2
y = (x^+x)^ (5.17)
x^ + x = g{x)
y=
So (5.17) is a function (cubing) of a function (adding its square to the
original value) and is thus a composite function. The second operation
we will refer to as f{x), and the chain rule can be stated as
(5.18)
dx dg dx
y = x^
could be expressed as
y=
for which, in terms of the chain rule,
~ = 2x and ^ = 3[g{x)]^
dx dg
so from (5.18)
dy
= 3[x2]^2x
dx
= 3x^2x
= 6x^
dy
= 3(x + x)
d^
and
dg
= 2x + 1
dx
^ = 3(x^ + x)^(2x + 1)
dx
+ (x + - x)^(2x - 1)]
where the chain rule is applied within the product rule. It may also be
necessary to use the chain and quotient rules together. As an example,
consider
(2x- 1)^
^ “ 7x2
for which
The chain rule also applies to functions where bracketed terms are
raised to non-integral powers. Consider
y = (2x + 1)3
dy
1
(2x + 1)2
As usual
d^ _ dy dz
dx dz dx
= 62
dz
and
dz
— = 43C
dx
so that
624x = 24zx
dz
24(2x^)x = 48x^
y= 3(2x^)^ = 12x^
so that
^ = 48*’
dx
The chain rule can also be applied to find the derivative dy/dx where
there are several ‘intermediate’ variables. For example, if
dy dy dz dw
dx dz dw dx
y = z = w^+w w = 5x^ + 1
dy
= 2z(2w + l)(10x)
dx
dy
= 2(w^ + w)(2w + l)(10x)
dx
which simplifies to
Exercises 5.7
1 Use the chain rule to find the derivative for the following:
(i) f(x) = (x3 + 1)^
(ii) f(x) = (x4+x2)^
(iii) f{x] = (4x^ + 3x^ - 7x2 ^ 2xf
(iv) f(x) =5(1 -x^)^
(v) y = (x2 -h x)^ -f (x^ + 2x)^
(vi) y = (6x^ — 5x'^ -K 6x2)*^ - (x® — x^)^
2 Use the chain rule with the product rule and/or the quotient
rule to find the derivatives for the following:
(i) y = (x®-1-x'^l^lx^ - x®)*^
and X =f~^iy)
%^ 1
(5.20)
dx dx/dy
where dx/dy 7^ 0.
Take an example that can be checked by other means:
X = 0.25y — 3 (5.21)
As the inverse function always exists for straight lines (with defined,
non-zero slope) we can find the derivative dx/dy = 0.25 and apply the
rule to produce
dy 1 1
= 4
dx dx/dy 0.25
y = 4x -h 12
for which
dx dx/dy
1
2y
for which
^=3y^ + 18y + 39
dy
_1_
dx 3y^ + 18z/ + 39
Before using the rule we should be satisfied that it applies. That the
slope of (5.19) is always positive follows from the fact that the roots of
dx/dy are complex - the graph does not cross the axis so the derivative
(the slope of the original function) does not change sign.
Exercises 5.8
1 Use the inverse function rule to find the derivative dy/dx of the
following:
(i) x = 4 + 0.2y
(ii) 0.25x = 0.125/-2.5
(iii) X = y^
5.9 Much work using functions of a single variable involves the form where
Implicit one variable is the subject of the equation, as is y in
differentiation
y=fix) (5.22)
y -f{x) = 0
fix, y) = 0
d(xy) dy dx
dx
dy ,
x^ + y
dx
— 5x differentiates with respect to x as —5 and the term +4 has zero as
its derivative. The derivative with respect to x of the term —by is
dy
-6
dx
So the derivative with respect to x of (5.23) as a whole is
dy , ^ dw
xj- + y 6^ = 0 (5.24)
dx dx
(5.24) is an implicit function involving dy/dx which can be rearranged
to state dy/dx explicitly as
y
(5.25)
dx X
y(x — 6) — 5x + 4 = 0
from which
5x — 4
y (5.26)
X — 6
dy (x — 6)5 — (5x — 4)
dx (x — 6)
-26 (5.27)
(»: - 6)^
At first glance, (5.27) does not look a great deal like equation (5.25),
but substitution for y from (5.26) into (5.25) results in
^ 5x — 4
dy_ ~ x-6
dx (x — 6)
-26
“ (x - 6)^
T dy
2y^
dx
The derivatives of the other terms follow and the overall result is
2x^y ^ + 2xy^ + 3 ^ — 4 = 0
dx dx
which rearranges as
dy 4 — 2xy^
dx 2x^y + 3
Exercises 5.9
1 Find the derivative dy/dx for the following implicit functions:
(i) 4x-5y+10 = 0
(ii) 1 Ox + 5xy -2y - 100 = 0
2 Rearrange function 1 (ii) above to state y as an explicit
function of x and obtain an expression for dy/dx using the
quotient rule. Confirm that the result agrees with 1 (ii) by
evaluating the expressions at
(i) x=10 y = 0
(ii) X = 1 y = 30
3 Use the implicit function rule to find the derivative dy/dx for
the following:
(i) f(x, y) = 5y - 2x + xy^ = 0
(ii) f{x, y] = 2x^y^ - 3x^ + xy - 7y = 0
y = 2x‘^-7x^+4:X^ + 8x + 20 (5.30)
o 3
— = 8x 21x^ + 8x + 8
dx
Second derivative:
d^^ 7
^ = 24x^ -42x + 8
dx^
Third derivative:
d^y
^ = 48x-42
dx^
Fourth derivative:
d^y
= 48
dx^
d^y
= 0 for n> 5
dy’^
rix)
while the third derivative is shown as
f"ix)
Although higher-order derivatives can be represented in this manner,
the notation soon becomes cumbersome. It is more economical to
write
/(")(*)
for the nth derivative of /(x) with respect to x. For example, consider
y = x-^
d y
= 2x
dx^
d^y
= 720x“^
dx^
Exercises 5.10
1 Find the second derivative for the following functions:
( ) f(x)=x5
( i) f(x)=7x^
( ii) f{x) = 5x^ + 3x2 ^ 2x
( v) f{x] = 8x5 _ 9x4 + 7^3 _ 8x2
2 Find the third and fourth derivatives for the following:
( ) X = x‘
( i) y = 2x5 _ 3^4
( ii) y = 0.01x10-0.005x20
3 Find the second derivative for the following:
( ) f|x| = x-'
( i) f(x) = 1 /x2
( ii) f[x) = xO 5
( v) f{x) = (x2 -1- sf
4 Find the second derivatives for the following:
(i) y = ox"* -f bx^ -f cx^ + dx + e
( i) y = —mx + nx "
1 If for X giving /'(x) = 0 it is also the case that /"(x) < 0, then the x
value produces a local maximum of /(x).
2 If for X giving /'(x) = 0 it is also true that f''{x) > 0, then the x
value produces a local minimum of/(x).
3 If for X giving f'(x) = 0 it is also true that /"(x) = 0, further
investigation is needed to determine the nature of the stationary
point.
for which
/'(x) = 50-0.2X
f"(x} = -0.2
y = 500-x^- 20x
-2x- 20 = 0
dx
y = 40x — x^ — 500
— = 40-2x = 0
dx
for which
du
— =3^ +6^- 24 (5.32)
and
d^y
y4= 6x + 6 (5.33)
dx^
3x^ + 6r — 24 = 0
Dividing throughout by 3
+ 2x — S = 0
so
(:x: + 4)(x — 2) = 0
= -18
d y = +18
dx2
Figure 5.7
■ Note that (1) the local maximum here does not produce the highest
value of the function - which is unlimited if x becomes arbitrarily large
and positive, and (2) that the sign of x at a turning point is usually
important in practice. These points will be considered later.
Now consider
X = 5 + 2/ and x = 5 — 2i
The absence of a solution in real numbers means that this cubic has no
turning points. It appears as graphed in Figure 5.8.
Figure 5.8
Exercises 5.11
1 Find the stationary point of the functions below and use the
second derivative test to establish if the point is a maximum or
a minimum;
(i) y=100x-2.5x2
(ii) y = 4x2 - 72x + 100
2 Find the stationary points of the following functions and
establish their character:
) f(x) = 0.5x^ - 2.25x2 - 60x
') f(x)= 1000-2x3+6x2+48x
ii) f(x) = - ] 2x^ + 28x2
, ,, , x-^ 5x^
v) f[x] = ^-— + 4x + 2
1 Using a function for which extrema occur only for realistic values
of the decision variables.
2 Placing specific limits on the domain of the function.
3 Discarding extrema associated with unrealistic values of the
decision variables.
where the —1000 term represents fixed costs and where the first-order
condition for a maximum
dTT
— = 200-2x = 0
dx
When the first derivative is set to zero, the solutions obtained are
X= 1 X = 2 and x = 4
Figure 5.9
The function itself can be used to rank local maxima, which involved
the evaluation of two possibilities in this case. But a function may
produce values greater than that for any stationary value. An obvious
case is where the function increases without limit as x moves without
for which
- 90X + 300
dx
which, when zero, solves for x = 5 and x = 10. The second derivative is
12X-90
Figure 5.10
But the value of x = 5 does not give the global maximum of the
function, and x = 10 does not correspond to the overall minimum of y
The function is unbounded in either direction. Other functions may take
unlimited values at particular points in the domain. This is so for rational
functions where the denominator is zero for certain x values. Consider
1
(5.34)
(x- 10)
for which
’ -1
dx (x - 10)^
This derivative is not zero for finite x. The function has no stationary
value and is graphed in Figure 5.11. It can be seen that y becomes
arbitrarily large and positive as x approaches 10 from above, and
arbitrarily large and negative as x approaches 10 from below.
for which
— 6x
Figure 5.13
(i) X = 0 to X = 2
(ii) X = 0 to X = 4
(iii) X = 0 to X = 12
(iv) X = 0 to X = 30
(v) X = — 10 to X =
First, we need the turning points to locate the local maxima. Thus
d^y 9
^ = 36x^ - 744x + 2760 < 0
dx^
at X = 0 y = 2000
at X = 2 1/ = 1776
x = 0 X = 10 and x=12
Exercises 5.12
1 With the specified turning points, identify the global maximum
for the following:
(i) f(x) = 1000- 3x^ + 56x3 _ 335^2 ^
(turning points at x = 2, x = 4, x = 8)
(ii) f{x] = 250 + 12x - 10.5x2 + 7^3 _ 1.5^4
(turning points at x = 0.5, x = 1, x = 2)
2 Given turning points at x = 3, x = 6 and x = 12, find the global
minimum of f(x) = x'^ - 28x3 _|_ 252x2 _ 864x + 5000.
3 Which of the following functions have finite global maxima?
(i) f(x) = 100 000 + 2500x - 0.0005x2
(ii) fix) = 0.01x2-5000x-10000000
(iii) fjx) =0.01x3 - 1 000000x2
(iv) f(x) = 75 + 20x + 30x2 _ q. 1 x3
(v) f(x)=x~’-x'°
(vi) f(x) = 4000x - x^
4 Find the global maximum of f(x) in each of the following cases:
(i) f(x) = 1500 - x3 + 39x2 - 360x
(with domain restricted to 5 < x < 25)
(ii) f(x) = 1500 - x3 1 33x2 - 360x
--
5.13 In the second derivative test we saw how the value taken by the second
Concavity, derivative allowed local maxima to be distinguished from local minima.
convexity and The test can be regarded as distinguishing those stationary values at
points of inflection which the function’s curvature (we use the term informally) makes it
5.13.1 locally ‘hollow from below’, or concave, from those points where the
Concavity and convexity function is locally ‘hollow from above’, or convex. Local concavity
Figure 5.14
f'{x) = 0 at X =
/'(x) = 0 at X = x*^
Point 2 means that in a function with several terms, if the terms each
produce a concave function the overall function is also concave.
5.13.2 Where a function changes from strict concavity to strict convexity (or
Points of inflection vice versa) over an interval, there will be a point in the interval where
the second derivative is zero. Such a point is an example of a point of
inflection. In Figure 5.14 point d is an inflection point. A distinction is
drawn between inflection points such as d where the function is
changing in value and those points - stationary points of inflection -
where the function is static, and where the first derivative is also zero.
Point d is therefore a non-stationary point of inflection. The usual
way in which a stationary point of inflection is discovered is if in the
search for a maximum or minimum of a function a value of x is found
which produces/'(x) = 0 and for which/"(x) is also zero. However, not
all points where /"(x) = 0 are points of inflection.
Consider some examples. First find the inflection point of
f'ix) = -3x+ 30 = 0
So in this case the function does change sign through x = 10, which
therefore produces a point of inflection. Finally, substitution of x = 10
into f'(x} produces a non-zero result (170), so that x = 10 gives a non¬
stationary point of inflection.
To understand the requirement that f'ix) change sign through the x
value for which f'ix) is zero, recall that on one side of the point of
inflection the function is concave if'ix) < 0), while on the other side it
is convex if'ix) > 0). If the third derivative is non-zero at the candidate
point of inflection, this is sufficient to ensure that the second derivative
changes sign. Such is the case for (5.35), for which
fix) = -3
The sign of/"'(x) does not matter - the important fact is that it is non¬
zero. However, the third derivative could also be zero, and yet the point
could still be a point of inflection. Where f'ix) is easily calculated, this
is a quicker check than working out fix) either side of the point in
question.
Now consider another example. Let
for which
and
fix) = 6x - 120
fix) = 6/0
fix) = x^ (5.37)
that is
x^ — 20x + 100 = 0
which factors as
(x- 10)^ = 0
f"\x) = 6x — 60
= 0 at X = 10
which does not resolve the issue. Trying fix) for one-unit departures
from X = 10, setting x = 9 gives
and, when x = 11
so the second derivative does not change sign as x passes through 10.
It follows that X = 10 does not produce a point of inflection. Note in
this case that
fix) = 0 at X = 10
Using this rule in the case of (5.38) the first non-zero derivative is
= 6. Thus, since the fourth (even) derivative is positive, the value
X = 10 produces a minimum. Using the rule on the quintic (5.37), the
first non-zero derivative at x = 0 is P’ = 120. Since it is the fifth (odd)
derivative that is the first to be non-zero, we conclude that there must
be a point of inflection at x = 0.
Another way of finding the nature of a point where several higher
derivatives are zero is to find out if /'(x) changes sign as x increases
through the point in question. If it does, the point will be a maximum if
the change in sign is from positive to negative, and a minimum for a
negative to positive change. If /'(x) does not change in sign a stationary
point of inflection is identified.
In the microeconomic theory of production and cost, a cubic cost
function is often used for its plausible economic properties. For example,
suppose that daily costs of a car manufacturer are given by
where C(x) is the total cost of producing x cars per day. Marginal cost is
the rate of change of total cost, C'(x), and is given by
The point of inflection of the total cost function (5.39) is the daily
output level producing minimum marginal cost. This is given by
C"(x) = 0.6x - 60 = 0
which solves for x = 100. Beyond this output marginal costs begin to
rise - even though average costs continue to fall for some time. The
example illustrates the fact that to find a point of inflection is to find
the maximum or minimum of the first derivative.
Exercises 5.13
1 Find the point of inflection for the following:
(i) f(x) =1000- 2.5x3 ^ 60x2 ^ 300x
(ii) f(x) = 4x3 _ 48x2 + 25x 4- 100
(iii) f(x) = 0.25x^ — 9x3 + 108x2 -4 20x
(iv) f(x] — 0.15x^ — 0.75x^ -I- x3 -I- 15x + 300
2 Are the points of inflection for the following functions
stationary or non-stationary?
(i) f(x) = x3 - 1 8x2 + 108x + 100
(ii) f(x) = 3x3 _ 27x2 + 1 OOx - 50
(iii) f(x) = x^ - 14x3 + 60x2 _ 50x + 100
LEARNING OUTCOMES
By the end of the chapter you will be familiar with the concept of
partial derivatives and how they’re used to identify stationary points of
functions of several variables. You should also be able to differentiate
polynomials in two or more variables and distinguish between types of
stationary value.
q = f(L. K, M) (6.2)
where: q = product output;
L = labour input;
K = capital applied:
M = materials used.
The form of the function, the way that quantities of input translate
into output, represents the technology of the manufacturer. In this and
later chapters we will consider a number of forms that the production
function might take. For the most part we will concentrate on the
relationship between one dependent variable and two independent
variables. This will illustrate most of the important and relevant
properties of functions of several variables. We will use the notation
z=f{x, y) (6.3)
z = 20 + 18 - 9 - 4 + 30 + 100
= 155
z = 40 —X —0.5i/ (6.4)
Figure 6.1 zk
lie on the plane (though Q would be outside the region shown in Figure
6.1). The point
lies below it. Contours of the surface (lines on the x, y plane above
which the surface is at a constant height) may be obtained by slicing
through the surface for given z in a plane parallel to the horizontal.
The resulting edge is then projected down to the x, y plane to form the
contour. This is illustrated for z = 15 in Figure 6.2. The figure shows
the part of the contour lying in the positive x, y quadrant.
Figure 6.2
z = 40 — X — 0.5y
z= 15
So:
0 = 25 — % — O.Sy
y = 50 — 2x
y = 80 — 2x
y = —2x
z=55-x-2y (6.5)
15 = 55 — X — 2y
so that
X + 2y = 4:0
and this equation and the equation of the contour already obtained
y = 50 — 2x
2x + y = 50
X + 2y = 40
X = 20 and y = 10
Thus the co-ordinate triple of the one point that lies on all three planes
is (20, 10, 15). The straight-line distance between two points in three
space can be found by use of Pythagoras’ theorem.
The concept of a simplex arises in linear programming where the
main method of solution is the simplex method, and in decision theory
where the idea of a simplex is useful in the theory of games. A simplex
may have any number of dimensions. In geometric terms a two-
dimensional simplex is the smallest region without indentations (a convex
figure) that contains three points which do not lie on the same straight
line (are not co-linear). So a two-dimensional simplex is a triangle. A
one-dimensional simplex is the smallest convex region containing two
Figure 63
Exercises 6.2
1 Given the plane defined by z = 200 - 5x - 2y, do the points
below lie on, above or below the plane?
(i) (20,50,0) (ii) (-10,25,210)
(iii) (-20, -10, 330) (iv) (40, 125, -240)
(v) (-80, 500, -440) (vi) (60, -60, 20)
2 Find the equations of the contours of z = 3x -I- 0.5y — 20 for
the following:
(i) z=100 (ii) z =-20 (iii) z = 50
3 Find the point in common between the three planes defined by
the following:
(i) z = 9 - 4x - y
x + y-|-z — 6 = 0
3y -F 4z - X = 17
(ii) 5x — 2y -f z = 140
3x -I- y — 0.5z = 40
0.25x -F y -F z = 30
A function of this form could well arise where a firm makes and sells
two products in amounts x and y, has a quadratic cost function and
faces linear demand equations for the two products. The value of z
would represent profit. Later in the chapter we shall show how (6.6)
might be arrived at from basic demand and cost information, and we
shall also find the best overall profit position. For the time being,
consider the general features of the function as shown in Figure 6.4.
Figure 6.4
The function has a single turning point which is a local and the
global maximum. This peak occurs in the positive quadrant -
important if x and y are physical measures such as output. An
elevation of the function is shown in Figure 6.5. Figure 6.5 shows a
vertical section, profile or ‘elevation’ of the surface when one variable is
set at a specific value. The vertical section corresponding to i/ = 5 is
shown. By substitution into (6.6) the equation of the elevation curve is
^{y=S) ~
All elevations of this function are quadratics with the same general
appearance, but with different intersection points with the axes and
corresponding to different values of z at their highest points. In
microeconomics when the function is a production function (showing
output level as a function of input levels) a vertical section of the function
shows output graphed against variations in the level of one input only.
This is a total product curve or productivity curve. Horizontal
sections through the surface z projected to the x, y plane generate
contours of the surface as in the linear case. Here, the contours resemble
those on an Ordnance Survey map - indeed ‘contour’ is interpreted in
just the same way as on a map. The contours of the function for
z = —300 and z = +500 are shown in Figure 6.6. All contours of (6.6)
have the same general shape. The equation of any one is given by:
Exercises 6.3
1 Given the production function
z = 120x + 50y — 2xy — 4x^ — 0.5y^
find the total product curves for the following:
(i) y = 10 (ii) x = 25
2 Find the equation of contours of
z = 1 Oy — 3xy — 100 + 6x — 9x^ for the following:
(i) z = 900 (ii) z = 0
3 Establish whether the following functions are homogeneous:
(i) z = y^ + 2xw — 3wy + w^
(ii) z = 3x^ + 2y^ + xyw - w^
2
y
(iii) z = wx - xy H--
w
(iv) z = wx + ^ + (x - 1
Figure 6.7
z = 0.6x -F 0.2y
|^ = 0.6=/,
OX
Similarly,
z = 2xy
dz
= 2y =fx
dx
/, = 2(5) = 10 /, = 2(3) = 6
fix, y) = xy^
fx =
fy — x2y = 2xy
In the case of the partial derivative with respect to x, the x part of the
term is linear, with the coefficient y^, which therefore ‘comes down’
into the derivative as would any constant. In differentiating partially
with respect to y, the non-y component, x, comes down into the
derivative unaltered to multiply the derivative of y^ with respect to y.
Now consider
3 2
z = x^y^
dz 2..2
3x y
dx
^2
— = x 3-7 0 3
2y = 2x y
dy
dz 2 2..2
3x y
dz
= 2x + 2xy — 2x y
dy
dz
= 100 — 2y — 4:X (6.7a)
dx
and
dz
= 60 — 2x — 2y {6.7b)
dy
dz dz
= 36 and = 20
dx dy
870 - 836 = 34
z = - [2]x^ + [4/]
~ = 4:x^[3y^] - [2]3x2
= 12x^y^ — 6x2
-=[3x^]5y^ + [4]6y^
= 15xV + 24/y2
The three first-order partial derivatives are found in just the same way
as for functions with two independent variables. The derivatives are
The product, quotient and chain rules can also be used for finding
the partial derivatives of functions of several variables. For example,
given
fix, y) = i2 + 3x + 4i/)2
^ /» = 2(2 + 3x + 4i/)(3)
= 12 + 18x + 24y
fy = 2(2 + 3x + 4^)(4)
= 16 + 24x + 32y
So
fx = 12 -T 18x -T 24i/
fy = 16 + 24x + 32i/
The product rule could also be used here, since the function can be
written as
/x = (2 + 3x + 4y)(3) + (2 + 3x + 4y)(3)
= 12-|-18x-)“ 24y
/, = (2 4- 3x + 4y)(4) + (2 + 3x + 4y)(4)
= 16 + 24x + 32y
then
for which
9xy + 2x^y^
fix, y) =
x3 + y4
then
Exercises 6.4
1 What is the slope of the plane z = 1000 — 2x + 0.5y in the
fundamental directions?
2 (i) Find the first-order partial derivatives of z = — 4xy + /•
(ii) Find the slope of the function in the fundamental directions at:
(a) (10, 2, 22) (b)(-2,7,67)
3 For f(x, y) = x^y^, find the partial derivatives and the slope of
the function in the fundamental directions at the point (5, 4,
1600).
4 Find the first-order partial derivatives of the following functions:
(i) F(x, y) = x^+ xy^+ x^y^
(ii) f(x, y) = 100x99yioi
(iii) f(x, y] = 4x^ + 2y^ - 3x^y^ + 4x^y^
(iv)
(v) q = l°k^ (for a and b as constants)
(vi) z = nx^y^~^ (for m and n as constants)
(vii) z = ^
2 = 3x^1/^ - 2x^ +
Note the two forms of notation. This derivative is found in the same
way as for the first derivative and is the second direct partial
derivative of z with respect to x (‘direct’ will be clarified shortly).
Consider the second direct derivative with respect to y. At the first order
we had
f)7
— =15xV + 24y5 (6.11)
dy
The process by which 60x^y^ was arrived at should be familiar, and the
function notation for this derivative, fxy, is that which common sense
suggests. (6.13) is a second-order cross-partial derivative or mixed
derivative. The somewhat counter-intuitive notation
d^z
dydx
may appear to have the denominator the wrong way round. But when
viewed as the derivative with respect to y of the derivative with respect
to X or:
d dz
dy dx
fy = 15x‘^y'^ + 24i/^
d^z
fyx = eOx^y'^ = (6.14)
dxdy
(f/" + 4)
— 4:xy^
- 7)
to obtain:
d^z
dxdy
and so
^=-12 2^-^
dydx ^ dxdy
We will find the first order partial derivatives, the slopes of the function
in the fundamental directions at the point (2,3, —64) and the second-
order direct and cross-partial derivatives. At the first order
dz 7 9
— = 12x — Sy -h 8xu
dx
and
dx
— = —lOxy + 4x^ — 6y^
dy
=12(2)2-5(3)^+ 8(2)(3) = 51
ox
and the slope in the fundamental y direction at this point will be
52 z
^-24x + 8y
-lOx- I2y
dy^
dh dh
— lOy -h 8x =
dydx dxdy
dh
d^xdy
cl^z
d^xd^y
Exercises 6.5
1 Find the second-order direct derivative, d^zfdx^, for the
following:
(i) z =
(ii) z = 7y'2 - 8x^y^ - 2x2y
(iii) z = 5x^y^ — 6x^y^w^ — xyw
2 For the functions in problem 1 above, find the second-order
direct derivative with respect to y.
3 Find second-order cross-partial derivatives for the functions in
problem 1 above.
d^z
(ii)
dxdy^
d^z
(iii)
dx^dy^
d^z
(iv)
dxdy^
dh
< 0 < 0
dx^ dy^
d^z
<0 (6.r6a)
dx^
and will not consider cases where the second derivatives are zero. But
there is another condition to be fulfilled involving the cross partial
derivatives. This (expressed as a strict inequality) is
= 10 + 2y — 2x = 0
ax
— = —8 -F 2x — 4i/ = 0
dy
d^z dh
-2 and -4
dx^ dy^
while
d^z
dxdy
So the discriminant has the appropriate sign and the stationary point is
a maximum. The location of the point is determined by the first-order
2 ~ 2y = 0
dz
= 1152 2y^ + 2xy — 12x^ 0
dx
dz
^ = —84 -|- 4xw -\-x^ — 24i/^ = 0
dy
d^z
= 2y — 24x < 0
dx^
d^z
= 4x — 48^ < 0
dy^
dh
4:y + 2x
dxdy
and that condition (6.16b) must also be satisfied. In this example, with
the second derivatives being functions of x and y, the conditions must
be evaluated at the point in question. Thus
d^z
-236 < 0
dx^
d^z
-56 < 0
dy^
dh
= +28
dxdy
and, as all conditions are met, the point (10, 2, 3968) is a maximum.
dh
> 0 > 0 (6.18a)
dx^ dy^
and there is the further condition on the discriminant that
dh dh ^ " 2
_ > 0 (6.18b)
dx^ dy^ \dxdy)
dz
= 2 — 0.5y + X = 0
dx
dz
= —3 — 0.5x -f 0.5y = 0
dy
d^z
^=1 ^i = 05 and = -0.5
Bx^ ■ dxdy
Exercises 6.6
1 For the following functions, does the stationary value satisfy
the conditions for a local maximum?
(i) f{x, y] = 20x + 64y -|- 8xy — x^ — 32y^
(ii) f(x, y] = 2x^ +y^ — 16x — 20y — 2xy
(iii) f(x, y) = 5x — 4y + xy — 0.25x^ — 0.5y^
6.7 We have examined maxima and minima but there is another form of
Saddle points stationary point - a possibility that does not exist for functions of one
variable - a saddle point. A saddle point is relative maximum in one
direction and a relative minimum in the orthogonal direction. Saddle
points are important in constrained optimization and the theory of
games. First-order conditions for a saddle point are the same as for
other stationary values:
dz ^ , dz
—= 0 and ^= 0
ox oy
But at the second order a sufficient condition for a saddle point is that
the discriminant is negative; That is,
dh dh fd^Y
dx^ dy^ \dxdy) ^
— 12 + 6y + 3x — 0
and
TT- = —30 + 6x — 6y — 0
dy
Since the direct partial derivatives are of opposite sign, there is no need
to evaluate the discriminant, as (6.18c) must be fulfilled. So the
stationary value of z at (2,-3) is a saddle point. The function is
graphed in Figure 6.9, where it can be seen that the minimum is in the
X direction and the maximum in the y direction.
Now consider
f(x, y) = lOxy + x^ + y^
fx = lOi/ + 2x = 0
fy = lOx + 2y = 0
so
and a saddle point occurs as shown in Figure 6.10. In this case the saddle
is oriented at 45° to the x, y axes. The fact that both direct second-order
partial derivatives are positive means that the function, while zero at the
stationary value, is increasing in both fundamental directions.
Now consider
Figure 6.10
fx = 5y — 0.5x = 0
fy = 5x- 0.5y = 0
Figure 6.11
Exercises 6.7
1 Identify the stationary point of each of the following functions
and confirm that the points are saddle points:
(i) f(x, y) = 8 x - 5y -F 2xy - 2x^ - 0.25y^
(ii) f(x, y) = 15x -F 12y — 3xy — 0.75x^ -F 1.5y^
(iii) f(x, y) = 2x2 ^ o.5y2 -f 8xy - 200
fxjyy - (Ixyf = 0
there can still be a stationary value of/(x, y). Second, in the numerical
examples we concentrated on quadratics and in all cases a stationary
value of some kind existed. But other outcomes are possible. For
example, when z is a polynomial of higher degree than two or is not a
polynomial at all, the number of stationary values may be a finite
number greater than one. Even where z is quadratic there are
possibilities other than one stationary value. Outcomes can be classified
as
Table 6.1
Discriminant
> 0 < 0
dz
— = 12x -I- 6z/ — 18 = 0
ox
dz
— = 3y + 6x = 0
dy
12x + 6^ = 18
6x + 3y = 0
0 = 18
Figure 6.12
then
— = 12x -F 6i/ — 18 = 0
ox
^=3i/ + 6x-9 = 0
dy
y = 3 — 2x
d^z , d^z
=3 and ——=6
dx^ dy^ dxdy
(12)(3) - 6^ = 0
Exercises 6.8
1 For the following functions find out if a stationary value exists.
Where a stationary value does exist, establish its nature.
(i) f{x, y) = 12x2 + 12xy + 3y2 ^ 8x - 1 Oy
(ii) f{x, y) = 60x + 64.5y - 2.5x2 + 3.5y2 0.5xy
(iii) f(x, y) = 1000 — 20x2 _ ^^2 20xy — 50x -h 30y
2 Establish whether there are many or no stationary values for:
(i) f(x, y) = 2.25x2 + 9y2 _ 9xy - 9x + 1 8y + 100
(ii) f(x, y) = 0.5x2 2y2 2xy — 5x — 8y + 100
Constrained optimization
LEARNING OUTCOMES
After reading this chapter you will see what is involved in the solution
of a range of problems and be able to tackle many directly. You will be
able to interpret key parameters and extract valuable management
information from a range of models.
7.1 In practice the values of decision variables must meet certain conditions
Introduction to if they are to be allowable. In manufacturing, production levels should
constrained be positive or zero - sign restrictions apply. Other requirements may
optimization relate to resource bottlenecks, required order sizes and other values set
externally. Internal management decisions such as the allocation of
output or investment capital between plants may need to be observed.
Consider some examples.
A firm may want output levels that maximize profits subject to the
condition that available supplies of materials for the production process
are not exceeded. Or the firm may wish to minimize the cost of
producing an order of a given size - a problem of minimizing cost
subject to an output constraint. In terms of service provision,
management may find that present waiting times for its customers are
unacceptable and may seek an economical rearrangement of its services
to give better expectations of waiting times.
In fact the structure of constrained optimization problems is
characteristic of a wide range of business, economic and financial
problems. There is much more to the subject than we will have space to
consider, and we will focus on representative problems, the properties of
optimal solutions and their interpretation. Insight can be gained from
numerical examples, and in some cases we will use complete
enumeration (essentially a ‘non-method’) to get results, enable
• tangency:
• substitution:
• Lagrange multipliers.
^ = 60 — 2i/ — 2x = 0
ox
and
dz
— = 80 — 2x — 4y = 0
oy
All conditions are fulfilled, since the direct second-order derivatives are
negative and the discriminant condition is satisfied too, since:
(_2)(-4)-(-2)2=4>0
Thus the point with x and y co-ordinates (20, 10) is a maximum, and
by putting these values into z we find the maximum value of z to be
1000. The point (20, 10, 1000) is the one turning point of the
function (there being no other solutions to the first-order conditions)
and is the global, as well as a local, maximum. The point is the
Figure 7.1
2x + 1/ = 40 (7.2)
Figure 7.2
1/ = 40 — 2x
j dz ^ dz .
dz = —dx + —di/ (7.3)
dx dy
In (7.3) the singular treatment of dz, di^ and dx differs from the use
of a term such as dy/dx as a composite symbol representing the value
towards which the ratio Ay/Ax tends. But there are occasions when
the separate interpretation of dx, dy and dz is valuable. In (7.3) dz is
the change in height of the tangent plane to the surface at a point, and
this approximates to the actual change in z. With a contour there can
be no change in z so long as any movement from the original point is
along the contour through the point, so we can write
dz dz
dz = T^dx + —dy = 0 (7.4)
dx dy
Rearrangement gives
dz dz
— dy = -^dx
dy dx
dzjdx
(7.5)
dx dz/dy
— = 60 — 2i/ — 2x
dx
and
dz
— = 80 — 2x — 4y
dy
the slope of the contours at any point is found using (7.5). Thus
and therefore
2x + 6y = 100 (7.6)
2x + 6y = 100
2x + y = 4:0
5y = 60
so
y = i2
X = 14
z = 980
Exercises 7.1
1 Use the tangency approach to find the maximum value of:
subject to X -t- 3y = 55
y = 40 — 2x
z* = 140x -
dz*
— = 140 - lOx = 0
dx
Second-order conditions are satisfied here since
y = 4:1 — 2x
z* = 146x- 82
dz*
— = 146 - lOx = 0
dx
so
35 955 5.80
36 960.8 5.40
37 966.2 5.00
38 971.2 4.60
39 975.8 4.20
40 980 3.80
41 983.8 3.40
42 987.2 3.00
43 990.2 2.60
44 992.8 2.20
45 995 1.80
46 996.8 1.40
47 998.2 1.00
48 999.2 0.60
49 999.8 0.20
50 1000 - 0.20
51 999.8 - 0.60
52 999.2 - 1.00
53 998.2 - 1.40
54 996.8 - 1.80
55 995 - 2.20
Maximize z = xy
subject to X + y — 12
Note that in this case if there was no effective constraint the function
would have no finite maximum. To solve the problem the constraint is
first rearranged to express y in terms of x. This gives
y = 12 — X
z* = x(12 — x) = 12x — x^
from which
dz*
= 12 — 2x = 0
dx
d^z*
-2
dx^
y = 13 — X
z* = x(13 — x) = 13x — x^
dz*
2x = 0
so that
z = 30x -1- 5y
subject to
+ 2iy — 20 = 0
^ = 10 — 2x^
z* = 30x+ 50 - lOx^
30 — 20x = 0
dx
d^z*
-20 < 0
dx^
Maximize z = wx + wy + xy + 6x
subject to 2x + y + w = 16
X + 3y — 2w = 23
w = 16 — 2x — y (7.7)
X + 3^ — 2(16 — 2x — I/) = 23
5x + 5y — 32 = 23
X + y = 11
that is
^ = 11 — X (7.8a)
w= 5 — X (7.8b)
When there is one more variable than there are equations we can
solve for all but one of the unknowns in terms of the remaining
variable - the numeraire - which in this case is x. Now substituting
(7.8a) and (7.8b) into z gives
dz*
— = 6-2x = 0
dx
At the second order the condition for a maximum is fulfilled since
y = 20 — 5x — 2w
for which
dz*
= 4x — 2w = 0
ow
^ = 4w + 20- 10x = 0
ox
and
In applying this principle, note the proviso ‘without altering its essential
character’, the problem-solving art being to find a modification that
achieves this end.
A typical business problem involves more than one constraint. For
example, when looking at an investment plan over the medium term a
company will usually face limited investment funds in each year. The
manufacturer will usually face, at least in the short term, limitations on
capital equipment and floor space, as well as limited availability of some
materials and components. While some multi-constraint problems can
be addressed by the method of substitution, this would not as a rule be
the technique of choice - the Lagrange multiplier method provides a
better framework for multi-constraint problems. Substitution is a useful
technique, with the advantage of being simple, but it has limitations. It
can become impractical or unusable if
Exercises 7.2
1 Find the maximum value of
-2x + y = 100
subject to 0.5x + y = 20
4 Find the maximum value of z where
subject to
x + 10y +5w = 80
-2x + 30y + 2.5w= 1 15
subject to 5x + 6y — 15w = 30
6 For the following problems, find the value at the margin of an
extra unit of the resource represented by the right-hand side:
(i) Maximize f(x, yl = xy
subject to 0.25x + 0.25y = 120
(ii) Maximize f|x, y) = 1 Ox + 0..25xy + 20y
subject to 2.5x + 4y = 100
(iii) Maximize f(x, y) = 100 + 200x - y2
subject to 2x-y = 100
(iv) Maximize fix, yl = 100x2 _ 5pxy + 200y
subject to 2x+y = 600
7.3 In the substitution method the constraint was included within a revised
Lagrange objective function to convert a problem in constrained optimization into
multipliers and a simpler unconstrained problem. The constraint was included in a way
equality that made it implicit in a modified objective function and is asymmetric
constraints in that it singles out one variable for special treatment. For it to be able
2x + ly = 40
40 — 2x — ly = 0
;(40 -2x-y)
is zero L will have the same value as z. To see how the method works,
differentiate L partially with respect to x, y and j, and set the derivatives
to zero to produce a stationary value of L. The first-order conditions are
then
dL
L;t = t;- = 60 — 2i/ — 2x — 2; = 0
ox
dL
Lu = — = 80 — 2x — 4y — j = 0
dy
dL
Li = — = 40 — 2x — y = 0
d]
i = 30 — y — X
50 — X — 3y = 0
2x + 6y = 100
2x + y = 40
5y = 60
so
y = 12, X = 14 and j= 4
Maximize f{x, y)
subject to g(x, y) — 0
should be negative. In E
_dg %
~ dy ■
dy
40 — 2x — y
or as
2x + y — 40
2x + 6y = 100
2x -\-y = 4:1
5y = 59
It follows that
^~ ^^xySxSy T-^yydx ^ ®
From (i)
2.5; = X + 4i; — 80
5; = 2x + 8^ — 160
i.e.
8/y + 2x — 80 = 0
X = 40 - 4zy
so that
5^ = 25
and therefore
y= 5
X = 20 and j = —16
so that E evaluates as
i = 200 — x^
From (iii),
y = 20 — 2x
which reduces to
x^ —25x + 100 = 0
which factors as
(x-5)(x-20) = 0
and
Qx — ~2, Qy = —1
the value of E is
y = 20 — 2x
in which the leading term is the key since x can be arbitrarily large and
positive (no sign requirement on y) or negative (no sign requirement on
x). The graph of z* is shown in Figure 7.4.
Figure 7.4
Hence
dl Of . ^
(i)
dx~ dx ~ °
dL df
(ii) ^ ^ - ;c = 0
oy ay
(iii) b — ax — cy = 0
dy
a dx
, _ 1 ^
^ c dy
The derivative df /dx is the rate at which the objective function rises
as X is increased. The ratio 1/a is the rate of change of x as the
resource level is increased (since the constraint is linear, 1/a is also the
number of units of x from an extra unit of resource, but it is the rate of
change that matters here). For example, if a = 2, then x can be
increased at the rate of ^ a unit per unit of resource transferred. The
product
1 ^
a dx
while
a dx
These rates apply in either direction at the margin, and it may help to
think in terms of a small finite change. If a unit of resource is
withdrawn from use in x, then on this account the value of the objective
function falls by approximately 5. But if this same unit of resource is
now applied to increase y, then on this account the value of the
objective function increases by about 7. The net effect is therefore an
improvement of 2, so the original point could not be optimal.
There is always scope for gains when there is a difference between
marginal rates. For optimality there must be a uniform rate - and j is
that rate. The multiplier can be seen as a mediator between competing
claims on the scarce resource. In any one equation j carries information
about the value of the resource in an alternative use. The effect of j is
to ensure that at the margin the scarce resource is allocated optimally
between possible uses. In other words, the use of resource is maximally
efficient - it is economized. (A common misuse of ‘economize’ is ‘to
minimize the use of’. The true meaning is ‘to make the most efficient
use of’.)
Exercises 7.3
1 Using the Lagrange multiplier method, find the maximum value
of
f(x, y) = xV
Maximize y=f{x)
subject to X > 0
1
fix) =
(1 - x)
7.4.1 Figure 7.5 shows a case where /(x) has a maximum for strictly positive
An interior optimum X. The optimum here is a turning point in the positive quadrant, so the
sign requirement does not affect the optimum. The maximum occurs
when X = x°. Two qualitative properties of this interior optimum should
be noted:
dw
X > 0 and —= 0
dx
Figure 7.5
Figure 7.6
7.4.2 In Figure 7.7 the function has negative slope for all permitted values of
A boundary optimum X. It follows that the maximum of the function in the domain must
occur at the (lower) boundary, i.e. at x = 0.
Figure 7,7
Figure 7.8 shows a case where the optimum is at the boundary, but
where there is also a local maximum at a turning point in the domain.
However, the end-point produces the global maximum given the
domain. Two qualitative properties of the boundary maximum case
should be noted:
X = 0 and ^ < 0
dx
When we have considered the third case we will draw these results
together.
X = 0 and ^= 0
dx
Figure 7.9
The results can now be summarized. In all cases dy/dx is either zero or
negative. The derivative is negative only if x = 0. At the sign-restricted
optimum x is either zero or positive. The positive outcome occurs only if
dy/dx is zero. So in all cases
du
(a) < 0
dx
(b) X > 0 (79)
dy „
(c) X-— = 0
dx
The properties (7.9) are necessary conditions for a sign-restricted
local maximum. Relatively simple problems can be solved using the
following four-step procedure:
Note that in any one problem both steps 2 and 3 can give results
satisfying all conditions. So there may be several candidates for the
overall optimum. In step 4 we distinguish between these points by
enumeration, but higher-order derivatives could be used to eliminate
local minima and points of inflection. We could also confirm the
behaviour of the function for extreme positive values for x. Examination
of the sign of the leading coefficient should provide the assurance
required.
Consider an example:
Maximize y = 20 + 8x — x^
subject to X > 0
First find the derivative as required by step (1). It will also help to set
out the conditions in full for ease of reference. It is required that
du
~=8-2x<0
dx
X > 0
~ = 4:2x-6x^ - 60 < 0
dx
-60 < 0
dr
r y
0 100
2 48
5 75
Figure 7.10
dz
— = -25 + 2.5x
dx
At X = 0, dz/dx = -25, sox = 0isa candidate for a sign-restricted
maximum. When dz/dx = 0, x = 10, so this is also a candidate. We
choose between x = 0 and x = 10 on the basis of the value of z:
X z
0 100
10 -25
X > 0 (7.10)
Consider an example:
X = 7 and x = —7
Figure 7.11
X = — 1, X = 2 and x = 6
X y
0 500
2 708
6 68
»
Maximize /(x, y)
subject to X > 0, y > 0
fx<0
xfx = 0
X> 0
(7.11)
fy<0
yfy = 0
y>0
Since each variable, the derivative of the function with respect to the
variable, or both must be zero, a possible procedure to find solutions to
conditions (7.11) would be to try allowable combinations in a
systematic way. We could, for example, proceed as follows:
Consider an example:
for which
/* = 15-5y-4x /y = -10-5x-8y
First try X = 0 and y = 0. If it turns out that /* < 0 and fy <0, then
this is a candidate for the sign-restricted optimum. When x = 0 and
^ z= 0, /c = 15 and fy = —10. The outcome for fx means that this point
cannot be optimal. The results for this, and the other, combinations are
shown in Table 7.3. The conditions are only fulfilled in case 3, and the
sign-restricted maximum is /(3.75, 0) = 28.125. Note that case 4
corresponds to the unrestricted maximum (for which/(x, y) = 150)
and that the function does not increase without limit for x and y in the
non-negative region. Since neither x nor y can be negative, the term
— 5xy cannot be positive. Neither can the terms —2x^ and —4y^. Since
these three are all the terms of degree two, the sign-restricted optimum
in case 3 must be the global optimum. In general the number of cases
to be examined in this approach is 2", where n is the number of sign-
restricted variables.
Table 7.3
1 X =0 y = 0 fx= 15 fy ~ -10
2 X = 0 fy = 0 fx= 21.25 y = -1.25
3 fx =0 y = 0 X = 3.75 fy — -28.75
4 fx =0 fy = 0 X = 10 y = -5
f{Xl, X2.Xn)
fxi<0
^ifxi — 0
Xi > 0
fxi = 0
is all that is required at the first order. Following the procedure of Table
7.3, if m of the n variables in the problem are sign restricted there will
be 2"^ cases to enumerate.
Consider an example. Find the maximum value of the function
fx<0
xfx = 0
X> 0
fy = 0
Trying x = 0 we have
fx = 0
fy<0
yfy = 0
y>o
Now consider case (iii). The results are shown in Table 7.4. So the
optimum is as for case (ii). In problem (iv) we have already seen that
the unrestricted optimum is at
Table 7.4
X = 0 y = 0 fx = 40 fy ~ -50
X = 0 fy = 0 fx = -10 y = -10
fx = 0 y = 0 X = 20/3 fy = -50/3
f. = o X = -10 y =
-20
w= X — 50
so that
% = vv + 50
dz
= -40 - 4w < 0
dw
w > 0
Figure 7.12
V = 30 — 3C
Clearly
V > 0
Now, since
3C = 30 — V
which simplifies to
^ = -40 - 4v < 0
dv
V> 0
dz dz dv
dx dv dx
but since
V = 30 — X
it follows that
so:
dz dz
dx dv
dz
— > 0
dx:
Now consider the requirement that
= 0
dz
that is
dz
(30 — x)^ = 0
dx
Thus the full conditions in terms of x are
dz
d^-
(30 — = 0
dx
(30 - x) > 0
Maximize z(x)
subject to X < b
it is necessary that
dz
—> 0
dx
dz (7.12)
{b-x)— = 0
dx
(b — x) > 0
Maximize z(x)
subject to X > 0
X < b
Recall that a turning point may occur at the boundary and we are
excluding functions with such properties as a vertical asymptote in the
domain. The requirement on the derivative (non-positive, non-negative
or zero) will vary between outcomes 1, 2 and 3. For a local optimum at
the lower bound
1 Find dz/dx.
2 Try x = 0. If dz/dx < 0, record x = 0 as a possible optimum.
3 Try x = b. If dz/dx > 0, record x = fc as a possible optimum.
4 Solve dz/dx = 0. Roots in the interval 0 <x <b should be recorded
as possible optima.
5 Evaluate z at all points recorded under steps 2-4. The greatest
value of z identifies the optimum.
5x + y = 90
1/ = 90 — 5x
90- 5x > 0
that is
X < 18
dz
-- = 4440 - 222x
dx
Exercises 7.4
1 (i) Maximize z = 24x - 1.5x2 ^ 120
subject to
X > 0
(ii) Maximize z= 100-21x-3.5x2
subject to X > 0
2 (i) Maximize z = 1000 + 120x + 27x2 _ ^3
subject to X > 0
(ii) Maximize z = 8000 - 1440x2 + 128x2 - 3x^
subject to X > 0
3 (i) Minimize z= 1.2x2 _ 12x + 30
subject to X > 0
(ii) Minimize z = 7x2 + 84x + 50
subject to X > 0
4 (i) Minimize z = x2 - 13.5x2 - 30x + 650
subject to X > 0
(ii) Minimize z = 0.75x^- 110x2 + 1500x2
subject to X > 0
5 For
find
(i) Tfie unrestricted maximum.
(ii) Tlie maximum subject to tine constraint 2x + y = 20.
(iii) The maximum subject to the condition 2x + y = 20 and
the sign requirements x > 0 and y > 0.
7.5 Weak inequalities are the most common form of limitation in business,
Lagrange financial and economic applications. Resource constraints are almost
multipliers and always ‘less than or equal to’ rather than ‘strictly equal to’. Sign
inequality requirements and upper bounds can be seen as a form of weak
constraints inequality. For example in a problem involving the non-negative
variables Xi and X2 the sign requirements could be written as
Conditions for optima with variables restricted both from above and
below provide a route into the conditions required for more general
inequality constraints. Consider a problem in a single variable (not sign
restricted) with a single inequality constraint. Suppose that
is to be maximized subject to
4x < 40
for which
= 72 — 6x — 4;
dx
and
^ = 40 — 4x
dj
Figure 7.13
Differentiating,
dF
-=18-6V16
18-6(48)/16 = 0=;
=72-6(12)-4(0) = 0
|^=18-6(40)/16 = 3=;
dL
— =72-6(10)-4(3) = 0
4x + s = b
72 — 6x — 4; = 0
40 — 4x — s = 0
72 — 6x — 4j = 0
b — 4x — s = 0
s; = 0
s > 0, j > 0
in which b is the value of the constraint RHS. Note that the product
term sj makes this a non-linear system, and the sign requirements on s
and i mean that linear inequalities as well as simultaneous equations
are involved. In a problem of this size a solution procedure is as follows:
1 b = 40
2 b = 56
3 b = 48
40 — 4x — 0 = 0
72-6(10)-4; = 0
72 - 6x - 4(0) = 0
40-4(12)-s = 0
56 — 4x — 0 = 0
72 - 6(14) - 4j = 0
72 - 6x - 4(0) = 0
which solves for x = 12. Putting this x value into the second constraint
produces
56-4(12)-s = 0
48 — 4x — 0 = 0
72 - 6(12) - 4; = 0
s = 0, j = 0, X = 12 and F = 1032
The reason that both s and j are zero here is that with the RHS set at
48 there is just sufficient resource to reach the unconstrained optimum.
Note that, although resource increases are of no value, any reduction
in resource would cut into F.
The procedure is summarized as follows. Given the single-variable
problem
Maximize z(x)
subject to ax + c < b
Zx — aj = 0
b — ax — c — s = 0
sj = 0
s >0, j >0
The Lagrangian is
dJ-j
(i) — = 80 - 5x-I-2i/- 4; = 0
ax
(ii) — = 40 — 8i/ -f 2x — 2; = 0
ay
dTj
(iii) — = 80 — 4x — 2z/ > 0
d]
80 — 4x — 2i/ — s = 0
sj = 0
s > 0, i > 0
y = 40 — 2x
80 - 5x + 2(40 - 2x) - 4j = 0
i.e.
160 — 9x — 4j = 0
-280 + 18;c-2; = 0
18x- 2; = 280
9x + 4; = 160
X = 16 and j = 4
5x — 2y = 80
—2x + 8i/ = 40
which solve for x = 20 and y = 10. Substituting these values into the
constraint in equality form produces
80-4(20)-2(10)-s = 0
Exercises 7.5
1 Find the maximum value taken by the function
subject to 2x + 4y<40.
7.6 The ideas of previous sections are now brought together. We will look
Inequality at the maximization of a quadratic function of two variables subject to
constraints and a linear constraint and sign restrictions. A second linear constraint will
sign requirements be added later and a solution procedures will be discussed.
First consider the following problem:
Lx = 8 — 2x — i < 0
Ly = 16 — 2y — 2j < 0
Lj = 15 — X — 2y > 0
xLx — 0, yl^y ~ 0, ]Lj — 0
X > 0, y > 0, j > 0
The inequality form for the derivatives with respect to x and y follows
from the sign restrictions on the variables. The derivative with respect
to the Lagrange multiplier restates the constraint and so is also an
inequality. The zero value for the products of the variables and their
corresponding derivatives follows from the sign restrictions. Finally, the
sign requirements themselves are restated. For solution purposes, the
addition of slack variables makes a systematic search for solutions
somewhat easier. With sign-restricted slack variables t and u
introduced, the conditions can be written as
8 — 2x—;-l-t = 0
16 — 2y — 2] + u = 0
15 — X — 2y — s = 0
tx = 0, uy = 0, s; = 0
X > 0, y > 0, j > 0, t > 0, u > 0, s > 0
Cases (b), (c), (d), (e), (f) and (h) all violate sign requirements: in fact
(b) and (f) are infeasible in terms of the resource constraint. Case (g)
requires 15 to equal 0. Case (b) gives the unconstrained optimum since
; = 0, so the constraint RHS does not restrict achievement of objective.
Moreover, since t and u are also zero, the derivatives with respect to x
and y are the derivatives of F, and their satisfaction as equalities is
necessary for the unconstrained optimum.
The only feasible solution here is the constrained optimum, case (a).
The Lagrange multiplier j is positive (s = 0 so the resource constraint is
satisfied as an equality) and gives the instantaneous rate of change of
the maximum of F as the constraint is relaxed. It is revealing to match
the cases of Table 7.5 with a graph showing the kind of point to which
the case corresponds. Figure 7.14 shows the location of the points as
each would be in a problem in which they were feasible. Also shown
are the contours that would result in the point being optimal (only one
such set of contours applies in any particular problem).
Case (a) at the tangency point marked as T1 shows the optimum in
the current problem. Case (b) shows how an interior optimum
(corresponding to an unconstrained maximum) would appear. As
shown by the negative value of s, in the current problem (b) lies outside
the feasible region above and to the right of Tl. Case (c) is a corner-
point solution since the resource constraint and the sign requirement
on y are satisfied as equations. So the corresponding straight lines
intersect and result in a corner of the feasible region. Corner-point
297
Constrained optimization
Figure 7.14
7.7 Objective functions are not always quadratic and constraints will not
The Kuhn-Tucker always be linear. But if the objective function and the left-hand side
conditions and (LHS) of the constraints are differentiable and certain regularity
mathematical conditions are satisfied, then the conditions developed below will apply.
programming The constraints should not produce singularities such as an outward¬
pointing cusp at the optimum. Constraint qualification conditions
could be introduced to provide against such situations. We will present
optimality conditions in a two-variable two-constraint model, but the
conditions generalize to larger problems. Examples will be confined to
quadratic objective functions and linear constraints.
The two-variable two-constraint problem can be expressed as
299
Constrained optimization
At the first order
dL dF . dgi . %2 . ^
—72 — < 0
dxi dxi dxi dxi
dL dF , dgi . dgj
r\ 7l r\ 72 r\ <0
dX2 dX2 dX2 dX2
dL dL ^
Xi — +X2^ = 0
uXi UXj
Xi > 0, X2 > 0
(7.14)
dJj
^ = h -giixi, X2) > 0
571
then the Kuhn-Tucker conditions are both necessary and sufficient for a
global optimum. This means that ‘all’ that is required is a feasible
solution to the system (7.14). Fortunately, in economic and business
models it is often acceptable to assume 1 and 2 above. Such problems
are called convex programming problems.
The Kuhn-Tucker conditions are necessary for a maximum of the
original function F subject to the constraints, but this does not
correspond to a maximum of the Lagrangian function in all
Maximize z{x)
subject to X > 0 and x < b
The Lagrangian is
L = z(x) +i(b - x)
dz . ^
L,=^-j<0
ax
xLx = 0
X > 0
Lj = b-x>0
jLj = 0
j>0
Note that the product of x and Lx is required to be zero, and not x and
dz/dx.
Consider a numerical example:
Maximize F = x^ — 10x-|-50
subject to X < 8
X > 0
301
Constrained optimization
With slack variables introduced in complementary slackness form it is
required that
2x — 10 —; + t = 0
:>jt = 0
X > 0, t > 0
8 — X —s= 0
js = 0
; > 0, s > 0
Table 7.6
zero group solution values (non-zero) objective
00
SO
(a) t s F= 34
II
II
(b) ti X= 5 s=3 F = 25
(c) X s no solution
(d) xj t = 10 s = 8 F= 50
Figure 7.15
F^ = 10 - 0.2x = 0
Fy = 20- OAy = 0
so
Figure 7.16
10 — 0.2x — 5] — k + t = 0
20 - OAy - j - 2k + u = 0
200 — 5x — i/ — s = 0
90-x-2y-r = 0
tx = 0, uy = 0, sj = 0, rk = 0
X, t, u, y, s,;, r, /c > 0
X = 30, y = 30, k = 4, s = 20
t = 0, u= 0, r = 0, j = 0
The Lagrangian is
120 - 2x - j - 2k + t = 0
200 -4:y-2j-k + u = 0
130 — X — 2y — s = 0
152-2x-i/-r = 0
tx + + }S + kr = 0
X, y, t, u, j, k, r, s > 0
X t y u j s k r
(a) t u s r 58 36 36 -16
(b) tusk 50 40 20 12
(c) t u j r 52 48 -18 8
(d) t uj k 60 50 -30 -18
(e) tysr no solution
(0 ty s k 130 -480 -140 -108
(g) tyjr 76 -216 54 -16
(h) tyjk 60 -200 70 32
(i) X u s r no solution
(j) X u s k -150 65 -30 87
(k) X uj r -1036 152 -174 -408
(1) X uj k -120 50 30 102
(m) X y s r no solution
(n) X y s k no solution
(o) X yj r no solution
(P) X yj k -120 -200 130 152
Figure 7.18
Exercises 7.7
1 For the following problem:
2x2 + y < 20
X > 0, y > 0
where x and y are the quantities produced, and p{x) and p{y) are the
prices. The products have independent demand functions, as shown
in (7.15) (no x term in the p{y) equation, no y term in p{x}). We will
assume a linear technology, although a quadratic programming (QP)
approach requires only the weaker condition that the products are
technically independent {y does not appear in the production
function for x and vice versa) and give quadratic or linear cost
functions. The link between production levels is through common
consumption of scarce resources.
Suppose two resources have limited supply. Table 7.8 gives
availabilities, prices and per unit requirements.
From the data in the table the unit cost of production for each
product can be found. Each unit of x requires 5 units of resource one
@ £3 per unit and 10 units of resource two @ £2 per unit. Thus
So for any x and y the production costs for the two products are
The demand conditions imply that revenues from the products, R{x)
and R{y), will be
R(x) = px = 75x — x^
Unit profits from the two products, 7r(x) and n{y), will be
F = 7r(:>:) + 7r(i/)
The Lagrangian is
40 — 23c — 5; — 10k + t = 0
60 — 4y — 4; — k + u = 0
120 — 5x — 4i^ — s = 0
140 — lOx — y — r = 0
tx = 0, uy = 0, sj = 0, rk = 0
X, t, u, y, s, j, r, k > 0
The following values meet the Kuhn-Tucker conditions and so give the
constrained optimum
88 100
— ^ 12.57, y = 14.29
7
192 632
-Ri 0.39, k = « 1.29
490 490
t = 0, u = 0, s = 0, r = 0
Each resource is fully used and the optimal solution is the extreme
point given by the two resource constraints as equalities. The optimal
value of the objective function is
38 896
F= 793.80
49
632
5 = 6.45 < 7
4^
But a proposal to secure the extra resources for £15.50 (a total
premium of £5.50) could neither be rejected nor be accepted on the
basis of the value of k. The actual changes in F following unit increases
in one resource alone are as follows:
279
resource one —>^121 AF = 0.2278
1225
1514
resource two —>141 AF = 1.2359
1225
83.2
0.17
4^
8 Integration
LEARNING OUTCOMES
By the end of the chapter you will be able to find the indefinite integral
for polynomial, selected rational and other functions. You will know
how to evaluate the definite integral in these cases, how to find areas
under curves and you will be able to solve some simpler types of
differential equation.
Figure 8.1
y= — 50
The expression
x^ +k
y 3x^dx = x^ + k (8.2)
■ /
was chosen by Leibniz as the symbol for integration. It is an archaic ‘S’,
the initial letter of ‘summa’ and reflecting the area under the curve
aspect of integration (the area can be found as the limit of a process of
summation). In (8.2) the symbol ‘dx’ means ‘with respect to x’ (as in
integration 313
differentiation), so the whole left-hand side of (8.2) reads ‘The integral
of 3x^ with respect to x’. The expression to be integrated, here Sx^, is
called the integrand. Using Leibniz’s notation (for both the derivative
and the integral), the steps can be summarized as follows. Starting with
dy = 3x^dx
y — J <iy = J 3x^dx = +k
Exercises 8.1
1 Which of the following ore antiderivatives of 4x^?
(i) x^+100 (ii) x^-TT (iii) x^-O.Olx
|iv|(x2-1)" |v| |x2-1)(x2 + 1|
2 Which of the following are antiderivatives of x^ — x?
J 7dx = 7x + k
/ x”dx
n+ 1
+k (8.4)
where n is any real number except —I. The rule can be confirmed by
differentiation. We can get a long way with this straightforward rule, so
we now give several examples. Consider the integral of the cubing
function. Here, n = 3 and the power rule gives
(i) j x^dx =
This illustrates the power rule when u is a positive integer. The rule
also applies if n is any negative integer other than —1. For example,
f X~^
(ii) / x“^dx = —
In this example the power rule is applied directly, but note that what
we have shown is that
(iii) J x^dx = + k
Again the rule is applied directly, inserting ^ for n. Note that this
example shows that
/ -y/x dx
3
+k
It is far easier to work out the integral with a rational exponent than to
try to operate with radical signs. The power rule applies to irrational
exponents. Suppose n = \/2:
r /
(iv) / x’^^dx = -h k
J V2 + 1
The power rule is a simple, important and frequently used rule for
finding indefinite integrals.
Integration 315
8.2.3 This rule is equivalent in integration to the sum-difference rule for
The sum-difference rule differentiation, and is just as easily applied. The sum-difference rule for
integration can be stated as
The rule reads as follows: the integral of the sum of, or difference
between, functions is the sum of (or difference between) the integrals.
As an example, consider
= x'^ + ki + x^ + k2
= x'^ + x^ + k
The power rule is applied to individual terms and only one constant
of integration is needed, as the sum of or difference between two
arbitrary constants {ki and kj) is an arbitrary constant {k). The sum-
difference rule greatly expands the range of the power and constant
function rules, and applies regardless of the number of terms. An
important consequence of the rule is that polynomials can be integrated
term by term. For example.
/(5x‘-2x’ + 9x^-14x+13)dx
8.2.4 This rule states that the integral of a constant multiple of a function is
The constant multiple that multiple of the integral of the function. That is.
rule
J cf(x)dx (8.6)
/(45x^ + 60x3)dx
|iii| j ^dx
(iv) f x^/'^dx
(v) f x°^dx
(vi) /x-o^dx
Iviil f±dx
(viii) Jx'^^dx
(ix) f x^^dx
3 Integrate the following expressions with respect to x:
(i) 4x^ + 5x^
(ii) 6x^ - 2.5x'^ + x^
(iii) 20x^ - 3x“'^ + 4x“^
(iv) 5xi^+x-’i
(v) ■K^X^~^ + {27T + ])x'^'^ + k
4 Find the following:
(i) J12x^dx
(ii) /(9250x^ - 125x9)dx
5 Find the following:
(i) / (ox^ + bx + c)dx
(ii) f mx^dx
(iii) /[(n + l)x" - (m - l)x'”“^]dx
(iv) /(3f2 + 2t)df
(v) -3 f x~‘^dx
8.3 Marginal cost, MC, is the rate of change of total costs as output is
Application to the varied, so MC(x) is the first derivative of cost with respect to output
marginal analysis level X. Either total variable costs, VC(x), or the total including fixed
of the firm costs, TC(x), can be used. The relationship is
dTC(x)
^ = MCW
dx
Using total variable costs, the indefinite integral of the expression for
marginal cost will be total variable costs plus the constant of integration:
I MC(x)dx = VC(x) + k
Integration 317
A value can be chosen for the constant of integration to correspond to
the level of fixed costs. Suppose that a firm finds that its marginal costs
are given by
MC(x) = 4x + 20
(8.7)
Now use one observation for total cost to find the value of k relevant
to the firm. If at an output level of x = 100 total costs are £23 000,
then k must take a value such that
So
k= 1000
J MR(x)dx = R(x) + k
MR(x) = 100 - 6x
= lOOx- 3x^
Since total revenue is the product of price, p, and volume of sales, x, the
equation of the demand curve for the product can be deduced:
R = px = lOOx — 3x^
p = — = 100 — 3x (8.8)
= 320x — 2x^
Dividing by x gives
price = 320 — 2x
= J (140 + 2x)dx
= 140x + x^ -I- k
and since TC(x) = 2000 when x = 10, the level of fixed costs, k, must
satisfy
so that
k= 500
Thus
Integration 319
Maximizing 7r(x) with respect to x:
— = 180-6x = 0
ax
p= 320-2(30) = 260
Exercises 8.3
1 A firm's marginal revenue function is
MC(q) = 210+ 5q
MR(q) = 480- lOq
TC(q) = 2780 when q=10
where
• general considerations:
• first-order linear differential equations with constant coefficients;
• higher-order differential equations (special cases):
• a first-order, second-degree example.
fix) - ^
dx
= 0
So the equation
6x^ -^ + 10 = 0 (8.9)
dx
/ dx ’ dx^ ’
(8.10)
For example.
(8.11)
Integration 321
differential equation is the power to which the highest-order derivative
is raised (after any rationalization to remove fractional powers). So
(8.11) is first degree despite the fact that the second derivative is
squared. The degree is given by the term involving the third derivative,
and this is raised only to the first power. By contrast,
7x^ + 2x + = 0
dy d^y d^y
dv’ dx2’ dx^’
but note that these are not themselves raised to a power other than
one.
The equation
5x'
Ay , -
— + 7x
dx
2
7x^ + 3
= 6^ + 10
dx
Once this rearrangement is carried out it is clear that the solution can
be obtained by taking the integral. We can write
y = 2x^ + lOx + 15
1.5x^ — 4x^ — 8x + ^ + 10 = 0
dx
given
y = 300 when x = 4
When X = 4, y = 300, so
so that
k=52
given that
y= 8 when x = 2
rearrangement gives
So
» = /(8.^+6.^-20x- 50)dx
when X = 2, y = 20
when x= 3, y = 44
= 12x — 8
dx^
fd^y , du
/ —rdx = — +constant
J dx^ dx
^=6x2-8x + ki (8.13)
dx
y = J (6x^ - 8x + ki)dx
so that
which is the general solution. The conditions can now be used to find
the particular solution. These give two simultaneous linear equations in
ki and ka- Substitution into (8.14) when x = 3 produces
44 = 54 - 36 + 3ka + ka
and when x = 2,
Integration 325
Simplification and rearrangement results in the simultaneous equations
3h+k2 = 26
2ki -j-kz = 20
ki = 6
(8.15) could have been obtained with data for just one value of x,
provided that in addition to y the value of dy/dx at this point was also
given. For example, in the present case the information
dy
y = 20 when x = 2, and — = 14
dx
d^y
12x 1.5 = 0
dx^
where when x = 1,
d y
= -1.5
dx^
^ = 0.25
dx
y = 22.25
Rearrangement gives
d^i/
12x- 1.5
dx^
—r = 6x^ — 1.5x + ki
dx^
d^y
= 6x — 1.5x — 6
dx^
^ = 2x^ — 0.75x^ — 6x + k2
ax
= — 56x -I-196
I') =4(.-7)^
^ = ±2(x- 7)
dx
(i) y = x^ — 14x -h k
(ii) y = 14x — x^ + k
are both general solutions. Context may suggest which of (i) and (ii)
best fits the application. For example, if y is total cost of production and
if X is output level, with typical values around 50 (say), then (i) is
appropriate. Variable costs (14x - x^) would unrealistically be given as
negative by (ii) for values of x over 14. On the assumption that (i) is
most suitable, then given the condition that
y = 2000 when x = 45
Integration 327
the constant of integration (representing fixed costs) emerges from (i) as
k = 325. So the appropriate particular solution is
y = — 14x + 325
Exercises 8.4
1 Give the degree and order of the following differential
equations: 3 % 2
10x3 -3x , /d y = 0
^dx^
i) 4^-7 = 0
' dx2 dx
ii) 20x
^dx^y \^dx^
VI ) 27x3
dV^dV 10
dy
0
dx^ ' dx^ ^ ' dx
2 Find the general solutions to the following differential
equations:
dy 8x3 ^ 9^2 ] Q
I 3;
dy
i) -2x3-4.5x27 0
dx
(i) 10 + 2x -3 3x-2 = 0
dx
Boundary condition: y = 35 when x = 0.2
(ii) 10.5x6-0.5^-10x-6 X + 16
dx
Boundary condition: y = 0 when x = 1
Find particular solutions to the following differential equations:
(i) 10-^-30x = 0
dx^
Boundary conditions: y = 0 when x = 3
y = —210 when x = 5
(ii) 6x-0.25^ = 3
dx^
Given that when x = 6, y = 672 and dy/dx = 362.
Find the particular solution to the following third-order
equation:
6 0.1 dV
- 0
dx^
dV
70
dx2
^ = 60.
dx
y = 50
Integration 329
Consider an example. It is possible to work out the indefinite integral
J — 7)^2xdx
by expanding the bracket and using the sum-difference rule. But the
solution is easier if an appropriate substitution — a change of variable —
is made. Suppose we let
u = x^ — 7
Ju^Ixdx
^ = 2x (8.16)
dx
Interpreting du/dx in (8.16) as a ratio, we can write
da = 2xdx (8.17)
(8.18)
/(.^+2>:^4)(.+ l)d*
a = x^ + 2x — 4
u(x + l)dx
du
= 2x + 2
dx
we can write
du = (2x + 2)dx
/ -.xj du
(x + Ijdx = —
(x^ + 2x — 4)^ ,
-i-
The integrand is simple enough for the result to be checked by
expansion and integration of the original expression in terms of x.
We now consider two examples in which the method of substitution
comes into its own. Find the indefinite integral
J 0.5x(x^ -F 10)^dx
Let
u = x^ -F 10
so that
du
du = 2xdx or 0.5xdx =
uV2
+k
(x2 + 10)^/^
Integration 331
For the second example, consider
/ (xO-^ - 1)^
^0.8
dx
u = (x°'2 - 1)
du = 0.2x“° ^dx
so that
x~°'®dx = 5du
km^
dx
(8.19)
Exercises 8.5
1 Use integration by substitution to find the following integrals:
(i) /(10 + x2)22xdx
(ii) J (2x^ + 6x -f 15)^ (2x + 3)dx
r d . d
dx = n— 1
+k
[bx + c)" b(l — n)(bx + c)
(i) ^ +^ dx
(x2 + 7x - 8)^
6x^ + 4x + 2
(ii) -dx
(x^ + x2 + X + 1)"
px = b
/ /(x)dx
Jx = a
for which
rx = h
for X = b for x = a
f(x)dx = (F(x) -I- k) - (F(x) + k) (8.20)
/
Jx = a
where F(x) + k is the indefinite integral a is the lower and b the upper
limit of integration. The constant of integration, k, cancels as the
Integration 333
integral is evaluated. Consider the definite integral oif{x) — 2x from
X = 2 to sc = 5:
Figure 8.2
= (0.25(81) - 3) - (0.25(16) - 2)
= 17.25 - 2
= 15.25
px = ‘i
(ii) / (6x^ — 4x + 4)dx = (2x^ — 2x^ + 4x)‘'‘^^
Jx=l
= 0.125(32 768)
= 4096
l'x=5
= -50-78
= -128
Figure 8.3
l'X = 4 i'X = 4
(v) / 3(-\/x)dx = / 3x^dx
Jx=l Jx=l
= 16-2
= 14
px=U
/(x)dx
Jx — a
/(x)dx
Integration 335
exists and its value is equal to D. For example
(Vi)
= 50
X = 40
Figure 8.4
R = (5-2)/(2) = 12
r = i(5-2)[/(5)-/(2)]
= 1.5(10-4)
= 9
The area between the line and the x axis above the interval is
Area = R + r= 12 + 9 = 21
The fact that, whatever the limits of integration, the definite integral is
the area under the curve can be confirmed in the case off{x) = 2x.
With limits of integration a and b the definite integral is
= b^-a^
T = l{b-a)\f{b)-M]
= j{b — a){2b — 2a)
= (b- a)^
= b^ — 2ba +
Area = R + T
= 2ba — 2a^ +b^ — 2ba +
= b^-a^
Integration 337
The next rule is also familiar from rules for indefinite integrals;
(ii) [
Jx=a
\f{x) ±g{x)]dx = f
Jx=a
f{x)dx± f
Jx=a
g{x)dx (8.22)
i.e.
12 = 4 + 8
f fix) = 0
In addition to use with improper integrals, we will see two further ways
in which this rule is useful. But first consider two examples of
integrands that could be evaluated without the rule. Given the
requirement to find
.x=2
2xdx
Jx=\
fx=l
which is
,.2\x= 3
ix^)^ = ^ - (x^)^=^ = 9-1 = 8
This outcome could be obtained by selecting (say) c = 2 and evaluating
the definite integral in two parts as follows:
rx = 2 px = 2
/ 2xdx+ / 2xdx
Jx = 1 Jx = 2
2\x = 2 r^2\x=l
+ [(^2)* = ^ 2\j: = 2i
(X^)
= (x^)
= 3+5
= 8
The rule comes into its own in two types of problem, examples of
which follow. The first case is piece-wise defined functions. The value of c
would coincide with the point where the definition of the function
changes, as in
Jx = 2
where
r/ N j 2x + 5 for X <6
\ 5x — 10 for X > 6
Figure 8.5 shows the graph of the function and the area under the
curve corresponding to the definite integral. The break is set at c = 6.
Excluding the constant of integration, the indefinite integral will be
^ f x^ + 5x for X < 6
- (2.5x2 _ 20xf=^
= 66-24 + 80- 30
= 92
»If the graph of the integrand crosses the x axis between the limits of
integration (8.24) is again useful. Consider
Integration 339
Figure 8.5
This result is correct and may be the relevant value (for example when
the surpluses and deficits of business units are to be aggregated). But as
is clear from Figure 8.6, there is some space between the integrand and
the X axis.
Figure 8.6
= -12.5-(-12)
= (-)0.5
= -12 - ( - 12.5)
= ( + )0.5
The absolute values of the two areas are added to give the total area
enclosed between the function and the x axis. This is one unit. A further
example illustrates the fact that (8.24) can be extended to involve as
many subdivisions of the range of integration as are needed. We shall find
i'X = 16
/ (3x^ — 36jc + 60)dx
Jx = Q
If one or both roots of the quadratic fall between the limits of
integration, subdivision of the interval will be required. The roots are
X = 2 and x = 10, and the absolute value of the area is found by
dividing the range of integration into the sections
So
56-0
56
|•x = 10
(x2 - 18x2 + 60x)^ = 20 _ (^3 _ 50x)^ = 2
Jx = 2
-200 - 56
-256
x= 16
I = 10
(x^ - 18x2 + 60x)*=26 _ (x^ _ 18x2 + 60x)^=^°
448 - ( - 200)
648
So the total area between the curve and the x axis regardless of sign is
If sign had been taken into account, the result would have been
'^x=l6
rX = J
Integration 341
Where it is required to find the area enclosed between two curves,
rather than the area between one curve and the x axis, the area can be
obtained with the use of definite integrals if the functions concerned are
non-negative over the relevant interval. If the two curves are those of
y = fix) and y = gix), and the limits between which the area is required
are a and b, then the definite integral needed is
fx = h
/ I \fix) - g(x)]d X
provided that f{x) > gix) between a and b. If it is more convenient, the
integral could be expressed as
rx^b px = b
0(x)dx (8.25)
/
Jx = a
fix)dx
/
Jx = a
■
and
gix) = 80 -(- 8x
Figure 8.7
Apart from the steepness of the curves, the problem graphs as shown in
Figure 8.7. We assume that the entire region is required, so the first
task is to find the points of intersection of the curves. These points give
the limits of integration, a and b. Begin by equating the values of the
functions:
fix) = gix)
I25 + 20x-x^ = 80 + 8x
- 12X-45 = 0
which factors as
a = —3 and b = +15
Exercises 8.6
Evaluate the following definite integrals:
(i)
20
(0.9x^ + X + 10)dx
0
(iv)
Integration 343
3 Find the area between the x axis and the following curves
between the given limits:
(i) f(x) = 3x2 + 2x + 5
between the limits x = 0 and x = 10.
(ii) f(x) = x^ + 1 Ox
between the limits x = — 10 and x = —2.
(iii) f(x) = 1.5x2+ 12x +30
between the limits x = —5 and x = 5.
4 Find the total area under the curves given by (i) and (ii) below,
between the given limits and regardless of sign:
(i) f(x) = X - 10
between x = 5 and x = 15.
(ii) f(x) = 3x2 _ 1 8x + 15
between x = 0 and x = 8.
5 Evaluate the following:
^40
(i) / f(x)dx
Jo
where
fi^\ = / 20 for X < 20
\X for X > 20
(ii) ^ ^(^)dx
where
f 4x — 20 for X < 10
\ X + 10 for X > 10
In each step a trapezium is formed, the base of which is the step width;
the parallel sides are the ordinates at the beginning and end-points of
the section, while the fourth side is the straight-line segment
connecting the tops of the ordinates. The area of the trapezium
>-
a w w w X
approximates to the area under the curve in each section, the accuracy
of the approximation in each step being better the smaller is the step
size. With f{x) being the value of the function (the height of the
ordinate) at any point, the areas of the trapezia are
Ti =^{fia)+f(a-\-w)]
w
^2 = ^ [/■(« + w) +f{a + 2w)]
w.
T3 = ^ [/”(« + 2w) +f{a + 3w)]
So the total area of the trapezia and the approximation to the value of
the definite integral is T, where
Trapezium rule
w
/ fix)dx ^ — \f{a) + 2f{a + w) + 2f{a + 2w) + ...+/(« + mw)]
Jn ^
where m is the number of steps. Let us use the rule to approximate the
area under the curve y = x^ between x = 1 and x = 4. We will
arbitrarily divide the interval into six steps, so the step size, w, will be
w=
Integration 345
Using the rule,
Here we can find the exact value of the integral by analytical means:
So in this case the trapezium rule is out by just under 0.6%. Given the
shape of ^ = ^2, since the area of each trapezium is slightly greater
than the true area under the curve in its section, the result must be an
overall overestimate. With other functions, the result may be either an
overestimate or an underestimate. Accuracy increases as the step size is
reduced so long as the function has a continuous second derivative, and
up to the point where rounding errors overtake the theoretical
improvement from reduced step size.
Table 8.1 gives the results of the trapezium rule for the current
example and for a range of step sizes. The increase in accuracy as the
number of subdivisions increases is evident. The error is less than 1%
for 5 steps, and less than 0.1% if 15 steps had been used. But in most
cases it will not be possible to calculate the precise percentage error.
Now use the trapezium rule with five subdivisions to estimate the
definite integral
With
w = (b — a)/5 = 1
The exact value of the integral can again be obtained in this case:
Simpson’s rule
where w is defined as for the trapezium rule and where the number of
steps, m, must be even.
We will now use Simpson’s rule to approximate the integral
'‘X= 3
x^dx
i:
lx = 0
So the error in this case is 0.025, or just over 0.05% of the true figure
- and this for a division into only six sections.
Integration 347
Now estimate the area under the same curve using the trapezium
rule. This is
an error of just over 4.6%, given the chosen number of divisions of the
interval.
Usually Simpson’s rule gives better results than the trapezium rule,
though this is not true in every case.
Exercises 8.7
1 Using m = 6 steps, estimate the value of the definite integral
(6x2 ^ 2)dx
0.5x2 + l)dx
8.8 The purpose of this chapter was to introduce the concept of an integral,
Concluding and to enable polynomial and rational expressions to be integrated
remarks using simple rules. More advanced devices for integration include the
following:
Integration 349
CHART E R
LEARNING OUTCOMES
By the end of the chapter you will be able to use calculus on simple
exponential and logarithmic functions. You will gain useful insight into
business, financial and economic concepts, and be aware of important
properties of models based on exponential and logarithmic functions.
y = a^ (9.1)
y = 2^
0 X
• positive everywhere:
• always increasing:
• increasing at an increasing rate.
Such curves are growth curves and are used to model expansion and
increase - for example of populations, atmospheric CO2 or outstanding
debt at a compound rate. As one example, for a developed economy
an arguable sustainable annual rate of growth in real gross domestic
product (GDP) is around 2.5%. Thus
GDP(1993) = GDP(1992)(1.025)
or, with reference to an initial year zero, GDP in some year, t, would be
given by
In (9.2) the base a = 1.025. The base of the exponential function is one
plus the rate of growth as a decimal. If in general the rate of growth
had been 100r%, then the relationship would have been
GDPt = GDPo(l + r)
Figure 9.2
The graph is the reflection in the y axis of the graph of 2^^ and it
typifies exponential functions with a base less than one. That is,
0 < a < 1. Properties of the graph are that it is:
• positive everywhere;
• always decreasing;
• decreasing at a decreasing rate.
Exponential functions with base less than one produce decay curves
and are used to model processes of contraction. It is a characteristic of
decay curves that, if x represents time, the time required to reduce y to
half its value is the same regardless of the initial value of y. Note also
that the process of exponential decay could just as well be modelled
using the form of function
y=
(if = i/ = 2-'
GDPt = GDPo(0.985)‘
= Vo(0.8)‘
where Vo is the original value of the car. So if the car cost £7500
initially, then after four years its value would be V4, where
Table 9.3
Principal One year Two years Three years
1 1 +0.1(1) 1.1 + 0.1(1.1) 1.21+0.1(1.21)
= 1.1 = 1.1(1 +0.1) = 1.21(1+0.1)
= (1 +0.1)2 = (1 + 0.1)^
= 1.21 = 1.331
If £75 had been invested for 8 years at 12% the result would have been
X T X
(l+-)^ = l-l-x + ->l-l-x
(1+0.12/2)^ = 1.1236
For example, £850 borrowed initially at a nominal 12% but with daily
compounding would, by the end of the year, have grown to
Table 9.4 0 1 2 10
n 5 25 100
(1 + l/«)" 1 2 2.25 2.48832 2.59374 2.66584 2.70481
n 500 1000 10 000 100 000 500 000 1 000 000
(1 + 1/n)" 2.71557 2.71692 2.71815 2.71827 2.71828 2.71828
£g2(0.12) _ £1.2712
PV(£) = Pc'"'
So, for example, a principal of £500 deposited for two years at 12%
continuously compounded would produce a future value of
Exercises 9.1
1 In a certain country the level of GDP in 1998 was £350bn.
Growth of GDP is at the rote of 3% per annum.
(i) Express this relationship as an exponential function.
(ii) Assuming that the relationship remains valid, use the
function to state GDP in (a) 2000 and (b) 2004.
Table 9.5
X e* Comment
5cm 1.48 metres Shoulder height
10cm 220 metres Height of respectable skyscraper
25cm 720 000 km Well beyond the orbit of the moon
50cm > 5000 light years 1 /6 of the way to the galactic rim
65cm > 15 bn light years To the edge of the universe
y = Ae"’* (9.7)
mx
= me (9.9)
dx
d{e’‘e^) 2x
= = 2e
ds:
Note that the exponent will not alter as higher derivatives are taken,
but the coefficient increases by a factor of m each time. Thus for the
nth derivative
^ ^ ^n^mx (9.10)
dx"
fix) =
the derivative is
f'(x) =
f"(x) =
m = e-"'
f'(x) = -2e-^‘
f"(x) = 4e-^^
fix) =
f'ix) =
fix) =
f'ix) = V2e^^'^
fix) =
fix) =
6x
90e'
dx
In general, a function may contain terms involving both x” and e^. The
simplest case of such a combination is
y = xe*
^ = xe" + g*
dx
= e’'(x + 1)
And given
S 2x
y = xe
dx
y = e^*(2x + x^)^
may be obtained by combined use of the product rule and the chain rule:
= 66e^
(*=i)
(9.11)
(9.12)
y = gmx + c
for which, since g'{x) = m, from (9.12) the derivative is
g(x) =
Now consider
y = e-,x^ + 7x + 6
^ = (2x+7)e"' + ^’'+^
dx
y = {x^ - 5x^)6"'
and, as usual, the slope of the function at any point can be found by
evaluating the derivative. So at x = 1 the slope will be
(vi) y = ?e°^^^
/ e'^^dx = — + k
m
e^^dx = ^ + k
/ 6e^’^dx = ^ + k
_p-2x
e~^^dx = —--h k
I e^^dx = — +k
TT
/aWWdx = ^W + )c
So, for example.
and
and
Jo,5xe^*"dx = 0.125e2*‘ + k
„2x 7,\x=l
I
r ^ e^^dx = (O.Se^" + kfi^^ - (0.5e^" + kfi
Jx=l
= 0.5e^ -0.5e^
= 23.605
and
^x=1.5
/ e^'-^dx = (0.25e^"-4)" = ^-^ - (0.25e^"-^)’'=^
Jx=l
= 0.25e^ - 0.25e°
1.597
= 1 - 1/e
= 0.6321
Figure 9.4
Note that, given the domain, /(£) begins at (0, p) on the vertical axis
and asymptotes to the t axis as t increases. Areas under the curve
correspond to probabilities, and we can use definite integration to
confirm that the total area under the curve is one. The total area is
given by the improper integral
1
rt = oo
/ pe-^“d£ = ( - =°
Jt = o
= (0 + k) - ( - 1 + fc)
= 1
The area under the curve between two values of t is the probability
that a customer’s service time lies between those limits. If in a
ft —CO
0.5t^t=3
Jt=i
= (0)-(-0.2231)
= 0.2231
ft = CO
/ pe-^^'dt = ( - = Vm
= 0 + e~^
= 0.3679
12x^ + ^- 3e~^* = 0
dx
dx
so that
y= (3e-3* - 12x2)dx
y = - 4x^ + k
At the value of x = 0,
y = -l + k= 11
z/ = -g-3^-4x^ + 12
y = x^ + + 5x + k2
and, since y = 6 when x = 0, the value of ^2 must be 5. The particular
solution is therefore
y = x^ + + 5x + 5
Exercises 9.3
1 Find the following indefinite integrals:
(i) J e^^dx
(ii) /-e“°-^'^^dx
(iii) f ne^^dx
(iv) /(dx^ + 3e^ ■^^)dx
2 Find the following indefinite integrals:
(i) / xe^^dx
(ii) /(5x-2)e’0^'-^^-6dx
(i)
L x=2
e°-^^dx
Jx = 0
(i) 5x + 0.5$^-2e2^ = 0
dx
9.4 An exponential relationship between variables x and y may take the form
Natural
x = ay (9.14)
logarithmic
functions which could be read either as
or as
This is written as
t/ = logflX (9.15)
and is defined for « > 0 and a ^ 1. (9.14) and (9.15) describe a single
relationship between x and y. They view different aspects of the same
connection and are inverse functions. Which form it is best to use
depends on the task in hand. In this section we will be concerned with
logarithmic functions and their relevance to business and economic
concepts.
Consider some examples of logarithmic relationships between two
variables, and their restatements in exponential form:
y = In X
Now consider the graph of y = logg x when 0 < a < 1. This is shown
in Figure 9.6. Note that the log function when 0 < a < 1, defined only
for X > 0, is always-decreasing at an always decreasing rate. The
following properties of logs are an alternative way of stating properties
of exponents. Where m and n are positive numbers,
(9.16) states that the log (to any base) of the product of two numbers is
the sum of the individual logs. So with the product of 4 and 8,
y
2^ 2^ = 2®
Equation (9.16) was the basis of calculation with logs - using the
properties of exponents to transform multiplication into addition.
YYi
(ii) log„ - = logfl m - log„ n (9.17)
Equation (9.17) states that the log of the quotient is the difference
between the logs of the two numbers and enables division to be
transformed into subtraction. For example,
243
logs = logs 243 - logs 27
= 5-3
= 2
y = x^
In 1/ = In (x") = n In X
y = x^
In = 3 In X
Figure 9.7
z = AxV (9.19)
In z = In A -F a In X -1- /3 In y
loghtn
(iv) Change of base: log^ m = (9.20)
logfcfl
N(l+0.02)^ (9.21)
N(1 +0.02)* = 2N
(1 + 0.02)* = 2
Using property (iii) and taking the logs of both sides with (say) a base
of 10,
so that
that is,
logio 2
^ logio (1.02)
0.3010
“ 0.0086
« 35
(1.03)* = 2
In 2
X =-
In 1.03
_ 0.6931
“ 0.0296
^ 23 A years
In 2
X =-
In 1.1
0.6931
“ 0.0953
7.27 years
In 2
X =-
In 1.3
0.6931
“ 0.2624
^2.64: years
gO.isx ^ 2
O.lSx = In 2
so that
0.6931
X = = 4.62 years
0.15
20(0.97)* = 10
(0.97)* = 0.5
xln0.97 = ln0.5
In 0.5
In0.97
-0.6931
^ “ -0.0305
22.7 years
g-o.oooix ^05
-0.0001x = ln0.5
so that
-0.6931
" = 300001 =
So the half-life of the radioactive material is almost 7000 years. Note
that in contrast to exponential growth, exponential decay to half of an
original quantity takes significantly longer than under a linear process,
where a fixed percentage of the original amount is subtracted each
period - as in straight-line depreciation. For example, in the above
case, if the volume of the contaminant was reduced by 0.01% of its
original quantity each year, then the linear equivalent ‘half life’ would
be given by
n = —— = 5000 years
0.0001
Exponential (true) half-life is therefore longer than the linear equivalent
by the ratio
— logg 0.5
1.3863
05
Iii]
annum?
9.5 We shall work towards the derivative oi y — logg x by making use of the
The derivative of facts that:
natural logarithmic
• We know how to differentiate the natural exponential function.
functions
• The inverse function rule applies since the logarithm function is
the inverse of the exponential function.
dx
dy ^ 1
dx dx/dy
__ 1
~ ey
d( loge X) ^ I
(9.22)
dx X
/.->dx = log,* + ^.
dx X
dx^ x^
dy m
dx X
/(x) = 171nx
fix) = 17x-^
dy m
dx X
then
dy 3
dx X
y = x\nx (9.23)
dx X
= 1 + In X
y = fix) In X
for which
y = x^ In
^y 31 T 21 I
— = X — + 3x In X
dx X
= x^(l + 3 Inx)
logfX
y = ^—
2x
Using the quotient rule,
dy 2x(l/x) — 2 logg X
dx 4x2
_ 1 — logg X
2x2
y = e('"lnx)
dx X
Iny = ln[e("’
7
= m Inx
= ln x"’
1/ = x'”
y = loge0(x)
dx dg dx
1
g (x)
g{x)
g'jx) (9.25)
g(x)
y = loge
dy 3x^ _ 3
dx x^ X
y = logg 6x
^=A
dx 6x
_ 1
X
1/ = In (x^ + 1)
dy 2x
dx x^ + 1
Exercises 9.5
1 Find the derivative dy/dx where:
(i) y = 20 log^ X (ii) y = log^ x^
(iii) y = x^ In X (iv) y = (4x — 2x^) In x
2 Find f'(x) where:
(i) f|x) = In x^ (ii) f{x) = In lOOx
(iii) f(x) = logg (5x^ — 6x^ — 7x^)
3 (i) Find the derivative of
logg X
y = (x2+x)
dV
(ii) dx2 given that y = x logg x
Ax Ai/
and
X y
elasticity = ^
elasticity = (9.26)
y/\x
X dy
E yx — (9.27)
y dx
p^
Ed = - (9.28)
q dp
p a — bq
p ^
Ed
q dp
where
-1
dp \dqj b
Thus
Ip {a-bq)
= —u-
(9.29)
b q bq
Note from (9.29) that, while slope is invariant, elasticity is not constant
along a straight-line demand curve. The range of variation is from oo
when q = 0 to 0 when p = 0. The graph of the demand curve is shown
in Figure 9.8(a) and that of demand elasticity in Figure 9.8(b).
Figure 9.8
R=pq
(9.30)
— —Q Ed + q
= q{l-Ed)
from which it is clear that where demand is elastic (Ed > 1) a price
rise will, other things being equal, result in a revenue decrease. But
when demand is inelastic (Ed < 1) the strategy of generating more
revenue by increasing price will succeed. The question arises as to what
point on the linear demand curve corresponds to unit elasticity. From
(9.29) this will be where
a — bq = bq
i.e. where
q = a/2b
p = 100 — 2q
Addressing task 1, first note that when q = 20, p — 60. Restating the
demand equation with q in terms of p gives
q = 50 — 0.5p
from which
Ed ( - 0.5) = 1.5
20
The most convenient way to address task 2 is to use equation (9.29),
from which
100 - 2q
Ed 0.25
2r~
so that
100-2q = 2q(0.25)
2.Sq= 100
q = 40
A
P =-
q
dq A
dp
so that
p A
Ed
qp^
PQ
Therefore
Ed = l
P = /(g)
we know that
and that
ding 1
dq q
ding 1
dq ^ q
dlnp dp/d(j
dq p
d\nq
d\np
p ^
q dp
_ ding
(9.31)
^ dlnp
This result is useful since there are demand curves for which it is
easier to find elasticity through (9.31) rather than using (9.28). For
example, if
p = Aq^
dIogeP ^ _2
dlog,q
Ei=i
Exercises 9.6
1 Given the demand-curve relationship p = 500 - 2.5q where
p = price, find the value of elasticity of demand where:
(i) p = 50 (ii) p= 100 (iii) p= 150
2 Given the demand relation p = 1000 — 4q:
(i) Find the level of output for which elasticity of demand is 3.
(ii) Find the price at which the elasticity of demand is 1.5.
(iii) Find the value of price for which total revenue is
maximized.
100
/ —dx = m log„ x + k
= logg x"" + k
r9
/ —dx = 9 In x + k
J X
= In x^ + k
(iii) dx = In/(x) + k
f (Sx^ - 4x + 4) 3 2 ^ ^ 7
= x(logg X — 1) + k
y = X logg X — X + k
^ = X- + (log,x) 1-1
dx X
= log, X
/log,x”d* = /„log,xdx
= „/log,xd*
J In x^dx = 7x(ln x — 1) + k
Note that
/ dx
—— = In (In x) + k
xlnx
y = In (In x) and In x = u
so that
y = In (u)
and
dy dy du
dx du dx
so therefore
dx u X
1
xlnx
[ — dx = 5 In (In x) + k
J xlnx
Now consider
8x(logg X — 1) + k
Exercises 9.7
Find the following indefinite integrals:
2 Find:
dx
(ii)
2x In
5 Evaluate
f*x = 5 (6x2 _ 5^1
I'l L
^x= 10
(ii) / log^ x^°dx
Jx = 4
9.8 In this chapter we have seen how integrals can be found for certain
Integration by exponential and logarithmic functions. In Chapter 8 we concentrated on
parts the integration of power functions. In this section we consider a technique
of integration that is useful for functions containing terms which are the
product of power and exponential components, although the usefulness of
the rule is not confined to this situation. The method is equivalent in
integration to the product rule for differentiation, and is known as
integration by parts. The product rule stated that if the dependent
variable, y, could be expressed as the product of two functions of x, as in
y = f{x)g{x)
then one way to carry out the differentiation was to use the fact that
dx dx ^ dx dx
it is clear that
fix) = e*
^= 1
dx
So, using (9.32),
dx
= xe* - e* + fc
= e*(x - 1) -I- k
Now consider
j 0.5x^e*dx
dx
It follows from this choice that
/(X) =
and with
g{x) = 0.5x^
dg
= X
dx
So, in the form of (9.32) we now have
/0.5AM. = 0,5.^.'-/..-d.
= 0.5x^e^ ~[e’‘(x-l)] + k
= e"(0.5x^ -x + l) + k
J xe^ ^^dx
for which we set
^ _ pO.Sx
dx~
As a result
fix) =
and since
g{x) =
/ xe^'^’^dx = 2xe'
,0.5x
Jle- Sx
dx
2xe,0.5x - +k
„0.5x
- 4) + /c
J 2x^e°'^^’'dx
d/ _ 0.25x
dx“
Thus
fix) =
and with
gix) = 2x^
it follows that
= - 256e®-2^*
8(x^ — 8x + 32)e°'^^* + k
/'X = 12
lOxe 0.4X-0.8 dx
Jx = 2
„0.4x-0.8
dx
then
fix) = 2.5e°-^*-°-*
and, with
gix) = lOx
dg
= 10
dx
200xe°^^-°^dx
functions to bases There are two stages in obtaining the derivative of a logarithmic
other than e function to a base other than e. These are:
Putting this process into effect, from the change of base formula
(9.20)
logfl X =
loggfl
d( loga ^ 1 1
dx loggfl X
y = log2 X
then
dy ^ 1
dx X In 2
I
“ 0.6931X
The second derivative can be found in the following way: writing the
first derivative as
(—V-i
dx \ln 2 )
dx^ \ln 2 )
_ 1
x^ In 2
Now consider
y = logio
^ loge
log, 10
so that
di/ _ 1 3x^
dx logg 10 x^
3
X logg 10
1.3
X
In the case of
so the derivative is
1 16x+7
/'(X) =
logg 8 (8x2
= 0.4343x(logg X — l) + k
i:
15 ^
log,
fx= 5 log, 2
which will be
x = 15
• 5
4x(log, X - 1) ---4x(log,x - 1)
log, 2 log, 2
/20x(log, X - 1)V
log, 2
300(log, 15 - 1) - 100(log, 5-1)
log, 2
451.4713
0.6931
651.34
y = a’'
means that
X = log« y
dx 1
dy ylog,a
(9.33)
= fl* log, a
i/ = 2^
= (0.6931)2^
d^M T
3-| = (0.6931)^2’=
= (0.4805)2’^
fix) = 10"
for which
/'(x) = 10"log,10
^ (2.3026)10"
f"ix) ^ (2.3026)^10"
(5.3019)10"
Consider also
fix) = (1)'
for which
from which it can be seen that the slope of the function is always
negative but is, in absolute terms, a decreasing function of x. Note that
where the base, a, is less than one, the higher derivatives will alternate
in sign. In the present case
and
/"'(x) = -0.3330(1)"
then
y=
logfl y = fix)
^ogeU
logfl y = = f{x)
log, a
so that
5!*!^=i^=(iofc«rw
dx y dx
so that
^ = {\og,a)f'ix)y
dx (9.34)
= (log,
So, with the exception of the coefficient logg a, the outcome is as for the
derivative of e^. Therefore, given
. r45C^ -7x + 8
y — ^
using (9.34),
^ = (logg 5)(8x- +^
dx
= (1.6094)(8x - 7)54*"-7x + 8
= (12.8755X- 11.2661)5^*'“^^ + ®
y =
for which
a*
+k (9.35)
logg a
f 10*
/ 10*=--—+fc = (0.4343)10*+ /c
J logg 10
J /'(x)(/^*^dx =
(/(*)
loggfl
+k
265C^ +8x —9
/ {I2x + + = -^-— + k
logg2
= (1.4427)2^"' + ®"-^+ k
a2x^ —x+1
+k
In 4
^(47 _ 4) ~ 11816
In 4
^ % (9.36)
dx
Exercises 9.9
1 Find the first derivative of the following functions:
(i) y = log5X (ii) y = logo2X
2 Find the required derivative in the following cases:
(i) f'(x) where f(x) = log^oX
(ii) f"(x) where f(x) = logio->f
(iii) f'(x) where f{x) = logg X 2
/ 2log,o x^dx
Jx = 4
y = r(l-fl-'’") (9.37)
^-bX ^ Q y. ^
a b 1 10 50 100 1000
Figure 9,10
y = Ta^^ (9.38)
Sensible results are produced when the value of b is near the upper
end of its range, while a is less than 0.5. The upper asymptote is at T
(representing market saturation, so T > 0) and the y intercept occurs at
y = Ta. The function has the x axis as an asymptote as the value of x
from which the derivative dy/dx could be more easily found in the
following way:
dx
=i^
y dx
so that
y = 20 000(0.25)^°”^'
Suppose that the department has narrowed its options to five possible
lengths of campaign, including no campaign at all. Aggregate sales as
given by the Gompertz function would be as shown in Table 9.7.
Table 9.7
length of campaign aggregate sales % market potential
(x) iy)
0 5 000 25.00
13 11659 58.29
26 16210 81.05
39 18429 92.15
52 19 373 96.87
While the example hasn’t been set in the wider context of profits
maximization, from Table 9.7 we can see that it is unlikely that, with
the market already at 92% of saturation level, the campaign would run
beyond 39 weeks. S-shaped patterns of aggregate demand are also
produced by the logistic curve. In this model aggregate sales, y, are
given by
T
(9.39)
1 + ae-^^
x = -^ln(l/a) (9.40)
T = 20 000 and a = 3
These values give the same upper asymptote (20 000) and y intercept
(5000) as in the Gompertz example. A value of b about 0.1 produces
similar overall results. Table 9.8 shows corresponding values from the
logistic and Gompertz curves.
Table 9.8
x = -^ln(l/3)^ll
1, 2, 3, 4, 5, 6, 7, 8, 9, ...
... - 5, - 4, - 3, - 2, - 1, 0, 1, 2, 3, 4, 5, ...
The integers comprise all positive and negative whole numbers and
zero. The negative integers are given by subtracting the counting
numbers from zero. The integers (think of integral meaning whole) can
be seen as part of a broader category - the rational numbers. A
rational number is the ratio of two integers and so may be an integer or
a fraction. When expressed in decimal form, a rational number either
terminates or repeats. For example,
17^8 = 2.125
is a rational number which repeats ad infinitum the six digits after the
decimal point. A rational number terminates only if its denominator
has no prime factors other than 2 or 5. Irrational numbers cannot
be expressed as the ratio of two integers. ‘Irrational’ is not used in the
sense of ‘unreasonable’, but rather as ‘non-ratio-nal.’ The real
numbers constitute the rational and irrational numbers taken
Appendix 401
together. The real numbers contain all decimal representations, but
there are equations which have no solution in real numbers. The
simplest case is
+ 1 =0
a + bi
a + bi and a — bi
For example,
X e A
which is read as
‘x is a member of A'
or as
‘x belongs to A’.
y /A
A = {1,2, 3,4, 5, 6, 7}
The description is the defining relation for the set. A, with a finite
number of elements, is a finite set. Sets which have the same number
of elements are said to be equivalent even though their elements may
differ. So the sets
(1 2 3}, (5 6 7} and (X Y Z}
are equivalent. Sets are equal when the membership of the two sets is
identical. So
(1 2 3} and (2 3 1}
are equal; the order in which the elements are listed does not matter. A
subset of a given set has as its members some (or all) the elements of a
set and no other elements. Thus, for example,
B C A
which reads
‘B is a subset of A’
Be A
The set
G = {2, 4, 6, 8}
Appendix 403
is not a subset of A since it contains an element which does not belong
to A. The set G not being a subset of A is indicated by
G%A
F C E and E C A, then E C A
The number of subsets for a set with n elements is 2”. The set of all
subsets of a given set is called the power set, written as 2*. So set A
with seven elements has 2^ = 128 subsets, while a set with 30
elements has a power set with 1 073 741 824 members!
A.2.1 We may wish to refer to the aggregate membership of two or more sets,
Operations on sets This is the union of the sets, and for sets E and F is written as
EUF
EUF= {3, 5, 6, 7}
As another example
BUC= {1, 2, 3, 4, 6}
Note that where an element is in both sets it appears once only in the
union. The order of elements does not matter and its immaterial
whether we write E U C or C U B. Union is an operation that can
include more than two sets. For example,
CUEUF = {2, 3, 4, 5, 6, 7}
Bnc
Bnc = {2, 4}
Note that where two sets have no elements in common, the intersection
of the sets has no members, i.e. it is the null set. So
Fnc = 0
H = {2, 3, 4, 5}
then
BncnH = {2, 4}
En{BUC}
First find B U C
BUG = {1, 2, 3, 4, 6}
En{BUC} = {6}
We may wish to refer to all items that do not belong to a set. This
concept of the complement of a set allows contradictions and it is
better to consider complementarity as relative to another set. This is
not restrictive since the other set could be the universe of discourse.
Any set and its complement are disjoint. We shall here be concerned
only with the universal complement - all elements in the universal set
that do not belong to A. The complement of A is written as
A or C{A} or A'
A = {8, 9, 10}
B-C = {1, 3}
Appendix 405
while
D-F= {1. 5, 7}
and
F- B = 0
B0C = {1.3,6}
\ JL • t\> t n\i
t\r • t\i • iX' • tX' • A, » • • 1H1 "VC
There are a number of rules for handling exponents and we will use
this simple definition to reinforce the rules in various examples.
A.3.1 When x” is multiplied by x'", the result is x raised to the sum of the
The product rule powers:
.yn m _ yH + m
tX iX - X
= (xx:x)(»:x) =
x^x^x^ = x^^
Appendix 407
Figure A.2
(a)
(b)
Figure A.3
(b)
(x")'" = x""’
(x^)^ = (xx)(xx)(xx) = X®
As with the product rule, more complicated cases can be dealt with
under the power rule. Consider the case of repeated raising to a power:
((x")"’f = x""'^
((X^)')" = X24
{xyf = x"y"
A special case here is where only one variable, x, is involved but where
X has a coefficient, a, other than one. It is the case that
(ax)" = a"x"
So, for example,
(x’^yPy = x^^yP^
= {x^y){x^y){x^y) = x^y^
A.3.3 Where a term consists of the ratio of x raised to differing powers, the
The quotient rule result is x raised to the difference of the powers, i.e.
— = x"-'"
X"'
For example:
X5 •yi'V'V'■V’
•A-A/•'V A* (A- o
_ ■'
—_
■ i\i
'ytj,
►V
/Y''Y'
yX/A/
The quotient rule is the product rule applied to the case where one
of the powers is negative. There are several interesting special cases of
the quotient rule:
(i) m = n — = x''~" = x° = 1
X”
Appendix 409
This provides a revealing demonstration of the fact that — 1
(ii) m = n+ 1 +1
(iii) n = 0 = = x-"'
x^
(xi/'")"’ = = X^ = X
36^/2 = ±6
125^/^ = 5
256^/^ = ±4
243^/5 = 3
64^/^ = ±2
Note that where m is even there are two real roots of opposite sign.
Now suppose the numerator of the exponent is a number other than
one. If X is raised to the power n/m, the result is the mth root of x", i.e.
x^!^ = ^,/x^
^ ^2v/2
Care may be required, as in this example, where the result, the sum of
the exponents, is not x^. As an example applying the power rule,
consider
= x^
In the above cases the base has been variable and the exponent
constant. In other circumstances the reverse may be true, or indeed
both base and exponent may be variables, as in x*. We will not
consider exponential functions (which result when the exponent is
variable and which are the subject of Chapter 9) at this point, other
than to point to a couple of examples. Note that in
3* = 81
-1024^ = -4
ABS(x) or |x|
Appendix 411
is the number itself. When x is negative, the modulus is minus x. Thus,
for example,
but
|6 + (-4)| = 2 < |6| + | -4|
(ii) |fl-1^1 > l|a| - |fe||
So that
|12 -9| = 3 = 1|12| - |9||
but
|-12-9| = 21>||-12|-|9|1 = 3
(iii) ABS(flb) = ABS(fl)ABS(&)
So that
ABS(( - 6)5) = ABS( - 6)ABS(5) = 30
8459067.123
0123456789
654.3
in octal represents
Appendix 413
Fixed-point notation (or an informal variant) is the most common
everyday use of the decimal system.
In floating point notation a number is shown as a multiple of a
base raised to a power. The ratio
250000
128
with four places before and after the point, has as its quotient in fixed-
point notation
1953.1250
19.53125 X 10^
0.1953125 x 10^
195312.5 X 10“2
1953.125 X 10°
1.953125 X 10^
5.960464 X 10"^
5.960464E“^
2+2=4 .
is with the operator - here addition, the symbol for which is '+’ -
placed between the addends or, in general, the arguments to which
+ 22 = 4
2 2 + = 4
A.6 Rules of precedence govern the order in which operations are carried
Precedence in out. In the absence of such rules, in infix notation the expression
arithmetic
7-3x4
operations
could have different values depending on which of the operations of
subtraction or multiplication was done first. With subtraction first,
7-3x4 = 4x4=16
7-3x4=7-12 = -5
1 Exponentiation.
2 Multiplication and division.
3 Addition and subtraction.
Thus
Infix notation alone does not always give the most economical
representation of a series of operations. For example, consider the
following set of instructions in four digits and three operators:
1 Subtract 3 from 7.
2’ Add 4 to 2.
3 Multiply the numbers produced by instructions 1 and 2 above.
(7 - 3) X (4+ 2) = 24
Appendix 415
which is more clearly an accurate statement of what is required than
the bald sequence of symbols (or string)
7x4+7x2-3x4-3x2
7 3 -4 2 + X
Read from right to left, the string reads ‘multiply the sum of 4 and 2 by
the difference between 7 and 3’. Although they are unambiguous, ease
of reading is not a notable feature of either prefix or postfix notation,
and the legibility provided by parentheses is a worthwhile net gain.
More complicated sets of instructions than 1-3 above may require
several levels of bracketing, or nested brackets. There are rules of
precedence for brackets - innermost brackets are evaluated first, then
work proceeds outwards to the highest level of bracketing. By
convention, the innermost brackets are parentheses, which are worked
out before square brackets [ ], which are evaluated before braces or
‘curly’ brackets {}. Occasional use is also made of a vinculum, a solid
line,_, placed above or below terms to be aggregated, as in
ax + by — c
5-1=4 and 49 - 32 = 17
4 + 2{5 x4-17}-10
20- 17 = 3
4+2 X 3 - 10
i.e.
4 + 6 - 10 = 0
Appendix 417
where, for example, if x is £ and y is DM, then (at the time of writing)
fl = 2.70. If X = grams and y = kg, then a = 0.001. Equilibrium
conditions derived from the model (such as marginal revenue equalling
marginal cost) are unaffected, although the numerical values at which
equilibrium is reached depend on the units chosen. Ease of use,
convention or transactions being conducted in an international unit of
account (e.g. the Ecu) or a single national currency (e.g. US dollars) are
common grounds for selecting one unit over another.
It is not always appropriate, or even meaningful, to measure a
variable against a ratio scale. An example is provided by utility
functions in consumer theory or portfolio theory, where alternatives
are ordered in terms of desirability and choices are based on individual
or collective preferences. For such purposes an ordinal scale is
appropriate. When a variable is measured against an ordinal scale no
significance is attached to ratios of numbers and differences between
numbers have no meaning apart from sign. This is because an ordinal
scale simply puts into rank order the sets of values of the arguments
of the function. Suppose that an investor prefers the combination (A)
of 20 units of return and 100 units of risk to the combination (B)
giving 30 units of return and 160 units of risk. This preference
ordering would be reflected by a function, /, which attached a larger
number to combination A than to combination B. That is, a function,
/, for which
/(20100) >/(30160)
g=f or g = ^/f
y = ax + b, where a > 0
F = 32 + 1.8C
Appendix 419
month or year is used as convenient. Optimal production plans relate
to resource availabilities and production levels expressed as rates of
flow per unit time. In contrast, a stock variable relates to the total,
number or aggregate value of items existing at some time. The
number of cars on the road, a firm’s finished goods inventory or the
nation’s currency reserves and available supply of capital goods are
examples of stock variables. Macroeconomic models use the concept of
stock variables. Where resources are limited it is important to
distinguish between two types of resource - those resources for which
unused supplies today can be carried over to tomorrow and those for
which non-use of today’s supply means that the productive potential is
lost. The former category is called a pool resource and the latter a
non-pool resource. Materials often represent a pool resource,
whereas labour is non-pool; unworked hours yesterday are not
available today.
^1' ^2» ^5
Xi, where i = 1, 2, 3, 4, 5
and where X; is the typical element. We may wish to know the sum of
the returns, S. This could be set out in full as
i=5
s=
i=l
S = ^ Xi = X2 + X3 + X4
i=2
If the upper and lower limits of summation are the same, just one
element is involved. For instance,
i=2
s = y^xi = X2
i=2
Another special case is where all terms in the summation are the same.
The result is simply the constant value multiplied by the number of
terms, as, for example, when each term takes the constant value k:
i=n
k = k + k + k + k + ... = nk
i=l
y^7 = 7 + 7 + 7+ 7+ 7 + 7 = 42
i=l
can be written as
3
yy WiXi
1=1
A special case of a weighted sum is when all weights are the same,
for example:
5
y^ = 4:xl + 4x1 + 4^4 + “^^5
i=2
i=2
Appendix 421
Note that the constant can be taken outside the summation sign. The
following are examples of the ways in which the notation is used:
3
^ ^ x=1-)-2t3 = 6
X=1
This shows the sum of specific numbers rather than variables. In this
case the terms are the index of summation itself. A convenient formula
exists for finding the value of such a sum, given only the number of
terms, the first term and the common difference between the terms.
Other expressions can be calculated directly. Consider
20
J]](0.1x^-2x^-3x+10)
j:=1
30 20 15 12
77_
YY
i=i/=i
The most convenient way to take this sum is to take the first value of
the index i and run j through all its values, then take the next value of
i and again run theindex through all its values - and so on.
Adopting this approach, the sum set out in full is
A.8.2 If the product of terms is needed (as when finding discount factors if
Product notation year-to-year interest rates differ) product notation uses the symbol H
to indicate the product of terms in the same way that ^ means the
sum. Thus
k=n
JJ = Xi>;2^3 • --Xn-lXn
k=l
represents the product of n terms x^. The product may involve values of
the index itself as in the factorial of a number. This is
k=n
JJ/f=lx2x3x4x5x...xn = n!
k=l
k^b
Y[k= 720 = 6\
k=l
Y[
k=l
Appendix 423
is a sequence. The symbol Rt, indicating the return in year t, represents
the typical member of the sequence and is called the general term.
Where successive terms of a sequence are related in the same way, the
sequence is called a progression. The way in which pairs of terms are
related defines a particular kind of progression. Because of their
importance in finance and economics, we will be concerned with
arithmetic progressions and geometric progressions. In an
arithmetic progression, consecutive terms differ by a fixed amount.
Thus, for example, the sequence
a = 50, d = 10, n = 6
and
a = 13, d = —3, n = oo
Using the first example to check the value of the last term as given by
(A.l), this should be
which is correct.
A financial example of arithmetic progression is simple interest,
where a fixed sum is paid in each period regardless of the amount
to which the original deposit has built up. So if £100 is banked on
1 January 1999 at 12.5% simple interest per annum the amount on
deposit increases by £12.50 each year, and a running record of
these amounts makes up an arithmetic progression shown as
follows:
The value of the investment after n years can be obtained using (A.l),
but care is required. The value after one year is the second term in the
progression and in general the value after n years is given by the
The nth term of this progression gives the book value after (n — 1)
years. Thus, since a = 50 000 and d = —5000, the book value after
seven years (corresponding to term eight in the series) will be
t=l
This is a power series. Power series are sometimes used to give a more
tractable representation of a function - a particularly important
example is the Taylor series. The sum, of an arithmetic progression
of n terms could be written in full as
A formula for this sum can be found by noting the result of adding
terms in a particular way. If the first and last terms are added, the
result is
Appendix 425
Terms can be paired off so that the sum of pairs is 2fl + (n — l)d. Since
there are n/2 pairs all told, the total is
note that a = 10, d = 5 and n = 100. So using the formula, the sum is
For
S„=|[-400 + (n-l)10] = 0
i.e.
^[-410 + lOn] =0
So, disregarding the trivial case in which n itself is zero, the sum is zero
when
-410 + lOn = 0
t= 0 5 5 5 5 5
t= 1 5 5 5 5
t= 2 5 5 5
t= 4 5 5
t= 5 5
15 20 25
E 5 10
d = 1 - 0.2 = 0.8
Appendix
427
and the value of the car after n years is given by the {n+ l)th term of
the progression. So after three years the value of a car bought for
£9000 would be
Note that
dSfj — ud -|- ad^ -I- ad^ -I- ad^ 4-... -I- ad" ^ 4- ad"
so S„ and dS„ have all but their first and last terms in common. That is
S„ — dS„ = a — ad^
so
so that
_ a(l - d")
(A.3)
“ (1-d)
111 1
5^(l + r)-‘ (A.4)
(1 + r) ^ (1 + r)2 ^ (1 4- r)^ ^ ^ (1 + r)" £=1
The constant difference in the annuity sum is 1/(1 -|- r) and the first
term, a, is also 1/(1 4- r). Use of this information in (A.3) produces
(l + r)-^[l-(l + r)-"]
1 -(1 + r)-^
(A.5)
1 - (1 4- r)-"
£282.24
0.1
428
Mathematics for business, finance and economics
An important property of (A. 3) is that if d < 1 the series has a
bound on its value even if the number of terms is infinite - it is
convergent. In a sum to infinity, the index of summation may be
unlimited from above, from below or in respect of either extreme, as
with
oo t=b 00
Xt or Xt or ^ Xt
t=a —oo —00
so
as M —> oo Sn ^ (^-6)
(1 - d)
500
3.5
PV = £63.64
0.055
Chapter 1
(ii) -1
(iii) -1
(iv) -1/6
(v) 7/4
3 (i) zero (horizontal) (ii) undefined (vertical)
(iii) slope is +1
In case (i) the slope formula would involve division by zero.
4 (i) cl=[{9-5f + (5-3f + (S~4ff
= (16 + 4+ 16)5
= 365
=6
(ii) d=[(3-8)2 + (6-6)2 + (12-6)2]5
= (25 + 0 + 36)5
= 615
«7.81
(iii) c/=[(7-8)V(-2-1)2 + (_8-1)2]2
= (1 +9 + 81)5
= 915
^9.54
4 (i) y = 5x - 11 (ii) y = 2x + 1
5 (i) Above the line, (ii) On the line,
(iii) Above. (iv) Below.
Exercises 1.5 1 (i) (a) Here f = 2, so n(t) = 90 (b) Here f = 5, so n(f) = 105.
(ii) In this case t = -3, so that, from the equation, n(t) = 65.
2 (i) / = 1000 — 40f, where / is the level of inventory and f is the
number of days'trading.
(ii) Stock falls to the predetermined reorder level on completion of t
days trading, such that / = 1000 — 40f = 240, so f = 19.
(i)
(iii) x= 1 y = -2 (iv) x = 2.5 y=1.2
2 (i) X = 6 y= 8 (ii) x = 7 y = -3
3 (i) x= 10 5 y= (ii) X = 3
y=8
Substitution is more convenient in these cases since rearrangement is
not needed.
4 (i) Unique solution: x = 5, y = 8.
(ii) Infinite number of solutions (which must, however, satisfy
y = 9.5 + 0.5x). The 'two' equations describe the same line.
(iii) Unique solution: x = 2, y = 7.
(iv) No solutions. The equations are inconsistent; they generate
parallel lines.
Exercises 1.11 1 Using the first equation to eliminate x from the second and third
equations
X + 3y + 2z = 29
2y + 2z = 20
-2y + z = -2
from which the second equation is used to eliminate y from the third
equation, which is then divided through by 3. The result is
X + 3y + 2z = 29
y+ z = 10
z = 6
x= 5 y = 4 z = 6
2 Using the first equation to eliminate x from the second and third
equations
X + 0.5y + z = 9.5
y + 2z = -1
y+z= 7
x + 0.5x + z = 9.5
/ + 2z=-l
z = -8
x= 10 x= 15 z = -8
4800
= 2400 units
5-3
5400
3086 units
^ 5-3.25
5400
2400
3.25
so p = 5.5
(i) Breakeven output level is
13 500
= 3375 units
12-8
(ii) 50% of plant capacity.
(iii) (a) Breakeven output level is unchanged,
(b) New breakeven output level is
13 500
q= = 3000
13.5-9
Solutions to exercises
chapter 2
Exercises 2.1 1 (i) < (ii) > (iii) > (iv) 7^ or <>
2 (i) > (ii)< (iii) > (iv) <
Exercises 2.2 1 (i) x<10 (ii) X < 8.5 (iii) x >-5 (iv) x < 0
2 (i) The solution set is empty. The identity simplifies to the statement
1 <0.
(ii) All values of x satisfy the inequality. The terms in x cancel,
leaving the statement 10 < 12.
3 (i) 5<x<12
(ii) 9<x<30
(iii) The solution set is empty. The inequalities require that,
simultaneously, x > 10 and x <5.
4 (i) 4 < X < 6 (ii) 3 < x < 7 (iii) 3 < x < 7 (iv) 3 < x < 7
Exercises 2.5. 1 For this problem the optimal solution is located at the intersection of
the two constraints. This is shown as point B in Figure S2.1. Optimal
values of the decision variables and objective function are
xi = 12 X2 = 9 F = 186
Figure S2.1
Figure S2.2
Chapter 3
6 C^ = [-8 0 V2 4]
TT
3 = [100]
-1
0
7 A is not symmetric; B is skew-symmetric; C is symmetric; D is
symmetric.
(ii) 5 20'
15 18
436
Mathematics for business, finance and economics
5 3
3 A+B
2 -1
(i) Thus when k = 2
10 6
k[A + B] =
4 -2
-5 -3
k{A+B)--
-2 1
12 22
13 -3
7 -7
(iv) 15 -2 36
11 -20 15
29 26 12
3 5
-5 8
1 18
-18 19
16 55
(iii) A^ =
55 39
2 0
.0 2
so / A and the matrix A is therefore not idempotent.
(ii) In this case:
8 7
-8 -7
22 -26
AB
26 22
22 -26
BA
26 22
' ']r4 [3 6]
4
221
19 [3 6]
11 J
66 132
57 114
33 66
3 2 0 '1 3'
1 0 1 •
1 2 -1 2 1
-
3 5 -1
2 5 1 0 4
7 11
1 0
5 1
3 5 -1
12 15
36 40 55 -4
12 8 5 4
54 57 75 -3
CD
o
5 3' 1 O'
1
5 4 -1 1 0 1
5 5 -1 0.2 0 -1
3 (i) Ad/A=
(ii) AdjA= t
(iii) Adj A =
(iv) AdjA =
■1.5 -2.5'
4 (i)
-1 2
0.4 -0.2
(ii) -0.3 0.4
■-1 0.4'
(iii)
2 - 0.6
2 5
(iv)
4 15
(v) No inverse: determinant is zero.
5 (i) Involutional: A“^ =A
(ii) Orthogonal: A^ = A“^
(iii) Involutional: A“^ =A
(iv) Involutional: A~^ =A
1-1
1_1
Oi o
1-1
CN ■—
o o
c> c>
Xl 8
2 (i)
L
5
Xl ■ -3'
.^2. .
7
Xl 9"
.^2. L
-4
Xl 9.5
.^2. .
15
’xi ^ O'
(v)
.^2. L
19
Xl ■ 100
(vi)
.^2. _
-90
Xl 20
(vii)
.^2. L
20
Xl ■ 1.5 -2.5' '90' '10'
3 (i) 2 50 10
.^2. 1
(ii) Using the inverse matrix, it is required that
Xl -2.5 b] 0
>
.^2. 2 bi 0
Exercises 3.7 1 (i) As revealed by the fact that |>l| = 0, the rank of A = 1, the
second row being the first row multiplied by -0.5. Since the
same relationship holds between the right-hand-side elements,
the equations are consistent, and any pair of values that satisfies
2xi - X2 = 20, that is X2 = 2xi - 20, will be a solution to the
system.
(ii) Here, the rows (columns) are linearly independent and the rank
of the A matrix is 2. The equations have the unique solution
given by
1 -1.6' ’1 r ■ 3'
-0.5 0.9 5 -1
(iii) Here, |A| = 0 and the rows (columns) of the A matrix are
linearly dependent (row 2 is —2/3 of row 1). But in this case
the right-hand-side elements do not stand in the same
relationship as that between the rows of A. So the equations are
inconsistent, and there is no solution to the system.
2 (i) The rank of A = 1. The second and third rows are twice and
four times the first row, respectively.
(ii) The rank of A = 1. Here, the linear dependency is more easily
seen in the relationship between the columns. Column 1 is twice
column 2, while column 3 is — ^ column 2.
(iii) The rank of A is 2. The third row is twice the first row so the
rank cannot be greater than 2. But the rank cannot be less than
2, as the second row is not a multiple of the first.
3 (i) Yes. The third column of the matrix is the sum of the two vectors
making up the basis, and the fourth column of the matrix is the
difference between the two basic vectors.
(ii) The rank of A is 2.
4 The rank of the matrix is 1. All columns can be seen as constant
multiples of any one given column.
(b) _ 8 4
2 1
= 8-8
= 0
(d) _ 7 9
m3 2
“34
= 28-27
= 1
(e) 5 9
= 2 1
= 5-18
= -13
now
5 1
mil ^ 20-2 = 18
2 4
so
cii (-l)'^'mii = 18
4 1
mi2 16-9 = 7
9 4
so
C12 (-l)'^'mi2 = -7
4 5
mi3 = 8-45 = -37
9 2
so
So the determinant is
where
4 5
mi3 = 8-45 = -37
9 2
2 4
m23 = 4-35 = -32
9 2
2 4
^33 = 10- 16 =-6
4 5
so that
A1 = 3(-37)-(-32)+4(-6)
= 111 +32-24 = -103
where
9 8
m2i = = 36-24 = 12
3 4
and
7 9
m23 = 21 -45 = -24
5 3
|Al = -12-(2)(-24) = 36
where
4 2
m3 2 32-16= 16
8 8
and
4 3
0133 = 20 - 24 = -4
8 5
\A\ = 16 + (4)(-4) =0
so
where
4 9
mil = = 12 - 18 = -6
2 3
5 9
mi2 = = 15-27 = -12
3 3
5 4
mi3 = = 10- 12 = -2
1 2
so
1 2
m2i = = 3-4 = -l
2 3
1 2
m22 = = 3-6 =-3
3 3
1 1
m23 = = 2 -3 = -1
3 2
1 2
^31=. o =9-8 = 1
4 9
1 2
^32 = ^ ^ =9-10 = -l
5 9
1 1
0133 = 4-5 = -l
5 4
J_ cn C21 C31
1-1
Cl 2 C22 C32
JA\
.Cl3 C23 C33
so
-6 1 r -3 0.5 0.5
1
12 -3 1 6 -1.5 0.5
2 1 -1 -1 0.5 -0.5
-2
7 2
= 10.5- 10 = 0.5
5 1.5
445
Solutions to exercises
So the third row cofactors ore
The inverse is
Exercises 3.9 1 The determinant of the coefficient matrix is -1 and its inverse is
■-6 5 -4
A ’ 13 -10 8
-2 1.5 -1
->^2 = -2 -1 2 139 25
.^3. -0.2 -0.4 0.6 78 -8
'xi ■
-
IV ' 3.6 2.2 -3.8' 9 'O'
->^2 -2 -1 2 63 9
.^3. -0.2 -0.4 0.6 45 0
From the X] line: 3.6bi + 2.2(1 85) — 3.8(1 85) > 0, which implies
that b] > 82.22. From the X2 line: —2b] — 185 + 2(1 85) > 0,
which implies that b] < 92.5. From the X3 line: —0.2b] — 0.4(1 85)
+ 0.6(1 85) > 0, which implies that b] < 1 85. So the tolerance
interval for this parameter is 82.22 < b] < 92.5.
For 62 h is required that
^2 = -2 -1 2 82 > 0
From the xi line: 3.6(85) + 2.2b2 — 3.8(1 85) > 0, which implies
that b2 > 1 80.45. From the X2 line: —2(85) — 62 + 2(1 85) > 0,
which implies that 62 < 200. From the X3 line: -0.2(85) — 0.4^2
+ 0.6(185) ^ 0, ^/hich implies that 182 ^ 235. So the tolerance
interval for this parameter is 180.45 < 82 < 200.
For 83 it is required that
-2 -1 2 185 > 0
3
^ . -0.2 -0.4 0.6 . ^3 0
From the xi line: 3.6(85) + 2.2(1 85) — 8.883 > 0, which implies
that 83 < 1 87.63. From the X2 line: —2(85) — 185 + 283 > 0,
which implies that 83 > 177.5. From the X3 line:
—0.2(85) — 0.4(1 85) + O.683 > 0, which implies that 83 > 151.67.
So the tolerance interval for 83 is 177.5 < 83 < 187.63.
Exercises 3.10 53 1
46 2
6 2
106-46
18-6
So the solution is
xi = 5 and X2 = 8
(ii) 26 5
3 3
xi
4 5
1 3
78- 15
12-5
= 9
4 26
1 3
1 3
12-26
12-5
where
1 5
mn = = 3 - 30 = -27
6 3
4 5
mi2 = = 12-10 = 2
2 3
4 1
mi3 = = 24-2 = 22
2 6
So
A| = 2(-27)-3(2) + 22 = -38
16 3 1
34 1 5
30 6 3
CO
16 1 5 34 5 + 1 34 1
1
6 3 30 3 30 6
So
-114
><1 = 3
-38
For X2:
2 16 1
4 34 5
2 30 3
34 5 -16 4 5 +1 4 34
30 3 2 3 2 30
So
-38
And for X3:
2 3 16
4 1 34
2 6 30
X3 =-
-38
in which the numerator determinant is
1 34 -3 4 34 + 16 4 1
6 30 2 30 2 6
So
X3
X] = 3 X2 = 2 X3 = 4
■ 1 -22 7‘
A= 1 29 2
-2 12 4
1 107 7
1 61 8
-2 -92 4
X2 =-
1 -22 7
1 29 8
-2 12 4
where
1 8
mi2 = = 4+16 = 20
-2 4
1 7
0122 = = 4+14= 18
-2 4
1 7
0132 = =8-7=1
1 8
SO the determinant is
- 107(20)+ 61 (18)+92(1)
= -2140+ 1098 + 92 = -950
X2 = -950/950 = -1
-700
X = = 70
-10
R = 700(70)-5{70f = 24500
tt=R-C
= 700x - 5x2 - 150 - 40x _ ^^2
= 660x — 6x2 _ 1
560-4q-0.25q2 =0
that is
q2 + 16q-2240 = 0
which factors as
(q-40)(q + 56) = 0
so that
x2 - 800X + 70000 = 0
which factors as
(x- 100)(x-700) = 0
x^ — 5x — 50 = 0
solves for
X = —5 and x = 10
vr
_= - 90x + 4000
X
the parabola for which opens upwards and has its vertex, a
minimum, at
-(-90)
= 45
TT = -Z 2 +5000z- 1 250000
-5000
2500
-2
so that, discarding the negative root, x = 50, at which point
Exercises 4.4 1 (i), (ii), (iv) and (vi) are polynomials. Recall that only non-negative
integers are allowed as exponents. So the term 3x~2 disqualifies (iii)
and the term x^ disqualifies (v).
xy — y — 2x + 2 = 0
can be re-expressed as
(x-l)(y-2) = 0
,0 = ^0°°-200 = 240
0 5000 - 200
480
10
3 Per unit fixed costs are given by the rational function
500000
AC =-
q
so an output level of at least q = 200 000 would be needed to
ensure that per unit fixed costs do not exceed £2.50.
10
(v)
(200 - y)
5 ( Homogeneous of degree one.
( Not homogeneous.
( ii) Homogeneous of degree three.
(iv) Not homogeneous.
(v) Homogeneous of degree one-half.
(vi) Homogeneous of degree minus two.
Chapter 5
Exercises 5.3 1 The slope of the straight-line segment connecting tv/o points on
y = 5x^ for which the x co-ordinate differs by Ax is given by
1 Ox Ax -I- 5(Ax)^
= 1 Ox + 5 Ax
^=10x
dx
i) ^ = 100x^9
dx
-3
ii) -3x“ i.e.
dx
1
(v) ^=(l/3)x-2/3 i.e.
dx 3x^/^
2
H ^ = 12/3)x-/^ i.e. 3^3,3
(x) ^ = 40x^
dx
2
(xii) ^ = 2x-’' i.e. ^
(xiii) f'[x] = 0
(i) f'(x) = 9x8
dy -1
i.e. -O.lx-2
'' dx"10x2
I ■■■] dy 10 -11
I.e. lOx
0.01
(xii) ^=-0.01x-' ’ I.e.
dx „i.i
3 (i) ^'(x) = 1
(ii) P(x) = mx'^-’
(iii) f'{x] = -px~P~^
(iv) f'(x) = (n - 1 )x“"
1
(v) I.e. nx,-n—
dx X■n-i-1
(viii) ^ = >x>-^
(ix) — n2
f'{x) = n^x - 1
-« ... X
X^
‘I
(ii) ^ = 5x^-3x2
dx
(iii) $^ = 6x^+5x^-4x2
dx
3 (i) $^=12x-156
dx
So a stationary value occurs at x-= 13 where the derivative is zero.
Solutions to exercises
dy (3x^ - 2x^)(20x^ + \Ax^] - (5x^ + 2x^)(12x^ - 6x^)
(3x4-2x3)2
(7x3+5)-x(21x^)
(iii) V[x]
(7x3 + 5)2
x2 + 1
This result could have been obtained without the use of the quotient
rule, since f(x) simplifies to
(ii) Using both the product and quotient rules, with the first two
brackets representing the numerator, the derivative is:
f'(x)= [(3x-5)[(x3-2x2)(2x + 2)
+ (x2 + 2x)(3x2 - 4x)] - (x^ - 2x2)(x2 + 2x)3]
^(3x-5)2
i.e.
2 (i) (x3+xY6(x^-x3)^(5x‘^-3x2)
dy (x2-1)3(12x2+6)-(4x3+6x)3(x2-1)22x
dx- (x2_1)6
5 (i) ^=15z2(16x3)
dx
= 240z2x3
= 240(4x^)^x3
= 240(16)(x3)x3
= 3840x’^
(ii) ^=(2z-1||4x-2|
= |4w + 2)(2l(3x^-1)
= (8w + 4)(3x2- 1)
= 4(3x2 _ i)(2x3 -2x+ 1)
which expands as
24x^ -32x3 + 12x2 + 8x-4
(ii) When x = 1 the derivative dy/dx evaluates as 8.
so that
^ = -L = 5
dx 0.2
^ = 0.5
dy
so that
±-±-7
dx 0.5
(Nil 1 = 5/
SO that
dy_ 1
dx 5/^
so that
dy _ 1
dx 8y2 - 9y2 + 8y
dy _ 1
2 (i)
d^ " (y^ + y2)(6y2 + 1) + (2y3 + y)(4y3 + 2y)
_1_
dx“3(5y3 + 12y- 10)^(15y2 + 12)
3 ^ = 5x^ + 6x^ + 5
dx
dx_ 1
dy 5x^ + 6x2 ^ 5
and when x = 1
dx 1 ^
j- = TT = 0.0625
dy 16
dy _ 4
Exercises 5.9 1
dx 5
dy _ 5y + 10
(ii)
dx 2 - 5x
dy _ -480
^ ” (2 - 5x)2
3 (i) jX-
^ dx”(2xy + 5)
dy 6x - y - 6xV^
dx 4x2y + X - 7
2 (i) ^=120x=
= 360x2
= 120x2 - 72x
^ = 240x - 72
dx^
j4
^ = 50.4x^-581.4x1^
dx^
Solutions to exercises
(iv) Using the chain rule
To obtain the second derivative, the product and chain rules should
be used. The result is
$^=100-5x = 0
dx
(ii) ^=8x-72 = 0
dx
4x(x-2)(x-7) = 0
(x+l)(x-l)(x + 2)(x-2) = 0
for which
at X = — 1, f''(x] = 6
so that X = — 1 is a minimum;
at X = 1, f "(x) = —6
so that X = 1 is a maximum;
so that X = —2 is a maximum;
at X = 2, f"(x) = 12
so that X = 2 is a minimum.
dV = 12x2 - 4Q^ ^ 44
dx2
at X = 1,
= 8
dx2
= 8
at X = -4,
dV
^ = -7560
dx^
= 5040
^ = -9450
dx-^
= 25 200
X = 1 + 2/ and x = 1 - 2/
Since the leading coefficient of f(x) is negative and the power of the
leading term is even, the function decreases without limit at either
extreme and the global maximum will be at either x = 2 or x = 8.
Evaluating f{x] gives
x = 2, f{x] = 1592
X = 8, fix] = 2024
is satisfied for all x, then x = 10 is the single local maximum and the
global maximum.
4 (i) First establish where the turning points are, and whether a local
maximum lies within the permitted domain:
f"(x] = -6x + 66
is negative atx = 20, showing that the local maximum lies in the
restricted domain. The procedure calls for the function to be
evaluated for the local maximum and the end-points of the domain.
The results are
solves for X = — 10 and x = —2, both of which points lie outside the
domain. The function is then evaluated at end-points only, giving
X = 0 f(x] = 1500
x=3 f(x)=1131
which solves for: x = 10 and x = 20, which are both within the
domain. The second derivative
f "(x) = 6x - 90
f'"(x) = -15
f "(x) = 24x - 90
f'"(x) = 24
f'"{x] = 6x - 54
3x(x^ — 3x + 2) = 0
so
3x(x — 1 )(x — 2) = 0
from which it is clear that the other two roots are x = 1 and x = 2.
The third derivative
P"(x) = 9x2 - 1 8x + 6
P'(x) = 6x - 36 = 0
f"{x) = 1 8x - 54 = 0
P"(x) = 6x - 30
f"(x) = 6x2 - 1 8x - 60 = 0
f "'(x) = 12x - 1 8
Chapter 6
Exercises 6.2 1 (i) On the plane. The values of x, y and z satisfy the equation.
(ii) Above the plane. The z value in the co-ordinate triple is above
the value (200) given by the function using the x and y co¬
ordinates.
(iii) Above the plane.
(iv) Above the plane.
(v) Below the plane.
(vi) On the plane.
(ii) z = 500-0.5/2
2 (i) With / stated as an explicit function of x, the equation of the
contour is
1000 - 6x + 9x2
(10-3x)
Exercises 6.4 1 The slope is —2 in the fundamental x direction and 0.5 in the y
direction.
2 (i) | = 2x-4y
|^=-4x+l
dy
fx = 2xy2
fy = 3x2/2
fx = 640
fy = 1200
(ii) = 9900x9
fy = 10100x99yioo
(iv) ^ = 0.7l~^-^k°^
ol
^ = 0.4P-^k-°‘'
^=bn>-'
ok
Bt
(vi) — =mnx^“V"~^
ax
^=n[n- l)xV^
dz
(ii) rAL^-^K^M^
OL
dQ
= sAL^K^-^M*
W
dQ
= tAL^K^M^-^
m
6 The marginal profits on x and y are the first-order partial derivatives
with respect to these variables. So marginal profit on x is
500 + 4y — 6x; marginal profit on y is 400 + 4x — 4y.
(x - y)'
r x(x - y) + xy
' (x - yf
x2
~ (x - yf
^ (y^ - 5xy)(4xy2 - 3x2) _ (2x^2 _ ^3)( _ 5^,
(y2 - 5xy)2
Exercises 6.5 (i) The first order direct derivative with respect to x is
dx ^
d^z
- 20x3y2
dx^
= — 160x3 y3 _ 84x^y
dx'^
2 « Sl = 2x=
(ii) 0 = 5O4y7-48xV
d^z d^z
3 (i) lOxV
dxdy dydx
= -120x^y2 - 14x6 =
dxdy dydx
d^z a2;
= 45x^y^ - 24xyw^ - w =
dxdy dydx
dz
4 (i) ^ ~ 1 Oxy + 2x + 7y - 3
d^z
(iv) 12x+10y + 2 = 34 at x=l,y = 2
dx^
d^z
(v) = 6y - 8x - 6 = -2 at x = 1, y = 2
5y2
di^z
VI = 12
dx^
d^ z
fx = 20 + 8y — 2x = 0
= 64 + 8x — 64/ = 0
(-2)(-64)-82 >0
= 4x — 16 — 2/ = 0
fy = 2/ - 20 - 2x = 0
^XX = 4 fyy = 2
fx = 5+y — 0.5x = 0
fy = -4 + x- x = 0
fxx='-0.5 fyy = -l
Thus the conditions for a maximum are not fulfilled in this case.
fx = -4 + X - 2x = 0
fy = 5+ x — y = 0
(-2)(-l|-l2>0
= 5 + 2y — 0.5x = 0
= -16 + 2x- 16y = 0
d^z
^ = 2y- 12x = -14<0
ox^
d^z
2x + 2x= 14
dxdy
f^ = -14-2.5y + 3x = 0
fy = 4 - 2.5x-f 4y = 0
^ = 6-3y + x = 0
fy = —54 — 3x -f 1 8y = 0
4x = 1 = 18 fxy = -3
(11(18)-1-3)^ >0
fxx = 6x - y = 59 > 0
fyy = -X - 4x + 96y = 46 > 0
4y = -X - 4/ = -14
Exercises 6.7 1 (i) At the first order, for any stationary value of f(x, y)
fx = 8 + 2/ - 4x = 0
fy = —5 + 2x — 0.5/ = 0
(-4)(-0.5)-22 = -2<0
fx = 15 — 3/ — 1.5x = 0
fy= 12 — 3x + 3/ = 0
fx = 4x + 8/ = 0
fy = / + 8x = 0
(4)(1)-82 = -60<0
= -220-14/+ 10x = 0
fy = 310-14x + 20x = 0
fxy = -14
15-7x-6x = 0
fy = 17 - 7x - 8/ = 0
f^ = -7
|-6)|^8|-|- + = -1 <0
which identifies the point as a saddle point.
(iii) At the first order
fx = 34 — X — 5x = 0
fy = —10 — x + 4x = 0
Since the direct derivatives are of opposite sign, the stationary value
in this case must take the form of a saddle point.
3 (i) Saddle point at X =-4, X = 6.
(ii) Saddle point at x = 3, / = 1 •
(iii) Minimum at x = 2, / = 8.
(iv) Saddle point at x = 2, / = 3.
(v) Maximum at the origin.
(vi) Saddle point at x = 2, / = -12.
= 24x + 12/ + 8 = 0
fj. = 12x + 6/ - 10 = 0
fx = 60 — 5x + 0.5/ = 0
fy = 64.5 + 7y + 0.5x = 0
= -40x + 20/ - 50 = 0
fy = — 10/ + 20x + 30 = 0
fx = 4.5x — 9/ — 9 = 0
fy = 1 8/ - 9x + 18 = 0
/ = 0.5x — 1
fx = x + 2/-5 = 0
fy = 4/ + 2x - 8 = 0
^ X + 2/ = 5
X + 2/ = 4
Exercises 7.1 1 The slope of the constraint line is —1/3, os may be seen from
rearranging the constraint as:
55 1
fx = 22 + 3y - 3x
fy = 33 + 3x — 4.5/
_4=_1
fy 3
SO
(22 + 3/ - 3x) _ 1
(33 + 3x - 4.5/) “ 3
Hence
66 + 9/ — 9x = 33 + 3x — 4.5/
which rearranges as
12x- 13.5/= 33
38
y — X = 17
fx = 20 - 0.2x = 0
fy = 16-0.1/ = 0
so
20 - 0.2x 5
16-0.1x~2
x= 1.25/- 100
325 2/ .
X= ^ ' =65 0.4/
so
Note that if the function f(x, y) represents profit, then £742.5 is the
maximum price worth paying in order to remove the constraint.
dz*
,=120 8x = 0
dx
d^z*
= 2000+ 40x-0.1x2
dz*
= 40-0.2x = 0
dx
-0.2 < 0
dx^
z* = 50x2 000
dz*
1 OOx - 4x3 = 0
dx
d^z*
= 100 - 12x2 < 0
dx2
x(100-4x2) = 0
so
X = 0, X = 5 and x = —5
At the second order the condition for a maximum is satisfied for the
values X = 5 and x = —5, both of which give a maximum value of
z, subject to the constraint, of 1625.
4 From the first constraint:
X = 80 — 1 Oy — 5w (1)
w = 22 — 4y
X = 1 Oy - 30
z* = 320y - 40y2
320 - 80/ = 0
-80 < 0
X 2y
w = —h — 2
3^5
which substituted into the objective function gives
fx* = -^ + 4x+10 = 0
fy" = 4x - 6y = 0
(- 10/3)(-6)-(4)2=4 >0
6 (i) With the right-hand side = 120, the problem solves for
' With the right-hand side = 121, the problem solves for
58564-57600 = 964
(iii) With the right-hand side = 100, the problem solves for
Lx = 20 - X - 2x - / = 0
Ly = 15 — X — X — 2/ = 0
L, = 20 - X - 2x = 0
From Lx,
/ = 20 - X - 2x
15 - x-x-2(20-x-2x) = 0
6x + 2x = 50 (Ij
X + 2x = 20
5x = 30
2 2
E = f-xx(Sfy) ~ '7-^xyQxQy + f-xy(Sfx) < 0
So E evaluates as
£ = -2(-2p-2(-1)(-l||-2) + (-1|(-l|^
= -8 + 4-1 =-5<0
so
Ix = 44 + 9/ + 1 Ox - 3/ = 0
Ly = 101 +9x- 16/-2/ = 0
Lj = 29 - 3x - 2/ = 0
so
5 The Lagrangian is
L = x^x + j(20 - X - x)
Lx = 3x'^y - / = 0
Ly = x^ - / = 0
L, = 20 - X - X = 0
x^ - 3x^x — ^ (1)
-3x2(20-x) = 0
i.e.
4x2 _ ^0x2 = 0
which factors as
x2(4x-60) = 0
also
gx = -1 and gy = -1
So for a maximum
E=|6xy|(-1|^-2(3x2l(-l)(-1)<0
£ = 6|151|5)|-I|"-2|3)(15)^|-1||-11
= 450- 1350 =-900 <0
^ = 24 - 3x
dx
^ = -21 -7x
dx
-2880x +384x2 _
dx
At X = 0, the derivative dz/dx = 0, so that in this case a turning
point occurs at the boundary and x = 0 could produce the sign-
restricted optimum. The other values of x which satisfy dz/dx = 0
are found by solving
-2880+ 384x-12x2 = 0
^ = 2.4x- 12
dx
^ = 3x2 - 27^ _ 30
dx
i.e.
x2 - n Ox - 1000 = 0
X z
0 0
10 47500
100 -20000000
f.<0
xfx = 0
X > 0
fy=0
^x = 0
fy<0
yfy = 0
X>0
1 x = 0 y = 0 fx = —300 fy = 40
2 x = 0 fy = 0 fx = -100 x = 20
3 fx = 0 y = 0 X = —5 fy = —10
4 f^^O fy = 0 x = -10 x = --30
f^ = 0 fy = 0
dz
|50-x|^ = 0
dx
(50 - x) > 0
The derivative is
150-2x
dx
X = 50 when z = 5500
^ = 2000 - 16x
dx
^ = 1600 - 4x
dx
X z
0 2500
8 644
20 1940
X z
0 0
5 -2187.5
50 73 750
^ = 6x2-210x+ 1500
dx
At X = 0, dz/dx = 1500, so this is not a possible optimum. At
X = 30, dz/dx = 600, so this position is a candidate. When
dz/dx = 0, X = 10 or X = 25. Evaluating z at the three possibilities:
X z
10 5500
25 2125
30 3500
4 =-2x-2.5x+10
fy = -2.5x - 4/ + 2
= -2x-2.5y+ 10-2/ = 0
Ly = —2.5x — 4y + 2 — j = 0
9x = -2, gy = -]
X = 20 - 2x
^ = 116 - 8x
dx
492
Mathematics for business, finance and economics
Trying x = 0, dz/dx = 116 > 0, so this cannot be the optimum.
Trying dz/dx = 0, the equation solves for x = 14.5, which is (now)
infeasible. Trying x = 10, dz/dx = 36 > 0, as required for the
upper bound to be a possible optimum. Since this is the only
possibility, the solution is
X z
12 2008
5 562.5
-10 2250
40 — 16x -f 4/ — 2/ = 0
4x + 80 - 10/ - 4/ = 0
40 - 2x - 4/ - s = 0
sj = 0
s > 0, / > 0
X = 4, y = 8, / = 4, z = 480
16x — 4y = 40
-4x + 1 Oy = 80
X = 4, y = 8, / = 4, z = 480
3 The Lagrangian is
120 - 5x + 3y - 3/ = 0
3x — 7y — 60 — 4/ = 0
35 — 3x — 7y — s = 0
5/ = 0
s > 0, / > 0
x = 330/13, y = 30/13
4 The Lagrangian is
100 — 6x + 4y — 2/ = 0
4x — 5y — 55 — 5/ = 0
90 — 2x — 5y — s = 0
sj = 0
s > 0, j > 0
X = 20, y = 5 and s = 25
32-2x-2j + t = 0
16 — 2/ — /+u = 0
30 — 2x — / — 5 = 0
fx = 0, uy = 0, sj = 0
■X > 0, / > 0, / > 0, f > 0, u > 0, 5 > 0
X t X u s /
(a) tu s 12 6 4
(b) fuj 16 8 -10
(c) fys 15 -15 1
(d) fyi 16 -16 -2
(e) XUS -120 30 -44
(f) X u 1 -32 8 22
(g) Xys No solution
(h) xyi -32 -16 30
63 - 1.6x + X - / + f = 0
—35 — 0.8x + X — 0.8/ + u = 0
35 — X — 0.8x — s = 0
fx = 0, uy = 0, sj = 0
X > 0, y > 0, j > 0, t > 0, u > 0, s > 0
(ii) Without the requirement to meet the constraint the best position
is (b), where the sign requirements happen still to be met and
where
fix, y] ^ 1229.58
Li = 0, 4 = 0
4 = 40 - 2x = 0
4 = 60 - 4/ = 0
38 896
X) ^ 793.80
49
100-4x-2/-0.5k + f = 0
60 — 2y — / — /c + u = 0
65 — 2x — y — s = 0
38 — 0.5x — y — r = 0
tx + uy + js + kr = 0
X, y, f, u, /, k, r, s>0
(a) tu s r 18 29 18 -16
(b) tusk 20 25 10 3
(c) tu j r 24 26 -9 8
(d) fujk 25 30 -15 -4.5
(e) fysr no solution
(f) tysk 32.5 -75 -15 21.75
76 -468 -87 -408
(g) f X / »■
(h) tyjk 25 -60 150 25.5
(i) X u s r no solution
(j) X u s k -280 65 -70 -27
(k) X u j r -108 38 27 -16
(1) X u j k -100 30 35 8
(m) X y s r no solution
(n) X y s k no solution
(0) xyjr no solution
(p) X / / k -100 -60 65 38
so
S0-x-j-k+f = 0
60 — / — / — 2/c + u = 0
100 — X — / — s = 0
140-x-2/-r = 0
tx + uy + js + kr = 0
X, y, f, u, j, k, r, s>0
X f y u / s k r
(a) fu s r 60 40 20 0
(b) tusk 60 40 20 0
(c) tu j r 68 36 -4 12
(d) tu j k 80 60 -40 -60
(e) ty s r no solution
(f) tys k 100 -80 -20 40
(g) fy/'' 140 -180 -40 -60
(h) fyjk 80 -60 20 60
(i) X u s r no solution
(j) X u s k -120 100 -40 -60
(k) X u jr -85 70 30 -5
(1) xujk -80 60 40 20
(m) Xys r no solution
(n) Xys k no solution
(o) xyjr no solution
(p) xyjk -80 -60 100 140
Cases (a) and (b) produce the same solution to the conditions.
This simply means that the corner point at the intersection of
constraints one and two is also a point of tangency (with the first
constraint). The optimal production levels for the two products are
therefore:
X = 60 and / = 40
Exercises 8.1 1 (i) (ii) and (iii) and (v) are antiderivatives of 4x^ since each is x"‘±
constant and would give 4x^ on differentiation. Cases (iii) and (iv)
have a term in x with a non-zero coefficient which would not vanish
on differentiation.
2 Cases (i), (ii) and (iv) are antiderivatives of x^ - x. Case (iii) is not
an antiderivative of x^ — x, since its derivative is 2x^ — 2x.
2 (i) +k
10
,-4
(iii) - k, i.e. + k
4x'^
4
(iv) ^x^/^-hk
D
.1.2
(v) +k
1.2
, 0.6
(Vi) +k
0.6
(vii) 2x5 + k, i.e. 2^yx + k
yV3+l
(viii) + k
\/3 + l
j.2n + i
H 2iTTT + ‘^
3 (i) x"* + x^ +
(ii) X* - 0.5x^ -f 0.25x^ + k
= 50x5 _ i2.5x’°
(iii) ^ ^ +k
(iv) P + + k
(v) x“^ + k
2000x - 5x2
price = = 2000 - 5x
X
J (] 00 + X + 0.3x2)cIx
so that
so
3000 + k = 4000
= 1000
480- 10q = 0
q = 48
, =480q-5q2
TR = pq
p = 480 - 5q
TT = TR - TC
TC = y MC(q)dq
= I (210 + 5q)dq
= 210q + 2.5q2 + i^
when q = 10,
so
1^ = 430
= 270q-7.5q2 -430
and
drr
= 270- 15q = 0
d^
p = 480-5(18) = 390
y — J - 2x + 1 )dx
= x'^ - x4 + x3 - x2 + X + /c
300 = 3^ - 34 + 33 - 32 + 3 + it
y = x^ - x4 + x3 - x2 + X + 117
y = 0.25x4 _ q + 2x2 + 1 Ox + it
90 = 84 + it
that is
in 1 3 ,
y=10x-^ + -+ k
X^ X
35 = 20 -L+A+fc
0.4 ^0.2
i.e.
35 = 20-2.5+ 15 +it
0 = 4 + 3-1-32+ A:
5 (i) Since
dV
^= 10-30X
dx^
and
3k^ +k2=90
-2k = -290
^=24x- 12
dx^
so
so that k] = 2 and
^ = 12x2 - 12x + 2
dx
X = 4x^ - 6x^ + 2x + 12
d X
= 60
dx^
and, integrating,
60x + k]
^ = 30x2 + 1 Ox + i^2
dx
7 Reorganization gives
2
= 9x2 - 1 80 + 900
so that
^ = ±3(x- 10)
dx
X = 1.5x2 _ 20x + k
X = — 1.5x2 ^ 20x + k
X = 1.5x2 _ 20x + k
u = 10 + x2
j u^6u = 0.25u'^ + k
u = 2x2 ^ + 15
it follows that
du = (4x + 6)dx
that is.
du
(2x + 3)dx =
i3 u4
= 0.125(2x2+6x+ 15)
u = (x2 — 15)
it follows that
du = 2xdx
so that
4xdx = 2du
/■ i , 4u2/2
/ 2u2du = —2-h k
u = (6x2 _ ^ 20)
it follows that
du = (12x — 8)dx
so that
which in terms of x is
2 (i) Setting
u = 5x2 - 19
it follows that
du = 1Oxdx
so that
3xdx = 0.3du
(ii) Let
u= 5 +X
Letting
u=X—5
Solutions to exercises
then du = dx and the integral is
J + k
= -(x-5r^ +k
= -1-^ + k
(x-5)
(iv) Let
u = 3x + 4
/ -2u ^du = 2u ^ + k
= 2(3x + 4)“’ +k
2
+ k
(3x + 4)
3 Let
u = bx + c
so
du
dx
td/b}u,-n+ 1
+ k
(1 -n)
+ k
b{] - n](bx + c)"'
4 (i) Let
u = x^ + 7x - 8
/ ^=-0-25u-^ + i^
=---^-j+k
(x2 + 7x - 8)^
u= +X+ 1
J 2u~^6u = -u~^ + k
=-^-2 + *^
(x^ + x^ + X + 1
= 200
= 0-(-20000- 1000-200)
= 21 200
= 1000+100 + 50-0
= 1150
= 24 - 3000
= -2976
= 425
(i) In this case the definite integral is split into two parts:
10 /-IS
[ (x - 10)dx + [ (x - 10)dx
Js J]0
the absolute value being taken for each part. So between 5 and 10
the integral evaluates as
x=5
(0.5x2- 10x)^=^°-(0.5x2 lOx)
= 50- 100-(12.5-50)
= - 12.5
and the area regardless of sign in this interval is 12.5. In the second
part the curve is above the axis and the area is given by
f (x) = 3x2 _ ] 3^ ]6
to find the points where the function crosses the x axis. f(x) = 0 for
X = 1 and X = 5. So the interval should be divided at 1 and 5; the
total area is the sum of the absolute values of the definite integrals:
7 + 32 + 81 = 120
fx = 20 /.x = 40
/ 20dx + / xdx
Jx = 0 Jx = 20
Exercises 8.7 1 (i) Using the trapezium rule with m = 5 (and therefore with
w = 0.5), the definite integral is evaluated as
= (250+10) - (16 + 4)
= 240
m=4
and so
w = (3 - 1)4 = 0.5
and
Chapter 9
(iii ^=-5e--
dy e-o.ix
(iv)
dx
dy 3e^3x
(v)
dx
dy gO.snx
(Vi)
dx
CNX
2 (i) f"(x)
0)
II
(ii) f"(x) = -2e-0'^
(iii) P'(x) = 16(e^^ —
3 (i) P"(x) = 4eV'2>‘
(ii) p'(x) = -e-^
Note that —e“^ is its own second derivative.
6.25s-»='<
5 (i) Both the product and chain rules are involved. The derivative is
6 |i)
dx
iii) ^ = 4xe
dx
(iv) ^=(20x-30)e^°^'-2°^-^°
dx
M ^ = £^1
' ' dx 2v/x
2e° +k
^529705
(_ = e-' + 1 ^ 0.6321
^ = 4e2^ - 1 Ox
dx
so
/ = — 5x^ + k
and, since y = 2e^ when x = 1,
2e^ = 2e^ — 5 + k
X = 26^^ -5x2+5
-20e-°2^ + 0.18x2
dx
X = 100e-° 2^ + 0.06x2 + k
X = 100e“2 + 60
X = 1 OOe-O-2^ + 0.06x2 40
In2 0.6931
(i) X = = 17.67 years
In 1.04 “0.0392
In 2 0.6931
(ii) X = = 9.01 years
In 1.08 “ 0.0770
In 2 0.6931
X = 3.80 years
(iii)
hrr2 “0.1893 ■
In 2 0.6931
(i) x= = 6.93 years
0.1 0.1
In 2 0.6931
II x= = 3.47 years
0.2 0.2
The 'halving time' is x, where:
In 0.5 -0.6931
X = = 34.31 years
In 0.98 - 0.0202
In 0.5 -0.6931
(ii) x = = 13.51 years
In 0.95 -0.0513
In 0.5 -0.6931
X = : 6.58 years
In 0.9 -0.1054
In 0.5
69 315 years
- 0.00001
^ 4
(ii)
dx X
^ ,, ,,, , 4x^ 4
2 (i| = X
100 1
|ii| f'M =
lOOx X
20x^ - 18x- 14
5x3 _ 5x2 _ 7^
3 (i) Using the quotient rule:
dy log^x-x(l/x)
dx
(loge^)
1 1
logeW
4 (i) ^-1
dx X
so that
dx2
SO that
dV^_
dx^
= 1 + ln X
dx
so
^-1
dx2 X
f'(x) = 2(loggX)^ 1
so
f'M =
^ = -!-=-04
dp -2.5
pdq 0.4p
E,= ,=-^1-0.4) =
qdp q q
0.4(375)
Ed = = 3
50
. 0.4(250) _ ,
0.4(125)
= 0.33
150
2 Here
-4
dq
so that
^= -0.25
dp
and
E« = "|-025l = ^
0.25(1000-4q)
q
250 - q
q
(i) when
q
250 — q = 3q
250 = 4q
q = 65
(ii) when
q
250-q= 1.5q
250 = 2.5q
250-q ^
q
so q = 125 and p = 500.
In p = In 100 — 2.5 In q
and
c/lnp
= -2.5
c/ln q
4 Rearrangement gives
1000 - 1 Op
P
dq _ 1000
dp p2
^ p/ 1000 \ 1000
pq
Therefore
(i) when q = 2.5, p = 80 and
^
1000
2.5(80)
.
(ii) when q = 10, p = 50 and
_ 1000
^ 10(80)
^
c 1000
I _ 19^
^ ~ 40(20) ~
1 Ox(ln X — 1) + /c
then
f(x) = e"
With
g(x) = 5x
then
r
dx
/ 5xe^dx = 5xe^
-/
5xe^ - 5e^ +
5e^dx
k
5(x- l)e^ + it
(ii) With
then
fix} = e"
With
g(x) = 2x2
= 2x2e^-4(x- ])e’^ + k
= 2(x^ - 2x + 2)e^ + k
2 (i) With
—-e^'
dx“
then
f(x) = 0.5e^^
With
g(x) = X
then
= 0.5xe2^ - 0.25e2^ + k
= 0.25(2x- l)e2^ + i^
(ii) With
dx
then
f{x) = 2e°^^
With
g(x) = 5x^
then
40xe° - 80e° +k
^ _ .0.5X-0.5
dx'
so
f(x) = 2e05^-°'^
and with
g(x) = 200x
and hence
^ = 200
dx
Thus the indefinite integral is
rt ^ ^ ^
dx X In 0.2 -1.6094X
1 1
2 (i) f'{x)
xln60 4.0943x
1
f"{x)
“x2|n 10
(iii) f'N 2x
5
3 (i) f'M
xlogg8 X
0.5 0.72
(li| f'(x)
xloggO.5 X
1 (10x2-3x+10)
In 2 (2.5x3 - x2 + 5x)
1
In 2
J Inxdx 1.4427x(lnx — ]] + k
(ii) The indefinite integral is
^ ^ In, 5
6 The indefinite integral is
2 .^^Sxflnx - 1) + /c
/ fii 2.6058x(ln x — 1) + /c
8 (i) ^=(lnl0)(2x)10^'
dx
« (4.6x)10^'
’ I'l
+k
1 q5x^ - 8x
(iii) +k
In 10
j (4x-11122''’-'‘dx = I^^+(r
Chapter 5: 4, 6, 16, 19
Chapter 9: 3, 6, 10, 19
528 Index
reduced form 41 dominant term 139
vector 41 exercises 131-3, 174
see also differential: quadratic; exponential 350-65
simultaneous; straight-line homogenous 170-1
equilibrium price 49 inverse 164-6
equipment utilization rule 69 logarithmic 366-77
equivalent sets 403 monotonic 130, 171-2
escapable cost 187-8 odd and even 172
even function 172 periodic 172-3
excess demand 49 piecewise defined 130-1
excess supply 49 quadratic 132-42
excise duty 50 exercises 148-9
exogenous variables 41, 113 one linear and one quadratic
expansion paths 255 142
explicit form of equation 127 two quadratic equations 143-8
explicit function 198 quartics 153-5
exponential function 350-5, 356-7 exercises 155
and derivatives 357-60 rational 157-62
exercises 355-6, 361 exercises 162-3
integration of 361-5 several variables 220
to bases other than e 390-5 fundamental directions 222
exponential notation 414 future value
exponential probability distribution 363
exponents 406 game theory 224
irrational 411 Gauss-Jordan elimination 44, 125
power rule 409, 411 Gaussian elimination 42-5
product rule 406-8, 411 exercises 45
quotient rule 409-11 general antiderivative 313
extrapolation 18 general form of straight line equation
extreme points 66, 176 22
general form of quadratic equation
factoring 134 147
family of lines 8 general solution to differential
feasible production set 68 equations 323
feasible region 68 general term of sequence 424
financial management, and geometric progression 427-9
simultaneous equations 29-31 global maxima/minima 176-7,
finite series 425 207-13, 262
finite set 403 exercises 213
fixed-point notation 413-14 Gompertz function 398-400, 411
floating point notation 414 gradient form 21
flow variables 419 greatest lower bound 58
forecasting, straight line application growth curves 351, 353
18
free maximum 252 half-life 372
function of a function rule 192 half space 61
function notation 180 hexadecimal numbers 413
functions 126-7 higher-order derivatives 200-2
absolute value 173-4 exercises 202
composite 163-4 higher-order partial derivatives 236-8
continuous/discontinuous 167-9 higher-order systems 115-17
cubic 149-53 homogenous functions 170-1
exercises 155 horizontal lines 8
discrete 169-70 hyperbolas 147, 158
domain 127-30 hyperbolic demand curves 160-2
idempotent matrices 92 inverse function 164-5
identities 34, 45-7 exercises 198
identity matrix 81, 96 rule 196-8, 366, 373
imaginary numbers 138, 402 inverse matrix 97, 116-17
implicit differentiation 198-200 involutional matrix 102-3
exercises 200 irrational numbers 401
implicit form of equation 127 iso-profit lines 69, 227
implicit function rule 254 isoquands 227
improper integral 335-7 isotonic function 171-2
income 140 iterations 68
inconsistent equations 29
independent equations 25, 38 joint analysis 109
independent variable 127 Jordan elimination 44
indeterminate systems 38
index of summation 420 Karush-Kuhn-Tucker conditions 300
inequality constraints 289-95, 296-8 Kuhn-Tucker conditions 299-307
exercises 299 exercises 307-8
inequality relationships
linear 53-76 Lagrange multipliers 264-7, 289-95,
quadratic 145 296-8
infimum 58-9 exercises 272, 295-6
infinite product 423 interpretation of 270-1
infix notation 415 minimization problems 267-70
inflection points 151, 215 Laplace expansion 115
initial conditions 323 law of scarcity 5 3
inner product 89 leading coefficient 132
input variables 221 leading diagonal 80
instruments 127 lead time 19
integer programming 35 least upper bound 58
integers 401 left inverse 103
integrand 314 Leibniz’s notation 179
integration 312-14 limits of summation 420
applied to marginal analysis 317-20 linear dependence 99, 107, 111
by parts 386-9 linear differential equation 322
by substitution 329-32 with constant coefficients 322-4
definite 333-43 linear equations
exercises 317, 332-3, 348 exercises 35-6, 42
limits of 333 in more than two variables 36-41
of natural exponential functions one equation in one unknown 34
361-5 one equation in two unknowns 35
of logarithmic functions 383-5 whole-number requirements 35
numerical 344-8 linear function of several variables
rules 222-5
constant function 314 exercises 225
constant multiple 316 linear inequalities
power 3 1 5 exercises 55
sum-difference 316 in one variable 55
interest rates 353-4, 371 addition of a constant 55
interpolation 18 multiplication by negative
intersection of sets 405 constant and reversal of
interval scale 418 direction 56-60
inventory control multiplication by a positive
exercises 20 constant 55-6
reorder level/point 19 simultaneous 58-60
straight-line application 18-20 strict 53-4
530 Index
in two or more variables 60-5 multiplication 89-9
weak 54 associative property 95
linear programming 47, 67-76, 224, conformability in 91-2
252 identity element for 9 6
exercises 76 postmultiplication 93
simplex method 66, 68 premultiplication 93
linear regression 18 negation 87
local maxima/minima 176, 202-6 notation 81-2
exercises 242-3 principle diagonal 80
first-order conditions 239-42 rank 111-14
logarithmic derivative 377 scalar multiplication 87-8
logarithmic functions 366-72 and simultaneous equations
exercises 373, 377, 385-6. 395-6 105-10, 118-21
and integration 383-5 submatrices 79-80
to bases other than e 390-5 subtraction 86-7
logical operators 417 transposition 82-3
logistic curve 399-400 triangular 80
log-linear functions 369 zero 81
lower bounds 58 maxima and minima
lower triangular matrix 80 global 176-7, 207-13
local 176, 202-6
main diagonal 80 membership of set 402
marginal cost 186-8, 318, 337 method of substitution 256
exercises 320 minors 115
and integration 317-20 mixed derivative 236-8
marginal profit 186, 231-2 mixed expression 159-60
marginal revenue 184-8, 319 model-building 55
equilibrium condition 185-6 modified growth curve 396-7
finite approximation 184-5 monetary variables 417-18
marginal values 75, 257, 259 monotonic functions 130, 171
market clearing price 49 multi-constraint problems 262
market equilibrium multinomial equation 156
exercises 51-2 multiple roots 135
and linear demand/supply curves multi-product monopoly 308-10
49-50 exercises 311
market potential 397
market saturation 161, 397 natural exponential functions see
matrices 77-8, 124-5 exponential functions
addition 85-6 natural logarithmic functions see
associative property 86 logarithmic functions
commutative property 86 natural numbers 355, 401
conformability in 85-6 negative exponential distribution 363
identity element for 8 7 nested brackets 416
applications 115-17 nominal scale 419
Cramer’s rule 122-4 non-linear differential equation 327-8
determinants 115-17 non-linear programming 299
diagonal 80-1 non-linear systems 141-2
difference 87 non-negativity requirements see sign
dimensions 78-9 requirements
elements 79 non-pool resource 420
equal 82 non-unique optima 73, 177
exercises 83-5, 88-9, 96-7, 103-5, normal lines 9
110-11, 114, 118, 122, 124 null set 404
inversion 97-9 numbers
labelling 78 complex 402
counting 401 piecewise defined functions 73, 130-1,
imaginary 402 210, 339
irrational 401 place-value notation 412
natural 401 point elasticity 378
rational 401 point of inflection 215-18
real 401-2 exercises 218-19
numeraire 41, 261 stationary/non-stationary 215
numerals 401 point-slope form 20-1
numerical methods for integration polish notation 415
344-5 pol3momials 156-7
Simpson’s rule 347-8 differentiating 183-8
trapezium rule 345-7 exercises 157
pool resource 420
objective function 67. 127 portfolio theory 255, 418
octal numbers 413 positive quadrant 5
odd function 172 postfix notation 415
off-diagonal elements 80 postmultiplication of matrices 93
one to one function 166 power function rule 180-1
open interval 59 power rule for exponents 92, 409
operators 414-15, 417 power series 425
binary 417 power set 404
logical 417 prefix notation 415
relational 417 premultiplication of matrices 93
opportunity loss 71 present value of annuity 428
opportunity set 68 price breaks 168
optimization under constraint principal 353
250-310 principal diagonal 80
order of matrix 78 production functions 221, 369
order-preserving transformations 418 production plan 67
ordinal scale 418 production possibility frontier 68, 73
ordinary differential equation 321 production theory 218, 221
ordinate 2 and integration 337
orthants 5-6 productivity curve 227
orthogonal lines 9 product notation 423
orthogonal matrices 102 product rule for differentiation 189-90
output variables 221 product rule for exponents 92, 406-8
overdeterminate system 38-40 progressions 424
proper subset 403
parabolas 132 Pythagoras formula 3, 6
parameters 8
parametric equations 41 quadrants 5
parentheses 415 quadratic equation
Pareto optimality 68 exercises 148-9
partial derivatives 229-35 general 147-8
cross 236 and one linear equation 142
direct 236 quadratic inequality 145-6
exercises 235, 238-9 two equations 143-5
higher-order 236-8 quadratic formula, 135
mixed 236 quadratic functions 132-5, 226-8
partial sum 422 discriminant 136-7
particular solution 323 exercises 228
partitioned matrix 79-80 parabola 132-5
periodic function 172-3 points on the graph 138-40
perpendicular lines 9-11 quadratic inequality 145
perpetuities 428 quadratic programming 252, 259, 306
532 Index
quartics 153-5 several variables 236
queuing systems 363-5 test 203-6
quotient rule for differentiation 190-1 sensitivity analysis 58-9, 76, 109
in partial differentiation 234 sequences and series 423-4
quotient rule for exponents 409-11 set
complement 404
radix 412 difference 404-5
range of feasibility 59-60, 108, 121 disjoint 404
range of function 127 elements or members 402-3
rank 111-14 intersection 404
rate of substitution 71, 253 null 403
rational functions 1 5 7-62 power 403
exercises 162-3 subset 403
rational numbers 401 superset 403
ratio scale 417 union 403-4
real line 2 universal 403
real numbers 401 Venn diagrams 405
reciprocal function 158 sexagesimal system 413
reciprocal rule 191 shadow prices 75
rectangular array 77 sigma notation 420-3
rectangular hyperbolas 158-9 index 420
reduced form equations 41 lower limit 420
re-entrant region 65 terms 420
reflex polygon 65 upper limit 420
regression, linear 18 sign-restricted variables 61, 250, 258,
relational operators 54, 417 272-3, 296
re-order level 19 simple interest 424
repeated root 135 simple root 135
resource constraints 289 simplex 224-5
reverse polish notation 415 method 68
right inverse 103 Simpson’s rule 347-8
roots of equation 34-6, 134, 465 simultaneous inequalties 58
roots of numbers 410 simultaneous linear equations 23-5,
row vector 79 58-60
rules of precedence 415 and breakeven analysis 47-9
elimination 27-9
saddle points 243-6, 301 exercises 31-2
sales 140 and matrices 105-10, 118-21
saturation curves 399 substitution 25-7
saturation level 397 simultaneous quadratic equations 143
scalar matrix 81 singular matrix 94, 99
scalar multiplication 87-8 skew-symmetric matrix 83
scalar product 90, 101 slack variables 46, 71, 291, 297
scalars 79 slope 4-6, 229-31
scales of measurement 417 slope-intercept form 7-10
interval 418-19 solution sets 41, 56-60
nominal 419 convex 65-6
ordinal 418 exercises 60
ratio 417-18 feasible 68
units 419-20 solutions to differential equations
scarce resources 67 general 323
scientific notation 414 particular 323
S-curve demand function 397-8 specific sales tax 50
second derivatives 200 square of matrix 92
exercises 206 square matrix 78
stationary points 176, 239-40, 247-9 Taylor series 425-6
stationary point of inflection 215 technical independence 308
step discontinuity 131 third derivative 200
marginal revenue function 167-8 tolerance interval 59-60, 108, 121
price breaks 168 total product curve 227
welfare traps 167 total revenue 140
step function 168-9 total variable cost 337
stock control see inventory control trace 80
stockouts 19 trade-off 71
stock variables 419, 420 transitivity 403
straight-line depreciation 42 5 transposition 82-3, 83
straight-line equation trapezium rule 345-7
exercises 17-18 triangular matrix 80
finding through a given point 14-15 turning points 138, 176
finding through two known points exercises 207
15-16 turnover 140-1
formula 7-10
identifying 11-12 undefined slope 8
obtaining 12 unique solution 36
plotting 12-13 union of sets 404
straight lines unit costs of production 89-90
applications 18-20 universal set 404
distance formula 3-4 upper bounds 58
establishing on which side a given upper triangular matrices 80
point lies 16 utility functions 418
exercises 11
and simultaneous equations 23-32 variables
slope 4-6 decision/control 127
strict inequality 53 dependent/independent 127
strip 63 endogenous/exogenous 41
subject of equation 7, 127 explicit/implicit 127
submatrices 79-80 in matrices 79
sub-problems 66 slack 71, 291, 297
subsets 403 vector equations 41
substitution method vectors 79
constrained optimization 256-63 Venn diagrams 405
integration 329-32 vertex of parabola 132, 140
linear equations 25-7 vertical lines 8
sum(-difference) rule 183-4 vinculum 416
exercises 188
integration 316 Wallis's product 423
sigma notation 421 War loan stock 428
summation notation see sigma notation weak inequality 54
sum to infinity 422 whole-number requirements 35
supersets 404
supply see demand and supply X intercept 138
supremum 58-9 X values 138
symmetric difference 406
symmetric matrix 82 1/ intercept 138
534 Index
Mathematics for Business,
Finance and Economics Second edition
Michael Wilkes
-—fpt-
ISB -86152-241-X
THOMSON
BUSINESS PRESS