Professional Documents
Culture Documents
CHAPTER 1
INTRODUCTION
The purpose of this initial chapter is to revise and introduce a range of basic
mathematical definitions, concepts and terminology that will be encountered regularly
throughout this text. Often much of this material is labelled basic algebra. For many
students, numerous concepts covered should be familiar from their experience of
secondary school mathematics. However some concepts, such as inverse functions and
summation notation, may not be so familiar.
The material presented in this chapter should not be difficult for someone who has
studied mathematics till the end of high school. Because of this, there may be a
tendency not to put sufficient effort into completely mastering the material. This
tendency should be avoided, as all too often difficulties in understanding more advanced
material result from an insufficient grasp of preliminary concepts and techniques, rather
than the degree of difficulty of the advanced material itself.
1
The author is an employee at Western Sydney University (email address:
J.Ablett@westernsydney.edu.au). Other than for any use permitted under the Copyright Act 1968, all
other rights are reserved and permission should be sought through the author prior to any reproduction of
this work.
2
Definitions (Sets)
The set of all elements of interest in a particular problem or context is called the
universal set, which can be denoted by, say, S. Other basic definitions relating to sets
are as follows:
An implication of definition (v) above is that A for any set A: the nil elements in
are also in any set A. This also permits us to say that the intersection of mutually
exclusive subsets of a universal set S, i.e. , is also in S.
Example 1.1.1
Consider the set S of the first 10 letters in the English alphabet. This set can be
described in the following way
S a, b, c, d, e, f, g, h, i, j
X a, e, i
Y b, c, d, f, g, h, j
Z a, d, e, g, h, j
Clearly the sets X, Y and Z are also subsets of S. Therefore we can write
X S, Y S, Z S
eX
Z X a , e
XY
XZ
X Y a, b, c, d, e, f, g, h, i, j S
X Z a, c, d, e, g, h, i, j
3
Y a, e, i X
Z b, c, f, i
Venn diagrams are often a convenient symbolic way of portraying sets and the
relationship between them. An example is the following diagram.
Fig. 1.1
AB
A B
In Fig. 1.1, the universal set S includes all elements inside the rectangle. Set A includes
all points inside the circle marked A (analogously for B). The set of elements common
to both A and B, A B , is given by points in the hatched area. In the Fig. 1.2 below,
the hatched area represents the set of elements in A and/or B, i.e. A B .
Fig. 1.2
AB
A B
The sets A and B are disjoint (or mutually exclusive) in the Venn diagram shown
below.
Fig. 1.3
A B
S
4
Example 1.1.2
An appropriate Venn diagram depicting the sets defined in Example 1.1.1 is shown
below.
Z Y
a d g b
e h j c
X
i f
A rational number is formed as the ratio (or quotient) of an integer over a non-zero
integer. Rational numbers allow us to talk in terms of fractions or proportions. Thus
the fraction 1 2 (or 0.5 when expressed as a decimal) is the ratio of the integer 1 (the
numerator) to the integer 2 (the denominator). Integers themselves are rational
numbers since they can be written as the ratio of themselves to a denominator of 1. The
set of all rational numbers is commonly labelled Q.
Some numbers, the irrational numbers, cannot be written precisely as ratios of integers;
rather, they can be represented as infinite decimals, i.e. numbers with an infinite number
of non-zero digits to the right of the decimal point.2 Common examples of irrational
numbers are: 2 1.4142 , the positive square root of 2; the number 3.1416 ( is
the Greek letter ‘pi’), the area of a circle with a radius of 1; and the base of natural
logarithms e 2.7183 , which is formally defined in Section 1.12.3 Note that the
notation ‘ ’ means ‘approximately equals’.
2
Of course many rational numbers can also be represented as infinite decimals, for example 1/3=0.333....
3
Readers should recall that the square root of a number ‘x’ is a number that can be multiplied by itself,
i.e. ‘squared’, to give the number ‘x’. Roots of a number will be considered in Section 1.7.
5
Normally we will be concerned with real numbers, which include both the rational and
irrational numbers. Any real number can be thought of as a point on the real number
line that is depicted below.
Fig. 1.4
-2 -1 0 1 2 3
In this text, unless otherwise indicated, the term ‘numbers’ will refer to real numbers.
The notation R is used to denote the set of all real numbers. It is assumed that readers
are familiar with the definitions of the basic arithmetic operations of addition (+),
subtraction ( ), multiplication ( ) and division ( ) that can be performed on real
numbers. Readers may also recall the following simple points about arithmetic
operations in the context of real numbers:
(ii) Dividing a number ‘x’ by the non-zero number ‘y’ can also be described as
taking the ratio of ‘x’ to ‘y’. Hence we have
x
xy (or x y )
y
3
For example: 3 2 1 .5
2
(iii) The reciprocal of a non-zero number ‘x’ is given by 1 divided by the number,
i.e. 1 x or x1 .
In this sub-section the basic properties of real numbers with respect to arithmetic
operations are given. Subsequently basic properties related to inequality expressions are
considered.
6
Property 1 simply says that the order in which the addition or sum of two numbers is
performed is irrelevant.
(x y) z x ( y z)
Round brackets are used to clarify operations, in particular the order of operations.
Thus the left-hand side of the above equation simply represents the addition of ‘z’ to the
sum of ‘x’ and ‘y’, whilst the right-hand side of the equation represents the addition of
the sum of ‘y’ and ‘z’ to ‘x’. In more complex expressions, we will sometimes also
have recourse to square brackets and parentheses to delineate particular operations.
Property 2 allows us to break down the calculation of the sum of any finite number of
real numbers into a sequence of sums of two numbers. Thus for any five real numbers
‘v’, ‘w’, ‘x’, ‘y’ and ‘z’, we can write, for example
v w x y z (v w) (x y) z (v w) ((x y) z)
y0 y (e.g. 1 0 1 )
y ( y) 0 (e.g. 2 (2) 0 )
x y y x (e.g. 3 2 2 3 6 )
(x y) z x ( y z)
The associative law for multiplication essentially means that when we multiply more
than two numbers together, the order in which the individual operations are performed
is irrelevant.
y 1 y
1
y y y 1 1 for y 0
y
In words, the multiplication of a non-zero number by its reciprocal gives the number 1.
(i) For any numbers ‘y’ and ‘z’, and a non-zero number ‘x’, if x y x z , then
yz.
xy xz
(x 1 x) y (x 1 x) z
1 y 1 z
yz
xy 0
(x 1 x) y x 1 0
1 y 0
y0
(Similarly for x y 0 and y 0 )
x ( y z) (x y) (x z)
For example
3 (2 1) (3 2) (3 1) 6 3 9
( y z) x (x y) (x z) ( y x) (z x)
Strict and non-strict inequality notation allows us to describe precisely the relative
positions of two numbers on the real number line as well as intervals on the real number
line. Before stating some basic properties of real numbers related to inequalities,
familiarity with a number of definitions is required.
Definitions (Inequalities)
Suppose ‘x’ and ‘y’ are real numbers. Then we have the following definitions:
(i) If ‘x’ is to the left of ‘y’ on the real number line, denoted x y or y x , ‘x’ is
said to be (strictly) less than ‘y’, and ‘y’ is said to be (strictly) greater than ‘x’.
(ii) A positive number ‘x’ is such that ‘x’ is strictly greater than 0, i.e. x 0 ,
whilst a negative number ‘y’ is such that y 0 .
(iii) y x 0 is an equivalent statement to x y .
(iv) If ‘x’ is to the right of ‘y’ on the real number line, ‘x’ is said to be (strictly)
greater than ‘y’, denoted x y or, equivalently, y x 0 .
(v) x y means ‘x’ is less than or equal to ‘y’ (a non-strict inequality).
(vi) x y means ‘x’ is greater than or equal to ‘y’ (a non-strict inequality).
(vii) x y and y x are equivalent statements.
(viii) A non-negative number ‘x’ is such that x 0 , whilst a non-positive number ‘y’
is such that y 0 .
The subtraction of a number ‘y’ from a number ‘x’ can be thought of as the addition of
( y) , the negative of ‘y’, to ‘x’, that is
x y x (y)
For example
2 (3) 2 (3) 1
x (y) x y
2 (3) 2 3 5
We can now state the final basic properties of real numbers, which concern inequalities.
x 0 (‘x’ is positive)
x0
x0 (‘x’ is negative)
(i) Suppose real numbers ‘x’ and ‘y’ such that x y , and any real number ‘z’. Then
xz yz
y x ( y z) (x z) 0
yzxz
xz yz
(ii) For any real numbers ‘x’ and ‘y’, one (only) of the following three statements
must be true.
xy
xy
xy
This is implied by Property 10 since, considering the real number (x y) , one of the
following statements must be true
x y 0 , i.e. x y
x y 0 , i.e. x y
x y 0 , i.e. x y
(iii) If real numbers ‘x’, ‘y’ and ‘z’ are such that x y and y z , then it must be the
case that x z , i.e. x y z .
y x 0 and z y 0
(y x) (z y) 0
zx 0
xz
(iv) If real numbers ‘x’, ‘y’ and ‘z’ are such that x y and z 0 , then xz yz .
xy0 , z0
11
(x y) (z) xz y(z) xz yz 0
yz xz
xz yz
(v) If ‘x’ and ‘y’ are both negative real numbers, then x y 0 .
x 0 implies 0 x x 0
y 0 implies 0 y y 0
Then
(vi) If real numbers ‘x’, ‘y’ and ‘z’ are such that x y and z 0 , then xz yz .
(x y) z xz yz 0
xz yz
Thus multiplying through an inequality by a positive number does not change the
direction of the inequality.
(vii) If ‘x’ is negative real number and ‘y’ is a positive real number, then x y 0 .
x 0 implies 0 x x 0
Therefore
xy 0
xy 0
This is just a generalization of the observation earlier in this subsection that multiplying
a real number by 1 (i.e. taking the negative of the number) changes the sign of the
number.
12
Suppose we consider the set of real numbers between ‘a’ and ‘b’, but not including ‘a’
and ‘b’, where a b . Such a set is called an open interval as it does not contain its end
points. A short but precise way of describing this set or interval is given by
x : a x b (1)
The notation in parentheses in (1) is translated as the set of points (numbers) ‘x’ such
that ‘x’ is between ‘a’ and ‘b’. A graphical representation of this set is given below.
Fig. 1.5
a b
In the above diagram, empty circles centred at the points ‘a’ and ‘b’ indicate that the
numbers ‘a’ and ‘b’ are not included in the set.
The interval from ‘a’ to ‘b’ including both these points is called the closed interval from
‘a’ to ‘b’. The convenient notation for this interval and its graphical representation are
as follows.
x : a x b (2)
Fig. 1.6
a b
x : a x b (3)
x : a x b (4)
x : x a (5)
x : x a (6)
x : x a (7)
x : x a (8)
13
The sets given by (3) and (4) above are sometimes called, for obvious reasons, half-
closed or half-open intervals. The sets given by (5) to (8) are called infinite intervals as
they extend indefinitely to one side of ‘a’. Sometimes the set of real numbers itself is
denoted by the infinite interval written as (, ) .
Example 1.4.1
Graph the following sets/intervals:
x : 0 x 1
x : 0 x 1
x : x 1
x : x 1
x : 0 x 1 :
0 1
x : 0 x 1 :
0 1
x : x 1:
0 1
x : x 1:
0 1
x if x 0
x
x if x 0
14
Hence a positive number is its own absolute value whilst the absolute value of a
negative number is simply the number without its minus sign. As in the above
definition, absolute value is normally represented by two vertical bars, one on either
side of the number. As examples, we have
2 2
3 3
(i) Given the definition of the reciprocal of a number, division of the number ‘x’ by
the number ‘y’ is equivalent to multiplying ‘x’ by the reciprocal of ‘y’, and vice
versa.
This means the results in Section 1.4 concerning the sign of a product can also be
applied directly to division. In particular, dividing a positive number by a negative
number, and vice versa, results in a negative number, and dividing a negative number
by a negative number yields a positive number. For example
2 1
2 0.4
5 5
2 1
2 0.4
5 5
(ii) Dividing a number ‘x’ by the ratio of two numbers is equivalent to multiplying ‘x’
by the reciprocal of the ratio. Thus, given two non-zero numbers ‘y’ and ‘z’, we
can write
x z xz
x for y 0 , z 0
y z y y
5 4 20
5
34 3 3
3 8 24
3 8
3 8 3 3
2 ( 7 ) 2 6 12
( 5) 6 7 5 35
15
(iii) A fraction or ratio of two numbers can sometimes be simplified by dividing the
numerator (‘top’) and denominator (‘bottom’) by a common factor. This
operation, often described as cancelling the common factor, of course makes no
difference to the number the ratio represents. In general we have for any number
‘y’ and non-zero numbers ‘x’ and ‘z’
xy y
for x 0 , z 0
xz z
20 2 (10) 2
30 3 (10) 3
(iv) When adding or subtracting fractions it often useful to apply the following simple
procedures, which essentially amount to combining the two operations of, firstly,
expressing the fractions as ratios with the same (common) denominator and,
secondly, performing the required arithmetic operation on the numerators.
Suppose any numbers ‘w’ and ‘y’, and non-zero numbers ‘x’ and ‘z’. Then
w y zw xy
for x 0 , z 0
x z xz
w y zw xy
for x 0 , z 0
x z xz
2 4 3 (2) 5 (4) 6 20 26
5 3 5 (3) 15 15
10 5 3 (10) 11(5) 30 55 25
11 3 11(3) 33 33
Since any number can be thought of as the ratio of itself to 1, these procedures are also
applicable to cases where a fraction is added to or subtracted from a non-fractional
number. For example:
1 45
(i)
2 92
3
(ii) 2
20
35
(iii) 4
46
10 4
(iv) 10 ( 2)
56 ( 3)
(i) If ‘n’ is a positive integer, the expression b n is defined to represent the number
obtained by multiplying ‘n’ b’s together, or ‘b’ raised to the power ‘n’.
Given definition (i) above, clearly b1 b . The ‘n’ in the expression b n is often called
the exponent, whilst ‘b’ is called the base. Common examples are b 2 and b3 , or ‘b-
squared’ and ‘b-cubed’, respectively. We have, for example
23 2 2 2 8
4
1 1 1 1 1 1
2 2 2 2 2 16
4
The advent of pocket calculators in the 1970’s can be considered one major reason why mental
arithmetic is no longer considered as important a life skill as it once was. This is somewhat unfortunate,
as the clarity of thought processes developed in the practice of mental arithmetic can conceivably be
applied to other problem solving situations.
17
The second and third roots of a number ‘y’ (if they exist, cf. below) are commonly
referred to as the square and cube roots of ‘y’, respectively.
It should be clear that an ‘n’th root of ‘y’ will not be defined if ‘y’ is a negative number
and ‘n’ is an even integer, since raising any number to an even power ‘n’ must result in
a positive number: this follows from the properties of real numbers. Thus the square
root of 2 is not defined. Furthermore, the ‘n’th root of ‘y’ will not be unique if ‘y’ is
positive and ‘n’ is an even integer, since raising any number to an even power gives the
same result as raising the negative of the number to the same even power. For example
the square roots of 4 are 2 (positive square root) and 2 (negative square root).
Stated more succinctly, Theorem 1.1 says that all non-negative real numbers possess a
unique non-negative ‘n’th root.
When there are positive and negative ‘n’th roots, the positive ‘n’th root is referred to as
the principal ‘n’th root. When ‘y’ is negative and ‘n’ is an odd number, the sole root of
‘y’, which in this case must be a negative number, is also called the principal ‘n’th root
of ‘y’. When referring to the principal ‘n’th root of a number ‘y’, the notation n y is
regularly used, although y1 n is an alternative. In circumstances where ‘y’ has both
positive and negative ‘n’th roots, the negative root can be denoted n y or y1 n :
shorthand ways of denoting both roots are n y or y1 n ( means ‘plus or minus’).
The principal square root of ‘y’ is normally denoted simply by y . Examples of the
use of this notation include:
4 41 2 2
4
256 4
3
27 3 (since (3)(3)(3) 27 )
So far b n has only been defined where ‘n’ is a non-negative integer. The next
definition extends the definition of b n to include any rational exponent.
5
A theorem is a proposition or result that can be deduced logically from one or more assumptions or
givens.
18
1 1
(ii) b q q
mn ( q 0 , the case of a negative exponent)
b b
43 2 (43 )1 2 641 2 8
8 2 3 (8 2 )1 3 641 3 4
1 1
10 3 3
10 1,000
1 1 1
8 2 3 2 3 2 1 3
8 (8 ) 4
There are a number of results that are useful when manipulating expressions involving
powers and roots. These are collected together in Theorem 1.2 below.
(i) a q a r a q r
aq
(ii) r
a q r
a
(iii) (a q )r a q r
(iv) (ab)q a q bq
aq
(v) (a b ) q
bq
The results in Theorem 1.2 are also applicable for negative ‘a’ and/or ‘b’ provided they
do not involve trying to take an even numbered root of a negative number, as mentioned
above.
19
Example 1.7.1
Evaluate or simplify the following, making use of the definitions and results concerning
exponents wherever possible.
(i) (a 1 4 ) 3 where a 0
(a 1 4 ) 3 a 3 4
54
(iii)
52
54
2
5 4 2 5 2 25
5
(iv) (32 ) 2 3
23 2
(v)
43 4
32
23 2 23 2 2
1
2
34 12 32
4 (4 )
3
x 2 3y4 3
(vi) 43
where x 0
x
3
x 2 3y4 3 (x 2 3 y 4 3 )3 (x 2 3 )3 ( y 4 3 )3 x 2 y 4 y 4
43
4 2
x (x 4 3 )3 (x 4 3 )3 x x
Irrational Exponents
At this point, we are not in a position to define what is meant by b x , where ‘b’ is a
positive number and ‘x’ is any real number, that is, including irrational numbers. Such
a definition does indeed exist, but requires knowledge of integral calculus. It is
perfectly consistent with our definitions and results for expressions involving rational
exponents. Moreover, the results of Theorem 1.2 also hold for ‘q’ and ‘r’ taking any
real values. Readers are simply asked to accept that b x with ‘x’ an irrational number
does have a meaning, and its value can be calculated using any scientific calculator.
However, we will not encounter irrational exponents in this text.
20
Calculator Tip: Calculations involving exponents are readily performed with a scientific
calculator.
In general to calculate x n , input the number ‘x’, press the x y (or y x ) function
key, input the number ‘n’, then press the ‘=’ key.
Most scientific calculators also have specific function keys for calculating x ,
x 2 and x 1 (i.e. 1 x ). Depending on the calculator, one simply inputs the
number ‘x’ then presses the appropriate function key, or presses the appropriate
function key then inputs the number ‘x’.
The bracket keys ‘(’ and ‘)’ can facilitate calculations involving fractional
exponents.
For example, to calculate 34 3 the following sequence of keystrokes can be
used:
i. Input the number 3
ii. Press the x y (or y x ) function key
iii. Use the sequence of keystrokes
( 4 3 )
(the answer obtained should be 0.231120…)
(i) 36 1 2
(ii) 85 381 3
(iii) (3 64 ) 4
3
x2 3
(iv) 2 3
x
( x1 2 y1 2 ) 3
(v)
x3 4
(b) Use a calculator to evaluate the following, rounding answers to 6 decimal places
as needed:
(i) 210
(ii) 5
2 3
(iii) 6
(iv) 5 2.56
(v) 0.5 0.6
21
Let us start by considering the solution of a single equation involving a single variable.
The process of finding the solution set (if any) to such an equation usually involves the
equation being transformed using a number of operations until any solution(s) can be
readily seen or determined. These operations must be such that the transformed
equation is equivalent to the initial equation in the sense that it has the same solution
set. The following operations will always result in an equivalent equation.8
(i) Adding or subtracting the same expression to or from both sides of the
equation.
(ii) Dividing or multiplying both sides of the equation by the same non-zero
constant (number).
(iii) Swapping left and right hand sides of the equation.
(iv) Replacing one side of the equation by an equivalent expression.
x 4x
An example of (iv) above would be to replace 2 by .
2 2
The following operations can also be used in some circumstances, however they do not
guarantee a transformed equation that will be equivalent to the initial equation.
(v) Multiplying both sides of the equation by the same expression involving the
variable.
(vi) Dividing both sides of the equation by the same expression involving the
variable.
(vii) Raising both sides of the equation to the same power.
6
A mathematical variable is simply some element that can represent various values or numbers. Of
course, if a variable is to satisfy some equation, it may be that the variable can take on only one or some
other restricted number of values in the equation.
7
More generally, a solution to an equation involving ‘n’ variables will be a set of ‘n’ values, one for each
variable, which satisfies the equation.
8
The operations mentioned here can also be applied to equations involving more than one variable.
Moreover the list is not exhaustive. For example, once we have reviewed the concept of a logarithm, the
operation of taking logarithms of both sides of an equation may also result in an equivalent equation in
certain circumstances.
22
Unless one is sure that operations (v) to (vii) above yield an equivalent equation, it is
important to check whether the solutions obtained for the transformed equation are in
indeed solutions to the original equation. The only complication is that if operation (vi)
is performed, the set of solutions to the transformed equation may not include all the
solutions to the original equation. Subsequent examples illustrate the use of operations
(i) to (vii).
An equation in which all variables appear only to the power ‘1’ (one) is called a linear
equation. All other equations can be referred to as nonlinear equations. The first
example below illustrates the solution of a linear equation in one variable.
Example 1.8.1
Solve 21 3x 4x
21 3x 4x
21 7 x
7 x 21
21
x 3
7
Example 1.8.2
4 2 x
Solve
7 3 5
4 2 x
7 3 5
4 x 2
7 5 3
x 2 4 7 (2) 3 (4)
5 3 7 3 (7 )
x 2
5 21
10
x
21
Example 1.8.3
3 x x 2
Solve 1
2 3
For this example, we start by bringing all the left hand side of the equation to the same
common denominator, and then we multiply by this denominator to obtain an equivalent
linear equation.
23
3 x x 2
1
2 3
3 (3 x ) 2 ( x 2)
1
2 (3)
3 (3) 3x 2 x 2(2)
1
6
5x
1
6
5x 6
56x
6x 5
x 1
Example 1.8.4
Solve x 2 4
In this case it is simply a matter of taking the square root of both sides.
x2 4
x 4 2
There is nothing in this example that restrains us to only consider the positive square
root.
Example 1.8.5
Solve x1 3 2
x1 3 2
( x1 3 )3 (2)3
x 8
Example 1.8.6
2
Solve 10
x2
2
10
x2
2 10 ( x 2)
2 10x 20
22 10x
10x 22
x 2.2
24
We easily verify that this solution is indeed a solution of the original equation.
Example 1.8.7
2x 4
Find a solution, if any, to the equation x 2
x 2
2x 4
x 2
x 2
2x 4 ( x 2 ) ( x 2 )
2x 4 x ( x 2 ) 2 ( x 2 )
2x 4 x x 2 2 x 2
2x 4 x 2
x 4 2
x2
It can be seen that the original equation is not defined for x 2 , since this would imply
dividing by 0 on the left hand side of the equation. Therefore there is no solution to the
initial equation.
(b) Cross-Multiplication
w y
(x 0, z 0) (9)
x z
wz xy
2 5
3 x
Cross-multiplying:
2x 3 (5)
25
Then, dividing by 2
15
x 7 .5
2
ax 2 bx c 0 (10)
In (10), ‘a’, ‘b’ and ‘c’ are known constants, whilst ‘x’ is a variable. There are
essentially two ways of solving a quadratic equation written as in (10). The first way
involves writing the left hand side as the product of two components or factors, each
involving the variable ‘x’; this process is called factorization. If this can be achieved,
then the solution(s) to the quadratic equation will be given by the value(s) of ‘x’ for
which either or both of the factors are equal to zero, since if any element of a product
equals zero the product will equal zero. The next example illustrates the factoring
method of solution.
Example 1.8.8
Solve the equation x 2 2x 3 0 .
(x 3)( x 1) 0
( x 3) 0
x 3
and/or
( x 1) 0
x 1
Fig. 1.7
A C
B D
The aim is to find ‘A’, ‘B’, ‘C’ and ‘D’ such that
AB ax 2 (11)
CD c (12)
AD BC bx (13)
Provided ‘A’, ‘B’, ‘C’ and ‘D’ satisfy (11) to (13), we will have that
ax 2 bx c (A C)(B D)
Normally when using such a cross diagram, one first finds a combination of ‘A’ and ‘B’
that satisfies (11), then an attempt is made to find ‘C’ and ‘D’ that satisfy (12) and (13).
Example 1.8.9
Draw the appropriate cross diagram for factorizing x 2 2x 3 .
We have
x 3
x 1
That is, x 2 2x 3 (x 3)(x 1) , as in Example 1.8.8. Note that for this example, the
following cross diagram is not suitable, since it does not satisfy requirement (13) (i.e.
x 3x 2x 2x )
x 3
x 1
Example 1.8.10
Draw a suitable cross diagram for factoring 6 x 2 5x 4 . Subsequently solve the
equation 6x 2 5x 4 0 .
27
We have
3x 4
2x 1
3x 4 0
3x 4
4
x
3
and/or
2x 1 0
2x 1
1
x
2
b b 2 4ac
x (14)
2a
Theorem 1.3 makes it clear that there are three possibilities when attempting to solve a
quadratic equation:
If b2 4ac 0 , there are two distinct solutions
If b2 4ac 0 , there is only one solution (or a repeated root)
If b2 4ac 0 , (14) will involve the square root of a negative number, and
therefore there is no (real) solution
Example 1.8.11
Find the solutions, if any, to the following quadratic equations using formula (14):
(i) x 2 2x 3 0
a 1, b 2, c 3
(ii) 2x 2 x 5 0
a 2, b 1, c 5
(iii) x 2 2x 1
x 2 2x 1 0
a 1, b 2, c 1
(iv) x 2 2x 2 0
a 1, b 2, c 2
Techniques for solving quadratic equations may also be applied to an equation that is
transformable into a quadratic equation by the appropriate substitution of a variable.
The next example illustrates this.
Example 1.8.12
Solve 2q q1 2 1 0 .
2p 2 p 1 0
Then
Therefore p 1 , and/or
2p 1 0
1
p
2
It is easy to verify that the above values for ‘q’ satisfy the initial equation.
(i) 4x 1 2 x
2 0 .5 x x 4
(ii)
3 2
x
(iii) 1 2x
5
(iv) 10x 2 3x
(v) x2 5
8 5
(vi) 0
x 2 (2 x )
(vii) x 2 3x 10 0
(viii) 4x 2 6x 3 0
(x) 2x 2 3 14x1 3 16 0
30
ax 2 bx c 0
Our aim is to solve this equation for ‘x’. Since a 0 (otherwise we would not have a
quadratic equation), we can divide by ‘a’ to give
b c
x2 x 0
a a
b c
x2 x (15)
a a
The next step is to ‘complete the square’ on the left hand side of the equation, so as to
obtain a squared expression in which ‘x’ appears only to the power 1. To achieve this,
half the constant (coefficient) multiplying ‘x’ in equation (15), i.e. b (2a ) , is squared
and added to both sides of the equation, yielding
c ab 2 4a 2 c
2 2
b b b c b
2
x2 x 2
a 2a 2a a 4a a 4a 3
This simplifies to
b 2 4ac
2
b
x
2a 4a 2
Taking the square root of both sides of this equation and solving for ‘x’ gives the
required result:
b b 2 4ac b 2 4ac
x
2a 4a 2 2a
b b 2 4ac
x
2a
31
Sometimes the identification of input and output variables is quite natural, as in the case
of quantity sold and sales revenue, where it is normal to think of the former as the input
variable that gives rise to a value of sales revenue. In other cases the distinction may be
arbitrary, such as in the relationship between the quantity sold of a good and unit price
charged for the good by a firm. Other commonly used names for the input variable are
independent or exogenous variable, whilst the output variable is also called the
dependent or endogenous variable.
In many instances, the values taken by input and output variables come from the set of
real numbers, or particular subsets of the set of real numbers. However in mathematics
and statistics it is useful to think more generally in terms of the relationship between
elements from two sets. We have the following basic definition.
Definition (Function)
A function f from one set A to another set B is a rule that associates to each element ‘x’
in A a unique element ‘y’ in B, called the image of ‘x’. Regularly such a rule is
denoted y f (x) (expressed verbally as ‘f-of-x’).9
f: A B
Hence if a function f associates a real number with real number, we can write f: R R.
A simple example is the function f that associates to each real input number an output
number that is three times the input number. This function can be represented by
f (x) 3x
Thus when ‘x’ takes the value 2, the output variable takes the value f (2) 3(2) 6 .
A function from any set A to the set of real numbers R is called a real valued function.
9
The almost universal custom is to use the notation f ( x ) to denote both the function (rule) and the
particular element ‘y’ associated with a particular element ‘x’. Which interpretation is intended is
normally obvious from the context.
32
A key aspect of the definition of a function f: A B is that each element ‘x’ in A has a
unique image in B. Any rule that associates to at least one element in set A more than
one element in set B is not considered a function from A to B by the above definition.
In such cases we can simply say there is a relation between the elements in A and the
elements in B.
Thus the domain of a function f from set A to set B is the set of all elements for which
the function is defined, whilst the range of the function need not include all the elements
in B (although it may).
For some functions the domain may be all real numbers, however the domain may be
restricted either because of purely mathematical reasons or the nature of the variables
involved. Common mathematical reasons why the domain of a function may be
restricted include the need to exclude division by zero, for example for the function
f ( x ) 1 x , and to avoid square roots of negative numbers. In the context of applied
problems, the variables involved may, for example, not take on negative values for
physical reasons. For example, quantity produced cannot be negative.
Example 1.10.1
State the domain of the following functions:
(i) f ( x ) 5 4x
3
(ii) f (x) 2
x
(iii) f (x) 2 x
Domain: x : x 2
1
(iv) f (x)
( x 1)( x 3)
Domain: x : x 1, x 3
33
Where ‘n’ is a non-negative integer, and a n , a n 1 ,……, a 0 are constants that can be any
real numbers.
{a i : i 0,1,..., n}
In (16), the constants a i (i 1,..., n) , i.e. excluding a 0 , are commonly called the
coefficients of the terms in ‘x’.
The highest power to which ‘x’ is raised in the polynomial function is called the degree
of the polynomial function, which is ‘n’ in (16) if a n does not equal 0. The domain of a
polynomial function is all real numbers. A constant function, such as f (x) 8 , can
be thought of as a polynomial function of degree 0, since by definition x0 1 . The
value of a constant function does not change, no matter what the value of the input
variable.
10
Apart from constant functions, all functions that are not linear functions are generally described as non-
linear functions.
11
Mathematicians sometimes call the function f ( x ) a bx an affine function, reserving the label linear
function (or linear transformation) for functions of the form f ( x ) bx . This distinction is important in
some advanced areas of linear algebra, but we will follow the more common usage of calling
f ( x ) a bx a linear function.
34
x 2 4x 2
f (x) ( x 1)
x 1
It should come as no surprise that functions can be combined to form new functions
using the arithmetic operations of addition, subtraction, multiplication and division.
Thus for the functions f ( x ) 2 x 1 2 and g (x) x 4 , we can use the arithmetic
operations to form the following functions:
Composition of functions involves using the output number from one function as the
argument (input number) of another function. Formally we have the following
definition.
( f g )(x) f ( g (x))
The domain of the composite function ( f g )( x ) is all ‘x’ such that ‘x’ is in the domain
of g ( x ) and g ( x ) is in the domain of f (x) . As an example, the composition of the
functions f ( x ) 2 x 1 2 and g (x) x 4 is given by
x , if x 0
f ( x ) x1 2 , if 0 x 4
2 x , if x 4
By the above definition, if y f (x) is such that a different value of ‘x’ yields a
different value of ‘y’, i.e. f (x) is a one-to-one ‘mapping’, then an inverse function
denoted x f 1 ( y) will exist. In this case a given value of ‘y’ will yield a unique
value of ‘x’. Not all functions possess an inverse. If for a given function y f (x) an
inverse x f 1 ( y) exists, it can normally be found by isolating ‘x’ by manipulation of
the equation y f (x) .
Example 1.10.2
Find the inverse of y f (x) 5x 25
5x y 25
y
x f 1 ( y) 5
5
Any function that either increases or decreases over its entire domain (monotonic
functions), as in the above example, will have an inverse. A simple example of inverse
functions is with respect to demand functions in basic microeconomics. The direct
market demand function for a particular good or service expresses the quantity
demanded as a function of price, whilst the inverse demand function expresses price as a
function of quantity demanded.
36
Fig. 1.8
y
3
2 (1,2)
(0,1) 1
-3 -2 -1 1 2 3 x
-1
(-1,-2) -2
-3
The set of all points in the rectangular coordinate system is usually denoted R2 (‘2-
space’).
Graphical representations of functions are used widely in this text. The easiest
functions to graph are constant and linear functions. The graph of the constant function
f ( x ) a is simply a horizontal line that goes through the point (0, a) on the vertical
axis, as illustrated below for the function y f (x) 2 .12 Obviously the line
representing the function extends indefinitely to the left and right.
12
The variable ‘y’ is appended to indicate that the value of the function is to be measured along the
vertical (‘y’) axis.
37
Fig. 1.9
y
3
y f (x) 2
-3 -2 -1 1 2 3 x
-1
-2
-3
Example 1.11.1
Graph the following linear functions:
(i) y f (x) 2 x
When x 0 , y 2
When y 0 , x 2
y
3 y f (x) 2 x
-3 -2 -1 1 2 3 x
-1
-2
-3
38
(ii) y g (x) 2 2x
When x 0 , y 2
When y 0 , x 1
y
y f ( x ) 2 2x 3
-3 -2 -1 1 2 3 x
-1
-2
-3
Example 1.11.2
A step function is a compound function for which the domain is divided into two or
more intervals, over each of which the function takes on a particular constant value.
Graph the following step function:
1, if x 1
y g ( x ) 2, if 1 x 3
3 if x 3
-2 -1 1 2 3 4 5 x
-1
39
Example 1.11.3
Graph the following compound function.
x , if x 0
12
y f ( x ) x 1, if 0 x 4
x 2 if x 4
-3 -2 -1 1 2 3 4 5
x
-1
The shape of the middle section of the graph of this function can be determined by
plotting several points on the curve.
Example 1.11.4
Consider the quadratic function y f (x) ax 2 bx c , where ‘a’, ‘b’ and ‘c’ are
constants. If we set f (x) 0 , we obtain the quadratic equation
y f (x) ax 2 bx c 0
In Section 1.8(c) it was stated that the solutions (roots) of this equation, if any, are given
by
b b 2 4ac
x
2a
From the above result it was concluded that the equation would have two distinct
solutions, a unique solution or no solution. Draw example graphs of
y f (x) ax bx c to illustrate these three possibilities.
2
0 root root x
If there is only one (repeated) root, the graph of f (x) must simply touch the horizontal
axis one point, as in the following diagram.
0 x
root
Finally if there is no solution to f (x) 0 , f (x) does not touch the horizontal axis at
any point, as illustrated in the next diagram.
0 x
(No root)
In the above diagrams it is assumed that ‘a’ is greater than 0. If ‘a’ were less than 0, the
graphs of f (x) would be flipped vertically (turned upside down) around the ‘x’ axis.
(i) y f (x) 2 x
(ii) y f (x) 3x 1
(iii) y f (x) 0.5x 2
41
Exponential functions differ from polynomial functions in that the variable appears as
an exponent. In accordance with the previous discussion of powers of a number in
Section 1.7, we have by definition that f (0) b0 1. Exponential functions are also
subject to the rules concerning exponents given in Theorem 1.2 of Section 1.7.
Fig. 1.10
y bx (b 1)
13
If ‘b’ were allowed to be negative, it would be necessary to restrict the domain of the function to
exclude the possibility of, for example, the function representing the square root of a negative number, i.e.
if ‘x’ took the value of 0.5.
42
Fig. 1.11
1
y bx (0 b 1)
A base that is encountered regularly is the irrational base e 2.7183 , and the function
f ( x ) e x is usually referred to as the natural exponential function. The natural base
number ‘e’ occurs in the context of calculus and plays a key role in the description of
continuous growth. To understand the definition of the natural base number ‘e’, let us
start by considering the function y f ( x ) (1 x )1 x , which is depicted below. This
function is not defined for x 0 , as the exponent 1 x is not defined at this point.14
Fig. 1.12
2 y (1 x)1 x
The following table gives some indication of how the value of (1 x )1 x changes as ‘x’
tends (gets closer and closer) to 0 from both the left and the right.
14
This function will also not be defined for various values of ‘x’ that at are less than or equal to 1 .
However, here we are only interested in what happens near x 0 .
43
The numbers in the second and fourth columns of the above table are rounded to four
decimal places: most of these numbers cannot be written exactly to a finite number of
decimal places. What should be clear is that the closer ‘x’ gets to 0 from both the left
and right, the closer together will be the values of the function f ( x ) (1 x )1 x for
negative and positive ‘x’. In fact as ‘x’ tends to 0, the value of this function tends to a
particular limit which is our definition of the number ‘e’. Formally we have the
following definition.
e lim (1 x)1 x
x 0
The notion of a limit of a function will be covered in more detail in Chapter 2. In the
meantime, the above definition of ‘e’ simply states it is the limit to which the function
f ( x ) (1 x )1 x tends as ‘x’ tends to 0.
The above definition indicates that the exponential function x f 1 (y) b y is the
inverse of the logarithmic function y f (x) Log b x , and vice versa. In words, it is
commonly stated that the logarithm to the base ‘b’ of a number ‘x’ is the power to
which ‘b’ must be raised to give ‘x’. This power is denoted Log b x . For example, since
10 3 1,000 , we can say that the logarithm to the base 10 of 1,000 is 3, or
Log 10 1,000 3 .
When defining a logarithmic function, the possibility that ‘b’ equals 1 is excluded, as
the number 1 raised to any positive power will always be 1. There is also a simple
explanation why the logarithm of a non-positive number is not defined. Given that the
44
base ‘b’ is restricted to being positive (and not equal to 1), Log b x will not be defined
for x 0 , since when b 0 , raising ‘b’ to any power, positive or negative, will always
give a positive number.
Logarithms to the base 10 are called common logarithms, whilst logarithms to the
irrational base e 2.7183 are called natural logarithms. The logarithm of a number ‘x’
to the natural base ‘e’, i.e. Log e x , is also commonly denoted ln x . If ‘x’ is a variable,
Log e x (or ln x ) is referred to as the natural logarithmic function. Typical graphs of
y Log b x for b 1 and 0 b 1 are shown below.
Fig. 1.13
y Logb x (b 1)
1 x
Fig. 1.14
y Logb x (0 b 1)
1 x
Some basic properties of logarithmic functions are listed in Theorem 1.4 below.
45
Example 1.12.1
2
xy
Express ln in terms of ln x , ln y and ln z .
z
2
xy xy
ln 2 ln 2 (ln xy ln z) 2 (ln x ln y ln z)
z z
Example 1.12.2
Write the following equation in an equivalent form free of logarithms.
ln a ln b 0.2y
ln (a b) 0.2 y
e ln ( a b ) e 0.2 y
a b e 0.2 y
3 2
Log b 4 Log b 8 Log b 2 Log b x
2 3
46
Suppose we have ‘n’ numbers. By labelling the numbers as, say, a1 , a 2 ,..., a n , we can
represent the numbers by
ai , i 1,..., n
a
i 1
i a1 a 2 .... a n
n
The notation a
i 1
i is thus a shorthand way of representing the sum. In words it means
the sum of all the a i from ( i 1 ) to ( i n ). The symbol is the upper case version of
the Greek letter ‘sigma’. Obviously the labels we give to the numbers (‘a’ above) and
the index (‘i’ above) are arbitrary.
Example 1.13.1
Suppose we wish to represent the sum of the numbers 3, -4, 1, 1, 7 and 2 using
summation notation. Let us label the six numbers arbitrarily as
b1 3 , b2 4 , b3 1 , b4 1 , b5 7 , b6 2
Then
6
3 (4) 1 1 7 2 bi 10
i 1
n n
(i) ca
i 1
i c a i
i 1
n
n
(ii)
i 1
( a i c ) a i nc
i 1
n
n 2 n
(iii)
i 1
( a i c) 2
a
i 1
i 2c
i 1
a i nc 2
n
n n
(iv) (a
i 1
i c) 2 a i2 2c a i nc 2
i 1 i 1
Proof:
To prove property (i) we note that
n n
ca
i 1
i ca1 ca 2 ... ca n c (a1 a 2 ... a n ) c a i
i 1
n
n
i 1
( a i c ) ( a 1 c) ( a 2 c ) ... ( a n c) ( a 1 ... a n ) ( c ... c ) a i nc
i 1
n n
(a i c)2 (a i2 2ca i c2 )
i 1 i 1
Example 1.13.2
Again consider the following six labelled numbers.
b1 3 , b2 4 , b3 1 , b4 1 , b5 7 , b6 2
6
Use property (iii) in Theorem 1.4 to calculate (b
i 1
i 2) 2 .
6
6 2 6
i 1
( b i 2 ) 2
b
i 1
i 2 (2)
i 1
bi 6(2)2
[3 (4) 1 12 7 2 22 ] 4(3 4 1 1 7 2) 24
2 2 2
80 40 24
64
(b
i 1
i 2) 2 (3 2) 2 (4 2) 2 (1 2) 2 (1 2) 2 (7 2) 2 (2 2) 2
64
a1 , a 2 ,..., a n
And
b1 , b 2 ,..., b m
Now suppose that we wish to represent the sum of all the possible products of one ‘a’
element with one ‘b’ element. This can be represented by
n m n
a i b j (a i b1 a i b2 .... a i bm )
i 1 j 1 i 1
As the order in which the summations are performed is immaterial, we could also write
49
n m m
a i b j (a1b j a 2b j .... a n b j )
i 1 j 1 j 1
Example 1.13.3
Consider the two sets of numbers given by
a1 3 , a 2 4 , a3 1
And
b1 1 , b 2 2
3 2
Calculate a b
i 1 j 1
i j .
In this case
3 2 3 3
a b
i 1 j 1
i j (a i b1 a i b 2 ) [a i (1) a i (2)]
i 1 i 1
a1 4 , a 2 2 , a3 5 , a 4 10
Calculate:
4
(i) a i 1
i
4
(ii) a i 1
2
i
4
(ii) (ai 1
i 3) 2