Professional Documents
Culture Documents
Exam Mathematics
2023
Version 0.7
1
Contents
1 Sets 3
1.1 Important sets of numbers . . . . . . . . . . . . . . . . . . . . 3
1.1.1 Real numbers . . . . . . . . . . . . . . . . . . . . . . . 5
1.1.2 Complex numbers . . . . . . . . . . . . . . . . . . . . . 6
1.2 A more abstract notion of sets and set operations . . . . . . . 12
1.2.1 Definition of sets . . . . . . . . . . . . . . . . . . . . . 13
1.2.2 Subsets and supersets . . . . . . . . . . . . . . . . . . . 15
1.2.3 Set operations . . . . . . . . . . . . . . . . . . . . . . . 16
3 Elementary combinatorics 30
3.1 Permutations . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
3.1.1 Permutation without repetition . . . . . . . . . . . . . 32
3.1.2 Permutation with repetition . . . . . . . . . . . . . . . 33
3.2 Variation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
3.2.1 Variation without repetition . . . . . . . . . . . . . . . 36
3.2.2 Variation with repetition . . . . . . . . . . . . . . . . . 41
3.3 Combination . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
3.3.1 Combination without repetition . . . . . . . . . . . . . 43
3.3.2 Combination with repetition . . . . . . . . . . . . . . . 45
3.4 Exercises about combinatorics . . . . . . . . . . . . . . . . . . 47
2
4.1.1 Linear equations in one variable . . . . . . . . . . . . . 49
4.1.2 Quadratic equations in one variable . . . . . . . . . . . 51
4.1.3 Radical equations in one variable . . . . . . . . . . . . 62
4.1.4 Rational equations in one variable . . . . . . . . . . . . 64
4.1.5 Exponential and logarithmic equations in one variable . 66
4.2 Inequalities in one variable . . . . . . . . . . . . . . . . . . . . 70
4.2.1 Linear inequalities in one variable . . . . . . . . . . . . 72
4.2.2 Quadratic inequalities in one variable . . . . . . . . . . 74
4.2.3 Rational inequalities in one variable . . . . . . . . . . . 78
4.3 Systems of linear equations . . . . . . . . . . . . . . . . . . . . 80
4.3.1 Substitution method . . . . . . . . . . . . . . . . . . . 81
4.3.1.1 2-D . . . . . . . . . . . . . . . . . . . . . . . 81
4.3.1.2 3-D . . . . . . . . . . . . . . . . . . . . . . . 86
4.3.1.3 n-D . . . . . . . . . . . . . . . . . . . . . . . 89
4.3.2 Elimination method . . . . . . . . . . . . . . . . . . . . 90
4.3.2.1 2-D . . . . . . . . . . . . . . . . . . . . . . . 91
4.3.2.2 3-D . . . . . . . . . . . . . . . . . . . . . . . 93
4.3.2.3 n-D . . . . . . . . . . . . . . . . . . . . . . . 97
4.3.3 Under- and overdetermined systems . . . . . . . . . . . 102
1 Sets
We first introduce well-known but important sets of numbers. We will omit
for this section mentioning the well-known basic arithmetical rules, such as
“a negative times a positive is a negative” etc.
Then, we will consider sets in a more abstract sense and set operations.
N := {0, 1, 2, 3, 4, . . . }.
3
For the set of natural numbers, the operations “Addition”, “Subtraction”,
“Multiplication” and “Division” are defined, but we have to be careful with
subtraction and division. It may happen that for natural numbers n and
m, their difference n − m is not a natural number; this is precisely the case,
if m > n. Furthermore, if n is not a multiple of m, then division is only
possible with remainder.
The integers solve parts of this problem. The set of integers is defined as
Z := {. . . , −3, −2, −1, 0, 1, 2, 3, . . . } = N ∪ {0} ∪ {−n | n ∈ N}.
Now, in contrast to the natural numbers, not only addition and multiplication
are always defined, but also subtraction. Furthermore, we notice, N is a
subset of Z,
N ⊆ Z.
In order to be able to perform the four basic arithmetic operations (al-
most) without any restrictions, we consider the set of rational numbers. The
rational numbers are defined to be the set of fractions, where numerator and
denominator are integer numbers, but the denominator is not allowed to be
equal to 0.
a
Q := { | a ∈ Z and b ∈ Z, b ̸= 0}.
b
As for any a ∈ Z we have a = a1 , it holds that Z is a subset of Q, the set
of integers is a subset of the set of fractions, hence
N ⊆ Z ⊆ Q.
The decimal representation of a rational number is either finite, e.g. 27 =
3.5, −18
8
= −2.25, or infinite but with eventually periodic digits, e.g. 31 =
0.33333 . . . = 0.3̇, −10
7
= −1.428571428571 . . . = −1.428571.
One may think that the rational numbers contain already sufficiently
many elements to represent every number on the number line, i.e., this would
mean that every number that can be expressed via its decimal representation
ultimately “ends” in a sequence of periodic digits. For example, we can
describe the number π = 3.14159265 . . . as precisely as we want considering
the sequence of fractions
3 31 314 3141 31415
, , , , ,...,
1 10 100 1000 10000
√
however, it can be shown that there are numbers, √ such as π, e, or even 2,
which are not contained in Q. The result that 2 ∈ / Q is usually one of the
prime examples for proofs by contradiction in mathematical textbooks.
4
1.1.1 Real numbers
Now, for many applications, it is not enough to consider only those numbers,
which can be represented by fractions. E.g. consider a right triangle with
sides a = 1 and b = 1, then by the Pythagorean theorem, the hypotenuse
√ c
can be determined by solving c2 = a2 + b2 = 12 + b2 . Thus, c = 2.
The set of irrational numbers is the collection of √
the numbers whose deci-
mal representation is neither finite nor periodic, e.g. 2 = 1.41421 . . . , −π =
−3.14159 . . . .
The set we typically work with in many applications, the set of real num-
bers, is defined to be the union of the rational numbers and the irrational
numbers:
R := Q ∪ {r | r is an irrational number}.
Hence, every point on the number line represents a real number.
18 10 1 √ 7
−π - 8 - 7 3 2 2
−5 −4 −3 −2 −1 0 1 2 3 4 5
As Q ⊆ R, we have
N ⊆ Z ⊆ Q ⊆ R.
R Q Z N
Within the set of real numbers, we can, as with the rational numbers,
perform all the arithmetic operations - of course, we are again not allowed to
divide by 0. Although the real numbers contain sufficiently many numbers
already for us to solve most problems, which stem from real life applications,
we will see that there are still some “simple” problems, which do not have a
solution within the real numbers.
5
1.1.2 Complex numbers
Consider the quadratic polynomial f (x) = x2 − 1. There are many problems,
which can essentially be solved by finding so-called roots or zeros of a poly-
nomial p, i.e., finding those values x0 in the domain, for which p(x0 ) = 0. For
the example we are considering, solving f (x0 ) = 0 means solving x20 − 1 = 0.
It can be easily checked that the real numbers x1 = 1 and x2 = −1 solve this
problem.
If we consider now the very similar polynomial g(x) = x2 + 1, we find out
that there are no real numbers, which solve this problem. This motivates the
definition of the so-called complex numbers. We will first define the set and
the corresponding arithmetic operations, and then consider the motivating
example again. A complex number z can be expressed using two real numbers
x and y and the so-called imaginary unit, which “separates” those numbers:
z = x + iy.
If we would not use the imaginary unit i, then the expression x+y would just
produce a real number. After some further considerations, we will explore
some further properties of the imaginary unit i.
Writing a complex number z ∈ C in the form z = x + iy is called the
canonical representation or Cartesian form. Hereby the real number x is
called the real part, and y is called the imaginary part of the corresponding
complex number. Hence, we define the set of complex numbers as
C = { x + iy : x, y ∈ R}.
6
Example 1.1. Given z1 = 5 + i3 and z2 = 1 + i2, we get
z1 + z2 = (x1 + x2 ) + i(y1 + y2 ).
This means, if we add two complex numbers, for the real part of the
sum, we simply add the real parts of the summands, and for the imag-
inary part of the sum, we simply add the imaginary parts of the sum-
mands.
Example 1.2. Given z1 = 5 + i3 and z2 = 1 + i2, we get
z1 + z2 = (5 + 1) + i(3 + 2) = 6 + i5.
z1 + z2 = (1 + 2) + i(20 + 8) = 3 + i28.
z1 − z2 = (x1 − x2 ) + i(y1 − y2 ).
7
Example 1.3. Given z1 = 5 + i3 and z2 = 1 + i2, we get
z1 − z2 = (5 − 1) + i(3 − 2) = 4 + i(−1).
z1 − z2 = (1 − 2) + i(20 − 8) = −1 + i12.
z1 · z2 = (x1 x2 − y1 y2 ) + i(x1 y2 + x2 y1 ).
Important, this means we do not simply multiply real parts and imag-
inary parts, but compute a more complicated expression. The reason
will be explained later.
z1 · z2 = (5 · 1 − 3 · 2) + i(5 · 2 + 3 · 1) = −1 + i13.
8
4. Division: Let us define this arithmetic operation in two steps, first let
us define what it means to divide a complex number by a real number,
then let us define what it means to divide a complex number by a
complex number. As before, division by 0 is not defined, hence not
allowed.
(b) For the quotient of two complex numbers, the complex conjugate
will be helpful, as. we will expand the corresponding fraction with
the complex conjugate. Let z1 , z2 ∈ C and z2 ̸= 0. Then,
z1 z1 z2 z1 z2
= · = .
z2 z2 z2 z2 z2
As we just found out, when considering multiplication, z2 z2 will
result in a real number, hence we can perform the division.
Example 1.7. Given z1 = 5 + i3 and z2 = 1 + i2, we get
z1 5 + i3 5 + i3 (5 + i3)(1 − i2) 11 − i7 11 7
= = · = = −i .
z2 1 + i2 1 + i2 (1 + i2)(1 − i2) 5 5 5
9
complex numbers. We saw, that in fact most operations, except for multipli-
cation, “carry over” from the real numbers. Recall that for z1 , z2 ∈ C, their
product is defined as
x1 x2 + i(x1 y2 + x2 y1 ) + i2 y1 y2 .
i2 = −1.
This is often presented as the defining property of the imaginary unit, which
then implies that multiplication of complex numbers produces the result
we stated above. But we saw that defining multiplication the way we did,
ultimately lead to this property as well.
How does this help with our motivating problem of finding solutions of
the equation x2 + 1 = 0? If instead of searching for a real number solution,
we could now try to find a complex number, which solves this problem. For
notational convenience, let us restate the problem as finding z ∈ C, so that
z 2 + 1 = 0. Using the canonical representation, and subtracting 1 from both
sides of the equation, we get
10
We multiply out and write −1 = −1 + i0,
We know, in order to be equal, complex numbers must have the same real
part and the same imaginary part. This leads to the following equation,
which we must solve for x, y ∈ R(!).
2
x − y 2 = −1,
2xy = 0.
z12 + 1 = i2 + 1 = −1 + 1 = 0,
z22 + 1 = (−i)2 + 1 = i2 (−1)2 + 1 = i2 1 + 1 = −1 + 1 = 0.
11
if 9 − 13 ≥ 0. But 9 − 13 = −4 < 0. Hence, this problem does not have a
solution.
Now, we solve z 2 − 6z + 13 for z ∈ C. Again, we consider
6 √
z1,2 = + −4,
2
√
but now, we get the following: solving −4 is equivalent to solving v 2 = −4
for v ∈ C, which yields v1 = 2i and v2 = −2i. Hence, the set of solutions is
given by
{3 + 2i, 3 − 2i} ⊂ C.
N ⊆ Z ⊆ Q ⊆ R ⊆ C.
12
Sets contain objects, for a set M and an object x, the notation
x∈M
• The set of federal provinces in Austria: {V, T, Sbg, St, K, B, W, Nö, Oö}.
• The set of natural numbers between 5 and 100: {5, 6, 7, . . . , 99, 100}.
13
• The set of integers: Z = {. . . , −3, −2, −1, 0, 1, 2, 3, . . . }.
The most powerful notion to define sets is defining them by the properties
of the elements, which are contained.
This way of defining sets follows a certain pattern, consisting of two main
parts, let us consider {x ∈ R : −3 ≤ x ≤ 2}:
1. We have opening parenthesis “{”, then we start with the first part.
4. We write a colon “:” to separate the first part and the second part.
14
• |{a, e, i, o, u}| = 5.
• |{5, 6, 7, 8, 9, 10}| = 6.
• |∅| = 0.
• (−∞, c] = {x ∈ R : x ≤ c},
15
• [c, ∞) = {x ∈ R : c ≤ x}.
We notice, using the set builder notation via the properties of the elements
(see p. 14), we always get subsets of the corresponding superset.
As a set is defined by the elements, which are contained, two sets are
equal, if they contain precisely the same elements. Hence, if A and B are
sets, then A = B iff A ⊆ B and B ⊆ A.
Example 1.19. Given A = {1, 2} and B = {2, 1} then A ⊆ B and B ⊆ A,
as every element from A is also contained in B and vice versa. Hence,
A = B.
A B
L1 = x, y ∈ R2 : x − y = 2 ,
and we can define the set of 2D points, which fulfil the first equation,
L2 = x, y ∈ R2 : x + y = 0 .
16
Now, the intersection L1 ∩ L2 corresponds to all those 2D points, which
fulfil e1 and e2 , i.e. the point(s) of intersection of both lines.
2. Union: If A and B are sets, then A ∪ B is the set that contains all the
elements, which are contained in A or in B (or both).
A B
A ∪ B = {1, 2, 3, 4, 5, 6}.
3. Set Difference: We will see that this operation allows us to write the
result of the previous example in another way. If A and B are sets,
then A \ B is the set, which contains all the elements of A that are not
contained in B.
A B
17
Example 1.22. We first list a simple example. If A = {1, 2, 3, 4} and
B = {5, 6, 1, 2}, then
A \ B = {3, 4}.
Considering again the very simple inequality x2 > 4, we already know
the set of solutions is given by (−∞, 2) ∪ (2, ∞) this can be expressed
as
R \ [−2, 2].
4. Set complement: Let A be any set, and let Ω be the universal set. The
complement of A, denoted by AC or Ω \ A, is defined as the set of all
elements (in Ω), which are not an element of A.
A AC
AC = {0, 5, 6, 7, 8, 9, 10}.
5. Power Set: If A is a set, then P(A) is the set that contains all subsets
of A.
18
Set Corresponding event
∅ “The coin does not land on Heads, but also not on Tails.”
{H} “The coin lands on Heads.”
{T } “The coin lands on Tails.”
{H, T } “The coin lands on Heads or Tails.”
The word “tuple” is commonly used for 2-tuples, 3-tuples are often
called triples. Two tuples are only equal, if they have the same length,
and if they coincide in each position.
x
Example 1.26. A 2D vector can be represented as (x, y) or ,
y
where x is the x-coordinate and y is the y-coordinate. The tuple (vector)
(1, 2) describes a different point than the
tuple
(vector) (2, 1). A 3D
x
vector can be represented as (x, y, z) or y , where the additional z-
z
coordinate is represented by z. The tuple (vector) (0, 1, 2) describes a
different point than the tuple (vector) (0, 2, 1).
From any two non-empty sets, we can generate the so-called Cartesian
product, which is a set of tuples of elements of this set.
19
A × B := {(a, b) : a ∈ A, b ∈ B}.
| ×A×
Remark 1.28. For A {z· · · × A}, i.e., if we compute a Cartesian
d−times
product of a set with itself, we use shorthand Ad .
If A and B are finite, i.e., |A| = k and |B| = ℓ for some k, ℓ ∈ N, then
also A × B is finite and
|A × B| = k · ℓ.
20
2.1 Unit conversion
2.1.1 Prefixes
In many applications, prefixes allows us to easily express and communicate
measurements that are much larger or smaller than the base unit in a uni-
form way. E.g. in the case of distances, it may be easier to understand that
“Linz and Vienna are around 184 km (kilometre) apart” than to state that
“Linz and Vienna are around 184000 m (metre) apart”, although technically
both statements share of course the same information. Similarly, one might
understand the statement “An adult fruit fly’s mass is around 100 µg (mi-
crogram)” better than the equivalent statement “An adult fruit fly weighs
around 0.0000001 kg (kilogram)”.
The most common prefixes are given in the following table.
tera- T 1012 deci- d 10−1
giga- G 109 centi- c 10−2
mega- M 106 milli- m 10−3
kilo- k 103 micro- µ 10−6
hecto- h 102 nano- n 10−9
deca- da 101 pico- p 10−12
21
3. Area: The standard unit for area is thus square metres, with symbol
[m2 ].
Example 2.3. As a metre consists of 10 dm, 1 square metre has 10 · 10
dm2 . As 1 decimetre has 10 cm 1 square metre has 100 · (10 · 10) =
10000 cm2 . Moreover, as 1 centimetre has 10 mm, 1 square metre has
10000 · (10 · 10) = 1000000 m2 .
Other commonly used units for area are ar [a], with 1 ar is equal to
100 m2 , and hectare [ha] (houndred-ar), with 1 ha is equal to 100 a or
10000 m2 .
4. Volume: The standard unit for volume, as the one before, stems from
the unit for length and is m3 . One interesting fact is that 1 litre [l],
which is often used to measure the volume of liquids, is equal to 1 dm3 .
You may want to verify as an exercise that 1 cm3 = 1 ml (millilitre).
5. Mass: The SI unit for mass is the kilogram [kg]. The modern definition
of the kilogram is based on the definition of second and metre, histor-
ically it was defined as the mass of 1 l of water. As the substances,
which are dissolved in water, and thus the mass of water varies a lot,
it may be obvious that the latter definition does not give a uniform or
universal definition of mass.
Example 2.4. It holds that 1 kg is 1000000000 µg.
6. Time: The standard unit for time is the second [s]. For smaller amounts
of time, the above prefixes are used, but for larger amounts of time,
usually different units, like minutes [min] or hours [h] etc. are used.
Example 2.5. If a computer is able to perform a floating point addi-
tion (“addition of two decimal numbers”) on average in 10 ns (nanosec-
onds), then it can perform 100000000 such operations per second.
One (1) minute [min] consists of 60 s, 60 min equal 1 hour [h], 24 h
are 1 day [d]. Thus, 1 d is equal to 24 · 60 · 60 = 86400 s.
Example 2.6. In the simple case of a unit circle, i.e., a circle with
radius 1, the angle that corresponds to a full rotation is equal to 2π rad.
For a half rotation, 180 rad are required.
m · 10n ,
with |m| ∈ [1, 10) and n ∈ N. In this form, m is called the mantissa and n
is called the exponent.
23
6. 602214.076 · 1018 is not in scientific notation, but 6.02214076 · 1023 is.
• The mass of the earth is around 5.9 · 1024 kg. The mass of the sun is
around 3.3 · 105 times the mass of the earth. As 3.3 · 5.9 = 19.47 =
1.947 · 101 , the mass of the sun is around 1.947 · 1030 kg.
Example 2.10. Assume that the outside temperature is 14 ◦C. How much
would that be in ◦F? You could use the formula
9
c + 32 = f, (1)
5
where c represents the temperature in ◦C and f represents the temperature in
◦
F.
Thus, as 95 · 14 + 32 = 57.2, an outside temperature of 14 ◦C corresponds
to 57.2 ◦F.
But what if you measured the outside temperature in ◦F and you want to
convert it to ◦C? You would need to modify formula (1) first.
24
In formula (1), we see that the variable f , which corresponds to temper-
ature in ◦ F, is isolated. In order to solve the above-mentioned problem, we
want to transform the equation in an equivalent equation, where the vari-
able c, which corresponds to temperature in ◦ C, is isolated. We will call this
“solving” equation (1) for the variable c.
If not specified otherwise, the domain for all the variables, which occur is
always the set of real numbers R.
We omit mentioning all the required steps for any equation in full gener-
ality and detail, instead we will present some examples.
Example 2.11. For solving equation (1) for c, in order to isolate the desired
variable by adding/subtracting, we get the following equivalent equation in a
first step (subtraction of 32):
9
· c = f − 32 (2)
5
Now, in order to further isolate the variable c, we perform multiplication/division.
Attention: Multiplication/Division is only allowed by terms that are not equal
5
to 0. By multiplication with ̸= 0, we arrive at the equivalent equation
9
5
c = (f − 32) , (3)
9
which can be further simplified to
5f − 160
c= . (4)
9
Now, we can easily answer questions like “What does a temperature of 5
◦
F correspond to in ◦ C?” .
5 · 5 − 160 135
=− = −15.
9 9
Hence, a temperature of 5 ◦ F corresponds to −15 ◦ C.
25
In this particular example, from a “mathematical” point of view, there
was no further restricion on the domain needed. From the viewpoint of a
physicist, absolute zero is at −273.15 ◦ C, which corresponds to
9
(−273.15) + 32 = −459.67 ◦ F.
5
Thus, the restriction c ∈ [−273.15, ∞) translates to f ∈ [−459.67, ∞).
Follwing the next example, we will see that sometimes, we have to be
more careful with restrictions:
Example 2.12. Newtons second law:
F = ma.
Again, we assume F, m, a ∈ R. If we want to solve this equation for m, we
would need to divide by a. But, as division by 0 is not possible, we would
need to restrict the domain of a to R \ {0}, i.e. a ̸= 0. The solution is then
F
m= .
a
We note: each time we divide by an expression, we have to restrict the
corresponding domains to not include 0. This is related to the fact that the
1
natural domain of the function f (x) = is R \ {0}.
x
Example
√ 2.14. Consider as a very simple example 22 = 4 and (−2)2 = 4,
but 4 = 2. Hence, if we want to solve x2 = 4, if we just use the square
root, we would loose the solution x = −2.
√ In order to avoid this, we always
first consider both solutions e.g. x = ± 4 = ±2.
27
A=4
Assuming the area of the square is 4 square units, what is the length of the
side a? Using the formula for area of a square, A = a2 , we solve a2 = 4,
which yields a = 2 or a = −2. In this context, we can safely say that a = 2
is the only solution.
28
we want to square both sides of an equation, we might introduce solutions,
which were not present in the beginning. Thus, as before, by considering
the appropriate context and by substituting back the intermediate solutions,
which were found, into the starting equation, we may decide to disregard
some of the intermediate solutions.
y = r sin(α).
29
3 Elementary combinatorics
Combinatorics is often referred to as the mathematics of counting and ar-
rangement. Hence, in this chapter, we will explore the basics of systemati-
cally counting and arranging objects. Being able to determine the number of
elements in certain sets is very important in discrete probability theory, but
of course, the applications of combinatorics are by no means just limited to
probability theory.
We already know that if A, B are finite sets and |A| = n and |B| = k
then |A × B| = n · k, thus, there are n · k possibilities to choose one element
from each set.
Example 3.1. If for a menu the options for a meal are A := {soup, rice, noodles},
and the options for a drink are B = {water, juice}, then we have 3 · 2 = 6
possibilities to choose a menu, which consists of a meal and a drink.
30
Example 3.6. For n = 2 and k ∈ {0, 1, 2}, we get
• 20 = 0!·(2−0)!
2!
= 22 = 1,
• 21 = 1!·(2−1)!
2!
= 21 = 2,
• 22 = 2!·(2−2)!
2!
= 22 = 1.
• 31 = 1!·(3−1)!
3!
= 62 = 3,
• 32 = 2!·(3−2)!
3!
= 62 = 3,
• 33 = 3!·(3−3)!
3!
= 66 = 1.
The binomial coefficients, nk “n choose k”, are called like this, because
they represent the positive integers, which occur as coefficients in the so-
called binomial theorem:
n
n
X n n−i i
(a + b) = a b.
i=0
i
Let us verify the above statement for the cases n = 2 and n = 3 presented
in the last example. We know
(a + b)2 = a2 + 2ab + b2 = 1 · a2 + 2 · ab + 1 · b2 ,
and
2 2 2
= 1, = 2, = 1.
0 1 2
Furthermore
and
3 3 3 3
= 1, = 3, = 3, = 1, .
0 1 2 3
We now examine further properties of the binomial coefficient:
31
• n n
0
= = 1 for all n ∈ N0 , this is easy to verify. Try it yourself.
n
• nk = n−k n
for all n, k ∈ N0 , this is called the symmetry of binomial
coefficients. Assume n, k ∈ N and k < n. Then
n n! n! n! n
= = = = .
k k! · (n − k)! (n − k)! · k! (n − k)! · (n-(n-k))! n−k
• n n−1 n−1
k
= k
+ for all n, k ∈ N \ {0}.
k−1
3.1 Permutations
3.1.1 Permutation without repetition
With permutations, we can count in how many ways the distinguishable
objects of a finite set can be ordered. Assume that we are looking at a finite
set A with n elements. We count, in how many different ways we can order
(sort) them. We will have to put one object in the first place, for this we can
choose any of the n objects in A. Assume that we have already chosen the
first object, we then have n − 1 objects left to put in order. For the second
place, we can thus choose n − 1 objects. In other words, for each of the n
choices for the first element, we still have n−1 choices for the second element,
in total we thus have n · (n − 1) choices for the first two objects. It is easy
to see that this pattern will continue, for the third element we can choose
(n − 2) objects (2 are already chosen), so we have a total of n · (n − 1) · (n − 2)
choices for the first three elements of the sequence. In total, we will have
n · (n − 1) · (n − 2) · · · · · 2 · 1 = n! choices, each corresponds to a different
sequence. Thus, the number of permutations of a set with n elements is n!.
Example 3.8. Assume, we want to sort 3 balls coloured in red r, green g
and blue b. Then for the first element, we can choose one of the three.
Options for choice 1 r g b
Now, assume that we have chosen the first element, in each of the cases,
there are still 3 − 1 = 2 elements left to choose for the second element. E.g.,
32
if we chose r for the first element, we can still choose g or b for the second
element.
Choice 1 r r g g b b
Options for choice 2 g b r b r g
Now, if we have chosen already two elements, then there is 3 − 2 = 1 option
left for the third (and last) element of the sequence. E.g., if we chose r and
then g, we can only choose b.
Choice 1 r r g g b b
Choice 2 g b r b r g
Options for choice 3 b g b r g r
We see that there are 6 = 3! different columns in the above table, thus there
are 6 permutations of a set of three elements.
Often, combinatorial questions are “hidden” in word problems.
Example 3.9. Consider the following set of Books: A := {Math, History, German}.
In how many different ways could we sort them into a bookshelf ?
Answer: There are 3! = 6 different ways to sort those 3 books into a bookshelf.
Written down explicitly, they are
• Maths, History, German,
33
assumption may be that we have again 4! = 24 possibilities. If we make the
beads indistinguishable, e.g. by labelling them with the numbers 1, 2, 3, 4,
we would in fact get 24 different permutations.
But, it turns out that some of the permutations, e.g.
2, 3, 1, 4
and
3, 2, 1, 4
are indistinguishable, if we remove the labels. After all, both start with a
green bead, then another green bead, and then red and blue beads.
For each of the unique ways of sorting the beads, there are 2! = 2 copies
4!
counted, if we just compute 4!. Hence, there should be only = 12 different
2!
ways to create such a necklace. The reason for the 2! copies is that no matter
how we permute the 2 green beads, we will get the same necklace, as soon
as we remove the labels.
This observation can be generalized: Assume that there are n objects
which can be sorted into i different categories, where we assume that all ob-
jects that fall into the same category are indistinguishable. For k = 1, 2, . . . , i,
with “nk ”, we will denote the number of elements in the k-th category. E.g.
n1 is the number of elements in the first category, n2 is the number of ele-
ments in the second category, etc. and ni is the number of elements in the
i-th (which is the last) category.
34
Example 3.11. Question: In how many ways can the letters of the word
“REFERENCE” be arranged? We also want to count arrangements that do
not give a word, which makes sense, e.g. we also count “RREEFEECN” as
a possible arrangement.
Answer: First we notice, the word “reference” has n = 9 letters in total.
There are i = 5 different letters, thus there are five different categories n1 ,
n2 , n3 , n4 and n5 .
Letter Category Number of elements in that category
R n1 2
E n2 4
F n3 1
N n4 1
C n5 1
We note n1 + n2 + n3 + n4 + n5 = 2 + 4 + 1 + 1 + 1 = 9 = n.
As n! = 9! = 362880, n1 ! = 2! = 2, n2 ! = 4! = 24, n3 ! = 1! = 1,
n4 ! = 1! = 1 and n5 ! = 1! = 1, we have that the total number of arrangements
is
362880
= 7560.
2 · 24 · 1 · 1 · 1
(This would already take some time to write down all possible arrangements
explicitly.)
Example 3.12. Question: In kindergarten, a child wants to decorate a rect-
angle with 11 · 10 pins. It uses 20 blue, red and green iron-on beads each;
18 purple and orange beads each; and 14 yellow beads. How many possible
arrangements are there?
We provide one example arrangement:
35
Answer: First, we notice, the rectangle has n = 11 · 10 = 110 pins. There
are i = 6 different colours, thus there are six different categories: there are
n1 = 20 blue beads, n2 = 20 red beads, n3 = 20 green beads, n4 = 18 purple
beads, n5 = 18 orange beads and n6 = 14 yellow beads.
Hence, the total number of different arrangements is
110!
.
20! · 20! · 20! · 18! · 18! · 14!
This number is equal to
308640846655952322940599636727733117453663086496971340636955780648256830396000000,
which is approximately 3.1 · 1080 . As some authors estimate that the number
of atoms in the observable universe ranges between 1 · 1078 and 1 · 1082 ,
it is impossible that you can write down all arrangements and count them
explicitly.
As the above example for an arrangement was chosen “randomly”, this
also implies that we can be almost certain that never before in history, some-
one presented the exact same arrangements of beads in any textbook.
3.2 Variation
Variations again describe how we can order/arrange objects, or create se-
quences of objects.The concept of variation is thus closely related to per-
mutation, in fact we will see that permutation without repetition is just a
special case of a variation without repetition. In many applications, if the
order in which elements are chosen, put or drawn etc. matters, it is highly
likely that variation is the key for solving the underlying problem.
36
balls we have drawn back in the urn, so we cannot re-draw them. The order
in which balls are drawn is important.
We can derive a formula for this number in a very similar way to what
was presented in section 3.1.1: Again, we compile a list of length k, choosing
some of the n objects. For the first element in the list, we can put any of the
n objects. For each of those choices, it holds that now we can only choose
one of the n − 1 remaining objects for the second entry in the list, etc.
Thus, there are
n!
n · (n − 1) · (n − 1) · · · · · (n − k + 1) =
(n − k)!
Example 3.13. Question: You are looking at 4 different pictures, and you
should tell, what is your favourite and what is your least favourite of the
pictures. In how many ways could you do that?
Answer: Let us, for simplicity, say that the four pictures are as follows:
Now, we say that we will first choose the picture, which we like the most
and label it with +, and we will then choose the picture, which we like the
least and label it with −. The result will then have the format:
+ −
37
+ −
If we have chosen the red picture as our favourite picture, then for the second
picture, we have the following 3 options for the other picture, we must select:
+ −
If we have chosen the green picture as our favourite picture, then for the
second picture, we have the following 3 options for the other picture, we
must select:
38
+ −
If we have chosen the blue picture as our favourite picture, then for the second
picture, we have the following 3 options for the other picture, we must select:
+ −
If we have chosen the orange picture as our favourite picture, then for the
second picture, we have the following 3 options for the other picture, we must
select:
39
+ −
40
Example 3.16. Question: For the running competition mentioned in a pre-
vious example, results are also recorded. The most important part of this
record is of course in what order the 150 runners finished. How many possi-
bilities are there?
Answer: 150!, which is a number so large, the authors’ calculator, a
TI-34II, gave up with an error. With a CAS (computer algebra system)
the author was able to compute the result, which is approximately 5.71 · 10262 ,
thus a number too large to put here explicitly.
We mentioned in a previous example that the number of atoms in the
observable universe is around 1 · 1080 , if we imagine that each of these atoms
is its own universe, with again 1 · 1080 “(sub)-atoms” and each of those is
again a whole universe with another 1 · 1080 “(sub-sub)-atoms”. Then the
total number of possible final results of the race would still be 10 trilliards
more than the total number of those sub-sub-atoms.
For the last exercise, it is impossible to write down all the possible out-
comes explicitly and count them. Still, using the formula we developed, we
are (theoretically) able to determine the number of arrangements.
41
total number of k-element variations of n objects with repetition is
· · · · · n} = nk .
| · n {z
n
k−times
366 = 2176782336
3.3 Combination
As in the previous sections, we still count in how many ways we can choose
some objects. In contrast to variation, sometimes it just matters, if a certain
object was chosen or not, not in what order the objects were chosen. The
prime example for this is “Lotto”, a certain game of change.
Example 3.19. In the Austrian version of Lotto, “6 aus 45”, everyone who
participates fills their lottery ticket by marking 6 from the 45 possible num-
bers, submits their ticket and gets a receipt. During the drawing, 45 balls of
the same size, which are labelled with the numbers 1 − 45, are then placed in
a transparent urn. A mechanism randomly chooses 6 from them for the main
prize. In order to win the main prize, the numbers that were selected on the
lottery ticked have to precisely match the ones that were drawn.
42
It is obvious that order is not important here, e.g. it does not matter if
you decide to mark the number 1 on the lottery ticket as your first or your
final choice, it only matters, if you have marked it or not. Similar with the
drawing procedure, it does not matter, if the ball with the number 1 was drawn
in the beginning, or at the end, just if it was drawn or not.
1. let us first assume that the order was important, in the example of
Lotto drawings, this makes perfect sense, as the Lotto drawings are
sometimes broadcasted in television and there we are able to see in
which order the balls are drawn by the machine,
2. let us then consider, in how many ways the chosen elements could have
been ordered, and let us divide by that number, after all, in the end
it should not matter, in which order the elements were chosen. For
the Lotto results, only the sorted list of drawn numbers (in ascending
order) is published, i.e. if 1 was drawn it will always be the first number
that is shown in the list of results, no matter when it was drawn, if 45
is among the winning numbers, it will always be the last element of the
list of results.
We already know (see section 3.2.1) that the total number of choosing k el-
ements from n objects if order is important (and repeats are not allowed) is
n!
. Now we have k elements that are chosen, and as order was impor-
(n − k)!
tant so far, we know in what order they were chosen. From section 3.1.1, we
know there are k! possible arrangements for those k numbers. Thus, in the
n!
list of possible results (when order matters), there are k! lists, that
(n − k)!
43
contain the same k elements, just in different order. Hence, if we just divide,
n!
(n−k)! n!
= ,
k! k!(n − k)!
we get the desired number. Note that
n! n
= ,
k!(n − k)! k
hence the binomial coefficient nk is often spelled out as “n choose k”.
Example 3.20. Question: How many subsets that contain precisely 2 ele-
ments does the set {a, b, c} have?
Answer: We see that this is combination without repetition, we should
select k = 2 elements from the set {a, b, c}, which has n = 3 elements. It
does not matter in which order we do it (this is a property of sets we studied
in section 1.2). It is easy to see that there are 3 possibilities:
1. either we do not choose a, which gives us {b, c},
2. or either we do not choose b, which gives us {a, c},
3. or either we do not choose c, which gives us {a, b}.
And in fact,
3 3!
= = 3.
2 2!(3 − 1)!
Reflecting what we did in the derivation of the formula, we could also
solve this problem like this. First we consider all tuples with non-repeating
elements we can make from the set {a, b, c}:
44
3.3.2 Combination with repetition
In this chapter, we present k-element combination of n objects with repeti-
tion, which represents choosing k elements from n objects, when the order
in which elements are chosen is not important, and when we are allowed to
choose the same element more than once. Similar as in section 3.2.2, we
allow k > n, as any object can be chosen multiple times. The corresponding
urn model is to draw balls with replacement, order does not matter.
In order to derive the formula for k-element combination of n objects
with repetition, we imagine n different objects x1 , x2 , . . . , xn . We create a
table where the columns are labelled with the xi , i = 1, 2, . . . , n, and put one
“+” in the corresponding column, if we want to select it once, two “+”, if we
want to select it twice, etc. and we simply leave the corresponding column
entry empty, if we do not want to select it. The total number of “+” has to
be k of course - after all, we want to choose k elements.
x1 x2 x3 ... xn
++ + +
In this example table, x1 would be chosen twice, x2 would not have been
chosen, . . . and xn would be chosen once.
Now, we could also represent this choice in another way. We could write
down a list of length n + k − 1 the following way. We start at x1 and write
down as many “+” as it was chosen. Then we write down a vertical bar “|”
to indicate that now we list a “+” for each choice of x2 . Then after adding as
many “+” as x2 was chosen, we write down another vertical bar, indicating
that we now consider how often x3 was chosen, etc. As there are n different
objects (x1 - xn ), we have to write down n − 1 vertical bars “|”; and as we
choose k elements, we have to write down k times a “+”. Hence, in total, we
really have a list containing n − 1 + k = n + k − 1 entries.
Example 3.22. The above table would correspond to the list
No. 1. 2. 3. 4. 5. 6. . . . n+k-2. n+k-1.
Symbol + + | | + | ... | +
or short: + + || + | . . . |+.
Example 3.23. Gummy bears are a very popular fruit gum candy, one brand
sells them in 6 different flavours, which are indicated by different colours:
pineapple (white), apple (green), lemon (yellow), orange (orange), strawberry
45
(pink) & raspberry (red). You offer a bag of gummy bears to some of your
friends and allow them to choose any 3 of them. We provide some examples
for possible choices (those are certainly not list all possible choices):
Friend # white # green # yellow # orange # pink # red
A 1 1 1
B 2 1
C 3
D 1 2
E 1 1 1
The corresponding lists in short form would be
A: +| + ||| + |,
B: | + +| + |||,
C: |||| + + + |,
D: +||| + +||,
E: +| + ||||+.
If we are ale to compute how many such lists there are, then we are
able to determine the number of k-element combinations of n objects with
repetition. The key insight is now: from the n + k − 1 positions in the list,
we choose k where we will put “+”, and we just put “|” in the remaining
n − 1 entries. It clearly does not matter in which order we choose positions,
and as we can only place one of the two symbols in each position, we have to
choose without repetition. Thus, we have k-element (the positions where we
put “+”) combination of n + k − 1 objects (the numbers from 1-n + k − 1,
which correspond to entries in the list), and the total number is given by (see
section 3.3.1)
n+k−1
.
k
Alternatively, we could also have chosen the positions,where we put the
n+k−1
n − 1 vertical bars “|”, which would result in possibilities. This
n−1
is not a contradiction, as in section 3 we already discussed the symmetry of
binomial coefficients,
n+k−1 n+k−1 n+k−1
= = .
k (n + k − 1) − k (n − 1
46
Example 3.24. Let us continue the example with choosing 3 gummy bears,
when there are 6 different flavours. We have that the total number of possible
choices is
6+3−1 8
= = 56.
3 3
47
different colours) and 4 smaller holes for the “key pegs” (black and white).
Additionally, at the end of the board, there is a row with 4 more holes for
code pegs and a shield to hide this row from one player. The first player, the
“code maker”, chooses a pattern of 4 coloured code pegs, which is then called
the “code”, and puts it into 4 holes at the end of the board and hides it from
the second player. There are 6 different colours to choose from, and a code
may contain multiple pegs of the same colour.
The other player, the “codebreaker”, should now try to find the code. To
this end, the codebreaker chooses a pattern as well and places it into the first
row of 4 holes. Then the code maker has to give feedback: for each peg of
the correct colour that was also placed in the correct position, the code maker
places a black key peg into a small hole next to the first row. For each peg in
the correct colour, but in an incorrect position, the code maker places a white
key peg.
Using this information, the codebreaker will now attempt to guess the
pattern using the second row of the board, etc.
10
1
In the above example, we see that the codebreaker already used 3 rows and is
close to breaking the code. How many possible codes are there?
Answer: We see that order (the position where the pegs are placed) mat-
ters, and we are allowed to use multiple pegs of the same colour. Thus, we
can answer the question by using 4-variation of 6 objects with repetition,
64 = 1296.
48
Answer: This is 5-combination of 15 objects without repetition. Hence,
the total number of options is
15
= 3003.
5
49
for a, b, c ∈ R and a ̸= 0. The set of solutions is then given by
c−b
L= .
a
If a = 0, then the above equation would simplify to b = c, which either has
no solution for x, i.e., L = ∅, this is the case when b ̸= c, or it has infinitely
many solutions, i.e., L = R, this is the case when b = c.
Example 4.1. Solve in R: x − 3 = 4.
Solution: We have U = R and D = R. We transform
4 − (−3)
x= ,
1
thus L = {7}.
Example 4.2. Solve in Z: x · 2 = 7.
Solution: We have U = Z and D = Z. We transform
7
x= ∈
/ Z,
2
hence L = ∅.
Example 4.3. Solve: 2x + 3 = 4 + x − 1 + x.
Solution: We have U = R and D = R. We isolate x and get
2x − x − x = 4 − 1 − 3,
50
4.1.2 Quadratic equations in one variable
Quadratic equations in one variable (often x) in canonical form are equations
of the form
ax2 + bx + c = 0,
for a ∈ R \ {0} and b, c ∈ R. If a = 0, then the equation simplifies
to a linear equation, which we already considered in section 4.1.1. By
adding/subtracting terms, any quadratic equation can be brought into the
canonical form. We hereby assumed again that, if not specified explicitly
otherwise, the underlying set is the set of real numbers.
Example 4.5. If we consider the quadratic equation
x2 − 9 = −8x,
by adding 8x to both sides of the equation, we get
x2 + 8x − 9 = 0.
Here a = 1, b = 8, c = −9.
A fact, which is particularly useful to determine the solution set, is that
the product of any real numbers r, s ∈ R is equal to 0, if and only if (at least)
one of the factors is equal to 0:
r · s = 0, iff r = 0 or s = 0.
Example 4.6. For solving x2 + 8x − 9 = 0, we may notice
x2 + 8x − 9 = (x − 1)(x + 9),
therefore x − 1 = 0 or x + 9 = 0. The solutions of those linear equations are
1 and −9 respectively, therefore L = {−9, 1}.
There are various special cases, but in the general case, we solve the
quadratic equation by the following transformations, which yield equivalent
equations:
ax2 + bx + c = 0 | · 4a
4a2 x2 + 4abx + 4ac = 0 | + b2 − 4ac
2 2
| {zx} + 4abx
4a b2 = b2 − 4ac
|{z} + |{z}
(2ax)2 2·(2ax)·(b) (b)2
51
Then, we introduce an auxillary variable y = 2ax + b and solve
y 2 = b2 − 4ac.
Thus, we reduced the problem to a special case, which yields the intermediate
solutions
√
y = ± b2 − 4ac,
if b2 − 4ac ≥ 0. In order to solve the initial equation, we solve the two (very
similar) linear equations
√
2ax + b = ± b2 − 4ac.
52
Given a quadratic equation in canonical form, by dividing by a, which is
assumed to be non-zero, we can represent the equation equivalently as
b c
x2 + x + = 0.
a a
b
Using the variables p = a
and q = ac , we have the equivalent equation
x2 + px + q = 0.
x2 + 8x − 9 = 0,
we have p = 8, q = −9.
b c
Using a
= p and= q, the solution formula can be transformed to:
a
√ √
−b ± b2 − 4ac −b b2 − 4ac
x= = ±
r2a 2a r 2a
p 2
b − 4ac p b2 4ac
=− ± 2
= − ± 2
− 2
2 r 4a 2 4a 4a
p p 2
=− ± −q
2 2
Example 4.9. For the quadratic equation
x2 + 8x − 9 = 0,
53
Assume that x1 and x2 are the solutions for the quadric equation
x2 + px + q = 0, (6)
i.e. x21 + px1 + q = 0 and x22 + px2 + q = 0. Clearly, x1 and x2 are also the
solutions of
(x − x1 )(x − x2 ) = 0.
Expanding the above expression yields
x2 − xx1 − xx2 + x1 x2 = 0,
x2 − (x1 + x2 )x + x1 x2 = 0. (7)
p = −(x1 + x2 ) and q = x1 x2 .
x2 + 8x − 9 = 0,
12 + 8 · 1 − 9 = 0.
Thus, we already found one solution, let us denote this one by x1 . As x1 ·x2 =
−9, we have x2 = −9 , and we found the two solutions.
54
If we tried e.g. x = (−1), we would get
x
−9 1
55
g(x) = x2 − 2
x
− 13 2
f (x) = 5x − 2x2
5x − 2x2 = x2 − 2
−3x2 + 5x + 2 = 0. (8)
hence L = − 31 , 2 .
56
x
− 31 2
57
Example 4.13. Solve x3 − x = 0. We factor x and solve x(x2 − 1) = 0. We
recognize x2 − 1 = (x + 1)(x − 1) and solve x(x + 1)(x − 1) = 0, hence the
three solutions are x1 = 0, x2 = 1, x3 = −1.
a(x2 )2 + b(x2 ) + c = 0,
ay 2 = by + c = 0,
58
As described above, we introduce the auxiliary variable y = x2 , and first solve
the following equation of degree 2
41 81
y2 − y+ = 0,
8 256
which yields
s 2
41 41 81
y= ± −
16 16 256
r
41 1681 81
= ± −
16 256 256
r
41 1600
= ±
16 256
41 40
= ± .
16 16
1
Hence we get the intermediate solutions y1 = 16 and y2 = 81
16
. Now, as we
2 2 1 2 81
set x = y, we solve both x = 16 = 0 and x = 16 . From the first equation,
we get the solutions x1 = 14 and x2 = − 14 , the second equation yields x3 = 49
and x4 = − 94 . Hence, for this equation, |L| = 4.
The geometric interpretation of this equation is
59
x
− 94 1
4
60
We use the pq-formula and get
2 √ √
x= ± 1 − 5 = 1 ± −4.
2
√ √
As −4 = i 4 = i2, we get
x1 = 1 + i2 and x2 = 1 − i2.
z 2 + pz + q = 0,
then p = −((x + iy) + (x − iy)) and q = (x + iy)(x − iy) and vice versa.
61
4.1.3 Radical equations in one variable
In this section, we briefly mention how to solve equations in one variable,
which contain square roots (radicals).
√
Example 4.19. We solve for x ∈ R: x + 2 = 3.
First, we determine the domain of definition, which is D = [−2, ∞).
Then, by squaring both sides of the equation, we get
x + 2 = 9,
which yields x = 7.
It can happen that such an equation has no solution, which can be easily
seen by considering the next example.
√
Example 4.20. We solve for x ∈ R: x + 2 = −3. The square root is
always a non-negative number, thus there cannot be a solution.
x + 2 = (−3)2 = 9,
62
√
Example 4.22. Solve for x ∈ R: x + 2 = x. As before, the domain of
definition is D = [−2, ∞) Squaring both sides of the equation yields
x + 2 = x2 ,
x − 1 = x2 − 2x + 1,
x2 − 3x + 2 = 0,
If we did not isolate the radical expression, using the binomial theorem,
squaring would yield an equation that contains a product of the root, i.e. the
radical expression would not vanish in general.
63
√
Example 4.24. Solving the equation x − 1 + 1 = x for x ∈ R, by squaring,
we get √
(x − 1) + 2 x + 2 · 1 = x2 ,
which is more complicated to solve than the initial equation.
Clearly, if there is more than one radical expression, isolating the root
expressions is not always possible. We present two even more involved ex-
amples.
√ √
Example 4.25. Solve for x ∈ R: 4x − 3 − 3x + 2 = 0. We determine
the domain of definition D = [ 43 , ∞) ∩ [ 32 , ∞) = [ 34 , ∞). Then, we see that
although there are two expressions√containing roots, we still are able to isolate
the radical expressions by adding 3x + 2 to both sides of the equation. Thus,
we have √ √
4x − 3 = 3x + 2,
and squaring yields
4x − 3 = 3x + 2,
which gives only one intermediate solution x = 5. Testing this solution shows
that the set of solutions L = {5}.
√ √
20 − 3 − 15 + 2 = 0.
√ √
Example 4.26. Solve for x ∈ R : x + 1 = x + 9. The domain of
definition is given by D = R≥0 . We cannot isolate both radical expressions
here. By squaring, we get
√
x + 2 x + 1 = x + 9,
which simplifies to √
x = 4,
√
which,
√ by squaring,
√ yields the intermediate solution x = 16. As 16 + 1 =
5 = 25 = 16 + 9, the set of solutions is indeed L = {16}.
64
well-defined, the expressions in the denominator of a fraction are not allowed
to be equal to 0.
The central technique is to clear the denominators, usually by multiplying
with the least common multiple (LCM) of all denominators. We call this
factor the least common denominator (LCD).
Example 4.27. Solve for x ∈ R: x1 = 4. As indicated, we note D = R \ {0}.
The LCD is x, thus we solve 1 = 4x, which yields x = 14 . Checking, if this
solution candidate really solves the equation confirms L = { 41 } ⊂ D.
We present more examples.
1 2
Example 4.28. Solve for x ∈ R: x−1 = 3x . We determine D = R \ {0, 1}
and the LCD (x − 1) · 3x. Hence, we solve
3x = 2(x − 1),
2(x − 2) + 4(x + 2) = x − 1,
x2 + 4x + 12 = 12,
which simplifies to
x(x − 4) = 0.
Hence, we consider the solution candidates x = 0 or x = −4. As 0 ∈
/ D, we
checkx = −4,
4 1 1
− + =− ,
12 12 4
hence L = {−4}.
65
4.1.5 Exponential and logarithmic equations in one variable
The topic of this section is solving equations, which contain exponentials or
logarithms, which are equations that contain expressions of the form bp or
logb (c), usually for b > 0, b ̸= 1. For such an expression bp , the number b is
called the base and p is called the exponent or power.
In high school, exponents are usually introduced as a shorthand notation
for repeated multiplication.
xn = x
| · x {z
· · · · · x} .
n-times
Extending this notion first to exponents that are integers, then to rational
numbers and finally to all real numbers, we get the following computation
rules for exponents.
p
• x1 = x and x0 = 1, • xy = xyp .
p
• for p ∈ R, log(xp ) = p,
66
• logx (c · d) = logx (c) + logx (d),
• logx ( dc ) = logx (c) − logx (d),
• logx (cd ) = d logx (c).
Now, using the computation rules we presented, we will solve some ex-
ponential and logarithmic equations. The central technique to solve those
equations is by taking corresponding logarithms on both sides of the equation
and writing both sides of the equation as the exponent of a corresponding
base respectively.
Example 4.34. We solve for x ∈ R: 3x+2 = 92x−3 . We notice 9 = 32 , hence
92x−3 = 34x−6 . We solve 3x+2 = 34x−6 by taking log3 on both sides, which
yields the linear equation
x + 2 = 4x − 6,
This equation can be solved using the methods from section 4.1.1, and has
the solution x = 83 . Checking this solution yields
8 14
3 3 +2 = 3 3 ,
and
8 7 14
92· 3 −3 = 9 3 = 3 3 .
Example 4.35. We solve 7 · 9x + 4x = 8 · 6x . As 6x ̸= 0 for all x ∈ R, we
can divide both sides by 6x . Hence, we solve
x x
9 4
7· + = 8.
6 6
9
As simplifies to 32 and 64 simplifies to 23 = 13 , we introduce the auxiliary
6
x 2
variable t = 32 , substitute this variable and solve the rational equation
1
7t + = 8,
t
68
x
for t. The LCD is t, we remark t ̸= 0 (see section 4.1.4), but as 32 > 0 for
all x ∈ R, this does not restrict the set of solutions. We multiply both sides
of the equation with t and get the quadratic equation
7t2 + 1 = 8t,
7t2 − 8t + 1 = 0.
Using the abc-formula (see section 4.1.2), we get the intermediate solutions
1
t1 = 1 and t2 = .
7
x
Now, we solve t = 32 for each intermediate solution. By taking log 3 on
x x 2
both sides, 1 = 23 yields x1 = 0. Similarly, from solving 17 = 32 we get
1 ln(7)
x2 = log 3 = log 3 (1) − log 3 (7) = − .
2 7 | {z2
} 2 ln(3) − ln(2)
=0
· 9}0 + |{z}
While checking the solution x1 = 0, is quite easy, we have 7| {z 40 =
=7 =1
· 6}0 , checking the solution x2 = − log 3 (7) is non-trivial.
8| {z
2
=8
Example 4.36. We solve for x ∈ R: log5 (x + 2) + log5 (3) = log5 (27). First,
we determine D = (−2, ∞), then we transform the equation to
which simplifies to
5log5 (x+2) · 5log5 (3) = 27
and
(x + 2) · 3 = 27.
This is a linear equation (see section 4.1.1) and has the solution x = 7 > −2.
Checking the solution candidate confirms we have found a solution
69
Example 4.37. We solve for x ∈ R: logπ (x) + logπ (x − 2) = logπ (x + 10).
First, we determine D = (0, ∞) ∩ (2, ∞) ∩ (−10, ∞) = (2, ∞), then we
transform the equation to
which simplifies to
π logπ (x) · π logπ (x−2) = x + 10
and
x · (x − 2) = x + 10.
This is a quadratic equation (see section 4.1.2) and has the solutions x1 = 5
and x2 = −2. As x2 ∈ / D, we only check the solution candidate x1 .
the other. In all four cases, the numbers or expressions are not equal to
one another.
70
Note, that a < b is equivalent to b > a and that a ≤ b is equivalent to
b ≥ a.
Sometimes, inequalities are only used as statements which can be true or
false. For example, 3 < 4 is a true statement and 5 ≥ 7 is a false statement.
But, most of the time, inequalities contain an unknown variable, mostly
denoted by x, for example, 4x − 1 ≥ 32 or − 14 x2 < 3x + 7. Then, we want
to determine which value(s) of x satisfy the inequality. This is called solving
an inequality. When not otherwise specified, as in previous sections, R is the
base set for the solution of an inequality. Usually, but not necessarily, the
solution of an inequality is a set, not a number. Therefore, the solution of
an inequality is (again) called the solution set, we will again denote it by L.
Similar to working with equations, there are four types of equivalent trans-
formations which can be used to solve inequalities: addition, subtraction,
multiplication, and division.
• Addition: Adding the same number to each side of an inequality leads
to an equivalent inequality. The inequality sign does not change.
• Subtraction: Subtracting the same number from each side of an in-
equality leads to an equivalent inequality. The inequality sign does not
change.
• Multiplication: There are two cases.
1. Multiplying both sides of an inequality with the same positive
number yields an equivalent inequality. The inequality sign does
not change.
2. Multiplying both sides of an inequality with the same negative
number does not yield an equivalent inequality; for this, the di-
rection of the inequality sign has to be reversed as well.
Example 4.38. Let us have a look at the inequality 5 < 8. We want
to multiply it with −3. In order to get an equivalent inequality, we not
only have to multiply 5 and 8 with −3, but we also need to reverse the
inequality sign. We get −15 > −24, which is equivalent to 5 < 8.
71
2. Dividing both sides of an inequality by the same negative number
does not yield an equivalent inequality; for this, the direction of
the inequality sign has to be reversed as well.
• 5x + 3
7
≤0
• −7x + 1 > 0
• 4
5
x −3≥4
• 1 > −2x − 91
• x < − 12
34
• 52 ≤ −4x
72
Example 4.41. Solve the linear inequality 4x − 3 > 21 for x ∈ R. State the
solution set and visualize it on the number line.
We look at the inequality 4x − 3 > 21. Since we want to isolate x,
we begin with adding 3 on both sides and obtain the equivalent inequality
4x > 24. Now we need to divide both sides of the inequality by 4. Since 4 is
a positive number, we do not need to reverse the inequality sign. We obtain
the equivalent inequality x > 6 and have already isolated x.
The solution set of the linear inequality 4x − 3 > 21 (for x ∈ R) is
the set of all real numbers which are greater than 6, i.e., the solution set is
L = {x ∈ R : x > 6} = (6, ∞). We visualize the solution set on the number
line:
5 6 7 8 9 10 11 12 13 14 15 16
−3 −2 −1 0 1 2 3 4 5 6 7 8
Example 4.43. You are saving money to buy a new smartphone. You cur-
rently have 200 e, and you want to save enough money to buy a smartphone
that costs 700 e within the next 9 months. Let m represent the amount of
money you have to save each month. Write a linear inequality to represent
this situation, and find the range of values for m that would allow you to buy
the smartphone.
This problem can be modelled by the following linear inequality. Solve for
m ∈ R: 9 · m + 200 ≥ 700. We subtract 200 from both sides and get
9 · m ≥ 500.
The interpretation for this inequality is “How much money do we need to
save each of the nine months, to get the 500 e we still need?”. Dividing both
sides by 9, the number of months yields
500
m≥ = 55.5̇.
9
Considering this is a “real world problem”, we have to save at least 55.56 e
per month.
73
4.2.2 Quadratic inequalities in one variable
After studying linear inequalities, as for equalities, we consider the quadratic
case. In fact, solving quadratic inequalities heavily depends on being able to
solve quadratic equalities (see section4.1.2).
We will only study quadratic inequalities that are of the form x2 +px+q >
0, x2 + px + q < 0, x2 + px + q ≥ 0 or x2 + px + q ≤ 0, for p, q ∈ R. Using
the equivalent transformations presented on page 71 it should be clear that
any quadratic inequality can always be brought in any of these forms.
We discuss the case that the inequality is of the form x2 + px + q > 0, the
other cases work analogously, the differences will be discussed further below.
The solution algorithm in this case can be summarized as:
1. Solve x2 + px + q = 0. Assume there are 2 distinct solutions x1 and x2 .
2. Rewrite the inequality as a product of the linear factors (x − x1 )(x −
x2 ) > 0.
3. In order that the above inequality is fulfilled, either both x − x1 > 0
and x − x2 > 0, or both x − x1 < 0 and x − x2 < 0. Recall, for a
product of two factors to be positive, either both are positive or both
are negative, they need to have the same “sign”.
4. Let L1,1 be the solution of x − x1 > 0 and L1,2 be the solution of
x − x2 > 0, then we compute L1 = L1,1 ∩ L1,2 . Furthermore, let L2,1 be
the solution of x − x1 < 0 and L2,2 be the solution of x − x2 > 0, then
we compute L2 = L2,1 ∩ L2,2 .
5. The solution is given by L = L1 ∪ L2 .
Example 4.44. We solve for x ∈ R: 2x2 + x − 3 > 0. First, by division by
2, we bring it into the required form: x2 + 12 x − 32 > 0.
1. We solve x2 + 21 x − 32 = 0, which has the solutions x1 = − 23 and x2 = 1.
2. We rewrite the inequality to (x + 32 )(x − 1) > 0.
3 3
3. We inspect x + 2
> 0 and x − 1 > 0, and we inspect x + 2
< 0 and
x − 1 < 0.
4. We determine L1,1 = (− 23 , ∞) and L1,2 = (1, ∞), hence L1 = (1, ∞).
Furthermore, we have L2,1 = (−∞, − 32 ) and L2,2 = (−∞, −1), thus
L2 = (−∞, − 32 ).
74
5. The solution is therefore L = (−∞, − 23 ) ∪ (1, ∞) = R \ [− 32 , 1].
−3 −2 −1 −1 1 2
−2
−3
75
9
−3 −2 −1 −1 1 2
−2
−3
We remark that for strict inequalities, i.e. inequalities of the forms “>” or
“<”, we always have to exclude the solutions of the corresponding quadratic
equation. In the cases “≥” and “≤”, we include them.
Let us now deal with the special cases, when in step 1 we did not find
two distinct solutions.
• If the corresponding quadratic equation has no solution, either the
inequality under consideration is always true, then L = R, or the in-
equality under consideration is always false, then L = ∅.
In order to find out, which of those two cases we have for a given
inequality, we only need to consider what happens when we set x =
0, which means the inequality simplifies to q > 0, q < 0, q ≥ 0 or
q ≤ 0 respectively. As described in the introduction to this section,
this statement can be either true or false, if the statement is true, then
L = R, otherwise L = ∅.
Example 4.46. We consider possibly the simplest examples for those
cases.
– Solve for x ∈ R: x2 +1 > 0. The corresponding equation x2 +1 = 0
has no real solutions, hence we inspect 1 > 0. As this statement
is true, the solution set is L = R.
– Solve for x ∈ R: −x2 − 1 > 0. The corresponding equation −x2 −
1 = 0 has no real solutions, hence we inspect−1 > 0. As this
statement is false, the solution set is L = ∅.
76
10
2
x +1>0 8
7 L=R
6
−3 −2 −1 −1 1 2 3
−2
−3
−4
−5
−6
−7 L=∅
2
−x − 1 > 0−8
−9
−10
77
– If we solve x2 − 2x + 1 > 0, we have L = R \ {1}.
– If we solve x2 − 2x + 1 ≤ 0, we have L = {1}.
– If we solve x2 − 2x + 1 < 0, we have L = ∅.
6 6 6 6
5 5 5 5
2 2 2
4 (x − 1) ≥ 0 4 (x − 1) > 0 4 (x − 1) ≤ 0 4 (x2 − 1) < 0
3 3 3 3
2 2 2 2
1 1 1 1
1 > 2x.
1 < 2x.
78
This linear inequality has the solution set L2 = (0, ∞) ∩ ( 12 , ∞) =
( 12 , ∞).
Hence, the result is L = L1 ∪ L2 = R \ [0, 21 ].
−3 −2 −1 1 2 3
−1
−2
−3
79
√ √
which holds for x ∈ R \ (−
√ √5, 5). Hence, the intermediate solution
set L1 = (−1, 1) ∩ R \ (− 5, 5) = ∅.
−3 −2 −1 1 2 3
−1
−2
−3
80
assignment of numbers to the variables, s.t. all the equations are satisfied
simultaneously. We present two solution methods, solving by substitution
method and solving by elimination method.
A system of equations is called consistent, if it has at least one solution,
it is called inconsistent, if it has no solution.
5x −2y =1
3x +3y =9
We decide that we want to solve the first equation for x, i.e. to isolate
the variable x (we could have also chosen differently). This yields (see sec-
tion 2.2).
1 + 2y
x= , (9)
5
and replacing x in the second equation with this expression we get
1 + 2y
3· + 3y = 9.
5
81
Thus, after simplification, we have y = 2. We substitute this back into
equation
(9), which then simplifies
to x = 1. We have a unique solution
x 1 1
= , therefore L = { }.
y 2 2
The geometric interpretation of solving a system with 2 linear equations
in 2 variables is to compute all points of intersection of the corresponding
affine linear functions.
Example 4.51. For the example above, the geometric interpretation is
3x + 3y = 9 y 5x − 2y = 1
4
2 (1, 2)
1
x
−4 −3 −2 −1 1 2 3 4
−1
−2
−3
−4
In the previous case, we found one unique solution, and the geometric in-
terpretation is that we compute the point of intersection of the corresponding
affine linear functions. However, we could easily think of two affine linear
functions, where the corresponding graphs are parallel lines. How would
solving a corresponding system of linear equations look like?
Example 4.52. Solve for x, y ∈ R:
5x −2y =1
10x −4y = −12
82
We decide, as before, that we want to solve the first equation for x, i.e. to
isolate the variable x (again, we could have also chosen differently). We have
1 + 2y
x= , (10)
5
and replacing x in the second equation with this expression we get
1 + 2y
10 · − 4y = −12,
5
which simplifies to
2 = −12.
This statement is a contradiction, no values for x or y satisfy this equation.
Hence, we have no solution, i.e. L = ∅. As two parallel lines do not intersect,
we expected to find no intersection.
y
4 10x − 4y = 12
1
x
−4 −3 −2 −1 1 2 3 4
−1
−2
−3
−4
5x − 2y = 1
One case that we did not yet consider is the case, when for a given system
of linear equations, the graphs of the corresponding affine linear functions
coincide, i.e. when the geometric interpretation is twice the same line. Then,
we expect infinitely many solutions, as the two lines then have infinitely many
points of intersection.
83
Example 4.53. Solve for x, y ∈ R:
5x −2y =1
10x −4y =2
We decide, as before, that we want to solve the first equation for x, i.e. to
isolate the variable x (again, we could have also chosen differently). We have
1 + 2y
x= . (11)
5
Replacing x in the second equation with this expression we get
1 + 2y
10 · − 4y = 2,
5
which simplifies to
2 = 2.
As this is a tautology, the second equation does not give an additional con-
straint. I.e. the solution set is the set of solutions
of equation (11). We have
x
infinitely many solutions, as every point ∈ R2 , with x = 52 y + 15 is a
y
1
solution. Choosing y = 0, leads to the solution 5 , choosing y = 5 leads
11 0
−1
to the solution 5 , choosing y = −3 leads to the solution etc. We
5 −3
can denote the solution set by
x 2 2 1 x 2 5 1
L= ∈R : x= ·y+ = ∈R : y = ·x− ,
y 5 5 y 2 2
or equivalently
1 2
0 1
L= 5 +λ· 5 : λ∈R = +λ· 5 : λ∈R .
0 1 − 12 2
The second way of representing the solution set, which corresponds to all
points on the corresponding line, will be easier to understand after studying
vectors in section ??.
84
y
4 10x − 4y = 2
1
x
−4 −3 −2 −1 1 2 3 4
−1
−2
−3
−4
5x − 2y = 1
Example 4.54. At his lemonade stand, Chris sells two types of drinks:
lemonade soda and strawberry soda. On the first day, 9 lemonade and 12
strawberry sodas for a total of 54 e are sold. Chris took in 36 e on the sec-
ond day, by selling 12 lemonade sodas and 4 strawberry sodas. What is the
price each of one lemonade and one strawberry soda?
We set x . . . price of one lemonade soda and y . . . price of one strawberry
soda. Therefore, we solve for x, y ∈ R+ 0:
9x +12y = 54
12x +4y = 36
This time, we choose to consider the second equation and solve for y (we
could have also chosen differently). This yields
36 − 12x
y= = 9 − 3x. (12)
4
85
Substituting this expression into the first equation gives
2. no solution, this can happen for example when at least 2 planes are
parallel,
86
Now, as discussed in the previous section, we solve this 2D system. We
isolate y in the first equation of this 2D system of linear equations, which
gives y = −2z +3. Substituting this into the other equation, we get a solution
for z,
z = 2.
Now, we are able to solve for y,
y = −2 · 2 + 3 = −1,
−3.8 = −5,
87
We isolate the variable x in the third equation, which yields x = 2y − 7z + 7.
Substituting this into the remaining equations and simplifying gives
2y −19z = −30
2y −19z = −30
As these two equations are the same, it is obvious that we will get infinitely
many solutions. We could choose for any value for the variable z, which
would then first yield a value for y, and then a corresponding value for x.
(In this situation, we could also choose any value for y, which would then
yield corresponding values for z and x.)
In order to explicitly indicate that the solution depends on some choice,
we introduce a parameter λ. We then set z = λ, which yields y = −15 + 19 2
λ.
Therefore, x = 2 −15 + 19 2
λ − 7λ + 7 = −23 + 12λ.
Hence, using z = λ = 0 + 1 · λ, the set of solutions is
−23 12
19
L= −15 + λ 2 : λ ∈ R .
0 1
88
Those statements are tautologies, if we choose a value for the variable z and
choose a value for y, from x = −y − z + 1, we get the corresponding value
for x. (Again, we could have chosen e.g. values for x and y and get the
corresponding value for z.)
In order to explicitly indicate that the solution depends on some choice,
we again first introduce a parameter λ. We then set z = λ = 0 + λ, which
yields x = −y − λ + 1. As we still have some choice, we introduce another
parameter µ and set y = µ = 0 + µ. Then, the value for x is determined by
x = −µ − λ + 1.
Hence, the set of solutions is
1 −1 −1
L = 0 + λ 0 + µ 1 : λ, µ ∈ R .
0 1 0
89
−81x3 + 131
2
x4 + 111
2
x5 =0
−148x3 + 263
2
x4 + 173
2
x5 =0
−91x3 +73x4 +48x5 =0
which simplifies to
45885 890475 5780
− − x5 = − x5 = 0.
62046 62046 383
90
one, as all corresponding coefficients will be equal to 0. Then, by just con-
sidering the system of all linear equations except the one we fixed, we again
reduced the number of variables and the number of equations by 1, and we
may repeat the procedure.
We illustrate how the algorithm works again by first considering a few
2-D examples.
3x −2y =1 (I)
6x +3y =9 (II)
Note that we have labelled the equations, which makes it easier to illustrate
what we are doing. We are now choosing to eliminate the variable x, which
occurs in both equations. We have again some freedom of choice, but we
choose equation (I). The goal is now to multiply the equation (1) by a number,
s.t. the coefficients for x in this equation and equation (II) match. I.e., we
solve c1 · 3 = 6.
3x −2y =1 (I)
6x +3y =9 (II)
91
Now, we could, similar as in the section about the substitution method, sub-
stitute this expression for y into equation (I) and get corresponding solutions
for x. We present an alternative, we use equation (II 1 ) to eliminate the
variable y in equation (I). Hence, we solve c2 · 1 = −2, which yields c2 = −2.
3x -2y =1 (I)
0x +1y =1 (II 1 )
x +0y =1 (I 1 )
0x +y =1 (II 1 )
From this very simplesystem
of equations, we can immediately read off the
1
set of solutions L = { }.
1
Now, let us see how the elimination method works for an example of a
system of linear equations that has no solution.
x
Example 4.61. We solve for ∈ R2 :
y
2x −2y =1 (I)
6x −6y =9 (II)
We are now choosing to eliminate the variable x, which occurs in both equa-
tions. We have again some freedom of choice, but we choose equation (I).
The goal is now to multiply equation (1) by a number, s.t. the coefficients for
x in this equation and equation (II) match. I.e., we solve c1 · 2 = 6. We have
that c1 = 3. Thus, we compute a new equation (II 1 ) : (II) − 3 · (I). This
yields the following system of linear equations, which has the same solution
set as the original problem:
2x −2y =1 (I)
0x +0y =6 (II 1 )
92
Equation (II 1 ), 0 = 6, is a contradiction, no choice for x, y ∈ R can fulfil
this equation. Thus, L = ∅.
Finally, let us see how the elimination method works for a 2-D example
of a system of linear equations, which has infinitely many solutions.
x
Example 4.62. We solve for ∈ R2 :
y
2x −2y =1 (I)
6x −6y =3 (II)
The system of equations is almost the same as before, hence we apply the
same solution strategy. We are now choosing to eliminate the variable x,
which occurs in both equations, and we choose equation (I). We compute a
new equation (II 1 ) : (II) − 3 · (I). This yields the following system of linear
equations, which has the same solution set as the original problem:
2x −2y =1 (I)
0x +0y =0 (II 1 )
Equation (II 1 ), 0 = 0, is a tautology, every choice for x, y ∈ R fulfils this
equation. Thus, as equation (II 1 ) does not present any restriction, we just
consider the linear equation 2x − 2y = 1. In the previous section, we already
noticed that now for every choice of y, we get a corresponding solution for the
variable x and vice versa. We parametrize y = λ = 0+λ and get 2x−2λ = 1,
which is equivalent to x = 21 + λ. Thus,
1
1
L={ 2 +λ· : λ ∈ R}.
0 1
93
We choose to eliminate the variable x from equations (II) and (III) by using
equation (I). We determine (II 1 ) : (II) + 1 · (I) and (III 1 ) : (III) − 2 · (I).
We choose to eliminate the variable z from equations (I) and (II 1 ) by using
equation (III 2 ). We determine (I 1 ) : (I) − 3 · (III 2 ) and (II 2 ) : (III 2 ) − 2 ·
(III 2 ).
x −2y +0z =3 (I 1 )
0x +y +0z = −1 (II 2 )
0x +0y +z =2 (III 2 )
Finally, before we are able to read off the solution very easily, we eliminate
the variable y from equation (I 1 ) by using equation (II 2 ). We determine
(I 2 ) : (I 1 ) + 2 · (II 2 ).
x +0y +0z =1 (I 2 )
0x +y +0z = −1 (II 2 )
0x 0y +z =2 (III 2 )
94
of linear equations in one less variable and one less equation. (Here: equa-
tions (II 1 ) and (III 1 ) can be treated as equations that only depend on the
variables y and z.) We then eliminated the next variable (y) from all but
one equation of the “reduced” system.
Now, we inspect again example 4.57, to see how this would be solved
using the elimination method.
Example 4.64. We solve for x, y, z ∈ R:
3x −4y +2z = −9 (I)
−4x +4y +10z = 32 (II)
−x +2y −7z = −7 (III)
We choose the variable x and use equation (III) to eliminate the variable x
from the other equations, hence, we compute (I 1 ) : (I) + 3(III) and (II 1 ) :
(II) − 4(III).
0x +2y −19z = −30 (I 1 )
0x −4y +38z = 60 (II 1 )
−x +2y −7z = −7 (III)
For now, we leave equation (III), and use equation (I 1 ) to eliminate the
variable y in equation (II 1 ). We determine (II 2 ) : (II 1 ) + 2(I 1 ).
0x +2y −19z = −30 (I 1 )
0x +0y +0z =0 (II 2 )
−x +2y −7z = −7 (III)
As equation (II 2 ) is a tautology, it does not represent an additional con-
straint, we see that we will have infinitely many solutions. We now eliminate
the variable y also from equation (III).
0x +2y −19z = −30 (I 1 )
0x +0y +0z =0 (II 2 )
−x +0y +12z = 23 (III 1 )
It is now maybe even more obvious that, if we choose a value for z, by
equation (I 1 ) we will find a corresponding value for the variable y, and by
equation (III 1 ) we will find a corresponding value for the variable x. We
parametrize z = 0 + λ, and we get y = −15 + 19 2
λ and x = −23 + 12λ. Thus,
−23 12
L = −15 + λ 19 2
: λ∈R .
0 1
95
Many other sources, and also the author, find it more convenient, to al-
ways write the equation, which is used to eliminate the first, second, third,
etc. variable, as the first, second, third, etc. equation, if possible. It should
be clear that just presenting the equations in different order does not change
the solution set. (After all, the solution set is the intersection of the solu-
tion sets for the first, second, third, etc. equation and set intersection is
commutative and associative.)
Example 4.65. The solution set L1 ⊂ R3 of the system of linear equations
3x −4y +2z = −9 (I)
−4x +4y +10z = 32 (II)
−x +2y −7z = −7 (III)
and the solution set L2 ⊂ R3 of the system of linear equations
−x +2y −7z = −7 (III)
3x −4y +2z = −9 (I)
−4x +4y +10z = 32 (II)
are the same. We just swapped equations (I) and (III).
Moreover, when we chose an equation that is used to eliminate a variable,
often (but not always) it is slightly transformed in such a way that the
coefficient of the variable that gets eliminated is equal to 1. As we are only
able to eliminate variables with non-zero coefficient, this can always be done
by dividing the left-hand side and the right-hand side of the equation by this
coefficient. This certainly does not change the solution set.
Example 4.66. The solution set L1 ⊂ R3 of the system of linear equations
−x +2y −7z = −7 (I)
3x −4y +2z = −9 (II)
−4x +4y +10z = 32 (III)
and the solution set L2 ⊂ R3 of the system of linear equations
x −2y +7z =7 (I 1 )
3x −4y +2z = −9 (II)
−4x +4y +10z = 32 (III)
are the same. We just divided equation (I) by (−1) to get the equivalent
equation (I 1 ).
96
4.3.2.3 n-D We illustrate the power of clever use of the elimination
method, given when we have some experience in using this method, by con-
sidering example 4.67 again. We remember that for solving this system using
the substitution method, we had to consider a lot of fractions, which is cer-
tainly a great source of errors, when computing by hand. And even, when
we use a calculator, it may be difficult to not lose track, which decimal rep-
resentation of which fraction is stored in which part of the memory - not to
speak of round errors. We cannot fully avoid the problem of large numbers
here, but using the greatest common divisor sometimes, we are able to avoid
fractions.
Using this equation to eliminate the variable x1 in all other equations, we get
x1 −5x2 +8x3 −10x4 +0x5 =5 (I 1 )
0x1 +13x2 −20x3 +25x4 +6x5 =0 (II 1 )
0x1 +25x2 −26x3 +38x4 +3x5 =0 (III 1 )
0x1 +21x2 −37x3 +56x4 −2x5 =0 (IV 1 )
0x1 +32x2 −53x3 +64x4 +3x5 =0 (V 1 )
Now, in order to get an equation with coefficient 1 for the variable x2 , we
determine (II 2 ) : 2(II) − (III).
97
x1 −5x2 +8x3 −10x4 +0x5 =5 (I 1 )
0x1 +x2 −14x3 +12x4 +9x5 =0 (II 2 )
0x1 +0x2 +324x3 −262x4 −222x5 =0 (III 2 )
0x1 +0x2 +257x3 −196x4 −191x5 =0 (IV 2 )
0x1 +0x2 +395x3 −320x4 −285x5 =0 (V 2 )
Finding a combination of equations that produces a coefficient 1 for the vari-
able x3 is not so easy, but computing the gcd (greatest common divisor) of 324
and 257 yields gcd(324, 257) = 1 = −233̇24 + 29 · 257. Hence, we determine
(III 3 ) : −23(III 2 ) + 29(IV 2 ):
Thus, we consider
x1 −5x2 +8x3 −10x4 +0x5 =5 (I 1 )
0x1 +x2 −14x3 +12x4 +9x5 =0 (II 2 )
0x1 +0x2 +x3 +342x4 −433x5 = 0 (III 3 )
0x1 +0x2 +0x3 −88090x4 +111090x5 = 0 (IV 3 )
0x1 +0x2 +0x3 −135410x4 +170750x5 =0 (V 3 )
We divide equations (IV 3 ) and (V 3 ) by the common factor 10, and then, in
order to eliminate the variable x4 , we determine (IV 4 ) : −6264 · (IV 3 ) +
4075 · (V 3 ), which gives
In the second phase, using this equation, from which we already see x5 = 0,
to eliminate the variable x5 from every other equation.
98
x1 −5x2 +8x3 −10x4 +0x5 =5 (I 2 )
0x1 +x2 −14x3 +12x4 +0x5 =0 (II 3 )
0x1 +0x2 +x3 +342x4 +0x5 =0 (III 4 )
0x1 +0x2 +0x3 +x4 +0x5 =0 (IV 5 )
0x1 +0x2 +0x3 +0x4 +1x5 =0 (V 5 )
(Note: Technically, equation (I 1 ) did not change.) We repeat this procedure
for the other variables, which yields the systems
x1 −5x2 +8x3 +0x4 +0x5 =5 (I 3 )
0x1 +x2 −14x3 +0x4 +0x5 =0 (II 4 )
0x1 +0x2 +x3 +0x4 +0x5 = 0 (III 5 )
0x1 +0x2 +0x3 +x4 +0x5 = 0 (IV 5 )
0x1 +0x2 +0x3 +0x4 +1x5 =0 (V 5 )
and
x1 −5x2 +0x3 +0x4 +0x5 =5 (I 4 )
0x1 +x2 +0x3 +0x4 +0x5 =0 (II 5 )
0x1 +0x2 +x3 +0x4 +0x5 = 0 (III 5 )
0x1 +0x2 +0x3 +x4 +0x5 = 0 (IV 5 )
0x1 +0x2 +0x3 +0x4 +1x5 =0 (V 5 )
and
x1 +0x2 +0x3 +0x4 +0x5 =5 (I 5 )
0x1 +x2 +0x3 +0x4 +0x5 =0 (II 5 )
0x1 +0x2 +x3 +0x4 +0x5 = 0 (III 5 )
0x1 +0x2 +0x3 +x4 +0x5 = 0 (IV 5 )
0x1 +0x2 +0x3 +0x4 +1x5 =0 (V 5 )
From this last system, we are able to read off the solution x1 = 5, x2 = x3 =
x4 = x5 = 0.
Another thing we may have noticed is that we had to write the “names”
of the variables x1 , x2 , . . . again and again and again, which consumes a lot
of paper. We could avoid this by using a table and the following convention:
In the first column, we always write the coefficient of the first variable, in the
second column, we always write the coefficients of the second variable and
so on. The last column represents the coefficient on the right-hand side of
the equation. If we transform a system of linear equations into an equivalent
one, we separate them using two horizontal lines in the table. We now use
the very first column to label the equation.
99
Example 4.68. We solve for x1 , x2 , x3 , x4 ∈ R:
2x1 −4x2 +8x3 +4x4 = −40 (I)
6x1 −3x2 −2x3 −3x4 = 51 (II)
6x1 +3x2 −10x3 −10x4 = 122 (III)
−6x1 +6x2 +8x3 = −48 (IV )
The corresponding table would look like this.
x1 x2 x3 x4
(I) 2 −4 8 4 −40
(II) 6 −3 −2 −3 51
(III) 6 3 −10 −10 122
(IV ) −6 6 8 0 −48
We now solve this example using the elimination method. You may want to
check the computation by writing out each row of the table explicitly.
100
x1 x2 x3 x4
(I) 2 −4 8 4 −40
(II) 6 −3 −2 −3 51
(III) 6 3 −10 −10 122
(IV ) −6 6 8 0 −48
(I 1 ) 1 −2 4 2 −20
(II 1 ) 0 −9 −26 −15 171
(III 1 ) 0 15 −34 −22 242
(IV 1 ) 0 −6 32 12 −168
(I 1 ) 1 −2 4 2 −20
(IV 2 ) 0 1 − 32
6
−2 28
(II 2 ) 0 0 22 3 −81
(III 2 ) 0 0 46 8 −178
(I 1 ) 1 −2 4 2 −20
(IV 2 ) 0 1 − 32
6
−2 28
3
(II 3 ) 0 0 1 22
− 81
22
38
(III 3 ) 0 0 0 22
− 190
22
(I 1 ) 1 −2 4 2 −20
(IV 2 ) 0 1 − 32
6
−2 28
3
(II 3 ) 0 0 1 22
− 81
22
(III 4 ) 0 0 0 1 −5
(I 2 ) 1 −2 4 0 −10
(IV 3 ) 0 1 − 32
6
0 18
(II 4 ) 0 0 1 0 −3
(III 4 ) 0 0 0 1 −5
(I 3 ) 1 −2 0 0 2
(IV 4 ) 0 1 0 0 2
(II 4 ) 0 0 1 0 −3
(III 4 ) 0 0 0 1 −5
(I 4 ) 1 0 0 0 6
(IV 4 ) 0 1 0 0 2
(II 4 ) 0 0 1 0 −3
(III 4 ) 0 0 0 1 −5
101
Hence, the solution set is
6
2
L = {
−3}.
−5
We see that this way of writing down the equivalent systems of linear
equations is much more efficient than repeating all the variable names for
each equation each time - all in all, it is just the coefficients that matter.
In section ??, an even more compact way of writing down the equivalent
systems which appear when solving this system with the elimination method
will be presented.
Exercise 4.69. Solve all systems of linear equations, which were presented
during this section (section 4.3), using the “table”-notation.
Comparing the last example with example 4.64, we immediately see that
sometimes undetermined systems occur naturally: whenever we study a sys-
tem of linear equations with equal number of variables and equations, and
102
it turns out at least one of the equations does not represent an additional
constraint, i.e., using the other equations, we can transform (or reduce) it
to the form “0 = 0”, then, by omitting this equation, we are left with an
underdetermined system.
The next simple example shows that underdetermined systems also may
have no solutions.
Example 4.71. We determine the solution set L ⊆ R3 of
x +y +z =0 (I)
x +y +z =1 (II)
Choosing to eliminate the variable x by using equation (I), we get the equa-
tion (II 1 ) : (II) − (I),
0 +0y +0z =1 (II 1 )
which is a contradiction. The system is hence inconsistent, we have no solu-
tion.
L = ∅.
It may be obvious, but the next example shows that trivially, overdeter-
mined systems also may be inconsistent.
Example 4.72. We solve for x ∈ R:
x =0 (I)
x =1 (II)
This is a system of two linear equations and one variable. By computing
(II) − (I), we get the contradiction 0 = 1. Hence, L = ∅.
However, when we are able to transform sufficiently many equations into
the form “0 = 0”, we may find that also an overdetermined system of equa-
tions can be consistent, i.e., it may have either one unique solution or even
infinitely many solutions.
x
Example 4.73. We solve for ∈ R2 :
y
x +y =2 (I)
x −y =0 (II)
3x +y = 4 (III)
−2x +8y = 6 (IV )
103
We use the “table”-notation (by now you should have enough practise to read
this):
x y
(I) 1 1 2
(II) 1 −1 0
(III) 3 1 4
(IV ) −2 8 6
(I) 1 1 2
(II 1 ) 0 −2 −2
(III 1 ) 0 −2 −2
(IV 1 ) 0 10 10
(I) 1 1 2
(II 1 ) 0 1 1
(III 2 ) 0 0 0
(IV 2 ) 0 0 0
(I 1 ) 1 0 1
(II 1 ) 0 1 1
(III 2 ) 0 0 0
(IV 2 ) 0 0 0
Thus, we have a unique solution
1
L={ }.
1
104