Professional Documents
Culture Documents
Undergraduate Mathematics
Amber Habib
Email: amber.habib@snu.edu.in
c 2019 Amber Habib
Version 1.0 January 2019.
http://creativecommons.org/licenses/by-nc-sa/4.0/
or send a letter to Creative Commons, PO Box 1866, Mountain View, CA 94042, USA.
About this Book
Modern mathematics is an immensely successful enterprise. Over the last
century the discipline has expanded exponentially in its depth and scope.
At the start of this period its serious applications were almost entirely in
physics and engineering. Now they extend to computer science, chemistry,
biology, medicine, economics, nance, management, and even the ne arts.
This growing popularity appears, somewhat paradoxically, to be due to an
increased abstraction. By removing material interpretations from the core of
mathematics, this process of abstraction has made the subject universal in
its scope. This book presents a course on the reformulation of mathematics
that has made this possible. It is suitable for being taught during the rst
year of an undergraduate mathematics program.
An early goal is to get students to realize that very often what we say or
write does not convey what we feel and think, and that clarity of expression
is benecial to both parties in a conversation. The basic topics from logic and
set theory are ideal for illuminating these aspects of doing mathematics. Of
course they also lay the foundation for further studies in mathematics, but
the immediate impact that the student can see relates to clarity of thought
and to self-criticism. This is the task of the rst chapter.
i
ii
The desire to touch upon these aspects has inuenced the choice of topics
in the fourth and nal chapter. To appreciate Set theory as a theory, and not
just as a convenient tool for expressing mathematical statements, we must
consider what it has to say about innite sets. The simplest of questions,
such as Is there a smallest innite set?, then force us to face the Axiom of
Choice and its consequences, the useful as well as the bizarre.
Readers will prot more from the formal approach of this book if they
have prior exposure to the basic notions regarding sets and and operations
with them, at the level of the Class XI NCERT textbook [19], or the books
[1, 2] in our References.
The rst three chapters already oer enough material for a typical 45
lecture course. If another 15 lectures are available, the fourth chapter could
also be covered. Alternately, one could skip some of the material of Chapter
3 in order to include the basics of innite sets from Chapter 4 in a 45 lecture
course. One such possibility is to cover all of the rst two chapters, the rst
two sections of the third chapter, and the rst four sections of the fourth
chapter.
Acknowledgments
Signage
End of an example.
End of a proof.
2 The main MTTS activity consists of month-long summer camps for undergraduate
students. MTTS implements a discussion-based pedagogy that provides to most of its
students their rst experience of genuine mathematical activity. Visit the MTTS website
http://mtts.org.in to view their latest activities.
3 http://www.winshell.de.
Contents
1 Sets and Logic 1
1.1 Statements and Operations . . . . . . . . . . . . . . . . . . . 3
1.2 Implication . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.3 Statement Forms and Equivalence . . . . . . . . . . . . . . . 15
1.4 Open Sentences and Quantiers . . . . . . . . . . . . . . . . . 21
1.5 Axioms for Sets . . . . . . . . . . . . . . . . . . . . . . . . . . 29
1.6 Sets, Quantiers and Numbers . . . . . . . . . . . . . . . . . 40
4 Innity 165
4.1 Constructing the Natural Numbers . . . . . . . . . . . . . . . 166
4.2 Finite and Innite Sets . . . . . . . . . . . . . . . . . . . . . . 170
4.3 The Axiom of Choice . . . . . . . . . . . . . . . . . . . . . . . 176
4.4 Countable and Uncountable Sets . . . . . . . . . . . . . . . . 181
4.5 Zorn's Lemma and Well-Ordering Theorem . . . . . . . . . . 186
A Supplements 193
v
vi CONTENTS
B Solutions 203
References 245
Index 249
1. 2 + 2 = 4.
2. 0/1 = 0.
3. There is no real number whose square is −1.
4. If the diagonals of a parallelogram are equal then it is a rectangle.
1 George Boole, The Mathematical Analysis of Logic, Macmillan, London, 1847. (http:
//www.gutenberg.org/ebooks/36884)
2 Idries Shah, The Pleasantries of the Incredible Mulla Nasrudin, The Octagon Press,
1995.
1
2 CHAPTER 1. SETS AND LOGIC
These are all assertions of facts, and one goal of a discussion involving them
would be to gure out whether they are correct or not. The rst two are
assertions about mathematical processes, about the result of carrying out an
operation with known numbers. The third is an assertion about the possible
existence of a number with a special property. The fourth claims a rela-
tionship between the property of a parallelogram being rectangular and the
lengths of its diagonals. It is in fact a combination of two simpler assertions
about parallelograms (being rectangular and having equal diagonals). At
the simplest level, we will learn how to correctly understand and express
mathematical assertions such as these. An important aim will be to remove
the ambiguities that are present in our common speech.
Now let us look at a few more sentences that might crop up in a school
mathematics class.
2. We have two collections: the real numbers and the rational members,
and we assert that there is a real number whose square is 2 and this
real number is not a rational number.
The steps we will take to bring more clarity to mathematics are modeled
on Euclid's formulation of geometry over two thousand years ago. Consider
the problem of dening points and lines in a plane. We may consider at-
tempts like A point occupies no space or A line has no width but we soon
realize these are all insucient. In fact, points and lines are fundamental
concepts of geometry and can't be dened in terms of simpler ones. What
matters is not the attempted denition of points and lines but the relation-
ships between them, such as There is a unique line joining any two points.
Euclid called these fundamental relationships postulates (the modern term
for such fundamental relationships is `axioms'). With them as a starting
point, he proceeded to derive all the theorems of geometry. In the same way
we will seek to set down some fundamental notions and relationships from
3
which all of mathematics is to be obtained.
3 Will we succeed? See A.1 for some unexpected twists in this story.
4 CHAPTER 1. SETS AND LOGIC
n 0 1 2
n2 + n + 41 41 43 47
Now I say to you, Look, this expression always produces a prime number
when we substitute a whole number for n! Are you convinced? Of course
not. You feel that three pieces of data are too little for rm conclusions and
you demand more evidence. I provide the following:
n 3 4 5 6 7 8 9 10 11
2
n + n + 41 53 61 71 83 97 113 131 151 173
These are all prime numbers too. Perhaps now you are wavering surely
something more than coincidence is involved here? Suppose I keep showing
you more and more instances of prime numbers being produced by this ex-
pression. Would there be a point when you would say, Please don't show
me any more instances! Now I believe you?
Simple Statements
We wish to set down precise rules for working out the truth or falsity of
mathematical assertions. First, we have the notion of a simple statement,
which has the form Object A has property P. For example, the following are
simple statements:
We may attach a truth value to each simple statement. The truth value is
either true (T) or false (F). If the object actually has the stated property
the statement is termed true, else it is termed false. We cannot allot both
F and T to the same simple statement. It may also happen that we do not
know which one to allot, for we simply may not know enough to make the
4
determination.
The purpose of logic is not so much to gure out whether given statements
are individually true or false, but rather to delineate the consequences of their
being true or false.
Compound Statements
The rst two statements are combinations of Two plus two equals four with
Two plus two equals ve. The third is an alteration of All rectangles are
squares.
We have to be careful while using the words and, or, not. In common
English usage there are ambiguities which we must remove. For instance,
the word or can be used in dierent ways as illustrated below:
In the rst, the usage of or is inclusive: it allows for the possibility that
a student may have taken both analysis and algebra. In the second, it is
exclusive: the possibility of getting 60% and failing is excluded. The dierent
usage is sensed from one's experience with the language and from the context
it is not detected from the structure of the sentence. This is obviously a
possible cause of confusion.
4 See A.1.
6 CHAPTER 1. SETS AND LOGIC
Did the writer mean to convey that no apple is black, or that some apples
are not black? It is dicult to be certain.
To avoid such problems, we shall specify the exact usage of each connective
within the subject of logic.
P Q P ∧Q P Q P ∨Q
T T T T T T
T F F T F T P ¬P
F T F F T T T F
F F F F F F F T
Let us take up the truth table for conjunction. In each row, the rst
entry is a possible truth value of P while the second entry is a possible truth
value of Q. The third entry gives the truth value of P ∧Q corresponding
to the given truth values of P , Q. We need four rows to account for all the
possible combinations of the truth values of P and Q. We see that P ∧Q is
dened to be true exactly when both P, Q are true. If one or both of them
are false then P ∧Q is false.
(d) It is false that two plus two does not equal ve.
(d) It is false that two plus two does not equal ve.
1.2 Implication
Another kind of compound statement is the P implies Q form. The deni-
tion of this form requires more careful thought than we needed for conjuction
and its companions. The reason is that words like `if ' and `implies' have
subtle connotations which we must untangle before we can settle on their
appropriate use in logic. Consider the statement
You can surely see how the divisibility of 36 by 4 follows from the divisibility
of 6 by 2. Therefore, you would accept this as a correct statement. Let's
consider a similar statement:
This may throw you for a second but not longer. From 5 being odd you
deduce that 6 is even and so you again obtain the divisibility of 36 by 4. It
seems that you, and anyone else, would be comfortable saying P implies Q
if the truth of Q could be seen as being caused by the truth of P, perhaps
through a chain of deductions.
√
The real number 2 is irrational implies that the number 36 is divisible
by 4.
This statement involves two true statements, but does it seem reasonable to
√
claim that the irrationality of 2 is responsible for the divisibility of 36 by
4? It does not. Yet, can we show that there is no chain of deductions that
√
moves from the irrationality of 2 to the divisibility of 36 by 4? This task
also seems daunting.
P Q P =⇒ Q
T T T
T F F
F T
F F
Here are two statements, corresponding to the last two rows of this truth
table:
√
If the real number
√2 is rational then the number 36 is divisible by 4.
If the real number 2 is rational then the number 36 is divisible by 5.
In each case, you may again feel uneasy about accepting the implication.
Can a false statement be allowed to imply anything? We argue our way out
of this as follows. The statement P =⇒ Q is concerned with what happens
when P is true. It has nothing to say about what happens when P is false.
A statement that has nothing to say has to be accepted as true rather than
false! (Think about 1 = 1) Hence we will accept both these statements as
true.
any other, such as starting from 2 + 2 = 5 and showing that a certain person
5
M is the Pope:
P Q P =⇒ Q
T T T
T F F
F T T
F F T
This truth table gives the nal and complete denition of =⇒ . We have
tried to motivate why we have made these particular choices. From now
« on you must use this denition alone and not be confused by expectations
arising from other usages of if or implies in ordinary language.
If P then Q, Q whenever P,
Q if P, Q follows from P.
Proof: Let us set up a truth-table for these three statements. Since three
statement variables are involved, the table would have eight rows.
P Q R P =⇒ Q Q =⇒ R P =⇒ R
T T T T T T
T T F T F F
T F T F T T
T F F F T F
F T T T T T
F T F T F T
F F T T T T
F F F T T T
Truth-table proofs are easy to carry out in the sense that they are mechan-
ical and do not strain our creative ability. But, as the last proof illustrates,
they can be tedious and that creates its own possibility of error. They also
do not give the sense of gaining insight into why something is true. A little
later, we will see other ways of proving this result.
x = x2 =⇒ x2 − x = 0 =⇒ x(x − 1) = 0
=⇒ x = 0 or x − 1 = 0 =⇒ x = 0 or x = 1.
(x = x2 =⇒ x2 − x = 0)
2
∧ (x − x = 0 =⇒ x(x − 1) = 0)
∧ (x(x − 1) = 0 =⇒ x = 0 or x − 1 = 0)
∧ (x = 0 or x − 1 = 0 =⇒ x = 0 or x = 1).
Biconditional Statements
P Q P =⇒ Q Q =⇒ P
T T T T
T F F T
F T T T
F F T F
P Q P =⇒ Q Q =⇒ P P ⇐⇒ Q
T T T T T
T F F T F
F T T F F
F F T T T
14 CHAPTER 1. SETS AND LOGIC
(a) P =⇒ (P =⇒ P ), (b) (P =⇒ P ) =⇒ P ,
(e) If the gure is a rectangle but not a rhombus then it is not a square.
Some statement forms always take on the value T, no matter what actual
statements are substituted for the statement variables. These are called
tautologies. Others, called contradictions, only take on the value F.
Example 1.3.1 The statement form P ∨(¬P ) is a tautology, while P ∧(¬P )
is a contradiction, as seen from their truth tables:
P ¬P P ∨ (¬P ) P ¬P P ∧ (¬P )
T F T T F F
F T T F T F
16 CHAPTER 1. SETS AND LOGIC
T ∨ P ≡ T, T ∧ P ≡ P, C ∨ P ≡ P, C ∧ P ≡ C.
P Q P =⇒ Q P Q ¬P ¬P ∨ Q
T T T T T F T
T F F T F F F
F T T F T T T
F F T F F T T
You may have noticed that the equivalence of statement forms appears
similar to biconditionality. Let A and B be two statement forms. They
are equivalent if A and B always have the same truth values. That is, if
A ⇐⇒ B is a tautology.
1. (Idempotence) A ∨ A ≡ A ≡ A ∧ A.
2. (Commutativity) A ∨ B ≡ B ∨ A, A ∧ B ≡ B ∧ A.
3. (Associativity) A ∨ (B ∨ C) ≡ (A ∨ B) ∨ C , A ∧ (B ∧ C) ≡ (A ∧ B) ∧ C .
4. (Distributivity) A ∧ (B ∨ C) ≡ (A ∧ B) ∨ (A ∧ C), A ∨ (B ∧ C) ≡
(A ∨ B) ∧ (A ∨ C).
5. (Absorption) A ∧ (A ∨ B) ≡ A ∨ (A ∧ B) ≡ A.
Negations
Task 1.3.7 Use a truth table to conrm that for any statement form A we
have
¬(¬A) ≡ A.
18 CHAPTER 1. SETS AND LOGIC
Theorem 1.3.8 (de Morgan's Laws) For any two statement forms A and
B we have:
Proof: We work out the truth tables for ¬(A ∧ B) ≡ ¬A ∨ ¬B . The other
law can be derived in a similar fashion.
A B A∧B ¬(A ∧ B) A B ¬A ¬B ¬A ∨ ¬B
T T T F T T F F F
T F F T T F F T T
F T F T F T T F T
F F F T F F T T T
The nal columns of each table have the same truth-values. This estab-
lishes the desired equivalence.
Task 1.3.9 Are any of the following statement forms logically equivalent to
each other?
(a) P =⇒ (Q =⇒ R),
(b) Q =⇒ (P =⇒ R),
(c) (P ∧ Q) =⇒ R.
Task 1.3.10 Check that under any assignation of truth values to P and
Q, at least one of P =⇒ Q and its converse is true. Hence, (P =⇒
Q) ∨ (Q =⇒ P ) is a tautology.
(P =⇒ Q) ∧ (Q =⇒ R) ≡ (¬P ∨ Q) ∧ (¬Q ∨ R)
≡ (¬P ∧ ¬Q) ∨ (¬P ∧ R) ∨ (Q ∧ ¬Q) ∨ (Q ∧ R)
≡ (¬P ∧ ¬Q) ∨ (¬P ∧ R) ∨ (Q ∧ R).
At least one of the three bracket terms in the last expression is true. Let's
do the three cases:
P Q P ∗Q P Q P ∗Q
T T T T T T
(a) T F F (b) T F T
F T T F T T
F F F F F F
3 Is (P ∧ Q) =⇒ (P ∨ Q) a tautology?
(b) The laws of logic (including de Morgan's laws), stating each law or
previously established consequence where it is used.
(b) (P ∧ Q) =⇒ R] ≡ (P =⇒ R) ∧ (Q =⇒ R) .
8 For each of the six statement forms you have written in the previous
exercise, nd out whether it is a tautology or a contradiction (or neither).
11 Show that any statement form is logically equivalent to one that only
uses the implication and negation operators.
We cannot assign this a truth value because we don't know the value of n.
On the other hand, if we knew that n had a certain value then this sentence
would become a statement. For example, if we knew that n = 99 then the
sentence would become a true statement:
Or, consider the open sentence Q(x, y): x + y = 0. Then Q(2, 3) is the (false)
statement 2 + 3 = 0.
Let R(x) denote the open sentence The gure x is a rhombus and S(x)
denote The gure x is a square. Then the given sentence is expressed by
R(x) =⇒ S(x).
Task 1.4.3 Let P (n) be the open sentence The integer n is an odd number.
Express the following symbolically: If 5 and 7 are odd then 35 is odd.
To cover these examples we introduce the symbols ∀ ( for all) and ∃ (there
exists): 6
The symbols ∀ and ∃ are called quantiers. The symbol ∀ is called the
universal quantier while ∃ is called the existential quantier.
6 Did you notice that these symbols are just A (`All') and E (`Exists') rotated by 180◦ ?
Example 1.4.4 The statement All squares are rectangles can be written
as
∀x (S(x) =⇒ R(x)),
where S(x) is the statement The object x is a square and R(x) is The
object x is a rectangle. The symbolic expression ∀x (S(x) =⇒ R(x)) is
read as Every object x which is a square is also a rectangle.
The previous tasks illustrate that ∃x (P (x) =⇒ Q(x)) does not have
the meaning we may expect it to have. While it is allowed to write such
« a statement, it is unlikely to come up in any meaningful discussion. For
it becomes true as soon as there is a value of x which makes P (x) false.
Implication goes naturally with ∀.
First, we identify the open sentences that are used in this statement. Let
N (x) stand for x is a natural number and E(x) stand for x is an even
number. Also, let F (x, y) stand for y is a factor of x.
The statement asserts the existence of a natural number with certain
properties. So it will have the form
∃x [N (x) ∧ · · · ],
24 CHAPTER 1. SETS AND LOGIC
where the dots ··· indicate the property that we have to include. The prop-
erty is about every natural number which is a factor of x. So our statement
becomes
∃x [N (x) ∧ ∀y [(N (y) ∧ F (x, y)) · · · ]].
We have brought in all the factors y of x. Now we just have to assert that if
something is such a factor then it is even. So the nal form of the statement
is
∃x [N (x) ∧ ∀y [(N (y) ∧ F (x, y)) =⇒ E(y)]].
When a statement has two or more quantiers, the order in which they
appear is critical to the meaning of the statement. The consequences of
« writing ∀x ∃y (· · · ) can be quite dierent from those of ∃y ∀x (· · · ). For
an example, see Exercise 6 below.
Example 1.4.9 Let us negate the statement All squares are rectangles.
Recall that it can be written as
∀x (S(x) =⇒ R(x)),
∃x [S(x) ∧ ¬R(x)],
which is read as There is a square that is not a rectangle.
Task 1.4.11 For each statement in Task 1.4.8 express the negation symbol-
ically and then in English.
Task 1.4.13 Can you imagine circumstances under which No roses are red
would be the negation of All roses are red? Or is that impossible?
For example, the existence and uniqueness of zero among the integers
can be expressed as
Task 1.4.15 Is it true that either ∀x (P (x) =⇒ Q(x)) or its converse must
be true?
Here, I(n) is n is an integer. The two variables x and n have very dierent
roles. The value of x is imposed from outside the sentence and determines
whether it will become true or false. If we write
∃n I(n) ∧ (y = 2n) ,
∃m I(m) ∧ (x = 2m) ,
the truth value of the sentence would not be aected. We call n a bound
variable.
1 Let N (x) be the open sentence x is a natural number. Using only this
open sentence, with a suitable choice of variables and quantiers, express the
following symbolically:
2 Let P (n) be an open sentence in which n varies over the natural numbers.
Will the truth of the following imply the truth of P (99)?
3 Does the truth of Every dragon is red imply the truth of There is a
red dragon? (Here, we take a break from our decision to only talk about
collections of mathematical objects.)
(a) For any integer x there is at least one integer y such that x + y = 1.
(b) For any integer x there is exactly one integer y such that x + y = 1.
28 CHAPTER 1. SETS AND LOGIC
(a) Given any two distinct points in the plane, there is a unique line
through them.
(d) Given a line m and a point x not on m, there is a unique line through
x m.
that is parallel to
(a) ∀x P (x) =⇒ (Q(x) ∨ R(x)) ⇐⇒ ∀x (P (x) =⇒ Q(x))∨
∀x (P (x) =⇒ R(x))
(b) ∀x (P (x) ∨ Q(x)) =⇒ R(x)] ⇐⇒ ∀x (P (x) =⇒ R(x))∨
∀x (Q(x) =⇒ R(x))
(c) ∀x P (x) =⇒ (Q(x) ∧ R(x)) ⇐⇒ ∀x (P (x) =⇒ Q(x))∧
∀x (P (x) =⇒ R(x))
10 Let x take values over real numbers and let f vary over the real functions
(functions whose domain and codomain consist of real numbers). Consider
the following open statements:
C(f, x) : f is continuous at x
D(f, x) : f is dierentiable at x
(a) ∃x (M = 2x),
(b) ∀x ∀y [(M = xy) =⇒ (x = 1 ∨ x = M )],
(c) (∀x (x = x)) ∧ (∃y (y 6= x)).
You have probably seen a set being described in one or more of the
following ways:
None of these is very satisfactory. What does well-dened mean, and does
it matter whether it qualies the collection or the objects? The usual inter-
pretation is that well-dened refers to clarity about whether any particular
object belongs to the candidate set or not. Given a set and an object we
should be able to say, without ambiguity or doubt, whether the object is a
member of the set.
30 CHAPTER 1. SETS AND LOGIC
Let C be the collection of all sets that are not members of themselves.
Although such collections are useful for organizing our thoughts they are
not, strictly speaking, part of mathematics. For the objects in these sets
are not mathematical objects. Cars, dogs, tables, countries and citizens
cannot be dened purely by mathematical means. On the other hand, the
set of all even integers can be conceived purely within mathematics. Set
theory is concerned only with such sets. You may well ask here, How am
I to recognise a mathematical object? The answer we adopt, the standard
answer for about a century now, is that sets themselves are the mathematical
objects. All the objects and operations of mathematics can be viewed as sets!
This may seem unreasonable at the moment but as you read this book the
details will become clear.
We shall often use lower case letters such as x and y for sets that occur
as members of other sets in our discussion. This is merely a convention
that provides visual cues which are useful for following an argument. The
notation x∈A is also read as x is an element of A or x belongs to A or
x is in A.
We consider two sets to be equal if they can substitute for each other
32 CHAPTER 1. SETS AND LOGIC
in any statement without altering its truth value. Our rst axiom describes
just when two sets can be considered to be equal in this manner.
ZF1. Axiom of Extension: Two sets are equal if and only if they have the
same members.
Given the work we have put in to learn how to express statements symboli-
cally, we should try our hand at expressing the ZF axioms symbolically. For
that, we need to include the quantiers ∀ and ∃, with the understanding
that they apply to our universe B . That is, ∀A stands for for every set A
and ∃A stands for there is a set A. The Axiom of Extension can then be
expressed as:
∀A ∀B [A = B ⇐⇒ ∀x (x ∈ A ⇐⇒ x ∈ B)].
The next axiom gives us a set which we can use as a basic building block,
and the following two axioms give us ways to combine known sets to get new
sets:
ZF2. Axiom of the Empty Set: There is a set that has no members. It is
called the empty set.
This axiom can be expressed symbolically as follows: ∃A ∀x (x ∈
/ A).
ZF3. Axiom of Pairs: Given sets x, y there is a set whose members are x
and y alone.
ZF4. Axiom of Union: If F is a set then there is a set called the union of
F whose members are exactly the members of the members of
S S F. The
union of F is denoted by F or A∈F A.
S
• Consider sets A and B . Using ZF3 we obtain F = {A, B}. Then F
is also denoted by A ∪ B and is called the union of A and B .
Our method of using {x} to represent the set whose only member is x,
and {x, y} to represent the set whose only members are x and y , is called the
roster notation for describing a set. One can use this for sets with more
elements as well. For example if the set has only a, b, c, d, e as members
then we can represent the set by {a, b, c, d, e}. In the other direction, the
empty set is represented by {}.
The group of three dots is called ellipsis and stands for and so on or and
so on, until.
Starting with the empty set ∅ we can use ZF3 and ZF4 to create many
more sets:
E=∅ (ZF2)
0
E = {∅} (ZF3)
00 0 0
E = E ∪ {E } = {∅, {∅}} (ZF3, ZF4)
.
.
.
34 CHAPTER 1. SETS AND LOGIC
These axioms allow us to build larger sets from smaller ones. Another
possibility is to extract smaller sets from a given set by selecting some of
its elements. To some extent this can be achieved by the axioms we already
have. For example, suppose we are given that A = {α, β, γ} is a set. We can
use Axiom ZF3 to create the sets {α}, {β}, {γ}, {α, β}, {α, γ}, {β, γ}.
A ⊆ B to mean A is a subset of B , i.e. every member of
We shall use
A B . Note that this allows a set to be a subset of itself.
is also a member of
When A is a subset of B we also say that B is a superset of A and denote
this by B ⊇ A. The usage is akin to that of ≤ and ≥ for numbers.
A related warning is that the truth of x∈A and A∈B does not imply
«
x ∈ B.
A ⊆ B : ∀x (x ∈ A =⇒ x ∈ B),
A B : ∀x (x ∈ A =⇒ x ∈ B) ∧ ∃x (x ∈ B ∧ x ∈
/ A),
A * B : ∃x (x ∈ A ∧ x ∈
/ B).
∀x (x ∈ ∅ =⇒ x ∈ A).
7 It is common to use the simpler notation A ⊂ B to denote subsets. Unfortunately,
some authors use ⊂ to denote only proper subsets, while others allow it to include equality.
For this reason, we shall not use this notation in this book.
Let P (x) be the open sentence corresponding to a property that a set x may
8
have. Then the Axiom of Separation can be stated as follows:
∀A ∃B ∀x [x ∈ B ⇐⇒ (x ∈ A ∧ P (x))].
It is worth noting what ZF5 does not allow. It does not allow applying a
property to all objects of the universal domain B and then collecting the
ones that satisfy that property into a set. A property can only be used to
separate out subsets from something that is already known to be a set, and
we shall see a little later (Theorem 1.5.15) that B cannot be a set.
{ x ∈ A | ∃k (k ∈ A ∧ x = 2k) }.
Example 1.5.9 Here are some sets expressed in both roster and set-builder
notations, with A being the set of natural numbers from 1 to 100.
Roster Set-builder
{} {x ∈ A | x2 = 2 }
{1} {x ∈ A | x2 = 1 }
{1, 2} {x ∈ A | x2 − 3x + 2 = 0 }
{1, 2, 3, . . . , 10} {x ∈ A | 1 ≤ x ≤ 10 }
The requirement that the property should not involve the subset we are
trying to dene, is needed to avoid circular reasoning. For example, it would
be silly to dene a subset B of A by the expression B = {x ∈ A | x ∈ B }
and so our axioms must explicitly ban it.
8 We have a separate statement for each property. So this axiom is actually a collection
of axioms, one for each property, and is called an axiom schema.
36 CHAPTER 1. SETS AND LOGIC
Theorem 1.5.13 If F is a set then there is a set whose members are exactly
those sets that are members of every member of
T F. (This set is called the
intersection
T
of F and is denoted by F or A∈F A.)
Proof: We use the Axioms of Union and Separation to dene the desired set
as follows:
\ [
F = {x ∈ F | ∀A (A ∈ F =⇒ x ∈ A) }.
The sixth axiom allows us to collect the subsets of a set into a set.
ZF6. Axiom of the Power Set: If A is a set then there is a set whose members
are precisely the subsets of A. This set is called the power set of A
and is denoted by P(A).
As an example, let A = {α, β, γ} be a set. Then
P(A) = {∅, {α}, {β}, {γ}, {α, β}, {α, γ}, {β, γ}, A}.
All the axioms so far are oriented towards allowing us to construct various
sets. Zermelo claimed that if we use only these axioms for constructing
9
sets, then we would not run into paradoxes. For example, he showed that
Russell's Paradox would be avoided by rst proving the following result:
Does M0 belong to itself ? If yes, then by its denition, it must not belong
to itself, a contradiction. If no, then by its denition, it must belong to itself,
also a contradiction. So M0 ∈
/ M.
This shows that the universal domain B cannot be taken to be a set in
this system of axioms. If it were a set, then every set would be a member
of the set B, contradicting the last theorem. One consequence is that we
cannot take the C of Russell's Paradox to be a set, since the property A∈
/A
is not being applied to select from a set.
The following tasks are meant to hone your skill in comprehending and
manipulating sets. Let us start with an example.
A = B ⇐⇒ (A ⊆ B and A ⊇ B).
First, suppose that A⊆B and A⊇B are both true. To prove A=B we
need to check that these two sets have the same members (by the Axiom of
Extension). Consider any member x of A. Since A ⊆ B , x is also a member
ofB . So every member of A is also a member of B . Now consider a member
y of B . Since A ⊇ B , y is a member of A. So every member of B is also
a member of A. Hence A and B have exactly the same members and we
obtain A = B .
Conversely, suppose that A = B is true. Then the two sets have the same
members. Since members of A are members of B , we have A ⊆ B . And since
members of B are members of A, we have A ⊇ B .
9 And so far, he has not been proved wrong!
38 CHAPTER 1. SETS AND LOGIC
Task 1.5.17 Let A be a set. Use the ZF axioms to show the following:
(a) A ∪ ∅ = A, (b) A ∩ ∅ = ∅.
Task 1.5.18 Let A, B be sets. Show that each of the following implies the
others:
1. (Idempotence) A ∪ A = A ∩ A = A.
2. (Commutativity) A ∪ B = B ∪ A, A ∩ B = B ∩ A.
3. (Associativity) A ∪ (B ∪ C) = (A ∪ B) ∪ C , A ∩ (B ∩ C) = (A ∩ B) ∩ C .
A ∩ (B ∪ C) = (A ∩ B) ∪ (A ∩ C),
4. (Distributivity) A ∪ (B ∩ C) =
(A ∪ B) ∩ (A ∪ C).
5. (Absorption) A ∩ (A ∪ B) = A ∪ (A ∩ B) = A.
6. (Double complement) (Ac )c = A.
7. (de Morgan's Laws) (A ∪ B)c = Ac ∩ B c , (A ∩ B)c = Ac ∪ B c .
x ∈ A ∩ (B ∪ C) ⇐⇒ (x ∈ A) ∧ (x ∈ B ∪ C)
⇐⇒ (x ∈ A) ∧ [(x ∈ B) ∨ (x ∈ C)]
⇐⇒ [(x ∈ A) ∧ (x ∈ B)] ∨ [(x ∈ A) ∧ (x ∈ C)]
⇐⇒ [x ∈ A ∩ B] ∨ [x ∈ A ∩ C]
⇐⇒ x ∈ (A ∩ B) ∪ (A ∩ C).
x ∈ (A ∪ B)c ⇐⇒ (x ∈ U ) ∧ ¬(x ∈ A ∪ B)
⇐⇒ (x ∈ U ) ∧ ¬[(x ∈ A) ∨ (x ∈ B)]
⇐⇒ (x ∈ U ) ∧ [(x ∈
/ A) ∧ (x ∈
/ B)]
⇐⇒ [(x ∈ U ) ∧ (x ∈
/ A)] ∧ [(x ∈ U ) ∧ (x ∈
/ B)]
⇐⇒ (x ∈ Ac ) ∧ (x ∈ B c )
⇐⇒ x ∈ (Ac ∩ B c ).
(a) The lines parallel to a given line m in the plane (Use the notation of
Example 1.6.1),
3 Let A, B and C be sets. Explain, in detail, how you would use the set
theory axioms to dene:
(a) A ∪ B ∪ C, (b) A ∩ B ∩ C.
4 Let A, B and C be given sets. Write the converse of each of the following:
What can you say about the truth values of the above statements and their
converses?
(a) A ∪ C ⊆ B ∪ C, (b) A ∩ C ⊆ B ∩ C.
(a) A ⊆ B, (b) B ⊆ A.
8 Show that the following equalities fail for some sets A, B . Further, modify
the right hand side so that equality becomes correct for all sets.
(a) A \ (A \ B) = B , (b) A \ (B \ A) = A \ B .
(a) B ⊆ A =⇒ A \ (A \ B) = B ,
(b) A ∩ B = ∅ =⇒ A \ (B \ A) = A \ B .
Task 1.6.2 Using the notation and approach of Example 1.6.1, express the
following axioms symbolically:
How will negation work in this notation? Consider the statement that every
member of A has property P . It will be false if and only if there is a member
ofA that does not have property P . Hence, the negation of (∀x ∈ A) P (x) is
(∃x ∈ A) ¬P (x). Similarly, the negation of (∃x ∈ A) Q(x) is (∀x ∈ A) ¬Q(x).
We dene the converse and contrapositive of (∀x ∈ A) [P (x) =⇒ Q(x)]
as follows:
42 CHAPTER 1. SETS AND LOGIC
Task 1.6.4 Prove that (∀x ∈ A) [P (x) =⇒ Q(x)] and its contrapositive
are equivalent.
Sets of Numbers
Let us undertake a brief review of some basic sets that we use all the time
in mathematics, and which we have already been referring to in this book
without much comment. These are the various sets of numbers, and their
subsets. For the time being we shall take it for granted that these are sets
and that they can be constructed on the basis of the axioms of set theory.
We'll return to this issue in Chapter 3. Our desire right now is to just collect
the basic notation and facts so that we can use them as examples to illustrate
and explore the various concepts of set theory.
: Z = {. . . , −2, −1, 0, 1, 2, . . . }
Integers
p
Rational Numbers : Q = p ∈ Z, q ∈ Z, q 6= 0
q
One can see that the natural numbers are the fundamental objects. Every
other system of numbers is eventually dependent on them. In school, the
natural numbers are introduced as a means of counting, and the integers
and rational numbers help us to frame and answer more involved questions
related to counting. For example, the question If two loaves of bread are
completely and equally shared among ve girls, how much will each girl
get? has no sensible answer if we only know natural or whole numbers, but
is easily answered if we understand rational numbers.
10 Eudoxus of Cnidus (c. 390 to c. 340 BC), Euclid of Alexandria (born c. 300 BC),
Archimedes of Syracuse (c. 287-212 BC). The dates are approximate.
11 A construction of real numbers from rational numbers is described in A.3.
44 CHAPTER 1. SETS AND LOGIC
of the form a + ib where a and b are real numbers and the symbol i is dened
to have the property i2 = −1.
Now we quickly summarize some denitions and notation involving num-
bers and sets of numbers. We will use these in our examples and exercises.
However, we shall not use these sets in any theorems until we have described
them formally in Chapter 3.
• The notation introduced above for R can also be adapted for Q. For
example, Q∗ = { x ∈ Q | x 6= 0 } and Q+ = { x ∈ Q | x > 0 }.
• Given two real numbers a and b, the greater one is called their max-
imum and is denoted by max{a, b}. The smaller one is called their
minimum and is denoted by min{a, b}. If a and b are equal their
common value serves as both maximum and minimum.
Task 1.6.5 Express the sets R>0 and R≥0 in interval notation.
In this context, ∞ and −∞ are just symbols that help to identify particular
« unbounded intervals. They do not stand for any actual objects called
innity or minus innity.
(a) { x ∈ N | x | 100 },
(b) { x ∈ R | x2 + x + 1 = 0 },
(c) { x ∈ R | x2 − 1 > 0 } .
5 Express the following in symbols, then write out the negation in symbols,
and nally write the negation as a sentence in English:
(a) For every real number x there is a natural number n that is greater
than x.
(b) If m, a and b are natural numbers such that m divides ab then m
divides either a or b.
(c) If two lines meet at two distinct points then they are the same line.
6 Express the following in symbols, then write out the converse in symbols,
and nally write the converse as a sentence in English:
(b) The product of an even natural number with any natural number is
even.
(b) The product of an even natural number with any natural number is
even.
9 Let X be the set of intervals (0, x) with x ∈ (0, ∞). Justify the following
statements:
S T
(a) X = (0, ∞), (b) X = ∅.
10 Let Y be the set of intervals (−x, x) with x ∈ (0, ∞). Justify the following
statements:
S T
(a) Y = R, (b) Y = {0}.
13 Let p and q be rational numbers with p < q. Dene the interval (p, q)Q
by
(p, q)Q = { x ∈ Q | p < x < q }.
(a) Let F be the set whose members are the intervals
T (0, 1/n)Q with
n ∈ N. Show that F = ∅.
(b) Let G be the set whose
S members are the intervals (0, 1 − 1/n)Q with
n ∈ N. Show that G = (0, 1)Q .
2 | Relations and Functions
Good mathematicians see analogies between theorems or theories;
the very best ones see analogies between analogies.
1
Stefan Banach
For example, even at age four, many children can share sweets
fairly between two recipients by using correspondences: they share
giving one-for-you, one-for-me, until there are no sweets left.
They do sometimes make mistakes but they know that, when the
sharing is done fairly, the two people will have the same amount
of sweets at the end.
2
Terezinha Nunes and Peter Bryant
49
50 CHAPTER 2. RELATIONS AND FUNCTIONS
For our next step forward, we again look at some mathematical state-
ments:
You may not understand all the terms used in these theorems. Do not worry
about that. We are interested in the structure of these theorems, and we
can appreciate the main features even without a detailed knowledge of the
terms. And the essentials will be reviewed below.
First of all, we note that these are existence theorems. The Fundamental
Theorem of Algebra asserts the existence of a complex number that is a root
of the given complex polynomial. The Fundamental Theorem of Calculus
asserts the existence of a function whose derivative is the given continuous
function. The Fundamental Theorem of Arithmetic asserts the existence of
some primes that can be multiplied with each other to get the given natural
number.
A B
FT of Arithmetic Natural Numbers Prime Numbers
FT of Algebra Complex Polynomials Complex Numbers
FT of Calculus Continuous functions Dierentiable functions
There is a common pattern involving these two sets. Each theorem considers
the elements of A and asserts the existence of corresponding elements of B
which are in a certain relationship to the elements of A.
In this chapter we will learn how to work systematically in such frame-
works. First, we need a language for talking about pairs of objects, where
each object is chosen from a particular set. This language, of cartesian prod-
ucts and ordered pairs, is developed in the rst section of the chapter. In the
remaining sections we carry out a description of relations and functions, and
put them to work. When you have completed this chapter you will possess
We could hope to do this for any two sets A and B . Their cartesian
product would consist of all ordered pairs (a, b) with a ∈ A and b ∈ B .
Such pairs would be useful as a means of discussing connections between the
members of A and B . The rst task is to show that the collection of ordered
pairs constitutes a set in the sense of Zermelo's axioms.
Theorem 2.1.1 Two ordered pairs are equal if and only if the corresponding
elements are equal:
Proof: Suppose (a, b) = (c, d). Then {{a}, {a, b}} = {{c}, {c, d}}. Equality
of these two sets means their elements must match.
One is tempted to argue that since the elements match, we have {a} = {c}
7 and {a, b} = {c, d}, due to the size of the sets. But then we are making
the error of assuming, for instance, that a 6= b.
Since the elements must match we have {a} = {c} or {a} = {c, d}. In both
cases, we conclude that a = c, and replacing c by a gives us
{{a}, {a, b}} = {{a}, {a, d}}.
• Ifb 6= a we have {a, b} =
6 {a} and hence {a, b} = {a, d}. It follows that
b = d.
52 CHAPTER 2. RELATIONS AND FUNCTIONS
Thus all the ordered pairs are present in the set P(P(X)). This opens
the prospect of using the Axiom of Separation to show that the collection of
ordered pairs constitutes a set:
A × B = { (x, y) ∈ X × Y | x ∈ A ∧ y ∈ B }.
y -axis
b (a, b)
x-axis
O a
y -axis
x-axis
−1 O 1 2 3
Z × R = { (x, y) ∈ R2 | x ∈ Z ∧ y ∈ R } = { (x, y) ∈ R2 | x ∈ Z }.
For a xed x∈Z our set includes all the points (x, y) with y varying over
R, and these points constitute a vertical line.
y -axis
x-axis
−1 0 1 2 3
We have been working with subsets of R2 that have the form A×B . Does
2
every subset of R have this form?
Example 2.1.4 2 2 2
Consider the subset S = { (x, y) ∈ R | x + y = 1 } of
R2 . Can it be of the form A × B ? If yes, then we would have S =
{ (x, y) ∈ R2 | x ∈ A ∧ y ∈ B }. This enables the following implications:
02 + 12 = 1 =⇒ (0, 1) ∈ S =⇒ 0 ∈ A,
12 + 02 = 1 =⇒ (1, 0) ∈ S =⇒ 0 ∈ B,
0 ∈ A ∧ 0 ∈ B =⇒ (0, 0) ∈ A × B = S.
Task 2.1.5 For each of the following subsets of R2 , check if it is of the form
A × B:
(a) { (x, y) ∈ R2 | 0 ≤ x ≤ 1, 0 ≤ y ≤ 1 },
(b) { (x, y) ∈ R2 | y ≥ 0 },
(c) { (x, y) ∈ R2 | x2 + y 2 ≤ 1 },
(d) { (x, y) ∈ R2 | |x| + |y| ≤ 1 }.
B
3×
2×
1×
× × × × A
a b c d
B
3×
2×
1×
× × × × A
a b c d
Let us consider the rules relating cartesian product with the set operations
of union, intersection and complement.
1. A × (B ∪ C) = (A × B) ∪ (A × C),
2. A × (B ∩ C) = (A × B) ∩ (A × C).
Proof: We shall prove the rst of these `distributive laws' and leave the
second to the reader:
(x, y) ∈ A × (B ∪ C) ⇐⇒ x ∈ A ∧ y ∈ B ∪ C
⇐⇒ x ∈ A ∧ (y ∈ B ∨ y ∈ C)
⇐⇒ (x ∈ A ∧ y ∈ B) ∨ (x ∈ A ∧ y ∈ C)
⇐⇒ (x, y) ∈ A × B ∨ (x, y) ∈ A × C
⇐⇒ (x, y) ∈ (A × B) ∪ (A × C) .
1 Show that the following can also be used as a denition of ordered pair:
2 Consider the denition P = {{x}, {x, y}} of the ordered pair (x, y). Show
that the coordinates x and y are the unique objects satisfying the following
properties:
(a) For x: x ∈ ∩P ,
(b) For y : (y ∈ ∪P ) ∧ [(∃S1 ∈ P )(∃S2 ∈ P )(S1 6= S2 ) =⇒ (y ∈
/ ∩P )].
4 Let A = {−2, −1, 0, 1, 2}. Express the given subsets of A×A using the
set-builder notation:
2 × 2 × 2 ×
1 × 1 × 1 ×
0 × 0 × 0 ×
−1 × −1 × −1 ×
−2 × −2 × −2 ×
× × × × × × × × × × × × × × ×
−2 −1 0 1 2 −2 −1 0 1 2 −2 −1 0 1 2
(a) (b) (c)
(a) (A × B) ∩ (C × D) = (A ∩ C) × (B ∩ D),
(b) (A × B) ∪ (C × D) = (A ∪ C) × (B ∪ D).
10 (a) Extend the denition (a, b) = {{a}, {a, b}} of an ordered pair to an
ordered triple (a, b, c). Your denition can be counted as successful if
you succeed in proving that (a, b, c) = (a0 , b0 , c0 ) if and only if a = a0
0 0
and b=b and c=c.
(b) Use part (a) to dene the Cartesian product A×B ×C of three sets.
2.2 Relations
Ordered pairs and cartesian products give us a way to talk about pairs of
elements where each member is from a particular set. Now consider the
Fundamental Theorem of Algebra (page 50). Let C be the set of complex
numbers and let C[z] be the set of complex polynomials in the variable z.
The Fundamental Theorem of Algebra can be seen as a statement about
certain ordered pairs in C[z] × C. It says that if a polynomial p(z) ∈ C[z]
is non-constant then there is an ordered pair (p(z), w) ∈ C[z] × C such that
p(w) = 0.
The pairs (p(z), w) such that p(w) = 0, give a complete description of
the relationship of a complex number being a root of a complex polynomial.
B
3×
2×
1×
× × × × A
a b c d
We see that:
Equivalence Relations
In mathematics we often ask whether two objects are of the same type in
some sense. For example, in plane geometry, we may consider two triangles
to be of the same type if they are congruent. Or, we may consider them to
be of the same type if they are similar. Among integers, we may consider
two numbers to be of the same type if they have the same remainder when
divided by 2. This splits the integers into the even and odd ones.
Reexivity: ∀x ∈ X , xRx.
Transitivity:
[(x1 , y1 )] = { (x, y) ∈ R2 | x = x1 }.
(x1 , y1 )
x = x1
Note that two distinct equivalence classes are parallel lines and have no
common points.
Task 2.2.4 2 2 2
Let (xj , yj ) ∈ R , j = 1, 2. We say (x1 , y1 ) ∼ (x2 , y2 ) if x1 +y1 =
2 2 2
x2 + y2 . Show that this denes an equivalence relation on R . Identify the
equivalence classes.
1. [x] = [y] ⇐⇒ x ∼ y .
2. [x] 6= [y] =⇒ [x] ∩ [y] = ∅.
Proof:
0 = {. . . , −6, −3, 0, 3, 6, . . . },
1 = {. . . , −3, 1, 1, 4, 7, . . . },
2 = {. . . , −4, −1, 2, 5, 8, . . . },
3 = {. . . , −3, 0, 3, 6, 9, . . . } = 0.
We see that the equivalence classes 0, 1, and 2, are distinct but 3 = 0. In fact
we can see that 0 consists of integers of the form 3k , 1 consists of integers of
the form 3k + 1, and 2 consists of integers of the form 3k + 2. Since every
integer is one of these forms, these three are all the equivalence classes.
Ordered Sets
Often we want to rank objects in some way. If two integers are not equal,
we wish to know which is greater. If two sets are dierent, we wonder if
one is contained in the other. We may not always have a clear answer. For
example, given two sets it may happen that neither is a subset of the other.
Reexivity: ∀x ∈ X , x ≤ x.
Anti-Symmetry: x ≤ y ∧ y ≤ x =⇒ x = y .
Transitivity: x ≤ y ∧ y ≤ z =⇒ x ≤ z.
We read x≤y as x is less than or equal to y or y is greater than or equal
to x. We also use y≥x as equivalent to x ≤ y.
{1, 2, 3}
This ordering really is partial, in the sense that there are pairs of elements
A, B that are unrelated: Neither A⊆B nor B ⊆ A.
8 12
4 6 9 10
2 3 5 7 11
1
Task 2.2.11 Let ≤ be a partial order on X . Show that the associated strict
order < has the following properties:
(a) (Irreexivity) ∀x ∈ X , x ≮ x,
(b) (Transitivity) x < y ∧ y < z =⇒ x < z .
Task 2.2.12 Let < be a strict order on X, i.e., < is a relation on X that
is irreexive and transitive. Dene a relation ≤ on X by x≤y if x < y or
x = y. Show that ≤ is a partial order.
Task 2.2.13 Show that the greatest and least elements are unique, if they
exist.
{1, 2, 3} {1, 2, 3}
00
• S has neither a greatest element nor a least element.
Can you see why {2} is not the least element of S0 and S 00 ?
This example illustrates that the greatest element may not exist. Some-
times all we need is a weaker condition the existence of an element that
has nothing above it, even though it may not itself be greater than all other
elements. This leads to the following denition:
Task 2.2.15 Show that if a set has a greatest element then that element is
also maximal. And a least element is also minimal.
Note that the formal denition allows the upper and lower bounds to be
outside of S.
In general, given several upper bounds for a set A, the lower ones give
more information, and so we wonder whether there is a least upper bound.
This would be the minimum element of the collection of all upper bounds of
A, if it exists. The least upper bound of A is also called the supremum of
A and is denoted by sup(A).
1 2
a b
Let A = {a, b}. The upper bounds of A form the set {1, 2}. This set has no
minimum so sup(A) does not exist.
α β γ
a b c
66 CHAPTER 2. RELATIONS AND FUNCTIONS
Let A = {a, b, c}. The upper bounds of A form the set {0, α, β, γ}. This set
has 0 as its minimum, so sup(A) = 0.
Totality: x, y ∈ X =⇒ x ≤ y or y ≤ x.
Of the orders we have considered so far the only total order is the one dened
on Z by n≤m if m − n ∈ W. We shall call this the standard order on Z.
⇐⇒ (x, y) = (x0 , y 0 ).
The last equivalence is obtained by applying the distributive law and
dropping every term that is a contradiction.
Translation: The author worked it out in his head, but realized typing it
out would be drudgery. He asks you to spare him and work it out yourself.
7 To give him his due, he honestly believes this will be good exercise and
will strengthen your mathematical muscles.
The dictionary order is also a total order. If (x, y) 6= (a, b) the two ordered
pairs dier in at least one coordinate and that will ensure that one is greater
than the other. In the diagram drawn below, the white circles mark the
ordered pairs that are greater than a certain (a, b), and the grey circles mark
the ones that are lesser.
y=b
x=a
Theorem 2.2.21 Let ≤ be a total order on X, and let < be the associated
strict order. Then < has the following properties:
Irreexivity: ∀x ∈ X , x ≮ x.
Trichotomy: ∀x, y ∈ X , exactly one of the following holds: x < y , y < x,
x = y.
Transitivity: x < y ∧ y < z =⇒ x < z .
Proof: The property of being irreexive is part of the denition of <. (Put
y=x in the denition.)
1 Let R be a relation on X. What can you say about its domain and range
if
68 CHAPTER 2. RELATIONS AND FUNCTIONS
(b) x̄ 6= ȳ =⇒ x̄ ∩ ȳ = ∅.
Prove that R is an equivalence relation.
(a) If A and B are bounded above then A∪B will be bounded above.
(b) If A and B have greatest elements then A∪B will have a greatest
element.
(c) If A and B have maximal elements then A∪B will have a maximal
element.
2.3 Functions
Consider a device that takes in some kind of input and converts it into an
output. What kind of abstract framework would be useful for describing such
devices? First, we need to know what are the allowed inputs into the device.
In the language of mathematics, we need to know the set from which the
inputs can be taken. Similarly, we need to know the set of possible outputs.
And we also need to know what happens for each possible input what is
the corresponding output?
Input Output
BP
BP
BP
X Y
(Just for fun: What do you think this `device' is doing to the people passing
through it?)
Let X be the set of allowed inputs and Y be the set of possible outputs.
Clearly, we are again looking for a relation from X to Y. What special
70 CHAPTER 2. RELATIONS AND FUNCTIONS
properties should the relation have? We must know the set of allowed inputs
exactly. We don't want to feed in something that the device cannot handle!
This means the domain of the relation must be all of X. On the other hand,
no harm may be done if we have extra elements in what we think is the
set of outputs. Those elements will just never show up as outputs, and so
what? The implication is that the range of the relation need not be all of
Y. Finally, we usually expect a device to produce one output per input. So
each element of X should be related to only one element of Y.
You can imagine devices which don't t this narrow framework. Maybe
they have a random element. Maybe their allowed inputs change with time.
Still, this simple type of device of ours is common enough and it has been
very fruitful to set up a corresponding formal denition, as done below:
(A) (B)
(C) (D)
(A) does not represent a function because there is a point in the domain that
has no image. (B) also does not represent a function, since there is a point in
the domain that has two images. (C) and (D) do represent functions, since
it is permitted for points in the codomain of a function to have no pre-image
as well as to have multiple pre-images.
(a) F = { (m, n) ∈ Z × N | m = n },
(b) G = { (m, n) ∈ N × Z | m = n }.
The relation F has dom(F ) = N 6= Z, since (m, n) ∈ F ⇐⇒ m = n =⇒
m∈N and m ∈ N =⇒ (m, m) ∈ F . Hence F is not a function.
For G, we see that the same reasoning gives dom(G) = N. Also (m, n) ∈
G =⇒ m = n and hence each point of the domain is associated to only one
point of the codomain. Hence G is a function.
We have been using the name f for a function. This is simply the most
commonly used notation for a function (as x is for a variable), but we are
free to use any other letter, symbol, or word. Other popular choices are g,
h, u, v , F , G, H , η , θ and so on. Functions that are particularly important
have their own names such as sin, cos, exp and log.
A function may also be called a map or mapping. If f is a function and
f (x) = y we say that f maps x to y.
Further, suppose f: X → Y is given by a clear rule, such as f (x) = x2 .
2
Then we represent the rule by a special arrow: x 7→ x .
1-1 Functions
Task 2.3.8 Show that f: X → Y is 1-1 if and only if f (a) = f (b) implies
a = b.
a
b
(A) (B)
The function in (A) is 1-1 because distinct points in the domain are mapped
to distinct points in the codomain. The function in (B) is not 1-1 because
the points a and b are mapped to the same value.
To prove that a function is 1-1 we can choose between the usual ap-
proaches:
a3 = b3 =⇒ a3 − b3 = 0
=⇒ (a − b)(a2 + ab + b2 ) = 0
=⇒ a − b = 0 or a2 + ab + b2 = 0.
74 CHAPTER 2. RELATIONS AND FUNCTIONS
On the other hand, to prove that a function f is not 1-1 we need to nd
some specic a and b such that a 6= b and f (a) = f (b). This usually calls
for more than manipulating the rule for the function. We need to recognize
some key characteristic that may lead to repeated values.
Task 2.3.15 Let X be any nonempty set and let P(X) be its power set.
Give a 1-1 map f : X → P(X).
Onto Functions
(A) (B)
The function in (A) is not onto because the point z in the codomain has no
pre-image. The function in (B) is onto.
We are going to use the fundamental fact that every non-negative real
√
number y has a non-negative square root y. You would not have seen
7 the proof of this fact in school! We also just assume it for now. Proofs are
indicated in Exercises 5 and 6 of 3.6.
Task 2.3.19 Find out whether the following functions are 1-1 or onto.
Task 2.3.20 Let 2N be the set of all even natural numbers. Give examples
of functions from 2N to N that are
76 CHAPTER 2. RELATIONS AND FUNCTIONS
(a) 1-1 but not onto, (b) Onto but not 1-1,
(c) Neither 1-1 nor onto, (d) Both 1-1 and onto.
A f (A)
X Y
1. A ⊆ B =⇒ f (A) ⊆ f (B),
2. f (A ∪ B) = f (A) ∪ f (B),
3. f (A ∩ B) ⊆ f (A) ∩ f (B) and equality may fail.
Proof:
3. A ∩ B ⊆ A, B =⇒ f (A ∩ B) ⊆ f (A), f (B) =⇒ f (A ∩ B) ⊆
f (A) ∩ f (B).
Here is an example of failure of equality. Consider the constant function
f : Z → Z, n 7→ 0, A = Z+ , B = Z− . Then A∩B = ∅ =⇒ f (A∩B) =
∅. But, f (A) ∩ f (B) = {0} ∩ {0} = {0}.
f −1 (C) = { x ∈ X | f (x) ∈ C }.
f −1 (C)
C
X Y
Let us consider the relationship between inverse images and the set op-
erations. As we see below, the situation is much neater than it was for
images!
1. C ⊆ D =⇒ f −1 (C) ⊆ f −1 (D),
2. f −1 (C ∪ D) = f −1 (C) ∪ f −1 (D),
3. f −1 (C ∩ D) = f −1 (C) ∩ f −1 (D),
4. f −1 (C c ) = [f −1 (C)]c .
Proof:
1. A ⊆ f −1 (f (A)),
2. f (f −1 (C)) ⊆ C .
Proof:
C
f −1 (f (A)) f −1 (C)
f (f −1 (C))
A f (A)
X Y X Y
In the rst diagram, the fact that f is not 1-1 allows an extra point to creep
into f −1 (f (A)). In the second diagram, the fact that f is not onto makes
−1
f (f (C)) lose a point.
Task 2.3.34 Let f: X →Y. Prove the converses of the results in the above
theorem:
Composition of Functions
X Y Z
g◦f
Proof:
=⇒ a = b. (∵ f is 1-1)
(a) g is not 1-1 but g◦f is 1-1, (b) f is not onto but g◦f is onto.
Proof:
1. f (a) = f (b) =⇒ g(f (a)) = g(f (b)) =⇒ g◦f (a) = g◦f (b) =⇒ a = b.
2. c ∈ Z =⇒ ∃a ∈ X, g ◦ f (a) = c =⇒ g(f (a)) = c =⇒ g(b) =
c, where b = f (a).
82 CHAPTER 2. RELATIONS AND FUNCTIONS
Finally, let us see the interaction of composition with images and inverse
images of sets.
2 × 2 × 2 ×
1 × 1 × 1 ×
0 × 0 × 0 ×
−1 × −1 × −1 ×
−2 × −2 × −2 ×
× × × × × × × × × × × × × × ×
−2 −1 0 1 2 −2 −1 0 1 2 −2 −1 0 1 2
(a) (b) (c)
(a) { (x, y) ∈ I × I | x2 + y 2 = 1 },
(b) { (x, y) ∈ I × I | x2 + y 2 = 1 ∧ x ≥ 0 },
(c) { (x, y) ∈ I × I | x2 + y 2 = 1 ∧ y ≥ 0 }.
x2
1/x if x 6= 0 if x≥0
(a) f (x) = , (b) g(x) = .
0 if x=0 −x2 if x≤0
7 Let f, g, h : R → R be dened by
x x2 x3
f (x) = , g(x) = , h(x) = .
1 + x2 1 + x2 1 + x2
(a) Determine which of them are injective.
11 f: N → A
Suppose and g: N → B are onto. Show that there is an onto
map h N → A ∪ B.
:
13 2 2
Let f : R → R be given by f (x, y) = x + y .
2
What are f −1 ({t}) and
f −1 ([a, b])? Draw pictures of these inverse images.
Task 2.4.1 Carry out an indexing of the set O of odd integers by the index
set Z.
3 The `if ' direction is also true, provided we assume the Axiom of Choice. See 4.3.
[ [ \ \
X = Xi and X = Xi .
i∈I i∈I
[ [ \ \
X = X and X = X.
X∈X X∈X
Task 2.4.2
[
Let X = { Xn | n ∈ N } with Xn = {1, 2, . . . , n}. Find Xn
n∈N
\
and Xn .
n∈N
The laws for union and intersection, which we stated earlier for combinations
of two or three sets at a time, can be generalized to arbitrary collections of
sets.
[ [ \ \
(a) U ∩( Xi ) = (U ∩ Xi ), (b) U ∪( Xi ) = (U ∪ Xi ).
i∈I i∈I i∈I i∈I
4 Galileo noted that the set {1, 2, 3 . . . } can be used to label, without repetition, the
visibly smaller collection {1, 4, 9, . . . }. He appears to have considered this paradoxical and
counted it as a reason to not consider innite collections as mathematical entities.
86 CHAPTER 2. RELATIONS AND FUNCTIONS
[ \ \ [
(a) ( Xi )c = Xic , (b) ( Xi )c = Xic .
i∈I i∈I i∈I i∈I
Cartesian Product
(a, a, a), (a, a, b), (a, a, c), (a, b, a), (a, b, b), (a, b, c).
Q
i∈I Xi
n
Y
I = {1, . . . , n} Xi or X1 × · · · × Xn
i=1
Y∞
I=N Xi or X1 × X2 × · · ·
i=1
I = {1, . . . , n} and X = X1 = · · · = Xn Xn
I = N and X = X1 = X2 = · · · X∞
5 It has been a matter of intense debate whether it is reasonable to allow arbitrary
amounts of simultaneous selections in a proof in mathematics. For more on this, see 4.3
on the Axiom of Choice.
3
[ \
Find Xn and Xn :
n∈N n∈N
(a) Xn = nN = { nk | k ∈ N }, (b) Xn = nZ = { nk | k ∈ Z }.
4
[ \
Find Xr and Xr :
r∈[0,1] r∈[0,1]
[ [ \ \
(a) A ∪ ( Xi ) = (A ∪ Xi ), (b) A∩( Xi ) = (A ∩ Xi ).
i∈I i∈I i∈I i∈I
(a) If
[Xi ⊆ A for each i, then (b) If A\⊆ Xi for each i, then
Xi ⊆ A. A⊆ Xi .
i∈I i∈I
[ [
(a) f( Xi ) = f (Xi ),
i∈I i∈I
\ \
(b) If f is 1-1 then f( Xi ) = f (Xi ).
i∈I i∈I
[ [ \ \
(a) f −1 ( Yi ) = f −1 (Yi ), (b) f −1 ( Yi ) = f −1 (Yi ).
i∈I i∈I i∈I i∈I
[ [ \ \
(a) U ×( Xi ) = (U × Xi ), (b) U ×( Xi ) = (U × Xi ).
i∈I i∈I i∈I i∈I
the other set, the one it was paired with. Thus our relation is actually a 1-1
function.
If the two groups have the same amount of people, there will be no
leftover members, and the function will be onto. Thus, a perfect pairing o
is represented by a function that is 1-1 and onto.
BP
BP
These tasks illustrate how the notion of bijection helps capture the in-
tuition of same amount. However, this simple idea has consequences that
seem unnatural at rst glance. For instance, we have seen earlier that there
is a bijection from N to Z, and so we should think of them as having the
same amount of points, even though Z has many points that are not in N!
We have also seen that all closed intervals [a, b] with a < b are in bijection
with each other, even though their widths vary indenitely. This is perhaps
to be expected as the numbers associated with an interval are due to our
choice of unit and hence are quite arbitrary. The next example is much more
surprising.
90 CHAPTER 2. RELATIONS AND FUNCTIONS
y -axis
( ) x-axis
−1 x 1 f (x)
The point x on the horizontal axis is rst lifted vertically to the point P on
the semicircle and then sent out radially until it hits the horizontal axis at
f (x).
The semicircle has equationx2√+ (y − 1)2 = 1. Hence the point P corre-
sponds to the ordered pair (x, 1 − 1 − x2 ). By considering similar triangles
we see that f (x) is given by
f (x) f (x) − x x
= √ or f (x) = √ .
1 1 − 1 − x2 1 − x2
If you visualize sliding the point x along the interval (−1, 1) you should be
able to see that f (x) will range over the whole real line. We can check this
by calculations.
a b a2 b2
f (a) = f (b) =⇒ √ =√ =⇒ 2
=
1 − a2 1 − b2 1−a 1 − b2
=⇒ a2 (1 − b2 ) = b2 (1 − a2 ) =⇒ a2 = b2 =⇒ b = ±a.
Note that f (−a) = −f (a). If a 6= 0 we have f (a) 6= 0 and hence f (−a) =
−f (a) 6= f (a). So b 6= −a, and hence b = a. If a = 0 we have b = ±0 = 0 = a.
Next we check that f is onto. Take any y∈R and try to solve y = f (x)
for x. We have:
x y2
y=√ =⇒ y 2 (1 − x2 ) = x2 =⇒ y 2 = (1 + y 2 )x2 =⇒ x2 = .
1 − x2 1 + y2
y
By substituting, you can check that x = p works.
1 + y2
Task 2.5.3 Investigate the one-one, onto, or bijective nature of the following
functions. Also, consider what shrinking of the domain or the range or both.
is needed to make the function one-one or onto or bijective.
Inverse Functions
g(y) = x ⇐⇒ f (x) = y.
f f −1
X Y X Y
Proof: The proof is very similar to the previous theorem, hence is left as an
exercise.
The notion of inverse function can also be expressed neatly in terms of
composition:
f ◦ g = 1Y =⇒ f −1 ◦ (f ◦ g) = f −1 ◦ 1Y
=⇒ (f −1 ◦ f ) ◦ g = f −1
=⇒ 1X ◦ g = f −1
=⇒ g = f −1 .
−1
A common error is to forget the rst, and main, interpretation of f (A).
−1
Often, we come across f (A) and conclude that this indicates the avail-
« −1
ability of the inverse function f . It does not, unless we have separate
information that f is a bijection.
The other common error is to confuse the notation for an inverse function
with that of the reciprocal of a real number. This confusion leads to
calculations like the following:
«
f (x) = x =⇒ f −1 (x) = x−1 = 1/x.
Cardinality
We say that two sets A and B have the same cardinality if there is a
bijection f : A → B, and we denote this by |A| = |B|.
We have already seen many examples of sets with the same cardinality,
for instance:
BP
BP
X Y
An example of |X| ≤ |Y |.
In the diagram drawn just above, there is a function that is 1-1 but not
« onto. The fact that this function is not onto is not sucient for claiming
that |X| < |Y |, as there may be another function that is both 1-1 and
onto.
Theorem 2.5.11 (Cantor's Theorem) For any set X , |X| < |P(X)|.
C = {x ∈ X | x ∈
/ g(x) }.
Since C ∈ P(X), there is a point a∈X such that g(a) = C . Does a belong
to C? We have the following:
a ∈ C ⇐⇒ a ∈ g(a) ⇐⇒ a ∈
/ C.
y -axis
( ) x-axis
−1 x 1 f (x)
9 Give a bijection between the set P of planes passing through the origin of
R3 and the set L of lines passing through the origin of R3 .
10 Suppose |A| = |C| and |B| = |D|. Which of the following are always
true?
(c) |A × B| = |C × D|.
11 Suppose A and B are subsets of X, with |A| = |B|. Can we assert the
following?
13 Give a bijection between the plane R2 and the open unit ball D =
2 2 2
{ (x, y) ∈ R | x + y < 1 }.
X a
a∗b
b X
You are already familiar with many binary operations, even if you did
not know that term earlier! For example:
Example 2.6.2 If you have studied vectors in high school, you may recall
the cross product for three-dimensional vectors. It is a binary operation on
R3 .
x ∗ a = y ∗ a =⇒ x = y and a ∗ x = a ∗ y =⇒ x = y.
x ∗ a = y ∗ a =⇒ (x ∗ a) ∗ b = (y ∗ a) ∗ b =⇒ x ∗ (a ∗ b) = y ∗ (a ∗ b)
=⇒ x ∗ e = y ∗ e =⇒ x = y.
Example 2.6.6 Consider a power set P(X) with the binary operations ∪
and ∩. Then:
2. The empty set ∅ is the identity for ∪, while X is the identity for ∩.
3. Only ∅ has an inverse relative to ∪, and only X has an inverse relative
to ∩.
4. The two operations distribute over each other:
A ∪ (B ∩ C) = (A ∪ B) ∩ (A ∪ C) and A ∩ (B ∪ C) = (A ∩ B) ∪ (A ∩ C)
Example 2.6.8 Let F(X) be the set of functions from X to X, with com-
position of functions as the binary operation.
1. Bijections that switch two numbers and leave the third one unchanged,
for example 1 7→ 2, 2 7→ 1, 3 7→ 3.
1 1 1
2 3 2 3 2 3
R1 R2 R3
1 1 1
2 3 2 3 2 3
T0 T120 T240
4. Every element has an inverse. Each reection is its own inverse, while
T120 and T240 are the inverses of each other.
0 1 2
0 0 0 0
1 0 1 2
2 0 2 1
(a) 1 is the identity as the row and column for 1 just replicate the list of
members of X in the same order.
(b) 0 has no inverse since the row (and column) for 0 has no entry of 1.
(d) The subset {1, 2} is closed since the entries for combinations of 1 and
2 only consist of 1 and 2.
102 CHAPTER 2. RELATIONS AND FUNCTIONS
(e) The operation is commutative since the table is symmetric about the
diagonal running from the top left to the bottom right.
(b) Give an example of ∗ where some elements have multiple left inverses.
(c) Show that if an element has both a left inverse and a right inverse
then they are equal, and hence are the inverse of that element. (In
particular, an element can have at most one inverse)
9 Draw the multiplication table for S3 with composition as the binary op-
eration.
13 Let D4 denote the set of symmetries of a square with its vertices labeled
1, 2, 3, 4. Which bijections of {1, 2, 3, 4} correspond to members of D4 ?
Which are reections and which are rotations?
0 1 2
0 0 1 2
1 1 2 0
2 2 0 1
(b) Suppose G has exactly two elements that are their own inverse. Then
there is a pair of elements which are the inverses of each other. Name
one of them 1. Also dene 2 = 1 + 1, and name the last member 3.
With these renamings, show that the multiplication table becomes
0 1 2 3
0 0 1 2 3
1 1 2 3 0
2 2 3 0 1
3 3 0 1 2
(c) Now consider the other case, when every member of G is its own
inverse. Show that the multiplication table for G is the following:
0 1 2 3
0 0 1 2 3
1 1 0 3 2
2 2 3 0 1
3 3 2 1 0
It came to appear that, between two truths of the real domain, the
easiest and shortest path quite often passes through the complex
domain.
2
Paul Painlevé
It is time that all these numbers were also brought within our house
of set theory and logic. In the rst section of this chapter we shall give a
1 From his text Brahmasphutasiddhanta composed in 628 AD. Translation by Henry
Thomas Colebrooke, 1817. https://archive.org/details/algebrawitharith00brahuoft
2 See http://homepage.math.uiowa.edu/~jorgen/hadamardquotesource.html.
105
106 CHAPTER 3. NUMBER SYSTEMS
formal description of the set of natural numbers via the Peano axioms. In
later sections we shall use the natural numbers and set theory methods such
as cartesian product and equivalence relation to build the other kinds of
numbers.
The most essential set of numbers is that of natural numbers. The formal ap-
proach to natural numbers originates with Charles Pierce, Richard Dedekind
and Giuseppe Peano in the late 19th century. The following axioms for the
set N of natural numbers are called the Peano Axioms:
P1. (Successor Function) There is a 1-1 function S : N → N.
The rst axiom represents the intuition that each natural number has a
`next' number. (For example, S(1) = 2, S(2) = 3 and so on) The second,
that there is an initial natural number which does not succeed any other.
The third, that all natural numbers can be obtained by taking consecutive
successors starting with 1. The following theorem expands on the last point:
Theorem 3.1.1 The Peano axioms for N imply that if y ∈ N \ {1} then
∃!x ∈ N such that S(x) = y .
Peano showed that all the basic properties of arithmetic with natural
numbers could be deduced from these axioms, and that is our topic of study
in this section. An obvious question at this stage concerns the actual exis-
tence of a set with a successor function that satises the Peano axioms, and
hence can serve as N. We leave that for the next chapter.
Recursion
Proof: To construct f we rst consider all relations that have the same
essential properties. Thus, we let M be the set of all R⊆N×X such that
(a) (1, c) ∈ R,
(b) (n, x) ∈ R =⇒ (S(n), G(x)) ∈ R.
For example, N × X ∈ M. Now take the intersection of all these relations:
\
f= M.
A = { n ∈ N | (n, x) ∈ f ∧ (n, x0 ) ∈ f =⇒ x = x0 }.
R = { (n, x) ∈ N × X | n 6= 1 ∨ x = c }.
if not, then consider the relation R = f \ {(S(n), x)}. Our assumption gives
R ∈ M, which contradicts the minimality of f.
Now we shall recover the arithmetic and order properties of N from the
Peano Axioms. So we begin by forgetting everything about N except that it
is a set with a successor function S: N → N which obeys the Peano Axioms.
We have no addition, no order, and we can't make statements like take the
rst n elements.
Addition
We can do a recursion along each row, using its rst point as the base point.
This means that we begin by setting the value cn of the binary operation for
each (n, 1) pair.
Next, we need a rule that nds the value for any (n, S(m)) by using the value
for (n, m). This requires functions Gn : N → N for each n.
To begin the recursion for dening addition, we rst dene n + 1 = S(n).
Then to obtain n + (m + 1) from n + m, we set n + (m + 1) = (n + m) + 1.
+ : N × N → N, (n, m) 7→ n + m,
such that n + 1 = S(n) and n + S(m) = S(n + m) for all natural numbers x
and y.
Multiplication
• : N × N → N, (n, m) 7→ n • m,
such that n•1 = n and n • S(m) = (n • m) + n for all natural numbers n and
m.
110 CHAPTER 3. NUMBER SYSTEMS
(a) fn (1) = n,
(b) fn (S(m)) = fn (m) + n.
Dene the binary operation of multiplication by n • m = fn (m).
Proof: Now that the operations have been established, their properties can
be veried by the usual induction approach. From our brief experience of
binary operations, we know that associativity is very helpful in proving other
facts. So we begin with the associative law for addition. Let
A = { n ∈ N | (` + m) + n = ` + (m + n) ∀`, m ∈ N }.
= ((` + (m + n)) + 1 (n ∈ A)
= ` + ((m + n) + 1) (Theorem 3.1.3)
m + (n + 1) = (m + n) + 1 (Theorem 3.1.3)
= 1 + (m + n) (1 ∈ C)
= 1 + (n + m) (n ∈ C)
= (1 + n) + m (Associativity)
= (n + 1) + m (1 ∈ C)
We leave the other laws to be worked out by you. Exercise 2 of this section
provides one sequence for doing so.
Zero
Task 3.1.7 Check that addition and multiplication of whole numbers have
the following properties:
(b) 0 is the identity for addition and 1 is the identity for multiplication.
Proof:
Proof:
Exponentiation
a0 = Ea (0) = 1,
a1 = Ea (0) ∗ a = 1 ∗ a = a,
a2 = Ea (1) ∗ a = a ∗ a,
.
.
.
an+1 = E(n) ∗ a = an ∗ a.
m+(n+1) (m+n)+1
a =a = am+n ∗ a = (am ∗ an ) ∗ a = am ∗ (an ∗ a) = am ∗ an+1 .
Hence n+1∈A and the proof is complete.
(a ∗ b)n = an ∗ bn .
(a ∗ b)n+1 = (a ∗ b)n ∗ (a ∗ b)
= (an ∗ bn ) ∗ (a ∗ b)
= ((an ∗ bn ) ∗ a) ∗ b
= (a ∗ (an ∗ bn )) ∗ b
= ((a ∗ an ) ∗ bn ) ∗ b
= an+1 ∗ (bn ∗ b)
= an+1 ∗ bn+1 .
Therefore n + 1 ∈ A. Hence, A = W.
Order
A = { n ∈ N | (m ≤ n) ∨ (n ≤ m) }.
(a) n 6= 1 =⇒ 1 < n,
(b) n < n + `,
(c) m < n ⇐⇒ m + ` < n + `,
(d) m < n ⇐⇒ m • ` < n • `,
Proof:
(c) The =⇒ direction is obvious. For the converse, apply the cancella-
tion law for addition.
From here on, we shall revert to our familiar notation of writing the
product of a and b as simply ab, rather than a • b.
2 The following tasks provide one path for completing the proof of Theorem
3.1.5:
10 (Fibonacci Sequence) F : N → N
Show there is a unique function
such that F (1) = F (2) = 1 and for each n ∈ N, F (n + 2) = F (n) + F (n + 1).
(Hint: Consider the function G(m, n) = (n, m + n) on X = N × N)
Suppose P (1) is true, and (∀n ∈ N)(P (n) =⇒ P (n + 1)) is true. Then
P (n) is true for every n ∈ N.
Example 3.2.1 We'll prove by induction that the sum of the rst n natural
numbers is n(n + 1)/2. We set
n
X n(n + 1)
P (n) : k= .
2
k=1
P (1) is the statement 1 = 1(1 + 1)/2, which is true. Now we assume the
truth of P (n) and use it to show the truth of P (n + 1):
n+1 n
X X n(n + 1) (n + 1)(n + 2)
k= k + (n + 1) = +n+1= .
2 2
k=1 k=1
Task 3.2.2 Use induction to prove that the sum of the squares of the rst
n natural numbers is n(n + 1)(2n + 1)/6.
Example 3.2.3 (Binomial formula) We'll use induction to prove that for
any real numbers a and b, and any natural number n,
n
X n n n!
(a + b)n = ak bn−k , where = .
k k k!(n − k)!
k=0
We denote this formula by P (n). 3 While our formal description of the real
numbers happens later, in 3.6, in this example we only use their familiar
properties like commmutativity and associativity. We should also note that
the binomial formula assumes 00 = 1.
Task 3.2.4 Suppose that P (k) is true for some k ∈ N, and (P (n) =⇒
P (n + 1)) is true for every n ≥ k . Show that P (n) is true for every n ≥ k.
Proof: Let P (n) be a property that may be associated ton. Suppose P (1)
is true, and for every n ∈ N ((∀k ≤ n) P (k)) =⇒ P (n + 1)) is true.
Dene a new property Q(n) = (∀k ≤ n) P (k). We apply the Principle of
Mathematical Induction to Q(n).
First, we see that Q(1) = P (1) is true.
Next, we assume that Q(n) is true. Then we have (∀k ≤ n) P (k). By the
assumptions about P, we nd that P (n + 1) is true. Therefore Q(n + 1) =
(∀k ≤ n + 1) P (k) is true.
Since Q(n) implies P (n), we see that P (n) is also true for all n ∈ N.
The advantage of the Principle of Strong Induction is greater exibility
in carrying out proofs. It allows us to prove P (n + 1) from any convenient
prior case, or by their combinations, and not from P (n) alone. The very
next theorem illustrates this gain.
120 CHAPTER 3. NUMBER SYSTEMS
Decimal Representation
Let us see how our abstract description of N matches with our everyday
use of these numbers, when we work with concrete numbers. How do we
represent an arbitrary member of N in the decimal notation? The key is the
Well-Ordering Principle.
11 Use the last exercise to create a process for calculating the gcd of a and
b without nding their factors.
14 Show that the sum of two consecutive prime numbers (like 7 and 11)
cannot be twice of a prime number.
3.3 Integers
While discussing natural numbers in the previous section, we had a few oc-
casions when we incorporated zero into our discussion. This added exibility
4
and allowed some arguments to run in a smoother fashion with fewer cases.
Now we will take the next step, of incorporating the negative integers as well.
How were you taught this in school? You already knew the natural numbers
and zero, and were told to just put a `minus sign' in front of these numbers
to create the new `negative numbers'. The positive and negative integers
were visualized as being laid out along a line, extending in both directions
from 0.
−3 −2 −1 0 1 2 3
• • • • • • •
Z = ({+, −} × N) ∪ {0}.
The diculty with dening integers as being made of two (in fact three)
intrinsically dierent types of objects is that all denitions and proofs take a
case-by-case form. And this is where it gets confusing. For a reminder, just
read the description of subtraction of integers given over a thousand years
ago by the Indian mathematician and astronomer Brahmagupta, and quoted
at the start of this chapter.
4 For this reason, many mathematicians prefer to include zero among the natural num-
bers.
124 CHAPTER 3. NUMBER SYSTEMS
We'll take a dierent tack. One motivation for creating negative numbers
is the desire to be able to carry out subtraction of arbitrary natural numbers
(If I have Rs 8 and I owe Rs 10, how much do I really have? Well, I need to
subtract 10 from 8, and so . . . ) To begin with, we can simply represent any
dierence by an ordered pair: for example (8, 10) can represent 8 − 10. Next
we observe that certain dierences should have the same result: for example
10 − 8 and 9−7 should be viewed as having the same value. Overall, this
gives an approach of viewing integers as ordered pairs with an equivalence
relation.
We'll use the notation (m, n) to represent the equivalence class of (m, n).
For example,
Task 3.3.1 Let (m, n) ∼ (m0 , n0 ) and (i, j) ∼ (i0 , j 0 ). Show that
(m + i, n + j) ∼ (m0 + i0 , n0 + j 0 ).
Task 3.3.3 Let (m, n) ∼ (m0 , n0 ) and (i, j) ∼ (i0 , j 0 ). Show that
Proof: These are straightforward calculations which you can carry out. We
have done the third one as an illustration.
Order
The next step is to supply Z with an order. First we identify integers that
we will call positive:
Z+ = { x ∈ Z | x = (m, n) such that n < m }.
The additive inverses of positive integers are called negative and their col-
lection is denoted by Z− . So,
Z Enlarges N
Let us again look at the equivalence classes that constitute Z. In the gure
below, the ones in Z+ are coloured dark gray while the ones in Z− are
coloured light gray. The zero class 0Z is coloured black.
3Z 2Z 1Z 0Z
The classes have been labeled as follows. We already know 0Z = (1, 1) and
1Z = (2, 1). We further dene
1. It is a bijection,
2. ϕ(1) = 1Z ,
3. ϕ(m + n) = ϕ(m) ⊕ ϕ(n),
4. ϕ(m · n) = ϕ(m) ⊗ ϕ(n).
128 CHAPTER 3. NUMBER SYSTEMS
+
Thus Z can be thought of as a copy of N inside Z and we shall do so. In
particular, the Principle of Mathematical Induction applies to Z+ as follows:
+
Let A ⊆ Z such that 1Z ∈ A and if x ∈ A then x ⊕ 1 Z ∈ A. Then
A = Z+ .
The correspondence between N and Z+ allows us to change our notation
to the familiar one. n ∈ N, we allow ourselves to also say n ∈ Z by
If
identifying n with (n + 1, 1). Similarly, we create the symbol −n and identify
it with (1, n + 1). 0 is identied with 0Z = (1, 1). And we replace ⊕, and
⊗ with +, − and · respectively.
With these identications, our calculations generate the familiar results.
For example,
We will now extend the concept of division and related results to integers.
Theorem 3.3.8 Let b ∈ Z and a ∈ N. Then there are unique integers q and
r such that b = qa + r and 0 ≤ r < a.
Proof: We have already proved this result for b > 0. If b = 0 we just take
q = r = 0. If b < 0, we rst apply the established case to −b: −b = q 0 a + r0
0
with 0 ≤ r < a.
Theorem 3.3.11 Let a, b ∈ Z. Then they have a unique gcd and it can be
expressed in the form ma + nb with m, n ∈ Z.
Absolute Value
n n≥0
The absolute value of an integer n is dened by |n| = −n
if
if n<0
.
2 2
(a) x ≥ 0, and x = 0 ⇐⇒ x = 0.
2 2
(b) x ⊕ y ≥ 0, and x2 ⊕ y 2 = 0 ⇐⇒ x = y = 0.
(c) x ∈ Z+ and y ∈ Z− implies x ⊗ y ∈ Z− .
(d) x, y ∈ Z− implies x ⊗ y ∈ Z+ .
(a) a, b ∈ I =⇒ a + b ∈ I , (b) a ∈ I =⇒ −a ∈ I .
11 Consider Zn , the integers modulo n. (See page 103) For any number
k ∈ {1, 2, . . . , n − 1} dene hki = { mk | m ∈ N }. Show that hki = Zn if and
only if gcd(k, n) = 1.
(0, 1) ∼ (0, 2), (2, 1) ∼ (6, 3), (3, −2) ∼ (−3, 2), (−3, −2) ∼ (3, 2).
0 0 2 6 3 −3 −3 3
= , = , = , = .
1 2 1 3 −2 2 −2 2
Task 3.4.1 Show that every rational number can be expressed in the form
m/n where m∈Z and n ∈ N.
Task 3.4.2 Show that every rational number can be expressed in the form
m/n where gcd(m, n) = 1. (This is called a reduced form of the rational
number. How many reduced forms can a rational have?)
Given two rational numbers m/n and i/j we dene their sum by
m i mj + ni
+ = .
n j nj
As we are working with equivalence classes we have to make sure our deni-
tion is independent of the choice of representatives.
m −m
Proof: We leave the proofs to you. We just note that if x= then y=
n n
m −m m
serves as its additive inverse. That is, − = = .
n n −n
on Q by
m i mi
• = .
n j nj
If x, y ∈ Q then x•y is called the product of x and y.
Task 3.4.5 Let (m, n) ∼ (m0 , n0 ) and (i, j) ∼ (i0 , j 0 ) in Z × Z∗ . Show that
∗
3. Let x∈Q . Then ∃y ∈ Q∗ such that x • y = 1Q . The rational number
y is called the multiplicative inverse of x and is denoted by x−1 or
1/x.
4. Multiplication distributes over addition: If x, y, z ∈ Q then x • (y + z) =
(x • y) + (x • z).
Proof: You must have predicted that this proof would be left to you! We
∗
just mention, for the record, a fact already known to you: If x = m/n ∈ Q
then x−1 = n/m.
134 CHAPTER 3. NUMBER SYSTEMS
(a) 0Q · x = 0,
(b) x • y = 0Q if and only if x = 0Q or y = 0Q ,
(c) (−1Q ) • x = −x.
m i y in
Task 3.4.8 Let x= ∈ Q∗ and y= ∈ Q. Show that = .
n j x jm
Q Enlarges Z
1. ψ is 1-1,
Proof:
Therefore we can think of ψ(Z) as a copy of Z inside Q, and thus Q
becomes an enlargement of Z. We start directly combining integers and
rationals through the following conventions, which are based on ψ:
k n k k n k
For n, k ∈ Z, ` ∈ Z∗ , n+ = + and n• = • .
` 1 ` ` 1 `
Exponentiation
On page 112 we saw how to dene an , for n ∈ W, with respect to any binary
operation with identity. In particular, this leads to a denition of an for
multiplication of rational numbers, with a ∈ Q and n ∈ W. Since every
non-zero rational number has a multiplicative inverse, we can extend this
denition to negative powers of non-zero rational numbers:
Task 3.4.10 Let a ∈ Q∗ and n ∈ Z. Show that a−n = (a−1 )n = (an )−1 .
Proof:
(a) We note that we already have the result when m, n ∈ W. The other
cases are:
• m, n ∈ Z− :
• m ∈ W, n ∈ Z− , m + n ∈ W:
• m ∈ W, n ∈ Z− , m + n ∈ Z− :
(b)
(c)
136 CHAPTER 3. NUMBER SYSTEMS
Order
Theorem 3.4.12 The positive and negative rationals have the following
properties:
+
2. Q is closed under addition and multiplication.
Proof:
Given x, y ∈ Q we say y < x (or x > y) if x − y ∈ Q+ . And we say y≤x
(or x ≥ y ) if y < x or y = x.
(a) x > 0 ⇐⇒ x ∈ Q+ ,
(b) x < 0 ⇐⇒ x ∈ Q− ,
(c) ≤ is a total order on Q.
m i
If x= and y= with m, i ∈ Z and n, j ∈ Z∗ then y < x ⇐⇒ ni < mj .
n j
10 Let p and q be rational numbers with p < q. Dene the interval (p, q)Q
by
(p, q)Q = { x ∈ Q | p < x < q }.
(a) Let F be the set whose members are the intervals
T (0, 1/n)Q with
n ∈ N. Show that F = ∅.
(b) Let G be the set whose
S members are the intervals (0, 1 − 1/n)Q with
n ∈ N. Show that G = (0, 1)Q .
Let us rst take the example of Z2 = {0, 1}. The only non-zero member
is the multiplicative identity 1. Now 1 • 1 = 1 shows that Z2 is an integral
domain.
From the original denition of a eld, we can deduce many other prop-
erties that it automatically possesses:
Proof:
3. a + b = a + c =⇒ (−a) + a + b = (−a) + a + c =⇒ 0 + b = 0 + c =⇒
b = c.
4. Similar.
5. 0a = (0 + 0)a = 0a + 0a.
6. Suppose a 6= 0. Then a−1 ab = a−1 0 = 0 =⇒ 1b = 0 =⇒ b = 0.
7. a + (−a) = 0 implies a is additive inverse of −a, implies a = −(−a).
8. ab + (−a)b = (a + (−a))b = 0b = 0.
9. Apply previous result.
10. If 0b = 1 then 0 = 1.
Since Q is a eld, it has all the properties listed above. But there is more
to Q. Not only do we have rules for combining rational numbers, we also
have a notion of comparing the size of two rational numbers. We know that
with respect to this notion of order, Q splits into three types of numbers:
0, positive and negative. We know that the sum and product of positive
numbers are positive, and that x is positive if and only if −x is negative.
Members of F+ are called positive and members of −F+ are called nega-
tive. The
+ −
set −F is also denoted by F .
Task 3.5.7 Verify that there is no way to make Z2 into an ordered eld.
Theorem 3.5.8 Let F be an ordered eld with order relation <. Then we
have
Proof:
z − x = (z − y) + (y − x) ∈ F+ .
x>y if y<x
x≤y if x<y or x=y
x≥y if x>y or x=y
The four basic properties of < can be used to prove other important prop-
erties:
Theorem 3.5.9 Let F be an ordered eld with order relation <. Then we
have
Proof:
This contradicts Trichotomy, which says that x<y and y<x cannot
be true simultaneously. Hence we must have x = y.
2. If x ∈ F+ then x2 ∈ F+ by denition of ordered eld. If x ∈ −F+ then
−x ∈ F+ and x2 = (−x)2 ∈ F+ .
3. 1 = 1 2 > 0, by previous result since 1 6= 0..
4. If y−x>0 then (−y) − (−x) = x − y < 0.
5. Apply previous result.
7. Suppose x>0 and y < 0. then −y > 0. Hence −xy = x(−y) > 0, and
so xy < 0.
8. Sum of positives is positive.
Task 3.5.10 Let F be an ordered eld, x, y ∈ F and 0 < x < y. Show that:
a + b = a + c =⇒ b = c, ab = ac and a 6= 0 =⇒ b = c.
3 Prove that every nite integral domain is a eld. (Hint: Can the powers an
all be distinct for a xed non-zero a and with n varying over N?)
Before trying out Exercises 8 to 12, you may want to review the material on bounds
in pages 6366.
cA = { y ∈ F | y = cx for some x ∈ A }.
Suppose sup(A) and sup(B) exist. Will sup(AB) = sup(A) sup(B) always
hold? Will it hold if A, B ⊆ F+ ?
This amounts to the value 62, 832/20, 000 = 3.1416. The precision to 4
decimal places is enabled by the large diameter of 20,000.
1+2 3
a1 = 1 =⇒ b1 = 2 =⇒ a2 = = ,
2 2
3 4 3/2 + 4/3 17
a2 = =⇒ b2 = =⇒ a3 = = ,
2 3 2 12
17 24 17/12 + 24/17 577
a3 = =⇒ b3 = =⇒ a4 = = .
12 17 2 408
The value 577/408 ≈ 1.414216 is accurate to ve decimal places! It is found
in a clay tablet from Mesopotamia from about 1800 BC. It is also found in
the Indian Sulvasutra texts from 800BC or later, where it is expressed in a
particularly convenient form:
1 1 1
1+ + + .
3 3 × 4 3 × 4 × 34
(And the rst three terms give the previous approximation of 17/12)
From the modern viewpoint, we are seeking an ordered eld (which au-
tomatically contains Q) that also suces for geometry. If a quantity can
The next example provides insight into what it means for some quantities
to be beyond Q and the additional requirement that would bring them into
the fold. This step was rst taken by the German mathematician Richard
Dedekind.
1 2 1 c2
d2 = (b + c2 + 4) = 2 + + 1 > 2.
4 c 4
Hence d is an upper bound of A. But then c is not the least upper bound, a
contradiction.
4 1 4 1
e2 = c2 + + 2 ≤ 2 + + 2 = 2.
9 c2 9 2
Hence e∈A and c is not an upper bound of A. Since c2 > 2 and c2 < 2
2
have been ruled out, the only possibility is c = 2.
148 CHAPTER 3. NUMBER SYSTEMS
So far, the only ordered eld we have seen is Q and it is not order com-
plete, since the set {x ∈ Q : x2 < 2} is bounded above by 2 but has no
supremum.
The unique order complete eld described above is called the eld of
real numbers and is denoted by R. We will take this result for granted .
Assuming it, we will try to prove all properties of real numbers without any
7
further assumptions.
Task 3.6.4 Show that R contains a unique positive number c such that
c2 = 2.
Task 3.6.5 Let A ⊆ R be non-empty and bounded below. Show that inf(A)
exists.
Archimedean Property
1
Proof: Since y − x > 0, there exists N ∈ N< y − x.
such that 0 <
N
Then 1 < N y − N x, hence N x + 1 < N y . Dene M = [N x] + 1. Then,
N x < [N x] + 1 ≤ N x + 1 < N y , i.e. N x < M < N y . So we can take
p = M/N .
Members of Qc = R \ Q are called irrational numbers.
Theorem 3.6.12 (Denseness of Qc in R) Let x, y ∈ R and y > x. Then
there is t ∈ Qc such that x < t < y.
√ √
Proof: By the denseness of
√ Q, we have p∈Q such 2x < p < 2 y. Take
t = p/ 2.
150 CHAPTER 3. NUMBER SYSTEMS
Absolute Values
1. |x| ≥ 0,
2. |x| = 0 ⇐⇒ x = 0,
3. |xy| = |x||y|.
We view |x| as giving the distance between x and 0. If we have two real
numbers x, y then the distance between them is |x − y|.
Decimal Representations
a1 = [10r1 ], r2 = 10r1 − a1 ,
a2 = [10r2 ], r3 = 10r2 − a2 ,
. .
. .
. .
Note that for each n, 0 ≤ rn < 1 and so an ∈ {0, 1, . . . , 9}. For each n,
consider
n
X
xn = [x] + ai 10−i .
i=1
n
X
x = sup{ d + ai 10−i | n ∈ N }.
i=1
What we have just done is to describe one process for generating a decimal
representation of a given number. Could there be others, and could they lead
to a dierent decimal representation? Is the decimal representation of a real
number unique? It surprises most people to learn that the answer to the last
question is not always. For an example, consider the decimal representation
0.999 . . . of some real number x. Let x1 = 0.9, x2 = 0.99, and in general
152 CHAPTER 3. NUMBER SYSTEMS
Pn −i
xn = i=1 9 · 10 . Dene A = { xn | n ∈ N }. Then x = sup(A). But we
can easily compute the supremum in another way. First we note that 1 is
−n
an upper bound of A. Second, 1 − xn = 10 . Hence 1 = sup(A) = x.
Therefore 0.999 . . . can also be expressed as 1.000 . . . !
On completing the next Task you will have established that this is the
only kind of double representation of a real number in decimal notation, and
so we almost have a bijection between the real numbers and their decimal
representations.
Task 3.6.17 Suppose a real number has two dierent decimal representa-
tions. Then one representation eventually consists of only 9's and the other
eventually consists of only 0's, and both become eventually constant at the
same stage.
8
The fact that we mostly count in powers of 10 is a historical accident.
Mathematically, we may equally well count in powers of 2 or any higher
natural number. A representation of a real number as a sum of powers of 2
is called a binary representation. For example,
0 1 0 1
2. We can express one-third as 0+ + 2 + 3 + 4 + ··· and hence
21 2 2 2
we say that its binary representation is 0.0101 . . .
Here too, one can have two distinct representations of the same number,
such as 0.111 . . . = 1.000 . . . .
Task 3.6.18 Suppose a real number has two dierent binary representa-
tions. Then one representation eventually consists of only 1's and the other
eventually consists of only 0's, and both become eventually constant at the
same stage.
8 The rst fully developed place value system came about in Mesopotamia around
1800BC, and it used powers of sixty!
(b) For existence, it is enough to do the a > 1 case. (So we assume a>1
in the subsequent steps)
c
9 Let a ∈ R, c ∈ R+ , and 0≤a≤ for every n ∈ N. Show that a = 0.
n
1
10 Let x, y ∈ R such that |x − y| ≤ for every n ∈ N. Prove that x = y.
n
n n
11
X X
Let x 1 , . . . , x n ∈ R. Show that
x i
≤
|xi |.
i=1 i=1
is subject to the same rules as any real number. We tell this story later in
this section.
We use C to denote the cartesian plane with these operations and call it the
complex plane.
Task 3.7.1 Consider the map ϕ : R → C, x 7→ (x, 0). Verify that it is
1-1, that its image is the x-axis, and that for every x, y ∈ R we have
ϕ(x + y) = ϕ(x) + ϕ(y) and ϕ(xy) = ϕ(x)ϕ(y).
Task 3.7.2 Check that (0, 1)2 = −(1, 0). Thus (0, 1) is the formal version
of i.
156 CHAPTER 3. NUMBER SYSTEMS
Now that we have seen how complex numbers t in our scheme of set
theoretic constructions, let us return to using the x + iy notation.
Imaginary Axis
z = x + iy
y
2
y
2 +
p x
| = Real Axis
−x |z x
−y
−z = −x − iy z = x − iy
plotted along the vertical axis, which is called the imaginary axis. The
distance
p of z from the origin is called the magnitude of z and is given by
|z| = x2 + y 2 . The number −z corresponds to reection with respect to
the origin. The complex conjugate z is obtained by reection in the real axis.
y z y z
Real Axis Real Axis
u x u x x+u
This pair of diagrams depicts the addition of complex numbers. The left
diagram shows two complex numbers z = x + iy
w = u + iv . The second
and
diagram shows their sum z + w . We see that adding w to z shifts z by w .
Geometrically, if we treat z as origin and then plot w , we get z +w . Addition
in real numbers has the same eect, if we add 2 to a real number it shifts by
2 units.
Imaginary Axis
−y + ix x
y x + iy
Real Axis
−y x
One can use the Pythagoras theorem to check that the points x + iy , −y + ix
and 0 form a right angled triangle.
Thus, multiplication by i causes each complex number to undergo a ro-
tation of 90 degrees around the origin. Multiplying by i twice gives a total
rotation of 180 degrees and hence sends z to −z . This is the geometric de-
scription of what i does and how it can be interpreted as a square root of
−1.
158 CHAPTER 3. NUMBER SYSTEMS
Imaginary Axis
y = |z| sin θ z = x + iy
|z |
θ
Real Axis
x = |z| cos θ
p
x = |z| cos θ, |z| = x2 + y 2 ,
y = |z| sin θ, tan θ = y/x.
zw
z z
|
|w
φ
|z |
| |
|z |z
θ w θ w
|w | |w |
φ φ
Real Axis Real Axis
yz = p and y 3 + z 3 = 2q.
Now substitute the rst into the second:
y 3 + (p/y)3 − 2q = 0
=⇒ y 6 − 2qy 3 + p3 = 0.
h i1/3 h i1/3
x = y + z = q + (q 2 − p3 )1/2 + q − (q 2 − p3 )1/2 .
We see that the choice of square root of q 2 − p3 does not matter, as both
roots nally appear in the formula for x.
Let us see what Cardano's formula produces in some simple situations.
Currently we are looking only for real solutions.
Nothing surprising here. There was only one solution, and the formula found
it.
h i1/3 h i1/3
x = 3 + (32 − 23 )1/2 + 3 − (32 − 23 )1/2
= [3 + 1]1/3 + [3 − 1]1/3 = 41/3 + 21/3 .
One drawback of Cardano's formula is that it only produces one real root,
whereas there may be up to three. This is not a serious issue because once
we know one real root a, we can divide the cubic by x−a and reduce it to
a quadratic.
Task 3.7.9 Let x=a be a real solution of x3 = 3px + 2q . Show that the
other two solutions are obtained from the quadratic equation x2 + ax + a2 =
3p.
We can't proceed further if we use only real numbers because we can't make
√ √
sense of −27. But, if −27 did make sense, we would have a cancellation
and we would get the x = 0 solution. This is an example of complex numbers
being forced upon us in a problem whose terms and solutions are all real.
Now we shall explore the benets accruing from using complex numbers
in these calculations.
12
The rst is that the q 2 − p3 < 0 case is no longer a
12 The application of complex numbers to this context was rst carried out by Rafael
Bombelli (1526-1572).
162 CHAPTER 3. NUMBER SYSTEMS
dead end. For instance, in Example 3.7.10, we can continue the calculation
as follows:
√ √
1/3 √ √
x= + (− −27)1/3 = ( 27i)1/3 + (− 27i)1/3
−27
√ √
= 3 i1/3 + 3(−i)1/3 .
| {z } | {z }
y z
√ !2 √
−1 + 3 i −1 − 3 i
Task 3.7.11 Verify that = . Hence, if we set
2 2
√
−1 + 3 i 2
w = , then the cube roots of 1 are 1, w and w . Further, show
2
2
that if a is a cube root of z , then the list of cube roots of z is a, wa and w a.
y z x
√ √
− √ 3i 3√i 0
3 + 3i 3 − 3i
3
2√ 2√
−3 + 3i −3 − 3i
−3
2 2
Thus, in this example, the use of complex numbers has paid o twice. First,
we found a way past the diculty of having to compute the square root of a
Now that we have had a glimpse of the benets of using complex numbers,
let us reconsider the cubic equation x3 = 3px + 2q . Setting x = y + z, we
once more reach the conditions
yz = p,
y + z 3 = 2q.
3
p p
2 3
Let q 2 − p3 q − p
be a square root of
p , y be a cube root of q + q 2 − p3 ,
and z be a cube root of q − 2 3
q − p . Then the choice of square root does
3 3
not matter. Also, each combination of y and z satises y + z = 2q . Let us
consider the product yz :
p p
(yz)3 = (q + q 2 − p3 )(q − q 2 − p3 ) = p3 =⇒ yz = p or wp or w2 p.
Hence, for a given y there is exactly one choice of z such that yz = p is also
satised and x = y + z solves the cubic.
Task 3.7.12
p p
y 3 = q + q 2 − p3 and z 3 = q − q 2 − p3 such that
Let
x = y + z is a solution of x3 = 3px + 2q . Show that the other solutions are
x = wy + w2 z and x = w2 y + wz .
(a) z + w = z + w, (b) −z = −z ,
4 Sketch the subsets of the complex plane whose elements z satisfy the given
condition:
164 CHAPTER 3. NUMBER SYSTEMS
(a) z = z, (b) z = −z ,
(b) Show that the equation P (x) = 0 has at most n distinct complex
solutions.
10 Show that:
(a) The unity 1 has exactly n distinct nth roots and they have the form
1, w, w2 , ..., wn−1 .
(b) The nth roots of unity satisfy 1 + w + w2 + · · · + wn−1 = 0.
11 Assume that every positive real number a has a unique positive nth root.
Show that every non-zero complex number has exactly n distinct complex
nth roots.
At the same time we have been careful in not allowing the words nite
and innite into our theorems. It is time we come to grips with these matters.
First, we need axioms that allow us to speak of innite sets, by providing
for the existence of at least one such set.
1 Idries Shah, The Pleasantries of the Incredible Mulla Nasrudin, The Octagon Press,
1995.
2 Quoted in H. Herrlich, Axiom of Choice. Springer, 2006.
165
166 CHAPTER 4. INFINITY
Task 4.1.1 Prove that the Axiom of Regularity rules out the possibility of
A ∈ A. (Hint: Consider the set {A})
(a) ∅ ∈ I,
(b) If x∈I then x ∪ {x} ∈ I .
It may also, conceivably, have members that are not obtained in this fashion.
We have no control over these members, so we would like to eliminate them.
We do this as follows:
1. For each x∈I dene its successor S(x) = x ∪ {x}. Since S(x) ∈ I ,
we obtain a function S: I → I called the successor function.
T
4. Intersect all the inductive subsets: N = J. (Then x ∈ N ⇐⇒
(∀A ∈ J ) (x ∈ A))
5. Verify that N is inductive.
6. Thus N is the smallest inductive subset of I and hence the best candi-
date for not having any extraneous elements.
3
We name the members of N as follows:
1=∅={}
2 = S(1) = 1 ∪ {1} = ∅ ∪ {∅} = {∅} = {1}
3 = S(2) = 2 ∪ {2} = {1} ∪ {2} = {1, 2}
4 = S(3) = 3 ∪ {3} = {1, 2} ∪ {3} = {1, 2, 3}
and so on
1. S is a 1-1 function.
Our earlier results about N, since they are based only on the Peano
axioms, hold for all Peano systems. In particular, Theorems 3.1.1 and 3.1.2
do so. For convenience, we restate them in our new terminology:
With the help of these results we can show that there is essentially only
one Peano system.
2 Show that the Axiom of Regularity implies that there cannot be sets A
and B such that A∈B and B ∈ A.
3 Show that the Axiom of Regularity implies that there cannot be sets
A0 , A1 , . . . , An with A0 ∈ A1 ∈ · · · ∈ An ∈ A0 .
7 Let X be an inductive set. Show that the following are also inductive sets:
170 CHAPTER 4. INFINITY
(b) N is transitive.
9 Show that X is transitive if and only if its power set P(X) is transitive.
Dene
Theorem 4.2.4 A nite set is not in bijection with any proper subset of
itself.
172 CHAPTER 4. INFINITY
Proof: Suppose there are nitely many primes. Since their set is in bi-
jection with some Fn we can label them as p1 , p2 , . . . , pn . Now consider
q = p1 p2 · · · pn + 1. Since q ≥ 2 it has a prime factor p. But none of the pi
divide q (they all leave remainder 1). So this p is not present in our list of
all primes, a contradiction!
If Cantor's Theorem was our rst striking theorem, the theorem which
conrms our suspicion has our rst really clever proof !
Theorem 4.2.8 Let A⊆B⊆C and |A| = |C|. Then |A| = |B| = |C|.
S∞\ f (f (B)), and so on. Dene Xn+1 = f (Xn ) for each n ∈ N and set
f (f (C))
X = n=1 Xn . Observe that
S∞
1. f (X) = n=2 Xn ⊂ B .
2. C \ X ⊆ B, since x∈ / X =⇒ x ∈/ X1 =⇒ x ∈ B .
f (x) if x ∈ X
Dene g: C → B by g(x) = .
x if x ∈
/X
From the two observations it follows that g indeed maps C into B. Fur-
ther, it maps X into X and Xc into X c. Let us check that g is a bijection:
2 Prove that:
(a) If x∈
/A then |A ∪ {x}| = |A| + 1.
(b) |A ∪ B| ≤ |A| + |B|.
(c) If A∩B =∅ then |A ∪ B| = |A| + |B|.
(d) |A \ B| = |A| − |A ∩ B|.
(e) |A ∪ B| = |A| + |B| − |A ∩ B|.
5 Suppose |A| = n.
Qn
(a) Give a bijection between P(Fn ) and k=1 F2 .
n
(b) Show that |P(A)| = 2 .
9 Let A be a set.
(a) Prove that if there is a 1-1 map gn : Fn → A for every n∈N then A
is innite.
(b) Use (a) to prove that there are innitely many rational numbers be-
tween any two rational numbers.
13 Show that
176 CHAPTER 4. INFINITY
Our dilemma begins with a simple act, the act of selecting a member of
a set. One can hardly write any signicant amount of mathematics without
including a phrase of the form let x be a member of A. In formal work this
is accomplished by the existentially quantied statement ∃x ∈ A. Diculties
arise when we wish to make innitely many such selections during a proof.
If we express them one by one, we will have a proof that never ends. Can
we express them all at once? Let us see what that entails.
First, suppose we have a set A whose members are non-empty sets. Let
choice function on A is a
S
A be the union of all the members of
S A. A
function c: A → A such that c(S) ∈ S for every S ∈ A. A choice function
therefore selects a member of each set S from the collection A.
If we want to use choice functions to make selections, we should have
some guarantee that they exist. One way to accomplish this is through an
axiom:
Z9. Axiom of Choice: Let A be a set whose members are non-empty sets.
Then there is a a choice function on A.
Let's consider the situation before formulating the proof. For f to reverse
g, g needs to map each point of Y
to one of its pre-images under f . In
7 other words, we need to select a member of f
−1
({y}) for each y ∈ Y . This
requires a choice function.
Recall that we dened |A| ≤ |B| to mean the existence of a 1-1 function
f : A → B.
Task 4.3.2 Show that |A| ≤ |B| if and only if there is an onto function
g : B → A.
Task 4.3.3 Show that if B = N then the proof of Task 4.3.2 does not require
the Axiom of Choice. (Hint: Use the Well-Ordering Principle instead of
Axiom of Choice)
Now we shall see that the Axiom of Choice helps us complete our earlier
descriptions of nite and innite sets.
(a) It may appear cyclical to use f in its own denition. However, what
happens here is that we use the values of f (k) for k < n to dene
f (n). We use f (1) to x f (2). Then we use f (1) and f (2) to x f (3),
and we proceed in this manner. You should try to justify it formally
using the Recursion Theorem (Theorem 3.1.2).
(c) Suppose f (n) = f (m) with n < m. Then f (n) ∈ f (Fm−1 ) and
f (m) ∈ A \ f (Fm−1 ). Hence f (n) 6= f (m). Therefore f is 1-1.
Proof: We already know that such a bijection implies the set is innite. For
the converse, we need the Axiom of Choice, specically the implication that
if a set is innite then there is an injection from N into that set.
Theorem 4.3.7 A set is nite if and only if it has no bijection with a proper
subset of itself.
Proof: This is the contrapositive of the previous theorem. Note that the
implication in one direction was already established and did not need the
Axiom of Choice.
These are the only implications of the Axiom of Choice that we will
develop for now. You may wonder at this stage why we made a fuss before
presenting the Axiom and these few consequences. It may seem quite like
others for, on the face of it, all it does is formalize the act of selection.
The Axiom of Choice has been shown to be a powerful tool for proving
results about all sets, or all sets satisfying some general conditions. For
example, in the study of cardinality, it further implies that any two sets A
and B will be comparable: either |A| ≤ |B| or |B| ≤ |A|. Applications like
4
this make it popular with most mathematicians.
The objections to the Axiom also relate to its power: that it is too pow-
erful, that it allows the use of actions (unrestricted selections) that we can
never come close to performing in practice. This power has led to results that
are so startling that some deem them unacceptable. Consider the Banach-
Tarski Paradox. This is a proof that, using the Axiom of Choice, one can
divide a solid sphere in R3 into a few parts and then reassemble them, us-
ing only transformations that do not change volume, into two spheres of
the same radii as the original! In other words, we can double volume, even
though we are using transformations that do not change volume. This para-
dox is possible because the Axiom of Choice allows us to construct strange
sets whose volume is not dened. Since their volume is not dened, the fact
that we use transformations that preserve volume becomes irrelevant. Thus
the Banach-Tarski Paradox does not cause any logical issues within mathe-
matics. But it does cause mathematicians to wonder if the Axiom of Choice
is too much of a good thing.
4 Various collections of non-empty sets are given below, and we wish to have
a choice function for each. For which collections is it necessary to invoke the
Axiom of Choice and for which can we avoid its use?
(e) An innite collection of pairs of socks. (Each pair has two identical
socks)
10 Read the graphic novel Logicomix: an epic search for truth by Apostolos
Doxiadis and Christos H. Papadimitriou [3].
Task 4.4.1 Show that a set is countable if and only if it is nite or countably
innite.
Task 4.4.2 Show that a set A is countable if and only if there is an onto
function f : N → A.
We have already mentioned the notation |A| = n for a nite set A which is
in bijection with Fn . IfA is countably innite we use the notation |A| = ℵ0 . 5
So far, we know that |N| = |Z| = |Q| = ℵ0 .
5ℵ is a Hebrew letter called `aleph' and so ℵ0 is read as `aleph-nought' or `aleph-null'.
182 CHAPTER 4. INFINITY
Proof: Let A be a countable set whose members are also countable sets. We
can assume that each member is non-empty.
g(m, n) = ψm (n).
We claim that g is onto, leaving the justication for you to work out. Com-
posing
S g with a bijection from
S N to N × N, we get an onto function from N
to A. Hence A is countable.
Task 4.4.4 The proof of Theorem 4.4.3 uses the Axiom of Choice in one
place. Locate it.
Therefore X is innite.
We dene a new member g∈X by using the diagonal values (circled in the
diagram) and changing them. A value 1 is changed to 0 and a value 0 is
changed to 1. This is done by setting
g(n) = 1 − ϕn (n).
Now the function g does not equal any of the functions ϕn , since g(n) 6=
ϕn (n). Therefore the function ϕ: N → X is not onto. Hence there is no onto
function from N to X. So there is no bijection from N to X. Hence X is not
countably innite. Therefore it is uncountable.
Then g is a bijection.
Proof: The case of nite B is already done (if you have completed Task
4.4.6). So we assume that B is countably innite.
184 CHAPTER 4. INFINITY
Then g is a bijection.
Cardinality of R
Theorem 4.4.8 R has the same cardinality as P(N).
Proof: We have seen earlier that R is in bijection with the interval [0, 1).
Also, we can identify P(N) with the set X of functions from N to{0, 1}.
Therefore it is enough to show that [0, 1) has the same cardinality as X .
Take any real number x ∈ [0, 1). It has a unique binary representation
x = 0.x1 x2 . . . xi is 0 or 1 and the xi 's are not eventually 1. (That
where each
is, there is no stage after which all the xi equal 1) We dene a corresponding
function fx ∈ X by fx (i) = xi .
Thus we have a 1-1 map from [0, 1) into X. Is this map onto? No, for
the constant function f (i) = 1 is not in its range. The functions that are
missing from the range are exactly those which are eventually 1, i.e. there
is some N such that for n≥N the function takes only the value 1.
Hence, we dene:
3 The dyadic rationals have the form m/2n , m ∈ Z and n ∈ N. Show that
the set of dyadic rationals is countably innite.
6 Prove that the set of all nite subsets of a countable set is countable.
7 Consider the set of sequences of 0's and 1's that are constant after some
entry. Show this set is countably innite.
(a) The set of polynomials with integer coecients and degree n or less.
(c) The set of real numbers which are roots of polynomials with integer
coecients. (Such numbers are called algebraic numbers )
{1, 2, 3}
(a) For any partially ordered set, the empty set is a chain.
(d) If C and D are chains in X and every element in C is less than every
element in D then C ∪ D is a chain.
We now state and prove a result known as Zorn's Lemma. The proof
requires the Axiom of Choice. Zorn's Lemma is often the means through
which the Axiom of Choice is applied to a proof. It is suitable for showing
the existence of objects whose construction can be visualized in a piece-by-
piece manner.
6
Proof: Let us assume that X has no maximal element and then strive to
reach a contradiction.
The rst step is to show that, given two f -chains, their union is a chain.
Thus, let K and K 0 be f -chains with K K 0 . We have K∪K 0 = K∪(K 0 \K).
Let x ∈ K 0 \ K . Dene
0
C = {x ∈ K ∩ K 0 : x < x0 }.
6 This proof is due to J. D. Weston, A short proof of Zorn's Lemma, Archiv der Math-
ematik, October 1957.
188 CHAPTER 4. INFINITY
K0 = { K ∈ P(X) | K is an f -chain } is a
S
It follows immediately that
0 0
chain. Further, if x, x ∈ K0 with x < x, then for any f -chain K , x ∈ K
0 0 0 0 0 0
implies x ∈ K . (Let x belong to an f -chain K . If x ∈ K \ K then x < x ,
a contradiction)
We have found an upper bound for every chain. Hence, by Zorn's Lemma,
X has a maximal element (M, fM ). We need to show that M = A. If not,
S a ∈ A \ M.
there is a set Pick an element ta of a. Dene N = M ∪ {a} and
fN : N → N by
fM (x) x ∈ M
fN (x) = .
ta x=a
Then (M, fM ) < (N, fN ), which is a contradiction. Hence M =A and fM
is a choice function on A.
Proof: We'll show that if |A| ≤ |B| is false then |B| ≤ |A| is true. Suppose
|A| ≤ |B| is false. Then there is no 1-1 map from A to B . Consequently
there is no onto map from B to A (This requires Axiom of Choice).
Task 4.5.5 Show that the rule for comparing cardinality follows trichotomy.
That is, for any two sets A and B, exactly one of the following holds: |A| =
|B|, |A| < |B|, |B| < |A|.
Well-Ordering Theorem
Proof: Let A be a set. We shall show that it has a total order that makes it
well-ordered.
(a) B ⊆ C,
(b) For any x, y ∈ B we have x ≤B y ⇐⇒ x ≤C y , and
(c) If x ∈ B , y ∈ C , y ≤C x, then y ∈ B .
{ B ∈ P(A) | (∃ ≤B )((B, ≤B ) ∈ C) }.
S
3. If C is a chain in X , dene U =
Note that if x, y ∈ U then there is a (B, ≤B ) ∈ C such that x, y ∈ B .
Dene x ≤U y if x ≤B y .
Like Zorn's Lemma, the Well-Ordering Theorem too implies the Axiom
of Choice. Thus these three form a single package once the ZF axioms are
assumed.
We have selected one partial order out of possibly many that qualify as
7 well-orderings of
S
A. However, as it is just a single selection, the Axiom
of Choice is not required.
S
Dene c: A → A by letting c(A) be the least element of A with respect
to ≤.
6 Provide a well-ordering of Q.
7 X be a well-ordered set
Prove the Principle of Transnite Induction : Let
and let S ⊆ X have the property that whenever { x ∈ X | x < s } is a subset
of S then s ∈ S . Then S = X .
193
194 APPENDIX A. SUPPLEMENTS
This raises the question of whether a statement can fall through the
cracks. Can a statement be such that neither it nor its negation is derivable?
choice whether to consider it true or not, in the sense that logic places no
restrictions on it.
As we have seen over the course of this book, the natural numbers N are
fundamental to mathematics. So we would very much want an axiomatiza-
tion of N that has the following properties:
A very remarkable achievement was the proof given by Gödel in 1930 that
this is impossible! His First Incompleteness Theorem implies that any con-
sistent axiomatization that can prove the standard properties of N would be
incomplete.
These very brief remarks on the subtle topic of provability should not be
taken as being complete, in any sense. If the issues mentioned here pique
your interest you can nd more details in books on symbolic or mathematical
logic, such as Goldrei [4], Leary and Kristensen [18] or Wilder [24].
rules out the mention of such things as `the set of all sets'. Many other
conceivable collections are also ruled out as sets. For example, we cannot
have `the set of all sets with exactly two elements'. For if this was a set
then the union over its members would be a set, and that union is the entire
universal domain B. Since any set with two members can be viewed as a
eld (set one member as `0' and the other as `1') we also cannot have `the
set of all elds'.
While this approach does eliminate the known paradoxes, it also causes
inconvenience since we often wish to make statements about arbitrary sets
of a common type. For example, consider the statement that the additive
identity of any eld is unique. If we were allowed to use the collection F of
all elds then we could easily express this symbolically using quantiers as
The scientist John von Neumann proposed a larger system that encom-
passes both sets as well as certain useful collections that do not qualify as
2
sets and whose use is correspondingly restricted. His system was recast and
simplied by Paul Bernays and Kurt Gödel, and the resulting collection of
axioms is known as the von Neumann-Bernays-Gödel or NBG system. In
the NBG system, our starting point is a universe of things called classes
together with a relationship called membership. A class is called a set if
it is a member of some class. A class which is not a set is called a proper
class.
We'll not give a systematic development of the NBG system. If you wish
to see one, we recommend Pinter [9]. For our part, we merely present some
of the more striking facts. Some of these are axioms, others are theorems. If
they whet your appetite or arouse your suspicions, do consult Pinter for the
full story.
First, the sets of NBG are the same as the sets of ZF, in that they follow
the same axioms. If a theorem about sets is true in ZF then it is also true
in NBG and vice-versa. Thus NBG can be seen as an enlargement of ZF.
Just as the ZF axioms allow the construction of sets in various ways, the
NBG axioms allow various constructions of classes in general. The rst NBG
axiom for classes describes equality:
• Every class C has a complement whose members are the sets that do
not belong to C.
Together with the existence of the empty set, this implies that we have a
class whose members are all the sets. So, while we do not have a `set of all
sets' we do have the `class of all sets'. Since the class of all sets is not a set,
it is a proper class and cannot be a member of a class. Hence there is no
`class of all classes'.
• Let P (x) be a property of sets. Then there is a class whose members are
the sets that satisfy this property. We denote this class by { x | P (x) }.
In the NBG context one may choose to deploy a stronger version of the
Axiom of Choice:
Task A.3.1 Show that a cut has the form hqi if and only if it has a supre-
mum (in Q).
Task A.3.3 A be a
Let cut and ε ∈ Q+ . Show that ∃a ∈ A such that
x > a + ε implies x ∈
/ A.
Let R be the set of all cuts. We equip it with binary operations + (addition)
and • (multiplication):
A + B = {a + b | a ∈ A and b∈B}
A B = { ab | a ∈ A
• and b∈B}
Let us check that A + B is indeed a cut. First, since A and B are non-empty,
we have an element a ∈ A and an element b ∈ B . Then a + b ∈ A + B and so
A + B is non-empty. Second, if a is an upper bound of A and b is an upper
bound of B then a+b is an upper bound of A+B . Third, if x = max(A+B),
let x = a + b with a ∈ A and b ∈ B . Then
a0 ∈ A =⇒ a0 + b ∈ A + B =⇒ x = a + b ≥ a0 + b
=⇒ a ≥ a0 =⇒ a = max(A)
ax bx ax bx
<a and < b =⇒ ∈ A and ∈B
a+b a+b a+b a+b
ax bx
=⇒ x = + ∈ A + B.
a+b a+b
We can similarly check that A•B is a cut.
Task A.3.4 Show that if p, q ∈ Q then hpi+hqi = hp+qi and hpi • hqi = hpqi.
DC10. A + B = A + C =⇒ B = C .
DC11. A • B = A • C =⇒ B • C .
Finally, we have the existence of supremum and inmum:
(A, B) + (C, D) = (A + C, B + D)
(A, B) • (C, D) = (A • C + B • D, A • D + B • C)
R3. The element (h1i, h1i) is the additive identity. (We call it zero and
denote it by 0.)
R4. The additive inverse of (A, B) is (B, A). We denote the additive inverse
by −(A, B).
R5. Multiplication is commutative and associative.
R7. The element (h2i, h1i) is the multiplicative identity. (We call it unity
or one and denote it by 1.)
R8. If (A, B) 6= 0 then it has a multiplicative inverse.
In comparison with the facts for Z, R8 is new and so we should prove it.
First we note that if (A, B) 6= 0 then A 6= B . If A > B then A = B + C
for some C ∈ R . Let C be the multiplicative inverse of C in R . Then
−1
R+ = { (A, B) | B ≤ A }.
The additive inverses of the positive real numbers are called negative and
their set is denoted by R− .
R9. R = R+ ∪ {0} ∪ R− is a disjoint union.
φ(A + B) = φ(A) + φ(B), φ(A • B) = φ(A) • φ(B), A < B =⇒ φ(A) < φ(B).
202 APPENDIX A. SUPPLEMENTS
φ(x+y) = φ(x)+φ(y), φ(x • y) = φ(x) • φ(y), and x < y =⇒ φ(x) < φ(y).
Proof: We give a very brief sketch of the proof, as readers who have persisted
this far can likely ll in the details themselves! If help is needed, you can
consult Mendelson [8].
One should not just translate into symbols as one reads. That may result
« in 2 + 2 = 4 ∧ 5, which would not make sense as `5' is not a statement.
This is the negation of `Two plus two equals ve'. So its symbolic
expression is ¬(2 + 2 = 5).
(d) It is false that two plus two does not equal ve.
This is the negation of `Two plus two does not equal ve', which in
turn is the negation of `Two plus two equals ve'. Hence it can be
expressed as ¬(2 + 2 6= 5) or as ¬(¬(2 + 2 = 5)).
3 If the given statement had been simply `One of P , Q is true', the interpre-
tation would have been that at least one of them is true, and the symbolic
version would be P ∨ Q. The use of `Exactly one' means we have to exclude
the case when both are true. This can be done by incorporating the negation
203
204 APPENDIX B. SOLUTIONS
of P ∧Q as follows:
(P ∨ Q) ∧ ¬(P ∧ Q)
Another way to think about the statement is that it allows for two possibil-
ities: P is true and Q is false, or else P is false and Q is true. Thus we can
also express it as:
(P ∧ ¬Q) ∨ (¬P ∧ Q)
5 Suppose the two statement variables are P and Q. A truth table for some
combination P ∗Q will have the following format:
P Q P ∗Q
T T
T F
F T
F F
Each position in the third column has two possible entries (T and F), hence
the number of possible tables is 24 = 16. We can study how to generate
them by considering various cases.
Before we begin, let us also note that in this context `but' is to be taken
as `and'.
We can see if the two versions always have the same meaning by considering
various possibilities for the given gure:
(b) If the gure is a polygon but not a square, both are true.
1.2
P Q P =⇒ Q
T T T
T F F
F T T
F F T
206 APPENDIX B. SOLUTIONS
Yes. Only in the rst row are both P =⇒ Q and P true. And in
that row Q is true.
Yes.
3 We have to express the following symbolically using ∧, ∨, ¬, =⇒ :
(a) If not P or not Q, then it is not the case that P or Q.
As a rst step let us put some brackets to help understand the overall
structure:
P Q ¬P (¬P ) ∨ Q
T T F T
T F F F
F T T T
F F T T
P Q ¬Q ¬P ¬Q =⇒ ¬P
T T F F T
T F T F F
F T F T T
F F T T T
1.3
P Q P ∧Q P ∨Q (P ∧ Q) =⇒ (P ∨ Q)
T T T T T
T F F T T
F T F T T
F F F F T
P Q R Q∨R P P P (P =⇒ Q)
=⇒ =⇒ =⇒ ∨
(Q ∨ R) Q R (P =⇒ R)
T T T T T T T T
T F T T T T T T
F T T T T T T T
F F T T T T T T
T T F T T T F T
T F F F F F F F
F T F T T T T T
F F F F T T T T
The two gray columns match exactly and so they are equivalent.
1.4
3 If there were any dragons at all, then this implication would hold. But if
there are actually no dragons, then Every dragon is red is true but There
is a red dragon is false.
7 If any pair among the following consists of equivalent statements they will
have the same value no matter what variable values x, y or open statement
P we consider. We also have to remember that the quantiers are applied
from left to right. This is important for (c).
9
(a) ∀x P (x) =⇒ (Q(x) ∨ R(x)) ⇐⇒ ∀x (P (x) =⇒ Q(x)) ∨
∀x (P (x) =⇒ R(x)) .
Suppose x varies over the integers. Let P (x) be the statement x 6= 0,
Q(x) be x > 0, and R(x) be x < 0. Then ∀x P (x) =⇒ (Q(x) ∨
R(x)) is true but ∀x (P (x) =⇒ Q(x)) ∨ ∀x (P (x) =⇒ R(x)) is
false.
(b) ∀x (P (x) ∨ Q(x)) =⇒ R(x)] ⇐⇒ ∀x (P (x) =⇒ R(x)) ∨
∀x (Q(x) =⇒ R(x)) .
Suppose x varies over the integers. Let P (x) be the statement x 6= 0,
be x > 0. Then (P (−1) ∨ Q(−1)) =⇒ R(−1) is false,
Q(x) and R(x)
and so ∀x (P (x) ∨ Q(x)) =⇒ R(x)] is false. But ∀x (Q(x) =⇒
R(x)) is true and so ∀x (P (x) =⇒ R(x)) ∨ ∀x (Q(x) =⇒ R(x)) is
true.
(c) ∀x P (x) =⇒ (Q(x) ∧ R(x)) ⇐⇒ ∀x (P (x) =⇒ Q(x)) ∧
∀x (P (x) =⇒ R(x)) .
This is always true.
11 (a) M is free and x is bound (by ∃x).
1.5
1 (a) Integer solutions of (x2 − 1)(x2 − 4) = 0: {1, −1, 2, −2} or {±1, ±2}.
(b) Subsets of {a, b}: {∅, {a}, {b}, {a, b}}.
(c) Prime factors of 60: {2, 3, 5}.
(d) Odd natural numbers: {1, 3, 5, 7, . . . }.
3 (a) First, use the Axiom of Pairs to create {A, B} and {C}. Use the
Axiom of Pairs again to createX = {{A, B}, {C}}. Apply the Axiom
of Union to X to obtain Y = {A, B, C}. Finally, apply the Axiom of
Union to Y to get A ∪ B ∪ C .
7 (a) (A ∪ B) ∩ C ⊆ A ∪ (B ∩ C):
Consider any x ∈ (A ∪ B) ∩ C . Then x ∈ A ∪ B and x ∈ C . Consider
both cases ofx ∈ A ∪ B . If x ∈ A then x ∈ A ∪ (B ∩ C). If x ∈ B
then x ∈ B ∩ C , and so again x ∈ A ∪ (B ∩ C). Hence in both cases
we have x ∈ A ∪ (B ∩ C).
(b) (A ∪ B) ∩ (A ∪ C) ∩ (B ∪ C) = (A ∩ B) ∪ (A ∩ C) ∪ (B ∩ C):
212 APPENDIX B. SOLUTIONS
(A ∪ B) ∩ (A ∪ C) ∩ (B ∪ C) = [(A ∪ B) ∩ (A ∪ C)] ∩ (B ∪ C)
= [A ∪ (B ∩ C)] ∩ (B ∪ C)
= [A ∩ (B ∪ C)] ∪ [(B ∩ C) ∩ (B ∪ C)]
= [(A ∩ B) ∪ (A ∩ C)] ∪ [B ∩ C]
= (A ∩ B) ∪ (A ∩ C) ∪ (B ∩ C)
9 (a) B ⊆ A =⇒ A \ (A \ B) = B :
Since B ⊆ A, A\B is just the complement of B in A. Hence A\(A\B)
is the complement of the complement of B , i.e., B itself.
(b) A ∩ B = ∅ =⇒ A \ (B \ A) = A \ B :
Let us consider A and B as subsets of some universal set U (for
example, we could take U = A ∪ B ). Then, taking complements with
respect to U, we calculate as follows:
A \ (B \ A) = A ∩ (B ∩ Ac )c = A ∩ (B c ∪ A).
A \ (B \ A) = A ∩ (B c ∪ A) = A ∩ B c = A \ B.
1.6
(∀x ∈ Q)[x > 0 =⇒ (∃y ∈ Q)(y 2 = x)]. False, as 2 does not have a
rational square root.
5 (a) For every real number x there is a natural number n that is greater
than x.
(∀x ∈ R)(∃n ∈ N)(n > x).
Negation: (∃x ∈ R)(∀n ∈ N)(n ≤ x).
Negation in words: There is a real number that is greater than or
equal to every natural number.
(c) If two lines meet at two distinct points then they are the same line.
Let P be the set of points and L be the set of lines. Then the given
statement is expressed symbolically as:
13 (a)
T
F =T{ (0, 1/n)Q | n ∈ N }. We have to show that F = ∅. Let
x ∈ F . Then x ∈ (0, 1)Q and so x is a positive rational. Hence
x = p/q with p, q ∈ N. But then x ∈ / (0, 1/q), a contradiction.
S
(b) G = { (0, 1 − 1/n)Q | n ∈ N }. We have to show that S G = (0, 1)Q .
First, since each (0, 1 − 1/n)Q ⊆ (0, 1)Q , we have G ⊆ (0, 1)Q . In
the other direction, consider any x ∈ (0, 1)Q . It has the form x = p/q
with p, q ∈ N and p < q . Hence x ∈ (0, 1 − 1/(2q)).
2.1
1 We have to show that (x, y) = {{x, ∅}, {{∅}, y}} can also dene an ordered
pair. That is, we have to show that (x, y) = (a, b) if and only if x = a and
y = b. The `if ' direction is trivial. So let us assume that (x, y) = (a, b). Then
{{x, ∅}, {{∅}, y}} = {{a, ∅}, {{∅}, b}}
Case I: {x, ∅} = {a, ∅} and {{∅}, y} = {{∅}, b}. It is immediate that
x=a and y = b.
Case II:{x, ∅} = {{∅}, b} and {{∅}, y} = {a, ∅}. Here, x = {∅} = a
and b = ∅ = y.
3 A, B, C are any sets.
(a) A × ∅ = ∅ × A = ∅:
If (x, y) ∈ A × ∅ then y ∈ ∅, a contradiction.
(b) A × B = B × A ⇐⇒ A = B or A=∅ or B = ∅:
The reverse implication is trivial. For the forward implication, sup-
pose that A × B = B × A, A 6= ∅ and B 6= ∅. We have to show
A = B. Leta ∈ A. Since B 6= ∅, we can choose some b ∈ B . Then
(a, b) ∈ A × B = B × A =⇒ a ∈ B.
Hence A ⊆ B. Similarly, B ⊆ A.
(c) Suppose A 6= ∅. Then A × B = A × C ⇐⇒ B = C :
Similar to (b).
5 Are the given sets of the form A × B ?
(a) X = { (x, y) ∈ R2 | xy = 0 }:
X = A × B . Then (1, 0) ∈ X =⇒ 1 ∈ A and (0, 1) ∈
Suppose
X =⇒ 1 ∈ B . Hence (1, 1) ∈ A × B = X , a contradiction.
(b) Y = { (x, y) ∈ R2 | |x + y| + |x − y| ≤ 2 }:
By trying out various cases of the signs of x and y we can establish
that |x + y| + |x − y| always equals the greater among 2|x| and 2|y|.
For example, suppose x ≥ y ≥ 0. Then x±y ≥ 0 and |x+y|+|x−y| =
(x + y) + (x − y) = 2x = 2|x| = max(2|x|, 2|y|).
Hence, Y = { (x, y) ∈ R2 | |x| ≤ 1 ∧ |y| ≤ 1 } = [−1, 1] × [−1, 1].
7 We are given A ⊆ X and are asked whether this implies that (A × Y )c =
c
A ×Y. Let us consider a point (x, y) ∈ X × Y . Then,
(x, y) ∈ (A × Y )c ⇐⇒ (x, y) ∈
/ A × Y ⇐⇒ x ∈
/A
⇐⇒ x ∈ Ac ⇐⇒ (x, y) ∈ Ac × Y.
(x, y) ∈ (A × B)c ⇐⇒ x ∈
/ A∨y ∈
/B
⇐⇒ (x, y) ∈ Ac × Y ∨ (x, y) ∈ X × B c
⇐⇒ (x, y) ∈ (Ac × Y ) ∪ (X × B c ).
(d)
2.2
1 R is a relation on X .
(a) R is reexive: Then x ∈ X =⇒ xRx =⇒ x ∈ dom(R)∧x ∈ ran(R).
Hence dom(R) = ran(R) = X .
216 APPENDIX B. SOLUTIONS
Since integers that dier by multiples of k are in the same equivalence class,
the distinct equivalence classes are [0], [1], . . . , [k − 1].
7 We are given a relation R on X such that the sets x̄ = { y ∈ X | xRy }
are pairwise disjoint and x ∈ x̄ for each x ∈ X. We shall prove that R is an
equivalence relation:
Reexive: x ∈ x̄ =⇒ xRx.
Symmetric: xRy =⇒ y ∈ x̄ =⇒ x̄ = ȳ =⇒ x ∈ ȳ =⇒ yRx.
Transitive: xRy ∧ yRz =⇒ xRy ∧ zRy =⇒ y ∈ x̄ ∩ z̄ =⇒ x̄ = z̄ =⇒ z ∈
x̄ =⇒ xRz .
9 We are asked whether the following is a correct argument that symmetry
and transitivity together imply reexivity: aRb =⇒ bRa =⇒ (aRb ∧
bRa) =⇒ aRa. In order to conclude aRa, the argument requires the
existence of some b such that aRb holds. There may not be such a b! As an
extreme example, consider the empty relation.
(a) If A and B are bounded above then A∪B will be bounded above.
2.3
1 2 × 2 × 2 ×
1 × 1 × 1 ×
0 × 0 × 0 ×
−1 × −1 × −1 ×
−2 × −2 × −2 ×
× × × × × × × × × × × × × × ×
−2 −1 0 1 2 −2 −1 0 1 2 −2 −1 0 1 2
(a) (b) (c)
(a) Not a function. The points (0, 0) and (0, 1) violate the requirement
(xRy ∧ xRz) =⇒ y = z .
(b) This is a function.
(c) Not a function. There is no y such that 0Ry and so the domain is
not all of X.
3 Equivalence classes for f (x, y) = x + y : Parallel straight lines with slope
−1.
Equivalence classes for g(x, y) = x2 + y 2 : Concentric circles with centre
at origin.
x y
f (x) = f (y) =⇒ =
1 + x2 1 + y2
=⇒ x + xy 2 = y + x2 y =⇒ x − y = xy(x − y).
It is easy to see that g is not injective either, since its values are the
same for ±x. For example, g(1) = g(−1) = 1/2.
For h, we rst note that h(x) has the same sign as x. Hence h(x) =
h(y) implies xy ≥ 0. So,
x3 y3
h(x) = h(y) =⇒ 2
= =⇒ x3 + x3 y 2 = y 3 + x2 y 3
1+x 1 + y2
=⇒ (x − y)(x2 + xy + y 2 + x2 y 2 ) = 0 =⇒ x = y.
x 1
0 ≤ (x − 1)2 = 1 + x2 − 2x =⇒ 2x ≤ 1 + x2 =⇒ ≤ .
1 + x2 2
Hence f (x) has no values greater than 1/2 and is not surjective.
Then h is onto.
13 Let f : R2 → R be given by f (x, y) = x2 + y2 . Pictures of f −1 ({t}) and
−1
f ([a, b]) are shown below:
15 Composition is associative:
(h ◦ (g ◦ f ))(x) = h((g ◦ f )(x)) = h(g(f (x))) = (h ◦ g)(f (x)) = ((h ◦ g) ◦ f )(x)
17 Consider a function f : X → Y .
(a) ∃g : Y → X such that f ◦ g = 1Y only if f is onto.
2.4
[
x∈A∪( Xi ) ⇐⇒ x ∈ A ∨ (∃i ∈ I)(x ∈ Xi )
i∈I
⇐⇒ (∃i ∈ I)(x ∈ A ∨ x ∈ Xi )
[
⇐⇒ (∃i ∈ I)(x ∈ A ∪ Xi ) ⇐⇒ x ∈ (A ∪ Xi )
i∈I
\ \
(b) A∩( Xi ) = (A ∩ Xi ).
i∈I i∈I
\ \
(b) f −1 ( Yi ) = f −1 (Yi ).
i∈I i∈I
2.5
Then g ◦ f = 1N and f ◦ g 6= 1N .
222 APPENDIX B. SOLUTIONS
5 Let a, b ∈ R with a < b. Consider the equation of the line joining (a, 0) to
(b, 1):
y−a
f (x) =
b−a
This denes a bijection f : [a, b] → [0, 1].
7 We are asked to use the following diagram to derive bijections f : (−1, 1) →
R and g : R → (−1, 1).
y -axis
1
P
−1 ( x
)
1 f (x) x-axis
|x| − 0 1−0 x
= =⇒ f (x) = .
x − f (x) 0 − f (x) 1 − |x|
y
The inverse function is g(y) = .
1 + |y|
9 Each plane passing through the origin is mapped to its normal line at the
origin. Specically, the plane given by Ax + By + Cz = 0 is mapped to the
x y z
line given by = = .
A B C
11 We have A, B ⊆ X , with |A| = |B|. Can we assert the following?
√
13 For any x = (a, b) ∈ R2 dene its length by kxk = a2 + b2 . Consider
2
the map f: D →R dened by
1
f (x) = x.
1 − kxk
2.6
3 For ease of calculation, view all the sets involved as lying in a common
`universal set' U.
(a) Symmetric dierence is commutative and associative.
A4B = (A \ B) ∪ (B \ A) = B4A.
A4(B4C) =A4[(B ∩ C c ) ∪ (C ∩ B c )]
=[A ∩ (B c ∪ C) ∩ (B ∪ C c )] ∪ [(B ∩ C c ) ∪ (C ∩ B c )] ∩ Ac
=(A ∩ B c ∩ C c ) ∪ (A ∩ B ∩ C) ∪ (Ac ∩ B ∩ C c )
∪ (Ac ∩ B c ∩ C)
224 APPENDIX B. SOLUTIONS
(b) We already know, from the `laws of logic' in the rst chapter, that ∧
is associative and commutative. Since T ∧F =F and T ∧ T = T, we
see that T is the identity for ∧.
(c)
9 The multiplication table for S3 :
T0 T120 T240 R1 R2 R3
T0 T0 T120 T240 R1 R2 R3
T120 T120 T240 T0 R3 R1 R2
T240 T240 T0 T120 R2 R3 R1
R1 R1 R2 R3 T0 T120 T240
R2 R2 R3 R1 T240 T0 T120
R3 R3 R1 R2 T120 T240 T0
3 4
Let us count how many members D4 will have. We have 4 choices for
σ(1). Once σ(1) has been xed, we have two choices for σ(2). And then σ(3)
has to be opposite σ(1) while σ(4) has to be opposite σ(2). So the number
of possibilities is 4 × 2 = 8. (While the total number of bijections is 4! = 24)
The possible symmetries are listed below. Each is given as a grid of two
rows. Every number in the lower row is the image of the number directly
226 APPENDIX B. SOLUTIONS
above it.
1 2 3 4 1 2 3 4
1 7→ 1 :
1 2 3 4 1 4 3 2
Identity Reection
1 2 3 4 1 2 3 4
1 7→ 2 :
2 3 4 1 2 1 4 3
Rotation Reection
1 2 3 4 1 2 3 4
1 7→ 3 :
3 4 1 2 3 2 1 4
Rotation Reection
1 2 3 4 1 2 3 4
1 7→ 4 :
4 1 2 3 4 3 2 1
Rotation Reection
0 1 2
0 0 1 2
1 1
2 2
Note that under the given conditions, the cancellation law holds.
0 1 2
0 0 1 2
1 1 0
2 2 0
0 1 2
0 0 1 2
1 1 2 0
2 2 0 1
The symmetry of the table about the diagonal shows the commutativity of
∗.
3.1
3 4=3+1 (denition of 4)
= (2 + 1) + 1 (denition of 3)
= 2 + (1 + 1) (associativity of +)
=2+2 (denition of 2)
One can use mathematical induction to prove that the rst coordinate of
Φ(n) is n. Hence Φ(n) = (n, F (n)) for each n ∈ W. Then Φ(n + 1) =
G(n, F (n)) = (n + 1, (n + 1)F (n)) implies that F (n + 1) = (n + 1)F (n).
The function F is called the factorial function, and we commonly write
n! for F (n).
3.2
A = {n ∈ N | 1 < n ≤ a and n | a }.
13 There are pk natural numbers that are less than or equal to pk . The ones
which are not coprime to pk are the ones that have p as a factor. These have
the form pm with 1 ≤ m ≤ pk−1 , hence are pk−1 in number.
15 Proof by induction:
Dene k = T (1). Then T (1) = k · 1 is true. Assume for some n ∈ N that
T (n) = k · n is true. Then, T (n + 1) = T (n) + T (1) = k · n + k = k(n + 1).
3.3
7 Apply the usual division algorithm to b and |a| to get b = p|a| + r with
0 ≤ r < |a|. Then b = qa + r, where q = p|a|/a.
9 Since gcd(m, a) = 1 we have xm + ya = 1 for some integers x and y. Then
xmb + yab = b. If m | ab then m | xmb + yab = b.
11 hki = Zn ⇐⇒ 1 ∈ hki ⇐⇒ (∃m ∈ N) 1 = mk
⇐⇒ (∃m ∈ N) n | (mk − 1)
⇐⇒ (∃m, m0 ∈ N) mk − m0 n = 1 ⇐⇒ gcd(k, n) = 1.
3.4
1 You must have seen the standard proof in school. It goes like this. Suppose
there is a rational number x x2 = 2. We can assume it is positive.
such that
Then it has the reduced form p/q where p, q ∈ N and gcd(p, q) = 1. Now we
2 2 2 2 2
have p /q = 2, hence p = 2q . Therefore p is even. Therefore p is even.
2 2 2 2 2
Hence p = 2k for some k ∈ N. Then 4k = 2q implies 2k = q , hence q is
even, hence q is even.
1 1 1 1 1
k = T (1) = T ( + · · · + ) = T ( ) + · · · + T ( ) = nT ( )
n n n n n
1 k
=⇒ T (x) = T ( ) = = kx.
n n
Now consider any x = m/n with m, n ∈ N. Then
m 1 1 1 1 1
T (x) = T () = T ( + · · · + ) = T ( ) + · · · + T ( ) = mT ( )
n n n n n n
k
=⇒ T (x) = m • = kx.
n
Finally, for any x ∈ Q+ :
β1 βl
n = q1 · · · ql be the factorizations into powers of distinct primes. Then the
pi and the qj are all distinct from each other.
From E(1/n)n = E(1) it follows that each αi and βj is divisible by every
n. This is impossible.
3.5
a + b = a + c =⇒ (−a) + (a + b) = (−a) + (a + c)
=⇒ ((−a) + a) + b = ((−a) + a) + c
=⇒ 0 + b = 0 + c =⇒ b = c.
a−x 1 1
ax ≥ 1 and a ≥ x =⇒ a − x ≥ =⇒ a − x ≥ −
ax x a
1 1
=⇒ x + ≤ a + .
x a
9 Let F be an ordered eld and A ⊆ F.
(a) A has an upper bound if and only if −A has a lower bound :
Let us check that ψ preserves addition, i.e., ψ(m + n) = ψ(m) + ψ(n) for
all m, n ∈ Z. This is trivial if either number is zero, or if both have the same
sign. The remaining case is when m>0 and n < 0. Then:
3.6
3 Let X ⊆ F be non-empty and bounded above. Let B be the set of all strict
upper bounds of X : B = { b ∈ F | (∀x ∈ X)(x < b) }. Let A be the set of all
strict lower bounds of B : A = { a ∈ F | (∀b ∈ B)(a < b) }. Then:
1. X ⊆ A =⇒ A 6= ∅.
2. If b is an upper bound of A then b+1 is a strict upper bound. Hence
6 ∅.
B=
3. a ∈ A, b ∈ B =⇒ a < b.
4. Suppose a∈
/ B. If there is b ∈ B such that b ≤ a, then a is a strict
upper bound of A and hence a ∈ B . So for every b ∈ B we have a < b.
Therefore a ∈ A. Hence A ∪ B = F.
Applying the given property of F we obtain a unique r ∈ F such that a≤
r≤b for every a∈A and b ∈ B . We claim that r = sup(X).
5 Fix a ∈ R+ and dene A = { x ∈ R+ | x2 ≤ a }.
(a) From Task 3.5.10 we know that 0 < x < y =⇒ 0 < x2 < y 2 . Hence
a can have at most one positive square root.
(b) Suppose every a > 1 has a positive square root. Now consider b < 1.
Then 1/b > 1 has a positive square root y , and 1/y is a positive
square root of b.
1 c2
d< c+ = c,
2 c
which contradicts c being the least upper bound of A.
236 APPENDIX B. SOLUTIONS
a 1
(f) Suppose c2 < a. Let e= c+ . Then, using Exercise 7 of
a+1 c
3.5,
a 2 1 a 2 1
e2 = c2 + 2 + 2 < a + + 2 = a,
a+1 c a+1 a
hence e ∈ A. Also,
a 1 a
e= c+ ,
a+1 a+1c
implies that e lies strictly between c and a/c. Now c2 < a implies
c < a/c and hence c < e. Since e ∈ A, this contradicts c being an
upper bound of A.
Combining (e) and (f ) with trichotomy gives c2 = a.
7 Imitate the steps outlined for the previous exercise.
c
9 We are given a ∈ R and c ∈ R+ such that 0≤a≤ for every n ∈ N. If
n
+
a 6= 0 then a ∈ R and, by the Archimedean property, there is n∈N such
that na > c. This contradicts the given information. Hence a = 0.
11 We can prove this by mathematical induction. The n=1 case is trivial.
The n=2 case has been proved earlier. Now consider the induction step:
n+1 n X n n n+1
X X X X
xi = xi +xn+1 ≤ xi +|xn+1 | ≤ |xi |+|xn+1 | = |xi |.
i=1 i=1 i=1 i=1 i=1
3.7
5 We have to nd the images of the given subsets of C under the function
f (z) = z 2 . Note that if z = x + iy then z 2 = (x2 − y 2 ) + i(2xy).
(a) Real Axis: y = 0 =⇒ z 2 = x2 and so the image is the positive real
axis.
(c) Lines passing through origin: Squaring a number doubles its argu-
ment, so a line passing through origin and making an angle θ with
the positive real axis will be sent to the ray starting at origin and
making an angle 2θ with the positive real axis.
P (α) = 0 ⇐⇒ P (α) = 0
⇐⇒ a0 + a1 α + a2 α2 + · · · + an αn = 0
⇐⇒ P (α) = 0.
cos 0 + i sin 0 = 1,
√
2π 2π −1 + i 3
cos + i sin = ,
3 3 2 √
4π 4π −1 − i 3
cos + i sin = .
3 3 2
We can show the existence of the n roots with the help of the `roots of
unity' 1, w, . . . , wn−1 of Exercise 10. Let α = r(cos θ + i sin θ) with r = |α| >
0 and 0 ≤ θ < 2π . Then β = r1/n (cos θ/n + i sin θ/n) satises β n = α and
n−1
is an nth root of α. Further, all the numbers β, wβ, . . . , w β are distinct
and are nth roots of α.
238 APPENDIX B. SOLUTIONS
4.1
1 In the following symbolic expressions the quantiers range over the uni-
versal domain B of all sets:
1. ∅ ∈ NI ∩ NJ .
2. x ∈ NI ∩ NJ =⇒ S(x) ∈ NI ∩ NJ .
ThereforeNI ∩ NJ is an inductive set. Hence NI ⊆ NI ∩ NJ , and so
NI = NI ∩ NJ . Similarly, NJ = NI ∩ NJ .
7 X is an inductive set. We have to show that the following are also inductive:
x ∈ A =⇒ x ∈ X =⇒ x ⊆ X =⇒ x ∈ P(X).
Therefore A ⊆ P(X). This shows P(X) is transitive.
4.2
5 Let |A| = n.
Qn
(a) Dene g : P(Fn ) → k=1 F2 by
1 if k∈A
g(A)k = .
0 if k∈
/A
(u + v − 1)(u + v − 2)
f (u, v) = v + .
2
4. This f is a bijection.
13 (a) + +
N is in bijection with Q : Each x ∈ Q has a unique reduced form
mx /nx with mx , nx ∈ N and gcd(mx , nx ) = 1. Mapping x to (mx , nx )
+
creates a 1-1 map from Q to N × N. Composing this with the
+
bijection from N × N to N creates a 1-1 map from Q to N. There is
+
also an obvious 1-1 map from N to Q . Apply the Schröder-Bernstein
Theorem.
4.3
(a) =⇒ (c)
(c) =⇒ (a)
9 R is a relation X with dom(R) = X . For each x ∈ X let Rx =
on
{ y ∈ X | xRy }. A = { Rx | x ∈ X } is a collection of non-empty sets.
Then
By Axiom of Choice, there is a choice function c : A → X . Dene G : X → X
by G(x) = c(Rx ). Apply the Recursion Theorem to G.
4.4
7 Let X be the set of sequences of 0's and 1's that are constant after some
entry. Then X = A∪B is a disjoint union, where A consists of the sequences
that terminate in 0's and B consists of the sequences that terminate in 1's.
It suces to show that A and B are countably innite. In fact, by symmetry,
it suces to show A is countably innite.
1 1 1 1 1 − 1/2k 1 1
σ≤ + 2 + ··· + k = < = 1.
2 2 2 2 1 − 1/2 2 1 − 1/2
√
For the second example, consider A = { 1/ n | n ∈ N }. Further, let Fn =
√ √
{1, 1/ 2, . . . , 1/ n}. Then,
1 1 1 1 1 √
1 + √ + · · · + √ > √ + √ + · · · + √ = n.
2 n n n n
4.5
1 Consider the disjoint union X = N ∪ {α} where N has the usual order
while α is not related to any other element of X. Then α is maximal while
N is a chain that is not bounded above.
W Usual Ë
{0, 1} × N Dictionary Ë
N×N Dictionary Ë
(0, 1) × (0, 1) Dictionary é
More advanced treatments of Set Theory can be found in [5, 9]. The
books [8, 11] have a similar aim to this book of moving from basic logic and
sets to number systems, while the classic [7] takes the natural numbers as
given and then constructs all the number systems up to the real and complex
numbers. There is little `chat' in [7] but then no awkward details are left for
the reader to sort out.
The text [12] has a chapter on Sets, Logic and Computation which tells
the story of the attempted axiomatization of mathematics in the 19th and
20th centuries, with connections to current developments. Finally, [3] has an
engrossing depiction of the human side of logic and mathematics.
245
246 REFERENCES
Books
[1] Shobha Bagai, Amber Habib, and Geetha Venkataraman. A Bridge to Math-
ematics. New Delhi: Sage Publications India, 2017. isbn: 978-9386446121.
[2] Keith Devlin. Introduction to Mathematical Thinking. Lightning Source Inc.,
2011. isbn: 978-0615653631.
[3] A. Doxiadis and C. H. Papadimitriou. Logicomix: an epic search for truth.
Bloomsbury, 2009. isbn: 978-0747597209.
[4] Derek Goldrei. Propositional and Predicate Calculus: A Model of Argument.
London: Springer, 2005.
[10] George Polya. How To Solve It. Reprint edition. First published in 1945.
Princeton University Press, 2014. isbn: 978-0691164076.
[11] Inder K. Rana. From Numbers to Analysis. Singapore: World Scientic, 1998.
isbn: 978-9810233044.
[12] John Stillwell. Mathematics and Its History. Third edition. Undergraduate
Texts in Mathematics. New York: Springer, 2010. isbn: 978-1441960528.
[13] Daniel J. Velleman. How to Prove it. Second edition. New York: Cambridge
University Press, 2006. isbn: 978-0521675994.
[14] Franco Vivaldi. Mathematical Writing. Springer Undergraduate Mathematics
Series. London: Springer, 2014. isbn: 978-1447165262.
[16] Richard Hammack. Book of Proof. url: http : / / www . people . vcu . edu /
~rhammack/BookOfProof/.
[17] Patrick Keef and David Guichard. An Introduction to Higher Mathematics.
url: https://www.whitman.edu/mathematics/higher_math_online/.
[18] A Friendly Introduction to Math-
Christopher C. Leary and Lars Kristensen.
ematical Logic. url: https://minerva.geneseo.edu/a-friendly-introdu
ction-to-mathematical-logic.
[19] Mathematics Text-
National Council of Educational Research and Training.
book for Class XI. url: http://ncert.nic.in/textbook/textbook.htm?
kemh1=0-16.
[20] Ted Sundstrom. Mathematical Reasoning: Writing and Proof. url: https:
//scholarworks.gvsu.edu/books/9/.
[21] Proof Designer. url: https://app.cs.amherst.edu/
Daniel J. Velleman.
~djvelleman/pd/pd.html.
[22] Finite Mathematics: introduction to
Stefan Waner and Steven R. Costenoble.
logic. url: https://www.zweigmedia.com/RealWorld/logic/logicintro.
html.
[23] Douglas West.The Grammar According to West. url: https://faculty.
math.illinois.edu/~west/grammar.html.
[24] Raymond L. Wilder.Introduction to the Foundations of Mathematics. url:
https://archive.org/details/IntroductionToTheFoundationsOfMathem
atics.
Index
ℵ0 , 181 consistent, 195
1-1 correspondence, see bijection
Banach-Tarski Paradox, 179
absolute value, 130, 150, 156 bijection, 89, 91
absorption, 17, 38 binary operation, 97
addition binomial formula, 117
of complex numbers, 155 bound
of integers, 124 lower, 64
of natural numbers, 109 upper, 64
of rational numbers, 132 bound variable, 26
of real numbers, 200 bounded, 64
and, 5 above, 64
anti-symmetry, 61 below, 64
Archimedean Property, 137, 148149
associative, 17, 38, 98, 110 cancellation, 98, 110, 115, 226
axiom, 2, 3, 193 Cantor's diagonal method, 182
Axiom of Cantor's Theorem, 94, 182
Choice, 84, 86, 176, 179, 182, Cardano's formula, 160
187, 188, 191, 194, 197 cardinality, 93, 172
Empty Set, 32 cartesian product, 51, 52, 56, 86
Extension, 32 chain, 186
Global Choice, 197 choice function, 176
Innity, 166, 169 class, 196
Pairs, 32, 166, 211 proper, 196
Power Set, 36, 166 closure, 101
Regularity, 166, 169, 194 codomain, 70
Replacement, 197 commutative, 17, 38, 98, 110
Separation, 35, 52, 166, 188, Comparability of Cardinality, 189
193, 211 complement, 36, 197
Union, 32, 211 complex numbers, 43, 154
axiom schema, 35 composition, 80, 98, 99
axiomatization, 194 condition
complete, 195 necessary, 12
249
250 INDEX
S3 , 100, 103
Schröder-Bernstein Theorem, 174, well-ordered set, 190
statement, 5
biconditional, 13 zero, 111, 125, 133
compound, 5 to the power of zero, 113
conditional, 10 ZF system of axioms, 31, 38
derivable, 194 Zn , 103, 138
simple, 4 Zorn's Lemma, 187, 191