Professional Documents
Culture Documents
Thinking Mathematically Notes
Thinking Mathematically Notes
Attila Máté
Brooklyn College of the City University of New York
March 12, 2020
Contents
Contents 1
1 Paradoxes in mathematics 2
1.1 Reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Cardinality of the power set . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Cantor’s paradox . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.4 Russell’s paradox . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.5 Epimenides paradox . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.6 Berry’s paradox . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.7 Axiomatic set theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.8 The axiom of replacement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.9 Other axiom systems of set theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.10 Hilbert’s program and incompleteness . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.11 The continuum hypothesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.12 Independence of the continuum hypothesis . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.13 Computers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.14 Reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3 Geometric cardinalities 12
3.1 Reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.2 Mapping a line segment to one of different length in a one-to-one way . . . . . . . . . . 12
3.3 Mapping a finite line segment onto the whole line in a one-to-one way . . . . . . . . . . 13
3.4 More on the stereographic projection . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.5 Mapping a square onto a line segment in a one-to-one way . . . . . . . . . . . . . . . . 14
3.5.1 An idea that fails . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3.5.2 An idea that succeeds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1
4.1 Reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
6 Platonic solids 17
6.1 Reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
6.2 The five Platonic Solids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
6.3 The duals of Platonic solids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
References 28
1 Paradoxes in mathematics
1.1 Reading
The material in this section is to provid background to the section Travels toward the Stratosphere
of Infinities: The Power Set and the Question of an Infinite Galaxy of Infinities [1, §3.4, p. 174]. If
you have a different edition of the book, you will still find the same material, perhaps with minor
changes. What is most likely to change material at the end of the section, called Mindscapes. The
material there is of real interest, but I largle skipped the Mindscapes segment of the book, since
there is really no time in the course for deep study of them, and I would like to present a variety
of material. This material is interesting, but the Midscapes are not important for exams. It gives
more insight for those whose interest goest further than what is presented in the main text.
2
1.2 Cardinality of the power set
For a set S, the power set P(S) of the power set is defined as the set of all subsets of S:
def
P(S) = {X : X ⊂ S}.
We will show that for any set, P(S) has larger cardinality (in other words, it has more elements)
than S. More precisely, we have
Theorem 1.1. Given an arbitrary set S, there is no mapping f : S → P(S) that is onto P(S).
The theorem does not require that f be one-to-one.
Proof. Assume there f is a mapping on S into P (S). Then
X = {x ∈ S : x 6= f (x)}
is a subset of S, and yet there is no y ∈ S for which f (y) = X. Indeed, assume on the contrary that
y ∈ S is such that f (y) = X. Recall that two sets are equal if they have the same elements. On
the other hand, the definition of X implies that y is an element of one, but not the other, of X and
f (y). so we have X 6= f (y). Indeed, if y ∈ f (y), then y ∈
/ X, and if y ∈
/ f (y), then y ∈ X.
The result and the proof applies to any set, even to finite sets, and even to the empty set itself.
One would, however, barely be interested in using this proof for finite sets (except for the interesting
point that the proof works even then), since other arguments show the much stronger result that a
finite set of n elements has 2n subsets. For a set S denoting by |S| the cardinality of S, by extension
of this statement, one defines the cardinality 2|S| as the cardinality |P(S)|.
|P(V )| > |V |.
since every element of P(V ) is already an element of V . This statement, showing that Theorem 1.1,
even though proved apparently rigorously is called Cantor’s paradox; a paradox is a self-contradictory
statement.
The set theory invented by Cantor is often called naive set theory (though I prefer the name
intuitive set theory, i.e., a set theory based on intuition, rather than on a rigorous foundation).
The discovery of paradoxes in set theory was very disconcerting to mathematician, since set theory
became an important part of mathematics. One of the greatest mathematicians at the time, David
Hilbert wrote that “Aus dem Paradies, das Cantor uns geschaffen, soll uns niemand vertreiben
können” (From the paradise that Cantor created for us, no-one shall be able to chase us out).
The resolution of this paradox was given by aximatic set theory, in which large collections are
called (real) classes, and small collections are sets; we will say more about axiomatic set theory later.
Here large and small are relative term, so we will give a more precise explanation: all collections are
considered classes, but some collections will be considered sets. Classes (and sets) can only have sets
as elements. Real classes (classes that are not sets) cannot be elements of classes. This will avoid
Cantor’s paradox, but it is not at all clear whether paradoxes will re-appear in different forms. ‘
3
1.4 Russell’s paradox
One of the most famous paradoxes is due to Bertrand Russell. It starts out with saying that for a
set x, we may or may not have x ∈ x.1.1 Now, consider the set R defined as
R = {x : x ∈
/ x},
where x runs over all sets. That is, R is the set of all sets x for which x ∈ / x. Then one asks the
question whether or not R ∈ R. This question has no answer, since if the answer is yes, that is, if
R ∈ R, then the definition of R with x = R says that R ∈ / R and if the answer is no, that is, if
R∈ / R, then the definition of R with x = R says that R ∈ R. Russel was so upset with his own
paradox that he decided to write a monumental three-volume work with Alfred North Whitehead1.2
in which, as detractors of the work like to say, it took 1500 pages to prove that 1 + 1 = 2. Their aim
was to put mathematics on a solid foundation; but, as Kurt Gödel showed in 1931, the goal they
tried to accomplish is not attainable.
where the variables p, u, and v run over integers. The literal translation of this fomral statement
is the following: p is a prime (expressed formally as prime(p)) if and only if (expressed as ↔) for
all u (expressed formally as ∀u) and for all v (expressed as ∀v)1.4 ), if 1 < u < p and (expressed as
1.1 An axiom called the Axiom of Foundation, invented by John von Neumann, explicitly disallows the possibility
used in the logic sense, as in “the sun shines and it is warm,” but in the sense of enumeration as in “ham and eggs.”
4
&) 1 < v < p then (the “if . . . then” is expressed as →) p 6= uv). Note that with the placements of
parentheses in the formula one can express more clearly what is meant than is possible in a natural
language without sometimes complicated circumlocutions.
visible for the reader without looking at the source file): this website is copyrighted by Paul Cox, all rights reserved.
The use of this material for commercial purposes without permission, including the posting of advertisements on this
site, is a violation of that copyright and may result in civil prosecution. Unfortunately, the last time I found this file
on the internet was in 2012, and it has disappeared before; so I felt I need to download it, so it will not be lost. I am
posting the website here for non-commercial use, since I feel it has great educational value, and without my posting
it it would disappear from public view. See the site for more.
1.6 In set theory, by integers one means nonnegative integers. As we mentioned, 0 = ∅, 1 = {0} = {∅}, 2 = {0, 1} =
{∅, {0}}, 3 = {0, 1, 2} = {∅, {0}, {∅, {0}}}, . . .; these are called the von Neumann ordinals; ordinals, and not integers,
because the construction continues after all integers have been described this way.
1.7 In set theory, functions themselves are considered sets, namely sets of pairs. For example, the function f (x) = x2
on the set R of all real numbers is considered the set f = {(x, x2 ) : x ∈ R}.
1.8 Latin for “to infinity,” meaning that the list is infinitely long.
5
distinguished variables x and y 1.9 such that, given a set z, for each x ∈ z there is exactly one y then
the collection
{y : x ∈ z and φ(x, y)}
is a set.
In the formal framework, the axiom of replacement is not a sincle axiom; it is a group of infinitely
many axioms, called an axiom scheme, one for each formula φ(x, y). The axiom scheme of replace-
ment greatly increases the power of Zermelo’s set theory. Zermelo’s set theory with the axiom of
choice and with the axiom of choice and with the axiom scheme of replacement is usually called ZFC
(Z for Zermelo, F for Fraenkel, and C for Choice); it is the most frequently used axiom system for
set theory.
6
there is a a one-to-one mapping g : B → A (in which case, |B| ≥ |A|). Without the axiom of choice, one can
prove that if there is a one-to-one mapping f : A → B and also there is a one-to-one mapping g : B → A,
then the is a one-to-one function h : A → B that is onto B. (That is, if |B| ≤ |A| and |A| ≤ |B| then
|A| = |B|.
In 1938, Gödel proved that if there are no contradictions in ZF (Zermelo–Fraenkel set theory
without the axiom of choice) then there are no contradictions in ZFC (Zermelo–Fraenkel set theory
with the axiom choice) even with the continuum hypohesis added. This is what we meant by saying
that the axiom of choice is a harmless assumption.
1.13 Computers
In 1928, David Hilbert and Wilhelm Ackermann formulated the Entsheidungsproblem1.12 in that
they asked whether one can design a systematic method of calculation (algorithm) that can decide
whether a mathematical statement is true.1.13 In 1936, Alonzo Church and Alan Turing solved
negatively, by showing that there is no such method. They gave radically different solutions: Church
created λ-calculus, on which the LISP programming language is based. LISP played a very important
role in the development of artificial intelligence.
Turing created a theoretical machine called the Turing machine that was very important in the
development of computers. Turing showed that it is possible to create a universal machine that can
do any calculation doable on other Turing machine. This led to the idea that it is in fact possible
to build an actual universal machine machine that can do all calculations that can be done at all
(except for resource constraints, such as time and memory; Turing’s theoretical machine did not
have such constraints).
Turing designed a theoretical machine called Turing machine, which played a great role in the
development of actual computers,
Building on Turing’s ideas, John von Neumann worked out the design principles of the a stored
program computer, and such a computer was built under his direction from 1945 to 1951 at the
Institute for Advanced Study in Princeton, New Jersey. The first electronic compter, the ENIAC
(Electronic Numerical Integrator and Computer) was built built earlier at the University of Penn-
sylvania; it was completed in late 1945. The ENIAC was not a stored program computer; it was
1.11 NBG is a conservative extension of ZFC. That is, any statement that can be formulated in the language of ZFC
and provable in NBG is also provable in ZFC. Note that the language of NBG is richer in that it can formulate
statements that cannot be formulated in the language of ZFC.
1.12 German for decision problem; the German word is used even in the English language literature.
1.13 The statement they used is a formal mathematical language similar to the one we used to define prime num-
bers above. Instead of true, they asked if the statement is universally valid, that is, whether it is true under any
interpretation of the symbols in it – we want to avoid a rigorous explanation for the sake of simplicity.
7
programmed with plugboards, and it took about eight hours to set up a plugboard for a new com-
putation. In 1948, under John von Neumann’s guidance, modificaions were made to the ENIAC
to make it function as a stored program computer. The design of nearly all computers today are
based on von Neumann’s ideas, and von Neumann architecture refers to the conceptual model of
this design.
1.14 Reading
[1, §3.4, pp. 174–185]
people cannot realy make sense of results in binary without laborious calculations.
2.2 The theorem says, informally, that there are results in any axiomatic theory of the nonnegative integers that are
meaning unpublished writings left behind after the author’s death. Perhaps there is a good English word the replace
it, but in Gödel’s case it is common to use the word Nachlass. Gödel published very little in his life time, but the
material in the Nachlass is very rich. A significant portion of it was written in Gabelsberger shorthand, and at the
time Gödel’s writing were prepared for publication by John W. Dawson Jr., it was not easy to find an alder Austrian
or German person who was able to read this now obsolete shorthand.
8
Gödel was an Austrian, but in 1938 Austria was absorbed by Germany in the Anschluss. It was impossible
to to travel to America at the time via the Atlantic because of the war. Gödel traveled through the Soviet
Union via the Trans-SiberianRailway, a trip made possible by the Molotov-Ribbentrop Pact.
Why mathematics is so useful in the sciences is not quite understood by the scientists themselves.
The Nobel laureate physicist Eugene Wigner wrote about the his puzzlement in the article The Un-
reasonable Effectiveness of Mathematics in the Natural Sciences. This is quite interesting read in
original, as is von Neumann’s First Draft. Wigner made one of his first contribution to physics by
using group theory to explain certain mulptiple spectral line of the hydrogen atom (which spectral
lines split into several lines in electromagnetic fields). Wigner learned about the subject of group rep-
resentation (a field of mathematics required for the explanation) from John von Neumann. Wigner
was a professor at Princeton University. Princeton University was also trying to hire the famous
physicist Erwin Schrödinger, but they did not succeed in view of Schrödinger’s unusual lifestyle. He
lived with his wife and his mistress in the same household.
At about Christmas, after hearing a lecture about the theory of Louis de Broglie in which he
reversed Einstein’s theory that light is not only waves but also particles to explain the photoelectic
effect, at ETH,2.4 Peter Debye commented, “this is not the way to do physics; if you want to talk
about waves, you need to write a wave equation.” So Schrödinger took de Broglie’s dissertation
together with his girlfriend (he was married at the time) to a Swiss chalet around Christmas time
in 1925 and worked out his famous equation that changed physics.
The equations of quantum mechanics worked out by in the summer of 1925 by Werner Heisenberg,
Max Born,2.5 and Pascual Jordan. Heisenberg re-invented matrix multiplication, a mathematical
tool that had been around for about 50 years at his time, but Heisenberg did not know about it
(Max Born did). The theory was based on matrix calculations that physicists found difficult to work
with. Wolfgang Pauli worked out a description of the spectrum of the hydrogen atom using matrix
mechanic, as the theory was called at the time. “Heisenberg was not smart enough to do it” in the
description of Steven Weinberg.2.6 With Schrödinger’s equation, the calculation of the spectrum
became quite easy, relatively speaking (so, it is not really easy, but physicists were familiar with how
to work with partial differential equations).
The wave equation is a mathematical tool to discuss waves; Isaac Newton used the wave equation
to mathematically calculate the speed of sound, relying on how air can be compressed. Later, James
Clerk Maxwell used the wave equation to show that there must be electromagnetic waves, and
calculated their speed. As the speed agreed with the experimentally measured speed of light, it
was accepted that light itself was an electromagnetic wave. Electromagnetic waves are what make
your cell phones work, and without quantum mechanics, the small electronic circuits inside your
cellphone could not be made.
The equations of electromagnetic field were found experimentally before Maxwell, except that
they were inconsistent with the conservation of energy. Maxwell added the missing term, and then
the equations satisfied the requirements for a wave equation. The missing term was difficult to
find experimentally. The equations, with the term added, became to be called Maxwell’s equations.
Maxwell also invented color photography in his life of 49 years.
Maxwell’s equations were inconsitent with the equations of Newtonian physics. Einstein decided
in favor of Maxwell, hence the theory of special relativity. However, special relativity was incon-
sistent with Newton’s theory of gravitation, since in Newton’s theory the gravitational force spread
2.4 Eidgenössische Technische Hochschule in Zürich, Switzerland
2.5 The grandfather of the Australian singer Olivia Newton-John.
2.6 In a television lecture on Book TV. Weinberg must have not had great like for Heisenberg, since Heisenberg
worked on the German atomic bomb poject; Heisenberg probably did not put in an all-out effort, since it is doubtful
that he believed that the bomb could be made with the resources Germany had at her disposal.
9
with infinite speed, something that is not allowed in special relativity; this was quite a serious incon-
sistency, and a geometry of curved spaces was needed to resolve it. Such a geometry was invented by
Bernhard Riemann.2.7 Einstein learned about differential geometry from Marcel Grossmann, who
gave Einstein quite a bit of help. Finally, David Hilbert invited Einstein to Göttingen to give a
series of lectures, after which they both worked out the equations of General relativity, apparently
independently. Hilbert certainly did not receive any help from Einstein after Einstein’s lectures;
there are some doubts about whether Einstein used Hilbert’s ideas without acknowledgement, and
Hilbert did not assert his priority later.2.8
To this day, nobody knows how to make quantum mechanics and general relavivity be com-
patible. This may not be a problem with an acute need for a solution, since quantum mechanics
is mostly concerned with small things (not quite, since it helps in explaining what happens inside
stars), and general relativity is concerned with very large distances. But one would like to what
really understand what is going on. Meanwhile, nobody really understands what is going on. The
mathematics is quite will understood, and one can draw practically useful conclusions, but what is
actually going on is a mystery. Quantum theory splits into two parts; the mathematics, no issue
with that, but the interpretation? There are several of them, but none of them make sense (that is
why you can make money by writing popular expositions of quantum mechanics), but the nonsense
explanations are good enough to make decisions in practical situations (such as designing integrated
circuits or designing drugs by computational chemistry).
For nearly fifty years it was thought that Einstein’s general relativity is of no practical use. To be
sure, it explains the anomaly of the motions of the planet Mercury (measurements showed for more
than perhaps 150 years in a way that Mercury did not quite move according to the requirements
of Newton’s equations), but so what. However, the equations also showed that gravitation slowed
down time. The Global Positioning System that shows your location on your cellphone (so that
drivers no longer have to ask for directions, their cellphone directly shows their position on the map)
relies on accurate measurements of the time it takes for the radio signal to travel between a number
of satellites and your cell phone. In order to keep the clocks on the satellites in sync with the clocks
on the ground, one needs to take Einstein’s equations into account, because the clocks tick faster
on the satellites (the gravitation of the earth becomes weaker higher up), without this the satellites
would become quite useless in about a day. I assume the public is not quite aware of this.2.9
Schrödinger initially wanted to include special relativity in his equation, but he could not do this
in a way that agreed experimental results, so he gave up on this, and his equation, while give quite a
good agreement with experimens, are not compatible with special relativity. Paul Dirac, also known
as PAM after his full name Paul Adrien Maurice Dirac, succeeded in this a few later, mathematically
explaining the electron spin, first suggested by Wolfgang Pauli’s exclusion principle. Wigner referred
to Dirac as “my famous brother in law” (Dirac was married to Wigner’s sister. Wigner later used
the requirement of compatibility between special relativity and quantum mechanic to describe what
kind of elementary particles were possible. Many of these particles were discovered lates. See
Wigner’s classification, though the description at the target of this link is quite technical, and I am
not suggesting more than a cursory reading. Some other links above are also technical.
The following story was told to me by Atle Selberg, the discoverer, together with Paul Erdős of the first
2.7 Riemann, when asked what his geometry was good for, he answered physics. This was about 50 years before general
relativity, though, to be fair, Riemannian geometry has also applications in physics other than general relativity.
2.8 Hilbert was quite famous, and did not need any boost to his fame. Among his students were Emanuel Lasker,
who was chess world champion for 27 years (the longest) in history, quite a respectable mathematician in his own
right.
2.9 As the joke goes, who needs weather satellites when you have the weather channel on television.
10
elementary proof of he Prime Number Theorem.2.10 Dirac was famous for hardly ever engaging in small
talk. When Dirac visited the Institute for Advanced Study, they took a walk in the woods at the Institute,
the two wives in front, and Dirac and Selberg following behind. The two wives were talking animatedly
all the way, Dirac, totally silent. When they finished their walk, Dirac said, “I wonder how they do it.” I
walked that walkway many times myself. On one of my walks, I was attacked by a rabid fox. You are quite
helpless in such a situation. I noticed the fox, and I realized the danger, so I picked up a big stick. It was
quite useless, I was bitten on both arms. Finally, I picked up the fox by its hind legs, and flung it as far away
as I coud, and ran back to the building. With both arms bleeding in manyy places, I immediately drove to
the emergency room, an employee of the Institute following in another car, just in case. If this ever happens
to you, you need to read further: The mistake I made, as I was told at the emergency room, that I should
have washed out the wounds with warm water before driving to the emergency room. Doing so improves
your chances of survival, which may not be quite 100%. I think I am the only person in history to whom
this happened on that walkway.
Meanwhile, another famous mathematician, looking back on his life, wondered in the book [2]
whether is life was spent on something useful, and expressed the hope that his mathematics would
never be used for war (it might be). Hardy tried to separate mathematics into mathematics of
practical use and pure mathematics. While the book is still an interesting read, Hardy was quite
wrong in his conclusions. With computers, all bets are off.
Function. Until the proof by Erdős, and Selberg, nobody though that such a proof was possible.
11
kernel has been written in C, and what has been proved is that the C implementation does exactly
the same thing as the one written in Haskell.
Computers of course do not understand the language C. So, before they can do anything they have to have
something to translate the C program into their own machine language. There is a small program, perhaps
a thousand lines long, that translate a small part of the C compiler into the computer’s own language. This
C compiler can translate the rest of the C compiler from C to machine language, and when the compiler has
been translated, the rest of the operating system into machine language. Thus the expression “pull oneself
up by one’s bootstraps” was applied to computers, as in “one bootsraps a computer.” Today, one uses the
expression to “boot a computer.”an expression quite common today.
Thus, to port (implement on another computer) the operating system one only has to rewrite the
thousand or so lines on a different computer, rather than rewriting the whole operating system.
3 Geometric cardinalities
3.1 Reading
The material in this section is to highlight the material in the section Steightening up the Circle:
Exploring the Infinite Within Geometrical Objects [1, §3.5, p. 190]. If you have a different edition of
the book, you will still find the same material, perhaps with minor changes. What is most likely to
change material at the end of the section, called Mindscapes. The material there is of real interest,
but I largle skipped the Mindscapes segment of the book, since there is really no time in the course
for deep study of them, and I would like to present a variety of material. Of course, you can spend
time on reading them, some of you might even enjoy them, but I definitely do not want to emphasize
the material there for exams. So you have the best of both worlds. You can learn a lot, you may
enjoy doing so, but the exams will not be terribly demanding. You still have to work though, since
nothing is free in life. The textbook is excellently readable, at least the main part, the Mindscape
segments may touch on more complicated matters.
P ......................
..... ........
..... .......
..... .......
..... .......
0 1
..... .......
. ...
.......................................................................
w .....
.....
.....
.....
.......
.......
.......
.......
..... .......
..... .......
..... .......
..... .......
..... .......
..... .......
..... .......
..... .......
..... .......
..... .......
..... .......
... .......
..
0 1 h 2 3
12
3.3 Mapping a finite line segment onto the whole line in a one-to-one way
Take the finite line segment (with the endpoints not included), and roll it up into a circle; the circle
will be missing its “North Pole” (its highest point). Then use the stereographic projection‘ to map
this circle onto the line. The stereographic projection maps a point of the circle to the line on which
the circle is sitting by connecting the point with the North Pole of the circle; the this line will hit
the horizontal line at the image of the point on the circle. In the picture on the right, the point u on
the circle is mapped to the point t on the horizontal line. The stereographic projection maps every
point of the circle (except the North Pole) onto the horizontal line. See Figure 3.2.
Navigation on the seas in the sixteenth century demanded a different kind of projection for map
making. The Mercator projection produces maps that shows as a straight line the path that a
navigator follows when sailing in a constant direction given by the compass. This was a great help
in planning shipping routes by drawing straight lines on a map.
.
.................. ...................
...... ..... .....
..... ...... .....
..... ...... ...
... .........
u ..
.
.
..........
.....
...
...
...
..
........
. ...
..
..... ...
. ...
..
.....
. ...
. ..
..
.....
. ...
...
.
..
.....
. ...
. ...
... ... ...
..... .....
.....
..... ......
..... ......
..... .......
..................................
.....
t 0
Figure 3.2: The stereographic projection: mapping a line segment onto a cirle
3.1 Half of a sphere; hemi is Greek, as is sphere. One does not use the Latin semi, which also means half, so as not
13
3.5 Mapping a square onto a line segment in a one-to-one way
3.5.1 An idea that fails
Consider the unit square, represented by pairs of numbers between 0 and 1, and the unit line segment,
represented by single numbers between 0 and 1. One need to ensure that each number only has one
representation; for example, never represent a number by digits that are all 0 from some point on.3.2
Then there is no pair of numbers (x, y) that corresponds to the number
t = 0.420909090909090909 . . .
(the number is continued by repeating the groups 09 of digits). Indeed, in order to get t, we would
have to have
x = 0.400000 . . . and y = 0.299999 . . . ,
but x = 0.400000 . . . is not allowed, since a number is not allowed to end in all zeros.3.3
should be written with its alternate representation as 0.72999 . . . (with infinitely many 9s).
3.5 Since we represent a number in such a way that no number ends in all 0s, it is guaranteed that each group of
digits is finite.
14
In [1, §p. 211], a way of rearranging pieces of a puzzle are shown are shown that amounts to a
proof of the Pythagorean theorem. However, in addition to rearranging the pieces, some rigorous
consideration is needed to show that the pieces, when rearranged, properly fit together. This is what
is shown in the calculation next.
Consider the two figures. Figure 4.1 geatures the triangle ABC with sides a = BC, b = AC, and
c = AB. The two squares, ADHE and F HIJ cover an area of size a2 + b2 . This area is made up
by four triangles of congruent to the triangle ABC and the square ECGF of side EC = a − b. The
four triangles and this latter square are rearranged to form the square KLM N in Figure 4.2 of side
KL = c. (Note that the small square in the middle, P RQO has side P O − P L = a − b, that is, it is
congruent to the square ECGF in Figure 4.1.) The area of the square KLM N is the same as the
area in Figure 4.1 covered by the two squares ADHE and F HIJ. That is, c2 = a2 + b2 .
b
J ................................................................... I
... .... ...
...
... .... ...
... ... ...
... .... ...
... ... ...
... ...
F ... ...
... ... ...
E H
.
. .
...............................................................................................................................................................
... . ...
... .
.
. .
... ....
... .
. ... ...
... .... ... ...
...
... ... ... ...
... ... ... ...
... ... ...
... . ... ...
. ... ...
... ..
. ... ...
... ..
. .
... ...
... .
. ...
... .
. ...
... .
. .
... ...
... .... ... ....
... ... ... ...
... ... ... ...
... ... ... ...
... ... ... ...
... ... ..
...
G
... . ......
... . ......
C B
.
.
........................................................................................................................................................................
.
... ....... .. . ...
... .. ... ... . ...
..
... .......... ...
.... .
....... ........... ....
... ................ ...
......
c
... ....
..... ...
... ................ ...
... ................ ...
....
... .................... ..
A ............................................................................................................................................................
a D
N .............................................................................................................................................................................................................. M
. .......... . ..
.... ...........
. ... ...
... ... .................... ... ..
..
... .
...
.. Q ..........
..........
............ ..
.. ..
.. .....
.
... ... ..........
.......... ... ....
... ... .......... ..
.......... ....
...
... ... ... ...
R
... ... . ...
... ..
. ..
. ...
... .. .. ...
. . ...
... ... ... ...
... ..
. ... ...
... ... ..
. ...
... ..
. ..
. ...
... ... ... ...
... ..
. ... ...
... ... ... ...
... .. .. ...
O
. . ...
... ... ..
. ...
... .......... ... ...
... .. ................
. ... ...
... .... .......... .
.. ...
........... .
... ... .......... .
. ...
P
... ... .......... ..
. ...
............
... ... .......... ... ...
... ... .......... .. ...
. .
......... ...
... ... ..........
..........
...... .......... ....
K ............................................................................................................................................................................
L
15
5 How many cameras need to watch an art gallery?
5.1 Reading
In [1, §4.2, p. 218], in the section entitled A View of an Art Gallery: Using Computational Geometry
to Place Security Cameras in Museums, a fairly recent mathematical problem is discussed.
L...............
. ...
... ...
... ... ...
... .....
.... ...
... ... ...
..
.
.. .. ..... ....
. ... ... ...
...
...
...
...
F . . ..
... ...
..
...
...
...
. . . .
S .................. . ...
................ ...
................ ...
................
................ ....
..............
L................................
.. . .......
... . .......
...
... .....
.
.... .. .....
.... . ........
. ... . . .....
.. ...
... . ...
... . ...
...
S
. ...
...... ...
F
..... . ..
..... . . ......
..... . .
...
...
...
... .
..... .....................
...
16
vertices are in common with the triangle next to it, and that triangle has already been colored), we
can choose the third color to paint the third vertex.
Once the vertices of the art gallery have been colored, if we place a camera at vertices of one
given color (say red), then these cameras will survey the whole art gallery. This is because there
will be a camera in each triangle, and from any vertex of a triangle, the whole triangle can be seen.
The only question that remains is, which at color should the the cameras be places. The answer
is: at the color which paints the smallest number of vertices. This color will paint at most n/3
vertices; otherwise, each color would paint more than n/3 vertices, but then the number of vertices
would be more than
n n n
+ + = n.
3 3 3
6 Platonic solids
6.1 Reading
The subject discussed here expands on [1, §4.5, p. 270] The Platonic Solids Turn Amorous: Dixcov-
ering the Symmetry and Interconnections Among the Platonic Solids.
17
World throug Graphs, and [1, §6.1, p. 386] Circuit Training: From Königsberg Bridge Puzzle to
Graphs, but the main material is in [1, §6.2, p. 401] Feeling Edgy: Exploring Relationships Among
Vertices, Edges, and Faces.
18
..... ..
..... .....
..... .....
..... .....
....
A B
..... ..
.
..... .
..... .....
....
....................................
..... .....
...
...... .....
.....
..... .....
..
....... .....
... ..
...............
................ .... .................
... C D
.............
... ...... ..................
. .......
.......................................
If in the graph in Figure 7.2, one removes the vertex D, the number of vertices V decreases by
one: Vnew = V − 1; with the vertex D, one also have to remove the edge CD, since, with the vertex
D removed, one end of the edge CD would not be attached to a vertex; so the number of edges
also decreases by one: Enew = E − 1. On the other hand, the number of faces does not change, so
Fnew = F . Hence
That is, the quantity V + F − E does not change by the removal of a vertex as described.
H
.........................................................................
............................
... ... ......
... ....... ......
..
.. ... . ............
....................................................................
If in the graph in Figure 7.3, one removes the edge GH, the number of vertices does not change,
that is Vnew = V . The number of faces decreases by one, that is Fnew = F − 1. The number of edges
decreases by one, that is Enew = E − 1. Hence,
That is, the quantity V + F − E does not change by the removal of an edge as described.
Thus, starting with any planar graph, one can build down this graph step by step, by removing
an edges and vertices (one at a time), without changing in the quantity V + F − E. In a finite
19
number of steps, one arrives at the graph containing one vertex and no edges. Since, as we saw
above, in this graph we have V + F − E = 2, we must have had V + F − E = 2 in the original graph.
Thus, Euler’s formula is established.7.3
V + F − E = 2,
where V is the number of vertices, F is the number of faces, and E is the number of edges.
For example, for the icosahedron F = 20 and V = 12.8.2 Thus,
E = V + F − 2 = 30.
Another way is to observe that if each face has s sides, then the number of edges is
F ·s
E= .
2
This is because each face is next to s edges, and so F faces give rise to F · s edges. However, each
edge is shared between two faces, so this number needs to be divided by 2. Thus, for the octahedron,
F = 8 and s = 3; so
8·3
E= = 12.
2
7.3 If one removes the last vertex, one ends up with the empty graph (a vertex with no edges and vertices). One can
reasonably say that V = 0, E = 0, but F = 1 in this case, since the whole plane can be considered a single region.
In this case, Euler’s formula is not valid. If one wants to be overly precise, one should say that Euler’s formula holds
form nonempty connected planar graphs.
8.1 V , F , and E for this planar graph is clearly the same as V , F , and E for the solid we started with.
8.2 It may be too much to ask to remember how many vertices the icosahedron has. However, since the icosahedron
is the dual of the dodecahedron, the number of vertices of the icosahedron is the same as the number of faces of the
dodecahedron; that is, V = 12 for the icosahedron.
20
The third way is to note that if at each vertex c edges come together then
V ·c
E= .
2
This is because each vertex sits on c edges. Hence the V vertices give rise to V · c edges. However,
each edge is shared between two vertices (its two endpoints), so this number needs to be divided by
2. Thus, for the dodecahedron, V = 20 and c = 3,8.3 and so
20 · 3
E= = 30.
2
V + F − E = 2,
we obtain
2E 2E
+ − E = 2,
c s
that is,
2 2
Å ã
E· + − 1 = 2.
c s
The first factor on the left (that is, E) is positive. For the product to be a positive number (namely,
the number 2 on the right-hand side), the second factor must also be positive. That is, we must
have
2 2
+ > 1.
c s
This inequality can hold only under very narrow circumstances. Namely, we must have c ≥ 3
(we cannot have only two edges coming into a vertex)8.5 and s ≥ 3 (there is no polygon with only
two sides).
As c ≥ 3, we have 2/3 ≥ 2/c, so the above inequality gives
2 2 2 2
+ ≥ + > 1,
3 s c s
8.3 It may be too much to ask to remember how many vertices the dodecahedron has. However, the dodecahedron,
being the dual of the icosahedron, the number of its vertices is the same as the number of faces of the icosahedron;
that is, 20. Similarly, the number of edges coming together at a vertex of the dodecahedron is the same as the number
of sides of each face of the icosahedron; that is, 3.
8.4 To count the number of edges, each face gives rise to s edges adjacent to it, so F faces give rise to F s edges. Since
each edge is shared between the two faces adjacent to it, so this number needs to be divided by 2; that is, E = F s/2.
Another way to count the number of edges is to note that if each vertex gives rise to the c edges coming into it, then
V vertices give rise to V c edges. Since each edge is shared between two vertices (its endpoints), this number needs to
be divided by 2; that is, E = V s/2.
8.5 It should be clear that the number of edges coming into a vertex is the same as the number of faces adjacent to
a vertex. A vertex sits at the tip of a pyramid (you can obtain this pyramid by slicing off a small part of the solid
containing the vertex), and a pyramid cannot have only two sides.
21
i.e.,
2 2
+ > 1.
3 s
This gives 2/s > 1 − 2/3 = 1/3, i.e., 6 > s. Hence the only possible values for s are 3, 4, and 5.
Similarly, As s ≥ 3, we have 2/3 ≥ 2/s, and a calculation along the same lines shows that the
only possible values for c are 3, 4, and 5.
Hence the only possibilities are s = 3 and c = 3, which gives the tetrahedron;8.6 s = 3 and c = 4,
which gives the octahedron; s = 3 and c = 5, which gives the icosahedron; s = 4 and c = 3, which
gives the cube; s = 4 and c = 4, but it fact this is not possible, since 2/c + 2/s = 2/4 + 2/4 = 1 in
this case, whereas the quantity on the left must be greater than one according to our key inequality
above. s = 4 and c = 5 is impossible a fortiori;8.7 s = 5 and c = 3 gives the dodecahedron. The
remaining cases, s = 5 and c = 4, and s = 5 and c = 5 are impossible, since 2/c + 2/s ≤ 1 in these
cases.
.
.........
....... ....
.......
................
.
...
...
.. .
....... ... ..
3
...
............ ..... .....
. ...
..... . ...
int
.
. .
. .. ... .. ...
..... .
2
....
... ... ...
.. .
.......... ....
. ...
. . ... ...... .... ....
.
..... ... .
. ...... ... ...
1 4
....
.. ... .. ....... .. .
...
. ... ..
..
...... . .
.
.......
.. ... ......... . ... ......
..................................................................................................................................
9.3 A supertile
Consider the figures below: The picture on the right shows a five pinwheel triangles (the tiles),
forming a larger pinwheel triangle (the supertile). The tiles are labeled 1, 2, 3, 4, and int; the
last label refers to the interior triangle, and it plays a special role in the considerations involving
the pinwheel tiling. It is useful to remember these labels when analyzing the pinwheel tiling (see
Figure 9.1).
8.6 Once s and c are given, the last displayed equation above determines E, and then the other equations given earlier
determine F and V . So, given s and c, all the other quantities are determined.
8.7 Even more so (because, in this case 2/c + 2/s is even smaller, that is 2/c + 2/s < 1).
9.1 A leg is a side adjacent to the right angle.
22
9.4 An interior triangle cannot be any other tile
Consider Figures 9.1–9.5. Figure 9.1 shows the tiling of a supertile, with the interior triangle labeled
as int. In Figure 9.2, superimposed on this, drawn with thin lines, is a supertile that uses this interior
triangle as triangle 1. As can be seen, some tiles drawn with thin lines cut some tiles drawn with
thick lines, showing that the addition of the tiling drawn with thin lines is not allowed. Figure 9.3
shows what happens if the interior triangle is used as triangle 2 to build a new supertile. Again,
tiles are cut. Figure 9.4 shows what happens if the interior triangle is used as triangle 3 to build a
new supertile. Again, tiles are cut. Figure 9.5 shows what happens if the interior triangle is used as
triangle 4 to build a new supertile. Again, tiles are cut.
..........
....... ........
....... ... ...
..
........... ... ...
...............................................................................................
..
....... . ..
....................................................... ....
......... .. ........ .. ..
. ..
..
.
..
.
...
...................... .... ...
...
..... ... ...........
.......... ....... int . . ....
.....
. ...
.. .......
....... ... ....... . ...
........................................ ..
....... ....... ....
..
.. .
.
. ...
..... . . ... . .
..........................................................................................................................................
......... . .
......... ....
...
.............
...... ..............
... .. .......
.. .... .......
...................................................................................................
. .......
... ... .......
......... . ....... . .
..
.... ..........................................................................
.. ...... ... ............
........... ........ int .. .. .......
. .
....
... ....
... ... .......
.......
.... ..............
...
...................................... .
.... .......
......
.. . ..... .
..........................................................................................................................................
......... .. ..
......... ....
..
.....................................................................................................................
......... .. ........................ ....... ..
.................................... int ...... .
........................................................................................................... ..
... .......
............................... ..
... .......
... .....
...
...
... .
... ....
... ... .... ...
. . ..
... .. .. ...
... ... ... ...
... .. ..
........ ..
...............................
... ...
...
... ...
... ...
...
... ...
.
... ...
... ...
... ..
... ..
......
.....
A part of the tiling is shown in Figure 9.6. A larger part, with smaller tiles, is shown in Figure 9.7.
23
..............................................................................................................................
......... . ........................ ...... ...... .
.................................... int ....... ........
...
....
... ....
.........
.....
...
..
...
..
.....
...
..
.....
.....
...
..
...
..
...
.......
... ....... ...
............ ..............
...
....... .. ... ...
...
....... .... .
.....
...
...
. .
...
...
...
......
...
...
...
.......
....... . . .
.................................................................
....................................................................................................
.. ... ..
.....
. . .
. . .
....... ..
24
Figure 9.6: A part of the tiling
Theorem 10.1. Let x be a real number and n a positive integer. Then there are integers k and l
25
Figure 9.7: A larger part of the tiling
26
Noting that l 6= 0, the inequality here can also be written as
x − k ≤ 1
.
l l(n + 1)
In other words, given any positive integer n, a real number x can always be approximated by a
common fraction whose denominator l is ≤ n such that the error of the approximation is less than
or equal to
1
l(n + 1)
Since 1 ≤ l ≤ n, this error is less than 1/l2 .
Proof. For a real number y, denote by [y] the integer part of y. That is, [y] is the largest integer
def
m ≤ y. Further, denote by {y} the fractional part of y; that is, {y} = y − [y]. 10.1 Clearly,
0 ≤ {y} < 1.
Assume that no k and l satisfying the assertion of Theorem 10.1 exist. Then for any integer l
with 1 ≤ l ≤ n we must have
1 n
(10.2) < {lx} < .
n+1 n+1
Indeed, writing s = [lx], we have {lx} = lx − s. Thus, if the first inequality fails then, noting that
lx ≥ s, we have
1
0 ≤ lx − s ≤ ,
n+1
the inequality (10.1) claimed in Theorem 10.1 is satisfied with k = s. If the second inequality fails
then, noting that lx − s = {lx} < 1, we have
n
≤ lx − s < 1.
n+1
10.2
Subtracting 1 from all the members of the inequality, we obtain
1
− ≤ lx − (s + 1) < 0.
n+1
In this case, inequality (10.1) claimed in Theorem 10.1 is satisfied with k = s + 1.
Given that (10.2) is satisfied, each of the n numbers {1x}, {2x}, {3x}, . . ., {nx} belongs to at
least 10.3 one of the n − 1 intervals
1 2 2 3 3 4 n−2 n−1 n−1 n
ï ò ï ò ï ò ï ò ï ò
, , , , , , ... , , , .
n+1 n+1 n+1 n+1 n+1 n+1 n+1 n+1 n+1 n+1
10.1 The def
symbol = describes an equation where the left-hand side is defined by means of the expression on the
right-hand side.
10.2 The members of the inequality are the expressions on the left-hand side, the middle, and on the right-hand side,
1 2 3 n−2
, , , ..., ,
n+1 n+1 n+1 n+1
then {px} belongs to each of the two intervals whose one endpoint is {px}.
27
Since there are n numbers and n − 1 intervals here, there must be (at least) one among these
intervals to which (at least) two of these numbers belong. That is, there are integers p and q with
1 ≤ p < q ≤ n such that {px} and {qx} belong to the same one among these intervals. 10.4 Since
the length of each of these intervals is 1/(n + 1), we then must have
1
{qx} − {px}| ≤ ;
n+1
Writing r = [px] and s = [qx], we have {px} = px − r and {qx} = qx − s; hence the above inequality
becomes
1
|(q − p)x − (r − s)| ≤ ;
n+1
here p, q, r, s are integers. Writing l = q − p and k = r − s, this inequality becomes identical
to (10.1), the inequality we wanted to show. As 0 ≤ p < q ≤ n, the inequality 1 ≤ l ≤ n also
follows.
References
[1] Edward B. Burger and Michael Starbird. The Heart of Mathemaics: An invitation to effective
Thinking. Wiley, Hoboken, NJ, fourth edition, 2012.
10.4 Saying that 1 ≤ p < q ≤ n amount to saying that there are two distinct integers p and q with 1 ≤ p ≤ n and
1 ≤ q ≤ n such that {px} and {qx} belong to the same interval, and the notation is so chosen that the smaller integer
is denoted by p and the larger one by q.
28