Professional Documents
Culture Documents
Guerino Mazzola
Maria Mannone
Yan Pang
Cool Math
for Hot Music
A First Introduction to Mathematics
for Music Theorists
Computational Music Science
Series Editors
Guerino Mazzola
Moreno Andreatta
Yan Pang
School of Music
University of Minnesota
Minneapolis, Minnesota, USA
(Novalis)
Preface
Fig. -1.1. Maria Mannone, Guerino Mazzola, and Yan Pang. Photo and © 2015 by
A.J. Wattamaniuk.
The idea for this book came from Yan Pang, a PhD student taking the
course “Mathematics for Music Theorists” at the School of Music of the Uni-
versity of Minnesota. She was not in love with mathematics at all—bad ex-
periences, bad teachers, the usual story. Fortunately, Maria Mannone, another
PhD student taking that course who had studied theoretical physics, helped
Yan get acquainted with mathematical rigor and beauty. Soon, Guerino Maz-
zola, the teacher, learned how to teach math using thorough musical motivation
vii
viii Preface
xi
xii Contents
5 Universal Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
5.1 Final and Initial Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
5.2 The Cartesian Product . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
5.3 The Coproduct . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
5.4 Exponentials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
5.5 Subobject Classifier . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
5.6 Cartesian Product of a Family of Sets . . . . . . . . . . . . . . . . . . . . . . . 65
6 Natural Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
6.1 Ordinal Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
6.2 Natural Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
6.3 Finite Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
7 Recursion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
8 Natural Arithmetic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
10 Integers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
10.1 Arithmetic of Integers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
11 Rationals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
11.1 Arithmetic of Rationals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
12 Real Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
16 Nerves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
16.1 A Nervous Sonata Construction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
16.1.1 Infinity of Nervous Interpretations . . . . . . . . . . . . . . . . . . . . 136
16.1.2 Nerves and Musical Complexity . . . . . . . . . . . . . . . . . . . . . . 137
17 Monoids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
18 Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
25 Primes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213
26 Matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 217
26.1 Generalities on Matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218
26.2 Determinants . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 222
26.3 Linear Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223
27 Modules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225
27.1 Affine Homomorphisms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
27.2 Free Modules and Vector Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . 232
27.3 Sonification and Visualization in Modules . . . . . . . . . . . . . . . . . . . 234
27.3.1 Creative Ideas from Math:
A Mapping Between Images and Sounds . . . . . . . . . . . . . . . 235
29 Categories . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249
29.1 The Yoneda Philosophy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 253
30 Continuity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257
30.1 Generators for Topologies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 260
30.2 Euler’s Substitution Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262
31 Differentiability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 263
32 Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267
32.1 Mathematical and Musical Precision . . . . . . . . . . . . . . . . . . . . . . . . 268
32.2 Musical Notation for Performance . . . . . . . . . . . . . . . . . . . . . . . . . . 268
32.3 Structure Theory of Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . 270
32.4 Expressive Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 272
33 Gestures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 273
33.1 Western Notation and Gestures . . . . . . . . . . . . . . . . . . . . . . . . . . . . 273
33.2 Chinese Gestural Music Notation . . . . . . . . . . . . . . . . . . . . . . . . . . . 275
33.3 Some Remarks on Gestural Performance . . . . . . . . . . . . . . . . . . . . 276
33.4 Philosophy of Gestures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 279
33.5 Mathematical Theory of Gestures in Music . . . . . . . . . . . . . . . . . . 281
33.6 Hypergestures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 283
33.7 Hypergestures in Complex Time . . . . . . . . . . . . . . . . . . . . . . . . . . . . 285
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 297
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303
Part I
First: the history of mathematics and music proves that these partners are dif-
ferent in methodology, language, and existential position, so identifying them
would no longer be possible now (more about the Pythagorean position in Sec-
tion 2.1). Second: the difference in their perspectives is a major force for creative
interaction, and this is a major reason for the historically important mutual
inspiration mathematics and music have given to each other. The recent inspi-
ration of the great mathematician Alexander Grothendieck (1928-2014) for his
highest intellectual challenge, the idea of a theory of motives, was understood
as a musical idea of fundamental mathematical structures that act like musical
motives in the great symphony of mathematics.
For these reasons we want to propose a picture of “mathemusical” interac-
tion that expresses the general atmosphere of their interaction. This picture is
that of a counterpoint of two voices, the cantus firmus of mathematics and the
discantus of music, which interact in a consonant harmony but move in a cre-
ative fashion on the axis of time through history and unfold in a contrapuntal
tension of autonomous but deeply connected voices.
The famous music theorist Eduard Hanslick in his book Vom Musikalisch-
Schönen [49] defines musical content as “tönend bewegte Formen,” not just
forms, but “forms that are moved in sound.” In fact, the formal aspect—the
formula—of a cadence, for example, is not sufficient to generate content. The
form(ula) needs to be moved, and so it is deployed in a gestural dynamics. And
Hanslick illustrates his idea with the kaleidoscope, a dynamic arrangement of
forms that receive their aesthetic value in a self-referential internal relationship.
–Σ–
Some technical terms will be used in this historical chapter. All terms will
be thoroughly explained in later chapters. Please use the book’s index to find
references to these terms if needed.
2.1 Pythagoras
A more reliable fact is that his school had to make an oath on the tetrak-
tys, the cosmological symbol for which Pythagorean philosophy and cosmology
is known, see Figure 2.2.
Fig. 2.2. The tetractys, the cosmological symbol for which Pythagorean philosophy
and cosmology.
The tetractys is a triangular symbol built from ten points, ten being a
sacred number in ancient Greece. The points are piled in decreasing groups
of 4,3,2,1 points. This generates a sequence of fractions 2 : 1, 3 : 2, 4 : 3,
which were considered as basic consonances when played on the Pythagorean
experimental device, the monochord, see Figure 2.3.
Fig. 2.3. The monochord has one string. Its pitch is doubled by one octave when the
string length is halved, 1/2, plays a fifth higher when its length is taken 2/3, and it
plays a fourth higher when taken 3/4.
The musical aspect of the Pythagorean approach was not to make any
compositions in the modern sense of the word. They would try to hear the hid-
den harmony of the universe that was thought be represented by the tetractys
symbol. This symbol played the role of what contemporary physics would call
a “world formula.” The tetractys was the Pythagorean world formula that ulti-
mately described the universe in a mathematical shape, the numerical tetractys
triangle. In this sense, Pythagorean music was an experimental science, and the
monochord was the experimental instrument/machine/testing ground (like the
role of the Large Hadron Collider (LHC) particle accelerator for the Centre
European de Recherche Nucléaire (CERN) today).
2.2 Artes Liberales 7
Artes liberales, the liberal arts, were a medieval canon of education for
the free persons (as opposed to slaves and bondservants). They were seven in
8 2 Short History of the Relationship Between Mathematics and Music
number, divided into two groups: the quadrivium, the “fourfold path,” compris-
ing music, arithmetic, geometry, and astronomy (called astrology at that time),
and the trivium, the “threefold path,” comprising grammar, logic (called dialec-
tic), and rhetoric, see Figure 2.4. The remarkable point here is that music and
the mathematical sciences, arithmetic and geometry, were grouped together.
This is due to the Pythagorean tradition to view music as a mathematical sci-
ence. The humanities, as grouped in the trivium, were separated from music.
The tradition to group music with the humanities was introduced much later,
essentially due to Decartes’ psychological interpretation of music. In the 19th
and 20th centuries, music has been redirected to the mathematical sciences,
mainly due to the development of acoustics and the mathematical nature of
modern physics.
2.3 Zarlino
Gioseffo Zarlino was one of the Renaissance’s most
important music theorists, composers, and musicians
(singer and organist). He was born in Chioggia near
Venice. He was educated by Franciscans and later joined
their ranks. He was maestro di cappella of St. Mark’s, a
most prestigious position in Italy. His theoretical work Le
istitutioni harmoniche [118] and later Dimostrationi har-
moniche [119] established new insights into meantone,
tempered, and just tunings. He established a harmony
that emphasized C-major (the ionian mode) and recog-
Fig. 2.5. Zarlino
nized the basic role of the major and minor triad, re-
(1517-1590). lating them by a symmetry in the just-tuning space. He
also developed just tuning from the Pythagorean tuning,
adding the major third interval because the Pythagorean tuning created diffi-
cult ratios for certain intervals. This was a logical extension also of the tetractys
construction, adding a fifth row with five points to the Greek construction, see
Figure 2.6. He also sought, as one of the first theorists, an explanation for the
forbidden parallels of fifths and octaves in counterpoint.
Despite the difference between Chinese and Western music development, the
basis of the equal-temperament scale (twelve equal semitone intervals per oc-
tave, we call this the 12-tempered in Section 13.1) was first mathematically
calculated by the Chinese mathematician and musician Zaiyu Zhu (1536-1611)
in 1584 (Figure 2.7). But this was never widely used in composing the indige-
nous music of China until 1685. Before Zhu’s tuning system was conceived,
2.4 Zaiyu Zhu 9
Fig. 2.6. The extension of Pythagorean tetractys by amplification with a fifth row,
yielding the new ratio 5 : 4 of the just-tuned major third interval.
there wasn’t a standard tuning system between different instrument types such
as strings, winds, and keyboards. The formula he discovered allowed pitch-pipes
in a equal-tempered scale of twelve equal-ratio semitones per octave, which led
to a revolution of music and physics.
The problem of how to modulate between
different keys bothered many theorists, until Zhu
solved it. This solution shows that music should
not be separated from mathematics. Independently,
only 150 years after Zhu’s theory, in the 18th cen-
tury, the equal-temperament became the basis of
Western composition. Equal-tempered tuning is
widely used today because it is the best tuning
system for modulation in performance. Zhu com- Fig. 2.7. Zaiyu Zhu (1536-
mented on his theory of equal-temperament tun- 1611).
ing that scholars would have to be well acquainted both with acoustics and
with mathematical calculation. Fifty-two years after Zhu published this equal-
tempered tuning system, theorists Père Mersenne discovered exactly the same
principles, using knowledge of mathematics to solve music problems.
Zhu inherited his peerage “Prince Zheng” in 1553 as the first son of Zheng,
when Emperor Zheng died. Although he could have had power, instead, he
decided to live in self-imposed retirement to focus on his research. Zhu’s famous
books include (乐律全书) On the Equal Temperament, 1584, [120], (律吕精义)
A Clear Explanation of That which Concerns the Equal Temperament, 1595/96
[121], and (算学新说) Reflection on Mathematics, 1603, [122]. Theorist Fritz A.
Kuttner describes him as “one of the most important historians of his nation’s
music” [57].
This system is a great invention, specifically for tonal modulation. How-
ever, in our contemporary practice it is still used in combinations of different
tuning systems depending on the instrumental setup.
10 2 Short History of the Relationship Between Mathematics and Music
Fig. 2.8. Johann Sebastian Bach (1685-1750) with the score of canon triplex in six
voices, written using only three of them.
Let us now consider an example (taken from [30]). The Italian composer Luigi
Cherubini wrote a double choir with perfect imitation inverse and contrario
motu, where the first choir is mirrored into the second choir, as shown in Figure
2.9.
Fig. 2.9. The first four measures of a double choir with perfect imitation inverse and
contrario motu, composed by Luigi Cherubini. The sound example is Cherubini.
12 2 Short History of the Relationship Between Mathematics and Music
Soprano (I)
6 7 2 1 7 1
Contralto (I)
5 6 6
Tenore (I)
6 5 3
Basso (I)
1 7 1 3 6
Risposta inversa contraria.
Soprano (II)
6 7 6 4
Contralto (II)
1 2 4
Tenore (II)
2 1
Basso (II)
1 5 6 7
Fig. 2.11. The first two (treble clefs) specify g above middle c, the next ones central
c, except the last three (bass clefs), which specify f below central c.
7
16
Fig. 2.12. A retrograde canon in Bach’s A Musical Offering. The sound example is
Bach. The sound example contains also the solution.
Fig. 2.13. Kircher’s tabula mirifica omnia contrapunctisticae artis arcana revelans.
1 2 3 4 5
1 2 3 4 5 4 3
1 2 3 4 5 4 3 2 1
the first to observe microbes on a microscope and set forth the now valid theory
that plagues were caused by infectious microorganisms.
For our musical concerns, his work in knowledge science is most important
[36], also, because music to Kircher was the model of any order. In the tradi-
16 2 Short History of the Relationship Between Mathematics and Music
Fig. 2.15. An original example of another similar game is the palindrome. The sound
example is Palindromo.
Fig. 2.17. A retrograde canon together with its retrograde solution. The sound ex-
ample is retrograde, first the retrograde canon, then the canon with its solution.
ware for composition, notation, and analysis. Figure 2.20 shows Kircher’s arca
musarythmica, which he described in his work Musurgia Universalis [53]. Recall
that Wolfgang Amadeus Mozart with his “musikalisches Würfelspiel” (musical
dice game) also conceived a combinatorial construction of musical composi-
tions from elementary components (see Sectiondicegame. Kircher’s approach
to music follows the Pythagorean tradition that music is strongly related to the
technology of its physical realization and also demonstrates that in music the
progress of technology is always reflected in the musical realm.
Leonhard Euler is one of the greatest mathematicians of all times. His formula
eiπ + 1 = 0 is considered to be the most beautiful formula in mathematics
as it combines the basic numbers e, π, 0, 1 as well as equality, addition, multi-
plication, and exponentiation. His work comprises seventy-six quadro format
(12-inch high) volumes.
Euler was born in Basel, Switzerland, and first studied theology, Greek,
and Hebrew because his father wanted him to become a priest. But the famous
mathematician Johann Bernoulli, who taught Euler mathematics, persuaded
father Euler to let Leonhard study mathematics. He completed his studies
in Basel with a dissertation about sound, De sono, in 1726. He applied for
a position as professor of physics at the University of Basel but did not get
the position and became professor of mathematics at the Imperial Russian
Academy of Sciences in Saint Petersburg in 1727. He moved to Berlin in 1741
18 2 Short History of the Relationship Between Mathematics and Music
Fig. 2.20. Kircher’s arca musarythmica. Its purpose was to develop a large number
of musical compositions by combining elementary components.
and was professor of mathematics at the Berlin Academy until 1766, when he
returned to Saint Petersburg, where he spent the last part of his live.
Euler’s work is very broad, reaching from theo-
retical to applied mathematics, from celestial to fluid
mechanics, from formal logic to mathematical music
theory. The latter field is why we are particularly in-
terested in Euler’s work. He wrote three treaties on
music [31, 32, 33]. The second deals with consonances,
and we shall see that Euler has made an important
contribution to the definition of a musical consonance
with his prime-number-based gradus suavitatis func-
tion (see Musical Example 64). But the most impor-
tant contribution to music theory is Euler’s definition
of the geometric space of pitch classes in his third
Fig. 2.21. Leonhard Eu- work, where a two-dimensional space is spanned by
ler (1707-1783).
the axis of fifths and the axis of major thirds. This
space is now known as the Euler space (see Musical Example 63). Figure 2.22
shows Euler’s geometry of pitch classes.
Euler’s success with his mathematical music theories was limited at his
time because it apparently was too mathematical for musicians and too musical
for mathematicians—that sounds familiar to us, doesn’t it?
2.8 Joseph Fourier 19
Fig. 2.22. Leonhard Euler’s two-dimensional space of pitch classes that is spanned
by the axes of fifths (horizontal) and major thirds (vertical).
amplitudes during the sound’s process). Music theorists have tried to view the
Fourier partials as being intrinsic components of a sound, but this is erroneous.
There are infinitely many decompositions in the form Fourier described, and
this is a mathematical field called functional analysis. The Fourier decomposi-
tion was also thought to be the analysis that the cochlear inner ear performs.
This is also not the case, but see [75, Appendix B] for details. The digital
technology has also found algorithms for fast calculation of partials, the Fast
Fourier Transform (FFT). Without FFT, modern sound technology would be
infinitely slow. Therefore, Fourier’s theorem has become a fundamental tool for
the present sound technology.
Fig. 2.24. Helmholtz constructed his resonator (i) to measure partials of complex
sounds, here within his instrumentation to mimic vowel sounds.
study medicine, but the son was more interested in natural science. He became
associate professor of physiology at the Prussian University of Königsberg in
1849, was full professor of anatomy and physiology at the University of Bonn
in 1855, moved to the University of Heidelberg in 1858 and was professor of
physics at the University of Berlin from 1871 through the rest of his life.
2.10 Wolfgang Graeser 21
between his abstract symbolic investigations and these symbols’ real dancing
embodiment [123].
In a nutshell, Graeser’s legacy testifies the
immense tension that causes music to vibrate be-
tween facticity of formulaic compression and un-
folding gesturality in the making. And it asserts
the belief and insight of an ingenious talent in
the utopia of coherence between these ontologi-
cal poles. Despite his profiled discoveries, Graeser
was never really understood by musicians, music
theorists, or music philosophers. His literally sui-
cidal tension somehow hindered academic minds
to approach this body of knowledge, much as it
has taken nearly a century for philosophers to se-
riously deal with Nietzsche’s “philosophy with the Fig. 2.26. Wolfgang
hammer.” Graeser (1906-1928).
should do with no other student, for I think one should study harmony
and counterpoint. But this was a man so much out of the ordinary that
I said (...) No, you are almost thirty, you have the good fortune of being
Greek, of being an architect and having studied special mathematics.
Take advantage of these things. Do them in your music.
In 1954, Xenakis was accepted into Pierre Schaeffer and Pierre Henry’s
Groupe de Recherches de Musique Concrète. In 1957, Xenakis received his first
composition award. In 1959 he left Le Corbusier’s studio and started making
a living as a composer and teacher. He founded the CEMAMU in Paris and in
1963 wrote his main work of mathematical music theory: Musiques formelles
[117], later extended and translated to English as Formalized Music: Thought
and Mathematics in Composition (1971) [116]. In this work he explained the
mathematical techniques he used for music composition: set theory, stochastics,
game theory, probability theory, and group theory of permutations.
With his computer-aided composition tool, the UPIC, he was able in 1979
to design compositions on the basis of a two-dimensional (plane) graphical
input (Figure 27.2). It is interesting that despite his strongly spatial approach
to musical composition, Xenakis never applied linear algebra, matrix calculus,
or even differential geometry. It is reported that his problem with not being
able to apply affine transformations to his configurations was solved only by the
graphical composition software presto invented by one of the authors (Guerino
Mazzola), but see Section 2.15.
Weil die mathematische Methode die Wissenschaft ist, die zur Zeit die
am weitesten entwickelte Methodologie besitzt, war mir daran gelegen,
sie zum Vorbild zu nehmen, das uns helfen kann, unsere gegenwärtigen
Schwachstellen zu beheben.
(Because the mathematical method is the science that presently has
the most developed methodology, I was interested to take it as a model
that can help us eliminate our present weaknesses.)
2.13.1 Genealogy
the domain of contemporary music theory. Perhaps one of the most fruitful
approaches is based on the underlying dichotomy between an apparently more
compositional attitude (Babbitt) and a radically analytical perspective (Allen
Forte) toward music theory. Both of these approaches are divided into two for-
mal parts: meta-theory and methodology/compositional theory, and historical
and theoretical essays/analytical studies, respectively. The former work points
to “the engagement by composers in fundamental music-theoretical explica-
tions” [21, p.vii], while the latter suggests that “the very fact that Forte is not
himself a composer has changed the field of theory considerably” [15, p.50].
We may also suggest here that this distinction not only is relevant for
an historical discussion on pitch-class set theory, but it also helps in under-
standing how this theory successively enlarged its field of applicability thanks
to important works by John Rahn [95], Robert Morris [89], and David Lewin
[60, 61].
As pointed out by George Perle in his comprehensive study on serial and
atonal music, “the most important influence of Arnold Schönberg’s method is
not the 12-note idea in itself, but along with it the individual concept of per-
mutation, inversional symmetry and complementation, invariance under trans-
formation, aggregate construction, closed systems, properties of adjacency as
compositional determinants (...)” [94, p.x]. This ‘Babbittian’ presentation of
12-tone problematics constitutes perhaps the most appropriate introduction to
Babbitt. It is widely accepted that he first provided “twelve-tone theory with
a consistent technical vocabulary” [94, p.xiv] and suggested that the relevance
and “the force of any ‘musical system’ was not as universal constraints for
all music but as alternative theoretical constructs, rooted in a commonality of
shared empirical principles and assumptions validated by tradition, experience,
and experiment” [21, p.ix].
One cannot emphasize enough that, for Babbitt, a “set” is an ordered
collection of pitch classes, and it is used as a perfect synonym for row and
series. In contrast, the very predecessor of Forte’s “pitch-class set” is Babbitt’s
“source set,” a set “considered only in terms of the content of its hexachords, and
whose combinatorial characteristics are independent of the ordering imposed
on this content” [11, p.57]. A synonym for it is “collection,” first introduced by
Lewin in [62] and widely discussed for its analytical pertinence in [63].
Subsequently, a vast body of American literature was devoted to the study
of the specific properties of sets and collections, particularly combinatoriality
and partitioning [69, 51, 43, 16, 106, 107]. Partition problems connected with
Babbitt’s original idea have also largely proved their relevance to mathemat-
ics with their natural embedding into the theory of groups [60, 89]. But even
the idea of applying the mathematical concept of group for modeling musical
systems can be regarded as one of Babbitt’s most fruitful intuitions4 , provided
that “the rules of formation and transformation of the twelve-tone system are
4
However anticipated in 1924 by Wolfgang Graeser in his study on Bach’s “Kunst
der Fuge” [46].
2.13 American Set Theory 27
2.13.2 Comments
(e.g., in John Amuedo’s work). But there remains a big lack of models in the
sense that beyond descriptive tasks, the AST language has very seldom led to
musicological modeling. Most theorems of AST are of strictly combinatorial
nature.
So we could summarize the AST achievement as a necessary but far from
sufficient attempt to escape decadent and impotent European musicology.
particular with Georg Rainer Hofmann, at that time a PhD student of professor
José Encarnaçao.
From 1987 to 1989, upon recommendation by conductor Herbert von
Karajan, Mazzola and Hofmann undertook a joint project at the Fraunhofer-
IGD in Darmstadt. The project, sponsored by private investors, produced a
composition software for ATARI called MDZ71. Later, several hundred copies
were commercially distributed. The software also was presented to von Karajan
in 1988.
As these efforts matured, the idea of a “Big Science in Music,” paired with
its realization in the form of a corresponding institute, became virulent. During
their countryside walk, Hofmann and Mazzola pleaded for the creation of an
association for the advancement of an Institute for Fundamental Research in
Music (IFM).
Meanwhile, the first systematic treatment of Mathematical Music The-
ory (MaMuTh), Geometrie der Töne, was published in 1990 [73]. This gave a
theoretical background to the ideas promoted thus far.
In 1992 an interest group was formed with the goal of finding people who
support the idea and the foundation of an IFM association.
In summer 1993, such an association, named “Verein zur Gründung eines
Instituts für Grundlagenforschung in der Musik,” was established. Among the
initial members were Mazzola and his wife, Christina, the statistician and com-
poser Jan Beran, and the mathematician and musicologist Daniel Muzzulini.
The next step was to win over members of a prominent patronage: (Wolf-
gang Auhagen, Valentin Braitenberg, Manfred Eigen, Heinz Götze, Walther
von Hahn, Michael Leyton, Ernst Lichtenhahn, Helga de la Motte, Hellmuth
Petsche, Roland Posner, Peter Stucki, Ernst Terhardt, Walter Thirring, and
Heinz-Gregor Wieser).
The association’s board succeeded to manage the periodic information
of the members of the association and the members of the patronage about
the association’s achievements, to organize annual meetings and symposia with
scientific and artistic musical themes. The last such event took place in 2002 in
Zürich during the Third Seminar on Mathematical and Computer-Aided Music
Theory, where two pianists, a music performer, and an interactive multimedia
environment were engaged.
a hard thing to sell. Well, the next section offers a solution to this challenge:
the Society for Mathematics and Computation in Music.
Last but not least, this book, being co-authored by Mazzola and two excel-
lent PhD students at the School of Music of the University of Minnesota, Maria
Mannone and Yan Pang, is a marvelous proof of the viability of Mathematical
Music Theory for a creative and scientifically based future of music.
Fig. 2.32. The 2015 MCM conference participants at Queen Mary University in
London.
34 2 Short History of the Relationship Between Mathematics and Music
This germinal power eventually helped create the Society for Mathematics
and Computation in Music (MCM). Music theorist Robert Peck from Louisiana
State University in Baton Rouge formally brought this society to life in 2006.
In June 2007, the Society’s first biannual conference took place in Berlin at the
National Institute for Music Research, followed by the 2009 MCM conference
at Yale University, the 2011 MCM conference at the IRCAM in Paris, the 2013
MCM conference in Montreal at McGill University Schulich School of Music
& CIRMMT, and the 2015 MCM conference at the Queen Mary University in
London, see Figure 2.32 for a group photograph. The conference in 2017 will
be held at the National Autonomous University of Mexico (UNAM) in Mexico
City. The proceedings of all these conferences are documented by Springer
International Publishing. The Society also has its own journal, the Journal of
Mathematics and Music, published by Taylor & Francis, and the Springer book
series Computational Music Science (editors: Andreatta and Mazzola). Mazzola
is the MCM’s founding president, see [86] for more information.
Summarizing, the long historical process of the counterpoint of mathemat-
ics and music has now led to a synthesis of art and science that rewards many
passionate and often desperate efforts to reunite beauty and truth.
Part II
ˇ “*
Example 2 Musical scores are containers of notes, see Figure 3.1. In music,
there are also containers of musical parts: Song form A-B-A, sonata form:
{exposition, development, recapitulation, coda}, etc.
Fig. 3.1. A score is a container of note objects. It is constructed from the empty
score container by adding a number of notes, rests, etc. The empty score to the
right, representing a real composition, is a pure container. This marche funèbre was
composed by Alphonse Allais in 1897 for the funeral of a deaf friend.
ferring to themselves in the definition, but in this book, only the set concept
will be introduced in a circular way. When introducing a new mathemati-
cal concept N ew, its definition, if it is given by an equation, is written as
N ew := Old, or Old =: N ew, where the Old part contains already defined
concepts.
• A sorite is a collection of easy statements that follow directly from given
definitions.
Here is the list of the eight ZFC axioms:
Axiom 1 (Axiom of Empty Set) There exists the empty set, denoted by ∅,
which has no elements, i.e., for all sets x, x ∈ ∅, or ∅ = {}.
ˇ “*
Example 3 The empty set is the formal concept that represents the empty
container. Figure 3.3 shows a famous composition by John Cage that—similar
to the above example—is an empty container. Only the duration, 4 minutes
and 33 seconds, is determined for Cage’s composition [25].
Fig. 3.3. John Cage’s composition 4’33", an empty set of notes. Only the container
is set—this is the musical analogue of the empty set ∅. The score is restricted to the
command to be quiet, tacet in Latin, for 4’33”. It was composed in 1952 [25].
Fig. 3.4. The left hand score (part below) is a typical subset of the total score for
both hands (upper part). We write the subset symbol vertically, so do not confuse it
with the union symbol. The sound example is Yan_set.
Fig. 3.5. The total score of a sonata is the union of the set a = {A, B, C, D} of its
subscores A, B, C, D.
3.2 Pure Sets 43
Axiom 4 (Axiom of Pairs) If a and b are two sets, then there is the pair
set {a, b}.
ˇ “*
Example 5 If a is a score, i.e., a set of notes, rests, etc., then taking Φ(x)
for “x = C# OR x = G” extracts the subset a|Φ = {C#, G} if both, x = C#
and x = G are elements of the score a. If Φ(x) stands for “duration of x=”,
then a|Φ is the subscore of all eighth notes in a.
ˇ “*
Example 6 Think of a rehearsal situation. If a is a score, the conductor
may choose to hear any subscore of a—strings, strings and winds, winds and
brass, etc. The set that contains all the possible subscores is the powerset 2a
of that score.
Lemma 1 For any set a, there is the set a+ = a∪{a}, it is called the successor
of a.
Fig. 3.6. In a score with two sets, a for the cantus firmus (CF) voice and b for the
discantus (D) voice, we have the intersection a ∩ b of the common notes.
This reminds us of the Chinese proverb, “One creates two, two creates
three, three creates everything.” The axiom of infinity guarantees that there is
one set container that for every element it also contains its successor.
Axiom 8 (Axiom of Choice) Let d be a set whose elements are all non-
empty, and such that
any two different elements x, y ∈ d are disjoint, then
there is a subset c ⊂ d such that for every x ∈ d there is exactly one element
in c ∩ x.
Fig. 3.7. The axiom of choice guarantees that we may create a set c that has exactly
one element in each of the elements of a set a. Here we take the first note of each of
the three measures in the set a.
With these constructors of sets, one can define a calculus on 2a called Boolean
algebra of a. We first present the theory and then discuss a composition, Herma
by Xenakis, that was composed using this theory. The Boolean algebra of 2a is
defined as follows.
(vi) (involution) x + x = ∅.
(vii) (solutions of equations) the equation x + y = z has exactly one solution w,
i.e., there is exactly one set w ⊂ a such that w + y = z.
Remark 2 This structure will later be discussed as the crucial algebraic struc-
ture of a commutative ring, see Chapter 24.
√
Exercise 2 Let a = {r, s, t, w} with four different elements r, s, t, w. In
the Boolean algebra 2a , calculate the solution x of x + y = z within 2a for
y = {r, s, w}, z = {s, w}.
ˇ “*
Exercise 1 Let a = {♩, , }. In the Boolean algebra 2a , calculate the solu-
tion w of w + y = z within 2a for y = {, }, z = {♩, }.
√
Exercise 3 Let a = {∅}. Calculate the complete table of sums x + y and
products x.y for x, y ∈ 2a , using the symbols 0 = ∅, 1 = a.
The following example has been discussed in [4]. Here, we give a short summary
of that discussion.
Xenakis was one of the first composers to use advanced mathematical
procedures to compose music. His method was described in his book Formal-
ized Music [116]. The composition Herma for piano uses Boolean algebra to
create its detailed structure. Xenakis calls his method “symbolic music.” In
[4], the authors also provide us with a computer-aided implementation of the
composition, using their music software OpenMusic.
The Boolean algebra on 2a that Xenakis uses starts from a frame set a =
R, which, according to the composer, is the set R “of all the sounds of a piano.”
That means all the pitches of the piano (usually 88 in number). He then selects
three subsets A, B, C of R, as shown in Figure 3.8, and creates new subsets
using the Boolean operations of union, intersection, and complementation. We
have to learn why these operations are all Boolean. The intersection is the
Boolean product. For the union x ∪ y of x, y ∈ 2a , observe that a − x ∪ y =
(a − x) ∩ (a − y) = (a − x).(a − y). But the complementation a − x is the solution
z of the equation z + x = a. Therefore, after taking differences a − x, a − y, we
can construct a − x ∪ y and then its difference set x ∪ y = a − (a − x ∪ y).
The core set needed by Xenakis is denoted by F . It is achieved by a flow
chart of preliminary operations as shown in Figure 3.9.
This flow chart shows two construction modes of F : plane (1), associated
with dynamics signs f and ff , and plane (2), associated with dynamics signs
ff and ppp. This flow chart is the basis of the display of the note sets as-
sociated with the different operations in musical time. It has been observed
in [4] that Xenakis here works on two different time levels: the symbolic or
3.2 Pure Sets 47
Fig. 3.8. Xenakis selects three subsets A, B, C of the set R of all piano sounds.
logical “outside-of time” realm of imaginary time of thoughts, and the musical
“in-time” of musical events in real time. We shall come back to this distinction
when discussing performance theory in Chapter 32.
48 3 The Architecture of Sets
Fig. 3.9. Two construction modes define F : plane (1), associated with dynamics signs
f and ff , and plane (2), associated with dynamics signs ff and ppp.
4
Functions and Relations
Summary. The elements of a set can be written in any order, and we don’t
yet have a way to define ordered structures. So we now develop the theory of
ordered structures. It also allows us to define the central concept of a function.
–Σ–
So far, the elements of a set can be displayed in any order, e.g., {a, b, c} =
{c, b, a}. But the order of a collection may be relevant for the description of a
mathematical situation. We will take care of this problem in this chapter.
ˇ “* Example 7 In music theory, they often speak about “ordered sets” in the
sense that an “ordered set” {c, e}, representing an interval, is different from an
“ordered set” {e, c}. But this is an abuse of language. In mathematics, sets are
always unordered collections of their elements. An “ordered set” {c, e} would
then be represented as an ordered pair (c, e), which is different from the ordered
pair (e, c).
Fig. 4.1. A drawing of Antoni Gaudí’s Sagrada Família church in Barcelona (left) as
a graph in the cartesian product space of height and width can be represented also as
a graph of notes, a composition living in the cartesian product space of onset times
and pitches. We come back to questions of visualization and sonification for music in
Section 27.3.
√
Example 1 The diagonal is the graph Δa ⊂ a × a of all ordered pairs
(x, x), x ∈ a (Figure 4.2). For a graph g ⊂ a × b, the inverse graph g −1 ⊂ b × a
is the graph of all ordered pairs (y, x) such that (x, y) ∈ g.
Fig. 4.3. The composition graph g in the cartesian product of onsets and pitches,
together with its two projections, which show the onsets and pitches that are involved
in this composition.
ˇ “*
Exercise 2 Consider the graph g shown in Figure 4.3. Show that the com-
position g −1 ◦ g is the graph in the cartesian product Onsets × Onsets whose
pairs (s, t) are the pairs of times that show repeated pitches.
52 4 Functions and Relations
4.2 Functions
Fig. 4.4. A melody is a typical functional graph in the onset-pitch space. For every
onset, there is at most one associated pitch in the graph. The flute and the monochord
are instruments on which one plays only melodies, never two sounds at the same time.
Fig. 4.5. If the onset times are eight multiples of one eighth (), then this score
shows a melodic function: It is a functional graph that has values for all available
onset times.
√
Example 2 For every set a we have the identity function Ida = (a, a, Δa).
Denote 1 = {∅}. Then, for every set a, there is a unique function ! : a → 1. And
4.2 Functions 53
Definition 11 If we have pr2 (g) = b for a function f = (a, b, g), we say that
f is surjective (epi) and also write f : a b. If f (x) = f (y) implies x = y for
any two arguments x, y ∈ a, we say that f is injective (mono) and also write
f : a b. If f is epi and mono, we say that f is bijective (iso) and also write
∼
f : a → b.
Fig. 4.6. Illustration of three function types: injective, surjective, and bijective.
ˇ “*
Exercise 3 In musical composition, there is a simple method to create pro-
totypes of melodies: dodecaphonism, which was invented by composer and theo-
∼
rist Arnold Schoenberg around 1921. He conceived 12-tone series s : O12 → P12 ,
where O12 is a set of 12 onset times
Fig. 4.7. Note names for keyboard keys and note and rest durations.
and where
P12 = {p0 , p1 , p2 , p3 , p4 , p5 , p6 , p7 , p8 , p9 , p10 , p11 }
is the set of 12 pitch names on the piano, typically named by
p0 = C, p1 = C, p2 = D, p3 = D, p4 = E, p5 = F,
p6 = F , p7 = G, p8 = G, p9 = A, p10 = A, p11 = B,
where the symbol denotes the black key to the right of the white key it follows,
e.g., C is the black key to the right of the white key C. See also Figure 4.7,
where we show the note names corresponding to keys, as well as durations of
notes and the corresponding rests.
Fig. 4.8. 12-tone series for Anton von Webern’s compositions op. 17.2 and op. 30.
∼
For any two such 12-tone series g, h : O12 → P12 , one may consider their
∼
composition h−1 ◦ g : O12 → O12 . Explain the meaning of pairs of times in this
4.2 Functions 55
composed bijection on the time set. Calculate the resulting bijection for the
two series composed by Anton von Webern in his op. 17.2 and op. 30 as shown
in Figure 4.8.
ˇ “*
Example 8 In serial music, Schönberg’s idea to consider a bijection s :
∼
O12 → P12 was generalized to other parameters beyond pitch. For example,
serial composers also consider 12 durations in a set
ˇ “*
Example 9 In music theory, there is a theory of harmony that was proposed
by Hugo Riemann, the (musical) function theory. It is a funny coincidence that
there is also a mathematical function theory (same name!) that was developed
by Bernhard Riemann. But beyond these similarities, the two theories have
nothing in common. Hugo Riemann’s musical function theory was the program
for attributing to every possible chord one of three possible harmonic functions,
Tonic (T ), Dominant (D), and Subdominant (S) [29]. This idea should have
defined a harmonic syntagmatics, i.e., a procedure to attribute to compositions
sequences of harmonic functions to represent the meaning of the harmonic
movement through time.
A classical syntagm in this spirit is the sequence IC = {c, e, g}, IVC =
{f, a, c}, VC = {g, b, d}, IC = {c, e, g} of triadic degree chords (here in C major),
the standard tonality of white keyboard keys when starting at the tonic key
C. This candential sequence is understood as a shorthand for the harmonic
identification of the given tonality (here, in fact, one octave of C-tonality is
identified as the union IC ∪ IVC ∪ VC ). In such a cadence, degree I was thought
to have the function value T , degree V value D, and degree IV value S.
Riemann’s idea was to attribute such function values to all chords, effec-
tively defining a tonality by such a function. So tonality C major would be
defined as a function C − T onality : Ch → T DS = {T, D, S} whose domain
Ch is the set of all chords, which are by definition all the finite sets in 2P itches ,
where P itches represents the set of all pitches. We shall define “finiteness” of a
56 4 Functions and Relations
set in Section 6.3, but here, just take it as the property that one can count a
chord’s elements and end up after a number of steps.
This means that Riemann wanted to define all possible tonalities X as
functions
X − T onality : Ch → T DS.
He however didn’t intend to allow general tonality functions; he intended to
extend functions
4.2.1 Equipollence
Fig. 4.9. The powerset of set a is larger than a, as is seen here for subsets of a that
are not hit by the map x → {x}.
4.3 Relations
Relations among elements of a given set are essential for the description of
musical configurations.
(vii) The relation is a partial relation if it satisfies (i), (iii), and (iv).
(viii) The relation is a linear relation if it is partial and satisfies (v).
(ix) The linear relation is a well-ordering1 if every non-empty set b ⊂ a has a
minimal element, i.e., m ∈ b with m ≤ y for all y ∈ b.
ˇ “*
Example 10 If we take the set F in(a) ⊂ 2a of finite subsets of the set
a of pitches, these subsets are called chords. With the relation of inclusion
c ≤ d iff c ⊂ d, we have a partial ordering among chords. The chords having
two elements are usually called intervals. The chords that are singletons are
identified with their single elements. In music theory, the case c ≤ d for an
interval c is an important property of a chord d, meaning that the interval c is
part of chord d. In harmony, especially in modulation theory (which we shall
discuss in Chapter 23) the important case is when a chord c is ≤ two larger
chords r, s.
The cartesian product of two relations R ⊂ a × a, S ⊂ b × b is the relation
R × S ⊂ (a × b) × (a × b) defined by (r, s)R × S(u, v) iff rRu and sSv. Observe
that the cartesian product R × S need not be linear if both R and S are so,
see the Musical Example 11.
1
A famous theorem of Ernst Zermelo states that the axiom of choice is equivalent
to the statement that every set has a well-ordering.
4.3 Relations 59
We have already encountered the final set 1 as well as the initial set ∅ in
Example 2. Let us denote ∅ = 0 to ease notation. So we have ! : 0 → a and
! : a → 1. It is easy to see that any final element must be a singleton, and that
any non-empty set cannot be initial. Therefore there is only one initial set 0
and the final sets are those whose cardinality is 1, the singletons.
For the next universal property we denote by pra : a × b → a : (x, y) → x
the first and by prb : a × b → b : (x, y) → y the second projection.
f
◦f
pr
?
-
a - b
pra prb
a×b
√
Exercise 8 Suppose a set q is given, together with two functions fa : q →
a, fb : q → b such that we have a bijection
∼
Set(c, q) → Set(c, a) × Set(c, b) : f → (fa ◦ f, fb ◦ f )
for all sets c. Then the universal property of the cartesian product a × b implies
that there a unique f : q → a × b corresponding to the two functions fa , fb .
Prove that f is a bijection. In particular, the universal property of the cartesian
product determines it up to equipollence, i.e., it determines its equipollence
class.
ˇ “*
Example 12 Take the two-element set 2 = {0, 1} and the cartesian product
a × b of the onset and pitch sets a, b. Then an interval in a × b can be defined as
a set function i : 2 → a × b. It determines a start note i(0) and an end note i(1)
of the interval. By the universal property of cartesian products, i corresponds
biunivoquely, or one-to-one, to a pair ia = pra ◦ i : 2 → a, ib = prb ◦ i : 2 → b.
This means that an interval i is the same as giving two intervals ia , ib , one
of onsets and one of pitches, see Figure 5.1. This idea can also be applied to
series in several musical parameters, such as pitch, duration, and loudness, as
is customary in serial composition.
5.3 The Coproduct 63
Fig. 5.1. An interval i in onset-pitch space is the same as giving two intervals ia , ib ,
one of onsets and one of pitches.
The next universal property is “dual” to the preceding one in the following
sense.
Definition 16 Given two sets a, b, their coproduct or disjoint sum is the set
a b = {0} × a ∪ {1} × b,
6
f◦
a
in
in
f
f◦
a
ina
- a b inb b
64 5 Universal Properties
5.4 Exponentials
For the next universal property, we denote Set(b, c) by cb for reasons that will
become clear soon, but also to stress that we view Set(b, c) as a set rather then
a special collection of functions.
ˇ “*
Example 13 Let c = {0, 1} and a the set of onsets, b the set of pitches.
Then a musical composition can be described (in a simplified manner) in two
ways: First as a function f : a × b → c that associates with each pair (o, p)
of onset and pitch the value f (o, p) = 0 if there is a note in this position, and
f (o, p) = 1 else. We can also see this data as a function δ(f ) on the set a of
onsets that associates with every o ∈ a the chord δ(f )(o) : b → c at that onset
time, i.e., the function that specifies the pitches that are present at that onset
o.
χ(b)(α) = 0 if α ∈ b, and χ(α) = 1 else. The function χ(b) is called the charac-
teristic function of b. The inverse of χ is the zero fiber, i.e., χ−1 (x) = x−1 (0).
Observe that now, the subsets of a, elements of 2a , are identified to func-
tions a → 2, i.e., elements of Set(a, 2) which we also denote by 2a . This is now
perfectly legitimate by the above theorem.
ˇ “*
Example 14 The subobject classifier is extremely important for the classi-
fication of musical structures. Let us give a basic example. Consider chords as
finite subsets of a domain of pitches, or even pitch names, such as the domain
Definition 19 Suppose that we are given a family (fx )a of sets such that each
fx bears a linear ordering <x , and such that the index set a is well-ordered by
the relation ≺. Then, for two different families (tx )a , (sx )a , the relation
ˇ “*
Example 15 Here is a first musical application of a lexicographic ordering.
2 represents chords (of pitch names). The set 2P12
P12
We had seen that the set
is a cartesian product x∈P12 fx = 2P12 for the constant family fx = 2. With
the relation 0 < 1, the set 2 becomes well-ordered. The index set P12 can be
given a well-ordering by the linear ordering
p0 < p1 < p2 < p3 < p4 < p5 < p6 < p7 < p8 < p9 < p10 < p11 .
The corresponding well-ordering means that for two different chords (ci )i , (di )i ,
we have (ci )i < (di )i iff the smallest index j where they differ has cj < dj ,
i.e., cj = 0, dj = 1. For example, the minor triad {c, d, g} precedes the ma-
jor triad {c, e, g} since we have this relation (0, 1, 1, 0, 1, 1, 1, 0, 1, 1, 1, 1) <
(0, 1, 1, 1, 0, 1, 1, 0, 1, 1, 1, 1) for their representation as sequences of zeros or
ones.
Given this well-ordering among chords, one may then choose the first in
any equivalence class of chords and declare it the representative of that class.
Probably for the lack of deeper knowledge about lexicographic orderings,
American music theory has chosen a well-ordering that is called “most packed
to the left”, see [108]. It is defined as follows.
We take the well-ordering of the index set P12 that is opposed to the
above, i.e.,
p0 > p1 > p2 > p3 > p4 > p5 > p6 > p7 > p8 > p9 > p10 > p11 .
On the value set 2 the ordering 0 < 1 is conserved. Then, a chord (ci ) precedes
a chord (di ) if the first index j from the right where they differ has cj < dj .
This means that chords with longer sequences of zeros from the right precede
others. For example,
ˇ “*
Example 16 A practical example of lexicographic ordering is the ordering
among notes in a score. Suppose that notes of a score are given as elements of a
fourfold cartesian product O × P × L × D, where O, P, L, D are linearly ordered
sets of onset, pitch, loudness, and duration. Then the lexicographic ordering
68 5 Universal Properties
Numbers
6
Natural Numbers
Summary. Natural numbers are the first topic studied by all students in first
years of elementary school. Here, the classic definitions of ordinal and natural
numbers are entirely derived from set theory. The well-known five Peano axioms
that define natural numbers are now presented as a theorem.
–Σ–
Proposition 5 (i) There is a set N which consists exactly of all natural num-
bers, i.e., n ∈ N iff n is natural.
(ii) The relation n ∈ m between natural numbers defines a well-ordering on N.
Proof 2 To prove (i), take the set w that is guaranteed by ZFC axiom 7. Take
the subset N ⊂ w of the natural numbers in w. We claim that every natural
number n is in N. In fact, by definition of w, 0 ∈ w, and if a natural number
n is in w, then, again by the definition of w, n+ ∈ w. Therefore, by Peano’s
fifth axiom, all natural numbers must be in w, and we are done. Statement (ii)
follows directly from the general ordinal number Theorem 5.
√
Example 6 Let us denote the first natural numbers to get a small number
of standard symbols. We show a sequence of numbers where each subsequent
number in the list is the successor n+ of the preceding number n in the sense
of set theory: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12.
ˇ “*
Example 17 Observe that in mathematics, physics, and computer science,
natural numbers start at 0, not at 1. Natural numbers are omnipresent in music,
too. Here are some examples.
• We count measures, usually starting at measure 1. But as the initial onset
should be 0, starting at measure 0 would not be a bad idea. However, if
the initial measure is incomplete, we number the first complete measure as
one, so the incomplete measure would be number zero.
• Within a given score, we count the beats, e.g., the number of eighth notes
(also called quavers) at a determined position in the score. Pitch is also
often given a natural number value, for example in the digital MIDI code1 ,
where middle C on a keyboard has number 60.
1
MIDI is the acronym for Musical Instrument Digital Interface, a standard code
for the exchange of digital performance data between computers and electronic
musical instruments. See http://www.midi.org and [78] for more information.
6.3 Finite Sets 75
• For a musical interval, one usually starts with the pitch of the lower note
and counts the number of semitone steps to reach the upper note. The
prime interval means to count 0, the minor second has 1, the major second
2, the minor third 3, the major third 4, the fourth 5, the tritone 6, the
fifth 7, the minor sixth 8, the major sixth 9, the minor seventh 10, and the
major seventh 11 pitch semitone steps.
• For strings, one counts the open string and the first or second position for
fingering instructions.
• The works of a composer are counted by natural numbers.
• The number of instruments in an orchestra is counted by natural numbers,
e.g., a quartet is an orchestra with four string instruments (first and second
violin, viola, violoncello).
ˇ “*
Example 18 Referring to Musical Example 3, our pitch name set P12 is
equipollent to the set 12 = {0, 1, 2, . . . 11} of natural numbers, and therefore
is finite. The same is valid for the onset time set O12 . A dodecaphonic series
∼ ∼
s : O12 → P12 is therefore represented as a bijection s : 12 → 12.
The first important application of this theory relates to permutations, i.e.,
∼
bijections F → F of finite sets F . We denote by Sn the set of permutations of
the natural number n. For example, we can look at a dodecaphonic series
0 1 2 3 4 5 6 7 8 9 10 11
s=
3 11 2 0 4 6 7 5 8 9 10 1
that sends every number in the top row to the number in the bottom row at
the same column, e.g., 3 → 0. This set has an additional structure that later
will turn out to be a central structure in group theory. But here we just use
what’s immediately feasible. To begin with, any two permutations π, ψ ∈ Sn
are functions that can be composed to a new permutation π ◦ ψ ∈ Sn . This
76 6 Natural Numbers
ˇ “*
Example 19 If X = P12 , a sequence t : 3 P12 defines a three-element
chord, called a triad in music theory. Of course, the order of pitches, e.g., t =
(c, g, e), isn’t important when choosing a set {c, e, g} for illustrative purposes.
But the order in the sequence is relevant if we care about the set’s members, for
example, if we want to conceptualize chord inversions. Starting at e in {c, e, g},
then taking g, then c, we get the first inversion of {c, e, g}, or starting at g,
then taking c, then e, defines the second inversion of {c, e, g}. This formalism
of sequences was introduced in 1981 by one of the authors (Mazzola) in his
university course on mathematical music theory and then in [72, Section I.2]
for the classification of general musical structures; it has recently been applied
under the catchword “orbifold” to topological considerations about voice leading
[110].
We now define an equivalence relation on X n . Let g, h ∈ X n . We set g ∼ h
iff there is π ∈ Sn such that g ◦ π = h. Check that it is an equivalence relation.
Denote by X n /Sn the set of equivalence classes.
∼
Proposition 8 The sets Xn = {Y |Y ⊂ X AND Y → n} and X n /Sn are
equipollent.
∼
Proof 3 If Y ∈ Xn , then we have a bijection g : n → Y that extends to
an injection in X n , and then its equivalence class, by composing it with the
∼
inclusion Y ⊂ X. If we take another bijection h : n → Y , then they differ by the
−1
permutation π = h ◦ g ∈ Sn , therefore they define the same equivalence class
[g] = [h]. Conversely, if [h] ∈ X n /Sn , then its image Y = Im(h) is in Xn , and
it is independent of the representative h since the difference of representatives
is only a permutation of n.
ˇ “*
Exercise 4 Given the sequence t = (c, e, g), find the permutation π ∈ S3
such that t ◦ π = (e, g, c), the first inversion of t.
7
Recursion
XN
|m
|n
-
|n
Xn m - Xm
r|n
X n −−−−→ Xn
⏐ ⏐
⏐ ⏐ |n
m
|n m
r|m
X m −−−−→ X m
We now want to show by induction on n that each of the restrictions r|n
has a unique fixpoint, i.e. F ix(r|n) ∈ X n with r|n(F ix(r|n)) = F ix(r|n). If this
is proved, the theorem follows immediately since then by the commutativity of
diagram (7), the fixpoint F ix(r|m) is the restriction of the fixpoint F ix(r|n).
This allows us to define F ixr as the unique function in X N that restricts to
F ix(r|n) for every n ∈ N.
For n = 0, X 0 is a singleton, so everything is clear. Suppose n = m+ ,
and the claim is proved for m. So we have a unique fixpoint Fm of r|m:
r|m(Fm ) = Fm . Take any element G ∈ X m+ with G|m = Fm . Take then
Qm+ := r|m+ (G). We claim that this element is the fixpoint for r|n. We have
Qm+ |m = r|m(G|m) = r|m(Fm ) = Fm = G|m. Therefore by the above lemma,
r|m+ (Qm+ ) = r|m+ (G) = Qm+ , and we are done.
ˇ “*
Example 20 Fibonacci numbers (invented by Italian mathematician Leo-
nardo da Pisa, known as Fibonacci) are a classic example of a recursive con-
struction.
We take X = N and define
This produces the fixpoint F ixrF ibonacci = (1, 1, 2, 3, 5, 8, 13, 21, . . .). Fibonacci
numbers have been used by many composers. For example, Béla Bartók has
implemented Fibonacci numbers in many compositions [59], e.g., in the number
of measures of his composition Allegro Barbaro. The ostinato F -minor chords
occur in groups of exactly 3, 5, 8, and 13 measures [59]. Also, the Chinese
composer Mingzhu Song has created a composition The Scene of Sichuan Opera
in which the note-group lengths are defined from Fibonacci numbers [105], see
Figure 7.1.
One should add that Fibonacci numbers are important also because
of the fractions F ixrF ibonacci ,i+1 /F ixrF ibonacci ,i which tend to a real number
1.6180339887 . . . that is known as the golden ratio in the arts. We come back
to this number when discussing real numbers in Chapter 12.
ˇ “*
Example 21 This musical example an illustration of the recursive method,
but it also adds a general new method of musical composition. It was discovered
while one of the authors (Mazzola) presented recursion in his course about
mathematics for music theorists. A very talented student, Ben Klein, asked
whether one could make sounds using recursion. And yes, it works. Here is the
result.
80 7 Recursion
Fig. 7.1. In the composition The Scene of Sichuan Opera, Mingzhu Song con-
structs note-group lengths according to Fibonacci numbers. The sound example is
Yan_Fibonacci.
of a’s, in our case F ixr13 = (0, 3, 6, 9, . . .), a sequence of minor thirds. The
second recursion function r2 is the composition r14 ◦ r13 of r13 and r14 . Therefore
it converges to its fixpoint with double speed—we only need two steps to and
from its fixpoint. The fixpoint here is a sequence (0, 4, 7, 11, 14, . . .) of major and
minor thirds. If we had taken the composition r13 ◦ r14 , we would have obtained
a sequence (0, 3, 7, 10, 14, . . .) of minor and major thirds. Figure 7.3 illustrates
this compositional construction.
Fig. 7.3. A composition (One for Ben) generated according to the recursive method
described above. The sound example is oneforben.
8
Natural Arithmetic
(s0 , s1 , . . . sn , sn+1 , . . .)
0 n+1
is a sequence of natural numbers, we define s = s0 and i=0 si =
n 0i=0 i n+1
( i=0 si ) + sn+1 , similarly for the product: i=0 si = s0 and i=0 si =
n
( i=0 si ) · sn+1 .
a = q · b + r.
This means that one can now define uniquely that remainder r < b after
division by b.
ˇ “*
Example 23 When dealing with pitch in semitone units, as typically real-
ized by the keys of a piano, and which is quantified by natural numbers, one
considers octaves between pitches, which means that one adds multiples of 12
semitones. In music theory, one often doesn’t deal with pitch, but with pitch up
to octaves. This corresponds to the situation in the Division Theorem, where
b = 12. For a given pitch a one then wants to know the non-negative remain-
der after adding up octaves. This means that one has to solve the equation
a = q · 12 + r in the Division Theorem. The remainder r is the called the pitch
class (number) of pitch a. For example, if a = 27, one gets 27 = 2 · 12 + 3,
yielding pitch class 3. The Division Theorem guarantees that this class number
is uniquely defined. The ability to define pitch classes is a consequence of this
theorem. Our previous usage of a 12-element pitch-class set P12 is based on
this technique.
Theorem 9 Let a, b ∈ N with b > 1 and a = 0. Then there are unique natural
numbers c, s, r with 0 < s < b and r < bc such that
a = s · bc + r.
The proof is similar to the preceding one, so we omit it. The next theorem
is the well-known representation of natural numbers using a “basis” number,
usually known for decimal numbers, i.e., b = 10. Let us first state the theorem:
Proof 6 The proof runs as follows. Suppose there are only finitely many prime
∼
numbers. Then we may enumerate their set P by p : n → P ⊂ N. We consider
the number N = i<n p(i) + 1. This one must be a product of prime numbers,
and let p(i0 ) be one of them. Then we have the representation N = q·p(i0 )+1 =
q · p(i0 ) = q · p(i0 ) + 0. We have 0, 1 < p(i0 ). But this is impossible, because
the remainder r in the division theorem cannot be 0 and 1 at the same time,
and we are done.
10
Integers
ˇ “* Example 24 Musical intervals are often not natural numbers. For example,
one may want to know the interval between pitch 9 and pitch 7. This leads to
the equation 9 + x = 7. One may also want to know the interval between pitch
11 and pitch 9. According to the above equivalence relation we calculate the
cross sums 9 + 9 and 7 + 11. Both add to 18, which means that the equations
have one and the same non-natural solution x. Integers are invented to deal
with exactly this new type of solution. We shall see in a moment, that this
solution is denoted by −2, a “negative” interval number.
Let us see how we may represent integers by pairs (a, b) of natural num-
bers. If [a, b] ∈ Z with b ≤ a, then we have a solution x of b + x = a, and we
have [a, b] = [x, 0], and no other [x , 0] does the job. If a ≤ b, then we have a
solution x of a + x = b, and we have [a, b] = [0, x], and no other [0, x ] does the
job. Further, [x, 0] = [0, y] iff x = y = 0. Therefore the integers [a, b] ∈ Z with
b ≤ a are represented by natural numbers x, via [x, 0]. This defines injection
N Z : a → [a, 0]. We write −b for the integer [0, b], b = 0, and call such an
integer a negative integer, while the images of N are called natural numbers
or positive or (when including the zero number) non-negative integers. For a
negative integer z = −b, the positive integer b is called its absolute value, which
is denoted by |z|. For a non-negative integer z the absolute value is that same
number |z| = z. For any integer z = [a, b], we more generally define its negative
by −z = [b, a].
Fig. 10.1. The set of integers Z extends the set of natural numbers N and can be
represented on a line that extends infinitely to the left and to the right.
Definition 27 Given two integers [a, b], [c, d], their product is defined by [a, b] ·
[c, d] = [ac + bd, ad + bc].
Check again that this function is well defined.
Sorite 5 Let a, b, c be three integers. We have these rules for their multiplica-
tion.
(i) (Associativity) (a · b) · c = a · (b · c) = a · b · c.
(ii) (Commutativity) a · b = b · a.
(iii) (Multiplicative neutral element) The element 1 = [1, 0] is neutral for mul-
tiplication, a · 1 = a.
(iv) (Zero and negative multiplication) a · 0 = 0, a · (−b) = −(a · b).
(v) (Distributivity) a · (b + c) = a · b + a · c.
(vi) (Integrity) If a, b = 0, then a · b = 0.
(vii) (Additive monotony) If a < b, then a + c < b + c.
(viii) (Multiplicative monotony) If a < b and 0 < c, then a · c < b · c.
(ix) (Extension of natural arithmetic) For two natural numbers a, b, we have
[a · b, 0] = [a, 0] · [b, 0]. This allows complete identification of naturals as a
subdomain of the integers, if we look at addition and multiplication.
(x) (Triangular inequality) We have |a + b| ≤ |a| + |b|.
The concept of a prime number is extended to integers by the definition
that a ∈ Z is prime iff |a| is so in the set of natural numbers.
92 10 Integers
a·x = b
c·x = d
c·a·x = c·b
a·c·x = a·d
Q = Z × Z∗ / ∼
where Z∗ = Z − {0}. We denote the equivalence class [b, a] by b/a or ab and call
b the numerator and a the denominator of the fraction ab . Observe that for any
s = 0, we have ab = s·a
s·b
.
The integers can be embedded in the rationals by the injection
a
Z Q : a → .
1
iff b < d.
ˇ “*
Example 26 In music, rationals are very important. Let us look at some
basic examples of the use of rationals in the musical domain. For the classical
score notation, the horizontal axis represents onset time and durations of notes.
This musical time is not the physical dimension, but it is a symbolic time. It
is only interpreted in physical units when one adds rules for the shaping of
tempo. We come back to the topic of tempo in Chapter 32. For the time being
we only want to look at the symbolic time that is denoted on the score. In this
environment, onset time is divided into equal portions, called measures. The
duration of such measures is indicated at their beginning with a time signature.
Typical time signatures are shown in Figure 11.1.
Time signatures look like rational numbers: They have a numerator and
a denominator. For example, the left time signature in Figure 11.1 resembles
the rational 44 . The middle time signature resembles rational 34 , while the right
signature resembles rational 68 . However, these symbols are not rationals but
are representatives for rationals.
We know that 34 = 68 mathematically, but their musical meaning is more
than this. See again Figure 11.1. You can see that the first measure consists
of two half notes, while the second one consists of two quarter notes and two
quarter rests. Each of these durations add up to 44 , as shown by the small
cross symbols denoting four “beats” in each measure. Look at the other two
time signatures in Figure 11.1. The denominators 4 and 8 designate the "beat"
durations in their respective measures. Notice the pattern of cross symbols
(eighths) above the measures. While the 34 signature is divided into three groups
6
of two cross symbols, the signature 8
shows a division into two three-cross
3
groups. Simple meters such as divide their beats into two equal parts as
4
shown in Figure 11.1, left and middle examples. In compound meters, as shown
in Figure 11.1, right example, the dotted quarter beat is not divided in two, but
into three notes. The time signature 68 indicates a total number of six quavers.
There are two dotted quarter beats, each one comprising three quavers, for a
11 Rationals 95
total of six. Figure 11.2 shows three examples of 34 as compared with 68 measures
in the literature.
Fig. 11.2. Examples of 6/8 against 3/4 measures: (a) Mi votu e mi rivotu, Sicilian
traditional song, (b) America, from West Side Story, by Leonard Bernstein (in the
original score, the time signature was a mixed one, showing both signatures at the
beginning of this two-measure unit), (c) An der schönen blauen Donau, by Johann
Strauss. The sound example is 3_4.
Fig. 11.3. Complex time signature construction from Stravinsky’s The Rite of Spring.
The sound example is Stravinsky.
defining tuplets in the score notation. Figure 11.4 shows a 4/4 measure that is
4 4
divided into 7 equal durations, each of which is 4·7 = 28 . Musical score notation
uses note durations from the standard repertory of half, quarter, eighth notes,
etc. (their durations are powers of 12 ), but indicates by the tuplet number (7
here) how their duration should be interpreted.
Fig. 11.4. The measure in time signature 4/4 is divided into a septuplet (seven
equal durations). The second measure is not complete. The sound example is
rhythm_Rationals.
Definition 28 Let b d
a, c ∈ Q. Then their sum is defined by
b d bc + ad
+ = ,
a c ac
while their product is defined by
bd bd
= .
ac ac
√
Exercise 13 Verify that these operations are well defined (independent
of the representatives) and that this arithmetic extends the arithmetic of the
integers under the above injection Z Q.
Let us denote this by the formula (Z, +, ·) (Q, +, ·), which will be
explained in detail in Chapter 24. We define the absolute value of a rational
number ab by
b
= |b|
a |a|
11.1 Arithmetic of Rationals 97
Fig. 11.5. Score example for complex time structures from Brian Ferneyhough’s
Third String Quartet.
a c e
Sorite 6 Let b , d, f be rational numbers. Then these rules hold.
(Additive associativity) ( ab + dc ) + fe = ab + ( dc + fe ) = ab + dc + fe .
(i)
(Additive commutativity) ab + dc = dc + ab .
(ii)
(Additive neutral element) ab + 01 = ab .
(iii)
(iv)(Additive inverse element) ab + −a b = 1.
0
(v)(Multiplicative associativity) ( b · d ) · f = ab · ( dc · fe ) = ab · dc · fe .
a c e
(vi)(Multiplicative commutativity) ab · dc = dc · ab .
(vii)(Multiplicative neutral element) ab · 11 = ab .
(viii)(Multiplicative inverse element) If b = 0, then ab · ab = 11 .
(ix)(Distributivity) ab · ( dc + fe ) = ab · dc + ab · fe .
(x)(Linear ordering) The relation < among rational numbers is a linear or-
dering. Its restriction to the integers a1 induces the given linear ordering
among integers.
(xi) (Additive monotony) If ab < dc , then ab + fe < dc + fe .
98 11 Rationals
Summary. We have used the philosophy of the problem being the solution to
construct integer and rational numbers when dealing with equations of type
a + x = b or a · x = b. But there are many other equations, especially dealing
with approximations in music theory, that cannot be solved with Z or Q. In
this chapter we apply the above philosophy to find solutions of such problems,
namely the real numbers.
–Σ–
The geometric problem of finding the length l of the diagonal of a square
with side length 1 leads us to the Pythagorean equation l2 = 12 + 12 = 2.
Suppose that l = pq ∈ Q, and suppose that p, q have no common prime factor.
2
Then we have l2 = pq2 = 2, hence the equation 2q 2 = p2 of integers. But (this is
the theorem about uniqueness of prime factorization, to be proved in Chapter
25) the factor 2 on the left side implies that p = 2p . But then 2q 2 = p2 = 4p2 ,
so q 2 = 2p2 . Therefore, for the same reason, q = 2q , which contradicts the
absence of common prime factors of p, q. Hence the diagonal equation has no
rational solution. A fortiori the equation s12 = 2 has no rational solution.
Otherwise s6 would solve the diagonal equation.
This latter equation is crucial in music theory: s is the frequency ratio
between successive semitone steps of the 12-tempered octave tuning. In fact, if
the frequency ratio from a pitch x to pitch y is r, and the ratio from y to z is
w, then the frequency ratio from x to z is r · w. Therefore, if all semitone steps
have equal frequency ratio s, the octave, having frequency ratio 2, and being
built from 12 equal semitone steps, must have frequency ratio 2 = s12 . Refer
to our Chapter 2 on the history of mathematics in music, where the invention
of 12-tempered octave tuning by Zaiyu Zhu is described in Section 2.4.
The situation is not hopeless, however, since despite the non-existence of
solutions in Q, we can still approximate solutions by rational numbers. Here
is the procedure for the diagonal equation l2 = 2. Take the largest integer
s0 = 1 such that s20 < 2, and then the smallest integer S0 = 2 such that
ˇ “*
Example 28 The above approximation of a solution of l2 = 2 should be
interpreted in musical terms. It is an open question whether one can invent
a practical musical realization, if possible with lengths of strings, of this ap-
proximation. We have the following calculations: Given two frequencies x, y.
These correspond to string lengths lx = 1/x, ly = 1/y. The middle frequency
m = x+y2 corresponds to the length
lx ly
lm =
(lx + ly )/2
which is not really a simple construction out of lx , ly in terms of mechanical
manipulations, except for the denominator.
ˇ “*
Example 29 Another example of real numbers in music is given by glis-
sando. A glissando is a very fast performance of the notes between a starting
and an ending point. n the score, usually only the first and last notes are in-
dicated, together with a connecting line. While a glissando on piano implies
discrete frequencies, a glissando on a violin, for example, is “continuous” be-
cause on strings continuous frequency values can be performed. The movement
of a glissando is thought to glide through all real numbers between initial and
final pitch. The idea of a continuity is delicate, however, since the rational num-
bers are also “dense”—between any two rational numbers there is an infinity of
rational numbers1 . We come back to these now mysterious concepts of density
and continuity when discussing questions of tuning systems in Chapter 28 and
continuity in Chapter 30.
We now see that we have identified the problematic objects. They are
sequences (xi )I∈N of rational numbers that are in some sense limited. The
philosophy now applies: We shall use such sequences to construct the set R of
real numbers.
a natural number N such that whenever m, n > N , we have |an − am | < 1/L.
The set of Cauchy sequences is denoted by C.
√
Example 7 Constant sequences of rational numbers are Cauchy. The se-
1
quence ( i+1 )i is Cauchy. If (ai )i is Cauchy, then so is (a+ai )i for every rational
number a.
of a recursive function and were defined in Musical Example 20. The frac-
tions F (i) = F ixrF ibonacci ,i+1 /F ixrF ibonacci ,i of rational numbers are a Cauchy
sequence. They are used to define a real number g = 1.6180339887 . . . (real
numbers are to be defined below), known as √the golden ratio. It can also be
represented as an explicit real number g = 1+2 5 (the existence of nth roots is
discussed in Chapter 13). This number is the solution of the geometric problem
of constructing a rectangle with sides a, a + b such that
a a+b
= ,
b a
see Figure 12.1. This means that the ratio of the longer side a + b to the shorter
a is the same as the ratio of the shorter a to the remainder b = (a + b) − a.
The ratio turns out to be g. This problem of ratios was first discussed by the
mathematician and cosmologist Johannes Kepler in 1597. Apart from being an
artistic principle of constructing aesthetically pleasing ratios, the golden ratio
is present in many biological and physical contexts. The golden ratio has also
been used in compositions of Karlheinz Stockhausen and Gérard Grisey. See
also [64] for more information about Fibonacci numbers and the golden ratio.
Fig. 12.1. The golden ratio is the solution of the geometric problem of constructing
a rectangle with sides a, a + b such that ab = a+b
a
.
102 12 Real Numbers
Definition 32 The set of real numbers is defined using the above equivalence
relation R:
R = C/R.
And here is a more concrete description of the equivalence classes that
define real numbers:
which means that we use the arithmetic on Cauchy sequences and simply
“project” it to the O-cosets. This is a standard procedure in algebra, which
we shall study in Chapter 24.
√
Exercise 14 Give a proof of fact that these arithmetic operations are well
defined.
|a + b| ≤ |a| + |b|.
Proof 8 The proof uses Corollary 2. Consider the set A = {q|q ∈ R AND q n <
a}. It is limited from above since we can show that (a + 1)n > a. For the
supremum sup(A), which exists by Corollary 2, it is easily seen that sup(A)n =
a.
√ √
n √ √
Exercise 16 Show that for real numbers a, b ≥ 0, we have ab = n
a n b.
ˇ “* Example 31 The existence of nth roots is the basis of many tuning systems
in music. We have already discussed the case of the 12-tempered tuning, where
the √ octave frequency ratio 2 is divided into 12 equal frequency ratios of size
12
2 ≈ 1.059463094359295 . . . For microtonal tunings—for example, quarter-
√
tone or 24-tempered tuning, one needs the quarter-tone frequency ratio 24 2 ≈
1.029302236643492 . . . The quarter-tone piano of Czech composer and theorist
Alois Hába (Figure 13.2), the archicembalo (Figures 13.3, 13.4) or the Chinese
erhu (Figure 13.1) string instrument have realized such microtonal tunings.
13.1 Roots, and Logarithms 109
Hába has also written interesting string quartets for quarter-tone, fifth-tone,
and sixth-tone temperaments. Many composers from different cultures have
written and played compositions for microtonal tunings.
This set is bounded from above. Its supremum is defined to be the logarithm
of x for basis a, denoted by loga (x), see Figure 13.5. It is the supremum of all
ˇ “*
Example 32 The selection of admissible musical pitches (also known as
tunings) is a major topic in the construction of musical instruments and in
music theory. For string instruments, any conceivable pitch can be played within
110 13 Roots, Logarithms, and Normal Forms
the range of the instrument, but for keyboards, only a discrete subset of pitches
is available (see also Figure 30.1). In music theory, the totality of possible
pitches is not conceived.
But what is pitch? In physics, sounds with a determined pitch are gener-
ated by a variation of the air pressure p(t) (in pascals, where one pascal (Pa)
is the force of one Newton per square meter N/m2 ) as a function of time t (in
seconds (sec), say) that shows periodicity, i.e., it repeats its shape after a time
period P . Figure 13.6 shows a sinusoidal function of pressure variation around
the average air pressure 101325 P a. The frequency of a pressure function is
defined to be f = 1/P if P is the time period, and the frequency unit is Hertz,
Hz = 1/sec. For example, the chamber a in music is frequently (but not always,
some regions have slightly different standards) associated with 440 Hz.
However, humans don’t perceive frequency as such. It is the logarithm
P itch(f ) = log(f ) that our brain perceives as pitch1 . This law is called the
Weber-Fechner law. For example, if we are given a pitch P itch(fc ) = log(fc ), say
of middle c on a piano, then the octave c of this pitch has the double frequency
2fc . This translates to the logarithmic equation P itch(2fc ) = log(2) + log(fc ).
In other words, going up one octave means adding the constant log(2) to the
given pitch. This is the reason why the distance between keys an octave apart
1
In psychoacoustics, the pitch number is defined by a slightly different formula,
namely P itch(f ) = log1200(2) log10 (f ) + v. The factor log1200(2) is chosen such that the
10 10
octave is divided into 1200 units. In fact, log1200(2) log10 (2) = 1200. The pitch unit
10
that is defined by this formula is called Cent (Ct). So the octave is divided into 12
times 100 Cents, which means that each semitone is divided into 100 Cents—hence
the name “Cent”.
13.2 Adic Representations 111
f = f0 · 2o 3q 5t , o, q, t ∈ Q
Proof 9 The proof essentially consists of an estimation of the speed with which
the sum converges. The point can be made for b = 2 andassuming that all
coefficients are ai = 1. Then we have to consider the sums i=0,1,...j 2−j . But
this is the so-called classical geometric series: sj (x) = i=0,1,...j xj for x = 1/2.
j+1
One easily verifies that sj (x)x − xj+1 + 1 = sj (x), so sj (x) = 1−x
1−x . Taking
x = 1/2, we get sj (1/2) = 2(1 − (1/2)j+1 ), and this clearly converges to 2.
The number zero is denoted by 0 or 0.0. If aj = 0 eventually, we also write
an . . . a0 .a−1 . . . a−m
and, if the number is negative, we prepend a − and write
−an an−1 . . . a0 .a−1 . . .
Theorem 14 Every real number can be represented in this b-adic form for
any given basis b > 1. The rational numbers are precisely of the following type:
There is a sequence am , am−1 , . . . am−k such that their b-adic representation is
as follows:
an . . . a0 .a−1 . . . am+1 am am−1 . . . am−k
which means that the overlined sequence acts as a period that is repeated ad
infinitum to the right.
For example, if we have the period 2, 3 for the decimal representation, we
get for example such a number
1.23232323 . . .
This adic representation is nearly unique, but for every basis b, there is a
situation where some rational numbers have two representations. This happens
each time where we have a period b − 1 of length one, for example in the
decimal representation 0.1239. This number is the same real number as the
number 0.124.
Let us see why, and look at the simplest example of a decimal represen-
tation. Take x = 0.999999 . . . with period 9 of length one. This number x is
in fact equal to 1.0. Let us see their differences when we consider the defining
Cauchy sequences. For 1.0, its sequence is constant (1)i . For 0.999999 . . . we
have the sequence 0.9, 0.99, 0.999, . . . The differences of the members of these
sequences are 0.1, 0.01, 0.001, . . . which is a zero sequence, hence our claim.
14
Complex Numbers
Summary. Square roots of negative real numbers are not defined yet. We
introduce complex numbers to solve this problem. Essentially, we introduce an
imaginary number i, the square root of −1, and thereby add a new dimension
to the real numbers.
–Σ–
We can now solve equations of type ax + b = c for all a = 0, and we can
find points of convergence of all Cauchy sequences in R, but we cannot solve
all equations yet. It can be shown that one can also solve any equation of type
x3 + bx2 + cx + d = 0, but equations of type x2 + bx + c = 0 cannot be solved
in general. For example, the simple equation x2 + 1 = 0 has no solution in R
since x2 ≥ 0 for all real number x.
The problem now is types of equations with higher powers of the unknown
x. We shall see later in Chapter 24 that in fact, here again, the problem is
the solution. But this requires more structures than we have yet. Therefore,
we present a solution with less theory. The method we use now goes back to
the German mathematician Carl Friedrich Gauss (1777-1855). He invented the
valid theory of complex numbers. Mathematicians had worked with solutions
of equations such as x2 + 2 = 0, but nobody figured out how to conceive such
strange numbers that would solve those equations. This is one reason they are
called “complex numbers”. Here is Gauss’ construction:
The set of complex numbers C is identical to the cartesian product R × R.
A complex number is a pair z = (x, y) of real numbers, where x is called the
real part x = Re(z) and y is called the imaginary part y = Im(z) of z. The
interesting new structure on C is its arithmetic, the addition and multiplication
of complex numbers. Let z = (x, y), w = (u, v) be two complex numbers. We
set
z + w = (x, y) + (u, v) = (x + u, y + v),
z · w = (x, y) · (u, v) = (xu − yv, xv + yu).
Here is the sorite for this arithmetic structure:
Sorite 8 Let x, y, z be complex numbers, and denote 0 = (0, 0), 1 = (1, 0).
Then
(i) (Additive associativity) We have (x + y) + z = x + (y + z) and denote this
number by x + y + z.
(ii) (Multiplicative associativity) We have (x · y) · z = x · (y · z) and denote this
number by x · y · z, or xyz, if no confusion is likely.
(iii) (Commutativity) We have x + y = y + x and x · y = y · x.
(iv) (Distributivity) We have x · (y + z) = x · y + x · z.
(v) (Additive and multiplicative neutral elements) We have 0 + x = x and
1 · x = x.
(vi) (Solution of equations) If a = 0, then every equation a · x = b has a unique
solution; in particular, the solution of a · x = 1, the multiplicative inverse
of a, is denoted by a−1 . The solution of a + x = 0, the additive inverse (or
negative) of a, is denoted by −a. We shall see below how to calculate the
inverse explicitly.
Fig. 14.1. The complex numbers are points in the Gauss plane R2 , together with
arithmetic operations. Every complex number is determined by its real and imaginary
components.
x = (Re(x), Im(x)).
We then have this crucial result, which justifies the geometric point of view:
x = Re(x) + i · Im(x),
The complex numbers have a rich inner structure that is related to con-
jugation.
Fig. 14.2. The five dimensional space-time with complex time. The “physical” space
R3 × R with the real axis R for physical time, and the “mental” space R3 × iR with
the “imaginary” time axis iR.
positioned in the mental component, and its physical performance lives in the
physical component. This implies that our mental activity while thinking about
the score or creating it as a set of symbols occurs in imaginary time, while per-
formance has to switch time to its real component. In our model [68], we have
developed a theory of transition from imaginary to real time, using ideas from
physical string theory. In this model, not only are there imaginary and physical
14 Complex Numbers 117
states, but we also designed an entire family of intermediate states that share
real and imaginary time. We shall come back to this model in Chapter 33 where
gestural aspects of performance are discussed.
ˇ “*
Example 34 In sound technology, complex numbers are indispensable.
When describing sounds that have a determined frequency, there is a classical
mathematical theory that meets the needs for a complete description, namely
the formalism discovered by Joseph Fourier around 1800. His theory allows for
a decomposition of a sound function as a sum of sinusoidal functions, called
partials or overtones. To perform calculations with Fourier’s theory, one works
with complex numbers. Fourier’s theory has also been applied to create fast
algorithms for the calculation of numerical data associated with partials. The
most famous is called Fast Fourier Transform (FFT). It is the basis of the
global Internet-based sound technology. Fast transmission of sound data would
be impossible without FFT.
Part IV
Summary. Up to now, we have been able to construct all basic number do-
mains N, Z, Q, R, C. But we have not considered geometric objects. This chap-
ter begins to fill that gap. It introduces the most elementary geometric objects:
graphs—systems of points and arrows connected by directed or undirected lines.
We shall conclude part IV with the introduction of higher-dimension graphical
objects that relate to coverings of sets by a system of subsets.
–Σ–
In music theory, the systematic use of graphs was introduced by one of
the authors (Guerino Mazzola) in 1980, see [72], who used category theory
where arrows are the natural language. His idea of replacing sets of notes with
directed graphs was motivated by the need for a method to define chords in
12-tempered tuning without reference to overtone arguments, which had never
been a good logic for the justification of harmonic arguments. Let us give a
single example of the graph-theoretical method for the construction of chords.
Consider the major triad c, e, g that we place in the set P12 = {0, 1, 2, 3 . . . 11}
of pitch classes. We apply the function F = T 7 3 : P12 → P12 : x → 3x + 7 to
Fig. 15.1. The major triad as a kind of “orbit” of c under one single function T 7 3.
Fig. 15.2. In network theory, as shown from this original graphic, the triad {f, c, g} is
interpreted using transposition or inversion relations (I5 and I7) among its elements.
ˇ “* Example 35 There are many musical situations where arrows that connect
musical objects are adequate. The nature of such arrows can vary considerably,
and we shall see some general examples when we discuss category theory in
Chapter 29. A simple illustration of the graphical approach to music is shown
in the following example. If we consider the black key f = g on a keyboard,
we may view it as either a sharpened version f of the white key f , or as
the flattened version g of the white g key. It is not an independent key; it is
thought of as a key that results from two possible movements. The graphical
representation f - f = g g represents this idea.
√
Example 8 Here is a digraph with one point t and one arrow T , a loop,
the so-called final digraph 1 = t b T . The following digraph has two vertices
u
and two parallel arrows u, v that connect them • %9 • . The following digraph
v
15.1 Directed Graphs 123
For every natural number n, we have the chain digraph [n]. It has V = n + 1 =
{0, 1, 2 . . . n} as vertex set and the set A = {(i, i + 1)|i = 0, 1, . . . n} as arrow
set:
(0,1) (1,2) (n−1,n)
[n] = 0 - 1 - 2 . . . n − 1 - n.
For n = 0 we have the trivial digraph with one vertex 0 and no arrow. The
number n is called the length of [n].
Similar to sets and functions, digraphs must also be related to each other
by “digraph functions.” Here is their definition:
ˇ “*
Example 36 Melodies within compositions are typical examples of paths
in music. Consider a composition K as shown at the top of Figure 15.3. Take
the digraph Γ (K) defined to have the notes of K as the vertex set and the
subset A ⊂ K 2 of all pairs of notes (n, m) such that Onset(n) < Onset(m).
124 15 Directed and Undirected Graphs
Fig. 15.3. Three melodic paths in the digraph Γ (K) of composition K, Song of Yi
II—A Se by Mingzhu Song. The sound example is Yi.
This is sketched with some of the arrows in the middle of Figure 15.3. Now,
we look at paths [n] → Γ (K). These are by construction connected sequences
of notes that follow each other by increasing onset times. This is what could
be defined as melodies in K. We have selected three such melodies, f1 , f2 , f3 ,
which are defined on the chain digraphs [7], [13], [6], respectively (Figure 15.3).
15.3 Cycles
We have seen that we can compose paths or walks. Now we can address cycles,
special paths or walks that terminate on the same vertex whence they started.
Cycles of length one are called loops. Graphs are said to be connected iff any two
vertices can be the extremal values of a walk. Digraphs Γ are called connected
iff their undirected images |Γ | are so. Two important types of cycles are Euler
and Hamilton cycles.
ˇ “*
Example 37 The harmonic sequence I − IV − V − V I − II − V − I can be
seen as a Euler cycle as shown in Figure 15.4.
ˇ “*
Example 38 Dodecaphonic series can be interpreted as Hamilton cycles.
We work in the complete graph K12 that has 12 vertices of P12 and all possible
unordered pairs {x, y}, x = y as edges, see the left graph in Figure 15.5. In this
representation, the set P12 is shown as a circular arrangement of twelve points
similar to the twelve hours on a clock. Later, in Section 19.2, when dealing with
group theory, we shall understand why this is a good representation. In section
∼
6.3, a series was interpreted as a permutation s : P12 → P12 . We now interpret
this as a walk that starts at s(0), goes to s(1), etc., and when arrived at s(11)
closes to s(0). This defines a Hamilton cycle. This one for Webern’s op. 17.2
(shown in Figure 4.8) is drawn to the right in Figure 15.5.
Connected graphs without cycles are called trees. A spanning tree of a
graph is a subgraph that is a tree and contains all vertices. Every graph has a
spanning tree.
15.3 Cycles 127
Fig. 15.5. The complete graph K12 of P12 (left) and a Hamilton cycle (right) asso-
ciated with Webern’s series of op. 17.2.
16
Nerves
Definition 43 Given
a set X, a covering of X is a set C ⊂ 2X of non-empty
subsets such that C = X. The nerve of C is the subset N (C) ⊂ 2C consisting
of all finite non-empty subsets s ⊂ C such that s = ∅. If card(s) = n + 1,
we say that s is an n-simplex.The subset of n-simplices of N (C) is denoted
by Nn (C), therefore N (C) = n≥0 Nn (C). In particular, N0 (C) = C if we
identify 0-simplices with their single elements. If s ∈ N (C), then any non-
empty subset t ⊂ s is in Nn (C). It is called a face of s.
ˇ “*
Example 39 A classical example of a nerve in music is given by the covering
C (3) ⊂ 2C of the diatonic scale X, here X = C-major = {c, d, e, f, g, a, b}, by
the seven standard triadic degrees I, II, . . . V II, as shown in Figure 16.1. The
seven note names of C-major are shown as full points. This nerve is called the
harmonic band of the given scale [72].
Fig. 16.1. The covering of the C-major scale by the seven standard degree chords.
The nerve N (C (3) ) has the seven degrees as 0-simplices, and the 14 lines
for all pairs of degrees that intersect, e.g., I ∩ III = ∅, as 1-simplices. It has
7 filled triangles for triples of degrees that intersect, e.g., I ∩ III ∩ V = ∅, as
2-simplices. The overall geometry of this nerve is shown in Figure 16.2.
The geometry of the harmonic band is the reason for the failure of Hugo
Riemann’s function theory program. Refer to Musical Example 9, where we
have explained Riemann’s ideas. Recall that Riemann wanted to define tonality
functions
X − T onality : Ch → T DS,
starting from three values X − T onality(IX ) = T, X − T onality(VX ) =
D, X − T onality(IVX ) = S. He then imposed conditions of function values
for the remaining degrees IIX , IIIX , V IX , V IIX . These conditions were in fact
geometric if one works with the harmonic band. Riemann’s first condition was
that successive degrees on the band’s boundary, i.e., I → V → II → V I →
III → V II → IV → I, should have different function values, this succession
16 Nerves 131
Fig. 16.2. The nerve N (C (3) ) of a diatonic scale is a Moebius band. Its lack of
orientation is a reason for problems in Riemann harmony.
D = X − T onality(V ) = X − T onality(πV ) =
X − T onality(III) = X − T onality(πIII) = X − T onality(I) = T,
which contradicts the first condition of different values for successive degrees
on the band’s boundary. This contradiction is due to the lack of orientation on
a Moebius band. If you stand on one side of the band and walk on the band’s
surface, you end up standing upside down, see Figure 16.3.
√
Example 12 For X = n + 1 = {0, 1, . . . n}, the covering C1 (n) = {{i, i +
1}|i = 0, . . . n − 1} defines a nerve N (C1 (n)), where N0 (C1 (n)) is the edge
∼
set of the chain graph |n|, and N1 (C1 (n)) → {{1}, {2}, . . . {n − 1}} the vertex
singleton set of |n|, except the first and the last vertices 0, n. The bijection is
defined by taking the intersection of successive members {i, i+1}∩{i+1, i+2} =
{i + 1}.
132 16 Nerves
Fig. 16.3. The harmonic band has no orientation: If you walk around you end up
upside down.
More generally, one may take the covering Ck (n) of n + 1 that has the
charts i, k := {i, i + 1, i + 2, . . . i + k} for i = 0, . . . n − k. We now have not
only 1-simplices as above, but also Nj (C(n)) = ∅ up to j = k, as {i, i + 1, . . . i +
k} ∩ {i + 1, i + 2, . . . i + 1 + k} ∩ . . . {i + k, i + 1 + k + 1, . . . i + 2k} = {i + k},
see Figure 16.4 for the nerve N (C3 (16)).
Fig. 16.4. The nerve of the covering C3 (16) is visualized; it is a chain of tetrahedra
(3-simplices) that are connected on one side with each other.
ˇ “*
Example 40 Compare the nerve C3 (16) to the nerve of the harmonic band.
Can you see similarities? Can you guess how a harmonic band could be defined
so it would resemble the nerve C3 (n)? Try first try to think about a harmonic
band that resembles C1 (n).
Similar to morphisms between graphs, we have morphisms between nerves:
In this section, we want to show how the geometry of nerves of coverings can be
used to compose motivic structures in a sonata. Our example is the sonata Al-
legro movement op. 3 by one of the authors (Mazzola), published and recorded
under the title L’essence du bleu [76]. We only describe the motivic construc-
tion of a specific part of the composition and don’t discuss the harmonic and
rhythmic aspects.
This sonata was composed in the spirit of Pierre Boulez’s “creative analy-
sis” [20], which means that the composition was analyzed and this analysis was
used to create a new composition by changing some analytical parameters. In
our analysis of Beethoven’s “Hammerklavier” Sonata, op. 106, we exhibited the
central role of the diminished seventh chord C−7 = {c, e, g, b}, see Figure
16.5. Apart from determining the sonata’s harmonic modulations (changes of
tonalities, see Chapter 23 for details), this chord also determines Beethoven’s
motivic work. As the chord C−7 is reproduced after transposition T 3 of its
pitch classes, it is reasonable to consider an elementary motivic “zigzag” of pe-
riod 3. This is shown in Figure 16.5 to the left, where a chromatic ascending and
descending melodic movement is visible. In op. 106, this zigzag is a dominating
motivic structure [72].
In our creative part of the analysis, we replaced the minor third transposi-
tion T 3 by a major third transposition T 4 , since 12 = 3 · 4 is the decomposition
134 16 Nerves
Fig. 16.5. The motivic zigzags in Beethoven’s op. 106 and Mazzola’s op. 3.
of 12 into different prime number powers. The corresponding chord is the aug-
mented triad1 C+ = {c, f, a} that is reproduced after a transposition T 4 of
pitch classes. And the corresponding motivic zigzag of period 4 is shown below
the chord C+ to the right in Figure 16.5.
Inspired by the harmonic band of triads, the idea was to construct a
covering of the unit of zigzag, shown on top of Figure 16.6, that would have a
nerve in the shape of a Moebius band, too. Figure 16.6 shows a solution that
consists of nine three-element motives.
Fig. 16.6. The covering of the motivic unit (top) by nine three-element motives has
a Moebius band nerve.
Fig. 16.8. Distribution of the three-element motives in the pitch-onset plane. Group
A, B, C, D, E, F yields Dr (r for right hand), while Group A , B , C , D , E , F yields
Dl (l for left hand).
Fig. 16.10. The score part of measures 33-38 corresponding to the distribution Dr ,
Dl . The sound example is sonata.
Nerves are a precise and powerful tool to discuss the difficult concept of musical
complexity in analysis and composition. The question is here what makes a
composition complex. There are two extremal positions when considering a
score: One can say that this object is just a set of notes. This would reduce
the score to an atomic perspective. The corresponding covering would be the
one which has exactly one note in each chart. The nerve of this covering would
be trivial, consisting only of zero simplices. And all these charts would be
isomorphic by shifting around one note to produce all the others. This would
amount to simply counting notes, a totally destructive classification: to reduce
a composition to the cardinality of its note set.
138 16 Nerves
2 230 741 522 540 743 033 415 296 821 609 381 912 ∼ 2.2336
Fig. 16.11. A harmonic band. Copy it to a separate paper, cut it out there, fold and
glue it using the small purple tab.
Part V
Summary. Monoids are the simplest type of algebraic structure, and for this
reason they are omnipresent in mathematics. This situation is parallel to the
hierarchy of numbers. The monoids will be extended structurally (not as sets!)
to groups, rings, and modules later.
–Σ–
single vertex and A the set of arrows. Here, Cyc(A) consists of sequences
of arrows (all being loops!), i.e., words built from letters in A.
ˇ “*
Example 41 In music theory, scales play an important role. In the pitch-
class set,P12 , the D-major scale is represented by d, e, f , g, a, b, c, d, when
starting from d. This information is often encoded by examining the intervals
(in multiples of semitone steps) between consecutive pitch classes. In our ex-
ample, this would be the sequence (2, 2, 1, 2, 2, 2, 1). This information is then
encoded as a word in the monoid W ord(A), where A = {a, b} is the alphabet
whose letters encode the two intervals, a stands for 1, and b stands for 2. This
encoding represents the scale by the word bbabbba. If we had taken the melodic
D-minor scale d, e, f, g, a, b, c, d, the word representation would be babbbba, ex-
changing the second and third letters in the major scale word. Of course, this
representation is not reliable since it does not define the pitch class where the
intervals start. We shall come back to the mathematical nature of this abstrac-
tion in Chapter 19. In recent times, this formalism of word monodies has been a
strong research direction in mathematical music theory, see the corresponding
papers in [5].
The implicit idea of such a scale word is that the sum of the intervals
corresponding to the letters a, b is 12, the octave interval. But the formalism is
indeed more general. One may define any word w ∈ W ord(A) for any alphabet
A with the interpretation of letters as intervals being given by a set function
int : A → P12 . Then the scale scale(w) would be the sequence of pitch classes,
starting from 0, say, and defined by the succession of intervals associated to
letters by the function int. It is understood that after the last interval of w,
the word starts over.
For example, the above major scale of word w = bbabbba yields the mul-
tiple concatenation of intervals, for example the triple one
For a non-octave scale word w = b, the word wwwwww yields the scale
c, d, e, f , g, a, c, the first Messiaen scale, also known as the whole-tone scale.
For the non-octave scale word w = ba, on gets wwww = babababa, which
corresponds to the scale c, d, d, f, f , g, a, b, c, which is known as the second
Messiaen scale. For w = aab, the iterated word www = aabaabaab yields the
scale c, c, d, e, f, f , g, a, a, c, the third Messiaen scale. But one may also get
non-octave compatible words such as w = bba, whose iteration eventually fills
every pitch class. However, one may also work in the pitch set Z and then gen-
erate a selection of pitches that don’t repeat themselves after octaves. In our
example, w = bba, one would get this scale: c, d, e, f, g, a, a, c, d, d, f, g, g, . . ..
ˇ “*
Example 42 Another use of the word monoid W ord(A) in music is the
formal representation of harmonic syntax, in the Riemann formalism, say. Here
we have an alphabet (we use only a simple model of harmonic functions here)
17 Monoids 145
ˇ “*
Example 43 In the theory of rhythm of music and language, word monoids
are also the standard representation. For example, in poetry, a rhyme scheme,
such as ABAB, represents the occurrence of a (end)rhyme word in a succession
of a poem’s lines.
Within a line of a poem, the distribution of short (symbolized by “letter”
∪) and long (symbolized by “letter” —) syllables defines a metrical line. Typical
“elementary” words of meters are the dactyl —∪∪ or the spondee — —. Here is
a classical hexameter, a product of dactyls and spondees, from Virgil’s Aeneid:
Arma vi(∗)rumque ca(∗)no, Tro(∗)iae qui(∗) primus ab(∗) oris
(—∪ ∪ ∗ —∪ ∪ ∗ — —∗ — —∗ —∪ ∪ ∗ — —)
These structures have been used for the rhythm of musical compositions
with lyrics. A wonderful example being Franz Schubert’s composition op. 72
for Leopold Stolberg’s poem Lied of dem Wasser zu singen. See [75, Section
11.6.2] for a thorough discussion of the poetic rhythm of musical three-note
motives in Schubert’s composition.
In music theory, certain large forms, such as the song form ABA or the
sonata form ABAD represent the succession of parts and their iteration. For
the sonata form, it is A for the exposition, B for the development, again A for
the recapitulation, and D for the coda.
Definition 48 Let (M, ∗) be a monoid and let S be any subset of N . Then the
intersection
S := N
N ⊂M submonoid with S⊂N
Summary. Monoids that have only invertible elements are called groups.
Groups are the most important single structure in algebra and have enormous
applications in physics as well.
–Σ–
Fig. 18.1. The symmetries of the square build a group under composition.
ˇ “*
Example 45 Let us consider the cartesian product space OP = R × R of
onsets and pitches, for which we allow real number values, just to view the
situation in a geometric way. Within the group Sym(OP ) of bijections on OP ,
there are some bijections that are classical in music: retrograde R, indexinver-
sioninversion I, and retrograde inversion RI. The operation R is defined by
R(x, y) = (−x, y), inversion is I(x, y) = (x, −y), and retrograde inversion is
RI(x, y) = (−x, −y), see Figure 18.2. (We should not care about getting neg-
ative pitch values here, we can always re-calibrate pitch to get positive values
by shifting the zero down.)
Fig. 18.2. The Klein 4-group of retrograde R, inversion I, and retrograde inversion
IR = RI.
Retrograde R and inversion I are reflections at the vertical axis and the
horizontal axis, respectively. But RI is not a reflection, it is a 180o rotation
around the origin (0, 0) of the plane P O. In music theory, RI is not understood
as a rotation, but as the composition R ◦ I = I ◦ R of retrograde and inversion.
In terms of group theory, we have a small group R, I that is generated by R
and I, with four elements IdOP , R, I, RI = IR. You can easily check that this
is in fact a group. It is called the Klein 4-group K4 . This group is generated by
retrograde and inversion, and the product RI is not understood geometrically
as a rotation, but as a composition of two musically understandable operations.
It has two non-trivial subgroups R = {IdOP , R} and I = {IdOP , I}.
The understanding of RI as a rotation has, however, been realized in a
very practical way by Ludwig van Beethoven. At his time there were public
piano competitions. Several pianists would perform whatever they liked and
then be judged by the audience. In such a competition, Beethoven was fighting
against the quite famous pianist Daniel Steibelt. After Steibelt had played his
150 18 Groups
Summary. In musical creativity, actions are very important. They always deal
with two components: the agent who acts in a determined way, and the object
on which this action is being performed. Of course, not just any action can be
performed on any object, so one has to specify agent/action as well as domain
of objects that are suitable for a determined action. In this chapter we develop
the formalism of group actions and provide introductory examples.
–Σ–
Figure 19.1 shows actions taken while building musical instruments or
composing a score. Of course, these intuitive situations are far from precise
mathematical formalism.
Fig. 19.1. Left: Building instruments is an action that combines parts to produce
a whole. But this is not always possible. Right: The musical composition is also an
action that (usually) generates a score. The neutral action could be the creation of
an empty score (recall John Cage’s empty composition in Figure 3.3), while analysis
could be seen as the inverse action.
19.1 Actions
Definition 52 A group homomorphism f : G → Sym(X) for a set X is called
an action of G on X. Equivalently, a group action is a function G × X → X :
(g, x) → g · x such that
(i) e · x = x for all x ∈ X and
(ii) (gh) · x = g · (h · x) for all x ∈ X.
The correspondence being g · x = f (g)(x).
ˇ “*
Example 46 In his seminal book [60, Definition 2.3.1], David Lewin intro-
duces a core concept of his transformational theory, the Generalized Interval
System. Here is his original text:
DEFINITION: A Generalized Interval System (GIS) is an ordered triple
(S, IV LS, int), where S, the space of the GIS, is a family of elements,
IV LS, the group of intervals for the GIS, is a mathematical group,
and int is a function mapping S × S into IV LS, all subject to the two
conditions (A) and (B) following.
(A): For all r, s, and t in S, int(r, s)int(s, t) = int(r, t).
(B): For every s in S and every i in IV LS, there is a unique t in S
which lies in the interval i from s, that is a unique t which satisfies the
equation int(s, t) = i.
We want to show that this concept is that of a simply transitive action of a
group. We have the group IV LS opp (see Example 17) that acts on the space S in
the following way. Call · the composition in IV LS and · the composition of the
opposite group. For any ordered pair (i, s) ∈ IV LS×S we have the action i∗s =
t for that unique element t ∈ S such that in Lewin’s approach i = int(s, t).
We can also state this by the equation int(s, t) ∗ s = t. Let us show that this
is indeed an action. The equation in Lewin’s point (A) shows that the neutral
element e ∈ IV LS acts with e ∗ s = s. In fact, int(s, s)int(s, s) = int(s, s).
Multiplying this equation with int(s, s)−1 yields int(s, s) = e. Moreover, if
i = int(t, r), j = int(s, t) are two group elements, and if j ∗ s = int(s, t) ∗ s,
i ∗ (j ∗ s) = i ∗ (int(s, t) ∗ s) = i ∗ t = int(t, r) ∗ t = r, while (i · j) ∗ s =
(int(t, r) · int(s, t)) ∗ s = (int(s, t) · int(t, r)) ∗ s = int(s, r) ∗ s = r. The action
19.1 Actions 153
ˇ “*
Example 47 Group actions are very frequent in music theory. Take G =
(Z, +), the additive group of integers and S = Z the set of pitches. Then G
acts on S by transposition, i.e., g ∗ s = T g (s) = g + s. This action is obviously
simply transitive. If we take the subset F in(Z) ⊂ 2Z of finite subsets of Z,
which can be interpreted as chords, we have the action Z × F in(Z) → F in(Z) :
(g, c) → g ∗ c = {T g (x)|x ∈ c}. This action is not simply transitive because it
is not transitive, although it is free, that is, g ∗ c = h ∗ c iff g = h, which means
that the map G → S : g → g ∗ s is injective.
The orbits of chords under this action are called transposition classes of
chords. For example, the chords in the orbit [{0, 3, 6}] are called diminished
triads, while the chords in the orbit [{0, 4, 7}] are called major triads.
ˇ “*
Example 48 We may also consider a larger group T I that acts on the pitch
set Z and consists of the transpositions T+t = T t and inversions T−t of Z. This
group acts transitively on Z, but its action is not free because for every s ∈ Z,
we have T−2s (s) = 2s − s = s, together with T+0 (s) = s. This group acts on
F in(Z) in the same way (Z, +) acted by transpositions. For a given chord c,
the subgroup F ix(c) of transpositions and inversions g such that g ∗ s = s is
called the fixpoint group of s. It describes the inner symmetries of chord c. The
diminished triad c = {0, 3, 6} has F ix(c) = T−6 , while for c = {0, 4, 7}, the
major triad, the fixpoint group is trivial. The T I-orbit [{0, 4, 7}] is the disjoint
union of the T -orbit of {0, 4, 7} and the T -orbit of {0, 3, 7}, major and minor
triads.
The most important primary application of group actions are actions of
subgroups H ⊂ G by cosets. These actions are defined by right multiplications
within the given group, i.e., G × H → G : (g, h) → gh. The orbits of such
an action are called left cosets of H. They are the sets gH = {gh|h ∈ H}.
In particular, two left cosets gH, g H are equal iff g −1 g ∈ H. The set of left
cosets is denoted by G/H. If we choose one representative g for each left coset
154 19 Group Actions, Subgroups, Quotients, and Products
gH, we have G = representative g gH. Moreover the maps H → gH : h → gh
are bijections. Therefore we have a bijection
∼
G → G/H × H.
One may also define right cosets Hg that correspond on an action from the
left (corresponding to a function H × G → G : (h, g) → hg). The orbits of this
action are the right
cosets Hg. They also define a partition by disjoint right
cosets, i.e., G = representative g Hg. The set of these equivalence classes is
denoted by H\G, and we have a bijection
∼
G → H\G × H.
∼
Left and right cosets correspond to each other by the bijection ?−1 : G → G :
g → g −1 . This implies that left cosets are mapped one-to-one to right cosets.
∼
Therefore G/H → H\G. The shared cardinality of G/H and H\G is called the
index of H in G and is denoted by (G : H). We therefore have
ord(G) = (G : H) × ord(H).
If G is finite, we have the famous Lagrange equation
ord(G) = (G : H)ord(H)
of natural numbers. In particular, the order of a subgroup H ⊂ G always divides
the order of the group G. For example, if G = S6 , we have ord(S6 ) = 6!, which
implies that there is no subgroup H whose order does not divide 6!—e.g., there
is no subgroup of order 7.
ˇ “*
Exercise 6 Recall the Klein 4-group discussed in Musical Example 45. Are
there subgroups of this group of order 3? Interpreting a dodecaphonic series as
a permutation in S12 , can its order be 12, 13, or 23?
If x ∈ G is a group element, its order is by definition the order of the group
it generates, i.e., ord(x) = ord(x). For a finite group G, the order of an ele-
ment must be a divisor of the group’s order: ord(x)|ord(G) by Lagrange’s equa-
tion. How do we find this order for x? One looks at all the powers e, x, x2 , x3 , . . .
of x. Since G is finite, they cannot all be different from each other. Let xm be
the first power of x that equals a preceding power, xm = xn , n < m. Then, after
dividing by xn , we see that e = xm−n . Since m was the first power of x that
equals a preceding power, this means that n = 0, and the group x consists
of the different powers e, x, . . . xm−1 . For example, we have x−1 = xm−1 . In
particular, m = ord(x).
Fig. 19.2. The cyclic groups Z3 , Z6 , Z9 , Z12 viewed as subgroups of the unitary group
U of complex numbers.
√
Example 19 The simplest example of a quotient group is given by taking
for every positive n ∈ N the subgroups n = nZ ⊂ Z generated by n. It
156 19 Group Actions, Subgroups, Quotients, and Products
Fig. 19.3. The quotient group Z12 , where the coset representatives 0, 1, 2 . . . 11 are
shown together with their standard interpretation as pitch classes of pitches in Z.
ˇ “*
Example 49 The case n = 12 is central for the quotient group Z12 . See
Figure 19.3, where the coset representatives 0, 1, 2 . . . 11 are shown together
with their standard interpretation as pitch classes of pitches in Z modulo the
octave of 12 semitones.
G
f
- K
6
f
proj inj
?
?
--
∼f 6
G/H → Im(f )
where proj is the canonical surjection, inj is the embedding of Im(f ), and f
is the isomorphism of groups that sends the coset gKer(f ) to f (g).
The proof is straightforward, and the only point is the definition of f . This
function is well defined since gKer(f ) = g Ker(f ) means g −1 g ∈ Ker(f ), so
f (g) = f (g ).
√
Example 20 The subgroup T ⊂ T I is normal, the quotient group is
∼
T I/T → {±1} = Z∗ .
We have seen that for chords ch ∈ F in(Z), the action of the group of transpo-
sitions T or the group T I of transpositions and inversions defines orbits that
are musically significant, for example defining a major triad as an element of
the T -orbit [{0, 4, 7}]. We can transfer this type of group action to pitch classes
if we deem irrelevant multiple appearances of a pitch name or the absolute
position in Z. We therefore consider the set of chords of pitch classes that is
defined by P CChords = 2Z12 . We include the empty or one-element chords for
completeness, although they are not considered chords in common terminology.
On P CChords = 2Z12 , we have the action of group T IZ12 of permutations
of Z12 that acts as before: T±t (x) = t±x, where t ∈ Z12 . The formal rules on this
group are as explained for T I in Example 20. We also have the normal subgroup
T Z12 ⊂ T IZ12 of transpositions T+t . The action of T IZ12 on P CChords follows
the same scheme as described in in Example 20—a chord is transformed by
f ∈ T IZ12 via the transformation of all of its elements by f . See Figure 19.4
for the action of an inversion on a chord.
The action of T Z12 and of T IZ12 define two classifications of chords:
T Z12 -orbits are called transposition classes, while T IZ12 -orbits are called
transposition-inversion classes. These classes are redundant, however, since we
also have the action of the two-element group Z2 on P CChords by complemen-
tation, i.e., −ch = 12 − ch is compatible with the T IZ12 -action. It is therefore
sufficient to classify chords of cardinality ≤ 6, and those with cardinality > 6
can be recovered from the classification of their complements.
The complete list of representatives of such classes is folklore in mathemat-
ical music theory for half a century. We refer to a complete list in [75, Appendix
L.1]. This list is far from random. The representatives of chord classes are cho-
sen according to the following criteria: We take the lexicographic ordering of
158 19 Group Actions, Subgroups, Quotients, and Products
Fig. 19.4. The inversion T−5 applied to chord ch = {c, e, g, b}.
19.3 Products
Cartesian products of groups are defined as follows.
ˇ “*
Example 50 In music theory, cartesian products of groups are important.
We shall see in Chapter 21 that the product Z3 × Z4 , together with its two pro-
jections pr3 : Z3 ×Z4 → Z3 and pr4 : Z3 ×Z4 → Z4 is the single most important
mathematical structure needed to understand the structure of intervals, and in
particular core properties of counterpoint theory.
The universal property of cartesian products of sets holds for groups mu-
tatis mutandis:
is a bijection.
Co
diminished
triad
15 • ◦ ◦ • ◦ ◦ • ◦ ◦ ◦ ◦◦
C+
augmented
triad
16 • ◦ ◦ ◦ • ◦ ◦ ◦ • ◦ ◦◦
C
major triad
10.1 • ◦ ◦ ◦ • ◦ ◦ • ◦ ◦ ◦◦
Cm
minor
triad
10.1 • ◦ ◦ • ◦ ◦ ◦ • ◦ ◦ ◦◦
C M 7 or
C maj7
major 7th
28.1 • ◦ ◦ ◦ • ◦ ◦ • ◦ ◦ ◦•
C7
dominant
7th
29 • ◦ ◦ ◦ • ◦ ◦ • ◦ ◦ •◦
CmM 7
minor
major 7th
30 • ◦ ◦ • ◦ ◦ ◦ • ◦ ◦ ◦•
Cm7
minor 7th
22.1 • ◦ ◦ • ◦ ◦ ◦ • ◦ ◦ •◦
C o7
dimin.
7th
37 • ◦ ◦ • ◦ ◦ • ◦ ◦ • ◦◦
Summary. This chapter deals with the study of the permutation groups Sn .
We also give examples of compositional methods using permutation groups.
–Σ–
Let us now investigate minimal sets of generators of Sn , and recall that
ord(Sn ) = n!
There are different ways to represent permutations p ∈ Sn . A simple basic
representation is to write the full table of ordered pairs (i, pi = p(i)), i = 1, . . . n.
This is usually done by a 2 × n-matrix, a rectangular system with two rows and
n columns:
1 2 ... n
p1 p 2 . . . p n
A more economical way is to use cycles. A cycle is an ordered k-tuple
C = (c1 , c2 , . . . ck ) of pairwise different numbers 1 ≤ ci ≤ n. This represents a
permutation that sends ci to ci+1 for i = 1, . . . k − 1 and then ck to c1 . The
number k is the length l(C) of C. The k-element set c1 , c2 , . . . ck of C is de-
noted by |C|, so card(|C|) = l(C). Cycles C = (c1 , c2 ) of length 2 are called
transpositions. We have already encountered transpositions in the calculation
of ord(Sn ). Cycles are standard generators of Sn .
Fig. 20.2. Mozart’s first dice number table. Rows correspond to dice numbers,
columns to measure numbers. The coefficients within the table are measure num-
bers Mozart defined.
Mozart’s musical dice game is entitled “Walzer und Schleifer mit zwei Würfeln
zu componieren ohne Musikalisch zu seyn, noch von der Composition etwas
zu verstehen.” (To compose a waltz with two dices without being musical and
20.1 Two Composition Methods Using Permutations 167
knowing anything about composition.) The idea is that the user can define a
sequence of eight measures I, II, III, . . . V III of music by throwing two dice
and then using a table (Figure 20.2) to look up a corresponding musical mea-
sure. The table has eleven rows that correspond to the result of adding the
two dice rolls (2 = 1 + 1 on top to 12 = 6 + 6 at the bottom). For measure
I, the user finds the dice result under column I. For example, we get 119 if
the dice roll yields 9, corresponding to measure 119 on the list of 176 possible
musical measures Mozart provided. The first thirty measures (all in waltz time
signature 3/8) are shown in Figure 20.3. This system allows creation of 118 =
214,358,881 possible compositions of eight measures each.
A well-known game that uses the concept of group is the Rubik’s CubeT M . It is
the world’s best-sold toy. Invented by Hungarian professor and architect Ernö
Rubik, the cube was first shown to his students as a teaching tool. Years later,
under the initial name of Magic Cube (perhaps in analogy with magic square
of letters and numbers), the cube was produced for sale [97].
The classic version is a 3 × 3 × 3 cube, where each face has a different
color. Sides can be moved. Among all possible combinations, there is only one
solution that completes the six faces with correct colors. Solving techniques are
studied by mathematicians. There are other versions of the cube, for example
2×2×2 and 4×4×4 (Pocket and Rubik’s RevengeT M ). The latter was invented
by Péter Sebestény [70].
One of the authors (Maria Mannone) used a 4 × 4 × 4 cube to create a
harmonic game, the Cubharmonic, shown in Figure 20.4. The idea is to exper-
iment with creating new harmonic sequences. On each face of the cube we can
write a harmonic sequence of a four-part harmony. For example, the cadence
I − IV − V − I in C-major can be written as
c c b c
g a g g
e f d e
c f g c
We can write these letters on one face of the cube. If we write a different
harmonic sequence on each face, by rotating and twisting sides we get a huge
number of different harmonic combinations. The number of combinations is
greater than for the classic 4 × 4 × 4 cube. The reason is that little squares
of the same color, having been marked with different names of notes, are not
undistinguishable. The scope of the original game was to recover the initial
configuration of one color on each face, which means from chaos to cosmos.
Here the main interest is to mix sequences, from cosmos to chaos. We can ex-
change chords, and we can also modify horizontally voices in the same harmonic
sequence.
168 20 Permutation Groups
Fig. 20.3. The first thirty measures, in waltz time, out of a total of 176 on Mozart’s
list.
The cube of Figure 20.4 presents a choice of six different harmonic se-
quences, shown in Figure 20.5. Three different results, obtained via simple
permutations, are given in Figure 20.6.
20.1 Two Composition Methods Using Permutations 169
Fig. 20.5. A choice of six different cadential sequences (one for each side of the cube)
as a starting position. The sound example is cubharmonic_1.
Fig. 20.6. Three different results obtained via simple permutations, i.e., scrambling
Summary. This chapter deals with the mathematics of the third torus group
Z3 × Z4 and its symmetries and then applies these structures to music theory,
in particular to counterpoint.
–Σ–
The third torus is a group that has a rich structure despite its simple mathe-
matical shape. It is also a central object of mathematical music theory, which
will be discussed in Section 21.2.
The third torus is the finite abelian group T3×4 = Z3 × Z4 . Let us look at
the finite cyclic groups Zn that define the torus’ background structure. If we
consider elements z ∈ Zn , they are cosets z = x + nZ, and we denote these by
xn = x + nZ. The group structure on Zn yields (x + y)n = xn + yn . We also
write x ≡ y(mod n) and say that x is congruent to y modulo n for xn = yn .
If n|m, we have a canonical surjective group homomorphism Zm → Zn :
xm → xn . It is well defined since mZ ⊂ nZ, therefore xm = ym implies xn = yn .
In particular, we have canonical surjections pr3 : Z12 → Z3 and pr4 : Z12 → Z4 .
The kernels are the four-element subgroup Ker(pr3 ) = 312 and the three-
∼ ∼
element subgroup Ker(pr4 ) = 412 . I.e., Z12 /3 → Z3 and Z12 /4 → Z4 .
√
Exercise 26 Verify these facts.
Using the universal property of cartesian products of groups, this data can be
combined to define a group homomorphism (pr3 , pr4 ) : Z12 → Z3 × Z4 . This
homomorphism is inserted in our commutative diagram
?
? -
-
∼ (pr3 ,pr4 ) 6
Z12 /Ker((pr3 , pr4 ))→ Im((pr3 , pr4 ))
The kernel Ker((pr3 , pr4 )) is the intersection Ker(pr3 ) ∩ Ker(pr4 ) of the
two kernels Ker(pr3 ), Ker(pr4 ), but this is trivial. Therefore, in our diagram,
the left projection is an isomorphism, and so is the homomorphism onto the
image Im((pr3 , pr4 )). This image of Z12 has 12 elements, and Z3 × Z4 also has
12 elements, so the image is all of Z3 × Z4 , which means that (pr3 , pr4 ) is an
isomorphism of groups. It is advantageous for music theory to compose this
Fig. 21.1. The third torus. We see four vertical circles—copies of Z3 —attached to
the four pitch-class points 0, 3, 6, 9.
On the third torus T3×4 we can define a metrical distance function by d(z, w)
being the minimal number of minor or major third steps to reach w from z.
The third steps are addition or subtraction of (13 , 04 ) or (03 , 14 ).
This metrical geometry on T3×4 is important because it is invariant under
all symmetries of T3×4 . Let us now explain what symmetries of T3×4 are. On
Z we had considered action of the group T I whose elements are the function
t
T±1 : z → t ± z. This means that we considered the invertible elements ±1 of
∗
Z and used the multiplication with such elements, together with a shifting by
t: T±1t
(z) = t + (±1)z.
We want to carry over this idea to Z12 . The problem is that so far we
have no multiplication of elements of Z12 . This is what we shall introduce now.
Later, in Chapter 24, we will see that the present procedure is a very simple
special case of a general method to introduce multiplication in quotient groups.
Multiplication on Zn is defined by xn · yn = (x · y)n . If this works, the
arithmetic on Z can be carried over “literally” to Zn as addition and mul-
tiplication are both defined by going back to Z, doing it there, and then
taking cosets; also refer to Musical Example 19. Take two different repre-
sentatives of our cosets, x = x + nz, y = y + nw instead of x, y. Then
x · y = (x + nz) · (y + nw) = xy + n(xw + zy + nwz) = xy(mod n), and
we are done.
For Z12 this multiplicative structure defines four elements that are invert-
ible, namely Z∗12 = {112 , 512 , 712 , 1112 }. We have 1212 = 5212 = 7212 = 11212 = 112 .
All other elements are not invertible, for example 312 412 = 012 .
With this multiplicative structure we now generalize the above construc-
tion of T I for Z. Rewrite T I = T Z∗ . We now define T Z∗12 as the group of
permutations of Z12 defined by the elements T t .s, t ∈ Z12 , s ∈ Z∗12 defined by
T t .s(z) = t + sz for z ∈ Z12 . This is a group under usual composition of func-
tions. In fact T t .s ◦ T u .v = T t+su .sv, and (T t .s)−1 = T −st .s, while T 0 .112 is
neutral. Symmetries on Z12 are by definition the permutations in the group
T Z∗12 .
The next step consists of proving that all symmetries are in fact geometri-
cally reasonable if we transport them to the third torus using the isomorphism
t. Here is the geometric interpretation, which we specify for generators of the
symmetry group T Z∗12 . We usually omit the factor 1 if the symmetry is T t .1,
see Figure 21.2.
174 21 The Third Torus and Counterpoint
Fig. 21.2. The symmetries of the torus all conserve the third distances.
Theorem 19 The group of symmetries T Z∗12 defines on the torus T3×4 a group
of symmetries that conserve metrical distances, i.e., all symmetries are isome-
tries on the third torus.
This is a remarkable result since on Z12 , multiplication with 5 or 7 does
not conserve distances in the circle representation of Z12 .
We have seen that chord classes under the action of T Z∗ are standard in music
theory. But we may as well look at classification of chords by orbits of the larger
21.2 Music Theory 175
group T Z∗12 . This is the classification that has been achieved in [75, Appendix
L.1]. There, two T Z∗ classes XX and XX.1 are representatives of the same
T Z∗12 class.
ˇ “*
Exercise 8 Try to find the number of key alterations ( or ) for the trans-
positions T 5 X starting from a scale X0 of seven white keys if the piano has the
following white keys: {c, d, e, f, g, a, b}. You will see that the above rule does
not hold.
The explanation comes from a different representation of the diatonic ma-
jor scale C. We use the symmetry T 0 .5 of Z12 and see that the scale C is mapped
to 5C = {7, 8, 9, 10, 11, 0, 1}, see Figure 21.3. This image is an uninterrupted
sequence of fourths. In this representation, we also see the distinguished roles
of f and b. They mark the boundaries of the fourth sequence of the scale. They
are known in music theory as the leading notes. We shall come back to this
observation in Chapter 23.
What is important here is that the transposition by 5, one fourth, moves
the sequence of fifths to a new sequence of fifths by one unit clockwise. One
black key (b) is added while one white key (b) is lost. This transposition by
one fourth now clearly always added one more black key, namely the succes-
sion e, a, d, g, c = b to the transposed scales and simultaneously removes
the keys e, a, d, g, c. Therefore the 5-symmetry explains this key signature phe-
nomenon for the major scales.
21.2.3 Counterpoint
In counterpoint, one starts with the construction of a composition from two
voices: cantus firmus (CF) and discantus (D). The rules that determine admis-
176 21 The Third Torus and Counterpoint
Fig. 21.3. The representation of the C scale in Z12 after multiplication by 5. The
key T 5 C = F is the rotation of 5C by one unit in clockwise direction.
sible constructions began developing in the 9th century in Europe and were
quite stabilized in the 16th century, typically as used by the great composer
Giovanni Pietro Aloisio Sante da Palestrina. These rules (or what they thought
were such rules) were written down in the small book Gradus ad parnassum by
Johann Joseph Fux in 1725 [42].
Despite the simplicity of Fux’s rules, there are a number of unsolved prob-
lems with this theory. It commences with the first species counterpoint. It is the
basis of all subsequent situations, namely second species, third species, fourth
species, and fifth species (florid counterpoint). We focus on the first species
here. It is defined by the CF melody and defines rules to add a second voice
(D), where for each note of CF one defines one note of D, of same duration
and onset. See Figure 21.4 for an example. The example shows the two voices
and a central feature: CF and D notes of same onset define consonant intervals.
These intervals (differences of pitch classes) must be of six types (if we work
in Z12 ), prime (0=12), minor third (3), major third (4), fifth (7), minor sixth
(8), and major sixth (9). The other six interval numbers—minor second (1),
major second (2), fourth (5), tritone (6), minor seventh (10), and major seventh
(11)—are dissonant. In our example, we see that the consonances 0, 4, 4, 9, 9,
0 are played.
This definition of consonances is the first big problem in counterpoint: The
selection of these intervals is not justified by acoustical arguments. The critical
interval is the dissonant fourth. In fact, recall that in the Pythagorean tuning
tradition, the fifth frequency ratio 3/2 was consonant, but also the fourth ratio
4/3. Because the common basis of music theory is the just tuning system, this
constitutes a serious problem, which has been recognized by leading scholars
such as Carl Dahlhaus (see [79, Chapitre 13] for a discussion). He argues that
21.2 Music Theory 177
Of course, for any symmetry f ∈ T Z∗12 , the dichotomies f (X), f (Y ) are also of
this type if (X, Y ) are so. But there are no other such dichotomies. It is remark-
able that the dichotomy #64 has as first half {2, 4, 5, 7, 9, 11} the set of proper
intervals of a major scale when counted from its tonic. It is therefore called the
major dichotomy. The question now is to exhibit a property that distinguishes
the consonance-dissonance dichotomy (K, D) (#82) from the others. The an-
swer is found when we consider these dichotomies within the third torus. We
want to learn how the interval numbers are distributed on the torus. To this
end we define two numbers, diameter and span.
178 21 The Third Torus and Counterpoint
It is obvious by the invariance of distance on the torus that diameter and span
are invariant within a class of dichotomies that is generated by the symmetries
f ∈ T Z∗12 .
Figure 21.5 shows these numbers (a), and the configuration for the Fux di-
chotomy (K, D) (b).
σ(X)
16 82 68
10 75 71, 78 64
24 25 28 29 δ(X)
a)
4 K
7 8
0
3 11
b)
1
10 2 5
9
6
D
Fig. 21.5. The diameters and spans of the six strong dichotomies are shown (a), and
the geometric configuration for Fux dichotomy (K, D) is shown in (b).
From this result we learn that the Fux dichotomy (K, D)—#82 in Fig-
ure 21.5 a)—has the smallest diameter and the largest span. In other words,
21.2 Music Theory 179
its consonant elements are separated in an optimal way from its dissonant el-
ements. This distinguishes this dichotomy from the other five possibilities. It
is interesting that the major dichotomy #64 plays a polar role, its members
are “mixed” in an optimal way. The role of strong dichotomies is far from in-
vestigated in music theory. Let us just give one more remarkable example.
The Russian composers Alexander Scriabin is known for his quite intriguing
approach to harmony (among other far out approaches). His most famous dis-
covery is the so-called mystic chord. Figure 21.6 shows the chord in its common
representation (left) and in its representation on Z12 (right). The chord has six
notes, and we recognize immediately that it is isomorphic to the dichotomy
of class #78. The chord also is covered by the four prominent triads: major,
minor, augmented, and diminished (middle of Figure 21.6).
Fig. 21.6. Scriabin’s mystic chord (left) is one half of the strong dichotomy #78
(right). Four prominent triads—major, minor, augmented, and diminished—cover
the chord.
ˇ “* Exercise 9 Calculate the nerve of the covering of the mystic chord by ma-
jor, minor, augmented, and diminished triads. Is it a tetrahedron?
known for its fast harmonic changes. Several pianists, such as Tommy Flanagan
and Cedar Walton, experienced difficulties in performing it.
Fig. 22.2. The lead sheet of John Coltrane’s composition Giant Steps.
22.1 The Analysis 183
In the first two measures, every melodic note has another chord, and
not just slightly different ones: Coltrane jumps around in the entire harmonic
spectrum.
We want to not only present an analysis, but also sketch how the analytical
part can be used to generate a new composition that incorporates the analytical
insights. This new composition by one of the authors (Mazzola) is entitled
Giant’s Steps and has been performed live and also recorded for a music DVD
[44] in fall 2014 at the Airegin Club in Yokohama, Japan. This procedure of
analyzing and then recreating a composition has been described by composer
and theorist Pierre Boulez as analyse créatrice (creative analysis) in [22], see
also [20]. It is also discussed in [79, Chapter 7] and [84, Chapter 25].
The second symmetric group comprises the next eleven chords, as shown
in Figure 22.4. The symmetry is the same, a harmonic retrograde inversion
184 22 Coltrane’s Giant Steps
Fig. 22.3. These nine consecutive chords are arranged in a perfect symmetry around
d.
Fig. 22.4. These eleven consecutive chords are arranged in a perfect symmetry
around d.
for the inversion Id . Again, the symmetry is expressed not only by the inner
symmetry of the central minor chord Am7 , it is also visible for the first and
last two chords of this group.
Fig. 22.5. The two melodies, the initial and the second on measure five, are arpeggios
of chords in those positions, and the notes following these melodies are Id -symmetric
to notes within these melodies.
22.1 The Analysis 185
Moreover, the lead sheet’s first and last chord, B maj7 and F maj7 (because
we’re treating F 7 as F maj7 ), are symmetric under Id . Coltrane’s deeply har-
monic style of thought1 is visible in his melodic approach, see Figure 22.5.
The initial melody is an arpeggio of Gmaj7 , the first chord in the second mea-
sure. The subsequent b is the Id symmetric note to the initial f . The second
melodic part of measure five is an arpeggio of Emaj7 , the first chord of measure
six. Again, the subsequent note f is Id -symmetric to the second melodic note
b. And these two symmetric note groups are also symmetric with each other
(b, f ) → (f , b)!
Fig. 22.6. The totality of chords. We have connected any two chords that are Id -
symmetric with each other. The two groups of nine and eleven chords, respectively,
are shown. We recognize that there are nine chord names, all except the three names
C, E, G.
(right). Whereas the left path is not very regular, the right one is a perfectly
Fig. 22.7. The chord paths for the nine-element group (left) and the eleven-element
group (right).
Given these analytical results, we have recreated the composition in the vein of
Boulez’s creative analysis. The overall insight was that there are two levels in
Coltrane’s harmonic construction: the one to the left in Figure 22.8, rotating
in that circle of four positions, and the one to the right, “descending” to the
symmetric pair F maj7 and B maj7 . In Coltrane’s construction, these two levels
are taken in alterations: circle goes to the symmetric pair goes to circle goes to
22.2 The Composition 187
Fig. 22.8. The total chord change system parametrized by Id symmetry and fifth
transposition T 7 chord relations.
the symmetric pair. We have therefore added a dynamic down and up trajectory
as shown in Figure 22.9.
The given harmonic units, such as the circle, have been reshaped by mo-
tivic structures that are arpeggios of harmonies given by Coltrane’s architec-
ture. Figure 22.10 shows such constructions for the circle part. The circle is
shown to the bottom right together with four arpeggios and motives that repre-
sent the harmonic positions as well as the symmetry Id . The harmonic positions
are made evident by circle of fifths representations of G-major and D-major
tonalities. This enables us to represent Coltrane’s short spots in a more explicit
melodic rendition.
The overall structure of our Giant’s Steps composition alternates between
a relatively fast upper part around the circle of four positions shown in Figure
22.10, on the one hand, and around the symmetric pair F maj7 and B maj7 on
the other. This lower part is associated with a more bluesy mood and interest-
ingly is easily associated with Coltrane’s A Love Supreme main motive. Our
score of Giant’s Steps as it is used in performances is shown in Figure 22.11. It
shows the two-level scheme in terms of a sequence of chords and motivic cells.
188 22 Coltrane’s Giant Steps
Fig. 22.9. The dynamics between the upper “speed” level and the lower “blues” level.
The trajectory alternates between them and shows a circular up-down path.
Fig. 22.10. Motivic structures that are arpeggios of harmonic structures given by
Coltrane’s architecture.
22.2 The Composition 189
Fig. 22.11. The two-level scheme in terms of a sequence of chords and motivic cells.
23
Modulation Theory
Fig. 23.1. Schönberg’s tripartite modulation, comprising the old tonality neutraliza-
tion step, then the fundamental degrees in the new tonality, and ending up with a
cadence in the new tonality.
This scheme is logical: We check out from the old hotel, and then we move
to the new one, and then we check in there. In fact, Schönberg stresses that a
modulation is more or less an involved trajectory. Here is his original text (in
an English translation):
There is, for example, a very popular harmony treatise in which mod-
ulations are almost exclusively made using the dominant seventh or
diminished seventh chord. And the author merely demonstrates that
after each major or minor triad, any of those two chords can be played,
23.1 The Concept of a Tonal Modulation 193
tonic c : ionian
tonic d : dorian
tonic e : phrygian
tonic f : lydian
tonic g : mixolydian
tonic a : aeolian
tonic b : locrian
This means that in our modulation theory, we may also change the mode
without changing the theoretical model, since the tonic is just an additional
specification without deeper consequences. Figure 23.2 shows the system
Dia(3) of all tonalities, which we also call triadic interpretations of the
major scales.
3. Cadence Let T ria be the subset of 2Z12 consisting of all major, minor, or
diminished triads. There are five minimal subsets of degrees of a tonality
X (3) such that only this tonality contains these degrees. The types are as
follows:
that maps a tonality into one of the five cadence types. For example,
cadi (X (3) ) = {IIIX , IVX }.
4. Mechanism We are given two different tonalities X (3) , Y (3) . We model
the modulation mechanism using an idea from particle physics. In physics,
there are four basic forces: electromagnetic, strong, weak, and gravitational
force. Every force is supposed to materialize in bosons or force quanta.
Electromagnetism interacts via photons, weak force interacts via W + , W − ,
and Z bosons, strong force interacts via gluons, and gravitation interacts
via gravitons. In modulation theory we interpret the forces that “transform”
X (3) into Y (3) as symmetries g ∈ T Z12 that map the scale X into the scale
Y . Observe that such a symmetry automatically maps the degrees of X (3)
into the degrees of Y (3) . This yields our concept:
Fig. 23.3. The inner symmetry A of X together with the transposition T t from X
to Y .
find enough quantized modulations and that their fundamental degrees are
those Schönberg found in his harmony.
Fig. 23.4. The nerves of a modulation C (3) → E(3) . On top is the nerve Q(3) of the
modulation quantum.
198 23 Modulation Theory
modulation table in Section 23.5, we hear note g as an octave interval: pedal and
stationary voice in the first part. The pitches of the first part, when transposed
into the octave spanned by the two g notes, show a regular melodic structure,
see Figure 23.6, bottom.
This structure has two parts: the first in measures 124-125, and the sec-
ond in measures 126-127. They are related to each other by the inversion at
d, which is the same as the inversion at g in pitch classes. This first part of
the modulation makes evident the inversion Id before we see the fundamental
degrees in the second part of the modulation. But why this preliminary inver-
sion? It is the modulator for the modulation in our model, Id (G(3) ) = E(3) .
This strategy is a beautiful compositional realization of what our model speci-
fies. The model does predict fundamental degrees, and it does so on the basis
of modulation forces that are provided by modulator symmetries. Beethoven
not only writes down the fundamental degrees, but also makes evident the
modulator in the first part of the modulation.
Our interpretation in this analysis does not assume that Beethoven has
performed his construction using the ideas of our model. But he might have
done so instinctively; one cannot know such hidden layers of creativity. This
situation is parallel to what happens in physics. We discover physical laws, but
we cannot know whether a divine creator (if this is the underlying cosmolog-
ical hypothesis) has constructed the universe according to these laws, which
are our way to understand nature. Nevertheless, the laws hold, and so does
our modulation module for the critical system of modulations in Beethoven’s
composition.
Concluding this chapter, we should add that our model also holds for
other compositions by Beethoven, for example for modulations in the Cavatina
movement of String Quartet op. 130.
23.5 Quanta and Fundamental Degrees 201
1
C -> F C -> B C -> Es
8 X XX X X XX XXX D X X XX b XX X b XX D XX XX X b
b XX b X XXX D
& 4 XX X X X XX XX X X X X X XX X X
8 X bX X X X X
D X X X XX X bX
D X X
bX b X X b XX
D
?4
IC VIIF IF IVF VF IF IC V VIIB VII I IC VC IIEs VEs VEs IEs
C IIIB B B
8
IIF VB VIIEs VIIEs
4 A B C A B C A B C
& XX XXX bX bX X X X
X XX b XX X X bX bX
X X bX bX bX
b X b X b X D X X XX X b XX XX b X D
? X bX D X
VIIAs IV V II VI VII VIIGes VGes I
IC IVC As As IAs IC IIIDes Des IIDesVDesIDes IC C C Ges
IIAs VIIDes IVC VIC IIGes IIGes VIIGes
A B C A B C A B C
XX XX X X #X X
X # X # XX
C -> H C -> E
XX # XX # XX # XX D
7
& #X XX XX # XX XX
X
X
X
X
XX XX # XX XX # # XX X X X X X X
#X
X
XD
? X X X #X
IIH VE
IC VC IIC IVH IVH VH IH IC VI IVE VE IE
VIIH C VIIE
A B C A B C
C -> D C -> G
# XX XX XX # XXX D XX XXX XX
C -> A
X XX X X XX # XX X D
10
& XX XX XX XX # XX D X X X X XX
X X X X X X XX # XX XX XX X D
? X X XX # XX XX XX D XX X X #X X D XX
X
X X
IVA IID
IC VC IC VI
IVA V I
VIIA A A C VC VII VD ID IC VC IIIG IIG VG IG
D
A B C A B C A B C
Fig. 23.7. Examples of modulations C (3) → Y (3) according to our model. The three
Schönberg steps are notated with A, B, C. The modulation to B (3) is notated by
the German letter H for B, while the symbol B stands for English B. The sound
example is modulation.
Part VI
Summary. Rings are the basic structures for algebra. We already have many
examples of rings: the integers, real and complex numbers, and the structure
of addition and multiplication that was defined on Zn in the chapter about the
third torus and its geometry.
–Σ–
again the required type. The 0R
M element here is the zero function f (m) = 0
for all m ∈ M . The 1R
M element is the function f (eM ) = 1R and f (m) = 0R
for all m = eM . There is a natural commutative subring of RM , namely the
set of functions fr (eM ) = r and fr (m) = 0R for m = eM . We denote these
functions fr by r if no confusion of notation results. Therefore we may identify
R with the subring of these special functions within RM . The monoid M is
also embedded in RM by fm (m) = 1R and fm (n) = 0R for n = m. Check
that the monoid multiplication carries over to the multiplication among these
special functions, and we may identify M with this multiplicative submonoid
of RM , see the following diagram.
R - RM
6
M
A core example of such an algebra is provided when taking the word
monoid M = P ath(A) of an alphabet A, denoted by RA. For our purposes,
the most important case is the simple one-letter alphabet A = {X}, where X
is usually called the indeterminate of this algebra. The path monoid consists of
the natural powers X n , n ∈ N of X. This algebra is called polynomial algebra
and is denoted by R[X]. The algebra’s elements are represented as follows: For
an element a ∈ R and a natural number n, we can consider the function aX n
defined by aX n (X n ) = a and aX n (X m ) = 0 for m = n. Then if the natural
powers of X, where the function values f (X n ) = an might not vanish, are ≤ N ,
we can write
f= an X n .
n≤N
24.1 Monoid Algebras and Polynomials 207
And this representation is unique. Such functions are called polynomials in the
indeterminate X and coefficients in R. In particular, the 1 element is 1 = 1R X 0 ,
which we identify with eP ath(A) and the zero element is 0 = 0R eP ath(A) = 0R 1.
The ring elements an in a polynomial are called its coefficients of a determined
power of the indeterminate. The addition of two polynomials is as follows:
an X n + bn X n = (an + bn )X n
n≤N n≤N n≤N
ˇ “*
Example 54 The map (pr3 , pr4 ) : Z12 → Z3 × Z4 : x12 → (x3 , x4 ) defined
in Section 21.1 is a ring homomorphism, but the map (also defined there)
Id × (−1) × (pr3 , pr4 ) is not a ring homomorphism because the 112 does not
map to the unit (13 , 14 ).
where Ringsf (RA, S) denotes the set of ring homomorphisms that restrict to
f when restricted to R.
In particular, if A = {X}, the polynomial ring homomorphisms g that
extend a given ring homomorphism f : R → S are in bijection with the elements
a ∈ S by the association g → g(X). Such homomorphisms are called polynomial
functions. For example, if f : R → C is the canonical embedding of the reals in
the complex numbers, the polynomial functions g : R[X] → C nare given by the
value x =
f (X), and the functions map a polynomial n an X to the complex
number n an xn .
ˇ “*
Example 55 Polynomial functions play a crucial role in the theory of mu-
sical performance. We shall deal with that theory in Chapter 32. But let us
preview the role of polynomials in this theory. Often, musicians have to perform
changes of musical parameters that are not explicitly notated. A basic example
is glissando, where the score notation shows only the initial pitch p1 and the
final pitch p2 of glissando. The musician then has to move from the beginning
to the end in a continuous curve of intermediate pitches. Whenever this move-
ment has to be defined precisely, be it for a software that implements glissandi
or for theoretical reasons, such a glissando curve must be defined in explicit
terms. Then we have to define a function gliss : [a, b] → R from a time interval
[a, b] to the real-number-valued pitch domain R. Typically such a function gliss
is defined by a polynomial function of shape P [X] = rX 3 + sX 2 + tX + d. But
we also want that P (a) = p1 , P (b) = p2 and that the slope of the function is
horizontal in the two limit times. Slopes will be discussed in Chapter 31. Such
a function is shown in Figure 24.1.
For other parameters, such as time changes with tempo, ritardando or
accelerando, and dynamics changes such as crescendo or diminuendo, similar
polynomials functions are used, but see Chapter 32 for details.
rI ⊂ I
Proof 14 The proof is a construction that we need to spell out. First, suppose
that I = Ker(f ), f : R → S. Then for i ∈ I and r ∈ R, we have f (ri) =
f (r)f (i) = f (r)0S = 0S , and thus the property rI ⊂ I. Conversely, suppose
this property holds for I. Then, as R is a commutative additive group, we may
apply the theorem in Section 19.2 with its commutative diagram of groups:
G
f
- K
6
f
proj inj
?
?
--
∼f 6
G/H → Im(f )
to our situation for G = R, K = S, and H = I, getting a group diagram
R
f
- S
6
f
proj inj
?
?
--
∼f 6
R/I → Im(f )
210 24 Rings and Fields
The missing part here is that R/I is only a group, not a ring. The ring
structure on R/I is defined as follows: For r + I, s + I in R/I, we define
r + I ∗ s + I = rs + I.
We call the quotient group R/I with this ring structure the quotient ring
of R modulo the ideal I.
√
Example 23 For any subgroup nZ ⊂ Z, n ∈ N, we have the quotient ring
Zn := Z/nZ that was defined in Section 21.1.1. This example is a special case
of a principal ideal, which by definition is an ideal of shape I = rR, which one
denotes by (r).
The other example that also played a major role in our earlier theory is
the ideal O ⊂ C of zero sequences in the ring C of Cauchy sequences used to
define real numbers as a quotient ring R = C/O in Chapter 12.
defined by sending X to the imaginary unit i, and by the identity on the real
coefficients. We now that f is surjective because the images of aX + b are all
imaginary numbers ai + b. The kernel of f contains the principal ideal (X 2 + 1)
since f (X 2 + 1) = i2 + 1 = 0.
We now show that Ker(f ) = (X 2 + 1). This will imply that R[X]/(X 2 +
∼
1) → C, i.e., the complex numbers can be constructed from the problem set
R[X] and a quotient construction! The claimed equation follows from the fact
that any polynomial P ∈ R[X] can be written in the form P = H(X 2 + 1) +
aX + b. We shall prove this fact in the next section. If we use this formula,
then P ∈ Ker(f ) implies 0 = f (P ) = f (H)f (X 2 + 1) + ai + b = ai + b, and so
a = b = 0, i.e., P = H(X 2 + 1) ∈ (X 2 + 1), and we are done.
24.2 Fields 211
24.2 Fields
Definition 63 A non-zero commutative ring R is called a field iff every non-
zero element x ∈ R is invertible, i.e., R∗ = R − {0}.
Corollary 3 The ring of integers Z and the polynomial algebra K[X] for a
field K are principal ideal rings, i.e., every ideal is principal.
Summary. Prime numbers play a crucial role in music theory, and in par-
ticular in the theory of tuning. In this chapter, we prove uniqueness of prime
decomposition for the integers and polynomial rings.
–Σ–
In a commutative ring R, if x = yz we write also y|x and say that y is a
divisor of x.
and we have already shown in Section 9.1 that there are infinitely many primes
here. For R = R[X], all linear polynomials aX + b, a = 0 are irreducible as their
In fact, if (p) is maximal, then p = qr implies (p) ⊂ (q), so either (q) = R, hence
q ∈ R∗ , or (p) = (q), hence p = qr, q = ps, i.e., p = rsp, (1 − rs)p = 0, hence
1 = rs as R is an integral domain. Therefore r is invertible and p is prime. If
(p) is not maximal, there is a strictly larger intermediate ideal (q), therefore
p = rq with both r and q not invertible, i.e., p is not prime.
ˇ “*
Example 57 The quotient rings Zp for primes p are fields, a fact already
mentioned in Section 24.1. This applies to the musical case Z3 . Also, it is im-
portant to note that both Z12 and Z4 are not fields, because there are sequences
0, x, 2x, . . . for x = 0 that don’t cover the entire rings, e.g., 0, 2, 4, 6, 8, 10 in Z12 .
The next result deals with unicity in the prime factorization.
Summary. Matrices are a very classical tabular form to represent data, for
example in accounting. They are built from columns that are juxtaposed and
can be split horizontally into a stack of rows. The novelty in mathematics is
that matrices that are built from numbers can be used to perform calculations
that are of general benefit to mathematics.
–Σ–
Fig. 26.1. A matrix built from columns in Greek temples. For every column j, we
have its building blocks aij , referring to row i. The third dimension of depth is not
dealt with in this book, but there are also three-dimensional matrices in mathematics.
Matrices are the backbone of algebra. They are indispensable for most
concrete calculations, but they also share the structure of a category (we shall
discuss categories in Chapter 29) in a particularly transparent way. Moreover,
matrices provide us with examples of important algebraic structures of modules
(we shall discuss modules in Chapter 27). Finally, matrices also provide us with
core examples of non-commutative rings.
This turns Mm,n (R) into a commutative group, and we have the isomorphism
∼
of groups Mm,n (R) → Rmn , the cartesian product of mn copies of the additive
group of R.
The product of matrices is slightly more involved: If M ∈ Mm,n (R) and
N ∈ Mn,l (R), then we define their product M N ∈ Mm,l (R) by
26.1 Generalities on Matrices 219
(M N )ik = Mij Njk .
j
This means that the coefficient (M N )ik at row i and column k is the sum of
the products Mij Njk of the ith-row coefficients of M with the corresponding
kth column coefficients of N .
To show the number of rows and columns of a matrix M ∈ Mm,n (R), we
also write it as a symbol of a function (and you will see soon that this has good
reasons): M : En → Em .
ˇ “*
Example 59 If we want to list the number of instruments of an orchestra
with n instrument types (violins, violoncellos, bassoons, etc.) with respect to
the m movements, we can describe this by a m × n matrix M , where Mi,j
denotes the number of instruments of type j in movement i.
and
λ.(M + N ) = λ.M + λ.N.
(iii) Scalar multiplication and transposition commute: (λ.M )t = λ.M t .
(iv) (Associativity) (C.B).A = C.(B.A), which we therefore denote by C.B.A.
(v) (Distributivity) If C : El → Ek and B : Em → El are two matrixes over
R, then (C + C ).B = C.B + C .B and C.(B + B ) = C.B + C.B .
(vi) (Homogeneity) If λ ∈ R is a scalar, then λ.(C.B) = (λ.C).B = C.(λ.B),
which we therefore denote by λ.C.B.
(vii) (Neutrality of identity matrixes) We have A.En = Em .A = A.
(viii) (C.B)t = B t .C t .
−1 0
M1 = horizontal reflection
0 1
1 0
M2 = vertical reflection
0 −1
−1 0
M3 = = M1 M2 180o rotation
0 −1
11
M4 = horizontal shearing
01
0 −1
M5 = 90o rotation
1 0
⎛ ⎞
√1 √−1
M6 = ⎝ 2 2⎠ 45o rotation
√1 √1
2 2
26.2 Determinants
We shall not discuss the full theory of determinants, but restrict our attention to
determinants in the case of the above example, i.e., M ∈ M2,2 (R). Determinants
26.3 Linear Equations 223
−
→
are numbers that help determine when the associated map M : R2 → R2 is a
bijection.
ab
Definition 68 If M = ∈ M2,2 (R), then we define det(M ) = ad − cb.
cd
√
Example 29
−1 −1
0 −1 0 1 11 1 −1
= , = .
1 0 −1 0 01 0 1
These matrices also have their determinants in Z∗ , which is essential for the
existence of the inverse with integer coefficients.
3 = 5x1 − 2x2
−2 = 3x1 + 6x2
We have
−1
6 2 1 1
5 −2 1 6 2 6 18
= = 36 36 = .
3 6 36 −3 5 −3 5 1 5
36 36 −12 36
Therefore we get
1 1 7
x1 6 18 3
= 1 5
. = 18
−19
.
x2 −12 36 −2 36
27
Modules
Summary. Many core structures in algebra are richer than groups but poorer
than rings. For example, an ideal I ⊂ R in a commutative ring is an additive
subgroup, but not a ring because it has no 1 in general. However, one may
multiply elements of I with any ring elements. Also, the set Mm,n (R) is an
additive group, but not a ring for n = m. Its structure as a cartesian product
ring Rmn is rarely considered. But again, one may multiply a matrix by a
“scalar” from R. These structures remind us of vector calculus in high school.
This is what we now want to investigate for the sake of music theory. The
structure of this type is called a “module”, and we want to give a short and
very incomplete account of the theory of modules, which plays a major role in
mathematical music theory.
–Σ–
ˇ “*
Example 61 A common module in music theory is REHLD , the R-module
of functions f : {E, H, L, D} → R, where E means symbolic onset, H means
symbolic pitch, L means symbolic loudness, and D means symbolic dura-
tion1 . This module is isomorphic to R4 , and its vectors are the note events
f = (fE , fH , fL , fD ) with onset fE in units of quarter notes ♩, pitch fH in
units of semitones, loudness fL in units of cents Ct, and duration fD in units
of quarter notes ♩. For performance, one accordingly uses the module Rehld of
1
E stands for Einsatzzeit, H for Höhe, L for Lautstärke, D for Dauer, all German
words.
27 Modules 227
ˇ “*
Example 63 This example is crucial for all tuning investigations. We shall
use it in Chapter 28. We saw in Musical Example 58 that Western tuning
systems use the frequency formula f (o, q, t) = f0 2o 3q 5t , where o, q, t ∈ Q. We
learned in that example that the map Q3 → R : (o, q, t) → f (o, q, t) is injective.
Since the logarithm is also an injective function, the map Euler : Q3 → R :
(o, q, t) → o log10 (2) + q log10 (3) + t log10 (5) = log10 (f (o, q, t)) − log10 (f0 ) is
also injective (the basis 10 is irrelevant, it works with any basis). But this map
is also Q-linear if we take the Q-module structure Q R of R. This means that
we have an identification of pitch (modulo a basic pitch f0 which we ignore for
the time being) with triples of rational numbers. It is crucial here to recognize
that the same function with real numbers
Euler : Q3 → QR
that we call the Euler embedding in honor of the great mathematician Leonhard
Euler, who was the first to define tuning systems in terms of logarithms. The
image Q-module Im(Euler) =: EulerSpace is called the Euler space. We shall
discuss it in Chapter 28.
The set ModR (M, N ) of R-linear homomorphisms f : M → N is an R-
module by these operations: For f, g ∈ ModR (M, N ), f + g : M → N : m →
f (m) + g(m), and for r ∈ R, rf : M → N : m → r(f (m)).
228 27 Modules
Proof 16 We use the universal property of direct sums for the situation Rn =
i R. A homomorphism f is equivalent to the family (fj ) ∈ M odR (R, R ).
m
But the factors fj are precisely the functions f (ej ), as the homomorphisms
g : R → Rm are given by their values on 1R . In particular, if n = m = 1, we
∼
have the isomorphism ModR (R, R) → R that is defined by f → f (1R ).
ˇ “*
Example 64 If we work in EulerSpace, we may consider Q-linear func-
tions f : EulerSpace → Q from the Q-module to the coefficient ring Q (in-
terpreted as Q-module too). The general context of this situation is the space
ModR (M, R) of so-called linear forms on an R-module M . A very important
such form on ModQ (EulerSpace, Q) is the form Γ0 (o, q, t) = 1.o + 2.q + 4.t =
(2 − 1).o + (3 − 1).q + (5 − 1).t. It is this form that essentially defines what Euler
had called gradus suavitatis, his quantitative measure for the degree of conso-
nance of an interval in EulerSpace. We see that he uses the three prime num-
bers 2, 3, 5 to define his linear form. Euler proposed that the function 1+Γ01(o,q,t)
would measure the consonance of the interval o = o1 −o2 , q = q1 −q2 , t = t1 −t2
between pitches (o1 , q1 , t1 ), (o2 , q2 , t2 ). Figure 27.1 shows the gradus function
values (the 10-fold) for frequency ratios within the just-tuned octave.
is a central structure in music theory of just tuning. We shall deal with this
one in Chapter 28.
27 Modules 229
10 / Γ
10
0
interval
1 16 9 6 5 4 45 3 8 5 16 15
1 15 8 5 4 3 32 2 5 3 9 8
Fig. 27.1. The gradus suavitatis function values (the 10-fold) for frequency ratios
within the just-tuned octave.
M
f
- L
6
f
proj inj
?
?
--
∼f 6
M/L → Im(f )
230 27 Modules
where proj is the canonical surjection, inj is the embedding of Im(f ), and f
is the isomorphism of R-modules that sends the coset m + Ker(f ) to f (m).
ˇ “*
Example 66 In Musical Example 20, we explored the group T I of functions
on Z of the form T t ◦ ±1. These are examples of affine homomorphisms of the
Z-module Z. In Section 21.1.1, we also discussed the group T Z∗12 , which is a
group of affine automorphisms of the Z12 -module Z12 , and an example of a
group of affine automorphisms of the R-module R defined by the invertible
factors r ∈ R∗ for the affine homomorphisms T t ◦ r.
We often identify the matrix group Mm,n (R) with the corresponding group
of R-linear homomorphisms M → N and then write T Mm,n (R) for the set of
−
→
affine homomorphisms f : Rn → Rm defined by f = T t ◦ M , M ∈ Mm,n (R),
and we also write f = T t ◦ M or even f = T t M if no confusion is likely.
√
Exercise 29 Prove that for an R-affine homomorphism f = T t ◦ f0 , its
translation element t and its R-linear homomorphisms f0 are uniquely deter-
mined by f .
ˇ “*
Example 67 In music theory, most morphisms are not linear, but affine.
For example, the pitch function P itch(o, q, t) = log(f0 ) + o log(2) + q log(3) +
27.1 Affine Homomorphisms 231
t log(5) is affine, with transposition log(f0 ) and linear part o log(2) + q log(3) +
t log(5). Also, Euler’s gradus suavitatis function is Γ (o, q, t) = 1 + Γ0 (o, q, t),
with transposition part 1.
within the pitch class module Z12 . But he also wants to apply these transforma-
tions to the other parameter series. This causes a serious conceptual problem
of serialism since, different from pitch classes, there are no naturally selected
classes of duration, attack, or loudness, let alone natural transformations on
these spaces. Boulez applies a very mathematical trick to solve the problem of
carrying over pitch-class transformations to the other parameters.
Denote by ei = (0, 0, . . . 1, 0 . . . 0), i = 1, 2, . . . 11 the vector in Z11 that has
zeros except for the ith coordinate, which is 1. To begin with, the pitch series
SP itch is represented by a Z-affine homomorphism S P itch : Z11 → Z12 which
sends the vector ei to SP itch,i for i = 1, . . . 11 and 0 to SP itch,0 . This can be done
by the homomorphism S P itch = T SP itch,0 Q, where Q(ei ) = SP itch,i − SP itch,0
is the linear part. If we apply a permutation p of the pitch classes, we get a
new series p ◦ S P itch : Z11 → Z12 . But this series can also be obtained by first
permuting the vectors ei , i = 1, 2, . . . 11 and 0 and the applying the original
∼
series S P itch , i.e., by an Z-affine isomorphism P : Z11 → Z11 defined by the
permutation of ei , i = 1, 2, . . . 11 and 0, such that
p ◦ S P itch = S P itch ◦ P.
theorists had understood musically was confirmed many centuries later on the
level of mathematics.
Proof 17 The proof goes by induction. Suppose that (xi )i are linearly inde-
pendent, then we are done. Otherwise, there is an equation 0 = i ri xi with
one coefficient rj = 0. We may supposethis is the first one (after a permuta-
tion of indices). Then we have x1 = −1
r1 i>1 ri xi , and the shorter subsequence
(xi )1<i generates M , so induction on its length proves the claim.
The next lemma proves that the size of a basis is uniquely determined and
thereby (as observed earlier in this chapter) defines the dimension dim(M ) of
a K-vector space.
Fig. 27.2. Xenakis’ UPIC. The drawing shows a graphical score of Xenakis’ compo-
sition Mycenae Alpha.
Fig. 27.3. Mazzola’s presto music software with a drawing of Beethoven on the
plane of onset and pitch.
Fig. 27.4. A small example of a color drawing on the RUBATO software’s BigBang
module.
and Sergei Prokofiev (see Figures 27.5 and 27.6). To compose, we can apply
onset
6
4
2
0
20
pitch
0
20
50
100
loudness
Fig. 27.5. First bars of Mozart’s Eine kleine Nachtmusik. The colors indicate the
instruments: first and second violin, viola, violoncello, and double bass.
onset 4
20
pitch 0
20
30
40
50
loudness
60
70
Fig. 27.6. The first three measures of a Classical Symphony by Prokofiev. The colors
distinguish between orchestral sections.
238 27 Modules
Fig. 27.7. Snail and conchoid. Top: The snail’s shell is mathematically described
by a spiral, using the equation of a conchoid. A selection of points is transformed to
music. Bottom: an extract of the score that is derived from the conchoid equation.
The sound example is snail.
via Mathematica software2 . The first experiments have been published under
the title Musica Tridimensionale [66]. Images turned into music include Bicycle
wheel by Marcel Duchamp, Concetto spaziale by Lucio Fontana, the shape of
Sagrada Família by Antoni Gaudí (image and score, respectively: Figures 27.8
and 27.9), a snail shell (mathematically described by a conchoid3 , Figure 27.7),
and the Eiffel Tower. Choice of timbres is left to composer. It is also possible
to return to an image from the created music; however, the result will be much
simpler than the original image, due to the choice of discretization.
2
Mathematical parameters: Pitches from 0 (middle C) with semitone distances up
and down; intensity on a scale with 60 = f, 50 = mf and so on; time was measured
in seconds.
3
Equation of a conchoid:
⎡ ⎤ ⎡ ⎤
x ku (1 + cos ν) cos u
⎢ ⎥ ⎢ ⎥
⎢y ⎥ = ⎢ ku (1 + cos ν) sin u ⎥ , u = 0, ... 6π, ν = 0, ... 2π, k = 1.2, k2 = 1.2, a = 1
⎣ ⎦ ⎣ ⎦
z ku sin ν − ak2u
240 27 Modules
Fig. 27.9. The beginning of the score derived from the drawing of Sagrada Família.
The sound example is sagrada_extract.
28
Just Tuning
Just tuning is the set of pitches that are defined by integer coordinates in
the Euler map, i.e., Just := Im(Z3 ) ⊂ EulerSpace. This is a discrete grid in
EulerSpace or Q3 . The plane spanned by the fifth and third is “copied” one
octave higher by adding to it the octave vector (1, 0, 0) ∈ Q3 . Therefore every
fifth-third plane is generated by adding a number of octaves to the original
plane. A selection of the 12 pitches within one chromatic octave in just tuning
is shown in Figure 28.1, bottom. This is Martin Vogel’s choice, but there are
several options, see [113].
The subspace of just tuning with fifth coordinate t = 0 is the Pythagorean
tuning. It involves only powers of 2 and 3—octave and fifth—as seen in the
tetractys symbol of the Pythagorean school (Section 2.1). The distribution of
thirds
12-tempered
C-chromatic
one octave c
d fifths
d
e
e
f
g
g
a
a
b
octaves b f
b
e
just (Vogel) a
d
C-chromatic
g
c thirds
f
e
b
fifths
a octaves
d
Fig. 28.1. One octave of the 12 chromatic pitches in just (bottom) and 12-tempered
tuning (top).
these 12 pitches looks quite random, but it can be shown that there is exactly
one affine automorphism of Z3 that maps this just C-chromatic set onto itself.
It turns out that this automorphism is essentially the autocomplementarity
symmetry, which we discussed in counterpoint theory in Section 21.2.3. See
[75, Section 24.1.1] for details.
This grid structure of just tuning motivated Leonhard Euler to advance his
substitution theory of human pitch perception. He claimed that humans always
substitute musical intervals with the nearest just intervals. This was necessary
for his gradus suavitatis function, which is only defined for just-tuned intervals,
see our discussion in Musical Example 64. We shall see in Chapter 30 that
Euler’s approach has a serious deficiency.
The second tuning, 12-tempered tuning, is defined√by frequency ratios that
are integer powers of the 12-tempered semitone ratio 12 2, and when we take the
logarithms thereof, we get the pitches log(f0 ) + 12o
log(2), o ∈ Z. One such 12-
tempered octave is shown in Figure 28.1, top. The dramatic difference between
these two tunings makes clear that the mathematics of music theory for just
and 12-tempered tuning looks quite different. It is remarkable that despite this
difference, many results, especially in modulation theory, are quite similar for
both tunings.
28.1 Major and Minor Scales: Zarlino’s Versus Hindemith’s Explanation 243
The following table lists the just coordinates of the just-tuning intervals
(with respect to c, first tone in first column) according to Martin Vogel [113],
together with the interval’s value in Cents and the percent deviation from the
tempered tuning with 100, 200, 300, etc. Cents.
thirds
a e b f
b f c g d
d a e
fifths
C-chromatic
thirds thirds
fifths fifths
Fig. 28.2. C-major, natural C-minor, and C-chromatic octaves in the fifth-third
plane.
inner symmetry
refl
ecti n
on ctio
sve
tran
Fig. 28.3. The inner symmetry A of C-major is the composition of a reflection and
a transvection (a more mathematical word for shearing).
28.2 Comparisons between Pythagorean, Just, and 12-tempered Tuning 245
ˇ “* Exercise 10 Check that A permutes the pitch classes of the C-major scale.
The relation we are looking for is this:
H = Z ◦ A,
which means that both theories are in fact defined by symmetries. Hindemith’s
approach is simply Zarlino’s modulo the inner symmetry of the C-major scale.
A major chord, for example {c, e, g}, is given by a major third c − e and a
perfect fifth c − g, whose intervals are 54 and 32 , respectively. Interval e − g is a
minor third, given by 56 . Ratios are thus given by 2 : 3 : 4 : 5 : 6, called simple
ratios.
Music’s historical complexification, with the introduction of modulations
between different tonalities, requires a system that should be stable with respect
to changes of tonality. But when changing tonalities, there is an unsolved prob-
lem with the just tuning system. For example, if the octave from c is set in just
tuning, the major second c−d is not the same interval as the major second d−e.
The first has a 9/8 frequency ratio, while the second has a (5/4)/(9/8) = 10/9
ratio. Therefore changing from C-major to D-major changes that initial major
second interval. Musical instruments are well tuned in one tonality. However,
while playing in other tonalities, intervals are different, and good ratios are
lost. This is the drawback of the just tuning system!
For these reasons, scientists and musical theorists developed the 12-
tempered system (also erroneously called well-tempered, but this is still an-
other tuning). In this tuning system, the pitch differences between chromatic
and diatonic intervals are canceled. Enharmonically equivalent notes as c and
d now have exactly the same frequency. The chromatic 12-tone scale contains
1
Properties of an oscillating string are studied in classical mechanics, a branch of
physics. In particular, the model of vibrating string is a really useful paradigm in
physics, from classical to modern physics, comprising string theory.
28.3 Chinese Tuning Theory 247
√
12 equal intervals, each
√ one with the frequency ratio size of 12 2. After 12 notes,
we correctly obtain ( 12 2)12 = 2, that is the double of frequency as required by
definition of octave.
Zaiyu Zhu calculated the equal-tempered system in 1584, see Section 2.4. How-
ever, the earliest Chinese tuning system, which we will describe briefly here, has
several correspondences with the Pythagorean system. It is called the thirds-
divide subtract-add (三分损益) method. Like the Pythagorean system, it is
based on the evaluation of frequencies using the length of a portion of vibrating
string.
Summary.
We have often referred to certain types of
structures—sets, monoids, groups, rings, digraphs,
or modules—where there was a shared structural
characteristic: All of these structures have objects
(such as sets, monoids, groups, etc.) and a type of
“function” (set functions, monoid morphisms, di-
graph morphisms, etc.). And all of these functions
can be composed if domains and codomains can
Fig. 29.1. Categories are
be ‘concatenated.’ The common denominator of the smiley of contemporary
these structures is the concept of a category. mathematics.
–Σ–
We have a category Set of sets, a category Grp of groups, etc. Categories
are great fun if one accepts their basic structure. So don’t shy away; it’s easier
than learning to read a score, and you will smile like the smiley in Figure 29.1.
Here is the definition of a category.
g
g◦f
Z
This means that a category is essentially a directed graph (possibly larger
than a set) together with the composition of arrows.
√
Example 35 The category Set of sets has as Ob(Set) the collection of all
sets, while M or(Set) consists of the sets Set(X, Y ) of set functions f : X → Y .
The composition of morphisms here is the classical composition of set functions.
The category Grp has as Ob(Grp) the collection of all groups, while the
morphism collection M or(Grp) has the sets Grp(G, H) of group homomor-
phisms f : G → H as morphisms.
The category M(R) for a commutative ring R has as objects the identity
matrices En = En (R), n > 0 a natural number, and this collection is a set! The
morphisms En → Em are the matrices in Mm,n (R). Composition of morphisms
is the product of matrices, and the identity matrices are also the identical
morphisms. This category is particularly nice since it has no special objects;
the objects are just the identity morphisms. Category theory can in fact be
defined without distinguishing objects from morphisms. This is the modern
point of view: Everything is a morphism. And matrices are a beautiful and
simple example of such a completely “morphic” category.
ModR is the category whose objects are the R-modules for a commutative
ring R, while the morphisms are the R-linear homomorphisms. AffR is the
category whose objects are the R-modules for a commutative ring R, while the
morphisms are the R-affine homomorphisms. ModR is a subcategory of AffR ,
ModR ⊂ AffR ; the objects are the same, but ModR has only the r-linear
homomorphism as morphism sets, a subsystem of all R-affine homomorphisms.
If C is a category, its opposite category C opp is defined as follows: We set
Ob(C ) = Ob(C) and C opp (X, Y ) = C(Y, X). The composition is the same
opp
as for C, but in reversed order, i.e., for f ∈ C opp (X, Y ), g ∈ C opp (Y, Z) we set
f ◦C opp g = g ◦C f .
ˇ “*
Example 71 One of the most important categories in mathematical music
theory is the category LocR of local compositions in R-modules. The objects
are pairs (K, M ) of a subset K = ∅ of an R-module M . The typical examples
are sets K of notes in a score, where the score is thought to be an R-module,
such as REHLD (see Musical Example 61). If (K, M ), (L, N ) are two local
compositions, a morphism f : (K, M ) → (L, N ) is a set map f : K → L such
that there is an affine homomorphism F : M → N with F |K = f . We have seen
many such morphisms, e.g., symmetries, such as inversions or retrogrades that
29 Categories 251
R? : LocR → ModR
with every local composition a module over the composition’s ring R. This helps
us verify the necessary conditions for local compositions to be isomorphic, since
if they are so, then their modules must be isomorphic, too. For example, the
triads {0, 3, 6}, {0, 4, 7} ⊂ Z12 cannot be isomorphic since Z{0, 3, 6} = 3Z12 ,
while Z{0, 4, 7} = Z12 .
For every category C, we have the category C @ (@ for “address,” see below)
of contravariant functors F : C → Setopp as objects and natural transforma-
tions as morphisms. For every object X of C, we define a contravariant functor
@X : C → Setopp by @X(Y ) = C(Y, X). It maps a morphism f : Y → Z to a
set map @X(f ) : C(Z, X) → C(Y, X) : g → g ◦ f . If h : X → Y is a morphism
in C, the natural transformation @h : @X → @Y maps C(Z, X) to C(Z, Y ) via
k → h ◦ k. This is a functor Y : C → C @ . It is the Yoneda functor that was
defined by Japanese computer scientist Nobuo Yoneda in 1956. Its relevance is
the following lemma:
Fig. 29.2. The Matterhorn from the usual perspective (right) and from the Theodul
glacier (left).
Fig. 29.3. Glenn Gould (left) interprets Beethoven without addressing his gestural
creativity, as opposed to Vladimir Horowitz (right), who fully realized this dimension.
Fig. 30.1. The musical glissando on a piano and a violin are different. On a violin,
the pitch changes smoothly, while on the piano we have a discrete set of pitches,
similar to walking down steps as opposed to skiing down a mountain.
(ii) for any non-empty family (Oi )i of closed sets, the intersection i O is
closed,
(iii) for any finite family (Oi )i of closed sets, their union i Oi is closed.
If x ∈ X is an element of a topological space X, an open neighborhood of
x is an open set that contains x.
√
Example 37 The most important example is X = R, where the open sets
O ⊂ R are the subsets such that for every x ∈ O, there is an open interval
]a, b[:= {x|a < x < b} with x ∈]a, b[⊂ O. In particular, open intervals are open
sets, but closed intervals [a, b] = {x|x ∈ R AND a ≤ x ≤ b} are not. See Figure
30.2, left part, for an open interval.
If X, Y are topological spaces, the cartesian product topology is the topol-
ogy on X × Y where the open sets are those O ⊂ X × Y such that for every
x ∈ O, there is a pair of open sets U ⊂ X, V ⊂ Y such that x ∈ U × V ⊂ O.
A similar definition holds for the cartesian product topology of a finite family
(Xi )i of topological spaces Xi .
The usual topology on Rn is the cartesian product topology induced from
the above topology of R. The open sets are those subsets O ⊂ Rn that contain
for every point x = (xi ) ∈ O a cartesian product ]a1 , b1 [× . . .]an , bn [ of intervals
with xi ∈]ai , bi [ for all i = 1, . . . n. See Figure 30.2, right part, for an open set
in R2 .
If X is a topological space, a subset Y ⊂ X is turned into a topological
space by assigning the open sets U ⊂ Y to those sets of shape U = Y ∩ O for
30 Continuity 259
Fig. 30.2. Open sets in R and R2 . In R2 , each element x of the open set U is in a
rectangular open neighborhood that is contained in U.
an open set O ⊂ X of the comprising space. This topology is called the relative
topology.
ˇ “*
Example 73 In musical performance, continuous functions are mandatory
for good rendition. If we want to produce a crescendo between onset time t0
260 30 Continuity
Definition
77 A topological space X is compact
iff any covering family (Xi )i ,
i.e., i Xi = X, has a finite covering subfamily k=1,...m Xik = X.
For example, a cartesian product of closed intervals [ai , bi ], i = 1, . . . n in
Rn is compact.
for any x ∈ Rn and radius e > 0. It can be shown that the topology on Rn
defined by open balls coincides with the product topology defined above.
ˇ “*
Example 74 Distances in Rn play a role in the theory of motivic similarity.
In this context, we consider a musical motive of k notes or k-motive to be a
sequence m = (m1 , m2 , . . . mk ) of elements (the notes) mi ∈ REHLD in a
symbolic space REHLD of onset, pitch, duration, and loudness with real values.
We suppose that the Euclidean distance is selected on REHLD . The set M otk
of all k-motives is then associated with the space (REHLD )k . On this space,
we define a distance between motives m, n by d(m, n) := maxi (d(mi , ni )). It is
easy to verify the axioms of a distance (Definition 78) for M otk .
With this distance among motives, one may consider motivic similarity ei-
ther directly via a measurement of distances or by investigation of balls Be (m)
of k-motives around a given k-motive m. Melodic similarity compares motives
by their topological position, not by symmetries that could transform motives
into each other. This splits musical paradigmatic analysis into two quite differ-
ent subfields: the “para” part—topological similarity, “side by side,” versus the
“deigmatic” part—“pointing to”: transformational relations. See [75, Chapter
22] for more details about the now active research field of motivic topologies.
The topological aspect was unfortunately ignored by music psychologist
Christian von Ehrenfels when he characterized “gestalt” as something that has
supersummativity, the whole is more than the sum of its parts, and transposabil-
ity, you can transpose a gestalt without changing it. He explained his concept
with the example of a melody. But he should have added that a gestalt is
invariant under similarity, you may deform it slightly without changing it.
For function sets Top(X, Y ) one considers the following set of subsets
of functions. One selects a compact set C ⊂ X and an open set O ⊂ Y .
The subset of the compact-open topology on Top(X, Y ) consists of all sets
(C, O) = {f |f ∈ Top(X, Y ) AND f (C) ⊂ O}. The compact-open topology
has the universal property that for any three topological spaces X, Y, Z, where
Y is locally compact Hausdorff, the composition map
Fig. 31.2. The absolute value function |x| is not differentiable for argument x = 0.
√
Example 39 If f = P : R → R is a polynomial P (X) = i ai X i , then it
is differentiable and Df (x) = i>0 iai xi−1 . The R-affine functions T t ◦ M are
differentiable, and we have Df (x) = M . The function sin(x) has D sin(x) =
cos(x), the function cos(x) has D cos(x) = − sin(x).
Fig. 31.3. Tempo relates to the function that sends symbolic score time to physical
time. It is the inverse differential quotient of this function.
1
The symbol de/dE is a traditional writing of D℘E , the derivative then being called
differential quotient.
32
Performance
We shall not deal here with the semiotic aspect of performance, but simply
describe the most elementary ingredients of structure theory. A more complete
description can be found in [75, Parts VIII-XII].
Expressive performance research was first received with much skepticism be-
cause it was believed, above all by traditional performance experts in musi-
cologist circles, that mathematical and musical precision were incompatible.
Or that some performance (machine-made or human) could be mathematically
precise but musically invalid, and that differences in the quantitative perfor-
mances could be irrelevant to the precision of the musical message. This turned
out to be a misunderstanding due to a confusion between structural determi-
nants and the semiotic contents. To put it simply: The relationship between
form and content is a complex one. But it is not true that structural precision
does not matter. It just doesn’t guarantee the transfer of a deeper content.
Already Theodor Wiesengrund Adorno in his correspondence with Walter
Benjamin [2] had recalled1 : “Walter Benjamin hat das Vermögen der Phan-
tasie die Gabe, im unendlich Kleinen zu interpolieren definiert. Das bedeutet
blitzhaft die wahre Interpretation. (...) Im dicht gewobenen Zusammenhang des
Notentextes sind die minimalen Hohlräume zu entdecken, in denen sinnverlei-
hende Interpretation ihre Zuflucht findet. (...) Das Medium künstlerischer Fan-
tasie ist nicht ein Weniger an Genauigkeit, sondern das noch Genauere.” This
wonderfully arcane text can be clarified if we recall that there is an exquisite
science of infinite precision, namely differential and integral calculus. We take
this reference as a starting point for a mathematically rigorous theory of mu-
sical performance.
Musical scores are not yet music: They indicate to performers the points to
reach. Let us briefly discuss some performance-related components of Western
scores. A classic Western score mainly contains information about pitch, onset,
duration, loudness, and timbre.
The names of the notes indicate their pitch class, i.e., c d e f g a b in Anglo-
Saxon notation, c d e f g a h in German notation, and Do Re Mi Fa Sol La Si
1
Walter Benjamin has defined the power of fantasy as the talent to interpolate in the
infinitely small. This immediately means true interpretation/performance. (...) In
the densely woven context of the score’s text minimal cavities are to be discovered,
wherein meaningful interpretation has found its refuge. (...) The medium of artistic
fantasy is not a diminution, but an augmentation of precision.
32.2 Musical Notation for Performance 269
the integral over the inverse tempo function T . This one-dimensional case is
used to set up a formalism to solve the general case. The inverse derivative
D℘−1
E that defines tempo can be generalized to the inverse Jacobian J℘(X)
−1
.
But tempo is a number and a matrix is not a number. To solve this problem,
the mathematical analysis shows that we may apply the matrix J℘(X)−1 to
the vector Δ = (1, 1, 1, 1) ∈ Rehld . This defines a vector
( ℘צX) = J℘(X)−1 .Δ
in REHLD . Similar to the tempo, this is a vector for every argument of the
symbolic space. Such a structure is called a vector field, and in our perfor-
mance theory, we call it the performance field ℘צof ℘. Figure 32.1 shows a
performance field in the two dimensions E, H of onset and pitch. It is defined
from the one-dimensional tempo and intonation fields T, S.
For onset and duration, one gets a tempo-articulation field in the plane of
E and D, see Figure 32.2. This is a consequence of the fact that with tempo,
duration as the difference of onset and offset of a note can also be defined. But
in general, with staccato and legato, duration has a more complex shape.
Similar to the integration of T1 to get the performance onsets, one can
integrate the field ℘צand get the performed point x = ℘(X). This is quite
difficult to explain in detail, but there is an intuitive approach that everybody
32.3 Structure Theory of Performance 271
Fig. 32.1. The performance field for onset and pitch that is defined by the tempo
and intonation fields.
Fig. 32.2. The performance field for onset and duration that is defined by the tempo
field with duration being derived as a difference of offset and onset of a note.
can understand. The integration in the general case involves looking at integral
curves of such a performance field. Imagine the field as being the velocity field
of a water current on the surface of a river. If you are in a boat, your trajectory
will be a curve that is determined by the velocity field of the river. This is
what they call an integral curve. One may calculate the integral curves of
performance fields that go through a selected note X, and the curve will hit
a point X0 within the frame F where one knows the image ℘(X0 ). Together
with this initial value, the curve will then allow us to calculate the image ℘(X).
This has been implemented in the software RUBATO , allowing us to calculate
performance of musical scores, once the performance fields were given.
272 32 Performance
Summary.
Gestures are com-
plex in their common un-
derstanding. The concept
of a gesture has never been
thoroughly defined to this
date, although gestures are
very important in human-
computer interface design,
human expressivity in and Fig. 33.1. Michelangelo’s Divine Gesture from God to
beyond common language, Adam.
and above all in the arts.
Painting, dance, music, theater, and film would not be understood without ges-
tural concepts and processes. We therefore will give a short introduction to the
first mathematical theory of gestures.
–Σ–
Fig. 33.5. Left: Chinese word development for “mountain,” right: some Chinese words
with the same genealogy.
The formalism developed for this simple case can be extended and applied
to any musical performance. It is clear that there is an infinite variety of physical
gestures associated with the same symbolic gesture. For the same symbolic
gesture, in fact, there can be some physical realizations that are more difficult
than others, some physical gestures that correspond more or less to the musical
idea, and so on.
Musical composition requires the inverse process, from physical reality
to symbolic reality. The composer can start from real physical movement—
such as piano improvisation—or “virtual” physical gestures imagined to be the
final result when the music is performed. Starting from physical gestures, the
composer has to find the optimal symbolic approximation of his or her ideas.
It means he or she will answer the question, what are the symbolic indications
whose physical realization is closer to my idea?
ˇ “*
Exercise 12 Play a song/phrase on a piano keyboard three times, choosing
a different gesture each time. Then try to write a score indicating each gesture.
A detailed analysis of gestures is beyond the scope of this book. However,
because of their dramatic importance in music and future development of music,
we will give mathematical definitions and some bibliographic references.
The gestures are curves in space and time. To formally connect the sym-
bolic to the physical, we use methods involving surfaces (world-sheets) (world-
sheets). These surfaces were inspired by the string theory in theoretical physics,
as described in [68]. A more complete discussion of the implication of gestures
in music is given in [85].
An example of transformation from symbolic to physical reality is porta-
mento for string instruments. A glissando, as we know, is a continuous transi-
tion from a starting note to an ending note. A portamento also occurs between
two different notes, but there are some differences in the execution. Ideally, if
the transition from one note to another is instantaneous, as a step-function, a
portamento implies a more smooth transition. This smoothness is realized with
a quick pass through more pitches than simply the starting and ending ones,
similar to the lower and upper level of a step. However, it does not involve all
intermediate pitches, as happens in a glissando.
Portamenti are often used by voice and string instruments. The duration
and pitches involved depend on the style of the chosen music.
ˇ “*
Exercise 14 Sing two different notes with precise intonation, but with no
pitch transition between them. Then try to perform a portamento. Measure
time duration of the first and the second case, and try to find the intermediate
notes that you are singing. Finally, make a graph of your first and your second
attempt, and compare the two images.
33.4 Philosophy of Gestures 279
tagms and contents. And they are always related to the body’s actions; no ab-
stract concept of a gesture, such as a gesture in a musical melody or a thought
gesture, is addressed.
The French tradition of gesture philosophy is charac-
terized by the thesis that gestures constitute a proper on-
tology that is independent of semiotic systems. It typically
precedes them, or is pre-semiotic. As in many other cases
of French philosophy of the 20th century, Paul Valéry is the
figurehead of gesture theory. His famous inscription on the
front of Palais Chaillot in Paris says:
Dans ces murs voués aux merveilles
J’accueille et garde les ouvrages
Fig. 33.8. Paul De la main prodigieuse de l’artiste
Valéry is the fig- Égale et rivale de sa pensée
urehead of French L’une n’est rien sans l’autre
gesture theory (In these walls devoted to the marvels
(1871-1945). I receive and keep the works
Of the artist’s prodigious hand
Equal and rival of his thought
One is nothing without the other.)
Again it is the thinking hand that Valéry invokes, a hand that in its
gestural originality is a full-fledged partner of human thoughts. It is not sur-
prising that Valéry wrote an essay on the philosophy of dance [111] in which he
concludes not with a scholarly description of dance, but a suggestion to start
dancing our thoughts instead of thinking about dance.
Ahead of his time, French mathematician and philosopher Jean Cavaillés
in 1938 stated in [26] a core property of gestures that bypasses any semiotic ba-
sis: “Comprendre, c’est attraper le geste et pouvoir continuer.” (“Understanding
is catching the gesture and being able to continue.”)
Cavaillés’ dancing thought (also shaped in Boulez’s
reflection on gesture in music [24]) was stated with re-
spect to mathematical theories, and as such it was one of
the very first principles of gestural embodiment in mathe-
matics, an idea now quite fashionable through the work of
George Lakoff and Rafael Núñez [58] but also anticipated in
Châtelet’s observation [27] that the Fregean (Gottlob Frege
(1848-1925) was German logician) concept of a function f
in mathematics is a dramatic (and questionable) abstrac-
Fig. 33.9.
tion that replaces the moving gesture from argument x to
Mathematician its functional value f(x) by a kind of disembodied “tele-
and philosopher
portation,” where the evidence of the functional relation is
Jean Cavaillés
(1903-1944).
wrapped and hidden, if not destroyed.
Gestures—except when “tamed” by social codes—are
not signs in a semiotic environment. They are not a realization of Ferdinand
33.5 Mathematical Theory of Gestures in Music 281
→
−
Intuitively, a gesture g : Γ → X is a system of curves in X that are
connected according to the arrows of the skeleton, see Figure 33.11.
ˇ “*
Example 76 The simple finger gesture is the most important example of
an elementary gesture of a pianist’s hand, see Figure 33.12. It is used in fun-
damental research about the transformation of symbolic gestures into physical
gestures, see Section 33.7.
→
− −
→
Definition 83 If g : Γ → X , d : Δ → Y are two gestures, a morphism
h : g → d is a pair h = (t, m) where t : Γ → Δ is a digraph morphism, and
m : X → Y is a continuous map, such that the following diagram commutes.
g →
−
Γ −−−−→ X
⏐ ⏐
⏐ ⏐→
t −m
d →
−
Δ −−−−→ Y
Clearly, morphisms can be composed in the expected way, and this composition
→
−
is associative. Every gesture g : Γ → X has the identity Idg = (IdΓ , IdX ) as
morphism. This setup defines the category Gest of gestures.
√ →
−
Example 40 For every arrow a in the skeleton Γ of a gesture g : Γ → X ,
we may restrict the gesture to that arrow. This defines a morphism of gestures
as shown in the following diagram. It is the restriction of g to the arrow a, and
we shall use it in Section 33.6.
33.6 Hypergestures 283
→
−
g◦a∗
[1] −−−−→
X
⏐ ⏐
a∗
⏐ ⏐−−→
IdX
g →
−
Δ −−−−→ X
33.6 Hypergestures
→
−
The set Γ @X of gestures g : Γ → X can be given the structure of a topological
space as follows. We first deal with the case of Γ = [1], the chain digraph with
∼ →−
two points and one connecting arrow. Clearly, in this case [1]@X → X . The
→
−
set X of arrows in X is the set Top(I, X), and in this case, we have the
compact-open topology that turns it into a topological space.
The general case Γ @X is obtained by looking at all of its arrows a and
looking at the morphism a∗ : [1] → Γ sending the arrow of [1] to a. Each such
a∗ defines a map pa : Γ @X → [1]@X by restriction of a gesture g to g ◦ a∗.
There is a unique topology on Γ @X that turns all these restriction maps pa
into continuous maps. This is the topology we impose on Γ @X.
This construction has the important consequence that we may now con-
sider the gesture space Δ@Γ @X = Δ@(Γ @X). Gestures in this space are
gestures of gestures, which we call hypergestures. This construction may be
iterated to yield higher hypergestures Γ1 @Γ2 @ . . . Γn @X.
284 33 Gestures
Fig. 33.13. First species counterpoint can be viewed either as a hypergesture from
cantus firmus to discantus (left) or as a hypergesture from the first to the subsequent
intervals.
ˇ “*
Example 78 In improvisation, it often is important for a creative step to
change one’s perspective of what is being heard. Here the Escher Theorem can
33.7 Hypergestures in Complex Time 285
help change, for example, a chord sequence in a given rhythm into a rhythm of
chord changes, i.e., to change one’s perception of the generative musical shape.
Fig. 33.14. Playing these notes (left) is physically impossible. The symbolic infor-
mation (rectangular curve of “imaginary” finger movements, left) must be deformed
into a smooth physical curve (right).
The pianist has to deform this set of symbols to produce a physical ren-
dition. This is mathematically achieved using a hypergesture from the finger
gesture in symbolic space to the finger gesture in physical space. This hyperges-
ture is similar to what in modern physics of strings is called a world-sheet. It is
a surface that represents a whole gesture of gestures, starting from the symbolic
286 33 Gestures
Fig. 33.15. Two world-sheets with a complex potential are shown, the left upper one
for the transition from symbols to physics, the right one for the inverse process from
physical performance to symbolic data as represented in a score. The bottom drawing
summarizes the double process from symbols to physics and back.
Our solution uses energy potential transitioning smoothly over time co-
ordinates. We start with imaginary times on the symbolic gesture and move
smoothly to real time for the physical gesture. See [68] for details. The shape
of the world-sheet of such a transition depends on the energy potential that
controls the efforts made by the pianist to achieve this performance. The top
left part of Figure 33.15 shows such a world-sheet of a complex potential. But
we should be aware that the composer may also move in the opposite direc-
tion, starting with a physical action of the hand(s) and then transforming it
into symbolic data, a process that is also achieved when recording a pianist’s
performance via MIDI technology. The top right part of Figure 33.15 shows this
inverse world-sheet. And the bottom drawing summarizes the double process
from symbols to physics and back.
Part VIII
Summary. This chapter contains the solutions of the mathematical and mu-
sical exercises. Each solution number corresponds to the exercise number.
–Σ–
√ a∪b 2a∪b
Solution 5 We have (x, y) ∈ 22 . Therefore a × b ∈ 22 .
√
Solution 6 Looking at the values f × g(x, y) = (f (x), g(y)), if every pair
(u, v) ∈ b × d is hit, then every u ∈ b and every v ∈ d is hit, and vice versa.
So epi for the cartesian product is equivalent to epi for each factor function. A
similar argument proves the mono statement, and since iso is epi AND mono,
the third statement follows.
√
Solution 7 The object Set(a, b) consist of triples (f, a, b), where f ⊂ a × b
is a graph. This means (f, a, b) ∈ 2a×b × {a, b}2 . Therefore Set(a, b) ⊂ 2a×b ×
a×b 2
{a, b}2 or Set(a, b) ∈ 22 ×{a,b} , which is a set.
√
Solution 8 The fact that q and a × b are equipollent follows from these
facts: From the universal property and the hypothesis about q, we imply that
∼
Set(c, q) → Set(c, a × b) for all c. This is true in particular for the singleton set
c = 1 = {0}. But for any set x, Set(1, x) is in bijection with the set of elements
of x. This defines a bijection of q and a × b.
More precisely, let us also show that f is a bijection. Exchanging the roles
of q and a×b, we have a unique g : a×b → q such that fa ◦g = pra , fb ◦g = prb .
The composition f ◦ g : a × b → a × b must be the identity since it is the unique
function guaranteed by the universal property of a × b. Therefore f ◦ g = Ida×b .
Exchanging the roles of q and a × b, we also get g ◦ f = Idq . This means that
these two sets are in bijection with each other.
√
Solution 9 If x∈a fx is well-ordered, take of any index x ∈ a the subset
[fx ] ⊂ x∈a fx of elements that have a fixed coordinate for all indices y = x
and any value at index x. This set is in bijection with fx , and the induced
ordering on [fx ] coincides with the ordering on fx . But as the induced ordering
on [fx ] is a well-ordering, the same holds for <x .
Conversely, if every <x is a well ordering, then for any non-empty subset
S ⊂ x∈a fx , take the smallest index x0 in a such that there are more than one
values of elements in S. Take the smallest x0 -coordinate s(x0 ) of elements of S.
(If there is no such value, there is only one element in S, and this is obviously
its smallest.) Then take the subset Sx0 of S consisting of the elements of S
who have that smallest coordinate s(x0 ) at index x0 . Then take the next larger
index where there is more than one value of elements of Sx0 at that coordinate.
Go on with the construction of a corresponding subset as before, and so on.
Since a is finite, this recursive construction stops after a finite number of steps,
and we get a smallest element of S.
34.1 Solutions of Mathematical Exercises 291
√
Solution 10 The empty set 0 = ∅ is obviously ordinal as it has no ele-
ments. The set 1 = {0} is transitive since its unique element 0 ∈ 1 is a subset.
It is alternative since it has only one element; it is founded since it is its only
non-empty subset, and its element 0 has 0 ∩ 1 = ∅. Check for 2, 3 that they are
ordinal by a similar case-by-case study. Let us see why J = {J} is not ordinal.
It is not founded because it is its only non-empty subset b, refer to property
(iii) in Definition 21. But the only element of b = J is J, and J ∩ J = J, which
is not empty.
√
Solution 11 If m ∈ n is an element of a natural number, it is ordinal by
the characteristic properties of ordinal numbers. It is either 0, hence natural,
or a successor m = r+ . Then property (ii) in Definition 22 is OK, and since n
is ordinal, it is transitive, so m is also a subset of n, and every element of m is
an element of n, so either 0 or a successor. Therefore m is also natural.
√
Solution 12 Composition of recursive functions is associative, as is any
composition of set functions. Let us show that s ◦ r : X N → X N is a re-
cursive function if r, s are so. If f, g ∈ X N such that f |n = g|n, then by
Lemma 6, r(f )|n+ = r(g)|n+ , therefore r(f )|n = r(g)|n, and since s is recur-
sive, s(r(f ))(n) = s(r(g))(n), and we are done.
√ b
Solution 13 Suppose b
a = a , which means ab = a b. Then
b d bc + ad
+ = ,
a c ac
and
b d b c + a d
+ = ,
a c a c
and we have to verify bc+ad
ac = b c+a
a c , i.e., (bc + ad)a c = (b c + a d)ac, but this
d
immediate in view of ab = a b. Changing also the second summand yields the
result for addition. The same method works for multiplication.
This arithmetic extends the integer arithmetic, since for addition and
multiplication we have 1b + 1c = b+c
1 and 1 1 = 1 .
b c bc
√
Solution 14 Take another representative of the first summand, (ai + zi )i ,
where (zi )i is in O. Then the sum is (ai + zi + bi )i = ((ai + bi ) + zi )i , which is
equivalent to (ai + bi )i . For the product, we have ((ai + zi )bi )i = (ai bi + zi bi )i ,
but since (bi )i is bounded, the product sequence (zi bi )i is in O, and we are
done.
√
Solution 15 We construct a Cauchy sequence (ci )i that converges to the
least upper bound as follows. Select any element a ∈ A, take any upper bound
b and set c0 = a. Then take the middle number (b + c0 )/2 and look whether
there is an element in [(b + c0 )/2, b] ∩ A. If yes, then take c1 = (b + c0 )/2, else
292 34 Solutions of Exercises
take c1 = a, and so on—we always take the top half-open interval where there
are elements of A, and of that one the left boundary number. This sequence
converges as it cuts the possible range in half at each step. And it defines the
least upper bound by construction.
√ √ √
Solution 16 Since the non-negative nth root is unique, we have ( n a n b)n =
√ √ √
( n a)n ( n b)n = ab = ( n ab)n , hence the claim.
√
Solution 17 The equalities are obvious for either p, q ∈ Z or p, q = z1 , w1 for
natural√numbers z, w. Then the claim follows immediately from the definition
p
x z = ( z x)p .
√
Solution 18 The first formula is clear, as it is just the rewriting of the
formula according to the rules from Sorite 8. The second formula follows from
Sorite 8 and from the equation i2 = −1.
√ √ √
3.5−i·√ 5
√
3.5−i· 5 3.5
√
5
Solution 19 (3.5 + i · 5)−1 = |3.5+i· 5|2
= 12.25+5 = 17.25 − i 17.25 ≈
0.202899 − i · 0.129627.
√
Solution 20 The map is an injection since its values on words are deter-
mined by values on the letters, as words are products of letters. Conversely,
if we are given any function f : A → N , then the definition f (a1 a2 . . . ak ) =
f (a1 ) ∗ f (a2 ) ∗ . . . f (ak ) is well defined since the letters are uniquely determined
from the words, and the monoid morphism property is immediate.
√
Solution 21 If we set f (a) = 1 for all letters a ∈ A, the value of a word w
is the length of w, i.e., the number of letters that define the word.
√
Solution 22 Here is the complete multiplication table:
∗ i r1 r2 r3 h v d1 d2
i i r1 r2 r3 h v d1 d2
r1 r1 r2 r3 i d1 d2 v h
r2 r2 r3 i r1 v h d2 d1
r3 r3 i r1 r2 d2 d1 h v
h h d2 v d1 i r2 r3 r1
v v d1 h d2 r2 i r1 r3
d1 d1 h d2 v r1 r3 i r2
d2 d2 v d1 h r3 r1 r2 i
√
Solution 23 A minimal set of generators is {r1 , h}.
34.2 Solutions of Musical Exercises 293
√
Solution 24 If f : G → H is injective, then x, y ∈ Ker(f ) implies f (x) =
f (y) = eH , so x = y. Since eG ∈ Ker(f ), we have Ker(f ) = {eG }. Conversely,
if Ker(f ) is trivial, then f (x) = f (y) implies f (xy −1 ) = eH , hence xy −1 = eG ,
i.e., x = y, and f is injective.
√
Solution 25 The cycle representation is (1 11 5 6 7 2 3 10 12 8)(4)(9).
√
Solution 26 The kernel of pr3 : Z12 → Z3 are the elements x12 such that
x3 = 0, i.e., x is a multiple of 3. In x12 this is the group generated by 312 . Same
argument for the kernel of pr4 : Z12 → Z4 .
√
Solution 27 We have r ∗ 0R = r ∗ (0R + 0R ) = r ∗ 0R + r ∗ 0R , and, after
subtracting r ∗ 0R , we get 0R = r ∗ 0R .
√
Solution 28 The sum formula is evident from the definition. For the prod-
uct formula, the monoid element X m has the coefficient that is the sum of all
products an bl of coefficients of X n , X l , respectively, such that X n X l = X n+l =
X m , and this yields the product formula.
√
Solution 29 If f = T t ◦ f0 , then f (0) = t, so T t is determined, and then
f0 = T −t ◦ f is also determined.
ˇ “*
Solution 1 Using the involution property of addition, the equation w + y =
z has the solution w = z + y = {♩, } + {, } = {♩, }.
ˇ “*
Solution 2 The graph g −1 ◦ g consists of the pairs (s, t) of onsets such that
there exists a pitch r with (s, r), (t, r) ∈ g, meaning that there are two onsets
with a repeated pitch.
ˇ “* ∼
Solution 3 The pairs (x, y) in h−1 ◦ g : O12 → O12 are the onset time pairs
such that g(x) = h(y) = p, i.e., they map to the same pitch p in P12 . The
resulting bijection is the following (the image of i is the number below i):
0 1 2 3 4 5 6 7 8 9 10 11
7 8 9 3 2 10 11 4 1 0 5 6
ˇ “* Solution 5 You have to check that all lines (1-simplices) are the case, and
that no other lines are there. Then verify that all 2-simplices (triangular sur-
faces) are the case and no others; then that there are no 3-simplices. Higher
simplices are then automatically non-existent.
For example, for a 1-simplex, we have the intersection of motive 1 and
motive 2 in two points, top and left next to top (Figure 34.1), or for a 2-
simplex the intersection of motives 1,2, and 9 in the point next left to the top
point of the triangle.
ˇ “*
Solution 6 We have ord(K4 ) = 4, and as 3 is not a divisor of 4, no such
group of order 3 is a subgroup of K4 . As a series s generates a subgroup s
of S12 , its order must be a divisor of 12! = ord(S12 ). Therefore orders 13 and
23 are impossible, as these prime numbers do not divide 12! But 12 is possible,
and in fact the series that sends 0 → 1 → 2 → 3 . . . 10 → 11 → 0 has order 12.
ˇ “* Solution 9 The nerve is the union of two 2-simplices, see Figure 34.2.
ˇ “* Solution 10 A makes the permutation (e)(a b)(c g)(f d), see Figure 28.3.
ˇ “*
Solution 11 Both, Z and A permute (c g)(f d), so their composition leaves
these four pitch classes fixed. Then A(a) = b, and Z(b) = a, A(e) = e, and
Z(e) = e, A(b) = a, and Z(a) = b.
34.2 Solutions of Musical Exercises 295
ˇ “*
Solution 14 This exercise is also an open one. You should show what the
curve of the step function and the portamento curve look like. Figure 34.3
shows the portamento curve as a smooth transition from note c to note g.
9. Aphex Twin: ΔMi−1 = −α N n=1 Di [n] j∈C[i] Fij [n − 1] + F extj [n
−1
] . In:
CD Windowlicker, Warp Records WAP105, 1999
10. Babbitt M: Words about Music. Dembski S and Straus J N (eds.), University
of Wisconsin Press, Madison 1987
11. Babbitt M: Some Aspects of Twelve-Tone Composition. In: Hays W (ed.): The
Score and IMA Magazine, 12, 53-61, 1955 (reprinted in “Twentieth Century
Views of Music History”, 364-371, Scribner, New York 1972)
12. Babbitt M: The Structure and Function of Music Theory. College Music Sym-
posium, Vol. 5, 1965 (reprinted in Boretz and Cone, 1972, 10-21)
13. Babbitt M: Twelve-Tone Invariants as Compositional Determinants. Musical
Quarterly, 46, 245-259, 1960
14. Babbitt M: Set Structure as a Compositional Determinant. JMT, 5(2), 72-94,
1961
15. Baker J, D Beach, and J Bernard: Music Theory in Concept and Practice. East-
man Studies in Music, University of Rochester Press, 1997
16. Bazelow A and F Brickel: A Partition Problem Posed by Milton Babbitt. PNM,
14(2), 15(1), 280-293, 1976
17. Beran J and G Mazzola: Timing Microstructure in Schumann’s Träumerei as an
Expression of Harmony, Rhythm, and Motivic Structure in Music Performance.
Computers and Mathematics with Applications, Vol. 39, Issue 5/6, 99-130, 2000
18. Beran J et al.: On the relationship between tempo and quantitative metric,
melodic and harmonic information in Chopin’s Prélude op. 28, no. 4: a statistical
analysis of 30 performances. Journal of Mathematics and Music,Vol 8, No. 3,
225-248, 2014.
19. Bernard J: Chord, Collection, and Set in Twentieth-Century Theory. In: Baker
J et al. this bibliography, 11-52
20. Boissière A: Geste, interprétation, invention selon Pierre Boulez. Revue
DEMéter, déc. 2002, Univ. Lille-3
21. Boretz B and E T Cone: Perspectives on Contemporary Music Theory. W.W.
Norton and Company, New York 1972
22. Boulez P: Jalons (dix ans d’enseignement au Collège de France). Bourgeois,
Paris 1989
23. Boulez P: Musikdenken heute I, II; Darmstädter Beiträge V, VI. Schott, Mainz
1963, 1985.
24. Boulez P: L’Ecriture du geste. Christian Bourgois, Paris 2002
25. Cage J: 4’33”. Edition Peters 1952
26. Cavaillès J: Méthode axiomatique et formalisme, Hermann 1938
27. Châtelet G: Figuring Space. Kluwer 2000
28. Colombo S: I canoni, storia e sviluppo di un enigma. Armelin Musica, Padova
2009
29. Dahlhaus C: Über den Begriff der tonalen Funktion. In: Vogel M (ed.): Beiträge
zur Musiktheorie des 19. Jahrhunderts. Bosse, Regensburg 1966
30. Dubois T: Trattato di Contrappunto e Fuga, Ricordi, Milano 1905. Reprinted
in 2002
31. Euler L: Tentamen novae theoriae musicae (1739). In: Opera Omnia, Ser. III,
Vol. 1 (Ed. Bernoulli, E et al.). Teubner, Stuttgart 1926
32. Euler L: Conjecture sur la raison de quelques dissonances générales reçues dans
la musique (1764). In: Opera Omnia, Ser. III, Vol. 1 (Ed. Bernoulli, E et al.).
Teubner, Stuttgart 1926
33. Euler L: De harmoniae veris principiis per speculum musicum representatis
(1773). In: Opera Omnia, Ser. III, Vol. 1 (Ed. Bernoulli, E et al.). Teubner,
Stuttgart 1926
34. Felver C: Cecil Taylor - All The Notes. A Chris Felver Movie, 2003
35. Finke R A, Th B Ward, S M Smith: Creative Cognition: Theory, Research, and
Applications. MIT Press, Cambridge, MA 1992
36. Fletcher J (ed.): Athanasius Kircher und seine Beziehungen zum gelehrten Eu-
ropa seiner Zeit. Herzog August Bibliohek Wolfenbüttel, 1988
37. Flusser V: Gesten — Versuch einer Phänomenologie. Fischer, Frankfurt am Main
1994
38. Forte A: A Theory of Set-Complexes for Music. JMT, 8(2), 136-183, 1964
39. Forte A: Structure of Atonal Music. Yale University Press, New Haven 1973
40. Forte A: La Set-complex theory : Élevons les enjeux ! Analyse musicale, 4e
trimestre, 80-86, 1989
References 299
98. Rufer J: Die Komposition mit zwölf Tönen. Bärenreiter, Kassel 1966
99. Russell B: A History of Western Philosophy. George Allen & Unwin Ltd, London
1945
100. Sachs K-J: Der Contrapunctus im 14. und 15. Jahrhundert. AMW, Franz
Steiner, Wiesbaden 1974
101. Schmitt J-C: La raison des gestes dans l’Occident médiéval. Gallimard, Paris
1990
102. Schmidt-Biggemann W: Topica universalis. Meiner, Hamburg 1983
103. Schönberg A: Harmonielehre (1911). Universal Edition, Wien 1966
104. Sciarrino S: Le figure della musica da Beethoven a oggi. Ricordi, Milano 1998
105. Song M and Y Pang: A Comprehensive Analysis of “The Scene of Sichuan
Opera.” In: Collected Papers of One Hundred Tenure Professors. Sichuan Peo-
ple’s Publishing House, Chengdu 2014
宋名筑, 庞艳: 管乐与打击乐《折子戏》及钢琴独奏《川剧场景》的作曲技术比较
分析. 四川音乐学院百名老教授论文作品选集, 四川人民出版社, 成都 2014
106. Starr D and R Morris: A General Theory of Combinatoriality and the Aggregate.
PNM, 16(1), 364-389, 16(2), 50-84, 1977-78
107. Starr D: Sets, Invariance and Partitions. JMT, 22(1), 1-42, 1978
108. Straus J N: Introduction to Post-Tonal Theory. Pearson, 3rd ed. 2004
109. Surian E: Manuale di storia della musica. Rugginenti, Milano 1995
110. Tymoczko D: A Geometry of Music. Oxford University Press, 2011
111. Valéry P: Philosophie de la danse (1938). Oeuvres I, Variété, “Téorie poétique
et esthétique”, 1390-1403, Gallimard, Paris 1957
112. Valéry P: Cahiers I-IV (1894-1914). Celeyrette-Pietri N and Robinson-Valéry J
(eds.), Gallimard, Paris 1987
113. Vogel M: Die Lehre von den Tonbeziehungen. Verlag für systematische Musik-
wissenschaft, Bonn-Bad Godesberg 1975
114. van der Waerden B L: Die Pythagoreer. Artemis, Zürich 1979
115. van der Werf H: “Early Western Polyphony” Companion to Medieval & Renais-
sance Music. Oxford University Press 1997
116. Xenakis I: Formalized Music. Pendragon Press (revised edition), New York 1992
117. Xenakis I: Musiques formelles : nouveaux principes formels de composition mu-
sicale. Richard-Masse, Paris 1963
118. Zarlino G: Le istitutioni harmoniche (1573). Facsimile edition: Gregg Press 1966
119. Zarlino G: Dimostrationi harmoniche. 1571
120. Zhu Z: On the Equal Temperament. 1584
121. Zhu Z: A Clear Explanation of That Which Concerns the Equal Temperament.
1595/96
122. Zhu Z: Reflection on Mathematics. 1603
123. Zurlinden, H: Wolfgang Graeser. Beck, München 1935
Index
categories inversion, 76
theory of -, 28 major -, 246
category, 226, 249 mystic -, 179
of gestures, 282 of pitch classes, 157
of groups, 250 prime form of -, 67, 158
of local compositions, 250 seventh -, 183
of matrices, 250 transposition
of modules, 250 class of a -, 157
of sets, 250 transposition-inversion
of topological spaces, 259 class of a -, 157
opposite -, 250 Cicero, 279
theory, 29, 30, 121 class
Cauchy chord -, 28
sequence, 100, 104 equivalence -, 58
Cavaillés, Jean, 280 pitch -, 86, 156
CEMAMU, 23 transposition -, 153
Cent (Ct), 110 Classical Symphony, 237
center, 219 classification
CERN, 6 chord -, 174
CF (cantus firmus), 13, 44, 175 of chords, 65
Châtelet, Gilles, 279 classifier
chain subobject -, 65
digraph, 123 clefs, 12
length of -, 123 clivis, 275
rule, 264 closed
undirected -, 125 interval, 258
chamber a, 110 set, 257
changes closure, 262
chord -, 182 coda, 145
chant codomain, 52, 249
Gregorian -, 273 coefficient, 218
characteristic cogitans
function, 65 res -, 116
cheironomy, 275 Coltrane, John, 181
Cherubini, Luigi, 11 combinatorial
Chew, Elaine, 30 music
chin, 275 theory, 16
Chinese combinatorics, 16
drum notation, 276 commutative
gestural music notation, 275 diagram, 62
language, 275 group, 147
choice monoid, 143
axiom of -, 44, 58 ring, 46
chord, 58 commutativity, 44
changes, 182 compact, 260
class, 28 locally -, 261
classification, 65, 174 compact-open
diminished topology, 261
seventh -, 133 complementation, 157
308 Index
differential keyboard -, 53
quotient, 265
digraph, 122, 143 E
chain -, 123 Eckardt, Jason, 33
final -, 122 Ehrenfels, Christian von, 261
morphism, 123 Eigen, Manfred, 31
spatial -, 281 Eilenberg, Samuel, 29, 251
dimension, 232 Eimert, Herbert, 3, 28, 222
diminished Eine kleine Nachtmusik, 237
7th, 162 electromagnetic
seventh force, 195
chord, 133 element
triad, 160, 179 minimal -, 58
diminuendo, 208 neutral -, 45, 143
direct sum, 226 order of -, 154
direct sums embedding
universal property of -, 228 Euler -, 227
directed empty
graph, 122 set, 41
Dirichlet, Peter Gustav Lejeune, 19 Encarnaçao, José, 31
discantus, 284 epi, 53
discantus (D), 44, 175 equal temperament, 9
disjoint, 44 equal-tempered
dissonance, 177, 245 tuning, 247
dissonant equation, 46, 89
interval, 176 conchoid -, 238
distance Lagrange -, 154
function, 260 linear -, 223, 234
Euclidean - -, 260 Pythagorean -, 99
metrical -, 173 equipollent, 56
distributive, 219 equivalence
law, 205 class, 58
distributivity, 45, 225 relation, 57
Division erhu, 108
Theorem, 85, 211 Escher
divisor, 213 Theorem, 284
dodecaphonic Euclid, 85, 87
series, 75, 126, 148 Euclidean
dodecaphonism, 53 distance function, 260
domain, 52, 249 length, 115
integral -, 214 Euler
dominant cycle, 126
7th, 161 embedding, 227
Dominant (D), 55, 145 number, 100
dualism space, 227
cartesian -, 277 Euler space, 18
duality, 64 Euler’s substitution theory, 262
Dubiel, Joseph, 33 Euler, Leonhard, 17, 227, 231, 262
duration, 231 Examples
310 Index
positive -, 90 jazz
integers harmony, 183
addition of -, 90 just
multiplication of -, 91 tuning, 8, 111, 176, 215, 228, 241, 262
integral, 270
curve, 271 K
domain, 214 Kandisky, Wassily, 235
interpretation Kant, Immanuel, 281
triadic -, 194 Kendon, Adam, 279
intersection, 44 Kepler, Johannes, 101
interval, 58, 62, 75, 90, 144, 152, 176 kernel, 154, 208
closed -, 258 key
consonant -, 176 black -, 175
dichotomy, 177 keyboard -, 53
dissonant -, 176 signature, 175
fifth -, 75 white -, 175
fourth -, 75 keyboard, 111
major second -, 75 duration, 53
major seventh -, 75 key, 53
major sixth -, 75 note, 53
major third -, 75 King, Emily, ix
minor second -, 75 Kircher, Athanasius, 12, 14
minor seventh -, 75 Klein 4-group, 149, 154
minor sixth -, 75 Klumpenhouwer
minor third -, 75 network, 122
open -, 258 Klumpenhouwer net (K-net), 29
prime -, 75 Klumpenhouwer, Henry, 29, 122
tritone -, 75, 232 knowledge
intonation, 260, 265 science, 15
inverse Kronecker
graph, 50 delta, 218
relation, 57 Kronecker, Leopold, 72, 73
inverse element Kunst der Fuge, 21, 26
additive -, 97
multiplicative -, 97 L
inversion, 26, 28, 92, 153, 222 L’essence du bleu, 133, 191
chord -, 76 La Bohème, 10
retrograde -, 222 Lagrange
invertible, 147 equation, 154
involution, 46 Lagrange, Joseph-Louis, 19
IRCAM, 3, 24, 32, 33 Lakoff, George, 280
irreducible language
polynomial, 213 Chinese -, 275
iso, 53 law
isomorphism, 123, 145 distributive -, 205
lazy
J path, 123
Jacobian Le Corbusier, 23
matrix, 270 lead sheet, 182
314 Index
polynomial, 207 Q
algebra, 206 quantized
degree of a -, 211 modulation, 196
function, 208, 259 quantum
irreducible -, 213 modulation -, 196
polyrhythm, 97 quarter tone
portamento, 278 music, 156
positive quarter-tone
integer, 90 piano, 108
real number, 103 quilisma, 275
Quintilian, 279
Posner, Roland, 31
quotient
Pousseur, Henri, 3
differential -, 265
powers, 43
group, 155
pre-semiotic, 280
module, 229
precision ring, 210
mathematical -, 268
musical -, 268 R
pressure, 110 Rahn, John, 26, 27, 33
prime, 176 ratio
factorization, 214 golden -, 101
interval, 75 rational
number, 86, 91, 213, 228 number, 93
prime form of numbers
chord, 67, 158 addition of -, 96
principal multiplication of -, 96
ideal, 210 real
product number, 100
cartesian -, 50, 62 positive -, 103
cartesian - of a family of sets, 65 numbers
of matrices, 218 linear ordering of - -, 104
projection, 50 time, 277
Prokofiev, Sergei, 237 recapitulation, 145
property reflection, 221
reflexive
Archimedean -, 104
relation, 57
universal -, 61
reflexivity, 56
proposition, 40
relation
psychoacoustics, 110
antisymmetric -, 57
Ptolemy, 245 binary -, 57
Puccini, Giacomo, 10 equivalence -, 57
punctum, 275 inverse -, 57
punctus contra punctum, 284 linear -, 58
pure partial -, 58
set, 38 reflexive -, 57
Pythagoras, 1, 5, 111 symmetric -, 57
Pythagorean total -, 57
equation, 99 transitive -, 57
tuning, 8, 176, 245, 247 relative
Index 319
difference, 45 solutions
empty -, 41 of the exercises, 289
final -, 61 sonata, 133
finite -, 58, 75 Appassionata -, 253
initial -, 61 form, 145
open -, 257 Hammerklavier -, 133, 191, 199, 277,
ordered -, 26, 49 285
pair -, 43 song
pitch-class -, 26 form, 145
pure -, 38 Song of Yi II—A Se, 124
theory, 38 Song, Mingzhu, 80, 124
seventh sonification, 234
chord, 183 sort, 40
major -, 176, 246 sound
minor -, 176 technology, 117
sharp , 269 space
shearing, 221 Euler -, 18, 227
Shorter, Wayne, 97 mental -, 116
sign, 273, 279 physical -, 116
signature vector -, 232
key -, 175 span, 178
time -, 94 spatial
signification, 50 digraph, 281
similarity species
motivic, 261 fifth -, 176
simple first -, 176
meter, 94 fourth -, 176
simplex, 129 second -, 176
face of a -, 129 third -, 176
geometric representation of a -, 130 Spectralism, 3
simplicial spiral, 238
morphism, 132 spondee, 145
simply staccato, 270
transitive Stange-Elbe, Joachim, 32
action, 152 statements in mathematics, 40
sinusoidal Steibelt, Daniel, 149
function, 110 Steinitz
sixth Lemma, 233
major -, 176, 246 step
minor -, 176 fundamental -, 192
skeleton semitone -, 99
gesture -, 282 Stockhausen, Karlheinz, 3, 101, 222
snail, 239 Stolberg, Leopold, 145
Society of Mathematics and Compu- Strauss, Johann, 95
tation in Music (MCM), 30, 33, Stravinsky, Igor, 95
34 string
solution theory, 246
approximate -, 99 vibrating -, 2, 246
Solution of Exercise, ix strong
Index 321