Professional Documents
Culture Documents
net/publication/299850277
CITATIONS READS
14 8,013
2 authors, including:
Andrew J. Milne
Western Sydney University
63 PUBLICATIONS 479 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
The learnability and playability of pitch layouts in new musical instruments View project
All content following this page was uploaded by Andrew J. Milne on 12 March 2017.
For most of human history, musical rhythm—as of the music betrays few discrete (and hence few
with all forms of musical performance—has been a rhythmic) features. In the popular music tradition,
fundamentally embodied enterprise, close to dance: analog and digital (computational) sequencers, loop
for example, limbs making beats by striking ob- sequencers (e.g., Acid), and step sequencers have
jects together. In all periods, however, the music often been used to produce many of the charac-
involving temporal repetitions evoking dance has teristic sounds of pop. These sounds include the
coexisted with “unmeasured” music, largely devoid disco beats of the 1970s, “synth groups” of the
of such impulse, such as medieval plainsong, or 1980s, house music and hip-hop in the 1990s, and
the unmeasured music of Couperin. Since the ad- the “glitchy” beats of more recent electronic dance
vent of electronic and computer technology, many music.
creators of classical, jazz, and popular music have Several current computer music platforms (e.g.,
explored both human and artificial (nonhuman, Max and Gibber) make it convenient to generate
often computer-aided) generation of rhythms, for and control freely definable sequences of events
performance and composition. Notable among these and the number of times they are exactly repeated.
explorations in Western classical music are (1) the In contrast to Max and Gibber, which encourage
evasion of rhythmic regularity in some works of pulse flexibility and variation, many pattern-based
Iannis Xenakis and Karlheinz Stockhausen; (2) the languages (e.g., the live-coding languages ixi lang
development of serialized rhythmic structures, of- and Tidal) are primarily based on the idea of
ten more readily realized by computational than fixed isochrony of pulses, and commonly oper-
human performance; (3) irrational rhythms (such ate on bars as unitary objects. (Isochrony means
as those in instrumental composition by Brian evenly spaced in time—all inter-onset intervals
Ferneyhough and the new complexity composers, are identical.) There are also numerous compu-
some of which parallel those of the roughly con- tational “rhythm machines” in use, published
temporaneous free jazz and free improvisation, from and unpublished (e.g., Dean 2003; Sioros et al.
John Coltrane and Cecil Taylor to Derek Bailey 2014). We have not, however, found a rhythm gen-
and Evan Parker); and (4) the establishment of erator like MeanTimes (illustrated in Figure 1),
noise music and its relatives, in which the surface which is capable of producing (1) a nested hierar-
chy of rhythmic streams that can be simultane-
Computer Music Journal, 40:1, pp. 35–53, Spring 2016 ously systematically morphed between isochrony
doi:10.1162/COMJ a 00343 and nonisochrony, and (2) differing levels of
c 2016 Massachusetts Institute of Technology.
⃝ complexity.
former as pulses, and events in the latter as beats. that a rhythm with high metricality or, indeed a
Under this definition, every beat must coincide meter itself, must have some level of organization,
with a pulse, but not every pulse must coincide patterning, or regularity across time. Our focus here
with a beat (because the beat rhythm is a subset will be on temporal patterns of the onsets of discrete
of the pulse rhythm). By extension, it is possible events, and we will mostly ignore variations in their
to have multiple rhythmic levels where each level intensity, spectrum, or individual duration. Similar
nests all lower levels. In such a situation, we arguments will apply across all these domains,
refer to events in the lowest rhythmic level under however, and they may interact to determine the
consideration as beats, and events in all higher levels resulting meter (e.g., through the influence of
as pulses. This hierarchical approach is related to dynamic accents).
that taken by Lerdahl and Jackendoff (1983), but For events to be patterned we have only a
our terminology differs because they term all levels general requirement that they are arranged with
“beats.” Of course, there are slower metrical levels some regularity or symmetry that distinguishes
below beats, such as measures, or phrases that span them from a purely random arrangement. We will,
several measures, or even longer-term structures, however, make some concrete suggestions about
but these are not our concern here. Henceforth, we possibly relevant regularities. Perhaps the most
use the terms event size or event length to specify basic form of regularity is repetition over a period.
the inter-onset interval between that event and the Indeed, periodicity may be reasonably considered
next event on the same level. An event’s duration a prerequisite for metricality, although in much
is used to specify how long that event is actually music the temporal period of a given meter may
played for (which may be less, the same as, or more vary detectably across a piece. As illustrated in
than its size). Figure 2, additional patterning may result from
In this article, our focus will be on rhythms that irregular repetitions of event sizes, or irregular
have a nontrivial degree of metricality. We take repetitions of subsequences of event sizes. There
a relatively nonrestrictive functional approach to may be reflective symmetries where all, or a part
the definition of metricality: We use it to mean of, a rhythm is repeated but in retrograde form.
the extent to which a rhythm has a repeating The rhythm may exhibit some of the organizational
or repeatable pattern and is perceived to have a properties identified by Toussaint (2013), or other
predictable structure. Indeed, we consider a meter principles such as balance (Milne et al. 2015).
to be a mental template or dynamic process that These organizational principles may hold at any
is sufficiently stable to assign probabilities to single rhythmic level or any mixture of levels. They
events’ onset times. In other words, a meter is may also hold for unplayed but mentally induced
a mental representation, not a physical, sounded metrical pulses. For example, suppose a played beat
manifestation (which is a rhythm). This implies rhythm that is somewhat unpatterned is a subset of
in Figure 3 (in this visualization, the imaginary unit meter (both containing M events). In Figure 3, the
i is at 3 o’clock, the real unit 1 is at 12 o’clock, and seven isochronous locations are shown with dashed
angles are measured clockwise from the latter—this vectors, and the clockwise displacement from the
is a diagonal reflection of the conventional math- mth isochronous location to the mth beat is shown
ematical visualization, but results in the familiar by the respective number outside the circle.
clockface type representation usually depicted in The magnitude of the first coefficient, |F x[1]|, is
rhythm-related research). The first element of x, the length of the resultant vector, which is the sum
which is 0 in both of these rhythms, is mapped to 1 of the M previously mentioned unit-length vectors
in the complex plane and so goes to the 12 o’clock whose angles correspond to the respective displace-
position, while successive elements proceed clock- ments. Clearly, the smaller the variance of these
wise around the circle (the angles written around displacements, the longer the resultant vector will
the circle will be explained forthwith). be. When the events are maximally even—which
The next step is to take the discrete Fourier implies all their displacements are identical, so all M
transform of this complex vector, and the magnitude summands are parallel vectors—the normalization
of the first coefficient gives the evenness of the by 1/M ensures that the length of the resultant
pattern: (i.e., the evenness value) is unity. In Figure 3, this
! ! summation of the angular displacement vectors
M ! is depicted by the seven small vectors extending
! ! !! 1 " !
evenness = !F x[1]! = ! z[m] e −2πim/M
! from the center of the circle outwards. Each vector
!M !
m=1 has a length of 1/7 because of the normalization.
! M
! The innermost vector has an angle of 0◦ (vertical)
!1 " !
! 2πi(x[m]−m/M) ! because the displacement between the first beat
=! e !. (1)
!M ! and the first isochronous location is zero; the next
m=1
vector outwards has an angle corresponding to the
The summand in the second line, e2πi(x[m]−m/M) , displacement between the second beat and the sec-
makes it clear that what is being calculated, for each ond isochronous location (8.6◦ in Figure 3a, −21.4◦
value of m, is a unit-length vector whose angle is in Figure 3b); and so on. Note how the vector re-
equivalent to the directional (angular) displacement sulting from the sum of these displacement vectors
between the mth event of the beat structure under is slightly longer in Figure 3a (its length is 0.988)
investigation and the mth event of an isochronous than it is in Figure 3b (its length is 0.924); hence
strategies that may be taken—for example, using is divided for the lowest-level rhythm (in this
balance rather than evenness (Milne et al. 2015). article, these numbers are denoted j and k, but have
We suggest, however, that they are useful principles different designations in MeanTimes’s interface).
relevant to both rhythm production and probably The lengths of these beats are denoted ℓ and s,
to perception. As such, they provide a useful means respectively: Their ratio r = ℓ/s can be smoothly
to reduce the space of rhythms into one that can varied with a slider along the entire continuum
be smoothly parameterized and that also contains a that preserves that rhythm’s structure (i.e., given
wide variety. j and k, the range 1 ≤ r < ∞ within which s ≤ ℓ).
The sizes of ℓ and s are automatically calculated
to ensure the period’s length is invariant across all
MeanTimes: The Application and its Design values of r . It is also possible to selectively sound a
number of “higher-level” well-formed rhythms that
Developing from the analogy between (rhythmic) nest the “Level 0” beat rhythm, and that smoothly
meter and (pitch) scale introduced earlier, and the change as a function of r over its entire range. As
consideration of evenness and well-formedness, discussed earlier, at certain values of r , some of
MeanTimes is constructed as a MIDI rhythm these higher-level rhythms will comprise perfectly
generator providing a rhythmic parallel to the pitch- isochronous pulses.
based manipulations used by the freeware Dynamic As shown in close-up in Figure 4, each rhythmic
Tonality synthesizers and sequencers detailed by level is visualized as a polygon inscribed in a circle. A
Milne, Sethares, and Plamondon (2008), Sethares and “playhead,” depicted as a small disk, rotates around
coworkers (2009), and Prechtl and colleagues (2012). the circle and, whenever it “hits” a polygon vertex,
MeanTimes is built in Cycling ’74’s Max, and can be a MIDI note is sent out with a pitch, duration,
downloaded from www.dynamictonality.com as a channel, and output port that has been designated
standalone application for Windows and Mac OS X. for that polygon by the user.
As detailed subsequently, MeanTimes provides Our approach to parameterizing and explaining
a number of parameters to shape six separate well-formedness is novel because of our focus on the
rhythmic levels of a well-formed hierarchy. The size of r , rather than the size of a generating interval,
user can set the length, in milliseconds, of the which, with the notable exception of Blackwood
period of repetition (denoted d), and the numbers of (1985), is the more usual way to parameterize well-
long beats and short beats into which this period formed pitch-based scales (Wilson 1975; Erlich 2006;
long and short beats ( j, k), and the beat-size ratio level) are displayed in the interface of MeanTimes,
r (all of which can be controlled in MeanTimes’s and they arise from the user-controlled changes in r ,
user interface). It is also trivial to generate different or the other parameters.
“modes” (starting points) for the rhythms by rotating
the well-formed word.
Because r has an infinite range, we scale it A Nested Hierarchy of Rhythmic Levels
by adjusting a parameter t ∈ [0, 1], such that r =
As previously discussed, every well-formed rhythm
1/(1 − t). Hence t = 0 implies r = 1, and t = 1
is nested inside another higher-level well-formed
implies r = ∞. This ensures that when the r -slider
rhythm. This allows a hierarchy of such rhythms
is in its halfway position (i.e., t = 0.5), the resulting
to be played simultaneously or successively: Mean-
r value is 2/1. This is a particularly important
Times’s interface (Figure 1) includes a set of six
beat-size ratio because—as shown in the next
checkboxes to toggle playback of the user-defined
section—it always implies that the next higher-level
beat rhythm, and the next five higher levels of
well-formed rhythm (which may be thought of as
pulses.
comprising pulses, some of which coincide with the
Let each successive rhythmic level be denoted
lower-level beat) is isochronous.
i, where i = 0 is the beat rhythm (whose j and k
As previously discussed, as the value of r is
values are entered by the user), i = 1 is the Level
moved throughout its range, it affects the evenness
1 well-formed pulse rhythm that nests it, i = 2 is
of the resulting rhythm. When r = 1, the long and
the Level 2 well-formed pulse rhythm that nests
short beats are of identical size, so the rhythm is
the Level 1 pulse rhythm, i = 3 is the Level 3 well-
maximally even (isochronous). As r is increased, the
formed pulse rhythm that nests the Level 2 pulse
evenness reduces. For any given r value, however,
rhythm, and so forth. This means that any meter
evenness also varies across different well-formed
is nested inside a hierarchy of higher-level pulse
rhythms. Usually, the evenness of well-formed
rhythms, with each higher-level pulse rhythm level
rhythms, all of the same number of events j + k,
being a superset of the previous.
is ordered by the size of j. That is, the greater the
We now make some more general statements
number of long beats, the greater the evenness of
about the relationships of the well-formed rhythms
that well-formed rhythm at any given r value. This
of successive levels. First, let us consider the
is illustrated in Figure 5, which shows the evenness
numbers of long and short pulses in level i + 1 as a
values for all five-beat well-formed rhythms over
function of level i:
1 ≤ r < ∞. Because the minimal evenness value
differs across rhythms, and also to keep the interface $
simple, we do not allow the user direct control of the ( ji + ki , ji ), ri ≤ 2/1,
( ji+1 , ki+1 ) = (2)
evenness value. The changes in evenness (for each ( ji , ji + ki ), ri ≥ 2/1.
different r0 values. This is illustrated in Figure 6, note that when ri → ∞, the well-formed rhythm is
which is explained forthwith. degenerate and its total number of events reduces
Let us start by considering the beat rhythm, because its short events vanish.
which is shown by the bottom half-diamond. The The level i of any well-formed rhythm is given by
user has control of the beat rhythm’s r0 value, and the number of diamonds the r0 line passes through
the precise locations of some r0 values are indicated to get to it. Hence the diamond labeled 5:2 and 2:5
by the numbers directly above this diamond. As is Level 1; the diamond labeled 5:7 and 7:5 and the
the value of r0 is changed, visualize a vertical line diamond labeled 7:2 and 2:7 are both Level 2; the
passing over the entire height of the figure; any diamond labeled 7:9 and 9:7 and the diamond labeled
horizontal diamond the line passes through is a 9:2 and 2:9 are Level 3; and so forth (each successive
higher-level well-formed rhythm that nests all level is successively colored with alternating black
lower-level rhythms (including the beat rhythm) at and white). At any given level, there will always be
that r0 value. The horizontal extent of every such a well-formed rhythm available (the apparent gaps
well-formed rhythm indicates the range of r0 values at levels 3, 4, and 5 are actually filled by higher-level
over which its own ri values are between 1 and ∞ well-formed rhythms “above” the current figure).
(some ri values for each well-formed rhythm are Let us demonstrate how this works by considering
shown directly above its diamond). The thickness of the nested rhythms that are available when r0 =
each diamond gives a very approximate indication 3/2. At this r0 value, the Level 0 beat rhythm—
of the rhythm’s evenness, which is maximal when represented by the bottom diamond—is a rhythmic
ri = 1 and minimal when ri → ∞ (as was illustrated analogue of the standard pentatonic scale (which
in Figure 5). The numbers of long and short events may help the visualization and auralization). It is
in each well-formed rhythm are indicated by the analogous because the pentatonic scale has two
figures inside each diamond. For example, the large and three small steps and, assuming a standard
label 2:3 means there are two long and three short twelve-tone equal temperament (12TET) tuning,
events. Note how each well-formed pattern swaps a step-size ratio of 3:2 (its large steps are three
its numbers of long and short events as it passes semitones and its small steps are two semitones).
through ri = 1/1, as implied in Equation 2. Also The Level 1 rhythm in the next row up has seven
nonisochronous pulses and nests the five-beat the two-long-, three-short-beat rhythm is illustrated
pattern. It comprises five long and two short pulses, in Figure 7.
and it has a step-size ratio (r value) of 2:1 (and so is In MeanTimes’s interface, the evenness values
a rhythmic analog of the 12TET diatonic scale with for the beat-level and each of the three higher-level
five two-semitone steps and two single-semitone pulses are displayed as bar graphs, which are updated
steps). There is also a Level 2 rhythm that nests both in real time as the r slider is moved. The color of
the previous two levels (shown by the next black the bar also indicates whether each rhythm level
diamond up), which contains twelve pulses. When is coherent (its r value is less than two), and each
r0 = 3/2 for the Level 0 rhythm, r2 = 1 for the Level level’s r value is also displayed.
2 rhythm. This means that all the latter’s pulses are
isochronous, hence this is the rhythmic analog of
the equal-tempered chromatic scale. All higher-level Extensions and Modifications of
rhythms have twelve isochronous pulses. Well-Formed Rhythms
As the value of the beat rhythm’s r0 -slider is
changed, MeanTimes automatically selects the In the earlier section “Well-Formed Words,” we
appropriate well-formed rhythm for each higher formalized the splitting of each long beat into
level, and finds the appropriate event timings by a new long and short beat with the mapping
recursion of Equations 2–4. ℓ '→ ℓs. We could have chosen the opposite ordering:
ℓ '→ sℓ. This implies changing the first morphism
ℓ '→ ℓs and s '→ ℓ (as defined earlier) to ℓ '→ sℓ
Evenness of the Well-Formed Hierarchy and s '→ ℓ; and changing the second morphism
ℓ '→ ℓs and s '→ s (as defined earlier) to ℓ '→ sℓ and
We previously showed, in Figure 5, how the evenness s '→ s. For convenience, we will refer to levels
of each possible five-beat well-formed rhythm resulting from the original morphisms as clockwise,
changes over all r0 values. Clearly, the same process and those originating from the just introduced
also occurs for the higher-level well-formed rhythms alternatives as counterclockwise. Interestingly,
too. In general, each will have an analogous curve clockwise and counterclockwise levels are identical
(maximal at ri = 1, minimal at ri → ∞) but over a up to rotation (or reflection). This means that by
smaller r0 range as levels are ascended. An example, independently changing the directionality of the
for the well-formed rhythms, up to Level 3, that nest levels in a hierarchy, their relative rotations change.
Figure 8
Figure 9
a motivic transformation device, but also of interest ℓℓ, then ℓs, and finally sℓ (proceeding from the last
in terms of the resultant change in metricality and element to the first). Hence, the probability mass
rhythmic transparency. function over all possible 2-tuples is p(ℓℓ) = 3/7,
The r value has no effect on the entropy of event p(ℓs) = 2/7, p(sℓ) = 2/7, and p(ss) = 0. The entropy
sizes, but the actual form of the scale does. This is (in bits) of a )
probability mass function is calculated
shown in Figure 9, which shows the effect on entropy by H( p) = − i pi log2 pi ; hence the entropy for this
of the given modifications of well-formedness. example is 1.56.
The entropies are calculated from probability Stepping through the diatonic pattern from left
distributions over n-tuples of consecutive events, to right, the following 3-tuples occur: ℓℓs, then ℓsℓ,
with n taking values from one to five We will then sℓℓ, then ℓℓℓ, then ℓℓs, then ℓsℓ, and finally sℓℓ.
show how these are calculated by considering the Hence, the probability mass function over 3-tuples is
diatonic pattern of ℓℓsℓℓℓs. Stepping through the p(ℓℓℓ) = 1/7, p(ℓℓs) = 2/7, p(ℓsℓ) = 2/7, p(sℓℓ) = 2/7,
diatonic pattern from left to right, the following p(ℓss) = 0, p(sℓs) = 0, p(ssℓ) = 0, and p(sss) = 0. This
2-tuples occur: ℓℓ, then ℓs, then sℓ, then ℓℓ, then has an entropy of 1.95. And so forth.