Professional Documents
Culture Documents
Citation: Einbond, A., Schwarz, D., Borghesi, R. & Schnell, N. (2016). Introducing
CatOracle: Corpus-based concatenative improvisation with the Audio Oracle algorithm.
Proceedings of the International Computer Music Conference, pp. 140-146.
This version of the publication may differ from the final published
version.
140 Proceedings of the International Computer Music Conference 2016 Proceedings of the International Computer Music Conference 2016 141
synthesis. Paired with the P I P O (Plugin Interface for Pro- 3.2.2 Audio Descriptors
cessing Objects) framework, analysis of audio descriptors
and segmentation can be performed in realtime or in batch By default the analysis subpatches are set to use pipo.basic,
on a whole collection of sound files. providing as descriptors: frequency, energy, periodicity,
autocorrelation, loudness, centroid, spread, skewness, and
We implemented CBCS in realtime in our CataRT sys-
kurtosis. This allows C AT O RACLE to run entirely within
tem, 4 now rebased on M U B U and P I P O (Figure 1). 5
the free M U B U distribution. Other descriptor calculations
may be customized by replacing the P I P O module with
pipo.yin, pipo.moments, or pipo.mfcc. Or, with a software
license, the full range of the I RCAM D ESCRIPTORS library
[12] is available with pipo.ircamdescriptors. 7
A subpatcher with convenient checkboxes for descriptor
selection may be substituted for the existing analysis mod-
ules in C AT O RACLE allowing access to spectral, temporal,
or many other features in any combination (Figure 3).
142 Proceedings of the International Computer Music Conference 2016 Proceedings of the International Computer Music Conference 2016 143
Figure 7: Paradigms of improvisation with C AT O RACLE.
144 Proceedings of the International Computer Music Conference 2016 Proceedings of the International Computer Music Conference 2016 145
5. DISCUSSION AND FURTHER DIRECTIONS Proceedings of SOFSEM99. Springer-Verlag, 1999,
pp. 291–306.
A number of directions for further research could extend
the implications of this project further. [3] G. Assayag, G. Bloch, M. Chemillier, B. M. Juin,
workshop SMARTER MUSIC by Arthur Wagenaar
The possibility of transcribing improvised sequences A. Cont, and S. Dubnov, “Omax brothers: a dynamic
in music notation to be re-interpreted by a human per- topology of agents for improvization learning,” in ACM
former in realtime has not yet been implemented in ex- Multimedia Conference, Santa Barbara, 2006. Almost everyone has a smartphone nowadays, but they are rarely used in a truly
isting computer-assisted improvisation platforms. While creative way. So far, the smartphone's musical potential has been sadly overlooked.
the BACH package offers promising possibilities, further [4] D. Schwarz, “Corpus-Based Concatenative Synthesis,”
development will be necessary to refine the notation of dy- IEEE Signal Processing Magazine, vol. 24, no. 2, pp. Yet it's sonic potential is very high - a fantastic but under-researched area of sound.
namics, playing techniques, and realtime rhythmic quanti- 92–104, 2007. Phones and pads are used as musical controllers, sometimes as simple instruments,
zation before it is useable in performance. [5] B. Lévy, “Principles and Architectures for an Interac- but this use is limited to a relatively small group of electronic music lovers. Whereas a
C ATA RT’ S potential for soundscape texture synthesis has tive and Agnostic Music Improvisation System,” Ph.D. big group of people uses their phone for gaming (mostly to kill idle time), they don't
already been proposed [14]. Could an AO algorithm offer dissertation, Université Pierre et Marie Curie, Paris, as yet play music on a similar scale.
a more “natural” reproduction of a soundscape, in effect 2013. We use our phones without much conscious thought. That I think is strange, because
imitating its behavior by permitting limitless renewal of
non-repetitive textures? While no additional technical ap- [6] S. Dubnov, G. Assayag, and A. Cont, “Audio Oracle: A their influence is enormous. They make us permanently reachable, permanently
paratus is necessary, listening tests must be employed to New Algorithm for Fast Learning of Audio Structures,” trackable, and ever more we tend to outsource our memories and minds to the
evaluate the effectiveness of potential results. in Proc. ICMC, Copenhagen, 2007. machines in our pockets: another very important aspect of mobile technology, that I
So far FO and AO algorithms have been used predom- [7] G. Surges and S. Dubnov, “Feature Selection and feel does not get the attention it deserves.
inantly for musical creation. However their capacity for Composition Using PyOracle,” in AIIDE Conference, The workshop SMARTER MUSIC combines these two elements. It seeks to explore
musical pattern identification and data reduction could also Boston, 2013.
have uses for analysis of existing music, especially elec- new ways in which the smartphone can be used more thoughtfully, more creatively.
troacoustic or timbrally rich music that still offers a chal- [8] G. Assayag, “Keynote Talk: Creative Symbolic Inter- Phones can be so much more than the time consuming, 'empty' machines they are
lenge for existing techniques. In particular, the graphi- action,” in Proc. ICMC, Athens, 2014. now. They can be given a soul, a voice. They can be used to make music.
cal representation of the oracle could be used to visualize
large-scale formal and sonic connections. C AT O RACLE [9] N. Schnell, A. Röbel, D. Schwarz, G. Peeters, and
R. Borghesi, “MuBu & Friends – Assembling Tools for SMARTER MUSIC is the kick-off of a larger project: a new electro-acoustic
could be integrated with existing tools for digital analy-
sis such as I N D ESCRIP or EA NALYSIS [15] to provide an- Content Based Real-Time Interactive Audio Process- composition I'm currently setting up called UNISONO, a 30 minute piece for
other complementary view of musical structure. ing in Max/MSP,” in Proc. ICMC, Montreal, 2009. orchestra and audience with smartphones (to be premiered somewhere '17-'18). Based
Finally, FO and AO are only two of several oracle al- [10] A. Einbond, D. Schwarz, and J. Bresson, “Corpus- on the novel ' The Circle' by Dave Eggers, UNISONO is about the addictive power of
gorithms that could be evaluated. Another recent exam- Based Transcription as an Approach to the Composi- mobile technology, the force of group pressure to join in, and the impact this has on
ple is the Variable Markov Oracle (VMO) [16]. Or a re- tional Control of Timbre,” in Proc. ICMC, Montreal, humanity. The project is a collaboration between myself and students of
lated project is ImproTek [17], exploring the possibility 2009, pp. 223–226.
of using pre-defined structures as templates for context- Music&Technology (HKU, The Netherlands), who will be designing musical
sensitive improvisation. While it could rely on a tonal [11] A. Einbond, C. Trapani, and D. Schwarz, “Precise smartphone applications, based on my compositional and theatrical ideas. These new
structure, like a jazz progression, it could also follow an Pitch Control in Real Time Corpus-Based Concatena- instruments are to be used by both orchestra and audience, playing along. They should
arbitrary trajectory of descriptors in time. Presently im- tive Synthesis,” in Proc. ICMC, Ljubljana, 2012, pp. be truly creative instruments, offering a large scala of sound possibilities, both
plemented in O PEN M USIC, it could exchange data with 584–588.
'bleepy' and 'natural'. The question is: who's in control...? At first the audience is in
C AT O RACLE in the form of OSC messages. These alterna-
tive algorithms should be explored to determine how their [12] G. Peeters, “A large set of audio features for sound charge, playing at will, interacting with eachother and the orchestra. But gradually the
results differ from C AT O RACLE and how they could be description (similarity and classification),” IRCAM, phones will take over, as hidden pre-programming will become apperent and the
Tech. Rep., 2004, unpublished.
musically enriching. sound will become more narrow and forcing. With sound design the piece lets us feel
[13] A. Einbond, C. Trapani, A. Agostini, D. Ghisi, and what technology can do to our minds in general.
Acknowledgments D. Schwarz, “Fine-tuned Control of Concatenative Along this composition there will be a side program where on the one hand ethical
We gratefully thank Séverine Ballon, Pierre Morlet, Ar- Synthesis with CataRT Using the bach Library for
Max,” in Proc. ICMC, Athens, 2014, pp. 1037–1042.
issues (see above) will be discussed more literally and more deeply, and on the other
shia Cont, Benjamin Lévy, Gérard Assayag, Jean Bres- hand the technical possibilities will be demonstrated outside the temporal context of
son, Mikhail Malt, Emmanuel Jourdan, Paola Palumbo, [14] D. Schwarz and N. Schnell, “Descriptor-based Sound
Stephanie Leroy, Pascale Bondu, Aurèlia Ongena, Jérémie
the composition. SMARTER MUSIC will be the starting point of exploring these
Texture Sampling,” in Sound and Music Computing,
Bourgogne, Julien Aleonard, Sylvain Cadars, and Eric de possibilities.
Barcelona, 2010, pp. 510–515.
Gélis. This paper is dedicated to the memory of David
Wessel, mentor and inspiration for this work. [15] P. Couprie and M. Malt, “Representation: From The workshop will consist of two parts: first, a presentation by myself on my ideas
Acoustics to Musical Analysis,” in EMS Network Con- regarding the sonic possibilities of smartphones in general, and the way they are going
ference, Berlin, 2014.
6. REFERENCES to be used in UNISONO in particular. Secondly, an open brainstorm session, where
[16] C. Wang and S. Dubnov, “Guided Music Synthesis participants of ICMC inspire each other and are invited to join us in this promising
[1] N. Donin, “Sonic Imprints: Instrumental Resynthesis with Variable Markov Oracle,” in AIIDE Conference,
in Contemporary Composition,” in Musical Listening sonic field - thus giving it the extra momentum I feel it strongly deserves. Bring your
Raleigh, 2013, pp. 56–62.
in the Age of Technological Reproduction, G. Borio, phone!
Ed. Farnham/Aldershot: Ashgate, 2015, pp. 323–341. [17] J. Nika and M. Chemillier, “ImproteK, integrating har-
monic controls into improvisation in the filiation of
[2] C. Allauzen, M. Crochemore, and M. Raffinot, “Fac- OMax,” in Proc. ICMC, Ljubljana, 2012, pp. 180–187.
tor Oracle: A New Structure for Pattern Matching,” in
146 Proceedings of the International Computer Music Conference 2016 Proceedings of the International Computer Music Conference 2016 147